Duplicate Image Replacement in São Paulo's Digital Infrastructure: The Key Decisions Ahead
City agencies and private platforms are under pressure to agree on standards for cleaning up redundant visual data—and the clock is ticking.
City agencies and private platforms are under pressure to agree on standards for cleaning up redundant visual data—and the clock is ticking.

São Paulo's municipal tech directorate is facing a critical inflection point over how the city's digital systems handle duplicate image files, a problem that has quietly ballooned into a storage and governance headache affecting everything from urban surveillance networks to the public health portal run out of the Secretaria Municipal da Saúde on Avenida do Estado. The immediate question is not whether to act, but who decides what counts as a duplicate—and what gets deleted permanently.
The timing matters because the city is midway through a broader digital overhaul tied to the Programa São Paulo Inteligente, which Mayor Ricardo Nunes launched as part of his 2025-2028 administrative plan. That program is integrating data from over forty municipal agencies into a unified cloud environment. When you merge that many legacy databases, duplicate image records multiply fast. Administrators who spoke generally about the challenge say the problem is structural, not accidental—decades of siloed procurement meant each secretariat bought its own document management software, and none of them used the same image fingerprinting standards.
Inside the Centro de Operações São Paulo—the city's main data command room near Largo do Arouche—analysts are already triaging redundant files across the municipal traffic camera archive and the city's cadastral mapping system. The cadastral issue is particularly acute in the Zona Leste, where rapid urbanisation in districts like Itaquera and Guaianases generated high volumes of aerial survey images, many of them uploaded multiple times by different contractors working on the same infrastructure projects. Storage costs are not trivial: cloud storage on Brazilian hyperscaler platforms was running at roughly R$0.08 per gigabyte per month as of the second quarter of 2026, and estimates circulating inside the Secretaria de Inovação e Tecnologia suggest the municipal archive holds tens of petabytes of unverified duplicate visual data.
Private sector platforms are watching closely. Movile, the São Paulo-based tech conglomerate whose subsidiaries include delivery and fintech operations across Latin America, has already standardised its own duplicate-image detection pipeline using perceptual hashing algorithms. Several São Paulo startups in the Cubo Itaú ecosystem on Avenida Brigadeiro Faria Lima have built similar tools and are pitching them to city procurement officers. The window for a city contract is real—and competitive.
Three choices now sit on the desks of municipal and state decision-makers. First, the city must decide whether to adopt an open-source deduplication standard or lock into a proprietary vendor solution. Open-source carries lower upfront cost but demands internal engineering capacity that the city's IT workforce, currently understaffed relative to demand, may not reliably supply. Second, there is the question of retention policy: Brazilian data law—Lei Geral de Proteção de Dados, which came into full force in 2021—requires that personal images, including those captured by public cameras, follow strict retention and deletion schedules. Any automated duplicate-removal system must be auditable, or the city risks LGPD enforcement actions from the Autoridade Nacional de Proteção de Dados. Third, and most politically sensitive, is who holds override authority when an algorithm flags an image for deletion that a human administrator wants to keep. That question touches union agreements with municipal IT workers, whose contract cycle reopens in March 2027.
The São Paulo State government is also a variable. The Poupatempo digital services network, administered at the state level and physically anchored at locations including the terminal on Avenida Paulista, shares some backend infrastructure with city systems. Any deduplication standard the city adopts will need at minimum a compatibility agreement with state-level data architects to avoid creating a new layer of redundancy at the integration seam.
Practical next steps are clearer than the politics. Municipal IT teams should run a full image audit across the three highest-volume databases—health, cadastral mapping, and traffic—before the end of the third quarter. Procurement officers should issue a request for information to vendors no later than September, with a formal RFP to follow in the first quarter of 2027. And the Câmara Municipal, which has a standing committee on digital governance, should schedule a public hearing before the contract is awarded. The city has done this before on smaller tech procurements. The scale this time is different.
How does this story make you feel?
Spread the word
About this article
Published by The Daily São Paulo
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News