São Paulo's city hall quietly began a formal duplicate-image-replacement drive in the first quarter of 2026, targeting the sprawling digital repositories maintained by the Secretaria Municipal de Gestão (SMG) and the urban-planning body SP Urbanismo. The effort, rooted in a broader digital-governance directive signed in late 2025, aims to strip redundant photographs, scanned documents and georeferenced imagery from public databases that, according to SMG's own internal audit framework, had grown by more than 40 percent in file volume between 2020 and 2024 — much of that growth driven by duplicated assets.
The timing matters. Cities across Latin America and Europe are now reckoning with storage costs that once seemed trivial but have ballooned alongside cloud-migration contracts. For São Paulo, which runs one of the largest municipal data infrastructures in the southern hemisphere, redundant image files inflate licensing fees paid to cloud providers and slow down the public-facing Geosampa mapping platform that urban planners, journalists and residents all rely on for flood-zone data, zoning records and street-level photography of the city's 96 subprefeituras.
What São Paulo Is Actually Doing
The SMG directive assigns deduplication work primarily to two units: the Centro de Processamento de Dados (CPD), the city's legacy IT body headquartered near the Bela Vista neighbourhood, and a newer data-quality team embedded within SP Urbanismo's offices on Rua São Bento in the historic centre. The CPD is running automated hash-matching scripts against image libraries to flag files with identical or near-identical pixel signatures, while the SP Urbanismo team is focused on the Geosampa layer specifically — where aerial survey photographs taken in different cycles frequently overlap and create redundant tiles covering the same Avenida Paulista block or Pinheiros riverside stretch multiple times over.
Officials have not publicly disclosed the total number of duplicate files identified so far, and the SMG did not respond to a request for comment by press time. What is clear from published budget documents filed with the Câmara Municipal de São Paulo in April 2026 is that the city allocated R$3.2 million within the digital-infrastructure line of the 2026 municipal budget to data-quality work broadly defined — though that figure covers more than just image deduplication.
How São Paulo Compares to Peers
The global picture is uneven. Mexico City's government launched a comparable image-deduplication programme for its CDMX Digital platform in 2023, backed by a partnership with a local university consortium, and has publicly reported reducing its aerial-imagery storage footprint. Bogotá's Catastro Distrital has moved aggressively toward single-instance storage for property-record images since 2022, partly driven by a World Bank urban-data grant. London's Greater London Authority consolidated its mapping image libraries following a 2021 infrastructure review, setting a benchmark that several European municipal governments have since cited.
São Paulo's approach sits somewhere in the middle of this spectrum. It is ahead of several Brazilian state capitals — Porto Alegre and Manaus, for instance, have no publicly documented deduplication programmes as of mid-2026 — but it is moving more cautiously than Bogotá or Mexico City, both of which published detailed methodology reports and measurable targets. The absence of a published São Paulo target figure or timeline makes independent evaluation difficult.
Part of the challenge is São Paulo's sheer size. The city covers 1,521 square kilometres and generates imagery constantly — from traffic-monitoring cameras along Marginal Pinheiros, from drone surveys of flood-risk zones in Itaquera and Grajaú, and from the urban-renewal documentation required whenever a project touches a Zona Especial de Interesse Social. Each of those streams feeds databases that were not originally designed to talk to each other, making automated deduplication harder than in smaller, more recently digitised cities.
For residents and organisations that use Geosampa or the city's open-data portal dados.prefeitura.sp.gov.br, the practical upshot is gradual: faster load times, more reliable search results and, eventually, lower risk of outdated imagery being mistaken for current conditions — a genuine problem in flood-mapping. The SMG has indicated it expects the first phase of the deduplication work to be complete before the end of 2026. Whether that schedule holds will depend on procurement timelines for additional processing capacity and, more fundamentally, on whether the two lead units — CPD and SP Urbanismo — can maintain coordination across a bureaucracy that has historically operated in silos.