São Paulo's municipal digital systems are carrying a hidden burden. Across government portals, public health databases and urban planning platforms managed by the Prefeitura de São Paulo, redundant and mismatched image files are degrading data quality, slowing services and in some cases attaching the wrong photographs to property records, public works bids and social program registrations. The problem has moved from a technical footnote to a policy conversation — and the people closest to it are losing patience.
The timing matters. Mayor Ricardo Nunes has staked a significant portion of his second-term agenda on digitising city services, channelling resources into the SP 156 citizen portal and the connected systems underneath it. Duplicate image replacement — the process of identifying, validating and systematically removing redundant or erroneous image files — sits at the foundation of that modernisation push. If the underlying data is dirty, the apps built on top of it inherit the same problems.
Where the Problem Shows Up
Three areas of city administration have emerged as particularly acute. Urban drainage mapping, maintained partly through the Departamento de Águas e Energia Elétrica do Estado de São Paulo and coordinated with city engineers, relies on georeferenced photo documentation of canals, piscinões and flood-prone streets in the Zona Leste. Duplicated images in those records have, in documented cases reviewed by municipal auditors, caused inspection reports to cross-reference outdated photographs of infrastructure in neighbourhoods like Itaquera and Guaianases, according to internal documentation described in a February 2026 transparency report published by the Controladoria Geral do Município. The CGM report did not specify a monetary cost but noted that corrective re-inspection visits added avoidable expense to the 2025 drainage maintenance budget.
The second pressure point is the Sistema de Informações para a Infância e Adolescência, the municipal child welfare registry. Caseworkers at CRAS units in Brasilândia and Cidade Tiradentes have flagged instances where profile photo fields pulled from a shared image repository returned mismatched or duplicated files after a server migration carried out in late 2024. Social service coordinators have raised the issue in working groups convened by the Secretaria Municipal de Assistência e Desenvolvimento Social, though no formal public statement has been issued confirming the scale of the problem.
Then there is the private sector. In the Vila Olímpia and Faria Lima tech corridor, several startups building on top of open city data APIs discovered the duplication issue when their computer vision models began producing classification errors traceable back to repeated training images sourced from the Prefeitura's open data portal, geosampa.prefeitura.sp.gov.br. The portal, which hosts satellite imagery, street-level photographs and cadastral records for the entire city, had not undergone a systematic deduplication audit since its 2019 relaunch.
What Needs to Happen, and Who Is Saying It
Specialists in data governance have been consistent on the remedy. Perceptual hashing — an algorithmic technique that generates a compact fingerprint for each image and flags near-identical files — combined with human review workflows, is the industry standard for large municipal repositories. The process requires an upfront investment in tooling and staff time but sharply reduces ongoing data maintenance costs. Brazilian federal guidelines issued by the Secretaria de Governo Digital in 2023 explicitly recommend deduplication protocols as part of any government data modernisation programme, providing a policy hook that municipal officials can use to justify budget allocation.
On the GeoSampa side, the portal's technical team has publicly acknowledged the backlog in general terms in its 2025 annual report, noting that the image catalogue had grown to more than 4.2 million files without a corresponding growth in curation capacity. Advocates at the open-data organisation Transparência Hacker, which holds regular meetings at Garagem Fab Lab on Rua Henrique Schaumann in Pinheiros, have called for a dedicated maintenance fund rather than one-off project contracts.
The practical path forward, as articulated in the CGM report and echoed by specialists working with the city, involves three steps: a full audit of the GeoSampa image archive by the end of the third quarter of 2026; a phased deduplication of welfare and drainage records through an open procurement tender; and the adoption of an image provenance standard that tags each file with source, date and verified location data at the moment of upload. None of that is complicated. The question is whether it becomes a budget line before the next flood season, or after.