São Paulo's municipal government is sitting on a ticking problem. Tens of thousands of duplicate image files — redundant photos, scanned documents, and infrastructure records stored across incompatible systems — are slowing down the city's digital services at precisely the moment urban managers need faster data. The question now is not whether to replace them, but who decides how, and at what cost to taxpayers.
The issue has crystallised over the past six months inside the Secretaria Municipal de Inovação e Tecnologia, which oversees digital infrastructure for a city of roughly 12 million people. Staff have flagged that legacy databases used by agencies ranging from the Centro de Gerenciamento de Emergências to the city's flood-monitoring network contain image libraries where the same file appears dozens of times under different identifiers. Every redundant file consumes storage, slows retrieval, and introduces the risk of decisions being made on outdated versions of the same photograph or engineering schematic.
Why the Timing Matters
The pressure is not abstract. São Paulo recorded more than 140 flooding events in the 2025–2026 wet season, according to the Centro de Gerenciamento de Emergências Climáticas, and rapid image retrieval — aerial drone photographs, drainage-map scans, street-level records from the Zona Leste and the Marginal Tietê corridor — is central to emergency coordination. When duplicate files force system operators to manually verify which image is current, response times stretch. That cost is measured in waterlogged streets and delayed rescue logistics.
At the same time, Mayor Ricardo Nunes's administration has committed to a broader smart-city push under the Programa São Paulo Inteligente, which earmarks investment in digital infrastructure through 2027. That program creates both an opening and a deadline: procurement decisions made in the next 90 days will determine which technology vendors, including several startups operating out of the Hub de Inovação Tecnológica on Avenida Paulista, get contracts to build the deduplication and image-management pipelines that the city needs.
Private-sector players are watching closely. Several São Paulo-based tech unicorns with computer-vision capabilities — firms that built their core products on image recognition for logistics and retail — have signalled interest in adapting their tools for municipal use. The Centro de Estudos e Sistemas Avançados do Recife (C.E.S.A.R), which has a São Paulo office near Pinheiros, has worked on similar data-hygiene projects for federal agencies and is considered a credible reference point for what a structured deduplication rollout looks like in a Brazilian public-sector context.
The Decisions That Cannot Wait
Three choices will define what happens next. First, the city must decide on a deduplication standard — whether to adopt a hash-based automated matching system or a hybrid approach that keeps human reviewers in the loop for sensitive infrastructure images. Automated tools are faster and cheaper upfront, but they carry a non-trivial error rate that, in the context of drainage-system schematics or building-permit photographs in dense neighbourhoods like Brás and Mooca, could mean critical records being incorrectly flagged as duplicates and deleted.
Second, procurement timelines matter. Under Brazilian federal law — specifically Lei 14.133/2021, the new public procurement framework that replaced the older Lei 8.666 — the city must publish formal tender notices with sufficient lead time. A contract awarded before September 2026 could realistically deliver a working deduplication system before the next wet season begins in November. A delay past that window pushes implementation into 2027, leaving emergency systems exposed for another full storm cycle.
Third, and politically most fraught, is the question of data governance. Which secretariat holds final authority over what gets deleted? Turf battles between the Secretaria de Infraestrutura Urbana and the Secretaria de Inovação are, according to public documents reviewed by this newspaper, already visible in internal procurement working groups.
The practical upshot for São Paulo residents is simple: if the city gets these decisions right in the coming weeks, the drainage-monitoring tools that track flooding on Avenida do Estado and the Córrego do Ipiranga should run measurably faster by the time the rains return. If it does not, the city will spend another wet season managing a digital archive that is simultaneously too large and too unreliable to trust.