Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo's Digital Archive Crisis: The Key Decisions Ahead on Duplicate Image Replacement

City agencies and tech firms face a critical crossroads as outdated duplicate image files clog public databases, slow emergency response systems, and drain municipal IT budgets.

By São Paulo News Desk · Published 4 July 2026, 4:00 pm

3 min read

São Paulo's Digital Archive Crisis: The Key Decisions Ahead on Duplicate Image Replacement
Photo: Photo by Luiz Silva on Pexels
Traduzindo…

São Paulo's municipal government is sitting on a ticking problem. Tens of thousands of duplicate image files — redundant photos, scanned documents, and infrastructure records stored across incompatible systems — are slowing down the city's digital services at precisely the moment urban managers need faster data. The question now is not whether to replace them, but who decides how, and at what cost to taxpayers.

The issue has crystallised over the past six months inside the Secretaria Municipal de Inovação e Tecnologia, which oversees digital infrastructure for a city of roughly 12 million people. Staff have flagged that legacy databases used by agencies ranging from the Centro de Gerenciamento de Emergências to the city's flood-monitoring network contain image libraries where the same file appears dozens of times under different identifiers. Every redundant file consumes storage, slows retrieval, and introduces the risk of decisions being made on outdated versions of the same photograph or engineering schematic.

Why the Timing Matters

The pressure is not abstract. São Paulo recorded more than 140 flooding events in the 2025–2026 wet season, according to the Centro de Gerenciamento de Emergências Climáticas, and rapid image retrieval — aerial drone photographs, drainage-map scans, street-level records from the Zona Leste and the Marginal Tietê corridor — is central to emergency coordination. When duplicate files force system operators to manually verify which image is current, response times stretch. That cost is measured in waterlogged streets and delayed rescue logistics.

At the same time, Mayor Ricardo Nunes's administration has committed to a broader smart-city push under the Programa São Paulo Inteligente, which earmarks investment in digital infrastructure through 2027. That program creates both an opening and a deadline: procurement decisions made in the next 90 days will determine which technology vendors, including several startups operating out of the Hub de Inovação Tecnológica on Avenida Paulista, get contracts to build the deduplication and image-management pipelines that the city needs.

Private-sector players are watching closely. Several São Paulo-based tech unicorns with computer-vision capabilities — firms that built their core products on image recognition for logistics and retail — have signalled interest in adapting their tools for municipal use. The Centro de Estudos e Sistemas Avançados do Recife (C.E.S.A.R), which has a São Paulo office near Pinheiros, has worked on similar data-hygiene projects for federal agencies and is considered a credible reference point for what a structured deduplication rollout looks like in a Brazilian public-sector context.

The Decisions That Cannot Wait

Three choices will define what happens next. First, the city must decide on a deduplication standard — whether to adopt a hash-based automated matching system or a hybrid approach that keeps human reviewers in the loop for sensitive infrastructure images. Automated tools are faster and cheaper upfront, but they carry a non-trivial error rate that, in the context of drainage-system schematics or building-permit photographs in dense neighbourhoods like Brás and Mooca, could mean critical records being incorrectly flagged as duplicates and deleted.

Second, procurement timelines matter. Under Brazilian federal law — specifically Lei 14.133/2021, the new public procurement framework that replaced the older Lei 8.666 — the city must publish formal tender notices with sufficient lead time. A contract awarded before September 2026 could realistically deliver a working deduplication system before the next wet season begins in November. A delay past that window pushes implementation into 2027, leaving emergency systems exposed for another full storm cycle.

Third, and politically most fraught, is the question of data governance. Which secretariat holds final authority over what gets deleted? Turf battles between the Secretaria de Infraestrutura Urbana and the Secretaria de Inovação are, according to public documents reviewed by this newspaper, already visible in internal procurement working groups.

The practical upshot for São Paulo residents is simple: if the city gets these decisions right in the coming weeks, the drainage-monitoring tools that track flooding on Avenida do Estado and the Córrego do Ipiranga should run measurably faster by the time the rains return. If it does not, the city will spend another wet season managing a digital archive that is simultaneously too large and too unreliable to trust.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.