Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo's Duplicate Image Crisis: The Key Decisions That Will Shape the City's Digital Infrastructure

Municipal systems, tech startups and federal databases are all grappling with what to do about duplicated image files—and the choices made in the next few months will have lasting consequences.

By São Paulo News Desk · Published 4 July 2026, 3:45 pm

3 min read

São Paulo's Duplicate Image Crisis: The Key Decisions That Will Shape the City's Digital Infrastructure
Photo: Photo by Kaique Rocha on Pexels
Traduzindo…

São Paulo's public administration is sitting on a problem it can no longer defer. Across municipal databases maintained by the Secretaria Municipal de Gestão, duplicated image files—scanned documents, urban survey photographs, permit attachments—have accumulated to the point where storage costs and data integrity failures are actively disrupting city services. The immediate question is not whether to act, but which of several competing technical and contractual paths the city will take before budget negotiations close in September 2026.

The issue has sharpened in recent months because the federal government's push to consolidate state and municipal data under the Rede Nacional de Dados em Saúde and related programs means São Paulo can no longer treat its internal data hygiene as a purely local concern. Duplicated records don't just waste storage—they generate conflicting entries in shared federal registries, slow licensing approvals and inflate costs charged back to taxpayers under per-gigabyte cloud contracts.

The Local Stakes: From Paulista to the Prefeitura's Server Rooms

The problem is visible in at least two critical operational layers. The first involves the city's urban planning directorate, which stores georeferenced images of every permitted construction site in districts from Vila Madalena to Itaquera. Engineers at the Secretaria Municipal de Urbanismo e Licenciamento have noted—without formal public disclosure—that file deduplication has been discussed internally as a prerequisite for the planned digitisation expansion set to cover all 96 subprefeituras by the end of 2027.

The second layer concerns the tech ecosystem clustered around Avenida Faria Lima and the broader Berrini corridor, where at least a dozen startups provide image-processing services to municipal and state contractors. Several of these companies, including firms registered with the Associação Paulistana de Startups, are competing for a contract renewal cycle that the city's information technology arm, PRODAM, is expected to open for bids in the third quarter of this year. The deduplication question sits at the centre of those procurement specs: whether the city mandates hash-based deduplication at ingestion, or adopts a retrospective clean-up model, will determine which vendors are technically qualified to bid.

Cloud storage is not cheap at municipal scale. Brazilian public cloud contracts negotiated under federal procurement framework Catálogo de Soluções de TI ran at roughly R$0.08 to R$0.12 per gigabyte per month for warm-tier storage as of contracts published in 2025. For a database running into hundreds of terabytes—which São Paulo's combined municipal image repositories almost certainly do—even a 15 percent duplication rate translates to millions of reais in avoidable annual expenditure. PRODAM has not published a specific figure for current duplication rates, but the fiscal argument for action is straightforward.

What Happens Next: Three Decisions the City Cannot Avoid

The first decision is technical: choose between perceptual hashing—which identifies near-duplicate images even when file sizes differ—and exact-match deduplication, which is cheaper to implement but misses the altered copies that accumulate through repeated scanning and compression. Procurement documents circulated internally at PRODAM suggest the city is leaning toward a hybrid approach, but that remains unconfirmed as of July 4.

The second decision is contractual. PRODAM must determine whether existing vendors under the current framework agreement, signed in 2024, carry liability for duplicates generated under their watch—or whether the city absorbs that cost as a transition expense. Legal teams at the Procuradoria Geral do Município are understood to be reviewing the relevant clauses.

The third and most politically consequential decision involves timeline. Mayor Ricardo Nunes' administration has signalled a preference for showing measurable digital modernisation results before the 2028 municipal election cycle begins to dominate the political calendar. That pressure compresses the realistic window for implementation to somewhere between 18 and 24 months—meaning a vendor selection by early 2027 is effectively the minimum viable schedule.

For São Paulo's tech sector and the civil servants who depend on functioning digital infrastructure, the coming weeks of procurement drafting will matter more than most public discussions acknowledge. The city that hosts Latin America's largest concentration of enterprise software companies cannot afford to let a solvable storage problem become a systemic one.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.