Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo's Digital Archives Are Drowning in Duplicate Images — and the Numbers Reveal Why It's Costing Millions

From municipal servers in Bela Vista to e-commerce warehouses in Brás, redundant image files are quietly eating storage budgets and slowing down the city's digital infrastructure.

By São Paulo News Desk · Published 4 July 2026, 3:45 pm

3 min read

São Paulo's Digital Archives Are Drowning in Duplicate Images — and the Numbers Reveal Why It's Costing Millions
Photo: Photo by Gabriel Schincariol Cavalcante on Pexels
Traduzindo…

São Paulo's public and private digital systems are carrying an estimated 30 to 40 percent of their total image storage in pure duplication — the same files saved multiple times across different folders, servers and platforms — according to data from technology audits conducted across Brazilian municipal networks in 2025. For a city that hosts more than 14,000 registered tech startups and operates one of the largest urban digital bureaucracies in Latin America, that redundancy translates directly into wasted money.

The issue has sharpened in 2026 because the Ricardo Nunes administration is pushing forward with the Cidade Inteligente program, a municipal initiative to digitise city services and consolidate data from departments ranging from SABESP billing systems to traffic camera networks managed through the Centro de Controle Operacional on Rua Boa Vista, near the Sé neighbourhood. As those systems merge, IT teams are confronting duplicate image stockpiles that built up over years of siloed record-keeping.

The Storage Bill Nobody Budgeted For

Cloud storage costs in Brazil have risen sharply since 2023, with enterprise-tier pricing from domestic providers reaching between R$0.09 and R$0.18 per gigabyte per month depending on contract size and redundancy tier. A mid-sized São Paulo prefeitura department archiving scanned documents, permit photos and infrastructure inspection images can accumulate tens of terabytes over a three-year cycle. When 35 percent of that volume is duplicated, the monthly overspend compounds fast. For the private sector, the picture is equally blunt: e-commerce operators concentrated around the Brás and Bom Retiro wholesale districts — many running product catalogues with hundreds of thousands of SKU images — routinely report storage inefficiencies of between 25 and 45 percent before deduplication tools are deployed, according to technology consultancies working the São Paulo market.

The federal government's own digital transformation body, the Secretaria de Governo Digital, identified duplicate file management as a priority issue in its 2024 national assessment of municipal IT infrastructure, noting that Brazilian city governments collectively spend a disproportionate share of IT budgets on unoptimised storage rather than active services. São Paulo, as the largest municipal data operator in the country, sits at the centre of that problem.

What Deduplication Actually Fixes — and What It Doesn't

Duplicate image replacement is not simply about deleting files. The technical process involves hashing algorithms that identify pixel-identical or near-identical files, then replacing redundant copies with a single canonical version and updating every database reference pointing to the old paths. Done badly, it breaks links across websites, internal portals and printed-output systems. Done well, it can reduce active storage requirements by 20 to 35 percent within a single audit cycle, according to documentation published by the Brazilian Association of Information Technology and Communication Companies, Brasscom, in its 2025 sector report.

Organisations on Avenida Paulista's tech corridor — including several of São Paulo's recognised unicorn-tier companies in the fintech and insurtech sectors — have incorporated automated deduplication into their content management pipelines, treating it as routine hygiene rather than a one-off project. The difference in approach between those firms and older institutional systems, including many tied to state-level agencies operating out of buildings near the Anhangabaú valley, is stark. Older systems often lack the metadata architecture needed for clean automated deduplication, requiring manual review that multiplies labour costs.

For city government departments beginning to consolidate under the Cidade Inteligente framework, the practical next step is a structured image audit before any large-scale migration. Technology officers recommend running a hash-based deduplication scan across all image directories before moving data to shared cloud environments — not after. Once duplicates are embedded in a unified system, the cost and complexity of removal increases significantly. São Paulo's window for getting ahead of the problem, rather than cleaning it up retroactively, is the current migration phase, which prefeitura communications indicate is scheduled to advance through the second half of 2026.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.