Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo's Digital Platforms Scramble to Fix Duplicate Image Problem That's Clogging City Services

A surge in duplicated image files across municipal databases and local tech platforms this week exposed deeper cracks in São Paulo's digital infrastructure push.

By São Paulo News Desk · Published 4 July 2026, 3:48 pm

3 min read

São Paulo's Digital Platforms Scramble to Fix Duplicate Image Problem That's Clogging City Services
Photo: Photo by Willian Santos on Pexels
Traduzindo…

Thousands of duplicate image files embedded in São Paulo's municipal systems and several Faria Lima corridor startups came to a head this week, forcing emergency audits and patch deployments across at least three major local platforms. The problem — redundant image assets multiplying inside databases and content management systems — is not glamorous, but it has real costs: slower load times, ballooning storage bills, and in at least one case, a public-facing city service displaying the wrong photograph alongside a citizen's records.

The timing is awkward. Mayor Ricardo Nunes has staked part of his second-term pitch on modernising São Paulo's digital services, and the city's Secretaria Municipal de Inovação e Tecnologia has been rolling out the SP156 platform upgrades through 2026. Duplicate image pollution — the term engineers use when identical or near-identical files proliferate without deduplication controls — is exactly the kind of technical debt that tends to surface when legacy systems are grafted onto newer infrastructure in a hurry.

What Happened This Week

The most visible incident involved a civic records portal linked to the Poupatempo unit in Itaquera, where users reported seeing mismatched profile images on documents pulled from the system between Monday and Wednesday. The Secretaria de Gestão did not issue a public statement by press time, and the nature and scale of the data mismatch has not been confirmed in official communications reviewed by The Daily São Paulo.

Separately, at least two São Paulo-based fintechs headquartered near Avenida Brigadeiro Faria Lima acknowledged internally — according to engineers active in the São Paulo tech Slack communities and GitHub discussions that are publicly visible — that deduplication scripts had failed following a June database migration. One repository, publicly accessible on GitHub as of Thursday, showed a commit history flagging the issue as introduced after a migration dated June 22, 2026. No user personal data was described as compromised in those public threads.

The broader tech ecosystem on Avenida Paulista and in the Vila Olímpia cluster has been wrestling with a related structural issue: the explosive growth of AI-generated imagery inside marketing and product platforms. When AI tools produce multiple near-identical images — slightly different crops, minor colour shifts — standard deduplication algorithms that rely on exact hash-matching fail to catch them. Perceptual hashing tools exist, but adoption has been uneven. iFood, Nubank and Mercado Livre all operate significant engineering hubs in São Paulo, though none of those companies confirmed any internal incidents this week.

The Cost of Getting It Wrong

Cloud storage is not free. Amazon Web Services S3 standard storage costs roughly R$0.023 per gigabyte per month in the South America East region as of mid-2026. For a mid-sized platform carrying several million product or profile images, duplicate bloat of even 20 percent translates to thousands of reais in avoidable monthly spend. For the municipal government, the calculation is different but the direction is the same: taxpayer-funded storage contracts that expand to accommodate files that should never have been saved twice.

São Paulo's digital infrastructure spending has grown significantly under the current federal and city administrations, with the federal Programa Cidades Digitais providing co-financing for municipal platform overhauls. That program requires recipient cities to meet data governance benchmarks, which technically include deduplication standards for image assets in citizen-facing systems. Whether São Paulo is currently meeting those benchmarks is a question the Secretaria had not publicly answered as of Friday morning.

Engineers and platform managers dealing with this problem have several concrete options. Perceptual hashing libraries — including open-source tools like ImageHash, available through Python's package index — can catch near-duplicate AI-generated images that byte-level comparison misses. Running a deduplication audit before any major database migration, rather than after, is the step most commonly skipped under deadline pressure. For São Paulo's municipal systems, the SP156 upgrade calendar has the next major deployment window scheduled for August, which gives the Secretaria roughly six weeks to tighten its image governance protocols before adding more complexity to the stack.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.