Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

How São Paulo's Digital Archives Became a Graveyard of Broken Images — and What Comes Next

The city's public and private institutions are grappling with years of neglected digital infrastructure, leaving thousands of records, news pages and government portals riddled with missing or duplicated images.

By São Paulo News Desk · Published 4 July 2026, 3:51 pm

3 min read

How São Paulo's Digital Archives Became a Graveyard of Broken Images — and What Comes Next
Photo: Photo by K on Pexels
Traduzindo…

The problem did not appear overnight. By mid-2026, a systematic audit of São Paulo's municipal digital portals revealed that roughly 40 percent of image assets hosted on city-government websites had either gone missing, loaded as broken placeholders, or been duplicated across multiple records — the result of more than a decade of fragmented content management decisions, platform migrations, and chronic underinvestment in digital archiving.

The scale of the issue matters now for a precise reason: the Nunes administration's push to consolidate municipal services under a single digital gateway — the Prefeitura de São Paulo's Sp156 platform — has made the inherited chaos newly visible. When engineers began migrating legacy data from older departmental servers in the Vila Madalena and Lapa administrative districts into the unified system earlier this year, duplicated and orphaned image files surfaced in volumes that surprised even the city's own IT contractors. Files that had been uploaded multiple times under different naming conventions, then linked redundantly across dozens of pages, were exposing structural failures stretching back to at least 2013.

A Decade of Patch Jobs and Platform Switches

São Paulo's digital record-keeping has passed through at least four major content management platforms since 2010, according to procurement documents filed with the city's Diário Oficial. Each transition — driven by budget cycles, political changes, and vendor contract endings — left behind a residue of unresolved image libraries. The Secretaria Municipal de Inovação e Tecnologia, based on Rua Líbero Badaró in the Centro histórico, has publicly acknowledged the migration challenge in budget presentations to the Câmara Municipal, though no comprehensive public tally of affected records has been released.

The private sector has not been immune. Major São Paulo-headquartered news organisations and tech companies operating out of the Faria Lima and Berrini corridors discovered similar problems when auditing their own content delivery networks. Image deduplication — the technical process of identifying and replacing redundant or broken visual files with single canonical versions — has become a line item in digital operations budgets that many organisations simply did not plan for. One commonly cited industry benchmark holds that duplicate files can inflate storage costs by 20 to 30 percent on content-heavy platforms, a figure that adds up quickly at the scale São Paulo's media and government ecosystems operate.

The roots of the duplication crisis trace partly to the era of cheap cloud storage that followed 2015, when falling costs encouraged teams to upload first and organise never. A journalist filing a photo report from Avenida Paulista might upload the same image in three different resolutions with three different filenames; a government communications officer publishing a press release on the Palácio do Governo portal in Morumbi might attach the same official headshot twelve times across twelve separate announcements. No one was penalised for it. No system flagged it. The debt accumulated silently.

What the Fix Actually Requires

Duplicate image replacement — the process of auditing a digital library, identifying redundant or broken assets, establishing a single authoritative file, and updating every reference that pointed to the discarded copies — is less glamorous than it sounds. It requires database access, consistent metadata standards, and human review at a scale that automated tools alone cannot handle. The Arquivo Público do Estado de São Paulo, which maintains historical records in the Bela Vista neighbourhood, began a formal digital asset management programme in 2023 precisely to address this kind of structural problem, though its mandate covers state-level historical records rather than municipal operational data.

For organisations now confronting the problem, the practical path forward involves three steps: a full crawl of existing content to map every image reference, a deduplication pass using hash-matching tools that can identify identical files regardless of filename, and a redirect or replacement protocol that updates broken links without destroying historical URLs. The city's SP156 consolidation project is expected to complete its first audit phase before the end of 2026. For private publishers and tech firms along Faria Lima, the calculus is simpler — the cost of fixing the problem is now measurably lower than the storage, bandwidth, and reputational cost of leaving it alone.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.