Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

How São Paulo's Public Archives Ended Up Drowning in Duplicate Images — and What It Cost to Get Here

A decades-long failure to standardise digital image management across city agencies has left municipal databases bloated, budgets strained, and a reform effort that is only now gaining traction.

By São Paulo News Desk · Published 4 July 2026, 3:40 pm

4 min read

How São Paulo's Public Archives Ended Up Drowning in Duplicate Images — and What It Cost to Get Here
Photo: Photo by Th2city Santana on Pexels
Traduzindo…

São Paulo's municipal government is sitting on hundreds of thousands of redundant digital image files — duplicated photographs, scanned documents, and graphic assets spread across at least a dozen separate departmental servers — and the bill for storing them has been climbing for years. The problem did not appear overnight. It is the accumulated result of more than two decades of uncoordinated digitisation drives, each launched by a different administration with different software, different naming conventions, and no shared protocol for checking whether an image already existed before uploading it again.

The issue matters now because the Nunes administration's ongoing overhaul of the city's digital infrastructure — centred on the Programa de Modernização da Gestão Municipal, which city hall has been rolling out in phases since 2024 — has made the bloated image repositories impossible to ignore. Technicians auditing the Secretaria Municipal de Gestão's internal systems found that a significant share of storage capacity across linked servers was consumed by files that were either exact copies or near-identical near-duplicates of assets held elsewhere in the network. The finding accelerated pressure on city hall to adopt a unified digital asset management standard before the next phase of the programme goes live.

The Long Road to a Broken System

The roots of the problem run back to at least the early 2000s, when individual secretarias began scanning paper records independently. The Arquivo Histórico Municipal, located on Rua Quirino de Andrade in the city centre, digitised tens of thousands of photographic prints from the late nineteenth and early twentieth centuries starting around 2003. Meanwhile, the Secretaria de Cultura separately commissioned its own digitisation of overlapping collections held at the Centro Cultural São Paulo on Rua Vergueiro in Paraíso. Nobody built a bridge between the two databases. By the time the Secretaria de Inovação e Tecnologia began pushing for integration in 2021, the duplication had compounded across multiple migration cycles — every time a server was replaced or a department upgraded its content management system, files were bulk-transferred without deduplication checks.

The problem is not unique to government. São Paulo's private sector has grappled with the same dynamics. The city's tech ecosystem, which includes more than 20 active unicorn-level startups according to Associação Brasileira de Startups figures published in 2025, built an entire product category around image deduplication tools aimed at e-commerce platforms managing product catalogues on marketplaces like Mercado Livre. The corporate solutions exist. The city simply never bought or built one at scale.

Storage costs for the municipal network are not publicly itemised in a single line of the orçamento, but procurement records filed with the Tribunal de Contas do Município show that the city renewed cloud storage contracts worth a combined R$ 47 million in 2025, covering services used by multiple secretarias. Independent IT specialists who have reviewed public contract documents have noted that deduplication alone, applied before such renewals, can reduce active storage requirements by between 20 and 40 percent in large institutional environments — a range consistent with findings published by the International Data Corporation in its 2024 storage efficiency report.

What Comes Next for the City's Image Stockpile

The Secretaria de Inovação e Tecnologia has been tasked with producing a unified asset registry by the end of the third quarter of 2026. The plan, as outlined in internal programme documents reviewed by A Diária São Paulo, calls for a phased deduplication sweep beginning with the Arquivo Histórico Municipal and the city's urban planning databases held by SMUL — the Secretaria Municipal de Urbanismo e Licenciamento — before expanding to satellite departments.

For residents and civil society groups that rely on municipal image archives — researchers at USP's Faculdade de Arquitetura e Urbanismo on Rua do Lago in Cidade Universitária regularly request access to historical city planning photographs, for example — the practical benefit will be a faster, better-indexed public portal. The current search interface, last updated in 2019, routinely returns the same image multiple times in a single query because the backend treats each duplicate as a distinct record.

City hall has not announced a public launch date for the cleaned archive. What is clear is that the longer the sweep takes, the more the next storage contract renewal will cost — and that renewal is already on the calendar for early 2027.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.