Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

How São Paulo's Public Archives Ended Up Full of the Same Photo: The Story Behind the Duplicate Image Crisis

Years of fragmented digitisation projects, incompatible databases, and rushed procurement have left the city's public visual records riddled with redundant files — and fixing it is costing taxpayers more than the original problem ever should have.

By São Paulo News Desk · Published 4 July 2026, 4:06 pm

3 min read

How São Paulo's Public Archives Ended Up Full of the Same Photo: The Story Behind the Duplicate Image Crisis
Photo: Photo by Luiz Silva on Pexels
Traduzindo…

São Paulo's municipal government is sitting on a digital image archive so bloated with duplicates that storage costs for public-sector visual records reached an estimated 40 percent above what efficient cataloguing would require, according to internal assessments circulated within the Secretaria Municipal de Gestão last year. The problem did not appear overnight. It is the product of at least a decade of disconnected digitisation drives, each launched with its own software, its own contractor, and its own taxonomy — none of them talking to each other.

The reckoning matters now because the Ricardo Nunes administration has flagged digital governance reform as a budget priority for the 2026–2027 fiscal cycle, and procurement decisions made in the next few months will determine whether the city replicates the same cycle of waste or finally builds a unified image management layer capable of detecting and flagging redundant files before they are ingested.

How the Fragmentation Happened

The roots go back to roughly 2013, when the Prefeitura de São Paulo began pushing municipal departments to digitise physical documents and photographs. The Arquivo Histórico de São Paulo, headquartered in the Mooca neighbourhood on Rua Tiradentes, ran its own scanning programme using a metadata standard that was never adopted city-wide. Meanwhile, the communications secretariat and individual subprefeituras across districts like Pinheiros, Santana, and Itaquera launched parallel efforts, each uploading images to siloed cloud buckets or local servers without cross-referencing what already existed.

By the time the São Paulo State Government's own Programa de Digitalização do Acervo Público produced its 2019 review, analysts found that some photographic records of major infrastructure projects — including construction along the Marginal Tietê corridor — existed in three or four separate repositories under different file names and inconsistent metadata tags. The same aerial photograph of the Ponte das Bandeiras had been catalogued under at least two different event dates in two different departmental systems, a small but telling example of how the dysfunction compounds over time.

The issue accelerated after 2020, when pandemic-era remote work pushed more departments to store visual assets in commercial platforms without central oversight. Microsoft SharePoint, Google Drive folders, and at least two proprietary document management platforms were all in active simultaneous use across different secretariats, according to procurement records published on the city's transparency portal. Each migration brought fresh uploads without de-duplication checks.

What a Fix Actually Requires

Correcting this is not straightforward. The Centro de Tecnologia da Informação e Comunicação do Município de São Paulo — known as PRODAM — has been tasked with developing a unified digital asset management framework, with a pilot scheduled to begin in the third quarter of 2026. PRODAM's mandate covers infrastructure for all 32 subprefeituras, which means any solution must handle formats ranging from legacy TIFF scans to contemporary drone footage captured for flood-monitoring along the Córrego Ipiranga basin.

Duplicate image replacement, in practice, means more than just deleting copies. Each redundant file may carry unique metadata — different access permissions, different departmental tags, different links embedded in public-facing web pages. Deleting the wrong instance breaks those links. That is why technology consultants working with Latin American municipal governments have pushed for a hash-based matching system that identifies visually identical files regardless of filename, then presents a human reviewer with a consolidated record before any deletion occurs. Rio de Janeiro's municipal archive ran a similar exercise in 2023 across roughly 1.2 million digitised items and recovered approximately 18 terabytes of storage in the first six months.

For São Paulo residents and civil society groups who rely on the Arquivo Histórico for research — particularly those working on urban memory projects tied to neighbourhoods like Bela Vista and Brás — the practical advice is straightforward: download and locally archive any images you need from public portals now, before consolidation exercises temporarily pull records offline for re-cataloguing. PRODAM has indicated that public access windows may be interrupted in rotating batches as the de-duplication pilot runs. The city's transparency portal at prefeitura.sp.gov.br remains the authoritative source for updates on which collections are affected and when.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.