Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo's Digital Archives Face a Reckoning Over Duplicate Images: The Key Decisions Ahead

Municipal systems and cultural institutions are being forced to choose how — and how fast — to clean up sprawling databases bloated with redundant visual records.

By São Paulo News Desk · Published 4 July 2026, 3:40 pm

3 min read

São Paulo's Digital Archives Face a Reckoning Over Duplicate Images: The Key Decisions Ahead
Photo: Photo by Gabriel Schincariol Cavalcante on Pexels
Traduzindo…

São Paulo's public sector is sitting on a problem it has spent years avoiding. Across municipal platforms, from the Secretaria Municipal de Cultura's digital collections to the databases underpinning the Nota Fiscal Paulistana program, duplicate image files have accumulated into a significant administrative and financial drag — and the window for a clean, orderly solution is narrowing fast.

The issue surfaced with renewed urgency this year after Brazil's Lei Geral de Proteção de Dados, in force since 2020, began drawing stricter enforcement attention from the Autoridade Nacional de Proteção de Dados. Redundant records — including duplicate photographs of citizens, properties and public events — now carry legal exposure they did not a decade ago. Every extra copy of an image linked to an identifiable person is, in regulatory terms, a liability sitting in a server room.

Why São Paulo's Scale Makes This Harder Than Almost Anywhere Else

This is not a small-scale filing problem. São Paulo's municipal government operates across 96 subprefeituras and services a population of roughly 12 million residents in the city proper. The volume of imagery generated annually — from urban surveillance to property cadastre updates to public health campaigns — is enormous. Technology consultancies working with state-level clients in the Paulista Avenue corridor have described the city's institutional image repositories as among the most fragmented in Latin America, though the municipality has not published a consolidated audit figure.

The Museu da Imagem e do Som, on Avenida Europa in Jardins, has been grappling with its own version of this challenge since digitising thousands of analogue records over the past five years. Duplicate scans, inconsistent metadata and format mismatches have complicated the museum's goal of making its collection publicly searchable. The problem is technical, but the decision about how to resolve it — whether to invest in automated deduplication software, hire specialist archivists, or pursue a phased hybrid approach — is fundamentally a budget and governance question.

At the city's Centro Empresarial de São Paulo in Itaim Bibi, several technology firms in the document management sector have been pitching automated duplicate-detection tools to municipal and state clients since at least 2024. Pricing for enterprise-grade image deduplication platforms typically ranges from R$80,000 to R$400,000 annually for large-scale public sector deployments, according to publicly available vendor documentation, though contract terms vary widely by scope.

Three Decisions That Will Define the Outcome

The next six months are critical. Mayor Ricardo Nunes's administration faces three concrete choices that will shape how this plays out.

First, the city needs to decide whether to centralise the deduplication effort under a single coordinating body — most likely the Secretaria Municipal de Inovação e Tecnologia — or allow each department to manage its own cleanup. Decentralised approaches have historically produced inconsistent results in São Paulo's public administration, where inter-secretariat coordination has long been a structural weakness.

Second, the Câmara Municipal de São Paulo will need to determine whether any budget reallocation is required for the 2027 Plano Plurianual, which is scheduled to be drafted in the second half of this year. Technology infrastructure line items have routinely been among the first cut during fiscal tightening, and the current federal government's emphasis on digital governance under the Estratégia Federal de Governo Digital creates both pressure and a potential funding hook for the city to align its own programs.

Third, institutions like the MIS and the Pinacoteca do Estado, on Praça da Luz in the Luz neighbourhood, will need to make independent decisions about whether to adopt shared technical standards or continue managing image metadata in silos. A shared São Paulo cultural sector standard would reduce long-term costs and improve public access, but requires institutional trust that has not always been easy to build.

None of these decisions can be deferred indefinitely. As LGPD enforcement matures and as the city prepares for an intensified period of digital service delivery ahead of the 2028 municipal elections cycle, the administrative cost of doing nothing will compound. The images are already there. The question is who in the Palácio do Anhangabaú — and in the secretariats that ring it — takes ownership of the cleanup before the window closes.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.