The problem has been building quietly inside the servers of municipal agencies across São Paulo for years. Duplicate digital images — scanned documents, planning photos, public works records — now clog storage infrastructure maintained by the Prefeitura de São Paulo and its associated bodies, and a reckoning is arriving. City IT administrators and archival specialists are now pressing for a formal policy decision before the end of the third quarter of 2026, when the municipal budget review cycle reopens procurement windows for storage contracts.
The timing matters because São Paulo sits at the centre of Brazil's largest public digitisation push. The federal government's Programa de Digitalização de Acervos Públicos, linked to the Ministério da Gestão e da Inovação, has channelled funding toward municipal digitisation projects throughout Lula's current term. That federal pipeline puts local agencies under pressure to demonstrate they are managing what they've already digitised before requesting fresh resources. Duplicate images represent wasted spend — every redundant file occupies physical server rack space, consumes energy, and introduces version-control confusion when public records are requested under Brazil's Lei de Acesso à Informação.
Where the Problem Is Most Acute
The São Paulo Municipal Archive, housed on Rua Voluntários da Pátria in Santana on the north side of the city, holds digitised collections dating back to the early 2000s. Staff there have flagged that batch scanning operations — often contracted out during peak digitisation drives — routinely produced duplicate TIFF and PDF image files because scanning equipment lacked real-time deduplication checking. The same issue surfaced inside the Secretaria Municipal de Urbanismo e Licenciamento, which manages building permit photography and inspection records across all 96 sub-prefeituras. Sources familiar with the workflows — speaking generally about the structural problem, not attributing any specific finding — describe a situation where a single permit application can be associated with three or four identical image files stored in separate folder hierarchies.
The Instituto de Tecnologia e Sociedade do Rio de Janeiro estimated in a 2024 report on Brazilian public digital infrastructure that storage inefficiency linked to duplicate files costs municipal governments across the country hundreds of millions of reais annually in aggregate, though São Paulo-specific figures have not been independently published. The city's annual IT budget, as disclosed in the 2025 Lei Orçamentária Anual, allocated R$1.2 billion to the Secretaria Municipal de Inovação e Tecnologia — making even marginal efficiency gains in storage management financially significant.
Private sector pressure is also building. The cluster of tech companies along Avenida Faria Lima in Itaim Bibi, many of which hold public data-processing contracts with the city, has pushed for clearer standards on image file handling. Without a defined municipal policy on what constitutes an authoritative master image versus a working copy, contractors working on projects like the city's integrated CCTV expansion — managed partly through the Centro de Operações da Prefeitura de São Paulo in Barra Funda — face legal ambiguity about which files they are obligated to retain and for how long.
What Happens Next
The Secretaria Municipal de Inovação e Tecnologia is expected to circulate a draft internal directive on duplicate image identification and deletion protocols before September 2026. The directive is likely to mandate a hash-based deduplication approach — comparing file fingerprints rather than file names — across all agency-managed repositories. This is the same technical standard adopted by the Tribunal de Contas do Estado de São Paulo for its own document management systems in 2023.
Three decisions will define the outcome. First, whether the city mandates a single centralised repository or allows each secretaria to manage its own deduplication. Second, whether contracts with private scanning and storage vendors are renegotiated to assign liability for future duplicates. Third, whether files flagged as duplicates are permanently deleted or archived in a low-cost cold-storage tier before any deletion occurs — a safeguard archivists on Rua Voluntários da Pátria have specifically advocated for, given the irreversible nature of destroying public records.
For ordinary paulistanos, the stakes are practical. Delays in property licensing on Avenida Paulista, permit queries in Pinheiros, or public records requests under the Lei de Acesso à Informação can all stall when agency databases return conflicting or duplicate file versions. A clean, deduplicated archive is not an administrative luxury — it is the infrastructure underneath every interaction between a citizen and a government that has promised to be digital-first.