Assinatura gratuita
The Daily São Paulo

São Paulo news, every day

News

São Paulo Tackles Duplicate Image Sprawl in City Records — and Finds Itself Ahead of the Curve

As urban administrations worldwide scramble to clean up bloated digital archives, São Paulo's municipal technology office is quietly running one of Latin America's largest automated deduplication programmes.

By São Paulo News Desk · Published 4 July 2026, 4:45 pm

3 min read

São Paulo Tackles Duplicate Image Sprawl in City Records — and Finds Itself Ahead of the Curve
Photo: Photo by Giovanna Kamimura on Pexels
Traduzindo…

São Paulo's city hall has been sitting on a problem that sounds mundane until you see the numbers: tens of millions of scanned documents, permit photos, and infrastructure inspection images stored across municipal servers — many of them duplicated two, three, or even four times over. The Secretaria Municipal de Inovação e Tecnologia, based in the Centro district, confirmed earlier this year that its digital asset review programme had identified redundant image files consuming a significant share of storage allocated to the city's GeoSampa mapping platform and the SP156 citizen services portal.

The timing matters. Brazil's Lei Geral de Proteção de Dados, which took full enforcement effect in August 2021, requires public bodies to maintain accurate, non-redundant records. Duplication is no longer just a storage headache — it creates legal exposure when authorities cannot confirm which version of a document is authoritative. The federal government's push toward a unified digital identity infrastructure under the Meu Gov.br platform has added pressure on municipal administrations to clean house before integration deadlines.

What São Paulo Is Actually Doing

The city's approach centres on two overlapping initiatives. The first runs inside the Centro de Operações São Paulo, the so-called COR facility near the Viaduto do Chá that coordinates emergency response and urban monitoring. Engineers there use perceptual hashing — a technique that detects visually identical or near-identical images even when file names differ — to flag duplicate drone footage and flood-sensor photographs captured during the rainy season. The second programme operates through the Empresa de Tecnologia da Informação e Comunicação do Município de São Paulo, known as PRODAM, which manages back-end infrastructure for roughly 40 city departments. PRODAM began a phased deduplication rollout in March 2025 across servers handling building permit archives in districts including Pinheiros, Mooca, and Santana.

Neither programme is flashy. Neither has generated a ribbon-cutting on Paulista Avenue. But municipal IT staff describe a systematic effort to reduce storage costs and comply with federal data standards — an effort that, measured against comparable cities, puts São Paulo in a relatively strong position.

How São Paulo Compares Globally

The comparison is instructive. Mexico City's Agencia Digital de Innovación Pública, which oversees a city of similar scale, launched its own digital archive audit only in late 2024, meaning its deduplication work is roughly 18 months behind São Paulo's. Bogotá's Secretaría Distrital de Hacienda acknowledged publicly in 2025 that duplicate records in its cadastral image database were complicating property tax assessments in the Chapinero and Usaquén localities — a problem São Paulo's PRODAM says it largely resolved in its own Imposto Predial e Territorial Urbano systems by mid-2024. In Europe, Milan's municipal digital office began a similar exercise after the city migrated to a new cloud provider in January 2025, but the project remains in an audit phase rather than active deletion.

What gives São Paulo an edge is scale of prior investment. The city has spent over R$1.2 billion on digital infrastructure modernisation since 2019, according to budget documents published by the Câmara Municipal de São Paulo. That spending funded the server consolidation that made automated deduplication technically feasible in the first place. Smaller Latin American capitals lack the underlying infrastructure to run the same tools at municipal scale.

There are genuine gaps. Outer periphery administrations — the subprefeituras covering areas like Grajaú and Parelheiros — still rely on older document management systems that are not yet connected to PRODAM's central deduplication pipeline. Images uploaded locally in those areas are not currently screened, meaning the redundancy problem persists at the edges of the city even as it shrinks at the centre.

The next scheduled review of the programme is set for the fourth quarter of 2026, when PRODAM is expected to publish a progress report to the Secretaria Municipal de Inovação e Tecnologia. City residents who want to flag suspected errors in official records — including duplicate or conflicting property images — can file a request through the SP156 portal or in person at any Poupatempo unit, including the high-traffic facility at Parque Dom Pedro II in the Centro district. The process takes up to 30 working days under current service standards.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily São Paulo

This article was produced by the The Daily São Paulo editorial desk and covers news in São Paulo. See our editorial standards for how we use AI.

The Daily São Paulo brief

The day's São Paulo news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to São Paulo news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily São Paulo and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily São Paulo

More in News

Enjoyed this story? Get tomorrow's briefing free.