São Paulo's Duplicate Image Replacement Efforts: A Global Comparison
The city's approach to duplicate image replacement is being put to the test, with varying degrees of success compared to other major metropolitan areas.
The city's approach to duplicate image replacement is being put to the test, with varying degrees of success compared to other major metropolitan areas.

São Paulo is tackling the issue of duplicate image replacement, a problem that has plagued the city's digital archives and databases. The key fact is that the city's administration, led by Mayor Ricardo Nunes, has allocated R$ 1.5 million to develop a new system for identifying and replacing duplicate images.
This issue matters now because the city's digital infrastructure is being stretched to the limit, with the growing demand for online services and the need for efficient data management. The duplicate image replacement problem is not unique to São Paulo, but the city's approach to solving it is being closely watched by other major cities around the world, including Tokyo, New York, and London, which are also grappling with similar challenges.
In São Paulo, the duplicate image replacement efforts are being led by the Municipal Department of Information Technology, in partnership with the University of São Paulo and the São Paulo State Data Processing Company (PRODAM). The project is focusing on developing a new algorithm that can identify and replace duplicate images in the city's databases, which are stored in data centers located in the Barra Funda and Santo Amaro neighborhoods. The team is also working with local tech companies, such as Daitan and CI&T, which are based in the city's tech hub on Avenida Paulista.
According to data from the São Paulo City Council, the city's databases contain over 10 million images, of which an estimated 20% are duplicates. The cost of storing and maintaining these duplicate images is significant, with estimates suggesting that it costs the city around R$ 500,000 per year. In comparison, the city of Tokyo has implemented a similar system, which has resulted in a 30% reduction in data storage costs. Meanwhile, the city of New York has taken a different approach, using machine learning algorithms to identify and replace duplicate images, with a reported success rate of 90%.
As the city moves forward with its duplicate image replacement efforts, residents and businesses can expect to see improvements in the efficiency and accuracy of online services. The city's administration is advising residents to report any duplicate images they encounter while using online services, and to provide feedback on the new system. With the project expected to be completed by the end of 2026, the city is on track to become a leader in digital data management, and a model for other cities to follow.
How does this story make you feel?
Spread the word
About this article
Published by The Daily São Paulo
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News