Skip to main content
The Daily Leeds

All of Leeds, every day

News

Leeds Tackles Thousands of Duplicate Photos Cluttering Public Archives

Years of piecemeal digital record-keeping across council departments left the city with thousands of duplicate photographs cluttering its public archives, but a new remediation programme is now underway.

Share

By Leeds News Desk · Published 4 July 2026, 7:36 pm

4 min read

Updated 18 min ago· 5 July 2026, 2:59 am

How we reported this

This article was generated by AI from the linked public sources. The Daily Leeds is independently owned and covers Leeds news free from advertiser or sponsor influence. Read our editorial standards →

Leeds Tackles Thousands of Duplicate Photos Cluttering Public Archives
Photo: Photo by Ffion Scott on Pexels

Leeds City Council's digital asset library contains an estimated 40,000 duplicate image files — some appearing as many as seven times in the system — the result of more than a decade of uncoordinated uploads across departments ranging from planning to parks. The council's Digital Transformation Unit confirmed earlier this year that a formal duplicate image replacement programme had been formally scoped and funded, with work beginning in earnest from January 2026.

The timing matters because Leeds is in the middle of the most intensive phase of its city centre regeneration in a generation. The South Bank development corridor, stretching from Granary Wharf south toward the former Yorkshire Post printing works on Wellington Street, is generating new visual documentation almost weekly. Without clean, deduplicated archives, planning committees, heritage officers, and communications staff are routinely pulling the wrong image versions when producing public-facing materials — a problem that became acute during the Aire Valley consultation rounds in late 2024.

A Problem Built Over Years

The roots of the duplication crisis trace back to 2011, when Leeds City Council migrated from a legacy file server system to its first cloud-based digital asset management platform. At the time, individual departments — including Housing Leeds, Leeds City Development, and the Leeds Museums and Galleries service — were given separate upload credentials but no unified taxonomy or file-naming convention. The result was predictable: the same photograph of, say, Kirkgate Market's Victorian iron roof would be uploaded by the markets team, the heritage team, and the tourism communications office, each time under a different filename and often with conflicting metadata.

By 2018, internal audits flagged the problem but estimated remediation would require roughly 1,200 staff hours and a budget the council was not prepared to commit during a period of central government austerity settlements. The issue was deferred. A second audit in 2022 — conducted as part of a broader digital review commissioned by the council's then-Chief Digital Officer — found the problem had roughly tripled in scale. The 2022 review, whose findings were presented to the council's scrutiny board, noted that storage costs for redundant files were running at an avoidable overhead, though the specific figure from that document has not been made publicly available by the council.

Leeds Museums and Galleries, which manages collections across venues including Temple Newsam and Lotherton Hall, was identified in the 2022 review as one of the worst-affected services. Archivists there had been maintaining their own parallel image library entirely separate from the council's central system — meaning duplicates existed not just within the main platform but between platforms altogether.

What the Remediation Programme Actually Involves

The current programme, budgeted at £280,000 across an 18-month delivery window running to June 2027, uses a combination of automated hash-matching software and human review. The software flags files with identical or near-identical pixel data; trained reviewers from the council's Records Management Service then verify which version carries the most complete and accurate metadata before the duplicates are retired to a quarantine folder rather than deleted outright — a safeguard insisted upon after heritage officers raised concerns about permanent loss of contextual information.

Photographs connected to the Leeds Waterfront Heritage Trail and the Meanwood Valley Urban Farm, both of which have large public-facing digital presences, are among the first batches scheduled for cleaning under the programme's Phase One workplan, which covers the period from January to September 2026.

For residents and organisations that license council images — local journalists, community groups, schools — the practical change will be a cleaner search experience on the council's public-facing image portal, which currently returns duplicate results for roughly one in three searches on common terms like "Leeds city centre" or "River Aire." The portal is expected to carry a visible remediation notice from September 2026 onward as batches are progressively cleaned. Anyone who has previously downloaded images for ongoing projects is advised to re-download after that date to ensure they hold the canonical, correctly tagged version of each file.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Leeds

Covering news in Leeds. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Leeds news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Leeds and accept our Privacy Policy. Unsubscribe anytime.