Leeds City Council's digital asset management systems currently hold an estimated 340,000 image files across departmental servers, according to figures compiled by the council's internal digital transformation team as of March 2026 — and auditors believe roughly a third of those may be duplicates. That single statistic sits at the heart of a slow-moving but costly problem affecting public bodies and cultural institutions across the city.
The issue has sharpened in 2026 because the council is midway through its Digital Leeds 2030 programme, a five-year modernisation push that involves migrating legacy content onto a unified cloud platform. When you move decades of disorganised files into a single system, the duplicates don't disappear — they multiply. Every department that uploaded its own version of a planning consultation photograph, a civic event image or a public health campaign asset has created a tangled archive that now needs human and automated intervention to untangle.
What the Numbers Actually Show
The scale becomes clearer when you look at individual institutions. Leeds Museums and Galleries, which oversees sites including Kirkstall Abbey and the Leeds City Museum on Millennium Square, reported in its 2025–26 annual operations review that its digital image catalogue had grown to just over 87,000 files. Staff identified approximately 19,000 of those as likely duplicates — images scanned more than once, re-uploaded after system migrations or saved in multiple resolutions without a single master file being designated. At a conservative estimate of 15 minutes of staff time needed to assess and resolve each duplicate record, that is around 4,750 hours of work — more than two full-time employees for an entire year.
West Yorkshire Combined Authority, which funds regional transport and economic development programmes and is based at Wellington House on Wellington Street in Leeds city centre, faces a comparable challenge with its infrastructure photography. Planning documents submitted to the authority's development directorate often arrive with duplicate site images embedded across multiple PDF attachments. A pilot deduplication exercise run on transport corridor data between January and April 2026 found that 28 percent of image assets reviewed were redundant copies of files already held elsewhere in the system.
Storage costs compound the problem. Cloud storage for local government in the UK currently runs at roughly £18 to £22 per terabyte per month through standard public sector framework agreements. High-resolution photography — the kind used in planning applications and heritage documentation — can push individual files beyond 50 megabytes. Scale that across 100,000 unnecessary duplicate files and the annual storage bill for redundant images alone can reach tens of thousands of pounds. For institutions already operating under squeezed budgets, that is money that cannot go to frontline services.
What Leeds Institutions Are Doing About It
The Leeds Digital Festival, which runs its annual programme from venues across the city including the Cloth Hall Court conference centre on Quebec Street, has begun including digital asset management workshops in its schedule specifically because smaller creative businesses in the city face the same duplication problem at a more modest scale. Freelance photographers and design agencies working on projects for Leeds-based clients routinely accumulate image libraries that double or triple in size unnecessarily over multi-year contracts.
Leeds Beckett University's School of Computing began a structured research project in September 2025 examining automated deduplication tools — software that uses perceptual hashing algorithms to identify visually identical or near-identical images without requiring staff to open each file manually. Early results, presented internally in May 2026, suggested the best-performing tools could process around 10,000 images per hour with an accuracy rate above 94 percent on exact duplicates, though near-duplicate detection — catching slightly cropped or colour-corrected versions of the same image — remained harder to automate reliably.
For any Leeds organisation wrestling with this now, the practical steps are straightforward even if the workload is not: conduct a file count audit first, identify the worst-offending departments or project folders, and run a free perceptual hashing tool such as dupeGuru on a sample batch before committing to enterprise software. The Digital Leeds 2030 programme is expected to publish guidance for council departments on image governance standards by October 2026 — at which point institutions across the city will have a clearer framework to work from, rather than each one solving the same problem independently.