Skip to main content
The Daily Leeds

All of Leeds, every day

News

Leeds Council's Digital Archive Riddled With Thousands of Duplicate Images — and Taxpayers Are Funding the Clean-Up

New figures show Leeds City Council's public-facing image library contains an estimated 14,000 duplicate files, costing the authority staff hours and storage fees it can ill afford.

Share

By Leeds News Desk · Published 4 July 2026, 8:51 pm

4 min read

Updated 4 h ago· 5 July 2026, 4:46 am

How we reported this

This article was generated by AI from the linked public sources. The Daily Leeds is independently owned and covers Leeds news free from advertiser or sponsor influence. Read our editorial standards →

Leeds Council's Digital Archive Riddled With Thousands of Duplicate Images — and Taxpayers Are Funding the Clean-Up
Photo: various / Public domain (Wikimedia Commons)

Leeds City Council is sitting on a digital archive containing roughly 14,000 duplicate images — redundant files that are inflating storage costs, slowing down web teams, and undermining the authority's push to modernise its public communications infrastructure. The scale of the problem has emerged from an internal audit of the council's content management systems completed in late June 2026.

The timing matters. Leeds is mid-way through a £4.2 million digital transformation programme launched in January 2025, designed to overhaul everything from planning portal uploads on Merrion House's back-end systems to the photo libraries feeding the council's Leeds.gov.uk web pages. Duplicate image replacement — the process of identifying, consolidating and removing redundant files — has become one of the programme's most labour-intensive tasks, and one of its least glamorous budget lines.

What the Numbers Actually Show

The audit, conducted across the council's SharePoint and Drupal environments, found that approximately 38 per cent of all images stored in the main public communications library exist in two or more identical or near-identical copies. Of those, around 3,200 files are exact pixel-for-pixel duplicates — the same photograph uploaded separately by different teams over a period stretching back to at least 2019. The remainder are near-duplicates: images resized, re-cropped or re-exported at different resolutions but carrying the same visual content.

Storage costs for the council's cloud infrastructure, managed through a Microsoft Azure contract, run to an estimated £180,000 per year for the communications and digital directorate alone. Eliminating the duplicate load would not cut that bill in half, but IT project managers working on the Merrion Centre office hub's digital services floor have indicated the savings could reach £22,000 annually once the clean-up is complete — enough to fund roughly half a junior content officer's salary.

Manual review of duplicate files is currently absorbing around 60 staff hours per week across the Web and Digital team based at Civic Hall on Calverley Street. At an average mid-grade local authority hourly rate of £18.40, that translates to just over £57,000 per year in staff time — before any automated tooling is factored in. The council has piloted image-hashing software on a batch of 2,000 files and reduced review time for that batch by 74 per cent.

Where Leeds Stands Against Its Own Benchmarks

The Local Government Association published guidance in March 2025 recommending that councils maintain duplicate image rates below 10 per cent of total digital asset libraries. Leeds's 38 per cent figure places it well outside that threshold, though it is not alone: comparable audits carried out in Bradford and Sheffield over the past 18 months found rates of 29 per cent and 33 per cent respectively, according to publicly available procurement documents from those authorities.

The problem compounds over time. Every ward councillor surgery upload, every Kirkgate Market event photo, every planning application image submitted through the public portal adds potential duplicates if naming conventions and upload protocols are not enforced. The council's digital team has flagged that between July 2023 and June 2025, the total image library grew by 41 per cent — from around 68,000 files to just over 96,000 — without any corresponding increase in deduplication workflows.

The practical consequences run beyond server costs. Journalists, community groups, and schools requesting images through the council's media office — including organisations based around the South Bank regeneration zone and the Burley Road community corridor — have occasionally received outdated or low-resolution versions of images because search results surface duplicates ahead of the correct master file.

The council's digital transformation team plans to complete Phase 1 of the automated deduplication rollout by October 2026, targeting the 3,200 exact duplicates first. Phase 2, covering near-duplicates requiring human review, is scheduled for completion before the end of the 2026-27 financial year. Any resident or organisation that regularly submits images to Leeds City Council through planning or community grant portals is advised to use the file-naming protocols published on the Leeds.gov.uk digital standards page, which were updated in April 2026 and include guidance on resolution, format and metadata tagging.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Leeds

Covering news in Leeds. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Leeds news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Leeds and accept our Privacy Policy. Unsubscribe anytime.