Staff at the Northern Territory Library on Parliament Square and the Museum and Art Gallery of the Northern Territory on Conacher Street have spent this week running systematic audits after internal reviews found their combined digital collections contained thousands of duplicate image files—some records appearing four or more times in separate cataloguing systems. The clean-up, which began in earnest on Monday 30 June, is the most significant data remediation effort either institution has undertaken since both migrated to a shared collections management platform in 2023.
The timing is not coincidental. The NT Government has signalled it wants its cultural institutions ready for a digital access push ahead of the 2026 Garma Forum, where First Nations archival material is expected to feature prominently in discussions about cultural sovereignty. Duplicate records create real administrative damage: they fragment search results, complicate repatriation paperwork for Aboriginal communities, and can cause the same restricted image—material that communities have designated as sacred or secret—to appear in public-facing portals under a different file name. That last problem, archivists say, is not hypothetical.
What the Audit Found
The NT Library's digital team identified more than 8,400 candidate duplicate image records across its holdings as of Thursday, with roughly 1,200 of those flagged as high-priority because they involve photographs of Aboriginal ceremonial content held under cultural protocols. The Museum and Art Gallery of the Northern Territory, which houses one of the country's most significant collections of Yolŋu bark paintings and artefacts from Arnhem Land, is working through a parallel list of approximately 3,600 suspect duplicates in its visual media archive.
The problem has roots in at least three separate digitisation rushes: a grant-funded project in 2014, another round tied to the centenary of World War One commemorations in 2015 and 2016, and a COVID-era push in 2020 when volunteer scanning teams worked remotely and batch-uploaded files without standardised naming conventions. When the two institutions began sharing a single Axiell EMu cataloguing environment in 2023, pre-existing duplicates from both organisations merged into one database without automatic de-duplication.
The NT Government allocated $340,000 in the 2025–26 budget for digital infrastructure upgrades across its cultural institutions, a line item that covers the current audit software licences and contractor hours. That figure does not include staff time from permanent employees at either Mitchell Street-precinct organisation. Archivists are using a combination of perceptual hash matching software and manual review, a process they expect to complete by mid-August.
What Happens to the Flagged Files
Not all duplicates will simply be deleted. Where two versions of the same image carry different metadata—different donor names, different date estimates, or different community permissions—staff are merging the records rather than removing one outright. Darwin-based Indigenous data governance organisation First Nations Media Australia has a standing agreement with both institutions to be consulted before any image associated with Aboriginal communities is permanently removed or consolidated, a protocol that adds review time but is considered non-negotiable under the institutions' current community agreements.
For Darwin residents with family photographs or historical images lodged with either institution, the practical advice is straightforward: if you donated material before 2020 and have never received a formal cataloguing confirmation, contact the NT Library's Oral History and Private Records unit on Parliament Square or the MAGNT's Registration desk on Conacher Street before the end of July. Staff are cross-checking donor records as part of the audit and can confirm whether your material has been correctly attributed or is sitting inside a duplicate cluster awaiting resolution.
The broader lesson the institutions appear to be absorbing—slowly—is that speed and volume targets during digitisation projects create downstream costs that dwarf the original grant funding. At eight-and-a-half thousand problem records and counting, Darwin's archivists are this week learning exactly how expensive cutting corners on file naming can get.