Hacking and mashing preservation issues with AQuA

Attendees at the first AQuA Project event in Leeds, UK, are spending 3 days tackling some preservation issues with their digital collections. We’re trying to answer questions such as:

  • What does my digital collection consist of?
  • What preservation risks does it face?
  • Are any of the files broken?
  • Are there any duplicates?

We’re mid way through the event where we’ve speed dated collection owners, techies and digital preservation experts to form preservation issue solving teams. Some interesting solutions are in development with topics such as image and audio fingerprinting for validation and de-duplication, PDF font validation and risk assessment, metadata/content/ocr consistency checking and automatic identification of damaged video files.

We’re also trying to capture and define the various collections, problems and solutions so that the work we begin here can be taken forward elsewhere. Our work in progress can be found here:

http://wiki.opf-labs.org/display/AQuA/Collections%2C+Issues+and+Solutions

We’ll also be evaluating this approach to working and refining our programme for our next event in London on the 13th June (places are filling up fast, so book now!).

Recent comments

  • Thanks for the correction Gareth. I think that was probably my misunderstanding! Looking forward to...
    paul 1 day 2 hours ago
  • Hi Paul, thanks for the write-up. Just to clarify an aspect of my talk - it's the Autopsy front-end...
    garethknight 3 days 18 hours ago
  • And here's an update on the status of the UDFR from the LoC's excellent digital preservation blog,...
    andy jackson 2 weeks 5 days ago
  • Hi Johan and Andy,   I agree with you both that some formats are worse than others with this,...
    ecochrane 3 weeks 19 hours ago
  • I have to agree with Johan, in that this depends very much on the format in question. There have...
    andy jackson 3 weeks 21 hours ago

Follow Open Planets Foundation on: