Measuring Bigfoot
My previous blog Assessing file format risks: searching for Bigfoot? resulted in some interesting feedback from a number of people. There was a particularly elaborate response from Ross Spencer, and I originally wanted to reply to that directly using the comment fields.
Optimising archival JP2s for the derivation of access copies
Like many other organisations that are using JPEG 2000, the KB produces two representations of most of its digitised content (newspapers, books, periodicals):
Identification of PDF preservation risks: the sequel
SCAPE Planning and Watch: Two years and a bit more
ICC profiles and resolution in JP2: update on 2011 D-Lib paper
It’s been more than two years now since I wrote my D-Lib paper JPEG 2000 for Long-term Preservation: JP2 as a Preservation Format. From time to time people ask me about the status of the issues that are mentioned in that paper, so here’s a long overdue update.
Open Research Challenges in Digital Preservation: Call for contributions!
Following the community response to our workshop last year, we want to invite you again to contribute your future preservation challenge!
EPUB for archival preservation: an update
Last year (2012) the KB released a report on the suitability of the EPUB format for archival preservation. A substantial number of EPUB-related developments have happened since then, and as a result some of the report’s findings and conclusions have become outdated.
PDF Eh? – Another Hackathon Tale
“Characterization” can mean many things (I’m particularly fond, especially in this context, of the OED’s “creation of a fictitious character or fictitious characters”).
What do we mean by "embedded" files in PDF?
The most important new feature of the recently released PDF/A-3 standard is that, unlike PDF/A-2 and PDF/A-1, it allows you to embed any file you like. Whether this is a good thing or not is the subject of some heated on-line discussions. But what do we actually mean by embedded files?
Identification of PDF preservation risks with Apache Preflight: a first impression
The PDF format contains various features that may make it difficult to access content that is stored in this format in the long term. Examples include (but are not limited to):

