From Tuesday 19th November until Thursday 21st November the internal SCAPE Developer’s Workshop was held at the Brno University of Technology.
Learning to Think Like a Package Maintainer
Lots of great digital preservation applications and services exist, however very few are actively maintained and thus preserved! This is a big problem! By introducing the steps to develop these and engage the support of the community, this training course looks at what can be done to improve this situation. Specifically, this training course looks at how to prepare packages for submission into the very heart of many digital environments; the operating system and directly associated “app-stores”. Attendees will be given hands-on experience with developing and maintaining packages rather than software and key differences will be discussed and evaluated. Better preservation of preservation tools, means better preservation our digital history.
Learning Outcomes (by the end of the training event the attendees will be able to):
- Understand the complexities of package management and distinguish between the different practices relating to both package objectives and chosen programming language.
- Be able to carry out advanced package management operations in order to critically appraise current packages and propose changes.
- Understand the importance of clearly defined versioning and licenses and the role of clear documentation and examples.
- Apply best practice techniques in order to create a simple package suitable for long term maintenance.
- Evaluate a number of options for managing package configuration and behavior relating to package installation, removal, upgrade and re-installation.
- Analyse opportunities for automating package management and releases, maintaining a clear focus on the user and not the developer.
- Critically evaluate opportunities to generalise package management to allow the easy building and maintenance of packages on multiple platforms.
- Assess the potential to apply package management techniques in your own environment.
Delegates will receive a certificate of attendance for the training course.
The agenda can be seen here: http://wiki.opf-labs.org/display/SP/Agenda+-+Preserving+Your+Preservation+Tools.
Registration is now open! https://scape-preserving-tools.eventbrite.co.uk
What is preservation watch?
The reason why we should worry about preservation of digital content and why some preservation action needs to be done is closely related to the idea that content is at risk. The risk relates to the potential of losing something of value, weighted against the potential of gaining something of value.
From the very beginning of the SCAPE project on, it was a requirement that the SCAPE Execution Platform be able to leverage functionality of existing command line applications. The solution for this is ToMaR, a Hadoop-based application, which, amongst other things, allows for the execution of command line applications in a distributed way using a computer cluster. This blog post describes the combined usage of a set of SCAPE tools for characterising and profiling web-archive data sets.
It's been two weeks since the internal SCAPE developer workshop in Brno, Czech Republic. It was a great workshop. We had a lot of presentations and demos, and were brought up to date on what's going on in the other corners of the SCAPE project. We also had some (loud) discussions, but I think we came to some good agreements on where we as developers are going next. And we started a number of development and productisation activities.
- Travis compiles the projects and executes unit tests whenever a new commit is pushed to Github, or when a pull request is submitted to the project.
- Jenkins builds are generally scheduled once per day. After a build the software has its code quality analysed by Sonar
This webinar provides an introduction to file format identification and characterisation tools which have been developed or extended as part of the SCAPE Project.
It covers the basic principals of file format identification, and shows how format information drives digital preservation workflows.
Participants will be given an overview of file format registries, and their role in digital preservation, and will see demonstrations of identification and characterisation tools including fido and tika.
We will provide a Virtual Machine image with samples files and step-by-step worksheets to allow participants to try out these exercises for themselves after the webinar with support.
Learning outcomes (by the end of the webinar and exercises, participants
will be able to):
- Distinguish between different file types and identify the requirements for characterising each of them.
- Carry out identification and characterisation experiments on example files.
- Compare characterisation and identification tools and understand their advantages and disadvantages when used in different scenarios.
Session Lead: Carl Wilson, OPF
Date: Friday 25 October
Time: 12 noon BST / 13:00 CET
Duration: 1 hour (please note this includes the presentation and demonstrations. Practical exercises can be carried out after the webinar).
There are 25 places available which will be allocated on a first come, first serve basis.
My previous blog Assessing file format risks: searching for Bigfoot? resulted in some interesting feedback from a number of people. There was a particularly elaborate response from Ross Spencer, and I originally wanted to reply to that directly using the comment fields.