Tag Archives: software

The future of file format identification

From the Digital Preservation Coalition website:

The National Archives is proposing to launch a new phase of development of its DROID tool, and is seeking to engage with various user groups and stakeholders from the digital preservation community, government and the wider archives sector communities to help inform and discuss potential developments and user needs. As part of this process, The National Archives, in conjunction with the Digital Preservation Coalition, invites interested parties to attend a one day workshop, hosted at Kew, to discuss their experiences of using DROID and PRONOM in their respective disciplines, discuss how the tools fit their use case, and describe both positive and negative experiences of the tools and their interaction with The National Archives.

The conference will be at the National Archives in Kew, London, on November 28. Registration is free for DPC members and associates and cheap for everyone else.

JHOVE2 2.0.0

JHOVE2 2.0.0 has been released. Supported formats are ICC Color Profile, SGML, Shapefile, TIFF, UTF-8, WAVE, and XML. The first three of these aren’t supported by the old JHOVE. There’s also a Zip module which validates files within a Zip repository, but not the Zip file itself. JHOVE2 can be downloaded in Zip or Gzip form, or from the Mercurial repository.

Congratulations to everyone who worked on this project!

JHOVE2 tutorial at IS&T Archiving

Forwarded from Stephen Abrams:

The JHOVE2 project team will be presenting a one day tutorial on the use of JHOVE2 at the IS&T Archiving conference on May 16.

http://www.imaging.org/ist/conferences/archiving/index.cfm

Description

JHOVE2 is an open source framework and application for next generation format-aware characterization of digital objects. Characterization is the process of deriving representation information about a formatted digital object that is indicative of its significant nature and useful for purposes of classification, analysis, and use in digital curation, preservation, and repository contexts. JHOVE2 builds on the success of the original JHOVE characterization tool by addressing known limitations and offering significant new functions, including: object-focused, rather than file-focused, characterization; signature-based file level identification using DROID; aggregate-level identification based on configurable file system naming conventions; rules-based assessment to support determinations of object acceptability in addition to validation conformity; and extensive user configuration options.

The 2011 release of JHOVE2 represents the availability of a significant new tool for digital preservation; this course will provide a broad overview of JHOVE2, as well as detailed information on its functionality, architecture, use in local workflows, and open source community.

Course Objectives:

This short course will give attendees both a broad conceptual overview and detailed information on JHOVE2, and equip them to use the open source tool in their local environments. Specifically, the course will:

  • Define the role of file characterization, including identification, feature extraction, validation, and assessment, in digital curation and preservation workflows.
  • Review the functionality of the JHOVE2 application, including the significant enhancements relative to JHOVE, and new capabilities based on object- and aggregate-level characterization
  • Detail the architecture, componentry, design patterns and Java API’s of the JHOVE2 framework, as well as the configuration options for plug-in modules, characterization strategies and results formatting
  • Demonstrate the use of JHOVE2’s new rule-based assessment capabilities, and integrating these into local workflows to determine object acceptability
  • Cover the community framework for the project, and how individual institutions can both contribute new format modules as well as resources to help extend and sustain the open source project.

Intended Audience:

This course is designed for technologists and practitioners (developers, managers, analysts and administrators) engaged in digital curation, preservation, and repository activities, and whose work is dependent on an understanding of the format and pertinent characteristics of digital assets.

JHOVE CVS repository back up

Because of security issues at SourceForge, all CVS repositories, including the one for JHOVE, were down for a week or so. They’re back up now. SourceForge provides details here.

CVS is getting to be ancient technology, so I may migrate the repository to Subversion at some point.

JHOVE2 poll

There is a poll online for letting the developers of JHOVE2 know what plans you have for it. It just takes a couple of minutes to fill out and doesn’t even require Javascript.