In a previous post, I asked for input on what sort of video metadata FITS should produce, and I’ve gotten a number of responses. Here I’d like to dig in some more and hopefully get more feedback.
We’re talking here primarily about technical metadata. Following the Harvard Library definition, “Technical metadata focuses on how a digital object was created, its format, format-specific technical characteristics, storage and location, etc. Accurate technical metadata helps a repository deliver digital content appropriately to users and to manage digital objects over time and keep them usable.” Other kinds of metadata are important, but the job of FITS is primarily to report what kind of file a digital object is: its format, size, compression method, encoding, etc. Some other kinds of metadata, such as copyright information (rights metadata), are reported, but technical metadata is primary.
I’m interested in a model, not a specific schema or representation. FITS pulls information from many different tools; the question I’m looking into is what properties to report, and with what vocabulary.
For still image metadata, we can go to Exif as a model. For audio, we have the AES standards. In the video realm, things are less settled so far. MPEG-7 offers a standard, but it focuses on the characteristics of the sounds and images, not of the files. It’s technical metadata, but at a level of abstraction which is less relevant to file characterization software.
XMP Dynamic Media looks more useful. It covers low-level technical properties such as frame rate, pixel depth, and compression. The high-level properties for music (no flat keys and only a limited choice of time signatures) are poorly thought out, but they aren’t relevant here.
Some people have recommended MediaInfo‘s output as a model to follow. I’m really not very impressed. Its output isn’t particularly consistent. Archivematica’s list of significant characteristics of video files is more useful, offering some properties which XMP doesn’t cover.
I’ve put together a spreadsheet of properties which represent my current thinking. Suggestions and comments are welcome.
The future of WebM
Yesterday I posted about the WebP still image format, expressing some skepticism about how easily it will catch on. Its companion format for video, WebM, may stand a better chance, though. Images aren’t exciting any more; JPEG delivers photographs well enough, PNG does the same for line art, and there isn’t a compelling reason to change. Video is still in flux, though, and the high bandwidth requirements mean there’s a payoff for any improvements in compression and throughput. The long-running battle among HTML5 stakeholders over video shows that it’s far from being a settled area. Patents are a big issue; if you implement H.264, you have to pay money. Alternatives are attractive from both a technological and an economic standpoint.
With Google pushing WebM and having YouTube, there’s a clear reason for browser developers to support it. YouTube plans to use the new WebM codec, VP9, once it’s complete. I haven’t seen details of the plan, but most likely YouTube will make the same video available with multiple protocols and query the browser’s capabilities to determine whether it can accept VP9. If the advantage is real and users who can get it see fewer pauses in their videos, more browser makers will undoubtedly join the bandwagon.
Comments Off on The future of WebM
Posted in commentary
Tagged Google, standards, video