P. Petrov, C. Becker:
"Large-scale content profiling for preservation analysis";
Poster: 9th International Conference on Preservation of Digital Objects (IPRES 2012), Toronto; 2012-10-01 - 2012-10-05; in: "Proceedings of the 9th International Conference on Preservation of Digital Objects (IPRES 2012)", R. Moore, K. Ashley, S. Ross (ed.); (2012), 2 pages.

English abstract:
The starting point of any operational endeavor to preserve digital content is gaining a deep understanding of the characteristics of the objects. Systematic analysis of digital object sets and the identification of sample objects that are representative of a collection are critical steps towards preservation operations and a fundamental enabler for successful preservation planning: Without a full understanding of the properties and peculiarities of the content at hand, informed decisions and effective actions cannot be taken. This article presents a software tool prototype that is able to pro file large sets of meta data in a scalable fashion and provide deeper
insight into the digital collection at hand.

digital preservation, mapreduce, scalability, preservation planning, content profiling, collection profiling

