MoRe Quality by Vangelis Banos (Future LIbrary – Greece) is the winning prototype application of the LoCloud hackathon which took place on the 11th of February 2015 at the premises of the Google Cultural Institute in Paris, France.The hackathon was organised by LoCloud and Europeana in the context of EuropeanaTech 2015.
Vangelis Banos, winner of the LoCloud Hackathon
The Metadata & Object Repository (MoRe) is an easy and powerful tool to aggregate information and harvest metadata from multiple sources in multiple schemas. Such aggregation schemas usually create problematic situations regarding the quality of the harvested metadata.
Metadata may pass the standard Europeana XML validity tests but they may include problematic metadata values. For instance:
* a dc:date value could be formatted in the wrong way:
<dc:date>approximately 18th century</dc:date>
This format is not correct according to established date formats.
* an author name could be incomplete according to bibliographic standards.
* a URL may be invalid. E.g.: <ese:isShownAt>http://invalidurl.com/error-url</ese:isShownAt>
The aim of the MoRe Quality tool is to implement a validation system which could be able to catch these errors and produce useful reports to the collection administrators.
MoRe Quality is implemented using linux and python 2.7.
Some common python modules are utilised:
* Virtual environments
* Python Requests
The prototype is not currently running on a production server but the full source code freely available at: https://bitbucket.org/vbanos/more-quality/
Anyone interested in MoRe Quality should feel free to contact the author for more information.