Automatic clustering of digital objects in Europeana

In May 2012, a small team of experts from the Europeana Office and from Online Computer Library Centre (OCLC) Research Europe started a project to investigate automatic clustering (grouping) of cultural heritage objects in Europeana.

One of the objectives was to find a way to avoid ambiguity between original and derivative versions of the same object, or even duplication if different providers give access to the same object. Examples of clusters include: all parts of the same object (e.g. scanned pages of a book), translated copies of the same archive, multiple letters belonging to the same set of correspondence, etc.

The research findings of this investigation will be presented at the next conference on Theory and Practices of Digital Libraries in September 2013.