Europeana Linked Open Data – data.europeana.eu
Submission in response to http://www.semantic-web-journal.net/blog/semantic-web-journal-special-ca...
Revised manuscript after "accept with minor revisions" - now accepted. The reviews from the first round are below.
Solicited review by Francois Scharffe:
The paper presents the pilot Europeana dataset. The dataset is important, rich and complex. It is a pilot as lessons learnt will enable to revise the publication. The paper is well written and gives a good overview of the dataset structure.
two minor remarks:
- Section 2 it is said that semantic markup is available on Web pages. It would be good to cite the technology used for the markup RDFa ? schema.org ?
-dereferencable -> dereferenceable
Solicited review by Dave Kolas:
This paper describes a prototype Linked Data version of the Europeana dataset.
* Quality of the dataset
The Europeana data on the museum / library resources is aggregated from a number of holders of the physical resources, thus the original providers have motivation to make the data accurate. It is possible that the aggregation of many sources means that some sources produce different subsets of data for the schema. The schema addresses the problem of multiple potentially conflicting records about a resource with proxies. It is not clear whether this is a better or worse approach than reification or named graphs for this purpose, but it appears sufficient. The other schema modeling is reasonable, though light on the interlinking (as noted in the paper). The authors do a good job of linking to other datasets, though it would be interesting to see percentages as well as raw links.
* Usefulness (or potential usefulness) of the dataset
This dataset could be potentially useful to a large number of people involved in or interested in the arts in Europe. It could also be combined with travel applications to know where to see particular works of interest. The prototype nature of the dataset leaves out much of the content currently in the non-linked-data Europeana dataset, somewhat mitigating its utility for the moment.
* Clarity and completeness of the descriptions
The paper is written clearly and concisely. The main classes in the data model are described well, and there is a good diagram of how these classes interact. An example record with properties might have been nice however.
Solicited review by Amit Joshi:
The paper is about the Europeana linked open data which contains open metadata with more than 2.4 million text,images, videos and sounds related to books, paintings, films, museum objects and archival objects throughout Europe. Data is gathered by Europeana from multiple data providers. Metadata is obtained from data providers, formatted according to ESE XML Schema and then converted to EDM for generated linked data version. The dataset is live and can be accessed either by downloading data dumps or executing SPARQL queries against the dataset. The significance of such unique dataset being open is, without any doubt, high. However, the paper has following weaknesses:
1. Use of provider proxy and Europeana proxy is not clear. Is it even required?
2. It would be good to provide examples of the items/resources in a dataset that uses existing ontologies and connects to other LOD datasets.
3. Number of references is very few (only two). Please revisit earlier sections and add additional references (ex: linked data principles)