Linked Data Quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO

Tracking #: 1465-2677

Authors: 
Michael Färber
Frederic Bartscherer
Carsten Menne
Achim Rettinger

Responsible editor: 
Guest Editors Quality Management of Semantic Web Assets

Submission type: 
Survey Article
Abstract: 
In recent years, several noteworthy large, cross-domain and openly available knowledge graphs (KGs) have been created. These include DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. Although extensively in use, these KGs have not been subject to an in-depth comparison so far. In this survey, we provide data quality criteria according to which KGs can be analyzed and analyze and compare the above mentioned KGs. Furthermore, we propose a framework for finding the most suitable KG for a given setting.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Accept

Solicited Reviews:
Click to Expand/Collapse
Review #1
By Sebastian Mellor submitted on 13/Nov/2016
Suggestion:
Accept
Review Comment:

As with my previous review I would accept this article.

Having re-read the article in view of the authors' changes I would agree that they are mostly minor - e.g., addition of supporting information, formalisation or clarifiaction, adjustment of smaller headings and position of some text - and in general, do improve readability. The updated syntactic and semantic validity metric definitions are well described.

Looking at the differences in Table 16 (previously Table 14) and several earlier tables in Section 5 we can see the effects of any changes introduced by the authors' clarifications on the worked example and confirm that the overall ranking of KGs remains unaffected for this demonstration of the framework.

Review #2
Anonymous submitted on 14/Nov/2016
Suggestion:
Accept
Review Comment:

Very glad that comments for last version have been well-addressed by the authors.