Multidimensional Enrichment of Spatial RDF Data for SOLAP

Tracking #: 2563-3777

Authors: 
Nurefsan Gur
Torben Bach Pedersen
Katja Hose
Mikael Midtgaard

Responsible editor: 
Boyan Brodaric

Submission type: 
Full Paper
Abstract: 
Large volumes of spatial data and multidimensional data are being published on the Semantic Web, which has led to new opportunities for advanced analysis, such as Spatial Online Analytical Processing (SOLAP). The RDF Data Cube (QB) and QB4OLAP vocabularies have been widely used for annotating and publishing statistical and multidimensional RDF data. Although such statistical data sets might have spatial information, such as coordinates, the lack of spatial semantics and spatial multidimensional concepts in QB4OLAP and QB prevents users from employing SOLAP queries over spatial data using SPARQL. The QB4SOLAP vocabulary, on the other hand, fully supports annotating spatial and multidimensional data on the Semantic Web and enables users to query endpoints with SOLAP operators in SPARQL. To bridge the gap between QB/QB4OLAP and QB4SOLAP, we propose an RDF2SOLAP enrichment model that automatically annotates spatial multidimensional concepts with QB4SOLAP and in doing so enables SOLAP on existing QB and QB4OLAP data on the Semantic Web. Furthermore, we present and evaluate a wide range of enrichment algorithms and apply them on a non-trivial real-world use case involving governmental open data with complex geometry types.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Minor Revision

Solicited Reviews:
Click to Expand/Collapse
Review #1
By Alberto Abelló submitted on 25/Sep/2020
Suggestion:
Minor Revision
Review Comment:

The paper has been substantially improved, but still has some issues and typos:
1) "Problem motivation and definition" section (also in some other places) is plenty of "beliefs" that should be certainties. On the other hand, it is still not clear to me what happens if the anotations (or part of them) are never used or suffer more changes than queries. Is it still worth to do it?
2) It is not clear how development times for "GeoNorthWind" were estimated or how accurate/reliable that estimation is. If some reliability cannot be guaranteed, may be better not to report anything.

Typos:
- "This obviously a slow"
- "cahce"
- "ares"
- "use(s) existing"
- "result are"
- "is not be able"

Review #2
Anonymous submitted on 25/Sep/2020
Suggestion:
Accept
Review Comment:

The paper has significantly improved with respect to the previous version and the authors have addressed all made comments. Thus, I recommend the acceptance of the manuscript.