Review Comment:
The paper describes the best practice recommendations identified by the joint W3C/OGC working group on Spatial Data on the Web. Given the ubiquity of spatial data, the topic is of interest not just to those who are involved in creating and maintaining spatial data infrastructures but also to other data providers and users who are no experts on geospatial data.
It is difficult to think of a group of people who are in a better position for communicating the results of the working group. The first three authors of the paper have edited the OGC/W3C Working Group Note "Spatial Data on the Web Best Practices", a note that has been published as a W3C technical report in May 2017 (https://www.w3.org/TR/sdw-bp). The paper submitted to the SWJ shares almost half of its contributors with the technical report.
This raises the question of how the paper relates to the W3C document. As a reader of both texts who did not participate in the discussion process that led to the recommendations I have difficulties in figuring out what exactly the authors intend to add to the technical report by writing the paper. This is my main criticism of the otherwise very interesting and readable paper.
The abstract suggests that the paper provides a kind of digest, which abridges the 96 pages of the W3C document ("summarizes some of the efforts"). Not all readers of the SWJ might need that kind of assistance, though. W3C best practice recommendations are usually written in a quite accessible way. The technical report, for instance, numbers and lists the recommendations in an overview section. Unfortunately, such an overview is missing in the paper making it less readable in this respect than the W3C document. The fact that currently several of the paper's subsections refer to the recommendations by their numbering in the technical report creates confusion.
Besides abstracting the W3C document, the paper seems to pursue additional objectives, but they are less clear. The introduction mentions the objective of publishing the recommendations ("This paper is the first publication of the best practices"). I do not see a basis for this claim. After all, the technical report has already been published.
Another objective consists in providing background information ("presents the rationale underlying the selection of best practices"). While the technical report contains explanations of geospatial concepts, best practice examples, and a rationale for each recommendation, the paper contributes further material. This is where I see most of the value of this companion paper. For instance, the comment in section 2 on the "perceived RDF bias" of prior versions of the recommendations is interesting since it highlights a point of disagreement and helps to better understand that recommendation 3 on linking data is not necessarily one on linked data.
I assume that the authors endorse all 14 OGC/W3C recommendations, but they never say so explicitly. More importantly, they do not explain why they organize the recommendations in a different way as in the technical report. This refers to section 3 of the paper and its mapping onto section 12 of the report, which lists the best practice recommendations. From the current version of the paper, it is difficult identify the individual recommendations or to even tell their number. Some of the subsections (e.g. 3.6 Thematic layering and spatial semantics) seem to relate to several of the OGC/W3C recommendations while recommendation 9 on relative positioning is not covered at all.
The issues mentioned can be addressed by improving the paper's introduction and by providing additional information in some subsections of sections 3 and 4. Detailed remarks follow below.
In conclusion, I think that the paper deals with a timely and highly relevant topic but that it should undergo a major revision before being published.
---
Comments on individual sections
---
1 Introduction
The existence of the W3C document and the role of the submission as a companion paper should be mentioned in the introduction. Currently, the W3C document does not even appear in the list of bibliographic references.
The intention of the companion paper should be clearly stated: a digest? an update? a comment? As a reader, I should understand immediately whether with my specific interests I should read the paper, the W3C document or both.
To make the paper self-contained, provide a table with the 14 OGC/W3C best practice recommendations. It would help a lot if the table also shows how to map the subsections of section 3 onto the recommendations. Alternatively, the table could go into an appendix. In both cases, the subsections could continue to refer to the recommendations by the numbering used in the technical report.
3.1. Spatial Reasoning
"Spatial reasoning" is a misnomer since the section deals with how to best publish geometries.
Wherever best practices are identified which involve standards, like GML or GeoJSON in this section, references to the standards should be included.
This section mentions geospatial concepts, which are defined in the following section, e.g. "coordinate reference system". The text should inform the reader that definitions are going to follow. A definition of "reference geometry format" is missing altogether.
3. The Key Requirements and Best Practices
The introduction states as a criterion for the best practice cases that they are "linked to at least one example of a non-toy dataset". The subsections of this section should provide at least one reference to an example.
3.2. Coordinate Reference Systems and Projections
Curiously, the paper does not identify the most commonly used projected reference system, namely WGS 84 / Pseudo-Mercator (EPSG:3857). I am aware of the fact that the part of the cartographic community has still some objections to Pseudo-Mercator, but it is used by virtually all online map providers (Google, Bing, OSM, etc.) and it is mentioned in the W3C document as "de facto Web-standard CRS".
Fig. 3: It should be mentioned that the tool shows the distortions for a specific projected CRS, very likely WGS 84 / Pseudo-Mercator (EPSG:3857). Another flattening of the sphere may produce rather different distortions.
The text first refers to figure 4 and then to figure 3. This could be avoided by renumbering the figures.
3.3 Spatial identifiers
Provide references (links) to the examples of ressources (DBpedia, GeoNames) and, more importantly, to standards (R2RML) and tools (LDproxy).
3.6
"One of the best practices ..." Which one?
3.7. Temporal Dimension
The subsection repeatedly refers to individual recommendations of the W3C document by the numbering used in that document. As stated in the general comments, a table should list the recommendations together with their numbering to make the paper self-contained.
Why would an accuracy of 3m not be acceptable for street-level directions to a shop?
3.9. Crawlability
"Several examples of spatial things published in this way are provided in Best Practice 2." Include at least one example in the paper.
4.1. Representing geometry on the web
This repeats much of what has already been said in section 3.1: for instance, GML and GeoJSON as the most popular ways to publish geometries on the web or the explanation given for EPSG codes. While some of these redundancies are present also in the original technical report, they are more disturbing in the much shorter paper.
Note that not all common CRS have an EPSG code. Probably the most prominent case is the encrypted GCJ-02 datum, which causes the China GPS shift problem. This datum is massively used for feature geometries in that part of the world but it is very unlikely that it will ever be registered by EPSG because this would involve publishing the details of the encryption. For a different reason, it was far from clear that Pseudo-Mercator obtains an EPSG code.
4.2. A spatial data vocabulary
Add references to the data set examples from the Netherlands, similar to those provided for the data sets from France.
4.3. Spatial aspects of metadata
Add a reference for the national general data portal of the Netherlands mentioned in the text.
|