Review Comment:
Declaration: I reviewed the initial version of this paper (as Reviewer 3), but not the revision in between.
I appreciate the effort that went into the revisions, which helped to improve the paper and appreciate the changes and extensions; in particular:
*) I acknowledge that the authors aimed to address my previous concerns over the use of the term "usability" (by removeing the term "usability analysis" from the title and adding a reference to a definition). However, the use of the term and the claimed scope of the paper still remains somewhat murky:
- in the introduction, the authors provide a definition in terms of five components (i.e., learnability, efficiency, memorability, errors and falsification) and state that their scope is limited to errors and "to a considerable extent" learning experience.
- In other parts of the paper, however, they suggest that the paper addresses usability in a broader sense, e.g.,
- Abstract: "empirical study comparing two different mapping paradigms from the perspective of usability"
- p. 4: "We have focussed here on usability in the sense of Nielsen's [6] five components."; Given that only 1-2 out of the five components are considered, such statements are misleading.
The confusion may also arise from different levels of abstraction: usability assessments are typically performed for particuluar system implementations and include aspects such as UI, interaction design etc.
By contrast, the authors state that their goal is to contrast mapping *paradigms* in terms of their *conceptual* difficulties (using YARRML and Sparql Anything only as "examples" of these paradigms) and their methodology seems less focused on usability, than on user behavior. I recommend to clearly scope the paper and at a minimum consistently use the chosen terms.
*) Parts of the paper are still exceedingly verbose - this has not changed since the original version of the paper despite my and other reviewers concerns and suggestions. Sections 4-5 in particular are highly anecdotal and still read more like a technical report. I'm also not convinced that the large number of listings provide particular value. Therefore, I suggest to shorten Sections 4-5 and move the detailed listings to an appendix.
*) I appreciate the addition of a Recommendations section, which adds significantly to the value of the contribution. The "recommendations for future development" for both paradigms in particular should be useful.
The value of the "recommendations for users", on the other hand, are a bit less clear. I understand that these recommendations are synthesized from common errors made by users in the study, but (i) most of them are not a particular outcome of the study (e.g., that , but quite basic particularly insightful (i.e., mostly covered in the documentation), and (ii) the general validity of the prioritization is a bit unclear given the low n and the very particular mapping task. Nevertheless, the list of pitfalls to avoid might be helpful to some.
*) I appreciate the added "Comparing the paradigms" section.
*) I appreciate that the authors extended the "overview of the study" section and included some details on the methodology.
Some comments on that: I understand that the study is exploratory and qualitative and that the experimental design is therefore not as rigurous (or: narrow-focused) as a confirmatory study would be, but the description of the experimental design is still quite limited; open questions include:
*) "The particpants were free to choose whether to work with SPARQL Anything or YARRML":
- Was the equal number of participants (9) in each group then just a coincidence or how did the authors ensure a balanced sample across the groups?
- in the limitations section, the authors state that neither of the participant groups had much knowledge of the two specific technologies under trial → we therefore do not learn about their respective merits/usability in the hands of more advanced users.
*) The authors also note that "both sets of participants had difficutlies with JSON arrays". This leaves me to wonder whether/which of the reported problems/errors were due to the familiarity with the respective source format vs. truly attributable to characteristics of the mapping paradigm/language.
*) Thank you for adding a discussion on limitations;
As the goal of such qualitative, exploratory studies is to produce new hypotheses, I was hoping for more insights and directions for future work. Given that the stated goal of the paper was to discern "conceptual differences" between mapping paradigms (using SPARQL Anything and YARRRML only as "examples")"" - I was also expecting more general insights on that level.
*) Terminology: "Triplification" vs. "Path-based" paradigm
I'm still not sure if the distinction between "path-based" vs. "triplifcation" is clear and commonly accepted or if this may be a false dichotomy. The implication that e.g., R2RML is not a "triplification approach" seems to be inconsistent with common use of the term (cf., e.g., https://www.snap4city.org/drupal/node/359, https://pypi.org/project/triplify-csv/). Althought there does not seem to be a single commonly accepted definition in the literature, [1] for instance defines "triplification" as "The process of transforming data stored in relational databases (RDBs) into sets RDF triples is known as triplification" - which would include "path-based approaches".
Section 2.2 seems to confirm that, as the authors state that "All approaches discussed in this paper are used to create RDF triples. In that sense, they are examples of triplification."
As this distinction of two paradigms is the key premise of the the paper, I recommend to clarify this, clearly define the terms, and use them consistently.
[1] Marx, Edgard, et al. "RDB2RDF: A relational to RDF plug‐in for Eclipse." Software: Practice and Experience 43.4 (2013): 435-447.
-----------
# Strenghts
+ A comparison between mapping paradigms and their respective strenghts and weaknesses is relevant and highly useful both from a research and practical perspective.
+ The is now more readable and the added recommendations make it a more useful contribution.
+ Provides empirically derived recommendations for further development of mapping tools
+ Well written and clearly structured
+ Originality
+ Followed some reviewer suggestions and added sections and clarifications
# Weaknesses
- Limited general insights about the relative merits of mapping paradigms.
- Excessively verbose and anecdotal sections
- (Small n, no rigurous experimental design etc) -> ok, but for a qualitative study: limited general insights/hypotheses
- Inconsistent formatting of listings
- falls somwhat short of the goal to yield fundamental insights on mapping paradigms and their relative merits (rather than their particular implementations)
# Minor
I appreciate the minor changes already implemented.
*) The experiment was apparently structured into "questions", but it's not clear to me whether there were any "questions or whether this refers to the "mapping problems/tasks" to be performed by the study subjects.
*) Code listing formats:
I appreciate that the authors changed the listings from low-res images to text, but:
- the listings are still formatted inconsistently (tables with horizontal lines vs. box only)
- the tabular form makes them difficult to read
- I'd recommend to remove the table lines that appear in some listings, include line numbers consistently, and add syntax highlighting
*) The explanation of and reference to Figure 1 at the very end of the paper (beginning of conclusions) is unexpected and seems misplaced.
|