Review Comment:
The paper deals with conjunctive queries with OWL ontologies. Specifically, it combines some techniques for efficiently computing the upper and lower bounds of query answers, and use a full-fledged reasoner if there is a gap between the bounds. The approach has been implemented in the open-source systems RSAComb and ACQuA. Experiments show that the system outperforms the PAGOdA system.
The paper has an overall clear structure. The general idea is not difficult to follow. The notation is a little bit heavy, but I don't see this can be easily avoided. Still, the presentation can be improved (see the details below).
One major issue of the paper is that the contributions should be better explained. Due to the nature of the paper combining many existing techniques, the authors should be more explicit about the novelty of the paper. In particular, the introduction should do a better job of explaining the novelty. Is this "just" an improved version of PAGOdA? How novel is the system RSAComb? Did RSAComb use more techniques than [14, 15]? Are there already some results published as workshop/conference papers? If so, what is the difference with respect to them? All efforts in this direction would greatly help readers understand the novelty of this paper.
More detailed comments:
- In Section 2, all the definitions are without references, but the Theorems are always with references. This causes some confusion. Are there some new definitions introduced in this paper? Or are they all from existing literature? This needs to be clarified
- p4. Table 1.
- Technically, R^- is not an axiom.
- In the translation of axiom T1, V_{i=1}^n Bi(x). n should be m
- p4 l27. What are the "bottom and equality axiomatization rules"?
- p4 l28. What is "\Pi_K"? This seems to be undefined.
- Example 2.1. What is the purpose of this example? Could you explain what is the source of the exponential size of the model? Is R unsafe under Definition 2.1?
Theorem 2.5. How about RSA+?
- p8 l18. "oriented forest" has not been introduced before
- p8 l19. this place introduces RSA+, and it says RSA+ is an extension of RSA. Is RSA+ is a new language introduced in this paper? If I understood correctly, RSA+ is a sub-language of RSA. It is strange to call a sub-language as an extension.
- Definition 2.4.
- What does "confl" stand for? conflict?
- Let **prec** be ... -> \prec
- p10. Theorem 2.5. Do you have results about RSA+?
- Example 3.1.
- It would be very useful to provide some CQs together with the KB. Otherwise, this "Overview" section seems to be incomplete.
- p20. Footnote 12. The link does not point to the right fragment of the page
- Sec 8. Experiments. In general, I feel that the experiments should show distinguish better the **approach** of PAGOdA/RSAComb/ACQuA and the **implementation** of these systems. In several cases, the issues observed from the experiments with PAGOdA are not clear from the approach (limitation of the theory/methodology) or the implementation (e.g. bugs in the system).
- The authors may repeat here the experiments using CQ under certain answer semantics using the SPARQL syntax.
- UOBM: "We could not perform a direct comparison with UOBM since we were unable to reproduce the PAGOdA’s results [12]." -> this should be elaborated. E.g., Do you mean PAGOdA throws exceptions, timeouts, or returns wrong answers?
- Reactome: What happened with other queries other than query 65?
- In the paper, the authors claim that PAGODA and RSAComb are not comparable. Did you observe any case that PAGOdA performs better, which could be beyond the experiments described in the paper?
- 8.3. OOR batch. This batch is a large collection of data sets. The paper should provide more statistics in this part of the evaluation. For example, provide some values of L/U/T similar to Figure 12, but aggregated. This can lead to better insight.
- "PAGOdA was able to process 103 out of 126 ontologies considered", what happened with the ontologies that could not be processed? What are the issues here?
- "We were able to fix the issue in ACQuA". This is a bug?
- The queries are very simple, in terms of conjunctive queries. Most of the queries have a small number of atoms and very small number of existential variables. Can the authors comment on this?
- p37 l32. The last semicolon ";" should be period "."
|