Review Comment:
## Response to letter
Overall, I am generally satisfied with the authors' response to my comments. As some remaining observations:
** I thank the authors for clarifying what they mean by "full-mappings" operators (which is based on full-relations operators from the book by Garcia-Molina et al.). In general, I still see this as a practical/informal distinction that relates more to "physical operators" (how the operators are implemented) than "logical operators" (how they are defined). I think there are ways to implement "full-mappings" operators in a "tuple-at-a-time" ((index) nested-loop) fashion, which grays the line between both types of operators. As some minor related comments:
- "For example, the ORDER BY is a full-mappings operator as it needs to materialize all the mappings before sorting them." I think this needs to be revised as the mappings do not always *need* to be materalised or sorted (as discussed a little later).
- "However, in the general case, queries that use a full-mappings operator are not preemptable." Again, while I understand the intuition, this is a very specific claim that I don't see evidence for. For example, if we have the sorted indexes available, I do not see how preempting ORDER BY (in the general case) is any more difficult than preempting joins. Note, for example, that enumerating mappings in order is precisely how many worst-case-optimal join algorithms work. Unless you have a proof for this, I would suggest to simply weaken the claim to suggest, e.g., that in such cases, preemption is "not straightforward", or that it has not yet been explored in detail, or something similar, rather than suggesting it cannot be done.
- "hence it cannot be suspended and resumed in constant time." Again, this claim is strong, not proven, and should either be weakened, or a proof provided.
** I thank the authors for clarifying the combination of HLL and LinearCounting in HLL++ to deal with both small and large cardinalities, and also for adding error rates.
** I think that other comments have been addressed appropriately.
## Minor comments
- "allows processing" -> "supports processing"
- "Public endpoints such as DBPedia or Wikidata [support]"
- "registers [de]noted"
- "two third[s] of the queries"
- "estimator to estimate the result" -> "estimator for the result"
- "that [supports] estimating a distinct sum"
## Recommendation
I think this is an interesting paper, and the authors have clarified/addressed a number of key concerns regarding error rates, limitations, scalability, and readability. I think that the remaining fixes (including weakening some informal claims) relate to language, and thus the paper can be accepted.
|