Review Comment:
The authors introduce the modelling issue of referring to multiple unspecified objects of a type and propose a “multi-instance” pattern family (MIF).
The article starts from describing the problem, referring to the PURO modelling method, and then proposes three potential solutions in RDFS/OWL.
Each solution makes use of different modelling constructs: logical structures, naming conventions and annotations. Finally, the authors discuss the relation between the proposed pattern and the Class as Property Value pattern (CPV).
The problem is described as follows:
“There is a distinguished real-world entity that is related, by the same kind of relationship, to (possibly even one, but usually) multiple undistinguished objects of a certain type."
An example may be something like "Amazon sells Books”. Clearly the problem here is not to relate Amazon to the set of Books for sell, but to each of the undefined instances without materialising all of them (because still to be known, for example).
This is an interesting “encoding” problem, in the sense that it is obvious that modelling this in RDFS/OWL is not straightforward.
The authors propose three solutions, all extending existing partial solutions: 1) relying on an existential quantifier that restricts the type of the entity with the property and the target type 2) the usage of an intermediate entity that acts as a placeholder for all the unspecified objects and 3) the specialisation of the property making it pointing to the type of the target entities directly.
All these preexisting solutions have some limitations, the authors say. For example, the usage of an existential quantifier, while it is sound at the logical level, does not reflect the semantic of multiplicity rigorously. This is true, but the authors say nothing about why to express this semantic distinction is useful.
I think the main contribution of the article is the specification of the modelling issue. My main problem with this paper is that the usefulness (or potential usefulness) of the pattern is not clear. This is also a requirement in ontology pattern paper submissions.
In addition, reading the paper, it looks like there is the assumption that making explicit a semantic distinction in a model would be a value on itself, which I doubt - but this is my personal opinion.
Abstract:
Authors say that their problem is going to be solved in OWL, but two of the three solutions do not need OWL expressiveness.
1. Introduction:
The paper starts from saying that referring to multiple unspecified instances of a type is a common problem. While I agree that it is a common situation - as the authors demonstrated by reporting the presence of it in e-commerce datasets - I am less keen to see it as a problem without having clear the advantages of having it solved.
In other words, I am quite convinced that patterns should be (possibly optimal) solutions that bring some advantages. In this paper the authors discuss three ways of achieving their goal, but none of them are discussed in terms of concrete benefits. The lack of use cases where the raised limitations of existing solutions can cause problems might be the reason why, as a result, the reader can not be convinced that this is a problem at all.
As part of the motivation, the main argument about the limitation of the existential quantifier in OWL is that its semantic is strictly “at least one”. Authors say that the interpretation of existential quantifiers in OWL is psychologically biased against multiplicity. This is true, because the specification of the existential quantifier is focused on the logical implications of it and not on the psychological ones.
It should be proved that this limitation has important implications before searching for a solution to it. I do not see it at the moment.
The other alternative solution is the usage of a blank node as existential quantifier. It is written that blank nodes are "considered as bad practice by a significant part of the linked data community and have no meaning to the description logic community”. These two statements are a bit shallow. The first need a citation and the second is only the consequence of the fact that the authors focus on a solution at the OWL level. Blank nodes have no special meaning in OWL, and not for the DL community. RDF, RDFS and OWL layers have different semantic, that’s all.
The characteristics of the multi-instance fact pattern does not seem to satisfy clear requirements (competency questions). The authors list a number of reasons why the proposal should be a contribution. The only one that I see as a concrete requirement is "possibilities to approximate the cardinality of the relationship considered”. However, this paper does not go beyond proposing an annotation property to specify the kind of multiplicity, leaving out the analysis on the possible values of it as future work. Having this discussion here would have probably enforced the motivation for having the pattern.
2. Multi-instance fact: background model
This section is very clear, and specifies very well the modelling issue.
3. Pattern modeling inventory
In this section the authors summarise different aspects of pattern based modelling that have a role in the propose solution: logical constructs, naming conventions, annotations, entity reuse.
This is useful.
4. MIF pattern family
This section of the paper is focused on the description of the MIF pattern family. There are three alternative concrete solutions:
4.1. Existential restriction with annotation
This solution extends the OWL based existential restriction. The authors insist that "the existential restriction pattern does not allow to express the multiplicity of the relationship to anonymous instances at the logical level”. It is not clear to me whether it does have any sense. The value of the multiplicity property should be a fuzzy value - ‘many’^^xsd:string, for example. Why we should desire it? In addition I do not see how adding an annotation property that specifies the multiplicity as a string value should express the multiplicity at the logical level, going beyond the simple OWL-based solution.
4.2. Linking to placeholder individual
The second option is extracted from Good Relations, and make use of a placeholder individual as the range of the property, representing the unspecified multiplicity of entities. .
This method has been also proposed as “Template Instance" pattern [1] in a very similar situation. There, the problem was to collapse a number of equivalent entities to a single one to reduce the space of the data. Authors might want to also discuss their work with relation to it. Similarly to the previous solution, authors discuss the need of specify the kind of multiplicity and the various options about how to encode this in RDFS/OWL.
4.3. Shortcut property with name and annotation
This last option is based on a special property that points directly to the type of the unspecified objects. This solution is also extracted from Good Relations. However, the paper does not say much about the benefit of it with respect to the other two.
5. Overview and selection criteria and 6. Implicit pattern usage
Here is discussed the pros and cons of the three solutions. I think this should be extended, or a real comparison of the three solutions performed, thus to elect a single optimal solution. This is another reason why I feel the work to be not mature enough for publication in a journal.
7. Relationship to the CPV family
In this section the authors discuss the relation with CPV, and again the main advancement of the proposed method (section 4.3) is that it specifically accounts for the multiplicity of the relationship between the source object and target class.
---
As a summary:
On Quality of the pattern:
The problem is very interesting, and I believe it is a hard one in RDFS/OWL.
I do not like much the three alternative solutions. One of the motivations of having patterns is that they provide good practices. From the overview of the alternative patterns in Table 1 it seems that the OWL-based solution does not have special limitations (the consequence is a requirement instead).
On Usefulness (or potential usefulness) of the pattern
The proposed pattern does not seem to provide any significant advantage with respect to existing solutions.
On Clarity and completeness of the descriptions
The paper is well written and clear.
[1] Nyulas, Csongor, Tania Tudorache, and Samson W. Tu. "The Template Instance Pattern." WOP. 2012.
|
Comments
Submission in response to
Submission in response to http://www.semantic-web-journal.net/blog/special-call-ontology-design-pa...