Review Comment:
This manuscript was submitted as 'full paper' and should be reviewed along the usual dimensions for research contributions which include
(1) originality: high
(2) significance of the results: high
(3) quality of writing: good
The paper is well written and describes an innovative approach to infer the category of a person based on the automatically derived category of a clique of persons it belongs to. I like this paper and recommend its acceptance. Nevertheless, the evaluation in its current form is not convincing. I am unsure, whether this is a "minor revsion" or a "major revision", because I do not know, whether additional evaluations show good or bad results.
A minor concern is, when I read abstract and introduction, I did not really understand this contribution, because they were too general. It became clear in the evaluation and conclusion:
"Finally, we presented and evaluated a strategy to assign a category to the nodes in a clique and then,
by generalisation, to the whole clique. The approach yields good results, especially at clique level, and is able to classify also entities that are not present in Wikipedia. … To our knowledge, this hypothesis was never proved before, and the clique classification task based on DBpedia ontology is an original contribution of this work."
Therefore, the main contribution should be stated more clearly in abstract and introduction.
My main concern is the scope of the evaluation:
1) The authors used only 50 cliques manually labeled. 6 cliques belonged to the most general category "person". It would be helpful to state the categories of the other 44 cliques and also the categories of the addtional nodes being labeled based on their clique category. My concern is, if most of the additional non-linked entities being assigned a category based on their clique membership belong to the category "politician", the assignment of this class to non-linked entitities is not very specific, because in this domain most persons probably are politicians. A simple baseline would be to compute the precision and recall, if all non-linked entities get the category "politician" (and also all non-linked entities without highly ambiguous entities).
2) The evaluation should not only be based on the selection of 50 cliques, but also on the selection of x randomly selected groups of consecutive sentences, because already recognized cliques might imply a bias.
|