Review Comment:
The authors describe the conversion of previous work (in which a mapping between FrameNet roles and WordNet synsets was created) into RDF. The conversion to RDF seems trivial, since the mapping was already undertaken and published in previous work.
(1) Information on the data set
The data set contains a mapping from FrameNet roles to WordNet synsets, as well as a minor amount of statistical information about the number of examples for a specific frame / its linked hyponym. The WordNet 3.0 ontology is used to link the resources to the English WordNet. Because there is no LOD version of FrameNet online, there are no links to FrameNet resources, only strings identifying frames. There also is no FrameNet ontology available, so the authors developed their own. The data set is not interlinked with any other data sets.
(2) Usefulness of the data set
Although the authors try to explain the usefulness of the paper, I don't think the data is very useful, as long as there is no published LOD version of the FrameNet corpus and the data links to this corpus as well. A large part of the data has questionable usefulness. The number of examples / total examples for a given LearntSemType is only relevant in a small number of
contexts, if at all, as it could be obtained from the published data. Links to the FrameNet examples could be useful, but can't be provided due to the aforementioned lack of a LOD FrameNet. So without FrameNet to link to, this data set's statistical information is rather pointless, besides its use for the authors java tool. But this statistical information is most of the metadata the data set contains. If one removes it, the data set enriches WordNet synsets with resources like this:
without descriptions of the classes itself, thus hiding its meaning to everybody without access to FrameNet. This comes down to very little data. Especially the missing links to other data sets enhance the problem of lacking usefulness.
The eventual usefulness of their java-based tool has no bearing on the usefulness of the data set as Linked Open Data. Its possible applications and uses are thus no arguments in favor of the data set. With this in mind, if we take a look at section 4 of the paper, the possible use of the data set isn't justified. There is nearly no profit over the original mapping done by the authors, neither for linguists, nor for the Semantic Web community.
(3) Clarity of the descriptions
The description is not very clear. Regarding the modelling, Figure 1 does not declare the namespaces, concealing that fn: is defined as the authors own namespace, not FrameNet's. This is only made clear implicitly in the text, causing general confusion. The authors were forced to act this way, as: "For the structure of the OWL version of FrameNet 1.5, we initially have chosen to rely on [3]. However, as the populated ontology is not yet available, we used FrameNet 1.5 XML representation as a reference for defining frame and semantic role URIs within the ontology."
Thus, it is unclear if the data will be useful when (if) an LOD version of FrameNet or a FrameNet ontology will be published. This raises questions about data set maintenance that are not addressed in the paper.
There also is no evaluation section, although the problems of the automatic mapping are mentioned in the discussion. The authors address steps taken to mitigate the problems, although it is not clear, why this should be done with their java tool instead of on the data itself.
|
Comments
Last minute typo
"Dear Editors" line should, of course, be in the cover letter and not in the abstract. As I see no way to edit the submission, I apologize for the above here in the comments.
fixed
fixed :)