Review Comment:
This paper describes an effort for cataloguing Han Poetry resources with envisaging a layered semantically-enhanced architecture for this task. This contribution is relevant in modeling also thematic and physical aspects of Han Poetry works. The paper is well written and well structured.
My concerns are chiefly about the distribution of the produced datasets. They are accessible through a dedicated online repository. The OWL ontology http://example.org/HanpoetryOWL is provided at two different paths: Heian_db/Ontology/HanpoertyOntology.ttl and Tang_db/Ontology/HanpoetryOntology.ttl. Similarly for SKOS vocabularies, divided in core and local (what is the difference?). All the items are in a http//:example.org namespace. Following the Linked Data principle, every IRI should be deferenceable. So, for example, accessing to http://example.org/HanpoetryOWL one should download THE ontology. Or, accessing http://example.org/HanpoetryConcept# one should download an RDF file containing, among others, a resource with IRI http://example.org/HanpoetryConcept#CollectionConcept/TangPoetryCollection. To accomplish this principle, the repository should be restructured with avoiding repetition. In addition, substantial IRIs should be used to make the corresponding file accessible. I suggest, after appropriately restructuring the repository, to acquire persistent IRIs using the WebID service (https://www.w3.org/wiki/WebID), in such a way that, when accessed, provide the corresponding files on github.
Another concern is about the prefixes used in the paper. Prefix abbreviations are provided in Section 4, but some prefixes are used before (for example, in hpc:Moon in Section 3.1.2 or bf:Work in Section 3.2.1). Prefixe abbreviations should be declared in advance.
Othere minor suggestions are reported in the following.
========
Q1. IIIF should be briefly introduced here as a "generic" reader probably doesn't know what it is.
Q2. In Section 1.2, on page 2 one can found "Based on an assessment", but for the FEAR principles this assessment should be accessible to the reader so that she could verify the coherence with the data you are reporting. Addind a bibliographic reference to this assessment should be enough.
Q3. on page 2, "A smaller but growing" shouldn't be capitalized.
Q4. Please provide bibliographic references for IIIF, CBDB and JBDB or say in what sections they will be recalled.
Q5. In Section 2.2 on page 3 I'm not sure that "Ontology Intelligence Service Center" is correct w.r.t. the cited work.
Q6. In Section 2.3 on page 3, change the font to characterize as titles and add a full stop after the followings: "Undigitized or Image-Only Resources", "Unstructured or Minimally Structured Text Resources", "Poetry Knowledge Graph Systems", "Experimental Semantic Modeling of Han Poetry Using RDF/OWL".
Q7. In Section 3 on page 5 "RDF instance data (e.g., poems, poets, poetry collections) are constructed and
deployed across multiple triple stores" is reported. However, using "RDF instance data" may be misleading as instances (or individuals) is tipical of semantically "reach" formalisms (such as RDF Schema or OWL), where as RDF provide just a notion of resource. Thus this sentence may be rephrased, or you could use just "instance data" instead of "RDF instance data".
Q8. In Section 3.1 on page 5 the notion of "semantic consistency" has a precise meaning in the field of formal logics and OWL (in particular, consistency check is a reasoning task), thus I suggest to reformulate using different words here.
Q9. In Section 3.1.1 on page 6 "The vocabulary structure of the SKOS-based conceptual vocabulary system" is quite redundant, I suggest to remove the first occurrence of "vocabulary".
Q10. In Section 3.1.1 on page 6, why "Poem title concept"? Usually title is a metadata of a poem. Also, "knowledge" is a very broad notion, probably a more explicative name could be more appropriate.
Q11. In Section 3.1.1 on page 6, replace "struct ured using properties" with "structured using properties".
Q12. Section 3.1.1, where the thesaurus can be downloaded? Provide a persistent IRI.
Q13. In Section 3.1.1 on page 6, concerning the sentence "Each poem is modeled as an RDF instance (hp:Poem)", the same considerations in Q8 hold. In addition, I can't find the hp:Poem class or skos concept in the theasurus. There is some confusion between hp:Poem and Poem title concept.
Q14. In Section 3.1.2 on page 6, an "open data platform" is mentioned for the first time. Inspecting all the paper I just found a github repository
I propose to reformulate " The SKOS Tree Viewer is available on the open data platform" as " The SKOS Tree Viewer is available open-source" or similar.
Q15. In Section 3.1.2 on page 6: "This concept represents loneliness, home-sickness, and purity, symbolizing the philosophical and emotional resonance [...]". This assertion is out of the context of this section and, so, it is a bit confusing. These "represent" relations are not contained in the SKOS files. In my understanding, they will be presented later. Thus I suggest to add a reference to where these representations will be presented.
Q16. In Section 3.2 on page 7: " the Person class reuses the FOAF (Friend of a Friend) model". But the FOAF ontology is not imported by HanpoetryOWL, as well as all the constraints on foaf:Person in it. May be there are some reasons for this fact. In such a case, the motivations should be explained.
Q17. In Section 3.2.1 on page 7: "the Poem class reuses the Work concept from Bibframe". Analogously, the Bibframe ontology is not imported by HanpoetryOWL.
Q18. In Section 3.2.1 on page 7: " the Poem class is defined as a subclass of bf:Work and assigned a unique identifier to store metadata". I can't understand this sentence. Note that a "unique identifier(s) to store metadata" is assigned to Poem instances (individuals). Of course, the Poem class has itself a unique identifier, but does not contain metadata about poems.
Q19. The linking between Knowledge sub-classes, corresponding SKOS concepts (?) and Poem instances is really unclear. I can't deduce it from the provided example materiale. Figure 2 does not help. Probably, this is better explained in Section 4.2, and thus adding a reference to this section may be enough here.
Q20. In Section 3.3.1 on page 9, replace " database, , as illustrated in Figure 4" with " database, as illustrated in Figure 4".
Q21. In Figure 4 on page 9, replace "Collcetion" with "Collection".
Q22. In Section 3.3.1 on page 9, " These TEI files are linked to the corresponding RDF resources". Here how these connections are established should be explained in more details. Anticipating that this will be explained in details in Section 4.4 would be enough.
Q23. In Section 3.3.2 on page 9, "the system uses Virtuoso triple store". Here is unclear what "system" the authors are talking about. The
reader could think that there is an online service with a corresponding SPARQL endpoint, but I understood that the database is just provided as RDF files in the GitHub repository. I suggest to restructure this section in order to clarify that there is no publicly available "system" actually running.
Q24. In Section 4 on page 10. In my understanding, concept:moon should abbreviate http://example.org/HanpoetryConcept#KnowledgeConcept/NaturalScene/Celest.... But in the prefix declarations provided in the same section, concept is the abbreviation for http://example.org/HanpoetryConcept#. Probably this "simplified notation" should be explained in more details.
Q25. In Section 4.1 on page 11, "Figure 7 illustrates this modeling structure using Shu Dao Nan" but there is no Figure 7 in the paper. Authors probably are referring to the code snippet just before. Using the figure environment also for code snippets may be a good choice.
Q26. In Section 4.2 on page 12, the property hp:hasEmotion is mentioned. May be that Emotion would deserve its own class.
Q27. In Section 4.5 on page 14, "key entities such as hp:Poem,hp:Person, and hp:Collection" should be rephrased as Poem, Person and Collection are the classes, whereas here I suppose the authors are talking about the individuals belonging to these classes.
Q28. In Section 5.2.1 on page 15, the results of a SPARQL query are reported, but the query has to be reported as well.
Q29. In Section 5.2.1 on page 16, replace "capabilities:First, " with "capabilities. First,".
Q30. In Section 5.2.2 on page 16, similarly to Q29, the query has not been reported.
|