Review Comment:
This article addresses the challenge of harmonizing Java-based prevalent ontology APIs with the techniques of deep learning, which are primarily developed in Python. It introduces DeepOnto, a Python package tailored for ontology engineering.
Starting from the premise that deep learning methodologies have gained substantial traction across many research landscapes, the paper shows through examples how these methodologies have demonstrated notable superiority over traditional ontology engineering tools. DeepOnto emerges as a venture to bridge the existing gap by offering a robust and Python-compatible package to aid deep learning-oriented ontology engineering, marking its flexibility and extensibility for additional implementations. Nevertheless, the literature review provided does not overtly illustrate the importance and advantages of merging these two paradigms, an aspect which is later provided in the use cases section. The frameworks proposed by Pan et al. (2023; https://arxiv.org/abs/2306.08302) effectively highlights how large language models often are black-box models and may fail in capturing and accessing factual knowledge.
DeepOnto encompasses a core ontology processing module, introducing the features of the OWL API, such as accessing ontology entities, querying concepts, entity deletion, axiom modification, and annotation retrieval. Furthermore, the ontology class houses several crucial sub-modules spanning reasoning to projection, which are described in-depth in the Architecture segment of the article, supported by illustrative figures. While the term “user-friendly” and less verbosity of the method compared to existing solutions recur throughout the text, and its flexibility for easy updates is noted, a more in-depth explanation motivating these statements could benefit the narrative.
Moreover, DeepOnto is equipped with different tools and resources aimed at ontology engineering tasks like ontology matching and ontology completion, all of which are elaborated in depth with supporting links provided. Mention of how linguistic resources such as Framester (Gangemi et al., 2016; https://framester.github.io/), could potentially improve ontology matching alongside BERT-based models, could be an interesting addition.
In demonstrating DeepOnto's practical utility, the authors describe two use cases: Digital Health Coaching at Samsung Research UK, and the Bio-ML track of the Ontology Alignment Evaluation Initiative (OAEI). This second part of the paper underscores the inherent challenges of benchmarking in ontology matching research, particularly within biomedical ontologies. It critiques existing datasets for their inadequacy in accommodating machine learning-based models and for their focus solely on matching equivalent concepts, while possessing incomplete ground truth mappings. Bio-ML, on the other hand, aims to utilize human-curated mappings as ground truth, employing a broader spectrum of ranking metrics, and includes subsumption matching. The paper thoughtfully articulates different data split configurations and an intention to optimize computational resources. The results reflect promising scores concerning performance, although with an identified issue in matching medical concepts, where certain non-disease concepts are erroneously matched to disease concepts due to lexical overlaps. Although unaddressed in the paper, this issue could bear significant implications in digital health coaching scenarios. It is suggested to mention possible solutions for improving these cases.
Towards the end, the paper touches on how modules in DeepOnto support the prompt learning paradigm, a foundational aspect of large language models like ChatGPT. However, in the conclusion, the authors suggest expanding the existing toolset with newer models such as the GPT series, creating ambiguity regarding what has been implemented versus what is projected for future integration.
In conclusion, the paper is well-written with good support from figures and extensive resources, along with a solid bibliography. It could benefit from more detailed explanations on certain design choices and a clearer distinction between current implementations and future plans.
|