Review Comment:
The topic of this paper, LLMs for Ontology Engineering, is a timely and important subject within the fields of Artificial Intelligence and the Semantic Web. The authors clearly identify the gaps in existing research within the Introduction. They explicitly list four research objectives, mapping them to four Research Questions (RQs), which form the core structure of the entire results section. Crucially, the paper does not merely list the 30 included publications; instead, it extracts 41 distinct studies and provides a synthesized analysis of these data points structured around the RQs. The logical structure and overall presentation in the current manuscript are generally clear. The authors also provide relatively rich and well-organized data files via GitHub and Zenodo, though a clarifying README file is missing. Overall, this paper is suitable as an introductory text for practitioners in Semantic Web. However, before it meets the requirements for publication, I have several concerns and recommendations, detailed below:
1. The paper mentions that this work is an extension of an earlier overview article (Garijo et al. 2024). It is essential that the authors clearly and explicitly articulate the unique, novel contributions of the current manuscript. This clarification must go beyond a simple increase in the number of papers and should include a clear statement of the substantive new analyses and new insights gained in this extended work.
2. In a high-profile area like LLMs and the Semantic Web, filtering the initial 11,985 retrieved results from 2018 to 2024 down to only 30 core papers (which appear to be predominantly from 2023 to 2024 based on the materials provided) seems a comparatively low number. This may be a consequence of the search terms (MR Terms) being restricted to "Language Model," "LM," and "LLM*." While these terms cover the general concepts, this strategy likely missed relevant work conducted between 2018 and 2021 (before "LLM" became common usage) that used specific models (e.g., BERT, T5) for ontology engineering but did not explicitly use "LM" or "LLM" in the abstract or keywords. The authors should discuss this potential limitation.
3. While the paper demonstrates critical thought in the Evaluation (RQ3) and Discussion (Section 5), there is a problem of excessive description in certain parts. For example, Section 4.4 (RQ4: Application Domains) largely constitutes a list of which papers were applied in which fields (healthcare, cultural heritage, finance, etc.). It lacks a deeper analysis of why these domains are hot spots (e.g., due to data availability, clear commercial needs, or the maturity of existing ontologies). Furthermore, there is no analysis of the commonalities or differences in the tasks addressed across these various domains. Similarly, Section 4.2.4 (RQ2.e: Role of the Human) is very brief. The finding that only four studies explicitly involve human participants is a significant observation, yet the paper's analysis of this crucial point could be much more profound.
4. The paper suffers from noticeable redundancy. Each subsection of Section 4 (Results) concludes with a detailed Summary section. While these summaries are well-written, they cause Section 5 (Discussion) to feel repetitive. Many of the core arguments presented in the Discussion have already been thoroughly emphasized in the Section 4 summaries. I recommend that the authors reorganize the relevant sections to convey their interpretive insights.
5. For a systematic review of this length and depth, the overall visual presentation is insufficient. In particular, the paper is missing a comprehensive taxonomic framework diagram. Such a visualization is crucial for helping the reader gain a clear, "at-a-glance" understanding of how the authors have organized and categorized the entire research field.
|