Semantic Web technologies in sensor-based personal health monitoring systems: A systematic mapping study

Tracking #: 3735-4949

Authors: 
Mbithe Nzomo
Deshendran Moodley

Responsible editor: 
Oshani Seneviratne

Submission type: 
Survey Article
Abstract: 
In recent years, there has been an increased focus on early detection, prevention, and prediction of diseases. This, together with advances in sensor technology and the Internet of Things, has led to accelerated efforts in the development of personal health monitoring systems. This study analyses the state of the art in the use of Semantic Web technologies in sensor-based personal health monitoring systems. Using a systematic approach, a total of 43 systems are selected as representative of the current state of the art. We critically analyse the extent to which the selected systems address seven key challenges: interoperability, context awareness, situation detection, situation prediction, decision support, explainability, and uncertainty handling. We discuss the role and limitations of Semantic Web technologies in managing each challenge. We then conduct a quality assessment of the selected systems based on the data and devices used, system and components development, rigour of evaluation, and accessibility of research outputs. Finally, we propose a reference architecture to provide guidance for the design and development of new systems. This study provides a comprehensive mapping of the field, identifies inadequacies in the state of the art, and provides recommendations for future research.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Minor Revision

Solicited Reviews:
Click to Expand/Collapse
Review #1
Anonymous submitted on 28/Oct/2024
Suggestion:
Minor Revision
Review Comment:

The paper provides a comprehensive analysis of Semantic Web technologies within sensor-based health monitoring systems, addressing a critical area in healthcare technology. The systematic mapping approach applied to select and evaluate 43 representative systems is thorough and enhances the study’s credibility. The focus on key challenges offers a well-rounded view of the current limitations and advantages of Semantic Web technologies in this field. In particular, evaluating systems according to development rigour, device compatibility, and accessibility of outputs is practical and helps translate research insights into actionable considerations.

Minor Issues and Recommendations:

- section 2.2.2 would benefit from incorporating a description of the concept of the Personal Health Knowledge Graph (PHKG). PHKGs are becoming central to personalized health data management, and their inclusion would provide a more comprehensive view of Semantic Web applications in this context;

- the section "Languages and Standards" (currently Section 2.2.4) should precede the discussion on ontologies and knowledge graphs. Positioning this section earlier would give readers a foundational understanding of the technical standards before delving into specific implementations, thereby improving the overall flow;

- a more detailed discussion of the limitations of the proposed study (Section 8.3), including potential publication bias and the generalizability of the findings, would enhance transparency and help contextualize the results;

- while future recommendations are provided, expanding these to include advancements such as the integration of Generative AI and Large Language Models (LLMs) with personal health monitoring systems would provide valuable insights for ongoing research;

- the GitHub repository should include, alongside PDFs, direct links to the published papers on their respective reference platforms (e.g., IEEE, Elsevier). This would facilitate easier access to official sources, enabling readers to reference the original publications directly and explore related research more conveniently.

Review #2
Anonymous submitted on 17/Jul/2025
Suggestion:
Minor Revision
Review Comment:

I thank the authors for resolving many of my comments about the previous version of the paper. As I have already mentioned before, the paper constitutes a large and useful body of work. I recommend its acceptance given that my remaining concerns (of which there are quite a few, but none of them major) are addressed.

I wanted to point out the following positives in particular:
- a good in-depth discussion on relevant ontologies and semantic interoperability
- useful extra sections on explainability and accessibility
- Table 16 is an excellent contribution.
- new parts that discuss the challenges supported by the systems, and the quality of the systems. This is a huge improvement over just showing huge tables (which are now moved to appendix).
- the improved and more granular reference architecture.

The new version now also outlines the particular role of SW technologies for overcoming the challenges. But, I found the discussion being too sparse in some cases, and there are many missing citations. I list some of these below.

Section 5.3.3:
The part on "rule-based reasoning" should be accompanied by citations; which systems use rule-based reasoning, and which ones rely on SW rules in particular?

Section 5.4.3
This section is very sparse. If the reviewed systems rely on SW rules for decision support - as it is currently implied - they should be cited here and this aspect of the systems discussed in more detail.

Section 5.5.3
"Explainability relies on domain knowledge". Not as a general rule - for instance, post-hoc XAI methods such as SHAP, LIME do not rely on domain knowledge.

"Additionally, many ontology reasoners provide explanations of the reasoning process, although it is not clear whether these explanations are made available to the end users." Yes, explanations are made available to end-users, but they still need to be formulated in a human-interpretable way to be useful. For instance, Protege formulates explanations on the reasoning behind an inference. The eye reasoner generates proofs for inferences, which can similarly be formulated in a human readable way [https://link.springer.com/chapter/10.1007/978-3-031-54303-6_7].

Section 5.6.5.
"[..] an approach used by a few of the selected systems." Citations needed.

I also still feel that the scoring perspective requires nuancing. Some challenge aspects may not be be relevant to all types of systems (such as DSS supporting more than one type of user). An excellent system that specifically focusses on end-users, and not clinicians, would thus score lower here (although some would argue that systems should always be tailored to a specific user group). This could be added as a limitation at the end of the paper, one that is shared with many other scoring / benchmarking efforts - it may not be fully applicable to all possible systems, and consequently some people may want to adapt it (such as single-user systems).

Other comments:

- It is unclear how process interoperability applies to the content of Subsection 5.1.4, as it does not pertain to integrating data (e.g., sensor data, EMR data). Perhaps an option here is to integrate the contents of this subsection into the prior subsections.

- There are some inaccuracies in the SW overview:
"RDF-star, which allows the subject or object of a triple to refer to another triple"
This is not wholly accurate, as the embedded subject/object triple is not necessarily asserted (and thus not "referred" to).
https://www.w3.org/TR/rdf12-concepts/#section-triple-terms-reification

"Other important standards in the Semantic Web community are: eXtensible Markup Language (XML), a markup language and file format". I would not call XML a SW standard.

- "This explicit provision of explanations can also be considered a form of post hoc explainability"
Both interpretable ML models, and post hoc explainability, aim to provide such explanations. Hence, it seems incorrect to put them in the same category as post-hoc explainability. It would be more appropriate to put this into a separate "explanation quality" section.

- The last part of section 5.5.2, starting with "Only six of the selected systems report", pertains to explanations in general and thus does not belong in the subsection either.

- It is strange to put "challenges assessment" under the "summary" section 5.8, which should be, well, a summary of what came before and thus not introduce new content. I would propose adding it as a separate subsection before the summary section. Same comment for "quality assessment" under the summary section 6.5.

- Section 8 provides useful summaries on SW usage, quality assessment, and future research directions. That said, it seems that "new" takeaways from section 8.2 (e.g., such as lack of resource re-use) do not belong under future research directions; the section should not introduce new observations.

Minor comments:

- "explainability is gaining traction as a pivotal aspect of AI-driven health systems" I don't believe that AI-driven health systems have been discussed / introduced yet, so it's a bit strange to describe aspects of it here.

- "Group 4: Other reviews related to AI and technology in the health domain"
Similar to before; I don't think AI-driven health systems have been introduced yet.

- "In contrast to SSN, SAREF is targeted at industry developers rather than ontology experts [25], making it practical for real-world applications." Surely the development of an ontology by experts does not precluded from being practical in the real world.

- The objectives listed under Section 4.1 do not fully correspond to the contributions listed in the introduction - namely, a quality assessment of the selected systems based on data, devices used, etc.

- "Semantic Web technologies can also contribute to syntactic interoperability, albeit in an indirect capacity." By offering RDF as a uniform data description language, SW Technologies can also offer syntactic interoperability. E.g., FHIR offers a Turtle syntax, which is a serialization format of RDF.

- "Although several situation-focused ontologies have been developed [..] none of the selected systems extend any such ontologies." How about systems using (not necessarily extending) existing ontologies?

- "The limitations of semantic-based approaches, such as scaling difficulties and inability to handle uncertainty, can be mitigated by combining them with complimentary techniques such as ML and Bayesian networks."
As the authors themselves mention later on (Section 5.6.5), there exist extensions of SW technology specifically to cope with uncertainty.

- "However, the use of these technologies does not guarantee explainability." Can the authors give an example here of where SW technology does not lead to explainability?

- "While ML is often criticized for its susceptibility to producing black box models" I don't think "susceptibility" is the right word to use here.

- "An additional ethical concern is the cascade of care, a phenomenon in which incidental findings from screenings or monitoring result in further clinical care."
Please consider elaborating on how personal health monitoring systems can exacerbate the cascade of care.

- "Using platforms like GitHub rather than static files has the advantage of version control". The drawback here is that GitHub repo's can easily be deleted or made private. I have experienced this personally for several papers. A Zenodo record has the benefit that it cannot be deleted after creation, but new versions of the record can still be added. Another solution to improve accessibility is for journals to have a data availability policy.

- Please clarify the following: "This can be attributed to the fact that there is a gap in tooling support for Semantic Web representations such as RDF with standards such as FHIR." FHIR has an RDF representation (using Turtle serialization).

- In general, Turtle is a serialization format (there are others, such as N-TRIPLES) of RDF, the latter being the abstract data representation language. This should be clarified throughout the paper.

Review #3
By Evan W. Patton submitted on 29/Jul/2025
Suggestion:
Minor Revision
Review Comment:

Overall, the manuscript covered substantial ground. I am assuming in the remainder of my review that the expected audience at least knows the basics of Semantic Web technologies, e.g., RDF. As such, I do believe it suitably targets the audience of the Semantic Web Journal. The authors were structured in their approach, and while I have a few reservations (see my detailed notes below), I believe they were thorough in their survey. The text itself is a bit dense but that is unsurprising given the breadth of topics covered. It is good for the semantic web community to see how their technologies are being adopted and the areas for future research and implementation. The linked GitHub repository contains copies of the final selection of 43 papers, but does not include any of the intermediate steps. While this may not be necessarily important I did find (and noted below) how jumps from larger sets to smaller sets of papers is hard to follow and may be difficult to replicate. I have noted a number of issues, but I anticipate that most can be resolved with minor revisions and clarifications.

General notes follow:

The authors look to analyze semantically-enabled Internet of Things health systems against seven challenges:

1. Interoperability
2. Context awareness
3. Situation detection
4. Situation prediction
5. Decision support
6. Explainability
7. Uncertainty handling

It's possible that I missed it, but it would be helpful if the authors stated up front why they chose to do a "systematic mapping" rather than a more traditional "systematic review." The latter is much more common in medicine adjacent fields.

Note that this list is not in the same order as that presented in the first full paragraph of page 2. It might be worth aligning them.

Is the order of the enumeration of challenges important? For example, every measurement has a degree of uncertainty, so is uncertainty handling the "last" thing that needs to be considered (versus, say, interoperability), or is every challenge equally valid. If they are equally weighted, it would be good for the authors to state this up front.

The authors claim on page 2 that "Semantic Web technologies, which are widely used in the health domain, can alleviate some of these key challenges." They don't seem to quantify what "widely used" means, nor do they really summarize which of the subset of challenges SW technologies alleviate.

I feel like the term "systematic mapping" could be misinterpreted given that "mapping" can mean multiple things in the semantic literature. It sounds more like in this content the authors are discussing a systematic review (in which case they could say that).

It may be worth clarifying on page 2 where you say "The goal of this study is to systematically map the state of the art in the use of Semantic Web technologies in sensor-based personal health monitoring systems." are you focused only on production level systems or are you considering research systems?

On page 3: "Health monitoring sensors are generally either wearable or implantable." While I overall agree with this sentiment, there is work by people like Prof. Katabi at MIT [1] that focus on gait and fall detection through Wifi perturbances that suggest a third mode of health monitoring. As I understand it they are not using semantic technologies, so their particular work would not fit the inclusion criteria, but it does suggest there is a class of research on personal health monitoring that goes beyond wearable sensors that should at least be acknowledged.

This is a stylistic thing, but given that section 2.2 opens with 3 technologies, the addition of section 2.2.4 Languages and standards seems like it could be elevated up to its own section or the intro amended to add "... and the languages and standards for constructing them."

One question about §2.2.4 for the authors: What criteria were used to select this list? For example, FHIR is provided with RDF mappings (FHIR doesn't get introduced until Sec 5.1.2 on page 14). There is also PROV-O for tracking resource provenance. Both seem relevant to this review but neither are mentioned. Yet, people were modeling medical and IOT data in RDF before the introduction of languages like SHACL.

Section 4.3: "those without a well-defined semantic technique" - it would be helpful to clarify what this means since a definition isn't given.

Section 4.4: How many papers were excluded for the purposes of extensions? How many due to pay walls? It's unclear how the authors go specifically from 186 papers down to 43. Since this is supposed to be a systematic paper, I do not believe I have enough information to filter down to the same set of 43 papers.

Section 5.1.3: "Most systems represent sensor and sensor data concepts in ontologies..." Why not provide the exact number out of the 43? Do you have a good sense/justification as to why so many did not reuse an existing ontology (you mention 13 reused)?

Section 5.3: In the introduction 7 dimensions were introduced, but here in section 5.3 you combined situation detection and situation prediction into a single section. Does it therefore make sense to reduce the 7 criteria to 6 and replace the two situational entries with situational analysis to reflect the structure of this section as currently it seems they are demoted to less than the other 5 dimensions.

Section 5.6.2: "Bayesian networks are well known for managing uncertainty" Do you mean *modeling* uncertainty?

Section 5.8.1: What is the motivation for using a 5 point scale here, especially when it results in 3 of the 7 criteria not dimensions not having a "high" category. If the selection of aspects is so limited then it seems just as valid to use a 4 point scale in which every point in the scale is valid. In figure 4 you also show the scores out of 100%, not out of 4 or 5 so it's unclear what the grouping offers the reader. Also, the intervals are not the same size, so it seems like going from low to medium would be more work than to go from high to very high.

Section 7.1: Given these architectures are defined in [186, 187]. you may want to mention that upfront in this section.

Section 7.2: This sounds very similar to the Model-View-Controller design. What in particular differentiates it? For example, it seems like one could map the data layer as model, presentation layer as view, and the analysis/decision support layer as the controller.

Section 8.1.1: You mention interpretability in the first paragraph, but some of the papers mentioned the use of LLM style chat technologies. You may want to add a sentence about that.

Section 8.2: "...majority of the systems are not build with..." build should be built

[1] Yonglong Tian, Guang-He Lee, Hao He, Chen-Yu Hsu, and Dina Katabi. 2018. RF-Based Fall Monitoring Using Convolutional Neural Networks. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2, 3, Article 137 (September 2018), 24 pages. https://doi.org/10.1145/3264947