A Knowledge Graph for Semantic-Driven Healthiness Evaluation of Online Recipes

Tracking #: 3534-4748

Authors: 
Charalampos Chelmis
Bedirhan Gergin

Responsible editor: 
Mehwish Alam

Submission type: 
Dataset Description
Abstract: 
The proliferation of recipes on the Web presents an opportunity for developing AI methods to promote healthy nutrition of people using the Internet as a source of food inspiration. Recent research endeavors have resulted in the development of ontologies related to food, and algorithmic solutions for ingredient substitution. However, there is a lack of a resource oriented towards promoting research in semantic-based algorithmic meal plan recommendation and/or individual ingredient substitution that explicitly incorporates healthiness into the recommendation process. To address this gap, we present a knowledge graph comprising a large collection of recipes sourced from Allrecipes.com, their ingredients and corresponding nutritional information, social interactions metadata, and healthiness information calculated based on two international nutritional standards. We describe the construction process of our knowledge graph, and show its utility in quantitatively evaluating the healthiness of online recipes.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Minor Revision

Solicited Reviews:
Click to Expand/Collapse
Review #1
Anonymous submitted on 18/Jan/2024
Suggestion:
Reject
Review Comment:

I thank the authors for considering my feedback and addressing some of the mentioned aspects in their new version of the paper. In general the new revision of this paper only seems to contain minor changes and most of my original comments are still valid. The author's response and new version sufficiently address the questions 1-5, 9 and 12-14 in their response to reviewer 3. However, the other questions are not sufficiently addressed from my perspective. In the following I would like to elaborate on each question and the provided answer:

Question 6
The authors state that they collected the dataset in compliance with the terms of service applicable at that time from Dotdash Meredith. The terms of service changed in the meantime, which means that no updates from Allrecipes.com can be included in RecipeKG. Given that this is the only data source presented in the paper, the KG can not be updated in a straightforward way, i.e. following the presented pipeline. Personally, I would assess the likelihood of copyright-related issues a bit higher, especially given that the terms of service changed and already prohibit the authors from updating their KG, but I did not look into the mentioned precedent. For my review I will still consider this as a small risk for the long-term availability. The authors could provide actual lawsuits or legal expert opinions on this matter. Just the existence of other datasets with potential copyright-issues does not count as precedent in a legal sense.

Question 7
Section 3.3 now provides a better description of the modelling process in general. However, the added text provides insights on how blank nodes are used, but I am still not sure why they are used.

Question 8
I agree with the authors that outdated datasets can support research. But these datasets provide support despite being outdated. It is still a drawback of such a dataset. Providing a single example of an outdated dataset with significant attention does not change that.
Of course the publication of a paper describing a KG can improve the visibility, but for papers in this journal the usefulness of the dataset should be shown by corresponding third-party uses. The git repository does not show this kind of third-party uses.

Question 10
According to the authors the novelty of their work is modelling of health scores by using SWRL rules and the extraction of the Allrecipes.com category system. Both aspects are not described nor investigated in detail as part of this work.

Question 11
But how would such SWRL rules look like? Given that this is one of the claimed novelties of this paper the authors should focus on this aspect in more detail. A user of RecipeKG would need to write these rules, but the paper does not show how these SWRL rules are created. Especially, if the KG should be used by domain experts a really detailed description of this is necessary.

Question 12
I overlooked the mention of the data provenance in the paper. Of course this addresses my initial question. In order to really support such comparisons, it might be useful to model the data provenance in the KG, too.

Question 15
The extracted categories are defined by Allrecipes.com to support navigation and search on their website and not designed for any kind of research. The authors seem to just assume that any kind of category helps researchers. For me this claim remains unsupported.

Overall, the new version of the paper and dataset still lacks a clear use-case, is not maintained nor was the linking towards existing datasets in this area (mapping only 89 from 6309 ingredients without further explanation) improved. The description is improved to some extent, but the paper still does not go beyond defining a few rules and it still lacks any discussion on this. This work seems to be more suitable for a workshop or a short conference paper.

Review #2
Anonymous submitted on 28/Jan/2024
Suggestion:
Accept
Review Comment:

The improvements provided by the authors in the revision of the manuscript are satisfying.
From my side, no further actions are required and the paper may be accepted.

Review #3
By Andrea Mannocci submitted on 26/Mar/2025
Suggestion:
Accept
Review Comment:

The paper introduces the RecipeKG a comprehensive collection of all recipes on an online platform, comprehending their ingredients, quantities, nutritional facts and health scores computed against two international standards.
The scope of the paper is clear, and the paper is well-structured and easy to understand.
I found the literature review thorough and well-organised. The methodology sounds appropriate.
The resource is interesting and complementary to what can be found already in the SoA.
The remarks provided in the previous round of review appear to be adequately addressed.

Minor remarks
- Sec. 2 could be improved by dividing it into more paragraphs. E.g., "health scores. // Applications" and "given user. // Similarly,"
- "well structured" missing hyphen
- In Tab 2, you could lay out rows in the same order provided in the text (black, amber, red). Also, is it BW-friendly?
- I could not access the SPARQL endpoint, https://recipekg.arcc.albany.edu so I could not replicate the queries included in the manuscript. Other reviewers in the previous round did it and their remarks were addressed in the authors' reply. I could not validate this, but I assume the authors are consistent.

Review #4
By Kyle Hamilton submitted on 02/Jun/2025
Suggestion:
Major Revision
Review Comment:

The authors introduce a knowledge graph consisting of recipes and nutritional information from allrecipes.com. According to the authors a gap exists in the literature when it comes to recipe recommendation procedures which take into consideration the healthiness of a given recipe.

Strengths
1. The dataset puts structure on semi-structured data which makes it easier to query across various nutritional facets.
2. The authors make a strong case for the usefulness of the dataset for such downstream tasks as healthy recipe recommendations, and for ingredient substitution recommendations. Albeit, the latter would require subsequent research and development.
3. The manuscript is well structured and clearly written.

Challenges:

Major:
1. The SPARQL endpoint https://recipekg.arcc.albany.edu/ cannot be reached.
2. Something seems off in Table 5. Why is more protein considered unhealthy? Shouldn’t this be the other way around? Of all the food categories, how did “bread” garner the best FSA score? This does not make common sense. What is the cause of these unexpected results?
3. An evaluation section of the ontology and knowledge graph is missing. Much has been written on this topic, including the following: https://www.semantic-web-journal.net/system/files/swj657.pdf, https://www.semantic-web-journal.net/system/files/swj1167.pdf
An evaluation of the ontology and KG could potentially answer the previous question.

Minor:
1. The use of color to report results is problematic. In particular, in Table 2, red and amber are indistinguishable. At minimum color should be chosen from a palette suitable for colorblind readers. Ideally, a different visualization technique or demarcation should be used.
2. Pg.10, line 41. “Not as healthy as one would expect” is not based on evidence. What does one expect? I’m not sure that there is an expectation of healthiness in online recipes.

Review #5
By Mehwish Alam submitted on 20/Jun/2025
Suggestion:
Minor Revision
Review Comment:

Meta-review:
Overall, the paper has been improved from the previous version. However, one of the reviewers has complained that the SPARQL endpoint it not working. Another reviewer has also concerns about longterm sustainability and maintenance of the resource. Even though I am adding a minor review, these two aspects should be considered since this is an important aspect.