Review Comment:
The work introduces an anytime algorithm for the bottom-up discovery of multimodal graph patterns in existing knowledge graphs. The authors evaluate the method from a user perspective with a task-based questionnaire. The aim is to support scholars in finding potentially interesting patterns in the data that can spark new research questions.
It uses generalised graph patterns and pays particular attention to multimodal knowledge graphs, while also mitigating the curse of dimensionality.
Patterns are organised according to depth, length, width and support. A facet browser assists scholars, and it’s very interesting.
Strengths
- Aim and scope fit the journal; very interesting idea and implementation, though perhaps not the most original one
- Good technical quality
- The facet browser proposal is helpful to reduce barriers to usage
- Good writing, clarity also in figures and listings
Weaknesses
- Lack of extensive literature on patterns. For instance, there is a connection to earlier KG-exploration UIs such as Aemoo (Nuzzolese et al., 2016) that already let users navigate DBpedia via automatically learned patterns and contextual panels.
- Exploratory analysis is missing; e.g. how many patterns were found for which type, etc., would be interesting.
- Evaluation through a user-based study is good in theory, but some details are missing. It does not say how the users were selected, invited, who they are (broadly) and even how many they are until Section 6.3; this should be introduced earlier. Participants' self-report experience might raise some problems connected to self awareness. People might also have biases towards automated pattern discovery for research, especially if they are not from technical backgrounds. Likewise, familiarity with the domain is unclear. Indeed, the results on the usefulness of the interface or its clarity depends on those backgrounds.
- Pattern browser not thoroughly explored, e.g. design principles.
- The approach does not capture and highlight infrequent yet semantically important regularities that may otherwise be overlooked due to low support.
Minor comments:
- Section 3: Perquisites is a typo for Prerequisites.
- Quotation marks in Listing 2 are incorrect, and also on line 30 of p. 13.
- Modelling interestingness as future work: "Palma, Cosimo, et al. Modelling Interestingness: Stories as L-Systems and Magic Squares. In: Text2Story @ ECIR. 2023. pp. 127-133" could be a good starting point
- The framework seems to have a name in the Github repository, HypoDisc, but it is never cited in the paper.
Overall, this paper tackles a timely and practically relevant problem—helping scholars surface meaningful structures in large, heterogeneous knowledge graphs—through a well-engineered, anytime pattern-mining pipeline and a thoughtfully designed facet browser. The technical core is solid and the presentation clear. Nonetheless, the manuscript would benefit from a deeper situating in prior KG-exploration literature, fuller reporting of the exploratory statistics behind the discovered patterns, and a more transparent description of the user-study protocol and participant profile. Addressing these points, along with the minor corrections listed, should be straightforward and will significantly strengthen both the empirical credibility and the broader impact of the work. I therefore recommend minor revision.
|