Fans Reconstruct Heroes: Modeling Fictional Characters in Participatory Culture

Tracking #: 3885-5099

Authors: 
Xiaoyan Yang
Federico Pianzola

Responsible editor: 
Guest Editors 2025 OD+CH

Submission type: 
Full Paper
Abstract: 
The study of fictional characters has always been a crucial part of literary research. Characters are one of the core elements that drive the development of narratives and convey the themes and messages of literary works. With the rise of participatory culture, especially fan cultures, the way people interact with and reinterpret fictional characters has become more complex and dynamic, a change that poses unique challenges and opportunities for analyzing character representation across cultures, genres, and platforms. This paper reports on, documents, and discusses a formal ontology that enables the systematic modeling and comparison of fictional characters across cultures and reading communities. Building upon literary theory and incorporating insights from crowd-sourced online databases, the GOLEM ontology provides a framework for analyzing both canonical characterizations and fan-driven reinterpretations.
Full PDF Version: 
Tags: 
Reviewed

Decision/Status: 
Major Revision

Solicited Reviews:
Click to Expand/Collapse
Review #1
Anonymous submitted on 23/Jul/2025
Suggestion:
Major Revision
Review Comment:

The paper describes and discusses the GOLEM ontology, which was mainly developed to model fictional characters and their reinterpretations in fan fictions. The paper evaluates the ontology by considering the case study of Greek Mythology in fan fictions, populating a dataset semi-manually and running SPARQL queries corresponding to 10 competency questions.

The first two sections (Introduction, Background & State of the Art) are well written and a pleasure to read.
The third section (Existing Ontological Models for Fictional Characters) is informative and quite comprehensive. However, given the paper's focus, I expected a more detailed and formal comparison of GOLEM with respect to the existing ontologies (maybe in a following section), which is lacking.
The fourth section (Modeling Fictional Character with the GOLEM Ontology) – arguably the central section of the paper — is too imprecise and incomplete to provide a detailed explanation of the GOLEM ontology (see detailed comments below).
The section on the case study is interesting but it lacks some details, especially regarding how the dataset and competency questions has been collected.

The space for adding details could be found by moving the SPARQL encoding of the CQs to an online file, while keeping the explanation of the rationale behind the encoding of the CQs in terms of the concepts and relations included in the GOLEM ontology.

My main concern regarding this paper is how to evaluate its novelty. The authors cite their paper "Pianzola F, Cheng L, Yang X and Panache F (2025) The golem ontology for narrative and fiction. Under review.", which, from the title, appears to be devoted to describing the GOLEM ontology. Without having additional details, it is unclear how the paper submitted to SWJ differs from the cited one, and whether these differences are significant enough. For instance, I don't know if the case study is also present in the other paper. If not, I'm indeed unsure whether this difference is sufficient for publication in the SWJ (see also below my comments on the case study).

Still regarding novelty and originality, the new categories in GOLEM modules described in the submitted paper are: G0_Character_Stoof, G1_Character, G2_Character_Feature, G3_Psychological_State, G4_Social_Relationship, G5_Narrative_Event, G6_Relationship_Role. The authors introduce these notions without precisely positioning their choices within the existing debates in literature and without a precise ontological characterisation: (1) the ontological distinction between G0_Character_Stoof and G1_Character is not clearly stated (see also comments below on that point); (2) G2_Character_Feature seems a trivial reification of a concept or (quality-)type, it is not clear why crm:E55_Type is not enough; (3) G3_Psychological_State, G4_Social_Relationship, G5_Narrative_Event, and G6_Relationship_Role seem restrictions of dlp categories to characters, no additional ontological analysis is provided.

From the methodological perspective, the authors import concepts from quite different ontologies (e.g., DOLCE and CIDOC-CRM). The difficulty of this integration and the extent to which the original meaning of the imported concepts is preserved in GOLEM are unclear. For instance, if I correctly understand, G1_Character is a subclass of both crm:E89_Propositional_Object and dlp:agentive-social-object. Did the authors checked whether the first concept is compatible with the second one?
Importantly, it is not clear in which language the ontology is implemented. The author say that "[t]he finalized controlled vocabulary is implemented as a SKOS (Simple Knowledge Organization System) concept scheme" but for the other modules of the ontology the linked website only contains graphs. Do the authors have an OWL implementation? Have they checked the consistency of the whole ontology? I think a clearer picture of the implementation aspects would be important.
Finally the pages linked in the SPARQL query gd: "https://data.golemlab.eu/entity/" and gt: are empty.

DETAILED COMMENTS

- p.6 "GOLEM is designed as an extension of CIDOC-CRM and LRMoo with alignment to DOLCE-Lite-Plus."
What exactly does "extension" mean? Does GOLEM import the whole CIDOC-CRM and LRMoo? What exactly does alignment to dlp mean? How are formal consistency and conceptual/ontological coherence guaranteed? Please add technical details.

- p.6 "G1 Character that extends CIDOC CRM’s E89 Propositional Object (...) G1 Character enriches this model by incorporating DOLCE’s social object framework. This dual classification acknowledges characters as socially constructed entities whose existence depends on people’s imagination within narrative communities." (and following)
In what sense does G1 Character extend E89 Propositional Object? If G1 is intended as a subclass of both E89 and dlp social agentive object, how can a propositional object be agentive? What is a dual classification? What does it mean "grounding characters in E28 while extending their classification to DOLCE’s socially constructed agents"? Please add technical details.

- p.6 "For significant objects showing some agency (e.g., magical artifacts), we employ G16 Object as a subclass of DOLCE’s social-object"
From an ontological point of view, what distinguishes G16 Object from DOLCE’s social-object?

- p.7 Figure 1 (erroneously referred as figure 3 in several places)
(A) If this is a class diagram I don't understand why several classes have been duplicated. The caption states "examples of related characters" but I can't see any examples or instances of characters.
(B) The prefixes of the names of the concepts/classes must be explained. In particular, I don't understand what gd: refers to.
(C) The intended semantics for the adopted graphical language has not been introduced. In particular, it is unclear why some boxes are divided into three and other into two. What is the semantics and the purpose of this division?
(D) Why is the reification of types necessary? Why are two subclasses of G1_Character (FanfictionCharacter and OriginalCharacter) not sufficient?
(E) Why features are not types?
(F) It is unclear to me why the instances of Character_Stoff do not have features; this would enable the relation features_are_also_found_on to be defined in terms of the features attributed to the stoffs and to the characters. Is features_are_also_found_on also defined between characters? If not, why?
(G) It is unclear whether a Character can be associated with only a work via is_character_in [this is also linked to CQ1 that is described as "cross-narrative identity tracking"]. Why are cardinality constraints not considered in the diagram?
If the work is unique, I don't understand the claim at p.1 "the same character often exists across disparate narratives, including canonical works, fan adaptations, and transmedia expansions, each offering conflicting or evolving traits, relationships, and backstories". Do the authors here refer to the stoff?

- p.7 "we introduce the G2 Feature class as an overarching category for attributes, with subclasses that reflect the diverse nature of these textual or hermeneutic elements, which can either be found in the narrative or be the outcome of reader interpretation."
In which sense features are textual or hermeneutic elements? I can't see how the reader interpretation is represented in the framework.

- p.7 "This epistemological distinction necessitated a departure from DOLCE’s conventional quality-based approach"
As far as I understand, the qualities of DOLCE can have an epistemological nature, e.g., colors. I don't see how the epistemological dimension of features is represented in GOLEM. Furthermore, according to table 1, the name is a feature of a character. A clarification on the nature of features would be useful (also explaining why clothes is included in physical description)

- p.8 "this fundamental characteristic can be radically transformed in alternative interpretations while the character is still being recognized as a variant of Harry Potter."
On the what basis the character is recognised as a variant of Harry Potter? Just because it is still named "Harry Potter"? I expected more discussion on this crucial point, which is widely debated in the philosophical literature on the identity of characters.

- p.8 "As illustrated in Figure 3, this approach allows conflicting or divergent traits to coexist within the same overarching character concept."
I can't see how figure 1 illustrates this.

- p.8 I would move section "Controlled Vocabulary Construction of Character Features" to the case study section. To me, it seems to be more about how the ontology is populated than the (structure of the) ontology itself.

- p.8 "The G17 Character Feature class captures qualitative character attributes through a bottom-up approach to controlled vocabulary construction."
Per se, character features do not imply a bottom-up approach. This seems to be an additional assumption of the proposed framework. Furthermore, I'm not sure to understand the strategy of developing a controlled vocabulary that covers all possible character features. In a bottom-up perspective, why not focus on the features necessary for the instances of the G1_Character class considered in the examples and adding new features as needed for new instances of G1_Character?

- p.10 Figure 3
(A) It is unclear to me whether narrative events only include characters and objects as participants. Consider, for example, the London in the Sherlock Holmes novels. Is London an instance of G16_object?
(B) Again, why must the types of events be reified rather than introducing different subclasses of G5?
(C) As far as I know dlp, relations dlp:follows, dlp:precedes, etc. have a temporal nature. It is unclear if the G7_Narrative_ Sequence class is characterised in terms of these relations, which do not encode causality. Furthermore, why are these relations defined between narrative events rather than time intervals?
(D) I don't understand why the duration and the temporal location of a narrative event are both dlp:time-intervals. It seems to me that duration is a quality of temporal locations / time intervals. Is it possible to have two different events with the same temporal location but different duration?
(E) Why psychological states do not have a temporal location and a duration (but they have temporal relations)?
(F) Why is the relation between characters and psychological states has_state rather than simply participation?

- p.10 "A psychological state represents a temporal mental condition, characterized by relatively stable qualities such as emotions, motivations, beliefs, and goals."
Are beliefs, goals, etc. qualities of psychological states or are they specific kinds of psychological states? I don't understand how they are represented.

- p.11 "By structuring competency questions (CQs) as SPARQL queries, we assess GOLEM’s capacity to represent character identities, relationships, and evolving traits, revealing its advantages over other frameworks."
First, I'm unsure if CQ1-CQ10 can be classified as competency questions. The authors did not collect them from domain experts; they just stated them. In this case, I would say they are examples of queries that can be done in the proposed framework.
Secondly, I don't see how answering these CGs reveals the advantages of GOLEM over other frameworks. A serious comparison would require proving that such CQs cannot be expressed and/or answered in other frameworks.
Thirdly, the reader would benefit from seeing some concrete examples of how "evolving traits" can be represented in the proposed framework. The authors later refer to "enabling roles like “lover” and “beloved” to be temporally bound" via Social Relationships. A more explicit example would be useful. Furthermore, it is unclear how the evolution of traits such as height, weight, personality, etc., which are represented by features, is captured.

Review #2
Anonymous submitted on 13/Dec/2025
Suggestion:
Reject
Review Comment:

The paper presents a module for characters within the GOLEM ontology. While the motivation for developing GOLEM is clearly articulated, several descriptions remain at a high level of abstraction and lack concrete examples. This may hinder comprehension for readers who are not experts in literary theory. For example in the introduction when mentioning the "fan adaptations, and transmedia expansions, each offering conflicting or evolving..." or in page 7 "These psychological dimensions prove particularly challenging to model..." it would better to have specific examples.

The introduction appears to refer to the GOLEM ontology as a whole, even though only a single module is actually described in the paper (see the last comment).

Existing and related models are described and reused, but it is unclear why the Provenance Ontology (PROV-O) is neither used nor mentioned to model provenance in the evolution of entities.

The last of the introduction is confusing as it says that future directions of work consider the non-Western narrative traditions however, earlier in the introduction it seem to be one of the motivation for developing this ontology. So it is not clear whether it is finally modeled.

Authors mention a methodology for organizing and acquiring knowledge to build the ontology, but no ontology development methodology is mentioned. For example, how is the quality of the modeling evaluated? That is how it is checked that the ontology represents properly the domain intended to be modelled. Note that this is different from the use of CQs, which in fact seems to be later defined to check the ontology use rather than to drive the ontology development.

The ontology is available at https://ontology.golemlab.eu but its URI (https://w3id.org/golem/ontology) and its elements URIs (https://w3id.org/golem/ontology#G1_Character)are not derreferenceable. See https://www.w3.org/TR/swbp-vocab-pub/ and https://arxiv.org/abs/2003.13084

Page 6 "For example, as shown in Figure 3, Harry Potter...": This figure is not provided, Figure 3 is about the event module.

Page 6: Is there a list or hierarchy that operationalizes the features mentioned in "Characters’ features are the stable descriptive elements that collectively construct a character’s narrative identity."

Page 7: Is this provenance information modelled or captured "... which can either be found in the narrative or be outcome of reader interpretation"?

Is there an exhaustive version of the information in Table 1 registered? Are update and maintenance protocols defined to manage the evolution of the controlled vocabulary?

Further justification is required for the decision to model the interpersonal connenctions within a controlled vocabulary instead of part of the ontology modules.

Page 9: Please clarify what is meant by using SHACL to check the data conformtity witht he established controlled vocabulary constraints. Also, how are these constraints defined and maintained?

Page 9-10: It is unclear the benefit of attaching all phsychological state tot he character level. More detailes would be needed.

Page 11: Additional informaton about how this is modelled is required: "The GOLEM ontology offers a flexible method for modeling narrative roles, accommodating various storytelling traditions and analytical approaches."

Page 11: It is not clear who are the annotators and their relation with the presented work. Also, which % of the ontology is being validated with this dataset.

Page 11: "The query doesn’t merely map names to texts but links instances of G1 Characters to specific narrative works through the gc:GP1 is character in property. GOLEM allows us to maintain the semantic continuity of, say, ”Orpheus” across dozens of retellings, while still treating each as a distinct interpretive act tied to a particular text": Does this method need of reconciling names across sources to use the unique URIs for characters? if yes: how is this done and planned to be maintained considerig scalability and collaborative generation of data?

The link to the SKOS generated should be provided.

Figure 1 to 3 are difficult to understand, it should be stated what is a class, a datatype property, an instance, declaration vs use of properties, etc. For example Figure 1 is about examples of related characters but they doesn't seem to be instances. Include also namespaces.

Overall, I find the resouce highly interesting but I would recommend authors to publish this character module with the rest of the ontology description to provide readers a unique point of reference instead of spread and fragmented publications. Considering that the "The golem ontology for narrative and fiction" uner review paper mentioned might have been already published (https://www.mdpi.com/2076-0787/14/10/193 ?) and that there is some overlapping information, the need for this partial publication should be further explained. To include more example and user oriented documentation authors can improve the HTML version of the ontology.

Minor:

- Explain what is "database driven perception"
- Provide a diagram in the HTML documentation and if possible diagrams and ttl with examples about how to use the ontology.
- What does it refer to "Page 63" in page 5?
- Figure 3 relations (follows, precedes, et.) represented with arrows with 2 directions: Indicate the correct direction for each property.

Review #3
Anonymous submitted on 03/Apr/2026
Suggestion:
Minor Revision
Review Comment:

The paper is very interesting, and the contribution is novel. The GitHub repository and SPARQL checkpoint are well organised, valid, and ready to use. While I have little doubt regarding the technical applicability of this work, I would like to suggest some revisions regarding the premise, theorization, and contextualization. My suggestion is between a minor revision and a major revision.

1. Introduction
(1) The authors make a great effort in establishing the need for a formal, systematic approach to representing literary characters in contemporary fanfiction, using it as a key example of the various new literary formats arising from participatory culture. However, the relationship discussed between fanfiction and canon remains unclear. These two concepts are generally perceived as distinct rather than closely related. It would be helpful to clearly define the scope and establish these boundaries early on to ground readers in your narrative and make a convincing argument.

(2) The claim that "older theories may not be suitable for contemporary media landscapes" needs better justification. For readers unfamiliar with these theories, it would be helpful to clarify exactly what factors make them unsuitable today (assuming they are not so old as to be considered 'ancient'). You mention later that characters are not static -- does this imply that older frameworks define characters strictly by static qualities? If so, please elaborate on this point.

(3) "Despite the proliferation of new characters, many narratives still draw upon deep-rooted traditions. Blockbuster films ....This interplay between tradition and innovation highlights how cultural heritage continues to shape—and be reshaped by—popular media." 
This paragraph feels somewhat out of place. How exactly does "storytelling" influence character theories in this context, and how does cultural heritage relate to the main topic? The authors need to better justify these sentences if they intend to bridge these slightly discrete fields. As it stands, the link between cultural heritage and literary characters in fanfiction remains very vague.

(4) "To systematically analyze these dynamic character representations, a unified conceptual framework is essential....existing ontologies for fictional characters remain underdeveloped and exhibit several significant limitations when applied to contemporary participatory culture." 
This argument can be better justified. At the end of this paragraph, I think the directly relevant concept is fanfiction, which is under the influence of participatory culture.

(5) You argued again in the following paragraph: "existing ontological frameworks demonstrate insufficient engagement with established literary theories of character" (which feels like repetition). I am not an expert in this field, but after a bit of searching, I found some preceding work such as:
Zöllner-Weber, A. (2009). Ontologies and logic reasoning as tools in humanities? Digital Humanities Quarterly, 3(4)
Hastings, J., & Schulz, S. (2019). Representing Literary Characters and their Attributes in an Ontology. In JOWO
Sheridan, P., Mikael, O. N. S. J., & Hastings, J. The Literary Theme Ontology for Media Annotation and Information Retrieval
There are likely more. Could you elaborate on how the existing research (not necessarily those I randomly quoted) does not sufficiently engage with literary theories, if they truly do not?

Engagement with Existing Research: You state that existing frameworks demonstrate "insufficient engagement with established literary theories." I suggest looking into preceding work such as Zöllner-Weber (2009), Hastings & Schulz (2019), and the Literary Theme Ontology (Sheridan et al.). Additionally, FRBROO is a widely used standard in literary studies. Could you elaborate on how existing research fails to sufficiently engage with literary theories?

2. Methodology & Methods

(1) As mentioned earlier, I find the link between cultural heritage and literary characters in fanfiction to be very vague. Could you justify why you chose to extend a cultural heritage ontology rather than a literary ontology? They do not appear to serve the same community or handle the same types of corpora.

(2) I also noticed that there is some overlap between the major part of the technical development (the ontology) and what has already been reported in https://www.mdpi.com/2076-0787/14/10/193. It would be helpful to explicitly state what you are positioning as the new contributions in this current article.