The InTaVia Knowledge Graph – European National Biographical and Cultural Heritage Object Data

Tracking #: 4090-5304

This paper is currently under review
Authors: 
Matthias Schlögl
Jouni Tuominen1
Joonas Kesäniemi
Petri Leskinen
Go Sugimoto
Victor de Boer

Responsible editor: 
Guest Editors 2025 OD+CH

Submission type: 
Dataset Description
Abstract: 
The InTaVia Knowledge Graph (IKG) is a large Knowledge Graph containing heterogeneous multilingual data from four European national biographies, connected to related cultural heritage objects. A key motivation for the construction of the resource is that this biographical information was fragmented across heterogeneous sources, and integrating it into a unified knowledge graph enables cross-source analysis, discovery of hidden historical patterns, and reusable infrastructure for large-scale digital humanities and linked data research. The IKG provides researchers, heritage professionals, and the informed public access to such biographical information. This paper describes the source data, the data model, the pipeline components for managing and harmonizing the data and the resulting knowledge graph. The data model combines domain standards CIDOC CRM and Bio CRM with elements to represent multiple perspectives on biographical information. The knowledge graph was consolidated from four prosopographical databases (PDBs) and enriched with links to Cultural Heritage Objects (CHOs) from Europeana and Wikidata. The resulting knowledge graph as information about 112,050 persons, described by 257,673 person proxies. In addition to the data model and the data itself, we also describe the infrastructure used to harmonize and maintain this heterogeneous knowledge graph.
Full PDF Version: 
Tags: 
Under Review