The NIKAW Project: An Infrastructure of Texts, Entities and Language Models to Study the Circulation of Knowledge in the Ancient World
Identifier (Artikel)
Abstract
This paper presents the foundational work of the interdisciplinary project NIKAW (Networks of Ideas and Knowledge in the Ancient World), which aims to analyse social networks in ancient Greek and Latin texts through mentions of historical figures. As a critical first step, we address the challenge of Named Entity Recognition (NER) for these languages by leveraging transformer-based models enriched with domain-specific knowledge. Our experiments highlight data sparsity and annotation inconsistencies as key bottlenecks for model performance. In the second phase, we introduce a pipeline for Named Entity Linking (NEL), utilizing the Wikisource edition of the Pauly-Wissowa Encyclopedy as a knowledge base. We detail the creation of silver-standard (automatically annotated) and gold-standard (human-verified) training datasets, and report preliminary results from fine-tuning the BLINK model for NEL.
Statistiken

Lizenz

Dieses Werk steht unter der Lizenz Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International.


