FAIR Derived Data in TEI and Its Publication in the TextGrid Repository

Many research projects face legal restrictions on the publication of texts. In recent decades, several projects have circumvented these restrictions by both deleting some parts of the data and publishing derived data from the original files. We discuss the limitations of the commonly used ad-hoc sol...

Full description

Saved in:
Bibliographic Details
Main Authors: José Calvo Tello, Mathias Göbel, Ubbo Veentjer, Stefan E. Funk, Nanette Rißler-Pipka, Keli Du
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2025-03-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/5622
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850199200520732672
author José Calvo Tello
Mathias Göbel
Ubbo Veentjer
Stefan E. Funk
Nanette Rißler-Pipka
Keli Du
author_facet José Calvo Tello
Mathias Göbel
Ubbo Veentjer
Stefan E. Funk
Nanette Rißler-Pipka
Keli Du
author_sort José Calvo Tello
collection DOAJ
description Many research projects face legal restrictions on the publication of texts. In recent decades, several projects have circumvented these restrictions by both deleting some parts of the data and publishing derived data from the original files. We discuss the limitations of the commonly used ad-hoc solutions and the deprecation of the FAIR status that they cause. In contrast, we propose to model derived data in TEI, and present several variants with five corpora from different languages, genres, and periods. We also present the implementation of several features for publishing such data in the TextGrid Repository and the publication of derived data from a corpus of Spanish novels and a corpus of American plays.
format Article
id doaj-art-e8e4a8e80d1343d8b274fb879095a5e8
institution OA Journals
issn 2162-5603
language deu
publishDate 2025-03-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-e8e4a8e80d1343d8b274fb879095a5e82025-08-20T02:12:41ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032025-03-0118FAIR Derived Data in TEI and Its Publication in the TextGrid RepositoryJosé Calvo TelloMathias GöbelUbbo VeentjerStefan E. FunkNanette Rißler-PipkaKeli DuMany research projects face legal restrictions on the publication of texts. In recent decades, several projects have circumvented these restrictions by both deleting some parts of the data and publishing derived data from the original files. We discuss the limitations of the commonly used ad-hoc solutions and the deprecation of the FAIR status that they cause. In contrast, we propose to model derived data in TEI, and present several variants with five corpora from different languages, genres, and periods. We also present the implementation of several features for publishing such data in the TextGrid Repository and the publication of derived data from a corpus of Spanish novels and a corpus of American plays.https://journals.openedition.org/jtei/5622legal issuesderived datacopyrightliteratureFAIR principlesrepository
spellingShingle José Calvo Tello
Mathias Göbel
Ubbo Veentjer
Stefan E. Funk
Nanette Rißler-Pipka
Keli Du
FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
Journal of the Text Encoding Initiative
legal issues
derived data
copyright
literature
FAIR principles
repository
title FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
title_full FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
title_fullStr FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
title_full_unstemmed FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
title_short FAIR Derived Data in TEI and Its Publication in the TextGrid Repository
title_sort fair derived data in tei and its publication in the textgrid repository
topic legal issues
derived data
copyright
literature
FAIR principles
repository
url https://journals.openedition.org/jtei/5622
work_keys_str_mv AT josecalvotello fairderiveddatainteianditspublicationinthetextgridrepository
AT mathiasgobel fairderiveddatainteianditspublicationinthetextgridrepository
AT ubboveentjer fairderiveddatainteianditspublicationinthetextgridrepository
AT stefanefunk fairderiveddatainteianditspublicationinthetextgridrepository
AT nanetterißlerpipka fairderiveddatainteianditspublicationinthetextgridrepository
AT kelidu fairderiveddatainteianditspublicationinthetextgridrepository