Keeping It Open: A TEI-based Publication Pipeline for Historical Documents

Following the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a comple...

Full description

Saved in:
Bibliographic Details
Main Author: Floriane Chiffoleau
Format: Article
Language:deu
Published: Text Encoding Initiative Consortium 2024-11-01
Series:Journal of the Text Encoding Initiative
Subjects:
Online Access:https://journals.openedition.org/jtei/5306
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832578460776660992
author Floriane Chiffoleau
author_facet Floriane Chiffoleau
author_sort Floriane Chiffoleau
collection DOAJ
description Following the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a complete open-source pipeline made of tools and methods making it possible to present a digital scholarly edition of scanned handwritten material. Composed of six steps (digitization, segmentation, transcription, post-OCR processing, encoding, and publication) and centered on historical documents, and more particularly on ego documents, this pipeline has been built around TEI, which works as a pivot format, to ensure its robustness, sustainability, and reusability. Beyond encoding in TEI, we also chose tools compatible with it, such as eScriptorium for segmentation/transcription and TEI Publisher for the publication. To further help the people working with the pipeline, we also heavily documented the development of the pipeline, as well as its steps, to ease its reuse.
format Article
id doaj-art-18ed860344194ac39c369a4fc67fa44f
institution Kabale University
issn 2162-5603
language deu
publishDate 2024-11-01
publisher Text Encoding Initiative Consortium
record_format Article
series Journal of the Text Encoding Initiative
spelling doaj-art-18ed860344194ac39c369a4fc67fa44f2025-01-30T13:56:45ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032024-11-011510.4000/12s01Keeping It Open: A TEI-based Publication Pipeline for Historical DocumentsFloriane ChiffoleauFollowing the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a complete open-source pipeline made of tools and methods making it possible to present a digital scholarly edition of scanned handwritten material. Composed of six steps (digitization, segmentation, transcription, post-OCR processing, encoding, and publication) and centered on historical documents, and more particularly on ego documents, this pipeline has been built around TEI, which works as a pivot format, to ensure its robustness, sustainability, and reusability. Beyond encoding in TEI, we also chose tools compatible with it, such as eScriptorium for segmentation/transcription and TEI Publisher for the publication. To further help the people working with the pipeline, we also heavily documented the development of the pipeline, as well as its steps, to ease its reuse.https://journals.openedition.org/jtei/5306digital editionencoding pipelinehistorical manuscriptspublication workflow
spellingShingle Floriane Chiffoleau
Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
Journal of the Text Encoding Initiative
digital edition
encoding pipeline
historical manuscripts
publication workflow
title Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
title_full Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
title_fullStr Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
title_full_unstemmed Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
title_short Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
title_sort keeping it open a tei based publication pipeline for historical documents
topic digital edition
encoding pipeline
historical manuscripts
publication workflow
url https://journals.openedition.org/jtei/5306
work_keys_str_mv AT florianechiffoleau keepingitopenateibasedpublicationpipelineforhistoricaldocuments