Keeping It Open: A TEI-based Publication Pipeline for Historical Documents
Following the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a comple...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | deu |
Published: |
Text Encoding Initiative Consortium
2024-11-01
|
Series: | Journal of the Text Encoding Initiative |
Subjects: | |
Online Access: | https://journals.openedition.org/jtei/5306 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832578460776660992 |
---|---|
author | Floriane Chiffoleau |
author_facet | Floriane Chiffoleau |
author_sort | Floriane Chiffoleau |
collection | DOAJ |
description | Following the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a complete open-source pipeline made of tools and methods making it possible to present a digital scholarly edition of scanned handwritten material. Composed of six steps (digitization, segmentation, transcription, post-OCR processing, encoding, and publication) and centered on historical documents, and more particularly on ego documents, this pipeline has been built around TEI, which works as a pivot format, to ensure its robustness, sustainability, and reusability. Beyond encoding in TEI, we also chose tools compatible with it, such as eScriptorium for segmentation/transcription and TEI Publisher for the publication. To further help the people working with the pipeline, we also heavily documented the development of the pipeline, as well as its steps, to ease its reuse. |
format | Article |
id | doaj-art-18ed860344194ac39c369a4fc67fa44f |
institution | Kabale University |
issn | 2162-5603 |
language | deu |
publishDate | 2024-11-01 |
publisher | Text Encoding Initiative Consortium |
record_format | Article |
series | Journal of the Text Encoding Initiative |
spelling | doaj-art-18ed860344194ac39c369a4fc67fa44f2025-01-30T13:56:45ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032024-11-011510.4000/12s01Keeping It Open: A TEI-based Publication Pipeline for Historical DocumentsFloriane ChiffoleauFollowing the emergence of numerous projects to make use of historical archives, books, or other materials, as well as the exponentially growing needs for digital tools tailored for those tasks, the DAHN project (Dispositif de soutien à l’Archivistique et aux Humanités Numériques) developed a complete open-source pipeline made of tools and methods making it possible to present a digital scholarly edition of scanned handwritten material. Composed of six steps (digitization, segmentation, transcription, post-OCR processing, encoding, and publication) and centered on historical documents, and more particularly on ego documents, this pipeline has been built around TEI, which works as a pivot format, to ensure its robustness, sustainability, and reusability. Beyond encoding in TEI, we also chose tools compatible with it, such as eScriptorium for segmentation/transcription and TEI Publisher for the publication. To further help the people working with the pipeline, we also heavily documented the development of the pipeline, as well as its steps, to ease its reuse.https://journals.openedition.org/jtei/5306digital editionencoding pipelinehistorical manuscriptspublication workflow |
spellingShingle | Floriane Chiffoleau Keeping It Open: A TEI-based Publication Pipeline for Historical Documents Journal of the Text Encoding Initiative digital edition encoding pipeline historical manuscripts publication workflow |
title | Keeping It Open: A TEI-based Publication Pipeline for Historical Documents |
title_full | Keeping It Open: A TEI-based Publication Pipeline for Historical Documents |
title_fullStr | Keeping It Open: A TEI-based Publication Pipeline for Historical Documents |
title_full_unstemmed | Keeping It Open: A TEI-based Publication Pipeline for Historical Documents |
title_short | Keeping It Open: A TEI-based Publication Pipeline for Historical Documents |
title_sort | keeping it open a tei based publication pipeline for historical documents |
topic | digital edition encoding pipeline historical manuscripts publication workflow |
url | https://journals.openedition.org/jtei/5306 |
work_keys_str_mv | AT florianechiffoleau keepingitopenateibasedpublicationpipelineforhistoricaldocuments |