Open Bibliographical Data Workflows and the Multilinguality Challenge

The aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Mark...

Full description

Saved in:
Bibliographic Details
Main Authors: Vojtěch Malínek, Tomasz Umerle, Edward Gray, Ivan Heibi, Péter Király, Christiane Klaes, Przemysław Korytkowski, David Lindemann, Arianna Moretti, Charlotte Panušková, Róbert Péter, Mikko Tolonen, Aldona Tomczyńska, Ondřej Vimr
Format: Article
Language:English
Published: Ubiquity Press 2024-03-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850151677896687616
author Vojtěch Malínek
Tomasz Umerle
Edward Gray
Ivan Heibi
Péter Király
Christiane Klaes
Przemysław Korytkowski
David Lindemann
Arianna Moretti
Charlotte Panušková
Róbert Péter
Mikko Tolonen
Aldona Tomczyńska
Ondřej Vimr
author_facet Vojtěch Malínek
Tomasz Umerle
Edward Gray
Ivan Heibi
Péter Király
Christiane Klaes
Przemysław Korytkowski
David Lindemann
Arianna Moretti
Charlotte Panušková
Róbert Péter
Mikko Tolonen
Aldona Tomczyńska
Ondřej Vimr
author_sort Vojtěch Malínek
collection DOAJ
description The aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Marketplace. Its role in the SSH infrastructural system is subsequently shortly introduced. Bibliodata-related workflows are needed at different levels of data creation and research, both for specific software features or data sources as well as for consolidating methodological aspects of bibliographical data curation. Set of five workflows showcasing various models of bibliodata related workflows is discussed afterwards. First of these workflows, From Library Data to Research Data describes conversion of library data into a dataset for data-based research. The other four are centred around leveraging existing tools and services. AVOBMAT: how to analyze and visualize bibliographical data and texts showcases a tool for combining text analysis and metadata-based research. Metadata crosswalk for citation data production in OpenCitations is a step-by-step instruction for using the OpenCitations infrastructure, a state-of-the-art service for sharing open citation data. LODification of bibliographical data: Zotero to Wikibase migration illustrates current dynamic developments concerning metadata in the field of Linked Open Data. Finally, the National Information Processing Institute from Poland (OPI PIB) prepared a workflow Studies on science and higher education system in Poland using the RAD-on platform, discussing how to use their dataset for research. Analysis of these workflows reveals particular needs to address the multilinguality challenge in the bibliodata field. On the level of curation this challenge is met with application of international standards for bibliographical data processing that on many occasions do not prioritise harmonization of multilingual datasets. The main curatorial techniques on how to solve multilingual issues in bibliographical data are briefly outlined. When we are tackling research questions the multilinguality challenge is even more prominent. Hence we are closing this article with a proposal for a preliminary workflow for processing multilingual bibliodata.
format Article
id doaj-art-0aef7516a88849e38bb37bf677b4530c
institution OA Journals
issn 2059-481X
language English
publishDate 2024-03-01
publisher Ubiquity Press
record_format Article
series Journal of Open Humanities Data
spelling doaj-art-0aef7516a88849e38bb37bf677b4530c2025-08-20T02:26:09ZengUbiquity PressJournal of Open Humanities Data2059-481X2024-03-0110272710.5334/johd.190190Open Bibliographical Data Workflows and the Multilinguality ChallengeVojtěch Malínek0https://orcid.org/0000-0002-9553-5993Tomasz Umerle1https://orcid.org/0000-0002-7335-0568Edward Gray2https://orcid.org/0000-0002-5201-1014Ivan Heibi3https://orcid.org/0000-0001-5366-5194Péter Király4https://orcid.org/0000-0002-8749-4597Christiane Klaes5https://orcid.org/0000-0003-4870-4392Przemysław Korytkowski6https://orcid.org/0000-0003-3504-7282David Lindemann7https://orcid.org/0000-0002-8261-6882Arianna Moretti8https://orcid.org/0000-0001-5486-7070Charlotte Panušková9https://orcid.org/0000-0002-3534-8440Róbert Péter10https://orcid.org/0000-0002-7972-4751Mikko Tolonen11https://orcid.org/0000-0003-2892-8911Aldona Tomczyńska12https://orcid.org/0000-0002-0832-8081Ondřej Vimr13https://orcid.org/0000-0002-9364-0685Institute of Czech Literature, Czech Academy of Sciences, PragueInstitute of Literary Research, Polish Academy of Sciences, PoznańDARIAH-EU/IR* Huma-Num (CNRS UAR 3598)Department of Classical Philology and Italian Studies – FICLIT, University of Bologna, BolognaGesellschaft für wissenschaftliche Datenverarbeitung mbH GöttingenTechnische Universität Braunschweig, Universitätsbibliothek, BraunschweigFaculty of Computer Science, West Pomeranian University of Technology in Szczecin; National Information Processing Institute, WarsawUPV/EHU University of the Basque Country, Faculty of Arts, Vitoria-GasteizDepartment of Classical Philology and Italian Studies – FICLIT, University of Bologna, BolognaInstitute of Czech Literature, Czech Academy of Sciences, PragueInstitute of English and American Studies, University of SzegedDepartment of Digital Humanities, University of HelsinkiNational Information Processing Institute, WarsawInstitute of Czech Literature, Czech Academy of Sciences, PragueThe aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Marketplace. Its role in the SSH infrastructural system is subsequently shortly introduced. Bibliodata-related workflows are needed at different levels of data creation and research, both for specific software features or data sources as well as for consolidating methodological aspects of bibliographical data curation. Set of five workflows showcasing various models of bibliodata related workflows is discussed afterwards. First of these workflows, From Library Data to Research Data describes conversion of library data into a dataset for data-based research. The other four are centred around leveraging existing tools and services. AVOBMAT: how to analyze and visualize bibliographical data and texts showcases a tool for combining text analysis and metadata-based research. Metadata crosswalk for citation data production in OpenCitations is a step-by-step instruction for using the OpenCitations infrastructure, a state-of-the-art service for sharing open citation data. LODification of bibliographical data: Zotero to Wikibase migration illustrates current dynamic developments concerning metadata in the field of Linked Open Data. Finally, the National Information Processing Institute from Poland (OPI PIB) prepared a workflow Studies on science and higher education system in Poland using the RAD-on platform, discussing how to use their dataset for research. Analysis of these workflows reveals particular needs to address the multilinguality challenge in the bibliodata field. On the level of curation this challenge is met with application of international standards for bibliographical data processing that on many occasions do not prioritise harmonization of multilingual datasets. The main curatorial techniques on how to solve multilingual issues in bibliographical data are briefly outlined. When we are tackling research questions the multilinguality challenge is even more prominent. Hence we are closing this article with a proposal for a preliminary workflow for processing multilingual bibliodata.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190bibliographic dataworkflowsmultilingualismdata analysisdata driven research
spellingShingle Vojtěch Malínek
Tomasz Umerle
Edward Gray
Ivan Heibi
Péter Király
Christiane Klaes
Przemysław Korytkowski
David Lindemann
Arianna Moretti
Charlotte Panušková
Róbert Péter
Mikko Tolonen
Aldona Tomczyńska
Ondřej Vimr
Open Bibliographical Data Workflows and the Multilinguality Challenge
Journal of Open Humanities Data
bibliographic data
workflows
multilingualism
data analysis
data driven research
title Open Bibliographical Data Workflows and the Multilinguality Challenge
title_full Open Bibliographical Data Workflows and the Multilinguality Challenge
title_fullStr Open Bibliographical Data Workflows and the Multilinguality Challenge
title_full_unstemmed Open Bibliographical Data Workflows and the Multilinguality Challenge
title_short Open Bibliographical Data Workflows and the Multilinguality Challenge
title_sort open bibliographical data workflows and the multilinguality challenge
topic bibliographic data
workflows
multilingualism
data analysis
data driven research
url https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190
work_keys_str_mv AT vojtechmalinek openbibliographicaldataworkflowsandthemultilingualitychallenge
AT tomaszumerle openbibliographicaldataworkflowsandthemultilingualitychallenge
AT edwardgray openbibliographicaldataworkflowsandthemultilingualitychallenge
AT ivanheibi openbibliographicaldataworkflowsandthemultilingualitychallenge
AT peterkiraly openbibliographicaldataworkflowsandthemultilingualitychallenge
AT christianeklaes openbibliographicaldataworkflowsandthemultilingualitychallenge
AT przemysławkorytkowski openbibliographicaldataworkflowsandthemultilingualitychallenge
AT davidlindemann openbibliographicaldataworkflowsandthemultilingualitychallenge
AT ariannamoretti openbibliographicaldataworkflowsandthemultilingualitychallenge
AT charlottepanuskova openbibliographicaldataworkflowsandthemultilingualitychallenge
AT robertpeter openbibliographicaldataworkflowsandthemultilingualitychallenge
AT mikkotolonen openbibliographicaldataworkflowsandthemultilingualitychallenge
AT aldonatomczynska openbibliographicaldataworkflowsandthemultilingualitychallenge
AT ondrejvimr openbibliographicaldataworkflowsandthemultilingualitychallenge