Open Bibliographical Data Workflows and the Multilinguality Challenge
The aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Mark...
Saved in:
| Main Authors: | , , , , , , , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Ubiquity Press
2024-03-01
|
| Series: | Journal of Open Humanities Data |
| Subjects: | |
| Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850151677896687616 |
|---|---|
| author | Vojtěch Malínek Tomasz Umerle Edward Gray Ivan Heibi Péter Király Christiane Klaes Przemysław Korytkowski David Lindemann Arianna Moretti Charlotte Panušková Róbert Péter Mikko Tolonen Aldona Tomczyńska Ondřej Vimr |
| author_facet | Vojtěch Malínek Tomasz Umerle Edward Gray Ivan Heibi Péter Király Christiane Klaes Przemysław Korytkowski David Lindemann Arianna Moretti Charlotte Panušková Róbert Péter Mikko Tolonen Aldona Tomczyńska Ondřej Vimr |
| author_sort | Vojtěch Malínek |
| collection | DOAJ |
| description | The aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Marketplace. Its role in the SSH infrastructural system is subsequently shortly introduced. Bibliodata-related workflows are needed at different levels of data creation and research, both for specific software features or data sources as well as for consolidating methodological aspects of bibliographical data curation. Set of five workflows showcasing various models of bibliodata related workflows is discussed afterwards. First of these workflows, From Library Data to Research Data describes conversion of library data into a dataset for data-based research. The other four are centred around leveraging existing tools and services. AVOBMAT: how to analyze and visualize bibliographical data and texts showcases a tool for combining text analysis and metadata-based research. Metadata crosswalk for citation data production in OpenCitations is a step-by-step instruction for using the OpenCitations infrastructure, a state-of-the-art service for sharing open citation data. LODification of bibliographical data: Zotero to Wikibase migration illustrates current dynamic developments concerning metadata in the field of Linked Open Data. Finally, the National Information Processing Institute from Poland (OPI PIB) prepared a workflow Studies on science and higher education system in Poland using the RAD-on platform, discussing how to use their dataset for research. Analysis of these workflows reveals particular needs to address the multilinguality challenge in the bibliodata field. On the level of curation this challenge is met with application of international standards for bibliographical data processing that on many occasions do not prioritise harmonization of multilingual datasets. The main curatorial techniques on how to solve multilingual issues in bibliographical data are briefly outlined. When we are tackling research questions the multilinguality challenge is even more prominent. Hence we are closing this article with a proposal for a preliminary workflow for processing multilingual bibliodata. |
| format | Article |
| id | doaj-art-0aef7516a88849e38bb37bf677b4530c |
| institution | OA Journals |
| issn | 2059-481X |
| language | English |
| publishDate | 2024-03-01 |
| publisher | Ubiquity Press |
| record_format | Article |
| series | Journal of Open Humanities Data |
| spelling | doaj-art-0aef7516a88849e38bb37bf677b4530c2025-08-20T02:26:09ZengUbiquity PressJournal of Open Humanities Data2059-481X2024-03-0110272710.5334/johd.190190Open Bibliographical Data Workflows and the Multilinguality ChallengeVojtěch Malínek0https://orcid.org/0000-0002-9553-5993Tomasz Umerle1https://orcid.org/0000-0002-7335-0568Edward Gray2https://orcid.org/0000-0002-5201-1014Ivan Heibi3https://orcid.org/0000-0001-5366-5194Péter Király4https://orcid.org/0000-0002-8749-4597Christiane Klaes5https://orcid.org/0000-0003-4870-4392Przemysław Korytkowski6https://orcid.org/0000-0003-3504-7282David Lindemann7https://orcid.org/0000-0002-8261-6882Arianna Moretti8https://orcid.org/0000-0001-5486-7070Charlotte Panušková9https://orcid.org/0000-0002-3534-8440Róbert Péter10https://orcid.org/0000-0002-7972-4751Mikko Tolonen11https://orcid.org/0000-0003-2892-8911Aldona Tomczyńska12https://orcid.org/0000-0002-0832-8081Ondřej Vimr13https://orcid.org/0000-0002-9364-0685Institute of Czech Literature, Czech Academy of Sciences, PragueInstitute of Literary Research, Polish Academy of Sciences, PoznańDARIAH-EU/IR* Huma-Num (CNRS UAR 3598)Department of Classical Philology and Italian Studies – FICLIT, University of Bologna, BolognaGesellschaft für wissenschaftliche Datenverarbeitung mbH GöttingenTechnische Universität Braunschweig, Universitätsbibliothek, BraunschweigFaculty of Computer Science, West Pomeranian University of Technology in Szczecin; National Information Processing Institute, WarsawUPV/EHU University of the Basque Country, Faculty of Arts, Vitoria-GasteizDepartment of Classical Philology and Italian Studies – FICLIT, University of Bologna, BolognaInstitute of Czech Literature, Czech Academy of Sciences, PragueInstitute of English and American Studies, University of SzegedDepartment of Digital Humanities, University of HelsinkiNational Information Processing Institute, WarsawInstitute of Czech Literature, Czech Academy of Sciences, PragueThe aim of the paper is to present and analyze workflows for bibliographical data curation and research that were created during the ‘Open Bibliodata Workflows’ project realised by the Bibliographical Data Working Group from the DARIAH ERIC consortium. These workflows are available via SSH Open Marketplace. Its role in the SSH infrastructural system is subsequently shortly introduced. Bibliodata-related workflows are needed at different levels of data creation and research, both for specific software features or data sources as well as for consolidating methodological aspects of bibliographical data curation. Set of five workflows showcasing various models of bibliodata related workflows is discussed afterwards. First of these workflows, From Library Data to Research Data describes conversion of library data into a dataset for data-based research. The other four are centred around leveraging existing tools and services. AVOBMAT: how to analyze and visualize bibliographical data and texts showcases a tool for combining text analysis and metadata-based research. Metadata crosswalk for citation data production in OpenCitations is a step-by-step instruction for using the OpenCitations infrastructure, a state-of-the-art service for sharing open citation data. LODification of bibliographical data: Zotero to Wikibase migration illustrates current dynamic developments concerning metadata in the field of Linked Open Data. Finally, the National Information Processing Institute from Poland (OPI PIB) prepared a workflow Studies on science and higher education system in Poland using the RAD-on platform, discussing how to use their dataset for research. Analysis of these workflows reveals particular needs to address the multilinguality challenge in the bibliodata field. On the level of curation this challenge is met with application of international standards for bibliographical data processing that on many occasions do not prioritise harmonization of multilingual datasets. The main curatorial techniques on how to solve multilingual issues in bibliographical data are briefly outlined. When we are tackling research questions the multilinguality challenge is even more prominent. Hence we are closing this article with a proposal for a preliminary workflow for processing multilingual bibliodata.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190bibliographic dataworkflowsmultilingualismdata analysisdata driven research |
| spellingShingle | Vojtěch Malínek Tomasz Umerle Edward Gray Ivan Heibi Péter Király Christiane Klaes Przemysław Korytkowski David Lindemann Arianna Moretti Charlotte Panušková Róbert Péter Mikko Tolonen Aldona Tomczyńska Ondřej Vimr Open Bibliographical Data Workflows and the Multilinguality Challenge Journal of Open Humanities Data bibliographic data workflows multilingualism data analysis data driven research |
| title | Open Bibliographical Data Workflows and the Multilinguality Challenge |
| title_full | Open Bibliographical Data Workflows and the Multilinguality Challenge |
| title_fullStr | Open Bibliographical Data Workflows and the Multilinguality Challenge |
| title_full_unstemmed | Open Bibliographical Data Workflows and the Multilinguality Challenge |
| title_short | Open Bibliographical Data Workflows and the Multilinguality Challenge |
| title_sort | open bibliographical data workflows and the multilinguality challenge |
| topic | bibliographic data workflows multilingualism data analysis data driven research |
| url | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/190 |
| work_keys_str_mv | AT vojtechmalinek openbibliographicaldataworkflowsandthemultilingualitychallenge AT tomaszumerle openbibliographicaldataworkflowsandthemultilingualitychallenge AT edwardgray openbibliographicaldataworkflowsandthemultilingualitychallenge AT ivanheibi openbibliographicaldataworkflowsandthemultilingualitychallenge AT peterkiraly openbibliographicaldataworkflowsandthemultilingualitychallenge AT christianeklaes openbibliographicaldataworkflowsandthemultilingualitychallenge AT przemysławkorytkowski openbibliographicaldataworkflowsandthemultilingualitychallenge AT davidlindemann openbibliographicaldataworkflowsandthemultilingualitychallenge AT ariannamoretti openbibliographicaldataworkflowsandthemultilingualitychallenge AT charlottepanuskova openbibliographicaldataworkflowsandthemultilingualitychallenge AT robertpeter openbibliographicaldataworkflowsandthemultilingualitychallenge AT mikkotolonen openbibliographicaldataworkflowsandthemultilingualitychallenge AT aldonatomczynska openbibliographicaldataworkflowsandthemultilingualitychallenge AT ondrejvimr openbibliographicaldataworkflowsandthemultilingualitychallenge |