Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities

This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection...

Full description

Saved in:
Bibliographic Details
Main Authors: Renata Vieira, Fernanda Olival, Helena Freire Cameron, Joaquim Santos, Ofélia Sequeira, Ivo Santos
Format: Article
Language:English
Published: Ubiquity Press 2021-09-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://openhumanitiesdata.metajnl.com/articles/43
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850177742445740032
author Renata Vieira
Fernanda Olival
Helena Freire Cameron
Joaquim Santos
Ofélia Sequeira
Ivo Santos
author_facet Renata Vieira
Fernanda Olival
Helena Freire Cameron
Joaquim Santos
Ofélia Sequeira
Ivo Santos
author_sort Renata Vieira
collection DOAJ
description This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.
format Article
id doaj-art-296843aedd364e8db2054ff51618c756
institution OA Journals
issn 2059-481X
language English
publishDate 2021-09-01
publisher Ubiquity Press
record_format Article
series Journal of Open Humanities Data
spelling doaj-art-296843aedd364e8db2054ff51618c7562025-08-20T02:18:55ZengUbiquity PressJournal of Open Humanities Data2059-481X2021-09-01710.5334/johd.4339Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named EntitiesRenata Vieira0Fernanda Olival1Helena Freire Cameron2Joaquim Santos3Ofélia Sequeira4Ivo Santos5CIDEHUS, University of ÉvoraCIDEHUS, University of Évora, Portugal; Department of History – University of ÉvoraCIDEHUS, University of Évora, Portugal; VALORIZA-Polytechnics of PortalegreDepartment of Informatics, University of ÉvoraCIDEHUS, University of ÉvoraCIDEHUS, University of ÉvoraThis work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.https://openhumanitiesdata.metajnl.com/articles/43digital historyhistorical sources (18th century)named entity recognitioninformation extractionnamed entity evaluationnamed entity labelled corpus
spellingShingle Renata Vieira
Fernanda Olival
Helena Freire Cameron
Joaquim Santos
Ofélia Sequeira
Ivo Santos
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
Journal of Open Humanities Data
digital history
historical sources (18th century)
named entity recognition
information extraction
named entity evaluation
named entity labelled corpus
title Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_full Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_fullStr Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_full_unstemmed Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_short Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_sort enriching the 1758 portuguese parish memories alentejo with named entities
topic digital history
historical sources (18th century)
named entity recognition
information extraction
named entity evaluation
named entity labelled corpus
url https://openhumanitiesdata.metajnl.com/articles/43
work_keys_str_mv AT renatavieira enrichingthe1758portugueseparishmemoriesalentejowithnamedentities
AT fernandaolival enrichingthe1758portugueseparishmemoriesalentejowithnamedentities
AT helenafreirecameron enrichingthe1758portugueseparishmemoriesalentejowithnamedentities
AT joaquimsantos enrichingthe1758portugueseparishmemoriesalentejowithnamedentities
AT ofeliasequeira enrichingthe1758portugueseparishmemoriesalentejowithnamedentities
AT ivosantos enrichingthe1758portugueseparishmemoriesalentejowithnamedentities