A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers

We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The ar...

Full description

Saved in:
Bibliographic Details
Main Authors: Mariona Coll Ardanuy, David Beavan, Kaspar Beelen, Kasra Hosseini, Jon Lawrence, Katherine McDonough, Federico Nanni, Daniel van Strien, Daniel C. S. Wilson
Format: Article
Language:English
Published: Ubiquity Press 2022-01-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://openhumanitiesdata.metajnl.com/articles/56
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850177864443363328
author Mariona Coll Ardanuy
David Beavan
Kaspar Beelen
Kasra Hosseini
Jon Lawrence
Katherine McDonough
Federico Nanni
Daniel van Strien
Daniel C. S. Wilson
author_facet Mariona Coll Ardanuy
David Beavan
Kaspar Beelen
Kasra Hosseini
Jon Lawrence
Katherine McDonough
Federico Nanni
Daniel van Strien
Daniel C. S. Wilson
author_sort Mariona Coll Ardanuy
collection DOAJ
description We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions of places, which are linked—whenever possible—to their corresponding entry on Wikipedia. The dataset consists of 3,364 annotated toponyms, of which 2,784 have been provided with a link to Wikipedia. The dataset is published in the British Library shared research repository, and is especially of interest to researchers working on improving semantic access to historical newspaper content.
format Article
id doaj-art-be47ac23f6a24dfebcf30578bee2b6f3
institution OA Journals
issn 2059-481X
language English
publishDate 2022-01-01
publisher Ubiquity Press
record_format Article
series Journal of Open Humanities Data
spelling doaj-art-be47ac23f6a24dfebcf30578bee2b6f32025-08-20T02:18:51ZengUbiquity PressJournal of Open Humanities Data2059-481X2022-01-01810.5334/johd.5654A Dataset for Toponym Resolution in Nineteenth-Century English NewspapersMariona Coll Ardanuy0David Beavan1Kaspar Beelen2Kasra Hosseini3Jon Lawrence4Katherine McDonough5Federico Nanni6Daniel van Strien7Daniel C. S. Wilson8The Alan Turing Institute, London; Queen Mary University of London, LondonThe Alan Turing Institute, LondonThe Alan Turing Institute, London; Queen Mary University of London, LondonThe Alan Turing Institute, LondonThe University of Exeter, ExeterThe Alan Turing Institute, London; Queen Mary University of London, LondonThe Alan Turing Institute, LondonThe British Library, LondonThe Alan Turing Institute, London; Queen Mary University of London, LondonWe present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions of places, which are linked—whenever possible—to their corresponding entry on Wikipedia. The dataset consists of 3,364 annotated toponyms, of which 2,784 have been provided with a link to Wikipedia. The dataset is published in the British Library shared research repository, and is especially of interest to researchers working on improving semantic access to historical newspaper content.https://openhumanitiesdata.metajnl.com/articles/56benchmarkdatasetgeographic information retrievalnewspapersnineteenth-century englishtoponym resolution
spellingShingle Mariona Coll Ardanuy
David Beavan
Kaspar Beelen
Kasra Hosseini
Jon Lawrence
Katherine McDonough
Federico Nanni
Daniel van Strien
Daniel C. S. Wilson
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
Journal of Open Humanities Data
benchmark
dataset
geographic information retrieval
newspapers
nineteenth-century english
toponym resolution
title A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
title_full A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
title_fullStr A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
title_full_unstemmed A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
title_short A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
title_sort dataset for toponym resolution in nineteenth century english newspapers
topic benchmark
dataset
geographic information retrieval
newspapers
nineteenth-century english
toponym resolution
url https://openhumanitiesdata.metajnl.com/articles/56
work_keys_str_mv AT marionacollardanuy adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT davidbeavan adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT kasparbeelen adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT kasrahosseini adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT jonlawrence adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT katherinemcdonough adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT federiconanni adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT danielvanstrien adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT danielcswilson adatasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT marionacollardanuy datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT davidbeavan datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT kasparbeelen datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT kasrahosseini datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT jonlawrence datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT katherinemcdonough datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT federiconanni datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT danielvanstrien datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers
AT danielcswilson datasetfortoponymresolutioninnineteenthcenturyenglishnewspapers