One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset

Previous studies have highlighted the importance of having long term data for the study of cities, but such sources are relatively scarce. This is especially the case for data about relations between cities, which is a crucial aspect of urban dynamics. Over the last two decades, many efforts have be...

Full description

Saved in:
Bibliographic Details
Main Authors: Antoine Peris, Willem Jan Faber, Evert Meijers, Maarten van Ham
Format: Article
Language:deu
Published: Unité Mixte de Recherche 8504 Géographie-cités 2020-01-01
Series:Cybergeo
Subjects:
Online Access:https://journals.openedition.org/cybergeo/33747
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849338018109849600
author Antoine Peris
Willem Jan Faber
Evert Meijers
Maarten van Ham
author_facet Antoine Peris
Willem Jan Faber
Evert Meijers
Maarten van Ham
author_sort Antoine Peris
collection DOAJ
description Previous studies have highlighted the importance of having long term data for the study of cities, but such sources are relatively scarce. This is especially the case for data about relations between cities, which is a crucial aspect of urban dynamics. Over the last two decades, many efforts have been made to digitalize texts, including books and newspapers, which are primary sources on most of our societies. Researchers have shown that these massive digital archives can be used to identify macroscopic trends related to historical and cultural changes. The wealth of geographic information in such digital archives has not been used much, while they are very valuable for the study of cities. In this paper, we present DIGGER, a newly developed dataset that we built on Delpher, the digital archive of historical newspapers of the National Library of the Netherlands, by extracting geographical information from a selection of 102 million of news items. This dataset allowed us to study the spatial diffusion of information on and between the Dutch cities from a corpus of 81 newspapers published in 29 different cities between 1869 and 1994. This paper presents the method developed to build the dataset as well as the validation steps for the accuracy of the place name recognition. This dataset can be used to study the evolution of the Dutch urban system as well as aspects related to the spatial diffusion of information and geographical bias in media coverage.
format Article
id doaj-art-e9372eb1d696402ab13a947c1f4219a7
institution Kabale University
issn 1278-3366
language deu
publishDate 2020-01-01
publisher Unité Mixte de Recherche 8504 Géographie-cités
record_format Article
series Cybergeo
spelling doaj-art-e9372eb1d696402ab13a947c1f4219a72025-08-20T03:44:32ZdeuUnité Mixte de Recherche 8504 Géographie-citésCybergeo1278-33662020-01-0110.4000/cybergeo.33747One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER datasetAntoine PerisWillem Jan FaberEvert MeijersMaarten van HamPrevious studies have highlighted the importance of having long term data for the study of cities, but such sources are relatively scarce. This is especially the case for data about relations between cities, which is a crucial aspect of urban dynamics. Over the last two decades, many efforts have been made to digitalize texts, including books and newspapers, which are primary sources on most of our societies. Researchers have shown that these massive digital archives can be used to identify macroscopic trends related to historical and cultural changes. The wealth of geographic information in such digital archives has not been used much, while they are very valuable for the study of cities. In this paper, we present DIGGER, a newly developed dataset that we built on Delpher, the digital archive of historical newspapers of the National Library of the Netherlands, by extracting geographical information from a selection of 102 million of news items. This dataset allowed us to study the spatial diffusion of information on and between the Dutch cities from a corpus of 81 newspapers published in 29 different cities between 1869 and 1994. This paper presents the method developed to build the dataset as well as the validation steps for the accuracy of the place name recognition. This dataset can be used to study the evolution of the Dutch urban system as well as aspects related to the spatial diffusion of information and geographical bias in media coverage.https://journals.openedition.org/cybergeo/33747historydiffusiondatabasesystem of citiesflows
spellingShingle Antoine Peris
Willem Jan Faber
Evert Meijers
Maarten van Ham
One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
Cybergeo
history
diffusion
database
system of cities
flows
title One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
title_full One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
title_fullStr One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
title_full_unstemmed One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
title_short One century of information diffusion in the Netherlands derived from a massive digital archive of historical newspapers: the DIGGER dataset
title_sort one century of information diffusion in the netherlands derived from a massive digital archive of historical newspapers the digger dataset
topic history
diffusion
database
system of cities
flows
url https://journals.openedition.org/cybergeo/33747
work_keys_str_mv AT antoineperis onecenturyofinformationdiffusioninthenetherlandsderivedfromamassivedigitalarchiveofhistoricalnewspapersthediggerdataset
AT willemjanfaber onecenturyofinformationdiffusioninthenetherlandsderivedfromamassivedigitalarchiveofhistoricalnewspapersthediggerdataset
AT evertmeijers onecenturyofinformationdiffusioninthenetherlandsderivedfromamassivedigitalarchiveofhistoricalnewspapersthediggerdataset
AT maartenvanham onecenturyofinformationdiffusioninthenetherlandsderivedfromamassivedigitalarchiveofhistoricalnewspapersthediggerdataset