An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News

Abstract The rapid evolution of artificial intelligence (AI), together with the increased availability of social media and news for epidemiological surveillance, is marking a pivotal moment in epidemiology and public health research. By harnessing the capabilities of generative AI, we use an ensembl...

Full description

Saved in:
Bibliographic Details
Main Authors: Sergio Consoli, Pietro Coletti, Peter V. Markov, Lia Orfei, Indaco Biazzo, Lea Schuh, Nicolas Stefanovitch, Lorenzo Bertolini, Mario Ceresa, Nikolaos I. Stilianakis
Format: Article
Language:English
Published: Nature Portfolio 2025-06-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05276-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849691612287860736
author Sergio Consoli
Pietro Coletti
Peter V. Markov
Lia Orfei
Indaco Biazzo
Lea Schuh
Nicolas Stefanovitch
Lorenzo Bertolini
Mario Ceresa
Nikolaos I. Stilianakis
author_facet Sergio Consoli
Pietro Coletti
Peter V. Markov
Lia Orfei
Indaco Biazzo
Lea Schuh
Nicolas Stefanovitch
Lorenzo Bertolini
Mario Ceresa
Nikolaos I. Stilianakis
author_sort Sergio Consoli
collection DOAJ
description Abstract The rapid evolution of artificial intelligence (AI), together with the increased availability of social media and news for epidemiological surveillance, is marking a pivotal moment in epidemiology and public health research. By harnessing the capabilities of generative AI, we use an ensemble approach which incorporates multiple Large Language Models (LLMs) to extract useful epidemiological information for analysis from the World Health Organization (WHO) Disease Outbreak News (DONs). DONs is a collection of regular reports on global outbreaks curated by the WHO with the adopted decision-making processes to respond to them. The extracted information is made available in a knowledge graph, referred to as eKG, derived to provide a nuanced representation of the public health domain knowledge. We provide an overview of this new dataset and describe the structure of eKG, along with the services and tools used to access and utilize the data that we are building on top. These innovative data resources open altogether new opportunities for epidemiological research, and the analysis and surveillance of disease outbreaks.
format Article
id doaj-art-5df01193f2b047deb8011ef15c8f0a94
institution DOAJ
issn 2052-4463
language English
publishDate 2025-06-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-5df01193f2b047deb8011ef15c8f0a942025-08-20T03:20:59ZengNature PortfolioScientific Data2052-44632025-06-0112111910.1038/s41597-025-05276-2An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak NewsSergio Consoli0Pietro Coletti1Peter V. Markov2Lia Orfei3Indaco Biazzo4Lea Schuh5Nicolas Stefanovitch6Lorenzo Bertolini7Mario Ceresa8Nikolaos I. Stilianakis9European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)European Commission, Joint Research Centre (JRC)Abstract The rapid evolution of artificial intelligence (AI), together with the increased availability of social media and news for epidemiological surveillance, is marking a pivotal moment in epidemiology and public health research. By harnessing the capabilities of generative AI, we use an ensemble approach which incorporates multiple Large Language Models (LLMs) to extract useful epidemiological information for analysis from the World Health Organization (WHO) Disease Outbreak News (DONs). DONs is a collection of regular reports on global outbreaks curated by the WHO with the adopted decision-making processes to respond to them. The extracted information is made available in a knowledge graph, referred to as eKG, derived to provide a nuanced representation of the public health domain knowledge. We provide an overview of this new dataset and describe the structure of eKG, along with the services and tools used to access and utilize the data that we are building on top. These innovative data resources open altogether new opportunities for epidemiological research, and the analysis and surveillance of disease outbreaks.https://doi.org/10.1038/s41597-025-05276-2
spellingShingle Sergio Consoli
Pietro Coletti
Peter V. Markov
Lia Orfei
Indaco Biazzo
Lea Schuh
Nicolas Stefanovitch
Lorenzo Bertolini
Mario Ceresa
Nikolaos I. Stilianakis
An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
Scientific Data
title An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
title_full An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
title_fullStr An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
title_full_unstemmed An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
title_short An epidemiological knowledge graph extracted from the World Health Organization’s Disease Outbreak News
title_sort epidemiological knowledge graph extracted from the world health organization s disease outbreak news
url https://doi.org/10.1038/s41597-025-05276-2
work_keys_str_mv AT sergioconsoli anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT pietrocoletti anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT petervmarkov anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT liaorfei anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT indacobiazzo anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT leaschuh anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT nicolasstefanovitch anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT lorenzobertolini anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT marioceresa anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT nikolaosistilianakis anepidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT sergioconsoli epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT pietrocoletti epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT petervmarkov epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT liaorfei epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT indacobiazzo epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT leaschuh epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT nicolasstefanovitch epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT lorenzobertolini epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT marioceresa epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews
AT nikolaosistilianakis epidemiologicalknowledgegraphextractedfromtheworldhealthorganizationsdiseaseoutbreaknews