Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System

BackgroundObservational health data are collected in different formats and structures, making it challenging to analyze with common tools. The Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM) is a standardized data model that can harmonize observational health data.ObjectiveT...

Full description

Saved in:
Bibliographic Details
Main Authors: Michael Ochola, Sylvia Kiwuwa-Muyingo, Tathagata Bhattacharjee, David Amadi, Maureen Ng’etich, Damazo Kadengye, Henry Owoko, Boniface Igumba, Jay Greenfield, Jim Todd, Agnes Kiragga
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Digital Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdgth.2025.1423621/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832577866079928320
author Michael Ochola
Sylvia Kiwuwa-Muyingo
Tathagata Bhattacharjee
David Amadi
Maureen Ng’etich
Damazo Kadengye
Henry Owoko
Boniface Igumba
Jay Greenfield
Jim Todd
Agnes Kiragga
Agnes Kiragga
author_facet Michael Ochola
Sylvia Kiwuwa-Muyingo
Tathagata Bhattacharjee
David Amadi
Maureen Ng’etich
Damazo Kadengye
Henry Owoko
Boniface Igumba
Jay Greenfield
Jim Todd
Agnes Kiragga
Agnes Kiragga
author_sort Michael Ochola
collection DOAJ
description BackgroundObservational health data are collected in different formats and structures, making it challenging to analyze with common tools. The Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM) is a standardized data model that can harmonize observational health data.ObjectiveThis paper demonstrates the use of the OMOP CDM to harmonize COVID-19 sero-surveillance data from the Nairobi Urban Health and Demographic Surveillance System (HDSS).MethodsIn this study, we extracted data from the Nairobi Urban HDSS COVID-19 sero-surveillance database and mapped it to the OMOP CDM. We used open-source Observational Health Data Sciences and Informatics (OHDSI) tools like WhiteRabbit, RabbitInAHat, and USAGI. The steps included data profiling (scanning), mapping the vocabularies using the offline USAGI and online ATHENA, and designing the extract, transform, and load (ETL) process using RabbitInAHat. The ETL process was implemented using Pentaho Data Integration community edition software and structured query language (SQL). The target OMOP CDM can now be used to analyze the prevalence of COVID-19 antibodies in the Nairobi Urban HDSS population.ResultsWe successfully mapped the Nairobi Urban HDSS COVID-19 sero-surveillance data to the OMOP CDM. The standardized dataset included information on demographics, COVID-19 symptoms, vaccination, and COVID-19 antibody test results.ConclusionsThe OMOP CDM is a valuable tool for harmonizing observational health data. Using the OMOP CDM facilitates the sharing and analysis of observational health data, leading to a better understanding of disease conditions and trends and improving evidence-based population health strategies.
format Article
id doaj-art-400363d8387344a0931650d5d16f6bd4
institution Kabale University
issn 2673-253X
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Digital Health
spelling doaj-art-400363d8387344a0931650d5d16f6bd42025-01-30T14:30:09ZengFrontiers Media S.A.Frontiers in Digital Health2673-253X2025-01-01710.3389/fdgth.2025.14236211423621Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance SystemMichael Ochola0Sylvia Kiwuwa-Muyingo1Tathagata Bhattacharjee2David Amadi3Maureen Ng’etich4Damazo Kadengye5Henry Owoko6Boniface Igumba7Jay Greenfield8Jim Todd9Agnes Kiragga10Agnes Kiragga11Data Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaDepartment of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United KingdomDepartment of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United KingdomData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaMachine Learning (AI and ML), Committee on Data of the International Science Council (CODATA), Paris, FranceDepartment of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United KingdomData Science Program, African Population and Health Research Center (APHRC), Nairobi, KenyaImplementation Network for Sharing Population Information from Research Entities (INSPIRE Network), Nairobi, KenyaBackgroundObservational health data are collected in different formats and structures, making it challenging to analyze with common tools. The Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM) is a standardized data model that can harmonize observational health data.ObjectiveThis paper demonstrates the use of the OMOP CDM to harmonize COVID-19 sero-surveillance data from the Nairobi Urban Health and Demographic Surveillance System (HDSS).MethodsIn this study, we extracted data from the Nairobi Urban HDSS COVID-19 sero-surveillance database and mapped it to the OMOP CDM. We used open-source Observational Health Data Sciences and Informatics (OHDSI) tools like WhiteRabbit, RabbitInAHat, and USAGI. The steps included data profiling (scanning), mapping the vocabularies using the offline USAGI and online ATHENA, and designing the extract, transform, and load (ETL) process using RabbitInAHat. The ETL process was implemented using Pentaho Data Integration community edition software and structured query language (SQL). The target OMOP CDM can now be used to analyze the prevalence of COVID-19 antibodies in the Nairobi Urban HDSS population.ResultsWe successfully mapped the Nairobi Urban HDSS COVID-19 sero-surveillance data to the OMOP CDM. The standardized dataset included information on demographics, COVID-19 symptoms, vaccination, and COVID-19 antibody test results.ConclusionsThe OMOP CDM is a valuable tool for harmonizing observational health data. Using the OMOP CDM facilitates the sharing and analysis of observational health data, leading to a better understanding of disease conditions and trends and improving evidence-based population health strategies.https://www.frontiersin.org/articles/10.3389/fdgth.2025.1423621/fullobservational health dataOMOP CDMpopulation health dataCOVID-19sero-surveillanceNairobi Urban HDSS
spellingShingle Michael Ochola
Sylvia Kiwuwa-Muyingo
Tathagata Bhattacharjee
David Amadi
Maureen Ng’etich
Damazo Kadengye
Henry Owoko
Boniface Igumba
Jay Greenfield
Jim Todd
Agnes Kiragga
Agnes Kiragga
Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
Frontiers in Digital Health
observational health data
OMOP CDM
population health data
COVID-19
sero-surveillance
Nairobi Urban HDSS
title Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
title_full Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
title_fullStr Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
title_full_unstemmed Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
title_short Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
title_sort harmonizing population health data into omop common data model a demonstration using covid 19 sero surveillance data from nairobi urban health and demographic surveillance system
topic observational health data
OMOP CDM
population health data
COVID-19
sero-surveillance
Nairobi Urban HDSS
url https://www.frontiersin.org/articles/10.3389/fdgth.2025.1423621/full
work_keys_str_mv AT michaelochola harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT sylviakiwuwamuyingo harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT tathagatabhattacharjee harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT davidamadi harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT maureenngetich harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT damazokadengye harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT henryowoko harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT bonifaceigumba harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT jaygreenfield harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT jimtodd harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT agneskiragga harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem
AT agneskiragga harmonizingpopulationhealthdataintoomopcommondatamodelademonstrationusingcovid19serosurveillancedatafromnairobiurbanhealthanddemographicsurveillancesystem