CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.

<h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. D...

Full description

Saved in:
Bibliographic Details
Main Authors: Gowtham A Rao, Azza Shoaibi, Rupa Makadia, Jill Hardin, Joel Swerdel, James Weaver, Erica A Voss, Mitchell M Conover, Stephen Fortin, Anthony G Sena, Chris Knoll, Nigel Hughes, James P Gilbert, Clair Blacketer, Alan Andryc, Frank DeFalco, Anthony Molinaro, Jenna Reps, Martijn J Schuemie, Patrick B Ryan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0310634
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540227723329536
author Gowtham A Rao
Azza Shoaibi
Rupa Makadia
Jill Hardin
Joel Swerdel
James Weaver
Erica A Voss
Mitchell M Conover
Stephen Fortin
Anthony G Sena
Chris Knoll
Nigel Hughes
James P Gilbert
Clair Blacketer
Alan Andryc
Frank DeFalco
Anthony Molinaro
Jenna Reps
Martijn J Schuemie
Patrick B Ryan
author_facet Gowtham A Rao
Azza Shoaibi
Rupa Makadia
Jill Hardin
Joel Swerdel
James Weaver
Erica A Voss
Mitchell M Conover
Stephen Fortin
Anthony G Sena
Chris Knoll
Nigel Hughes
James P Gilbert
Clair Blacketer
Alan Andryc
Frank DeFalco
Anthony Molinaro
Jenna Reps
Martijn J Schuemie
Patrick B Ryan
author_sort Gowtham A Rao
collection DOAJ
description <h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer's disease (AD) across 10 different observational data sources.<h4>Results</h4>By utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease's anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.<h4>Discussion</h4>We provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.<h4>Conclusion</h4>Diagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs.
format Article
id doaj-art-077688a1cb5d470f945bdd2c68246334
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-077688a1cb5d470f945bdd2c682463342025-02-05T05:31:19ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e031063410.1371/journal.pone.0310634CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.Gowtham A RaoAzza ShoaibiRupa MakadiaJill HardinJoel SwerdelJames WeaverErica A VossMitchell M ConoverStephen FortinAnthony G SenaChris KnollNigel HughesJames P GilbertClair BlacketerAlan AndrycFrank DeFalcoAnthony MolinaroJenna RepsMartijn J SchuemiePatrick B Ryan<h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer's disease (AD) across 10 different observational data sources.<h4>Results</h4>By utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease's anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.<h4>Discussion</h4>We provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.<h4>Conclusion</h4>Diagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs.https://doi.org/10.1371/journal.pone.0310634
spellingShingle Gowtham A Rao
Azza Shoaibi
Rupa Makadia
Jill Hardin
Joel Swerdel
James Weaver
Erica A Voss
Mitchell M Conover
Stephen Fortin
Anthony G Sena
Chris Knoll
Nigel Hughes
James P Gilbert
Clair Blacketer
Alan Andryc
Frank DeFalco
Anthony Molinaro
Jenna Reps
Martijn J Schuemie
Patrick B Ryan
CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
PLoS ONE
title CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
title_full CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
title_fullStr CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
title_full_unstemmed CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
title_short CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
title_sort cohortdiagnostics phenotype evaluation across a network of observational data sources using population level characterization
url https://doi.org/10.1371/journal.pone.0310634
work_keys_str_mv AT gowthamarao cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT azzashoaibi cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT rupamakadia cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT jillhardin cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT joelswerdel cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT jamesweaver cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT ericaavoss cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT mitchellmconover cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT stephenfortin cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT anthonygsena cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT chrisknoll cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT nigelhughes cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT jamespgilbert cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT clairblacketer cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT alanandryc cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT frankdefalco cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT anthonymolinaro cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT jennareps cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT martijnjschuemie cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization
AT patrickbryan cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization