CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.
<h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. D...
Saved in:
Main Authors: | , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2025-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0310634 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832540227723329536 |
---|---|
author | Gowtham A Rao Azza Shoaibi Rupa Makadia Jill Hardin Joel Swerdel James Weaver Erica A Voss Mitchell M Conover Stephen Fortin Anthony G Sena Chris Knoll Nigel Hughes James P Gilbert Clair Blacketer Alan Andryc Frank DeFalco Anthony Molinaro Jenna Reps Martijn J Schuemie Patrick B Ryan |
author_facet | Gowtham A Rao Azza Shoaibi Rupa Makadia Jill Hardin Joel Swerdel James Weaver Erica A Voss Mitchell M Conover Stephen Fortin Anthony G Sena Chris Knoll Nigel Hughes James P Gilbert Clair Blacketer Alan Andryc Frank DeFalco Anthony Molinaro Jenna Reps Martijn J Schuemie Patrick B Ryan |
author_sort | Gowtham A Rao |
collection | DOAJ |
description | <h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer's disease (AD) across 10 different observational data sources.<h4>Results</h4>By utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease's anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.<h4>Discussion</h4>We provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.<h4>Conclusion</h4>Diagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs. |
format | Article |
id | doaj-art-077688a1cb5d470f945bdd2c68246334 |
institution | Kabale University |
issn | 1932-6203 |
language | English |
publishDate | 2025-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj-art-077688a1cb5d470f945bdd2c682463342025-02-05T05:31:19ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e031063410.1371/journal.pone.0310634CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.Gowtham A RaoAzza ShoaibiRupa MakadiaJill HardinJoel SwerdelJames WeaverErica A VossMitchell M ConoverStephen FortinAnthony G SenaChris KnollNigel HughesJames P GilbertClair BlacketerAlan AndrycFrank DeFalcoAnthony MolinaroJenna RepsMartijn J SchuemiePatrick B Ryan<h4>Objective</h4>This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.<h4>Materials and methods</h4>The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer's disease (AD) across 10 different observational data sources.<h4>Results</h4>By utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease's anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.<h4>Discussion</h4>We provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.<h4>Conclusion</h4>Diagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs.https://doi.org/10.1371/journal.pone.0310634 |
spellingShingle | Gowtham A Rao Azza Shoaibi Rupa Makadia Jill Hardin Joel Swerdel James Weaver Erica A Voss Mitchell M Conover Stephen Fortin Anthony G Sena Chris Knoll Nigel Hughes James P Gilbert Clair Blacketer Alan Andryc Frank DeFalco Anthony Molinaro Jenna Reps Martijn J Schuemie Patrick B Ryan CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. PLoS ONE |
title | CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. |
title_full | CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. |
title_fullStr | CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. |
title_full_unstemmed | CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. |
title_short | CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization. |
title_sort | cohortdiagnostics phenotype evaluation across a network of observational data sources using population level characterization |
url | https://doi.org/10.1371/journal.pone.0310634 |
work_keys_str_mv | AT gowthamarao cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT azzashoaibi cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT rupamakadia cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT jillhardin cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT joelswerdel cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT jamesweaver cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT ericaavoss cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT mitchellmconover cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT stephenfortin cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT anthonygsena cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT chrisknoll cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT nigelhughes cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT jamespgilbert cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT clairblacketer cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT alanandryc cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT frankdefalco cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT anthonymolinaro cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT jennareps cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT martijnjschuemie cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization AT patrickbryan cohortdiagnosticsphenotypeevaluationacrossanetworkofobservationaldatasourcesusingpopulationlevelcharacterization |