Real-time data in cancer registries: Validation of an automated data extraction system

Summary: Timely surveillance of cancer treatment requires real-time integration of electronic health records (EHR) data into population-based registries. We validated data from the Datagateway, an automated system that harmonizes structured EHR data across hospitals into a common model to support ne...

Full description

Saved in:
Bibliographic Details
Main Authors: Sylvie A.M. Langhout, Sjoerd J.F. Hermans, Anna J.T. Smit, Elizabeth Berkx, Sophie A. Kurk, Keetje J. Schade, Eduardus F.M. Posthuma, Otto Visser, Jan J. Cornelissen, Peter C. Huijgens, Jurjen Versluis, Maarten van der Wilt, Avinash G. Dinmohamed
Format: Article
Language:English
Published: Elsevier 2025-08-01
Series:iScience
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2589004225013173
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Summary: Timely surveillance of cancer treatment requires real-time integration of electronic health records (EHR) data into population-based registries. We validated data from the Datagateway, an automated system that harmonizes structured EHR data across hospitals into a common model to support near real-time enrichment of the Netherlands Cancer Registry (NCR). Data from patients with acute myeloid leukemia, multiple myeloma, lung cancer, and breast cancer were extracted via the Datagateway and compared to NCR data and EHR source data. The system achieved 100% accuracy compared to registered NCR diagnoses, and an accuracy of 95% when comparing new diagnoses to the NCR inclusion criteria. Treatment was correctly identified in all cases, with only 3% of combination therapies misclassified. Laboratory values matched virtually completely; toxicity indicators showed 72%–100% accuracy. Automated real-time EHR data integration using a harmonized model is feasible and reliable, enabling scalable, high-quality support for real-world oncology research.
ISSN:2589-0042