Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification

Continuous monitoring of fetal heart rate (FHR) and uterine contractions (UC), otherwise known as cardiotocography (CTG), is often used to assess the risk of fetal compromise during labor. However, interpreting CTG recordings visually is challenging for clinicians, given the complexity of CTG patter...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lochana Mendis, Debjyoti Karmakar, Marimuthu Palaniswami, Fiona Brownfoot, Emerson Keenan
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Journal of Translational Engineering in Health and Medicine
Subjects:	Cardiotocography deep learning fetal compromise fetal heart rate time-series classification
Online Access:	https://ieeexplore.ieee.org/document/10912500/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850064291710894080
author	Lochana Mendis Debjyoti Karmakar Marimuthu Palaniswami Fiona Brownfoot Emerson Keenan
author_facet	Lochana Mendis Debjyoti Karmakar Marimuthu Palaniswami Fiona Brownfoot Emerson Keenan
author_sort	Lochana Mendis
collection	DOAJ
description	Continuous monitoring of fetal heart rate (FHR) and uterine contractions (UC), otherwise known as cardiotocography (CTG), is often used to assess the risk of fetal compromise during labor. However, interpreting CTG recordings visually is challenging for clinicians, given the complexity of CTG patterns, leading to poor sensitivity. Efforts to address this issue have focused on data-driven deep-learning methods to detect fetal compromise automatically. However, their progress is impeded by limited CTG training datasets and the absence of a standardized evaluation workflow, hindering algorithm comparisons. In this study, we use a private CTG dataset of 9,887 CTG recordings with pH measurements and 552 CTG recordings from the open-access CTU-UHB dataset to conduct a cross-database evaluation of six deep-learning models for fetal compromise detection. We explore the impact of input selection of FHR and UC signals, signal pre-processing, downsampling frequency, and the influence of removing intermediate pH samples from the training dataset. Our findings reveal that using only FHR and pre-processing FHR with artefact removal and interpolation provides a significant improvement to classification performance for some model architectures while excluding intermediate pH samples did not significantly improve performance for any model. From our comparison of the six models, ResNet exhibited the strongest fetal compromise classification performance across both databases at a downsampling rate of 1Hz. Finally, class activation maps from highly contributing signal regions in the ResNet model aligned with clinical knowledge of compromised FHR patterns, highlighting the model’s interpretability. These insights may serve as a standardized reference for developing and comparing future works in this domain. Clinical and Translational Impact: This study provides a standardized workflow for comparing deep-learning methods for CTG classification. Ensuring new methods show generalizability and interpretability will improve their robustness and applicability in clinical settings.
format	Article
id	doaj-art-4acf503492264be7bbd5c11d879bdbc2
institution	DOAJ
issn	2168-2372
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Journal of Translational Engineering in Health and Medicine
spelling	doaj-art-4acf503492264be7bbd5c11d879bdbc22025-08-20T02:49:20ZengIEEEIEEE Journal of Translational Engineering in Health and Medicine2168-23722025-01-011312313510.1109/JTEHM.2025.354840110912500Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography ClassificationLochana Mendis0https://orcid.org/0009-0003-6670-6719Debjyoti Karmakar1https://orcid.org/0009-0007-2301-1326Marimuthu Palaniswami2https://orcid.org/0000-0002-3635-4252Fiona Brownfoot3Emerson Keenan4https://orcid.org/0000-0003-1966-2293Department of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC, AustraliaDepartment of Obstetrics and Gynaecology, Obstetric Diagnostics and Therapeutics Group, The University of Melbourne, Heidelberg, VIC, AustraliaDepartment of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC, AustraliaDepartment of Obstetrics and Gynaecology, Obstetric Diagnostics and Therapeutics Group, The University of Melbourne, Heidelberg, VIC, AustraliaDepartment of Electrical and Electronic Engineering, The University of Melbourne, Parkville, VIC, AustraliaContinuous monitoring of fetal heart rate (FHR) and uterine contractions (UC), otherwise known as cardiotocography (CTG), is often used to assess the risk of fetal compromise during labor. However, interpreting CTG recordings visually is challenging for clinicians, given the complexity of CTG patterns, leading to poor sensitivity. Efforts to address this issue have focused on data-driven deep-learning methods to detect fetal compromise automatically. However, their progress is impeded by limited CTG training datasets and the absence of a standardized evaluation workflow, hindering algorithm comparisons. In this study, we use a private CTG dataset of 9,887 CTG recordings with pH measurements and 552 CTG recordings from the open-access CTU-UHB dataset to conduct a cross-database evaluation of six deep-learning models for fetal compromise detection. We explore the impact of input selection of FHR and UC signals, signal pre-processing, downsampling frequency, and the influence of removing intermediate pH samples from the training dataset. Our findings reveal that using only FHR and pre-processing FHR with artefact removal and interpolation provides a significant improvement to classification performance for some model architectures while excluding intermediate pH samples did not significantly improve performance for any model. From our comparison of the six models, ResNet exhibited the strongest fetal compromise classification performance across both databases at a downsampling rate of 1Hz. Finally, class activation maps from highly contributing signal regions in the ResNet model aligned with clinical knowledge of compromised FHR patterns, highlighting the model’s interpretability. These insights may serve as a standardized reference for developing and comparing future works in this domain. Clinical and Translational Impact: This study provides a standardized workflow for comparing deep-learning methods for CTG classification. Ensuring new methods show generalizability and interpretability will improve their robustness and applicability in clinical settings.https://ieeexplore.ieee.org/document/10912500/Cardiotocographydeep learningfetal compromisefetal heart ratetime-series classification
spellingShingle	Lochana Mendis Debjyoti Karmakar Marimuthu Palaniswami Fiona Brownfoot Emerson Keenan Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification IEEE Journal of Translational Engineering in Health and Medicine Cardiotocography deep learning fetal compromise fetal heart rate time-series classification
title	Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification
title_full	Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification
title_fullStr	Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification
title_full_unstemmed	Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification
title_short	Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification
title_sort	cross database evaluation of deep learning methods for intrapartum cardiotocography classification
topic	Cardiotocography deep learning fetal compromise fetal heart rate time-series classification
url	https://ieeexplore.ieee.org/document/10912500/
work_keys_str_mv	AT lochanamendis crossdatabaseevaluationofdeeplearningmethodsforintrapartumcardiotocographyclassification AT debjyotikarmakar crossdatabaseevaluationofdeeplearningmethodsforintrapartumcardiotocographyclassification AT marimuthupalaniswami crossdatabaseevaluationofdeeplearningmethodsforintrapartumcardiotocographyclassification AT fionabrownfoot crossdatabaseevaluationofdeeplearningmethodsforintrapartumcardiotocographyclassification AT emersonkeenan crossdatabaseevaluationofdeeplearningmethodsforintrapartumcardiotocographyclassification

Cross-Database Evaluation of Deep Learning Methods for Intrapartum Cardiotocography Classification

Similar Items