Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study

Abstract Objectives The study aimed to compare the diagnostic efficacy of the machine learning models with expert subjective assessment (SA) in assessing the malignancy risk of ovarian tumors using transvaginal ultrasound (TVUS). Methods The retrospective single-center diagnostic study included 1555...

Full description

Saved in:
Bibliographic Details
Main Authors: Xin He, Xiang-Hui Bai, Hui Chen, Wei-Wei Feng
Format: Article
Language:English
Published: BMC 2024-11-01
Series:Journal of Ovarian Research
Subjects:
Online Access:https://doi.org/10.1186/s13048-024-01544-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850061954598567936
author Xin He
Xiang-Hui Bai
Hui Chen
Wei-Wei Feng
author_facet Xin He
Xiang-Hui Bai
Hui Chen
Wei-Wei Feng
author_sort Xin He
collection DOAJ
description Abstract Objectives The study aimed to compare the diagnostic efficacy of the machine learning models with expert subjective assessment (SA) in assessing the malignancy risk of ovarian tumors using transvaginal ultrasound (TVUS). Methods The retrospective single-center diagnostic study included 1555 consecutive patients from January 2019 to May 2021. Using this dataset, Residual Network(ResNet), Densely Connected Convolutional Network(DenseNet), Vision Transformer(ViT), and Swin Transformer models were established and evaluated separately or combined with Cancer antigen 125 (CA 125). The diagnostic performance was then compared with SA. Results Of the 1555 patients, 76.9% were benign, while 23.1% were malignant (including borderline). When differentiating the malignant from ovarian tumors, the SA had an AUC of 0.97 (95% CI, 0.93–0.99), sensitivity of 87.2%, and specificity of 98.4%. Except for Vision Transformer, other machine learning models had diagnostic performance comparable to that of the expert. The DenseNet model had an AUC of 0.91 (95% CI, 0.86–0.95), sensitivity of 84.6%, and specificity of 95.1%. The ResNet50 model had an AUC of 0.91 (0.85–0.95). The Swin Transformer model had an AUC of 0.92 (0.87–0.96), sensitivity of 87.2%, and specificity of 94.3%. There was a statistically significant difference between the Vision Transformer and SA, and between the Vision Transformer and Swin Transformer models (AUC: 0.87 vs. 0.97, P = 0.01; AUC: 0.87 vs. 0.92, P = 0.04). Adding CA125 did not improve the diagnostic performance of the models in distinguishing benign and malignant ovarian tumors. Conclusion The deep learning model of TVUS can be used in ovarian cancer evaluation, and its diagnostic performance is comparable to that of expert assessment.
format Article
id doaj-art-3fb817d6d8cb4569be726eb6a00f516c
institution DOAJ
issn 1757-2215
language English
publishDate 2024-11-01
publisher BMC
record_format Article
series Journal of Ovarian Research
spelling doaj-art-3fb817d6d8cb4569be726eb6a00f516c2025-08-20T02:50:04ZengBMCJournal of Ovarian Research1757-22152024-11-0117111210.1186/s13048-024-01544-8Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative studyXin He0Xiang-Hui Bai1Hui Chen2Wei-Wei Feng3Department of Obstetrics and Gynecology, Ruijin Hospital, Shanghai Jiaotong University School of MedicinePhilips Health Technology (China) Co., Ltd. Shanghai BranchDepartment of Obstetrics and Gynecology, Ruijin Hospital, Shanghai Jiaotong University School of MedicineDepartment of Obstetrics and Gynecology, Ruijin Hospital, Shanghai Jiaotong University School of MedicineAbstract Objectives The study aimed to compare the diagnostic efficacy of the machine learning models with expert subjective assessment (SA) in assessing the malignancy risk of ovarian tumors using transvaginal ultrasound (TVUS). Methods The retrospective single-center diagnostic study included 1555 consecutive patients from January 2019 to May 2021. Using this dataset, Residual Network(ResNet), Densely Connected Convolutional Network(DenseNet), Vision Transformer(ViT), and Swin Transformer models were established and evaluated separately or combined with Cancer antigen 125 (CA 125). The diagnostic performance was then compared with SA. Results Of the 1555 patients, 76.9% were benign, while 23.1% were malignant (including borderline). When differentiating the malignant from ovarian tumors, the SA had an AUC of 0.97 (95% CI, 0.93–0.99), sensitivity of 87.2%, and specificity of 98.4%. Except for Vision Transformer, other machine learning models had diagnostic performance comparable to that of the expert. The DenseNet model had an AUC of 0.91 (95% CI, 0.86–0.95), sensitivity of 84.6%, and specificity of 95.1%. The ResNet50 model had an AUC of 0.91 (0.85–0.95). The Swin Transformer model had an AUC of 0.92 (0.87–0.96), sensitivity of 87.2%, and specificity of 94.3%. There was a statistically significant difference between the Vision Transformer and SA, and between the Vision Transformer and Swin Transformer models (AUC: 0.87 vs. 0.97, P = 0.01; AUC: 0.87 vs. 0.92, P = 0.04). Adding CA125 did not improve the diagnostic performance of the models in distinguishing benign and malignant ovarian tumors. Conclusion The deep learning model of TVUS can be used in ovarian cancer evaluation, and its diagnostic performance is comparable to that of expert assessment.https://doi.org/10.1186/s13048-024-01544-8Ovarian cancerUltrasoundMachine learningDiagnostic models
spellingShingle Xin He
Xiang-Hui Bai
Hui Chen
Wei-Wei Feng
Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
Journal of Ovarian Research
Ovarian cancer
Ultrasound
Machine learning
Diagnostic models
title Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
title_full Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
title_fullStr Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
title_full_unstemmed Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
title_short Machine learning models in evaluating the malignancy risk of ovarian tumors: a comparative study
title_sort machine learning models in evaluating the malignancy risk of ovarian tumors a comparative study
topic Ovarian cancer
Ultrasound
Machine learning
Diagnostic models
url https://doi.org/10.1186/s13048-024-01544-8
work_keys_str_mv AT xinhe machinelearningmodelsinevaluatingthemalignancyriskofovariantumorsacomparativestudy
AT xianghuibai machinelearningmodelsinevaluatingthemalignancyriskofovariantumorsacomparativestudy
AT huichen machinelearningmodelsinevaluatingthemalignancyriskofovariantumorsacomparativestudy
AT weiweifeng machinelearningmodelsinevaluatingthemalignancyriskofovariantumorsacomparativestudy