The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis

Abstract Introduction There is currently no satisfactory model for predicting malignant transformation of endometriosis. The aim of this study was to construct and evaluate a risk model incorporating noninvasive clinical parameters to predict endometriosis‐associated ovarian cancer (EAOC) in patient...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaopei Chao, Shu Wang, Jinghe Lang, Jinhua Leng, Qingbo Fan
Format: Article
Language:English
Published: Wiley 2022-12-01
Series:Acta Obstetricia et Gynecologica Scandinavica
Subjects:
Online Access:https://doi.org/10.1111/aogs.14462
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850115262684069888
author Xiaopei Chao
Shu Wang
Jinghe Lang
Jinhua Leng
Qingbo Fan
author_facet Xiaopei Chao
Shu Wang
Jinghe Lang
Jinhua Leng
Qingbo Fan
author_sort Xiaopei Chao
collection DOAJ
description Abstract Introduction There is currently no satisfactory model for predicting malignant transformation of endometriosis. The aim of this study was to construct and evaluate a risk model incorporating noninvasive clinical parameters to predict endometriosis‐associated ovarian cancer (EAOC) in patients with endometriosis. Material and Methods We enrolled 6809 patients with endometriosis confirmed by pathology, and randomly allocated them to training (n = 4766) and testing cohorts (n = 2043). The proportion of patients with EAOC in each cohort was similar. We extracted a total of 94 demographic and clinicopathologic features from the medical records using natural language processing. We used a machine learning method – gradient‐boosting decision tree – to construct a predictive model for EAOC and to evaluate the accuracy of the model. We also constructed a multivariate logistic regression model inclusive of the EAOC‐associated risk factors using a back stepwise procedure. Then we compared the performance of the two risk‐predicting models using DeLong's test. Results The occurrence of EAOC was 1.84% in this study. The logistic regression model comprised 10 selected features and demonstrated good discrimination in the testing cohort, with an area under the curve (AUC) of 0.891 (95% confidence interval [CI] 0.821–0.960), sensitivity of 88.9%, and specificity of 76.7%. The risk model based on machine learning had an AUC of 0.942 (95% CI 0.914–0.969), sensitivity of 86.8%, and specificity of 86.7%. The machine learning‐based risk model performed better than the logistic regression model in DeLong's test (p = 0.036). Furthermore, in a prospective dataset, the machine learning‐based risk model had an AUC of 0.8758, a sensitivity of 94.4%, and a specificity of 73.8%. Conclusions The machine learning‐based risk model was constructed to predict EAOC and had high sensitivity and specificity. This model could be of considerable use in helping reduce medical costs and designing follow‐up schedules.
format Article
id doaj-art-2ee64311e5a54973a43341da8cdb2e1a
institution OA Journals
issn 0001-6349
1600-0412
language English
publishDate 2022-12-01
publisher Wiley
record_format Article
series Acta Obstetricia et Gynecologica Scandinavica
spelling doaj-art-2ee64311e5a54973a43341da8cdb2e1a2025-08-20T02:36:38ZengWileyActa Obstetricia et Gynecologica Scandinavica0001-63491600-04122022-12-01101121440144910.1111/aogs.14462The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosisXiaopei Chao0Shu Wang1Jinghe Lang2Jinhua Leng3Qingbo Fan4Department of Obstetrics and Gynecology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences (CAMS) & Peking Union Medical College Beijing ChinaDepartment of Obstetrics and Gynecology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences (CAMS) & Peking Union Medical College Beijing ChinaDepartment of Obstetrics and Gynecology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences (CAMS) & Peking Union Medical College Beijing ChinaDepartment of Obstetrics and Gynecology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences (CAMS) & Peking Union Medical College Beijing ChinaDepartment of Obstetrics and Gynecology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences (CAMS) & Peking Union Medical College Beijing ChinaAbstract Introduction There is currently no satisfactory model for predicting malignant transformation of endometriosis. The aim of this study was to construct and evaluate a risk model incorporating noninvasive clinical parameters to predict endometriosis‐associated ovarian cancer (EAOC) in patients with endometriosis. Material and Methods We enrolled 6809 patients with endometriosis confirmed by pathology, and randomly allocated them to training (n = 4766) and testing cohorts (n = 2043). The proportion of patients with EAOC in each cohort was similar. We extracted a total of 94 demographic and clinicopathologic features from the medical records using natural language processing. We used a machine learning method – gradient‐boosting decision tree – to construct a predictive model for EAOC and to evaluate the accuracy of the model. We also constructed a multivariate logistic regression model inclusive of the EAOC‐associated risk factors using a back stepwise procedure. Then we compared the performance of the two risk‐predicting models using DeLong's test. Results The occurrence of EAOC was 1.84% in this study. The logistic regression model comprised 10 selected features and demonstrated good discrimination in the testing cohort, with an area under the curve (AUC) of 0.891 (95% confidence interval [CI] 0.821–0.960), sensitivity of 88.9%, and specificity of 76.7%. The risk model based on machine learning had an AUC of 0.942 (95% CI 0.914–0.969), sensitivity of 86.8%, and specificity of 86.7%. The machine learning‐based risk model performed better than the logistic regression model in DeLong's test (p = 0.036). Furthermore, in a prospective dataset, the machine learning‐based risk model had an AUC of 0.8758, a sensitivity of 94.4%, and a specificity of 73.8%. Conclusions The machine learning‐based risk model was constructed to predict EAOC and had high sensitivity and specificity. This model could be of considerable use in helping reduce medical costs and designing follow‐up schedules.https://doi.org/10.1111/aogs.14462endometriosismachine learningmalignant transformationovarian cancerrisk model
spellingShingle Xiaopei Chao
Shu Wang
Jinghe Lang
Jinhua Leng
Qingbo Fan
The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
Acta Obstetricia et Gynecologica Scandinavica
endometriosis
machine learning
malignant transformation
ovarian cancer
risk model
title The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
title_full The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
title_fullStr The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
title_full_unstemmed The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
title_short The application of risk models based on machine learning to predict endometriosis‐associated ovarian cancer in patients with endometriosis
title_sort application of risk models based on machine learning to predict endometriosis associated ovarian cancer in patients with endometriosis
topic endometriosis
machine learning
malignant transformation
ovarian cancer
risk model
url https://doi.org/10.1111/aogs.14462
work_keys_str_mv AT xiaopeichao theapplicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT shuwang theapplicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT jinghelang theapplicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT jinhualeng theapplicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT qingbofan theapplicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT xiaopeichao applicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT shuwang applicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT jinghelang applicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT jinhualeng applicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis
AT qingbofan applicationofriskmodelsbasedonmachinelearningtopredictendometriosisassociatedovariancancerinpatientswithendometriosis