A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application
BackgroundOvarian cancer (OC) remains a highly lethal gynecological malignancy, often diagnosed at advanced stages with a poor prognosis. Lymph node involvement is a critical prognostic factor and significantly influences treatment planning. However, accurately predicting lymph node positivity remai...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Frontiers Media S.A.
2025-07-01
|
| Series: | Frontiers in Oncology |
| Subjects: | |
| Online Access: | https://www.frontiersin.org/articles/10.3389/fonc.2025.1527674/full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849423273580822528 |
|---|---|
| author | QingYong Guo Jinji Wang Ru Chen LiPing Hu Wenqiang You |
| author_facet | QingYong Guo Jinji Wang Ru Chen LiPing Hu Wenqiang You |
| author_sort | QingYong Guo |
| collection | DOAJ |
| description | BackgroundOvarian cancer (OC) remains a highly lethal gynecological malignancy, often diagnosed at advanced stages with a poor prognosis. Lymph node involvement is a critical prognostic factor and significantly influences treatment planning. However, accurately predicting lymph node positivity remains challenging due to the disease’s heterogeneity and the limitations of traditional models in handling high-dimensional and imbalanced data.MethodsA retrospective analysis was conducted using the SEER database (2000–2021), including 26,844 OC patients with complete clinical information. We developed a machine learning model incorporating multiple algorithms, with XGBoost demonstrating superior performance. SMOTE was used to address class imbalance, and LASSO regression aided in selecting key predictors such as tumor size, histology, chemotherapy, and surgery. Model performance was assessed via accuracy, sensitivity, specificity, F1 score, and AUC, with external validation performed using an independent cohort from Fujian Provincial Maternity and Children’s Hospital.ResultsThe XGBoost model achieved an AUC of 0.98 (95% CI: 0.975–0.986) in the training set and 0.847 (95% CI: 0.823–0.871) in external validation. The model demonstrated high sensitivity and robust performance in identifying lymph node-positive cases. Tumor size ≥5 cm, histological subtype, and chemotherapy were key predictive features, with SHAP analysis identifying tumor size as the most influential factor.ConclusionWe present the first machine learning model specifically developed for predicting lymph node positivity in OC, validated across large, diverse cohorts. To facilitate clinical translation, we developed a free, user-friendly online calculator, which allows clinicians to quickly estimate lymph node positivity risk using patient-specific clinical parameters. This tool can be accessed at http://127.0.0.1:6818 and serves as a practical, evidence-based aid to support individualized treatment decisions and potentially improve patient outcomes. Future studies should integrate molecular data and broaden external validation to enhance generalizability. |
| format | Article |
| id | doaj-art-ec53f966895e44bca0d456c12df64d34 |
| institution | Kabale University |
| issn | 2234-943X |
| language | English |
| publishDate | 2025-07-01 |
| publisher | Frontiers Media S.A. |
| record_format | Article |
| series | Frontiers in Oncology |
| spelling | doaj-art-ec53f966895e44bca0d456c12df64d342025-08-20T03:30:40ZengFrontiers Media S.A.Frontiers in Oncology2234-943X2025-07-011510.3389/fonc.2025.15276741527674A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical applicationQingYong Guo0Jinji Wang1Ru Chen2LiPing Hu3Wenqiang You4Obstetrics & Gynecology, Fujian Maternity and Child Health Hospital College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, ChinaMedical Record Statistics, Fujian Maternity and Child Health Hospital College of Clinical Medicine for Medical Record Statistics, Fujian Medical University, Fuzhou, Fujian, ChinaObstetrics & Gynecology, Fujian Maternity and Child Health Hospital College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, ChinaObstetrics & Gynecology, Fujian Maternity and Child Health Hospital College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, ChinaObstetrics & Gynecology, Fujian Maternity and Child Health Hospital College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, ChinaBackgroundOvarian cancer (OC) remains a highly lethal gynecological malignancy, often diagnosed at advanced stages with a poor prognosis. Lymph node involvement is a critical prognostic factor and significantly influences treatment planning. However, accurately predicting lymph node positivity remains challenging due to the disease’s heterogeneity and the limitations of traditional models in handling high-dimensional and imbalanced data.MethodsA retrospective analysis was conducted using the SEER database (2000–2021), including 26,844 OC patients with complete clinical information. We developed a machine learning model incorporating multiple algorithms, with XGBoost demonstrating superior performance. SMOTE was used to address class imbalance, and LASSO regression aided in selecting key predictors such as tumor size, histology, chemotherapy, and surgery. Model performance was assessed via accuracy, sensitivity, specificity, F1 score, and AUC, with external validation performed using an independent cohort from Fujian Provincial Maternity and Children’s Hospital.ResultsThe XGBoost model achieved an AUC of 0.98 (95% CI: 0.975–0.986) in the training set and 0.847 (95% CI: 0.823–0.871) in external validation. The model demonstrated high sensitivity and robust performance in identifying lymph node-positive cases. Tumor size ≥5 cm, histological subtype, and chemotherapy were key predictive features, with SHAP analysis identifying tumor size as the most influential factor.ConclusionWe present the first machine learning model specifically developed for predicting lymph node positivity in OC, validated across large, diverse cohorts. To facilitate clinical translation, we developed a free, user-friendly online calculator, which allows clinicians to quickly estimate lymph node positivity risk using patient-specific clinical parameters. This tool can be accessed at http://127.0.0.1:6818 and serves as a practical, evidence-based aid to support individualized treatment decisions and potentially improve patient outcomes. Future studies should integrate molecular data and broaden external validation to enhance generalizability.https://www.frontiersin.org/articles/10.3389/fonc.2025.1527674/fullovarian cancerlymph node positivitymachine learningXGBoostprognosis |
| spellingShingle | QingYong Guo Jinji Wang Ru Chen LiPing Hu Wenqiang You A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application Frontiers in Oncology ovarian cancer lymph node positivity machine learning XGBoost prognosis |
| title | A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application |
| title_full | A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application |
| title_fullStr | A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application |
| title_full_unstemmed | A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application |
| title_short | A machine learning model for predicting lymph node positivity in ovarian cancer: development, validation, and clinical application |
| title_sort | machine learning model for predicting lymph node positivity in ovarian cancer development validation and clinical application |
| topic | ovarian cancer lymph node positivity machine learning XGBoost prognosis |
| url | https://www.frontiersin.org/articles/10.3389/fonc.2025.1527674/full |
| work_keys_str_mv | AT qingyongguo amachinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT jinjiwang amachinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT ruchen amachinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT lipinghu amachinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT wenqiangyou amachinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT qingyongguo machinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT jinjiwang machinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT ruchen machinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT lipinghu machinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication AT wenqiangyou machinelearningmodelforpredictinglymphnodepositivityinovariancancerdevelopmentvalidationandclinicalapplication |