Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models

PurposeThis study aimed to develop three types of machine learning (ML) models based on gradient boosting decision tree (GBDT), random forest (RF), and extreme gradient boosting (XGBoost) to explore their predictive value for the stone-free rate after percutaneous nephrolithotomy (PCNL).Patients and...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhao Rong Liu, Zhan Jiang Yu, Jie Zhou, Jian Biao Huang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-08-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2025.1559613/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849338993756340224
author Zhao Rong Liu
Zhao Rong Liu
Zhan Jiang Yu
Jie Zhou
Jian Biao Huang
author_facet Zhao Rong Liu
Zhao Rong Liu
Zhan Jiang Yu
Jie Zhou
Jian Biao Huang
author_sort Zhao Rong Liu
collection DOAJ
description PurposeThis study aimed to develop three types of machine learning (ML) models based on gradient boosting decision tree (GBDT), random forest (RF), and extreme gradient boosting (XGBoost) to explore their predictive value for the stone-free rate after percutaneous nephrolithotomy (PCNL).Patients and methodsA retrospective analysis was conducted on 160 patients who underwent PCNL. The patients were randomly divided into a training set and a test set in a 7:3 ratio. Clinical data were collected, and univariate analysis was performed to identify important data significantly associated with the stone-free rate after PCNL. Three ML models (GBDT, RF, and XGBoost) were developed using the training set. The predictive performance of these models was evaluated using the area under the curve (AUC) of the receiver operating characteristic (ROC) on the test set, confusion matrix, specificity, sensitivity, accuracy, and F1 score. For the top-performing prediction model, the study further employed the SHapley Additive exPlanations (SHAP) method to enhance model interpretability by elucidating the contribution of individual features to the prediction outcomes and ranking the relative importance of the predictive data. Finally, the clinical utility of the model was assessed through decision curve analysis (DCA), which quantified the net clinical benefit of applying the model across various risk thresholds.ResultsPostoperative statistics indicated a stone-free rate of 70.6% (n = 113) among the patients. The data significantly associated with the absence of residual stones included the number of stones, stone diameter, stone CT value, history of previous stone surgery, stone location, and stone shape (p < 0.05). All three models demonstrated strong predictive effects in the validation set, with the GBDT model showing superior performance [AUC: 0.836 (95% CI: 0.785–0.873); accuracy: 0.854; sensitivity: 0.853; specificity: 0.857] compared to the XGBoost [AUC: 0.830 (95% CI: 0.792–0.868); accuracy: 0.771; sensitivity: 0.824; specificity: 0.643] and RF models [AUC: 0.803 (95% CI: 0.763–0.837); accuracy: 0.792; sensitivity: 0.824; specificity: 0.714]. The F1 scores for the GBDT, RF, and XGBoost models were 0.892, 0.836, and 0.849, respectively. The DCA decision curve analysis confirmed that the GBDT model offers a favorable net clinical benefit. In addition, the SHAP analysis identified the number of stones and the stone CT value as the most critical features influencing the model’s predictions, contributing significantly to its overall predictive performance.ConclusionThe prediction models developed based on three machine learning algorithms can accurately predict the stone-free rate after PCNL in patients with urinary tract stones. Among these, the GBDT model can effectively identify patients who are most likely to achieve successful outcomes from PCNL based on demographic and stone characteristics, thereby assisting in clinical treatment decision-making.
format Article
id doaj-art-7c3b9aabd2d5405ab8ddffbb90803764
institution Kabale University
issn 2296-858X
language English
publishDate 2025-08-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Medicine
spelling doaj-art-7c3b9aabd2d5405ab8ddffbb908037642025-08-20T03:44:14ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2025-08-011210.3389/fmed.2025.15596131559613Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning modelsZhao Rong Liu0Zhao Rong Liu1Zhan Jiang Yu2Jie Zhou3Jian Biao Huang4Jiangxi Cancer Hospital & Institute, Jiangxi Clinical Research Center for Cancer, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, ChinaDepartment of Urology, Yudu County People’s Hospital, Yudu, ChinaThe Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, ChinaThe Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, ChinaJiangxi Cancer Hospital & Institute, Jiangxi Clinical Research Center for Cancer, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, ChinaPurposeThis study aimed to develop three types of machine learning (ML) models based on gradient boosting decision tree (GBDT), random forest (RF), and extreme gradient boosting (XGBoost) to explore their predictive value for the stone-free rate after percutaneous nephrolithotomy (PCNL).Patients and methodsA retrospective analysis was conducted on 160 patients who underwent PCNL. The patients were randomly divided into a training set and a test set in a 7:3 ratio. Clinical data were collected, and univariate analysis was performed to identify important data significantly associated with the stone-free rate after PCNL. Three ML models (GBDT, RF, and XGBoost) were developed using the training set. The predictive performance of these models was evaluated using the area under the curve (AUC) of the receiver operating characteristic (ROC) on the test set, confusion matrix, specificity, sensitivity, accuracy, and F1 score. For the top-performing prediction model, the study further employed the SHapley Additive exPlanations (SHAP) method to enhance model interpretability by elucidating the contribution of individual features to the prediction outcomes and ranking the relative importance of the predictive data. Finally, the clinical utility of the model was assessed through decision curve analysis (DCA), which quantified the net clinical benefit of applying the model across various risk thresholds.ResultsPostoperative statistics indicated a stone-free rate of 70.6% (n = 113) among the patients. The data significantly associated with the absence of residual stones included the number of stones, stone diameter, stone CT value, history of previous stone surgery, stone location, and stone shape (p < 0.05). All three models demonstrated strong predictive effects in the validation set, with the GBDT model showing superior performance [AUC: 0.836 (95% CI: 0.785–0.873); accuracy: 0.854; sensitivity: 0.853; specificity: 0.857] compared to the XGBoost [AUC: 0.830 (95% CI: 0.792–0.868); accuracy: 0.771; sensitivity: 0.824; specificity: 0.643] and RF models [AUC: 0.803 (95% CI: 0.763–0.837); accuracy: 0.792; sensitivity: 0.824; specificity: 0.714]. The F1 scores for the GBDT, RF, and XGBoost models were 0.892, 0.836, and 0.849, respectively. The DCA decision curve analysis confirmed that the GBDT model offers a favorable net clinical benefit. In addition, the SHAP analysis identified the number of stones and the stone CT value as the most critical features influencing the model’s predictions, contributing significantly to its overall predictive performance.ConclusionThe prediction models developed based on three machine learning algorithms can accurately predict the stone-free rate after PCNL in patients with urinary tract stones. Among these, the GBDT model can effectively identify patients who are most likely to achieve successful outcomes from PCNL based on demographic and stone characteristics, thereby assisting in clinical treatment decision-making.https://www.frontiersin.org/articles/10.3389/fmed.2025.1559613/fullurinary tract stonespercutaneous nephrolithotomymachine learningstone-free ratepredict
spellingShingle Zhao Rong Liu
Zhao Rong Liu
Zhan Jiang Yu
Jie Zhou
Jian Biao Huang
Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
Frontiers in Medicine
urinary tract stones
percutaneous nephrolithotomy
machine learning
stone-free rate
predict
title Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
title_full Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
title_fullStr Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
title_full_unstemmed Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
title_short Predictive value of the stone-free rate after percutaneous nephrolithotomy based on multiple machine learning models
title_sort predictive value of the stone free rate after percutaneous nephrolithotomy based on multiple machine learning models
topic urinary tract stones
percutaneous nephrolithotomy
machine learning
stone-free rate
predict
url https://www.frontiersin.org/articles/10.3389/fmed.2025.1559613/full
work_keys_str_mv AT zhaorongliu predictivevalueofthestonefreerateafterpercutaneousnephrolithotomybasedonmultiplemachinelearningmodels
AT zhaorongliu predictivevalueofthestonefreerateafterpercutaneousnephrolithotomybasedonmultiplemachinelearningmodels
AT zhanjiangyu predictivevalueofthestonefreerateafterpercutaneousnephrolithotomybasedonmultiplemachinelearningmodels
AT jiezhou predictivevalueofthestonefreerateafterpercutaneousnephrolithotomybasedonmultiplemachinelearningmodels
AT jianbiaohuang predictivevalueofthestonefreerateafterpercutaneousnephrolithotomybasedonmultiplemachinelearningmodels