Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 p...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMJ Publishing Group
2025-02-01
|
| Series: | BMJ Open |
| Online Access: | https://bmjopen.bmj.com/content/15/2/e087427.full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850079184668327936 |
|---|---|
| author | Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai |
| author_facet | Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai |
| author_sort | Ying Zhou |
| collection | DOAJ |
| description | Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis. |
| format | Article |
| id | doaj-art-ab8bc2c42a2e450bbb32ef304263239f |
| institution | DOAJ |
| issn | 2044-6055 |
| language | English |
| publishDate | 2025-02-01 |
| publisher | BMJ Publishing Group |
| record_format | Article |
| series | BMJ Open |
| spelling | doaj-art-ab8bc2c42a2e450bbb32ef304263239f2025-08-20T02:45:19ZengBMJ Publishing GroupBMJ Open2044-60552025-02-0115210.1136/bmjopen-2024-087427Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC databaseYing Zhou0Fuyuan Li1Zhan Wang2Zhanjin Wang3Ruiling Bian4Zhangtuo Xue5Junjie Cai63 Qinghai University Affiliated Hospital, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China4 Department of Hepatopancreatobiliary Surgery, the Affiliated Hospital of Qinghai University, Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China2 Medical School of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, ChinaObjective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis.https://bmjopen.bmj.com/content/15/2/e087427.full |
| spellingShingle | Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database BMJ Open |
| title | Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database |
| title_full | Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database |
| title_fullStr | Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database |
| title_full_unstemmed | Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database |
| title_short | Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database |
| title_sort | predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model a retrospective study based on the mimic database |
| url | https://bmjopen.bmj.com/content/15/2/e087427.full |
| work_keys_str_mv | AT yingzhou predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT fuyuanli predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhanwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhanjinwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT ruilingbian predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhangtuoxue predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT junjiecai predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase |