Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database

Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 p...

Full description

Saved in:
Bibliographic Details
Main Authors: Ying Zhou, Fuyuan Li, Zhan Wang, Zhanjin Wang, Ruiling Bian, Zhangtuo Xue, Junjie Cai
Format: Article
Language:English
Published: BMJ Publishing Group 2025-02-01
Series:BMJ Open
Online Access:https://bmjopen.bmj.com/content/15/2/e087427.full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850079184668327936
author Ying Zhou
Fuyuan Li
Zhan Wang
Zhanjin Wang
Ruiling Bian
Zhangtuo Xue
Junjie Cai
author_facet Ying Zhou
Fuyuan Li
Zhan Wang
Zhanjin Wang
Ruiling Bian
Zhangtuo Xue
Junjie Cai
author_sort Ying Zhou
collection DOAJ
description Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis.
format Article
id doaj-art-ab8bc2c42a2e450bbb32ef304263239f
institution DOAJ
issn 2044-6055
language English
publishDate 2025-02-01
publisher BMJ Publishing Group
record_format Article
series BMJ Open
spelling doaj-art-ab8bc2c42a2e450bbb32ef304263239f2025-08-20T02:45:19ZengBMJ Publishing GroupBMJ Open2044-60552025-02-0115210.1136/bmjopen-2024-087427Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC databaseYing Zhou0Fuyuan Li1Zhan Wang2Zhanjin Wang3Ruiling Bian4Zhangtuo Xue5Junjie Cai63 Qinghai University Affiliated Hospital, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China4 Department of Hepatopancreatobiliary Surgery, the Affiliated Hospital of Qinghai University, Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China2 Medical School of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, ChinaObjective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis.https://bmjopen.bmj.com/content/15/2/e087427.full
spellingShingle Ying Zhou
Fuyuan Li
Zhan Wang
Zhanjin Wang
Ruiling Bian
Zhangtuo Xue
Junjie Cai
Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
BMJ Open
title Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_full Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_fullStr Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_full_unstemmed Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_short Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_sort predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model a retrospective study based on the mimic database
url https://bmjopen.bmj.com/content/15/2/e087427.full
work_keys_str_mv AT yingzhou predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT fuyuanli predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT zhanwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT zhanjinwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT ruilingbian predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT zhangtuoxue predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase
AT junjiecai predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase