Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database

Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 p...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ying Zhou, Fuyuan Li, Zhan Wang, Zhanjin Wang, Ruiling Bian, Zhangtuo Xue, Junjie Cai
Format:	Article
Language:	English
Published:	BMJ Publishing Group 2025-02-01
Series:	BMJ Open
Online Access:	https://bmjopen.bmj.com/content/15/2/e087427.full
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850079184668327936
author	Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai
author_facet	Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai
author_sort	Ying Zhou
collection	DOAJ
description	Objective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis.
format	Article
id	doaj-art-ab8bc2c42a2e450bbb32ef304263239f
institution	DOAJ
issn	2044-6055
language	English
publishDate	2025-02-01
publisher	BMJ Publishing Group
record_format	Article
series	BMJ Open
spelling	doaj-art-ab8bc2c42a2e450bbb32ef304263239f2025-08-20T02:45:19ZengBMJ Publishing GroupBMJ Open2044-60552025-02-0115210.1136/bmjopen-2024-087427Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC databaseYing Zhou0Fuyuan Li1Zhan Wang2Zhanjin Wang3Ruiling Bian4Zhangtuo Xue5Junjie Cai63 Qinghai University Affiliated Hospital, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China4 Department of Hepatopancreatobiliary Surgery, the Affiliated Hospital of Qinghai University, Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China2 Medical School of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, China1 Clinical Medical College of Qinghai University, Xining, Qinghai, ChinaObjective This study developed and validated a stacked ensemble machine learning model to predict the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis.Design A retrospective study based on patient data from public databases.Participants This study analysed 1295 patients with acute pancreatitis complicated by septicaemia from the US Intensive Care Database.Methods From the MIMIC database, data of patients with acute pancreatitis and sepsis were obtained to construct machine learning models, which were internally and externally validated. The Boruta algorithm was used to select variables. Then, eight machine learning algorithms were used to construct prediction models for acute kidney injury (AKI) occurrence in intensive care unit (ICU) patients. A new stacked ensemble model was developed using the Stacking ensemble method. Model evaluation was performed using area under the receiver operating characteristic curve (AUC), precision-recall (PR) curve, accuracy, recall and F1 score. The Shapley additive explanation (SHAP) method was used to explain the models.Main outcome measures AKI in patients with acute pancreatitis complicated by sepsis.Results The final study included 1295 patients with acute pancreatitis complicated by sepsis, among whom 893 cases (68.9%) developed acute kidney injury. We established eight base models, including Logit, SVM, CatBoost, RF, XGBoost, LightGBM, AdaBoost and MLP, as well as a stacked ensemble model called Multimodel. Among all models, Multimodel had an AUC value of 0.853 (95% CI: 0.792 to 0.896) in the internal validation dataset and 0.802 (95% CI: 0.732 to 0.861) in the external validation dataset. This model demonstrated the best predictive performance in terms of discrimination and clinical application.Conclusion The stack ensemble model developed by us achieved AUC values of 0.853 and 0.802 in internal and external validation cohorts respectively and also demonstrated excellent performance in other metrics. It serves as a reliable tool for predicting AKI in patients with acute pancreatitis complicated by sepsis.https://bmjopen.bmj.com/content/15/2/e087427.full
spellingShingle	Ying Zhou Fuyuan Li Zhan Wang Zhanjin Wang Ruiling Bian Zhangtuo Xue Junjie Cai Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database BMJ Open
title	Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_full	Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_fullStr	Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_full_unstemmed	Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_short	Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database
title_sort	predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model a retrospective study based on the mimic database
url	https://bmjopen.bmj.com/content/15/2/e087427.full
work_keys_str_mv	AT yingzhou predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT fuyuanli predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhanwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhanjinwang predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT ruilingbian predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT zhangtuoxue predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase AT junjiecai predictingtheriskofacutekidneyinjuryinpatientswithacutepancreatitiscomplicatedbysepsisusingastackedensemblemachinelearningmodelaretrospectivestudybasedonthemimicdatabase

Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database

Similar Items