Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database
Abstract Objectives This study aimed to develop and validate an explainable machine learning (ML) model to predict 28-day all-cause mortality in immunocompromised patients admitted to the intensive care unit (ICU). Accurate and interpretable mortality prediction is crucial for clinical decision-maki...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMC
2025-05-01
|
| Series: | European Journal of Medical Research |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s40001-025-02622-3 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850042801766531072 |
|---|---|
| author | Zhengqiu Yu Lexin Fang Yueping Ding |
| author_facet | Zhengqiu Yu Lexin Fang Yueping Ding |
| author_sort | Zhengqiu Yu |
| collection | DOAJ |
| description | Abstract Objectives This study aimed to develop and validate an explainable machine learning (ML) model to predict 28-day all-cause mortality in immunocompromised patients admitted to the intensive care unit (ICU). Accurate and interpretable mortality prediction is crucial for clinical decision-making and optimal allocation of critical care resources for this vulnerable patient population. Methods We utilized retrospective clinical data from the MIMIC-IV (version 2.2) database, encompassing ICU admissions at Beth Israel Deaconess Medical Center from 2008 to 2019. Eligible immunocompromised patients, including those with primary immunodeficiencies and chronic acquired conditions, such as hematological malignancies, solid tumors, and organ transplantation, were selected. Data were randomly split into training (80%) and testing (20%) cohorts. Ten ML models (logistic regression, XGBoost, LightGBM, AdaBoost, Random Forest, Gradient Boosting, Gaussian Naive Bayes, Complement Naive Bayes, Multilayer Perceptron, and Support Vector Machine) were developed and evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision–recall curve (AUPRC), sensitivity, specificity, accuracy, and F1 score. Model explainability was achieved through SHapley Additive exPlanations (SHAP), and decision curve analysis (DCA) assessed clinical utility. In addition, Cox proportional hazards regression was conducted to evaluate the impact of predictive factors on time-to-event outcomes. Results Among the evaluated models, the Support Vector Machine (SVM) demonstrated the highest AUROC of 0.863 (95% CI 0.834–0.890) and a notable AUPRC of 0.678 (95% CI 0.624–0.736). Key predictive factors consistently identified across multiple ML models included 24-h urine output, blood urea nitrogen (BUN) levels, presence of metastatic solid tumors, Charlson Comorbidity Index (CCI), and international normalized ratio (INR). SHAP analyses provided detailed insights into how these features influenced model predictions. Conclusions The explainable ML models based on various artificial intelligence methods demonstrated promising clinical applicability in predicting 28-day mortality risk among immunocompromised ICU patients. Factors such as urine output, BUN, metastatic solid tumors, CCI, and INR significantly contributed to prediction outcomes and may serve as important predictors in clinical practice. |
| format | Article |
| id | doaj-art-cb7225138ecc419ba65f3c87f18329f6 |
| institution | DOAJ |
| issn | 2047-783X |
| language | English |
| publishDate | 2025-05-01 |
| publisher | BMC |
| record_format | Article |
| series | European Journal of Medical Research |
| spelling | doaj-art-cb7225138ecc419ba65f3c87f18329f62025-08-20T02:55:25ZengBMCEuropean Journal of Medical Research2047-783X2025-05-0130111510.1186/s40001-025-02622-3Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV databaseZhengqiu Yu0Lexin Fang1Yueping Ding2School of Medicine, Xiamen UniversityDepartment of Critical Care Medicine, The Second Affiliated Hospital of Zhejiang Chinese Medical UniversityDepartment of Critical Care Medicine, The Second Affiliated Hospital of Zhejiang Chinese Medical UniversityAbstract Objectives This study aimed to develop and validate an explainable machine learning (ML) model to predict 28-day all-cause mortality in immunocompromised patients admitted to the intensive care unit (ICU). Accurate and interpretable mortality prediction is crucial for clinical decision-making and optimal allocation of critical care resources for this vulnerable patient population. Methods We utilized retrospective clinical data from the MIMIC-IV (version 2.2) database, encompassing ICU admissions at Beth Israel Deaconess Medical Center from 2008 to 2019. Eligible immunocompromised patients, including those with primary immunodeficiencies and chronic acquired conditions, such as hematological malignancies, solid tumors, and organ transplantation, were selected. Data were randomly split into training (80%) and testing (20%) cohorts. Ten ML models (logistic regression, XGBoost, LightGBM, AdaBoost, Random Forest, Gradient Boosting, Gaussian Naive Bayes, Complement Naive Bayes, Multilayer Perceptron, and Support Vector Machine) were developed and evaluated using area under the receiver operating characteristic curve (AUROC), area under the precision–recall curve (AUPRC), sensitivity, specificity, accuracy, and F1 score. Model explainability was achieved through SHapley Additive exPlanations (SHAP), and decision curve analysis (DCA) assessed clinical utility. In addition, Cox proportional hazards regression was conducted to evaluate the impact of predictive factors on time-to-event outcomes. Results Among the evaluated models, the Support Vector Machine (SVM) demonstrated the highest AUROC of 0.863 (95% CI 0.834–0.890) and a notable AUPRC of 0.678 (95% CI 0.624–0.736). Key predictive factors consistently identified across multiple ML models included 24-h urine output, blood urea nitrogen (BUN) levels, presence of metastatic solid tumors, Charlson Comorbidity Index (CCI), and international normalized ratio (INR). SHAP analyses provided detailed insights into how these features influenced model predictions. Conclusions The explainable ML models based on various artificial intelligence methods demonstrated promising clinical applicability in predicting 28-day mortality risk among immunocompromised ICU patients. Factors such as urine output, BUN, metastatic solid tumors, CCI, and INR significantly contributed to prediction outcomes and may serve as important predictors in clinical practice.https://doi.org/10.1186/s40001-025-02622-3Mortality predictionImmunocompromisedIntensive care unitMachine learningMIMIC-IVSHAP value |
| spellingShingle | Zhengqiu Yu Lexin Fang Yueping Ding Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database European Journal of Medical Research Mortality prediction Immunocompromised Intensive care unit Machine learning MIMIC-IV SHAP value |
| title | Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database |
| title_full | Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database |
| title_fullStr | Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database |
| title_full_unstemmed | Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database |
| title_short | Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database |
| title_sort | explainable machine learning model for prediction of 28 day all cause mortality in immunocompromised patients in the intensive care unit a retrospective cohort study based on mimic iv database |
| topic | Mortality prediction Immunocompromised Intensive care unit Machine learning MIMIC-IV SHAP value |
| url | https://doi.org/10.1186/s40001-025-02622-3 |
| work_keys_str_mv | AT zhengqiuyu explainablemachinelearningmodelforpredictionof28dayallcausemortalityinimmunocompromisedpatientsintheintensivecareunitaretrospectivecohortstudybasedonmimicivdatabase AT lexinfang explainablemachinelearningmodelforpredictionof28dayallcausemortalityinimmunocompromisedpatientsintheintensivecareunitaretrospectivecohortstudybasedonmimicivdatabase AT yuepingding explainablemachinelearningmodelforpredictionof28dayallcausemortalityinimmunocompromisedpatientsintheintensivecareunitaretrospectivecohortstudybasedonmimicivdatabase |