Random forest-driven mortality prediction in critical IBD care: a dual-database model integrating comorbidity patterns and real-time physiometrics

BackgroundInflammatory bowel disease (IBD) poses significant mortality risks for critically ill patients requiring intensive care unit (ICU) admission, driven by complications such as malnutrition, thromboembolism, and multi-organ dysfunction. Current prognostic tools for mortality prediction in thi...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhenze Zhang, Caiqing Zhao, Yijun Zhou, Ling Yao, Peng Liu, Ziling Fang, Nian Fang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-08-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2025.1624899/full
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:BackgroundInflammatory bowel disease (IBD) poses significant mortality risks for critically ill patients requiring intensive care unit (ICU) admission, driven by complications such as malnutrition, thromboembolism, and multi-organ dysfunction. Current prognostic tools for mortality prediction in this population remain limited. Machine learning (ML) offers advantages in handling complex clinical data but has not been systematically applied to this high-risk cohort. This multicenter study aimed to develop and validate ML-based models for mortality risk stratification in critically ill IBD patients using large-scale ICU databases.MethodsData from 551 IBD patients in the MIMIC-IV database (2008–2019) were analyzed, with external validation using the eICU dataset. Nine ML algorithms (XGBoost, logistic regression, LightGBM, random forest, decision tree, elastic net, MLP, KNN, RSVM) were trained to predict 1-year mortality. Predictors included demographics, comorbidities, laboratory parameters, vital signs, and disease severity scores. Missing data (<30%) were imputed using random forest. The cohort was split into training (75%) and internal testing (25%) sets, with hyperparameter optimization via 5-fold cross-validation. Model performance was evaluated using AUC, sensitivity, specificity, and calibration curves. The SHAP framework was integrated with predictive analytics to systematically evaluate key determinants of mortality risk through quantitative feature importance analysis. A nomogram was constructed based on key predictors identified through logistic regression.ResultsThe random forest model achieved superior discrimination in internal validation (AUC > 0.8). Nine predictors were identified: malignancy history, Charlson Comorbidity Index (CCI), Red Cell Distribution Width (Rdw), Glasgow Coma Scale (GCS), Sequential Organ Failure Assessment (Sofa), age, heart rate, weight and gender. The nomogram demonstrated robust external validation performance in the eICU cohort (AUC > 0.8).ConclusionWe developed and validated a machine learning-based nomogram to predict mortality in critically ill IBD patients, integrating interpretable predictors from multicenter ICU data.
ISSN:2296-858X