Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis

Sepsis remains a major global health concern, causing high mortality rates, prolonged hospital stays, and substantial economic burdens. The accurate prediction of clinical outcomes, such as mortality and length of stay (LOS), is critical for optimizing hospital resource allocation and improving pati...

Full description

Saved in:
Bibliographic Details
Main Authors: Houssem Ben Khalfallah, Mariem Jelassi, Jacques Demongeot, Narjès Bellamine Ben Saoud
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Computation
Subjects:
Online Access:https://www.mdpi.com/2079-3197/13/1/8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588792059396096
author Houssem Ben Khalfallah
Mariem Jelassi
Jacques Demongeot
Narjès Bellamine Ben Saoud
author_facet Houssem Ben Khalfallah
Mariem Jelassi
Jacques Demongeot
Narjès Bellamine Ben Saoud
author_sort Houssem Ben Khalfallah
collection DOAJ
description Sepsis remains a major global health concern, causing high mortality rates, prolonged hospital stays, and substantial economic burdens. The accurate prediction of clinical outcomes, such as mortality and length of stay (LOS), is critical for optimizing hospital resource allocation and improving patient management. The present study investigates the potential of machine learning (ML) models to predict these outcomes using a dataset of 1492 sepsis patients with clinical, physiological, and demographic features. After rigorous preprocessing to address missing data and ensure consistency, multiple classifiers, including Random Forest, Extra Trees, and Gradient Boosting, were trained and validated. The results demonstrate that Random Forest and Extra Trees achieve high accuracy for LOS prediction, while Gradient Boosting and Bernoulli Naïve Bayes effectively predict mortality. Feature importance analysis identified ICU stay duration (ICU_DAYS_OBS) as the most influential predictor for both outcomes, alongside vital signs, white blood cell counts, and lactic acid levels. These findings highlight the potential of ML-driven clinical decision support systems (CDSSs) to enhance early risk assessment, optimize ICU resource planning, and support timely interventions. Future research should refine predictive features, integrate advanced biomarkers, and validate models across larger and more diverse datasets to improve scalability and clinical impact.
format Article
id doaj-art-b3790a65b816456c8ba54424d9080af5
institution Kabale University
issn 2079-3197
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Computation
spelling doaj-art-b3790a65b816456c8ba54424d9080af52025-01-24T13:27:47ZengMDPI AGComputation2079-31972025-01-01131810.3390/computation13010008Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in SepsisHoussem Ben Khalfallah0Mariem Jelassi1Jacques Demongeot2Narjès Bellamine Ben Saoud3RIADI Laboratory, Ecole Nationale des Sciences de l’Informatique, Manouba University, La Manouba 2010, TunisiaRIADI Laboratory, Ecole Nationale des Sciences de l’Informatique, Manouba University, La Manouba 2010, TunisiaAGEIS Laboratory, University Grenoble Alpes, 38700 La Tronche, FranceRIADI Laboratory, Ecole Nationale des Sciences de l’Informatique, Manouba University, La Manouba 2010, TunisiaSepsis remains a major global health concern, causing high mortality rates, prolonged hospital stays, and substantial economic burdens. The accurate prediction of clinical outcomes, such as mortality and length of stay (LOS), is critical for optimizing hospital resource allocation and improving patient management. The present study investigates the potential of machine learning (ML) models to predict these outcomes using a dataset of 1492 sepsis patients with clinical, physiological, and demographic features. After rigorous preprocessing to address missing data and ensure consistency, multiple classifiers, including Random Forest, Extra Trees, and Gradient Boosting, were trained and validated. The results demonstrate that Random Forest and Extra Trees achieve high accuracy for LOS prediction, while Gradient Boosting and Bernoulli Naïve Bayes effectively predict mortality. Feature importance analysis identified ICU stay duration (ICU_DAYS_OBS) as the most influential predictor for both outcomes, alongside vital signs, white blood cell counts, and lactic acid levels. These findings highlight the potential of ML-driven clinical decision support systems (CDSSs) to enhance early risk assessment, optimize ICU resource planning, and support timely interventions. Future research should refine predictive features, integrate advanced biomarkers, and validate models across larger and more diverse datasets to improve scalability and clinical impact.https://www.mdpi.com/2079-3197/13/1/8sepsismachine learninglength of stay (LOS)mortality predictionclinical decision support systems (CDSSs)intensive care unit (ICU)
spellingShingle Houssem Ben Khalfallah
Mariem Jelassi
Jacques Demongeot
Narjès Bellamine Ben Saoud
Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
Computation
sepsis
machine learning
length of stay (LOS)
mortality prediction
clinical decision support systems (CDSSs)
intensive care unit (ICU)
title Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
title_full Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
title_fullStr Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
title_full_unstemmed Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
title_short Advancements in Predictive Analytics: Machine Learning Approaches to Estimating Length of Stay and Mortality in Sepsis
title_sort advancements in predictive analytics machine learning approaches to estimating length of stay and mortality in sepsis
topic sepsis
machine learning
length of stay (LOS)
mortality prediction
clinical decision support systems (CDSSs)
intensive care unit (ICU)
url https://www.mdpi.com/2079-3197/13/1/8
work_keys_str_mv AT houssembenkhalfallah advancementsinpredictiveanalyticsmachinelearningapproachestoestimatinglengthofstayandmortalityinsepsis
AT mariemjelassi advancementsinpredictiveanalyticsmachinelearningapproachestoestimatinglengthofstayandmortalityinsepsis
AT jacquesdemongeot advancementsinpredictiveanalyticsmachinelearningapproachestoestimatinglengthofstayandmortalityinsepsis
AT narjesbellaminebensaoud advancementsinpredictiveanalyticsmachinelearningapproachestoestimatinglengthofstayandmortalityinsepsis