Empowering Diagnostics: An Ensemble Machine Learning Model for Early Liver Disease Detection

Early and accurate detection of liver disease is critical to improving patient outcomes yet remains challenging due to class imbalance and noisy clinical data. In this study, we present a robust ensemble learning framework applied to the Indian Liver Patient Dataset, incorporating systematic data c...

Full description

Saved in:
Bibliographic Details
Main Authors: Abdulrahman Ahmed Jasim, Hajer Alwindawi, Layth Rafea Hazim
Format: Article
Language:English
Published: Al-Iraqia University - College of Engineering 2025-06-01
Series:Al-Iraqia Journal for Scientific Engineering Research
Subjects:
Online Access:https://ijser.aliraqia.edu.iq/index.php/ijser/article/view/314
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Early and accurate detection of liver disease is critical to improving patient outcomes yet remains challenging due to class imbalance and noisy clinical data. In this study, we present a robust ensemble learning framework applied to the Indian Liver Patient Dataset, incorporating systematic data cleaning, normalization, and Synthetic Minority Over‑Sampling (SMOTE) to address missing values, outliers, and class skew. We then perform correlation-based feature reduction before training a stacking classifier that combines Random Forest, XGBoost, and ExtraTrees base learners with an ExtraTrees meta‑learner. Using stratified 10‑fold cross‑validation on the balanced cohort (n = 792), our ensemble achieves 91.6 % accuracy, 92 % F1‑score, and a high area under the ROC curve, outperforming individual models and prior published approaches. These results demonstrate the potential of heterogeneous ensembles for clinical decision support in hepatology and lay the groundwork for prospective validation in diverse patient populations.
ISSN:2710-2165