Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective

Heart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in thi...

Full description

Saved in:
Bibliographic Details
Main Authors: Enoch Sakyi-Yeboah, Edmund Fosu Agyemang, Vincent Agbenyeavu, Akua Osei-Nkwantabisa, Priscilla Kissi-Appiah, Lateef Moshood, Lawrence Agbota, Ezekiel N. N. Nortey
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:Applied Computational Intelligence and Soft Computing
Online Access:http://dx.doi.org/10.1155/acis/1989813
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849730665333915648
author Enoch Sakyi-Yeboah
Edmund Fosu Agyemang
Vincent Agbenyeavu
Akua Osei-Nkwantabisa
Priscilla Kissi-Appiah
Lateef Moshood
Lawrence Agbota
Ezekiel N. N. Nortey
author_facet Enoch Sakyi-Yeboah
Edmund Fosu Agyemang
Vincent Agbenyeavu
Akua Osei-Nkwantabisa
Priscilla Kissi-Appiah
Lateef Moshood
Lawrence Agbota
Ezekiel N. N. Nortey
author_sort Enoch Sakyi-Yeboah
collection DOAJ
description Heart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in this study: adaptive boosting, extreme gradient boosting, random forest, and extremely randomized trees, investigating their ability to predict heart disease. Data related to heart disease clinical features was obtained from the open Kaggle Machine Learning Dataset repository. Adaptive Boosting stands out as the highest performer, achieving an average testing accuracy of 93.70%, precision of 93.71%, recall of 93.70%, and F1 score of 93.69%, along with the highest AUC score of 0.9708, across all competing models considered in the study. These metrics indicate a superior ability to distinguish between patients with and without heart disease, effectively making it particularly valuable for clinical applications where early detection can save lives. The SHapley Additive exPlanations (SHAP) framework adopted to investigate the relative importance of the features in predicting heart disease revealed the most influential predictors (ST slope, chest pain type, old peak, and cholesterol), further aiding the understanding of heart disease mechanisms. Future work should explore the integration of ensemble learning algorithms with real-time patient monitoring systems. This integration could allow for continuous health status updates, equipping predictive models with the information necessary to facilitate dynamic, real-time interventions that are more closely aligned with patient needs.
format Article
id doaj-art-a5f8c5ca51334230baa2fd2adae0bcef
institution DOAJ
issn 1687-9732
language English
publishDate 2025-01-01
publisher Wiley
record_format Article
series Applied Computational Intelligence and Soft Computing
spelling doaj-art-a5f8c5ca51334230baa2fd2adae0bcef2025-08-20T03:08:47ZengWileyApplied Computational Intelligence and Soft Computing1687-97322025-01-01202510.1155/acis/1989813Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning PerspectiveEnoch Sakyi-Yeboah0Edmund Fosu Agyemang1Vincent Agbenyeavu2Akua Osei-Nkwantabisa3Priscilla Kissi-Appiah4Lateef Moshood5Lawrence Agbota6Ezekiel N. N. Nortey7Department of Statistics and Actuarial ScienceDepartment of Statistics and Actuarial ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceDepartment of Statistics and Actuarial ScienceHeart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in this study: adaptive boosting, extreme gradient boosting, random forest, and extremely randomized trees, investigating their ability to predict heart disease. Data related to heart disease clinical features was obtained from the open Kaggle Machine Learning Dataset repository. Adaptive Boosting stands out as the highest performer, achieving an average testing accuracy of 93.70%, precision of 93.71%, recall of 93.70%, and F1 score of 93.69%, along with the highest AUC score of 0.9708, across all competing models considered in the study. These metrics indicate a superior ability to distinguish between patients with and without heart disease, effectively making it particularly valuable for clinical applications where early detection can save lives. The SHapley Additive exPlanations (SHAP) framework adopted to investigate the relative importance of the features in predicting heart disease revealed the most influential predictors (ST slope, chest pain type, old peak, and cholesterol), further aiding the understanding of heart disease mechanisms. Future work should explore the integration of ensemble learning algorithms with real-time patient monitoring systems. This integration could allow for continuous health status updates, equipping predictive models with the information necessary to facilitate dynamic, real-time interventions that are more closely aligned with patient needs.http://dx.doi.org/10.1155/acis/1989813
spellingShingle Enoch Sakyi-Yeboah
Edmund Fosu Agyemang
Vincent Agbenyeavu
Akua Osei-Nkwantabisa
Priscilla Kissi-Appiah
Lateef Moshood
Lawrence Agbota
Ezekiel N. N. Nortey
Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
Applied Computational Intelligence and Soft Computing
title Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
title_full Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
title_fullStr Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
title_full_unstemmed Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
title_short Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
title_sort heart disease prediction using ensemble tree algorithms a supervised learning perspective
url http://dx.doi.org/10.1155/acis/1989813
work_keys_str_mv AT enochsakyiyeboah heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT edmundfosuagyemang heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT vincentagbenyeavu heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT akuaoseinkwantabisa heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT priscillakissiappiah heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT lateefmoshood heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT lawrenceagbota heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective
AT ezekielnnnortey heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective