Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
Heart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in thi...
Saved in:
| Main Authors: | , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Wiley
2025-01-01
|
| Series: | Applied Computational Intelligence and Soft Computing |
| Online Access: | http://dx.doi.org/10.1155/acis/1989813 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849730665333915648 |
|---|---|
| author | Enoch Sakyi-Yeboah Edmund Fosu Agyemang Vincent Agbenyeavu Akua Osei-Nkwantabisa Priscilla Kissi-Appiah Lateef Moshood Lawrence Agbota Ezekiel N. N. Nortey |
| author_facet | Enoch Sakyi-Yeboah Edmund Fosu Agyemang Vincent Agbenyeavu Akua Osei-Nkwantabisa Priscilla Kissi-Appiah Lateef Moshood Lawrence Agbota Ezekiel N. N. Nortey |
| author_sort | Enoch Sakyi-Yeboah |
| collection | DOAJ |
| description | Heart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in this study: adaptive boosting, extreme gradient boosting, random forest, and extremely randomized trees, investigating their ability to predict heart disease. Data related to heart disease clinical features was obtained from the open Kaggle Machine Learning Dataset repository. Adaptive Boosting stands out as the highest performer, achieving an average testing accuracy of 93.70%, precision of 93.71%, recall of 93.70%, and F1 score of 93.69%, along with the highest AUC score of 0.9708, across all competing models considered in the study. These metrics indicate a superior ability to distinguish between patients with and without heart disease, effectively making it particularly valuable for clinical applications where early detection can save lives. The SHapley Additive exPlanations (SHAP) framework adopted to investigate the relative importance of the features in predicting heart disease revealed the most influential predictors (ST slope, chest pain type, old peak, and cholesterol), further aiding the understanding of heart disease mechanisms. Future work should explore the integration of ensemble learning algorithms with real-time patient monitoring systems. This integration could allow for continuous health status updates, equipping predictive models with the information necessary to facilitate dynamic, real-time interventions that are more closely aligned with patient needs. |
| format | Article |
| id | doaj-art-a5f8c5ca51334230baa2fd2adae0bcef |
| institution | DOAJ |
| issn | 1687-9732 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | Wiley |
| record_format | Article |
| series | Applied Computational Intelligence and Soft Computing |
| spelling | doaj-art-a5f8c5ca51334230baa2fd2adae0bcef2025-08-20T03:08:47ZengWileyApplied Computational Intelligence and Soft Computing1687-97322025-01-01202510.1155/acis/1989813Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning PerspectiveEnoch Sakyi-Yeboah0Edmund Fosu Agyemang1Vincent Agbenyeavu2Akua Osei-Nkwantabisa3Priscilla Kissi-Appiah4Lateef Moshood5Lawrence Agbota6Ezekiel N. N. Nortey7Department of Statistics and Actuarial ScienceDepartment of Statistics and Actuarial ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceSchool of Mathematical and Statistical ScienceDepartment of Statistics and Actuarial ScienceHeart disease stands as a leading cause of morbidity and mortality globally, presenting a significant public health challenge. Therefore, early prediction and detection are critical, leading to timely and appropriate interventions at early stages. Four ensemble tree-based algorithms were used in this study: adaptive boosting, extreme gradient boosting, random forest, and extremely randomized trees, investigating their ability to predict heart disease. Data related to heart disease clinical features was obtained from the open Kaggle Machine Learning Dataset repository. Adaptive Boosting stands out as the highest performer, achieving an average testing accuracy of 93.70%, precision of 93.71%, recall of 93.70%, and F1 score of 93.69%, along with the highest AUC score of 0.9708, across all competing models considered in the study. These metrics indicate a superior ability to distinguish between patients with and without heart disease, effectively making it particularly valuable for clinical applications where early detection can save lives. The SHapley Additive exPlanations (SHAP) framework adopted to investigate the relative importance of the features in predicting heart disease revealed the most influential predictors (ST slope, chest pain type, old peak, and cholesterol), further aiding the understanding of heart disease mechanisms. Future work should explore the integration of ensemble learning algorithms with real-time patient monitoring systems. This integration could allow for continuous health status updates, equipping predictive models with the information necessary to facilitate dynamic, real-time interventions that are more closely aligned with patient needs.http://dx.doi.org/10.1155/acis/1989813 |
| spellingShingle | Enoch Sakyi-Yeboah Edmund Fosu Agyemang Vincent Agbenyeavu Akua Osei-Nkwantabisa Priscilla Kissi-Appiah Lateef Moshood Lawrence Agbota Ezekiel N. N. Nortey Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective Applied Computational Intelligence and Soft Computing |
| title | Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective |
| title_full | Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective |
| title_fullStr | Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective |
| title_full_unstemmed | Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective |
| title_short | Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective |
| title_sort | heart disease prediction using ensemble tree algorithms a supervised learning perspective |
| url | http://dx.doi.org/10.1155/acis/1989813 |
| work_keys_str_mv | AT enochsakyiyeboah heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT edmundfosuagyemang heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT vincentagbenyeavu heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT akuaoseinkwantabisa heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT priscillakissiappiah heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT lateefmoshood heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT lawrenceagbota heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective AT ezekielnnnortey heartdiseasepredictionusingensembletreealgorithmsasupervisedlearningperspective |