Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection

Heart disease or cardiovascular disease is one of the leading causes of death in the world. Based on WHO data, in 2019, as many as 17.9 million people died from cardiovascular disease. If early prevention is not carried out immediately, of course, the victims will increase every year. Therefore, wit...

Full description

Saved in:
Bibliographic Details
Main Authors: Hirmayanti, Ema Utami
Format: Article
Language:English
Published: Ikatan Ahli Informatika Indonesia 2025-04-01
Series:Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Subjects:
Online Access:https://jurnal.iaii.or.id/index.php/RESTI/article/view/6175
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849711563740545024
author Hirmayanti
Ema Utami
author_facet Hirmayanti
Ema Utami
author_sort Hirmayanti
collection DOAJ
description Heart disease or cardiovascular disease is one of the leading causes of death in the world. Based on WHO data, in 2019, as many as 17.9 million people died from cardiovascular disease. If early prevention is not carried out immediately, of course, the victims will increase every year. Therefore, with the increasingly rapid development of technology, especially in the health sector, it is hoped that it can help medical personnel in treating patients suffering from various diseases, especially heart disease. So in this study, it will be more focused on the selection of relevant features or attributes to increase the accuracy value of the Machine Learning algorithm. The algorithms used include Random Forest and SVM. Meanwhile, for feature selection, several feature selection techniques are used, including information gain (IG), Chi-square (Chi2) and correlation feature selection (CFS). The use of these three techniques aims to obtain the main features so that they can minimize irrelevant features that can slow down the machine process. Based on the results of the experiment with a comparison of 70:30, it shows that CFS-SVM is superior by using nine features, which obtain the highest accuracy of 92.19%, while CFS-RF obtains the best value with eight features of 91.88%. By using feature selection and hyperparameter techniques, SVM obtained an increase of 10.88%, and RF obtained an increase of 9.47%. Based on the performance of the model using the selected relevant features, it shows that the proposed CFS-SVM shows good and efficient performance in diagnosing heart disease.
format Article
id doaj-art-998efc2bd4d448bf8af4f6ca72dda756
institution DOAJ
issn 2580-0760
language English
publishDate 2025-04-01
publisher Ikatan Ahli Informatika Indonesia
record_format Article
series Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
spelling doaj-art-998efc2bd4d448bf8af4f6ca72dda7562025-08-20T03:14:35ZengIkatan Ahli Informatika IndonesiaJurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)2580-07602025-04-019238539210.29207/resti.v9i2.61756175Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature SelectionHirmayanti0Ema Utami1Universitas Amikom YogyakartaUniversitas Amikom YogyakartaHeart disease or cardiovascular disease is one of the leading causes of death in the world. Based on WHO data, in 2019, as many as 17.9 million people died from cardiovascular disease. If early prevention is not carried out immediately, of course, the victims will increase every year. Therefore, with the increasingly rapid development of technology, especially in the health sector, it is hoped that it can help medical personnel in treating patients suffering from various diseases, especially heart disease. So in this study, it will be more focused on the selection of relevant features or attributes to increase the accuracy value of the Machine Learning algorithm. The algorithms used include Random Forest and SVM. Meanwhile, for feature selection, several feature selection techniques are used, including information gain (IG), Chi-square (Chi2) and correlation feature selection (CFS). The use of these three techniques aims to obtain the main features so that they can minimize irrelevant features that can slow down the machine process. Based on the results of the experiment with a comparison of 70:30, it shows that CFS-SVM is superior by using nine features, which obtain the highest accuracy of 92.19%, while CFS-RF obtains the best value with eight features of 91.88%. By using feature selection and hyperparameter techniques, SVM obtained an increase of 10.88%, and RF obtained an increase of 9.47%. Based on the performance of the model using the selected relevant features, it shows that the proposed CFS-SVM shows good and efficient performance in diagnosing heart disease.https://jurnal.iaii.or.id/index.php/RESTI/article/view/6175heart diseasefeature selectionrandom foresthyperparameter
spellingShingle Hirmayanti
Ema Utami
Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
heart disease
feature selection
random forest
hyperparameter
title Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
title_full Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
title_fullStr Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
title_full_unstemmed Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
title_short Enhanced Heart Disease Diagnosis Using Machine Learning Algorithms: A Comparison of Feature Selection
title_sort enhanced heart disease diagnosis using machine learning algorithms a comparison of feature selection
topic heart disease
feature selection
random forest
hyperparameter
url https://jurnal.iaii.or.id/index.php/RESTI/article/view/6175
work_keys_str_mv AT hirmayanti enhancedheartdiseasediagnosisusingmachinelearningalgorithmsacomparisonoffeatureselection
AT emautami enhancedheartdiseasediagnosisusingmachinelearningalgorithmsacomparisonoffeatureselection