An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection

Introduction: High-dimensional datasets often contain an abundance of features, many of which are irrelevant to the subject of interest. This issue is compounded by the frequently low number of samples and imbalanced class samples. These factors can negatively impact the performance of classificatio...

Full description

Saved in:
Bibliographic Details
Main Authors: Jamshid Pirgazi, Mohammad Mehdi Pourhashem Kallehbasti, Ali Ghanbari Sorkhi, Ali Kermani
Format: Article
Language:English
Published: Tabriz University of Medical Sciences 2024-10-01
Series:BioImpacts
Subjects:
Online Access:https://bi.tbzmed.ac.ir/PDF/bi-15-30340.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850034193024679936
author Jamshid Pirgazi
Mohammad Mehdi Pourhashem Kallehbasti
Ali Ghanbari Sorkhi
Ali Kermani
author_facet Jamshid Pirgazi
Mohammad Mehdi Pourhashem Kallehbasti
Ali Ghanbari Sorkhi
Ali Kermani
author_sort Jamshid Pirgazi
collection DOAJ
description Introduction: High-dimensional datasets often contain an abundance of features, many of which are irrelevant to the subject of interest. This issue is compounded by the frequently low number of samples and imbalanced class samples. These factors can negatively impact the performance of classification algorithms, necessitating feature selection before classification. The primary objective of feature selection algorithms is to identify a minimal subset of features that enables accurate classification. Methods: In this paper, we propose a two-stage hybrid method for the optimal selection of relevant features. In the first stage, a filter method is employed to assign weights to the features, facilitating the removal of redundant and irrelevant features and reducing the computational cost of classification algorithms. A subset of high-weight features is retained for further processing in the second stage. In this stage, an enhanced Harris Hawks Optimization algorithm and GRASP, augmented with crossover and mutation operators from genetic algorithms, are utilized based on the weights calculated in the first stage to identify the optimal feature set. Results: Experimental results demonstrate that the proposed algorithm successfully identifies the optimal subset of features. Conclusion: The two-stage hybrid method effectively selects the optimal subset of features, improving the performance of classification algorithms on high-dimensional datasets. This approach addresses the challenges posed by the abundance of features, low number of samples, and imbalanced class samples, demonstrating its potential for application in various fields.
format Article
id doaj-art-1d16aebe94e142d7b8aa49bf1618fb94
institution DOAJ
issn 2228-5652
2228-5660
language English
publishDate 2024-10-01
publisher Tabriz University of Medical Sciences
record_format Article
series BioImpacts
spelling doaj-art-1d16aebe94e142d7b8aa49bf1618fb942025-08-20T02:57:54ZengTabriz University of Medical SciencesBioImpacts2228-56522228-56602024-10-01151303403034010.34172/bi.30340bi-30340An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selectionJamshid Pirgazi0Mohammad Mehdi Pourhashem Kallehbasti1Ali Ghanbari Sorkhi2Ali Kermani3Department of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, IranDepartment of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, IranDepartment of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, IranDepartment of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, Behshahr, IranIntroduction: High-dimensional datasets often contain an abundance of features, many of which are irrelevant to the subject of interest. This issue is compounded by the frequently low number of samples and imbalanced class samples. These factors can negatively impact the performance of classification algorithms, necessitating feature selection before classification. The primary objective of feature selection algorithms is to identify a minimal subset of features that enables accurate classification. Methods: In this paper, we propose a two-stage hybrid method for the optimal selection of relevant features. In the first stage, a filter method is employed to assign weights to the features, facilitating the removal of redundant and irrelevant features and reducing the computational cost of classification algorithms. A subset of high-weight features is retained for further processing in the second stage. In this stage, an enhanced Harris Hawks Optimization algorithm and GRASP, augmented with crossover and mutation operators from genetic algorithms, are utilized based on the weights calculated in the first stage to identify the optimal feature set. Results: Experimental results demonstrate that the proposed algorithm successfully identifies the optimal subset of features. Conclusion: The two-stage hybrid method effectively selects the optimal subset of features, improving the performance of classification algorithms on high-dimensional datasets. This approach addresses the challenges posed by the abundance of features, low number of samples, and imbalanced class samples, demonstrating its potential for application in various fields.https://bi.tbzmed.ac.ir/PDF/bi-15-30340.pdffeature selectionhigh-dimensional dataharris hawks optimizationglobal search
spellingShingle Jamshid Pirgazi
Mohammad Mehdi Pourhashem Kallehbasti
Ali Ghanbari Sorkhi
Ali Kermani
An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
BioImpacts
feature selection
high-dimensional data
harris hawks optimization
global search
title An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
title_full An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
title_fullStr An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
title_full_unstemmed An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
title_short An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection
title_sort efficient hybrid filter wrapper method based on improved harris hawks optimization for feature selection
topic feature selection
high-dimensional data
harris hawks optimization
global search
url https://bi.tbzmed.ac.ir/PDF/bi-15-30340.pdf
work_keys_str_mv AT jamshidpirgazi anefficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT mohammadmehdipourhashemkallehbasti anefficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT alighanbarisorkhi anefficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT alikermani anefficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT jamshidpirgazi efficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT mohammadmehdipourhashemkallehbasti efficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT alighanbarisorkhi efficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection
AT alikermani efficienthybridfilterwrappermethodbasedonimprovedharrishawksoptimizationforfeatureselection