MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics

In bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effect...

Full description

Saved in:
Bibliographic Details
Main Authors: Hudhaifa Mohammed Abdulwahab, S. Ajitha, Mufeed Ahmed Naji Saif, Belal Abdullah Hezam Murshed, Fahd A. Ghanem
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10419347/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850195001457246208
author Hudhaifa Mohammed Abdulwahab
S. Ajitha
Mufeed Ahmed Naji Saif
Belal Abdullah Hezam Murshed
Fahd A. Ghanem
author_facet Hudhaifa Mohammed Abdulwahab
S. Ajitha
Mufeed Ahmed Naji Saif
Belal Abdullah Hezam Murshed
Fahd A. Ghanem
author_sort Hudhaifa Mohammed Abdulwahab
collection DOAJ
description In bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effective feature selection (FS) technique is crucial to mitigate dimensionality, thereby enhancing the performance and accuracy of medical diagnosis. The Cuckoo Search Algorithm (CSA) has proven effective in gene selection, demonstrating prowess in exploitation, exploration, and convergence. However, most of the current CSA-based FS techniques deal with gene selection problems as a single objective rather than adopting a multi-objective mechanism. This article proposes the Multi-Objective Binary Cuckoo Search Algorithm (MOBCSA) for gene selection. MOBCSA extends the standard CSA by incorporating multiple objectives, including accuracy of classification and number of selected genes. MOBCSA utilizes an S-shaped transfer function for transforming the algorithm’s search space from a continuous to a binary search space. MOBCSA integrates two components: an external archive to save the pareto optimal solutions attained during the search process, and an adaptive crowding distance updating mechanism integrated into the archive to maintain diversity and increase the coverage of optimal solutions. To assess MOBCSA’s performance, evaluation experiments were conducted on six benchmark biomedical datasets using three different classifiers. Then, the obtained experimental results were compared against four multi-objective-based state-of-the art FS methods. The findings prove that MOBCSA surpasses the other methods in both accuracy of classification and number of selected genes, where it has obtained an average accuracy ranging from 92.79% to 98.42% and an average number of selected genes ranging from 15.67 to 27.88 for different classifiers and datasets.
format Article
id doaj-art-25c7751d294147d98aaa4638d7d2d53e
institution OA Journals
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-25c7751d294147d98aaa4638d7d2d53e2025-08-20T02:13:52ZengIEEEIEEE Access2169-35362024-01-0112218402186710.1109/ACCESS.2024.336222810419347MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in BioinformaticsHudhaifa Mohammed Abdulwahab0https://orcid.org/0000-0001-6631-051XS. Ajitha1https://orcid.org/0000-0002-1458-1411Mufeed Ahmed Naji Saif2https://orcid.org/0000-0002-0399-6339Belal Abdullah Hezam Murshed3https://orcid.org/0000-0003-2187-5044Fahd A. Ghanem4https://orcid.org/0000-0002-5055-0137Department of Computer Science and IT, University of Science and Technology, Taizz, YemenDepartment of Computer Application, Ramaiah Institute of Technology (affiliated to VTU), Bengaluru, Karnataka, IndiaDepartment of Computer Applications, Sri Jayachamarajendra College of Engineering (affiliated to VTU), JSS TI Campus, Mysore, Karnataka, IndiaDepartment of Computer Science, College of Engineering and IT, Amran University, Amran, YemenDepartment of Computer Science and Engineering, PES College of Engineering, Mysore University, Mandya, IndiaIn bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effective feature selection (FS) technique is crucial to mitigate dimensionality, thereby enhancing the performance and accuracy of medical diagnosis. The Cuckoo Search Algorithm (CSA) has proven effective in gene selection, demonstrating prowess in exploitation, exploration, and convergence. However, most of the current CSA-based FS techniques deal with gene selection problems as a single objective rather than adopting a multi-objective mechanism. This article proposes the Multi-Objective Binary Cuckoo Search Algorithm (MOBCSA) for gene selection. MOBCSA extends the standard CSA by incorporating multiple objectives, including accuracy of classification and number of selected genes. MOBCSA utilizes an S-shaped transfer function for transforming the algorithm’s search space from a continuous to a binary search space. MOBCSA integrates two components: an external archive to save the pareto optimal solutions attained during the search process, and an adaptive crowding distance updating mechanism integrated into the archive to maintain diversity and increase the coverage of optimal solutions. To assess MOBCSA’s performance, evaluation experiments were conducted on six benchmark biomedical datasets using three different classifiers. Then, the obtained experimental results were compared against four multi-objective-based state-of-the art FS methods. The findings prove that MOBCSA surpasses the other methods in both accuracy of classification and number of selected genes, where it has obtained an average accuracy ranging from 92.79% to 98.42% and an average number of selected genes ranging from 15.67 to 27.88 for different classifiers and datasets.https://ieeexplore.ieee.org/document/10419347/Features selectionmulti-objective optimizationcuckoo search algorithmmachine learningdata miningbioinformatics
spellingShingle Hudhaifa Mohammed Abdulwahab
S. Ajitha
Mufeed Ahmed Naji Saif
Belal Abdullah Hezam Murshed
Fahd A. Ghanem
MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
IEEE Access
Features selection
multi-objective optimization
cuckoo search algorithm
machine learning
data mining
bioinformatics
title MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
title_full MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
title_fullStr MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
title_full_unstemmed MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
title_short MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
title_sort mobcsa multi objective binary cuckoo search algorithm for features selection in bioinformatics
topic Features selection
multi-objective optimization
cuckoo search algorithm
machine learning
data mining
bioinformatics
url https://ieeexplore.ieee.org/document/10419347/
work_keys_str_mv AT hudhaifamohammedabdulwahab mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics
AT sajitha mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics
AT mufeedahmednajisaif mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics
AT belalabdullahhezammurshed mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics
AT fahdaghanem mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics