MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics
In bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effect...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10419347/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850195001457246208 |
|---|---|
| author | Hudhaifa Mohammed Abdulwahab S. Ajitha Mufeed Ahmed Naji Saif Belal Abdullah Hezam Murshed Fahd A. Ghanem |
| author_facet | Hudhaifa Mohammed Abdulwahab S. Ajitha Mufeed Ahmed Naji Saif Belal Abdullah Hezam Murshed Fahd A. Ghanem |
| author_sort | Hudhaifa Mohammed Abdulwahab |
| collection | DOAJ |
| description | In bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effective feature selection (FS) technique is crucial to mitigate dimensionality, thereby enhancing the performance and accuracy of medical diagnosis. The Cuckoo Search Algorithm (CSA) has proven effective in gene selection, demonstrating prowess in exploitation, exploration, and convergence. However, most of the current CSA-based FS techniques deal with gene selection problems as a single objective rather than adopting a multi-objective mechanism. This article proposes the Multi-Objective Binary Cuckoo Search Algorithm (MOBCSA) for gene selection. MOBCSA extends the standard CSA by incorporating multiple objectives, including accuracy of classification and number of selected genes. MOBCSA utilizes an S-shaped transfer function for transforming the algorithm’s search space from a continuous to a binary search space. MOBCSA integrates two components: an external archive to save the pareto optimal solutions attained during the search process, and an adaptive crowding distance updating mechanism integrated into the archive to maintain diversity and increase the coverage of optimal solutions. To assess MOBCSA’s performance, evaluation experiments were conducted on six benchmark biomedical datasets using three different classifiers. Then, the obtained experimental results were compared against four multi-objective-based state-of-the art FS methods. The findings prove that MOBCSA surpasses the other methods in both accuracy of classification and number of selected genes, where it has obtained an average accuracy ranging from 92.79% to 98.42% and an average number of selected genes ranging from 15.67 to 27.88 for different classifiers and datasets. |
| format | Article |
| id | doaj-art-25c7751d294147d98aaa4638d7d2d53e |
| institution | OA Journals |
| issn | 2169-3536 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-25c7751d294147d98aaa4638d7d2d53e2025-08-20T02:13:52ZengIEEEIEEE Access2169-35362024-01-0112218402186710.1109/ACCESS.2024.336222810419347MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in BioinformaticsHudhaifa Mohammed Abdulwahab0https://orcid.org/0000-0001-6631-051XS. Ajitha1https://orcid.org/0000-0002-1458-1411Mufeed Ahmed Naji Saif2https://orcid.org/0000-0002-0399-6339Belal Abdullah Hezam Murshed3https://orcid.org/0000-0003-2187-5044Fahd A. Ghanem4https://orcid.org/0000-0002-5055-0137Department of Computer Science and IT, University of Science and Technology, Taizz, YemenDepartment of Computer Application, Ramaiah Institute of Technology (affiliated to VTU), Bengaluru, Karnataka, IndiaDepartment of Computer Applications, Sri Jayachamarajendra College of Engineering (affiliated to VTU), JSS TI Campus, Mysore, Karnataka, IndiaDepartment of Computer Science, College of Engineering and IT, Amran University, Amran, YemenDepartment of Computer Science and Engineering, PES College of Engineering, Mysore University, Mandya, IndiaIn bioinformatics, medical diagnosis models might be significantly impacted by high-dimensional data generated by high-throughput technologies. This data includes redundant or irrelevant genes, making it challenging to identify the relevant genes from such high-dimensional data. Therefore, an effective feature selection (FS) technique is crucial to mitigate dimensionality, thereby enhancing the performance and accuracy of medical diagnosis. The Cuckoo Search Algorithm (CSA) has proven effective in gene selection, demonstrating prowess in exploitation, exploration, and convergence. However, most of the current CSA-based FS techniques deal with gene selection problems as a single objective rather than adopting a multi-objective mechanism. This article proposes the Multi-Objective Binary Cuckoo Search Algorithm (MOBCSA) for gene selection. MOBCSA extends the standard CSA by incorporating multiple objectives, including accuracy of classification and number of selected genes. MOBCSA utilizes an S-shaped transfer function for transforming the algorithm’s search space from a continuous to a binary search space. MOBCSA integrates two components: an external archive to save the pareto optimal solutions attained during the search process, and an adaptive crowding distance updating mechanism integrated into the archive to maintain diversity and increase the coverage of optimal solutions. To assess MOBCSA’s performance, evaluation experiments were conducted on six benchmark biomedical datasets using three different classifiers. Then, the obtained experimental results were compared against four multi-objective-based state-of-the art FS methods. The findings prove that MOBCSA surpasses the other methods in both accuracy of classification and number of selected genes, where it has obtained an average accuracy ranging from 92.79% to 98.42% and an average number of selected genes ranging from 15.67 to 27.88 for different classifiers and datasets.https://ieeexplore.ieee.org/document/10419347/Features selectionmulti-objective optimizationcuckoo search algorithmmachine learningdata miningbioinformatics |
| spellingShingle | Hudhaifa Mohammed Abdulwahab S. Ajitha Mufeed Ahmed Naji Saif Belal Abdullah Hezam Murshed Fahd A. Ghanem MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics IEEE Access Features selection multi-objective optimization cuckoo search algorithm machine learning data mining bioinformatics |
| title | MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics |
| title_full | MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics |
| title_fullStr | MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics |
| title_full_unstemmed | MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics |
| title_short | MOBCSA: Multi-Objective Binary Cuckoo Search Algorithm for Features Selection in Bioinformatics |
| title_sort | mobcsa multi objective binary cuckoo search algorithm for features selection in bioinformatics |
| topic | Features selection multi-objective optimization cuckoo search algorithm machine learning data mining bioinformatics |
| url | https://ieeexplore.ieee.org/document/10419347/ |
| work_keys_str_mv | AT hudhaifamohammedabdulwahab mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics AT sajitha mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics AT mufeedahmednajisaif mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics AT belalabdullahhezammurshed mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics AT fahdaghanem mobcsamultiobjectivebinarycuckoosearchalgorithmforfeaturesselectioninbioinformatics |