A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware

The advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving And...

Full description

Saved in:
Bibliographic Details
Main Authors: Fandi Kurniawan, Deris Stiawan, Darius Antoni, Mohd Yazid Idris, Rahmat Budiarto
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11082145/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849417146582433792
author Fandi Kurniawan
Deris Stiawan
Darius Antoni
Mohd Yazid Idris
Rahmat Budiarto
author_facet Fandi Kurniawan
Deris Stiawan
Darius Antoni
Mohd Yazid Idris
Rahmat Budiarto
author_sort Fandi Kurniawan
collection DOAJ
description The advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving Android Package Kit (.apk) files related to electronic wedding invitations. This study aims to develop a robust and efficient static analysis-based machine learning framework for early detection of Android malware. Six machine learning algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Naive Bayes, AdaBoost, and Gradient Boosting are used to identify malicious behavior in APK files. The dataset used consists of 2,084 Android applications, including 1,314 malware samples and 770 benign applications, obtained through a reverse engineering process. Data pre-processing, feature extraction, and training using supervised learning are carried out to optimize detection accuracy. The experimental results show that the Random Forest algorithm achieves the best performance with 97% accuracy and 95% precision, surpassing the performance of other algorithms.
format Article
id doaj-art-e15527139c794f598cc7fa986f5d9d63
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-e15527139c794f598cc7fa986f5d9d632025-08-20T03:32:55ZengIEEEIEEE Access2169-35362025-01-011312718312722010.1109/ACCESS.2025.358965611082145A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android MalwareFandi Kurniawan0https://orcid.org/0009-0003-3012-387XDeris Stiawan1https://orcid.org/0000-0002-9302-1868Darius Antoni2Mohd Yazid Idris3https://orcid.org/0000-0001-7702-6610Rahmat Budiarto4https://orcid.org/0000-0002-6374-4731Faculty of Engineering, Universitas Sriwijaya, Palembang, IndonesiaFaculty of Computer Science, Universitas Sriwijaya, Palembang, IndonesiaFaculty of Computer Science, Universitas Indo Global Mandiri, Palembang, IndonesiaFaculty of Computing, Universiti Teknologi Malaysia, Johor Bahru, Johor, MalaysiaCollege of Computing and Information, Al-Baha University, Al Aqiq, Saudi ArabiaThe advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving Android Package Kit (.apk) files related to electronic wedding invitations. This study aims to develop a robust and efficient static analysis-based machine learning framework for early detection of Android malware. Six machine learning algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Naive Bayes, AdaBoost, and Gradient Boosting are used to identify malicious behavior in APK files. The dataset used consists of 2,084 Android applications, including 1,314 malware samples and 770 benign applications, obtained through a reverse engineering process. Data pre-processing, feature extraction, and training using supervised learning are carried out to optimize detection accuracy. The experimental results show that the Random Forest algorithm achieves the best performance with 97% accuracy and 95% precision, surpassing the performance of other algorithms.https://ieeexplore.ieee.org/document/11082145/Reverse engineeringmalware androidmachine learningmalware detectionstatic malware analysis
spellingShingle Fandi Kurniawan
Deris Stiawan
Darius Antoni
Mohd Yazid Idris
Rahmat Budiarto
A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
IEEE Access
Reverse engineering
malware android
machine learning
malware detection
static malware analysis
title A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
title_full A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
title_fullStr A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
title_full_unstemmed A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
title_short A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
title_sort robust and efficient machine learning framework for enhancing early detection of android malware
topic Reverse engineering
malware android
machine learning
malware detection
static malware analysis
url https://ieeexplore.ieee.org/document/11082145/
work_keys_str_mv AT fandikurniawan arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT derisstiawan arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT dariusantoni arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT mohdyazididris arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT rahmatbudiarto arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT fandikurniawan robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT derisstiawan robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT dariusantoni robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT mohdyazididris robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware
AT rahmatbudiarto robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware