A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware
The advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving And...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11082145/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849417146582433792 |
|---|---|
| author | Fandi Kurniawan Deris Stiawan Darius Antoni Mohd Yazid Idris Rahmat Budiarto |
| author_facet | Fandi Kurniawan Deris Stiawan Darius Antoni Mohd Yazid Idris Rahmat Budiarto |
| author_sort | Fandi Kurniawan |
| collection | DOAJ |
| description | The advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving Android Package Kit (.apk) files related to electronic wedding invitations. This study aims to develop a robust and efficient static analysis-based machine learning framework for early detection of Android malware. Six machine learning algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Naive Bayes, AdaBoost, and Gradient Boosting are used to identify malicious behavior in APK files. The dataset used consists of 2,084 Android applications, including 1,314 malware samples and 770 benign applications, obtained through a reverse engineering process. Data pre-processing, feature extraction, and training using supervised learning are carried out to optimize detection accuracy. The experimental results show that the Random Forest algorithm achieves the best performance with 97% accuracy and 95% precision, surpassing the performance of other algorithms. |
| format | Article |
| id | doaj-art-e15527139c794f598cc7fa986f5d9d63 |
| institution | Kabale University |
| issn | 2169-3536 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-e15527139c794f598cc7fa986f5d9d632025-08-20T03:32:55ZengIEEEIEEE Access2169-35362025-01-011312718312722010.1109/ACCESS.2025.358965611082145A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android MalwareFandi Kurniawan0https://orcid.org/0009-0003-3012-387XDeris Stiawan1https://orcid.org/0000-0002-9302-1868Darius Antoni2Mohd Yazid Idris3https://orcid.org/0000-0001-7702-6610Rahmat Budiarto4https://orcid.org/0000-0002-6374-4731Faculty of Engineering, Universitas Sriwijaya, Palembang, IndonesiaFaculty of Computer Science, Universitas Sriwijaya, Palembang, IndonesiaFaculty of Computer Science, Universitas Indo Global Mandiri, Palembang, IndonesiaFaculty of Computing, Universiti Teknologi Malaysia, Johor Bahru, Johor, MalaysiaCollege of Computing and Information, Al-Baha University, Al Aqiq, Saudi ArabiaThe advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving Android Package Kit (.apk) files related to electronic wedding invitations. This study aims to develop a robust and efficient static analysis-based machine learning framework for early detection of Android malware. Six machine learning algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Naive Bayes, AdaBoost, and Gradient Boosting are used to identify malicious behavior in APK files. The dataset used consists of 2,084 Android applications, including 1,314 malware samples and 770 benign applications, obtained through a reverse engineering process. Data pre-processing, feature extraction, and training using supervised learning are carried out to optimize detection accuracy. The experimental results show that the Random Forest algorithm achieves the best performance with 97% accuracy and 95% precision, surpassing the performance of other algorithms.https://ieeexplore.ieee.org/document/11082145/Reverse engineeringmalware androidmachine learningmalware detectionstatic malware analysis |
| spellingShingle | Fandi Kurniawan Deris Stiawan Darius Antoni Mohd Yazid Idris Rahmat Budiarto A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware IEEE Access Reverse engineering malware android machine learning malware detection static malware analysis |
| title | A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware |
| title_full | A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware |
| title_fullStr | A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware |
| title_full_unstemmed | A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware |
| title_short | A Robust and Efficient Machine Learning Framework for Enhancing Early Detection of Android Malware |
| title_sort | robust and efficient machine learning framework for enhancing early detection of android malware |
| topic | Reverse engineering malware android machine learning malware detection static malware analysis |
| url | https://ieeexplore.ieee.org/document/11082145/ |
| work_keys_str_mv | AT fandikurniawan arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT derisstiawan arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT dariusantoni arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT mohdyazididris arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT rahmatbudiarto arobustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT fandikurniawan robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT derisstiawan robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT dariusantoni robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT mohdyazididris robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware AT rahmatbudiarto robustandefficientmachinelearningframeworkforenhancingearlydetectionofandroidmalware |