Detection and Analysis of Malicious Software Using Machine Learning Models

The continuous evolution of malware poses a significant challenge in cybersecurity, adapting to technological advancements despite implemented security measures. This paper introduces an innovative approach to enhance the detection of obfuscated malware through the integration of machine learning (M...

Full description

Saved in:

Bibliographic Details
Main Authors:	Selman Hızal, Ahmet Öztürk
Format:	Article
Language:	English
Published:	Sakarya University 2024-08-01
Series:	Sakarya University Journal of Computer and Information Sciences
Subjects:	information security software analysis malware detection system machine learning
Online Access:	https://dergipark.org.tr/en/download/article-file/3952776
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849724256710033408
author	Selman Hızal Ahmet Öztürk
author_facet	Selman Hızal Ahmet Öztürk
author_sort	Selman Hızal
collection	DOAJ
description	The continuous evolution of malware poses a significant challenge in cybersecurity, adapting to technological advancements despite implemented security measures. This paper introduces an innovative approach to enhance the detection of obfuscated malware through the integration of machine learning (ML). Utilizing a real-world dataset of prevalent malware types such as spyware, ransomware, and trojan horses, our study addresses the evolving challenges of cybersecurity. In this study, we evaluate the performance of ML algorithms for obfuscated malware detection using the CIC-MalMem-2022 dataset. Our analysis encompasses binary and multi-class classification tasks under various experimental conditions, including percentage splits and 10-fold cross-validation. The evaluated algorithms include Random Tree (RT), Random Forest (RF), J-48 (C4.5), Naive Bayes (NB), and XGBoost. Experimental results demonstrate the effectiveness of RF, J-48, and XGBoost in achieving high accuracy rates across different classification tasks. NB also shows competitive performance but faces challenges in handling imbalanced datasets and multi-class classification. Our findings highlight the importance of employing advanced ML techniques for enhancing obfuscated malware detection capabilities and provide valuable insights for cybersecurity practitioners and researchers. Future research directions include fine-tuning model hyperparameters, exploring ensemble learning approaches, and expanding evaluation to diverse datasets and real-world scenarios.
format	Article
id	doaj-art-e1f8d2177d0a4fd7ab837be988a4100e
institution	DOAJ
issn	2636-8129
language	English
publishDate	2024-08-01
publisher	Sakarya University
record_format	Article
series	Sakarya University Journal of Computer and Information Sciences
spelling	doaj-art-e1f8d2177d0a4fd7ab837be988a4100e2025-08-20T03:10:47ZengSakarya UniversitySakarya University Journal of Computer and Information Sciences2636-81292024-08-017226427610.35377/saucis...148923728Detection and Analysis of Malicious Software Using Machine Learning ModelsSelman Hızal0https://orcid.org/0000-0001-6345-0066Ahmet Öztürk1https://orcid.org/0009-0009-5228-7596SAKARYA UNIVERSITY OF APPLIED SCIENCESSAKARYA UYGULAMALI BİLİMLER ÜNİVERSİTESİThe continuous evolution of malware poses a significant challenge in cybersecurity, adapting to technological advancements despite implemented security measures. This paper introduces an innovative approach to enhance the detection of obfuscated malware through the integration of machine learning (ML). Utilizing a real-world dataset of prevalent malware types such as spyware, ransomware, and trojan horses, our study addresses the evolving challenges of cybersecurity. In this study, we evaluate the performance of ML algorithms for obfuscated malware detection using the CIC-MalMem-2022 dataset. Our analysis encompasses binary and multi-class classification tasks under various experimental conditions, including percentage splits and 10-fold cross-validation. The evaluated algorithms include Random Tree (RT), Random Forest (RF), J-48 (C4.5), Naive Bayes (NB), and XGBoost. Experimental results demonstrate the effectiveness of RF, J-48, and XGBoost in achieving high accuracy rates across different classification tasks. NB also shows competitive performance but faces challenges in handling imbalanced datasets and multi-class classification. Our findings highlight the importance of employing advanced ML techniques for enhancing obfuscated malware detection capabilities and provide valuable insights for cybersecurity practitioners and researchers. Future research directions include fine-tuning model hyperparameters, exploring ensemble learning approaches, and expanding evaluation to diverse datasets and real-world scenarios.https://dergipark.org.tr/en/download/article-file/3952776information securitysoftware analysismalware detection systemmachine learning
spellingShingle	Selman Hızal Ahmet Öztürk Detection and Analysis of Malicious Software Using Machine Learning Models Sakarya University Journal of Computer and Information Sciences information security software analysis malware detection system machine learning
title	Detection and Analysis of Malicious Software Using Machine Learning Models
title_full	Detection and Analysis of Malicious Software Using Machine Learning Models
title_fullStr	Detection and Analysis of Malicious Software Using Machine Learning Models
title_full_unstemmed	Detection and Analysis of Malicious Software Using Machine Learning Models
title_short	Detection and Analysis of Malicious Software Using Machine Learning Models
title_sort	detection and analysis of malicious software using machine learning models
topic	information security software analysis malware detection system machine learning
url	https://dergipark.org.tr/en/download/article-file/3952776
work_keys_str_mv	AT selmanhızal detectionandanalysisofmalicioussoftwareusingmachinelearningmodels AT ahmetozturk detectionandanalysisofmalicioussoftwareusingmachinelearningmodels

Detection and Analysis of Malicious Software Using Machine Learning Models

Similar Items