Comparison of Supervised Machine Learning Algorithms for Malware Detection

Comparison of Supervised Machine Learning Algorithms for Malware Detection

Authors

  • Mohd Faris Mohd Fuzi Mr
  • Syamir Mohd Shahirudin
  • Iman Hazwam Abd Halim
  • Muhammad Nabil Fikri Jamaluddin

DOI:

https://doi.org/10.24191/jcrinn.v8i2.329

Keywords:

malware, machine learning, detection accuracy, machine learning algorithm, malware detection model

Abstract

Due to the prevalence of security issues and cyberattacks, cybersecurity is crucial in today's environment. Malware has also evolved significantly over the past few years. With the advancement of malware analysis, Machine Learning (ML) is increasingly being used to detect malware. This study's major objective is to compare the best-supervised ML algorithms for malware detection based on detection accuracy. This study includes the scripting and development of supervised ML techniques such as Decision Tree (DT), K-Nearest Neighbors (KNN), Naive Bayes, Random Forest, and Neural Networks. This study was solely concerned with the Windows malware dataset. The malware classification was determined by testing and training the supervised ML algorithms using the extracted features from the malware dataset. Then, the percentage of detection accuracy was used to compare the detection performance of all five algorithms. The detection accuracy is calculated using the confusion matrix, which includes the False Positive Rate (FPR), the True Positive Rate (TPR), and the False Negative Rate (FNR). The results indicated that the Decision Tree and Random Forest algorithms provided the best detection accuracy at 96%, followed by the K-NN algorithm at 95%. To improve the detection accuracy for future research, it is suggested that the malware dataset be enhanced using several architectures, such as Linux and Android, and use additional supervised and unsupervised machine learning algorithms.

Downloads

Download data is not yet available.

References

Irfan, Ariffin, Naz & Anuar, S. (2020). A Malware Detection Framework Based on Forensic and Unsupervised Machine Learning Methodologies. 194–200.

Moubarak, Joanna, and Tony Feghali. Comparing Machine Learning Techniques for Malware Detection - Researchgate.

https://www.researchgate.net/publication/340214601_Comparing_Machine_Learning_Techniques_for_Malware_Detection.

Pavithra & Josephin. “IOPscience.” IOP Conference Series: Materials Science and Engineering, IOP Publishing, 1 Dec. 2020, https://iopscience.iop.org/article/10.1088/1757 899X/993/1/012099/meta.

Samy, Ganthan Narayana, et al. “Analysis of Feature Categories for Malware Visualization.” Journal of Telecommunication, Electronic and Computer Engineering (JTEC), https://jtec.utem.edu.my/jtec/article/view/4703.

Selamat & Ali. (2019). Comparison of malware detection techniques using machine learning algorithm. Indonesian Journal of Electrical

Zakaria, Abdollah, Mohd & Ariffin. (2017). The Rise of Ransomware. Proceedings of the 2017 International Conference on Software and E-Business - ICSEB 2017. https://doi.org/10.1145/3178212.3178224

Downloads

Published

2023-09-01

How to Cite

Mohd Fuzi, M. F., Mohd Shahirudin, S., Abd Halim, I. H., & Jamaluddin, M. N. F. (2023). Comparison of Supervised Machine Learning Algorithms for Malware Detection . Journal of Computing Research and Innovation, 8(2), 67–73. https://doi.org/10.24191/jcrinn.v8i2.329

Issue

Section

General Computing

Most read articles by the same author(s)

1 2 3 > >> 
Loading...