Performance analysis of machine learning classi ers for detecting PE malware

Azmee, ABM.Adnan; Choudhury, Pranto Protim; Alam, Md.Aosaful; Dutta, Orko

View/Open

16101155, 16101062, 16101061, 16101022_CSE.pdf (3.182Mb)

Date

2019-12

Publisher

Brac University

Abstract

In this modern era of technology, securing and protecting one's data has been a major concern and needs to be focused on. Malware is a program that is designed to cause harm and malware analysis is one of the paramount focused points under the sight of cyber forensic professionals and network administrations. The degree of the harm brought about by malignant programming varies to a great extent. If this happens at home to a random person then that may lead to some loss of irrel- evant or unimportant information but for a corporate network, it can lead to loss of valuable business data. The existing research does focus on some few machine learning algorithms to detect malware and very few of them worked with Portable Executables (PE) les. However, we worked on the PE les and also for real-time computation, a client-server model was developed by using Flask to detect malware or benign. In this paper, we mainly focused on top classi cation algorithms and compare their accuracy to nd out which one is giving the best result according to the dataset and also compare among these algorithms. Top machine learning clas- si cation algorithms were used alongside neural networks such as Arti cial Neural Network, XGBoost, Support Vector Machine, Extra Tree Classi er, etc. The exper- imental result shows that XGBoost achieved the highest accuracy of 98.62 percent when compared with other approaches. Thus, to provide a better solution for this kind of anomalies, we have been interested in researching malware detection and want to contribute to building strong and protective cybersecurity.

Keywords

Malware detection; Machine learning; Data protection; XGBoost; Support Vector Machine; Extra Tree Classi er; Client- Server Model

LC Subject Headings

Neural networks; Machine learning

Description

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.

Cataloged from PDF version of thesis.

Includes bibliographical references (pages 58-60).

Department

Department of Computer Science and Engineering, Brac University

Type

Thesis

Collections

Thesis & Report, BSc (Computer Science and Engineering) [1588]