Show simple item record

dc.contributor.advisorChakrabarty, Amitabha
dc.contributor.advisorRodoshi, Ahanaf Hassan
dc.contributor.authorBushra, Raisa Hasan
dc.contributor.authorAlam, Md Taukir
dc.contributor.authorSaha, Aniruddho
dc.contributor.authorFahim, Nazmus Sakib
dc.contributor.authorBinty, Nabila Mourium
dc.date.accessioned2023-10-15T10:39:29Z
dc.date.available2023-10-15T10:39:29Z
dc.date.copyright©2022
dc.date.issued2022-09-29
dc.identifier.otherID 18301064
dc.identifier.otherID 18301277
dc.identifier.otherID 18201117
dc.identifier.otherID 18201166
dc.identifier.otherID 19101082
dc.identifier.urihttp://hdl.handle.net/10361/21825
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2022.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 32-36).
dc.description.abstractMalware detection research has been popular over the years as the variations and complexity of malware attacks are increasing daily. Using variously Supervised and Unsupervised machine learning algorithms to detect, identify, or classify malware attacks has been proven a very effective technique for some past years. Some com- mon and widely concerning malware attacks are Trojan, Adware, Ransomware, and Zero-day. In this paper, we used ten ML algorithms such as AdaBoost, Stochastic Gradient Descent (SGD), Naïve Bayes (NB), Decision Tree (DT), Random For- est (RF), XGBoost, Logistic Regression (LR), Multi-Layer Perceptron (MLP), K- Nearest Neighbour(KNN), Support Vector Machine (SVM) for classifying software- based Trojan attacks, Ransomware, Adware and Zero-day attacks. This research was conducted on a dataset having a total sample of 12863 malware, consisting of the malware categories mentioned above, to extract features and learn patterns. Also, we showed a comparison between these ML methods and analysis based on how they classify these popular malware in this paper after testing each classifier on the selected dataset. After implementation, RF achieved the highest accuracy of 86.97%, and Gaussian NB achieved the lowest accuracy of 47.84%. MLP, XGBoost, KNN, DT, AdaBoost, SVM, LR, SGD got 83.60%, 82.59%, 80.68%, 79.63%, 73.30%, 73.22%, 67.08%, 64.40% accuracy respectively. Other than accuracy, our analysis was based on individual accuracy, precision, and F1-score, TPR, TNR, FPR, and FNR of malware classes for each ML classifier.en_US
dc.description.statementofresponsibilityRaisa Hasan Bushra
dc.description.statementofresponsibilityMd Taukir Alam
dc.description.statementofresponsibilityAniruddho Saha
dc.description.statementofresponsibilityNazmus Sakib Fahim
dc.description.statementofresponsibilityNabila Mourium Binty
dc.format.extent47 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectMachine learningen_US
dc.subjectTrojanen_US
dc.subjectAdwareen_US
dc.subjectRansomwareen_US
dc.subjectClassificationen_US
dc.subjectMalwareen_US
dc.subjectZero-dayen_US
dc.subjectNaïve Bayesen_US
dc.subjectStochastic gradient descenten_US
dc.subjectRandom foresten_US
dc.subjectDecision treeen_US
dc.subjectAdaBoosten_US
dc.subjectXGBoosten_US
dc.subjectLogistic regressionen_US
dc.subjectMulti-layer perceptronen_US
dc.subjectK- nearest neighbouren_US
dc.subjectSupport vector machineen_US
dc.subject.lcshRegression analysis
dc.subject.lcshComputer algorithms
dc.titlePerformance analysis of machine learning algorithms for Malware classificationen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record