Show simple item record

dc.contributor.advisorArif, Hossain
dc.contributor.authorOishi, Fayza Rezwana
dc.contributor.authorAl Mahadi, Mehnaj
dc.contributor.authorParvez, Omar Bin
dc.date.accessioned2020-02-02T04:47:45Z
dc.date.available2020-02-02T04:47:45Z
dc.date.copyright2018
dc.date.issued2018-12
dc.identifier.otherID 18201215
dc.identifier.otherID 13201076
dc.identifier.otherID 18241031
dc.identifier.urihttp://hdl.handle.net/10361/13694
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 33-34).
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2018.
dc.description.abstractThe world of Machine Learning is expanding everyday through its implementations in modern day healthcare. Researchers have sketched out many ways to implement Machine Learning algorithms and droned into ways to make them work in their utmost efficiencies. As there will always be the need for healthcare in the world, we believe that there will always be a need of comparison between Machine Learning algorithms in terms of their performance and relevance to make healthcare more reliable through Machine Learning. For this study, we have picked up the most commonly used Machine Learning algorithms, Logistic Regression, Support Vector Machine, Decision Tree and Random Forest to produce a comparative analysis on a dataset of Framingham Heart Study which is dedicated to the prediction of risk of Coronary Heart Disease (CHD). We have used a combination of Data Preprocessing and Feature Selection methods, namely The Row Elimination method and Recursive Feature Elimination respectively. To understand the impact of each prevailing features in the dataset on the target feature, we have applied the Chi Squared Technique which is a highly recommended technique when it comes to classification problems. To compare and analyze performance of the algorithms, we applied concepts of the Confusion Matrix, Precision, Recall and F1 Scores; we have plotted ROC curves using Sensitivity and Specificity scores to categorize the algorithms’ behavior. We have found out that the highest average accuracy in our study was given by the Logistic Regression algorithm (83.9%) while the other algorithms have come fairly close.en_US
dc.description.statementofresponsibilityFayza Rezwana Oishi
dc.description.statementofresponsibilityMehnaj Al Mahadi
dc.description.statementofresponsibilityOmar Bin Parvez
dc.format.extent34 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectMachine learning algorithmsen_US
dc.subjectCoronary Heart Disease (CHD)en_US
dc.subjectHealthcareen_US
dc.subjectChi Squared Techniqueen_US
dc.subject.lcshMachine learning
dc.subject.lcshComputer algorithms
dc.subject.lcshMachine learning--Mathematical models
dc.titleComparative analysis between machine learning algorithms in efficiency of Coronary Heart Disease (CHD) predictionen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record