dc.contributor.advisor | Arif, Hossain | |
dc.contributor.author | Oishi, Fayza Rezwana | |
dc.contributor.author | Al Mahadi, Mehnaj | |
dc.contributor.author | Parvez, Omar Bin | |
dc.date.accessioned | 2020-02-02T04:47:45Z | |
dc.date.available | 2020-02-02T04:47:45Z | |
dc.date.copyright | 2018 | |
dc.date.issued | 2018-12 | |
dc.identifier.other | ID 18201215 | |
dc.identifier.other | ID 13201076 | |
dc.identifier.other | ID 18241031 | |
dc.identifier.uri | http://hdl.handle.net/10361/13694 | |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 33-34). | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2018. | |
dc.description.abstract | The world of Machine Learning is expanding everyday through its implementations in
modern day healthcare. Researchers have sketched out many ways to implement
Machine Learning algorithms and droned into ways to make them work in their utmost
efficiencies. As there will always be the need for healthcare in the world, we believe that
there will always be a need of comparison between Machine Learning algorithms in
terms of their performance and relevance to make healthcare more reliable through
Machine Learning. For this study, we have picked up the most commonly used Machine
Learning algorithms, Logistic Regression, Support Vector Machine, Decision Tree and
Random Forest to produce a comparative analysis on a dataset of Framingham Heart
Study which is dedicated to the prediction of risk of Coronary Heart Disease (CHD). We
have used a combination of Data Preprocessing and Feature Selection methods, namely
The Row Elimination method and Recursive Feature Elimination respectively. To understand
the impact of each prevailing features in the dataset on the target feature, we have
applied the Chi Squared Technique which is a highly recommended technique when it
comes to classification problems. To compare and analyze performance of the
algorithms, we applied concepts of the Confusion Matrix, Precision, Recall and F1
Scores; we have plotted ROC curves using Sensitivity and Specificity scores to categorize
the algorithms’ behavior. We have found out that the highest average accuracy in our
study was given by the Logistic Regression algorithm (83.9%) while the other algorithms
have come fairly close. | en_US |
dc.description.statementofresponsibility | Fayza Rezwana Oishi | |
dc.description.statementofresponsibility | Mehnaj Al Mahadi | |
dc.description.statementofresponsibility | Omar Bin Parvez | |
dc.format.extent | 34 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Machine learning algorithms | en_US |
dc.subject | Coronary Heart Disease (CHD) | en_US |
dc.subject | Healthcare | en_US |
dc.subject | Chi Squared Technique | en_US |
dc.subject.lcsh | Machine learning | |
dc.subject.lcsh | Computer algorithms | |
dc.subject.lcsh | Machine learning--Mathematical models | |
dc.title | Comparative analysis between machine learning algorithms in efficiency of Coronary Heart Disease (CHD) prediction | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |