Early detection of chronic kidney disease using machine learning

Abrar, Tahmid; Tasnim, Samiha; Hossain, Md. Mehrab

dc.contributor.advisor	Alam, Md Ashraful
dc.contributor.author	Abrar, Tahmid
dc.contributor.author	Tasnim, Samiha
dc.contributor.author	Hossain, Md. Mehrab
dc.date.accessioned	2019-10-29T10:13:07Z
dc.date.available	2019-10-29T10:13:07Z
dc.date.copyright	2019
dc.date.issued	2019-09
dc.identifier.other	ID 14301051
dc.identifier.other	ID 18341027
dc.identifier.other	ID 131010 43
dc.identifier.uri	http://hdl.handle.net/10361/12817
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 41-44).
dc.description.abstract	Chronic kidney disease (CKD) is a global prevalent ailment that causes lives in a predominant number. CKD is the 11th most deadly cause of global mortality with 1.2 million death each year and according to kidney Foundation of Bangladesh, around 40,000 CKD people experienced kidney failure annually as well as several thousand passed away in short stage of life because of CKD. Predictive analytics for healthcare using machine learning is a challenged task to help doctors decide the exact treatments for saving lives. Scientist researched collaboratively chronic kidney diseases, with the majority of their work on pure statistical models, generating numerous gaps in the development of machine-learning models. In this article we discussed the current methods and suggested improved technology based on the XGBoost (Extreme Gradient Boost), which combined significant characteristics of the F scores and evaluated four pre-processing scenarios. In addition, we provided machine training methods for anticipating chronic renal disease with clinical information. Four techniques of master teaching are explored including Support Vector Regressor (SVR), logistic Regressor (LR), AdaBoost, Gradient Boosting Tree and Decision Tree Regressor. The components are made from UCI dataset of chronic kidney disease and the results of these models are compared to determine the best regression model for the prediction. From this four preprocessing cases, replacing missing values with mean values of each column and choosing important features was most logical as it allows to train with more data without dropping. However, XGBoost gave the best outcomes in all four cases where it obtained 98% accuracy in case one where nulled valued are dropped, 98.75% testing accuracy for both case two and three where null values were replaced with minimum and maximum values of each column and it scores 100% accuracy in case four where null values are replaced with mean values. Thus, the system can be implemented v for early stage CKD prediction in a cost efficient way which will be helpful for under developed and developing countries.	en_US
dc.description.statementofresponsibility	Tahmid Abrar
dc.description.statementofresponsibility	Samiha Tasnim
dc.description.statementofresponsibility	Md. Mehrab Hossain
dc.format.extent	44 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Chorionic kidney disease	en_US
dc.subject	XGBoost	en_US
dc.subject	Support Vector Regressor (SVR)	en_US
dc.subject	Logistic Regressor (LR)	en_US
dc.subject	AdaBoost	en_US
dc.subject	Gradient Boosting Tree and Decision Tree Regressor	en_US
dc.subject.lcsh	Machine learning
dc.title	Early detection of chronic kidney disease using machine learning	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B. Computer Science

Files in this item

Name:: 14301051, 18341027, 131010 ...
Size:: 2.172Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1589]

Show simple item record