Analysis on dengue severity using machine learning approach
Abstract
Dengue is a viral disease that spreads in tropical and subtropical regions and is especially prevalent in South-East Asia. To some certain extent, this mosquito-borne
disease triggers nationwide epidemics, which results in large number of fatalities. In
our study, we mainly worked with two data sets from two countries: Bangladesh and
Vietnam. For the Vietnamese data set we have used supervised learning and implemented different prediction models like Decision Tree Classifier, Random Forest,
Gradient Boosting, Ada Boosting, XG-Boosting Classifier Model and have taken the
best fitted one (that being XG-Boosting Classifier) to predict the severity amongst
the dengue infected patients. After predicting severity we analyzed the data set
further to identify what might be the possible cause leading towards the DSS or
the DHF for the clinical data. In parallel, for the Bangladeshi data set we applied
the unsupervised learning technique, Hierarchical Clustering, to find the different
clusters of various vital components of the patients according to their blood report.
We then analyzed the clusters further to find the severity among the patients, which
led them to DSS or DHF.