Breast cancer prediction using different machine learning models
Abstract
Breast cancer is often the most lethal diseases with a large mortality rate especially among women. Despite the severe effect of the disease, it is possible to pinpoint the genre of breast cancer using diff t machine learning algorithms. However, many of these algorithms perform differenttly depending on their types and complexities. In our work, we have analyzed and compared the classification results of various ma- chine learning models and fi out the best model to classify between diff t types of breast cancers. We have used Logistic Regression, SVM, Random Forest, AdaBoost Tree, NaA˜ ve Bayes, K neighbor classifier, Decision Tree and Gaussian Process classifiers for our comparative study. Additionally, we applied dimensional- ity reduction in order to simplify our dataset from 30 features to 2 features so that the computation time can be reduced. Our task is to critically analysis different data and to classify them with respect to the efficacy of each algorithm in terms of accuracy, precision, recall and F1 Score. Without dimensionality reduction, our best accuracy was 97.36 percent which was found using SVM. Then again, with dimensionality reduction, the prime accurate result was 98.24 percent which was achieved by SVM and the computation time also decreased.