Show simple item record

dc.contributor.advisorKarim, Dewan Ziaul
dc.contributor.advisorSaha, Ramkrishna
dc.contributor.authorArobi, Ritu
dc.contributor.authorRijon, Rakibul Hassan
dc.contributor.authorHaque, Nimul
dc.contributor.authorSafkat, T.I.M. Sakir
dc.contributor.authorSafoan, Sk
dc.date.accessioned2023-03-30T03:44:27Z
dc.date.available2023-03-30T03:44:27Z
dc.date.copyright2022
dc.date.issued2022-05
dc.identifier.otherID 17101330
dc.identifier.otherID 17301090
dc.identifier.otherID 17301136
dc.identifier.otherID 17201112
dc.identifier.otherID 17301160
dc.identifier.urihttp://hdl.handle.net/10361/18038
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 52-56).
dc.description.abstractOpinions from others can be important when a choice is necessary, mainly when valuable resources such as time and wealth are involved. People often depend on their peers’ past experiences. The analysis of sentiments or opinions is a computation using text analysis of people’s emotions, thoughts, and feelings. In recent years, it became the most significant natural language processing and sentiment analysis research approach. Because it is founded on people’s views and as all people have different viewpoints to see things, it is becoming more popular every day. In this digital world, when a person decides to purchase a product or utilize a service, they get access to many customer evaluations. Still, it is a tiresome process to read and analyze them all. Moreover, when an organization wants to make a profit, find new possibilities, anticipate sales trends, and manage its reputation via public opinion or sell its product, it also needs to address many customer remarks accessible to its customers. Therefore, our goal is to show that it is feasible to do so with sentiment analysis approaches. With sentiment analysis, it is easy to analyze and extract a vast number of accessible data comments from both consumers that can aid in fulfilling the objectives of the organization. The dataset utilized here is obtained from the Ali-Express e-commerce website’s online product reviews. We use various data processing techniques such as tokenization, removal of punctuation marks and stop words, stemming, TF-IDF, and parts of speech tagging. The results of our study comprise several methods of machine learning techniques. In this work, we examined eight distinct types of Machine Learning Algorithms, including Naive Bayes Classifier, SVM, Random Forest Classifier, Logistic Regression (L.R.), K- nearest neighbors, XGBoost Classifier, Decision Tree Classifier, and Gradient Boosting, and compared their precision and accuracy to find the most accurate one. According to our analysis, Logistic Regression is performing better among all the other seven classifiers having an accuracy of around 98%, and KNN is performing lowest, having an accuracy of 46%. Other classifiers such as Decision Tree is, giving 69%, Gradient Boosting 63%, Naive Bayes and SVM 82%, and Random Forest and XGBoost are giving 80% of accuracy.en_US
dc.description.statementofresponsibilityRitu Arobi
dc.description.statementofresponsibilityRakibul Hassan Rijon
dc.description.statementofresponsibilityNimul Haque
dc.description.statementofresponsibilityT.I.M. Sakir Safkat
dc.description.statementofresponsibilitySk Safoan
dc.format.extent56 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectTokenizationen_US
dc.subjectNaive Bayes Classifieren_US
dc.subjectSVMen_US
dc.subjectRandom Forest Classifieren_US
dc.subjectLogistic Regression (L.R.)en_US
dc.subjectK-nearest neighborsen_US
dc.subjectXGBoost classifieren_US
dc.subjectDecision Tree Classifier(DTC)en_US
dc.subjectGradient boosting classifieren_US
dc.subjectTF-IDFen_US
dc.subject.lcshMachine learning
dc.subject.lcshMathematical statistics--Data processing
dc.titleSentiment analysis on E-commerce based product reviews using machine learning algorithmsen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record