Show simple item record

dc.contributor.advisorRasel, Annajiat Alim
dc.contributor.advisorKarim, Dewan Ziaul
dc.contributor.authorShifullah, Khalid
dc.contributor.authorIslam, Nuzhat
dc.contributor.authorRaihan, Hasin
dc.contributor.authorRakibullah, H.M.
dc.contributor.authorIqbal, Md. Ashik
dc.date.accessioned2024-11-28T05:21:29Z
dc.date.available2024-11-28T05:21:29Z
dc.date.copyright2022
dc.date.issued2022-09
dc.identifier.otherID 18101062
dc.identifier.otherID 18101374
dc.identifier.otherID 19301276
dc.identifier.otherID 18101371
dc.identifier.otherID 19341033
dc.identifier.urihttp://hdl.handle.net/10361/24836
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.en_US
dc.descriptionCatalogued from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 34-36).
dc.description.abstractSocial media has become essential for people all over the world. It has given a platform for people to share thoughts, emotions, opinions, and ideas, causing a huge deal of data upsurge. Such an amount of data could be analyzed based on sentiment analysis and text classification via construction of an effective machine learning model. The concept gets more insight into it through analysis of the data, which is nearly impossible to conduct manually due to its huge configuration. This research focuses on the user’s comments, and reviews about different hotels to predict their sentiment. As for the datasets, comments and reviews of hotels from online sites have been utilized. Moreover, text pre-processing techniques like tokenization, case folding, stopword removal, lemmatization, and duplicate data removal have been applied. TF-IDF and Bag of Words has been applied for word embedding. Furthermore, the effectiveness of supervised machine learning algorithms like, Support Vector Machine, Na¨ıve Bayes, Random Forest, and Logistic Regression was evaluated and from the comparative analysis, it was observed that the Logistic Regression provided the most accuracy ranging from 86 to 89 percent.en_US
dc.description.statementofresponsibilityKhalid Shifullah
dc.description.statementofresponsibilityNuzhat Islam
dc.description.statementofresponsibilityHasin Raihan
dc.description.statementofresponsibilityH.M. Rakibullah
dc.description.statementofresponsibilityMd. Ashik Iqbal
dc.format.extent36 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectSentiment analysisen_US
dc.subjectWord embeddingen_US
dc.subjectClassifieren_US
dc.subjectTokenizationen_US
dc.subjectDecision treeen_US
dc.subjectRandom foresten_US
dc.subjectLogistic regressionen_US
dc.subject.lcshMachine learning
dc.titleClassification of hotel reviews using sentiment analysis and machine learningen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record