dc.contributor.advisor | Rasel, Annajiat Alim | |
dc.contributor.advisor | Karim, Dewan Ziaul | |
dc.contributor.author | Shifullah, Khalid | |
dc.contributor.author | Islam, Nuzhat | |
dc.contributor.author | Raihan, Hasin | |
dc.contributor.author | Rakibullah, H.M. | |
dc.contributor.author | Iqbal, Md. Ashik | |
dc.date.accessioned | 2024-11-28T05:21:29Z | |
dc.date.available | 2024-11-28T05:21:29Z | |
dc.date.copyright | 2022 | |
dc.date.issued | 2022-09 | |
dc.identifier.other | ID 18101062 | |
dc.identifier.other | ID 18101374 | |
dc.identifier.other | ID 19301276 | |
dc.identifier.other | ID 18101371 | |
dc.identifier.other | ID 19341033 | |
dc.identifier.uri | http://hdl.handle.net/10361/24836 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022. | en_US |
dc.description | Catalogued from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 34-36). | |
dc.description.abstract | Social media has become essential for people all over the world. It has given a platform
for people to share thoughts, emotions, opinions, and ideas, causing a huge
deal of data upsurge. Such an amount of data could be analyzed based on sentiment
analysis and text classification via construction of an effective machine learning
model. The concept gets more insight into it through analysis of the data, which is
nearly impossible to conduct manually due to its huge configuration. This research
focuses on the user’s comments, and reviews about different hotels to predict their
sentiment. As for the datasets, comments and reviews of hotels from online sites
have been utilized. Moreover, text pre-processing techniques like tokenization, case
folding, stopword removal, lemmatization, and duplicate data removal have been
applied. TF-IDF and Bag of Words has been applied for word embedding. Furthermore,
the effectiveness of supervised machine learning algorithms like, Support
Vector Machine, Na¨ıve Bayes, Random Forest, and Logistic Regression was evaluated
and from the comparative analysis, it was observed that the Logistic Regression
provided the most accuracy ranging from 86 to 89 percent. | en_US |
dc.description.statementofresponsibility | Khalid Shifullah | |
dc.description.statementofresponsibility | Nuzhat Islam | |
dc.description.statementofresponsibility | Hasin Raihan | |
dc.description.statementofresponsibility | H.M. Rakibullah | |
dc.description.statementofresponsibility | Md. Ashik Iqbal | |
dc.format.extent | 36 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Sentiment analysis | en_US |
dc.subject | Word embedding | en_US |
dc.subject | Classifier | en_US |
dc.subject | Tokenization | en_US |
dc.subject | Decision tree | en_US |
dc.subject | Random forest | en_US |
dc.subject | Logistic regression | en_US |
dc.subject.lcsh | Machine learning | |
dc.title | Classification of hotel reviews using sentiment analysis and machine learning | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B.Sc. in Computer Science | |