Show simple item record

dc.contributor.advisorAlam, Md. Golam Rabiul
dc.contributor.authorShahadat, Ashraf Bin
dc.contributor.authorRony, Md. Mizanur Rahman
dc.contributor.authorAnwar, Md. Adnanul
dc.contributor.authorJoy, Eialid Ahmed
dc.date.accessioned2020-10-12T06:16:45Z
dc.date.available2020-10-12T06:16:45Z
dc.date.copyright2019
dc.date.issued2019-12
dc.identifier.otherID: 16101199
dc.identifier.otherID: 16101184
dc.identifier.otherID: 16101005
dc.identifier.otherID: 16101182
dc.identifier.urihttp://hdl.handle.net/10361/14057
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 62-65).
dc.description.abstractThe increasing growth of social networks and microblogging websites have enabled people from different backgrounds and diverse moral codes to communicate with each other quite easily. While social media promotes communication and sharing of information, these are also used to initiate heinous and negative campaigns. Social networks although discourage such act but people often use these social platforms to propagate offensive and hatred towards individuals or specific groups. Therefore,detecting hate speech has become a serious issue that needs considerable attention. The goal of this research is to detect such campaigns of hate. In this paper, two different approaches have been proposed for detecting hate and offensive language on social platforms. The paper proposes Natural language processing with CNN architecture and XGBoost classifier which will be explicitly effective for capturing the context and the semantics of hate speech. The proposed classifiers distinguish hate speech from neutral text and can achieve a higher quality of classification than current state-of-the-art algorithms.Using CNN,the accuracy that has been obtained on detecting if a tweet is offensive or neutral is 89.18% and on another datasetcontaining hateful, offensive and neutral comments, the accuracy is 84.74%.The later approach of using XGBoost classifier has achieved an accuracy of 93.10% and 80.51% respectively.In addition,2333 tweets have been collected from twitter and labelled using annotators.On that dataset, using CNN model the accuracy is 76.70% and for XGBoost the accuracy is 78.14%.en_US
dc.description.statementofresponsibilityAshraf Bin Shahadat
dc.description.statementofresponsibilityMd. Mizanur Rahman Rony
dc.description.statementofresponsibilityMd. Adnanul Anwar
dc.description.statementofresponsibilityEialid Ahmed Joy
dc.format.extent65 pages
dc.language.isoen_USen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectNatural Language processingen_US
dc.subjectHatespeechen_US
dc.subjectOffensive Languageen_US
dc.subjectConvolutional Neural Network(CNN)en_US
dc.subjectXGBoosten_US
dc.titleHate speech detection from social networking posts using CNN and XGBoosten_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record