• Login
    • Library Home
    View Item 
    •   BracU IR
    • School of Data and Sciences (SDS)
    • Department of Computer Science and Engineering (CSE)
    • Thesis & Report, BSc (Computer Science and Engineering)
    • View Item
    •   BracU IR
    • School of Data and Sciences (SDS)
    • Department of Computer Science and Engineering (CSE)
    • Thesis & Report, BSc (Computer Science and Engineering)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Hate speech detection from social networking posts using CNN and XGBoost

    Thumbnail
    View/Open
    16101199, 16101184, 16101005, 16101182_CSE.pdf (9.531Mb)
    Date
    2019-12
    Publisher
    Brac University
    Author
    Shahadat, Ashraf Bin
    Rony, Md. Mizanur Rahman
    Anwar, Md. Adnanul
    Joy, Eialid Ahmed
    Metadata
    Show full item record
    URI
    http://hdl.handle.net/10361/14057
    Abstract
    The increasing growth of social networks and microblogging websites have enabled people from different backgrounds and diverse moral codes to communicate with each other quite easily. While social media promotes communication and sharing of information, these are also used to initiate heinous and negative campaigns. Social networks although discourage such act but people often use these social platforms to propagate offensive and hatred towards individuals or specific groups. Therefore,detecting hate speech has become a serious issue that needs considerable attention. The goal of this research is to detect such campaigns of hate. In this paper, two different approaches have been proposed for detecting hate and offensive language on social platforms. The paper proposes Natural language processing with CNN architecture and XGBoost classifier which will be explicitly effective for capturing the context and the semantics of hate speech. The proposed classifiers distinguish hate speech from neutral text and can achieve a higher quality of classification than current state-of-the-art algorithms.Using CNN,the accuracy that has been obtained on detecting if a tweet is offensive or neutral is 89.18% and on another datasetcontaining hateful, offensive and neutral comments, the accuracy is 84.74%.The later approach of using XGBoost classifier has achieved an accuracy of 93.10% and 80.51% respectively.In addition,2333 tweets have been collected from twitter and labelled using annotators.On that dataset, using CNN model the accuracy is 76.70% and for XGBoost the accuracy is 78.14%.
    Keywords
    Natural Language processing; Hatespeech; Offensive Language; Convolutional Neural Network(CNN); XGBoost
     
    Description
    This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.
     
    Cataloged from PDF version of thesis.
     
    Includes bibliographical references (pages 62-65).
    Department
    Department of Computer Science and Engineering, Brac University
    Collections
    • Thesis & Report, BSc (Computer Science and Engineering)

    Copyright © 2008-2019 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback
     

     

    Policy Guidelines

    • BracU Policy
    • Publisher Policy

    Browse

    All of BracU Institutional RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Copyright © 2008-2019 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback