Show simple item record

dc.contributor.advisorAlam, Md. Golam Rabiul
dc.contributor.advisorRahman, Rafeed
dc.contributor.authorShanto, Hasibul Hossain
dc.contributor.authorFarooqui, Farhan
dc.contributor.authorRafi, Abdullah Al
dc.contributor.authorFeona, Maisha Maliha
dc.contributor.authorPhul, Progya Talukder
dc.date.accessioned2025-02-23T05:02:56Z
dc.date.available2025-02-23T05:02:56Z
dc.date.copyright2024
dc.date.issued2024
dc.identifier.otherID 19301217
dc.identifier.otherID 19301230
dc.identifier.otherID 19301213
dc.identifier.otherID 20101339
dc.identifier.otherID 19301170
dc.identifier.urihttp://hdl.handle.net/10361/25531
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 42-44).
dc.description.abstract"Grooming children on social media is a dangerous side effect of modern internet era. AI models, specially NLP have the potential to play a critical role in detecting grooming behavior. Even though, there have been studies in the past to build a grooming detection system, there is limited research on building such systems us- ing modern NLP techniques. In this paper, we propose a modern sexual grooming detection system using state-of-the-art NLP models and techniques that can detect and alert users to potentially dangerous online interactions between groomers and their targets. Our detection system is a ConversationClassifier which is able to clas- sify conversations, whether they are grooming or not. With over 19,000 grooming sentences collected from PervertedJustice grooming conversations, we created an annotated dataset exhibiting the grooming characteristics. Conversational data was also collected from both PervertedJustice and PAN12 dataset. With the sentence- level annotated dataset, we trained a SentenceClassifier model based on RoBERTa & DeBERTa to be able to accurately predict if a sentence has grooming character- istics or not. The ConversationClassifier was built on top of the SentenceClassifier with LSTM & GRU to capture the sequential features in the conversation. Further- more, a self-attention mechanism was added so that the model can focus on relevant sentences. Our models achieved promising results. In case of the SentenceClassifier, it displayed an accuracy of 93% for RoBERTa and 94% for DeBERTa. We paired the RoBERTa based SentenceClassifier with LSTM which yielded an accuracy of 97% and DeBERTa based SentenceClassifier with GRU which yielded an accuracy of 95%."en_US
dc.description.statementofresponsibilityHasibul Hossain Shanto
dc.description.statementofresponsibilityFarhan Farooqui
dc.description.statementofresponsibilityAbdullah Al Rafi
dc.description.statementofresponsibilityMaisha Maliha Feona
dc.description.statementofresponsibilityProgya Talukder Phul
dc.format.extent51 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.rightsBRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectGroomingen_US
dc.subjectKidsen_US
dc.subjectPedophilesen_US
dc.subjectRoBERTaen_US
dc.subjectPredatorsen_US
dc.subjectOnlineen_US
dc.subjectAIen_US
dc.subjectNLPen_US
dc.subjectClassificationen_US
dc.subjectGRUen_US
dc.subjectLSTMen_US
dc.subjectDeBERTAen_US
dc.subject.lcshArtificial intelligence
dc.subject.lcshNatural language processing (Computer science)
dc.titleAn AI and NLP approach for detecting grooming behavioren_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, BRAC University
dc.description.degreeB.Sc. in Computer Science and Engineering


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record