Show simple item record

dc.contributor.advisorTurzo, Esfar E Alam
dc.contributor.authorNahar, Fatiha Binte Kamrun
dc.contributor.authorAfsana, Umme Halima
dc.contributor.authorChowdhury, Azizul Muktadir
dc.contributor.authorHasnaen, Maha
dc.contributor.authorJahan, Sumaya
dc.date.accessioned2023-12-18T06:35:07Z
dc.date.available2023-12-18T06:35:07Z
dc.date.copyright2023
dc.date.issued2023-01
dc.identifier.otherID: 19101500
dc.identifier.otherID: 19101427
dc.identifier.otherID: 22341040
dc.identifier.otherID: 19141002
dc.identifier.otherID: 22241182
dc.identifier.urihttp://hdl.handle.net/10361/22003
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 39-40).
dc.description.abstractIn this research, we propose a hybrid model for predicting suicide risk from text data that incorporates BERT, VADER, and a Random Forest classifier for sentiment analysis. This model aims to identify individuals who may be at risk of committing suicide based on the tone of the text. The model is trained on a labelled dataset of text data that is either classified as ”suicide” or ”not suicide,” which provides the model with instances of text data that are linked with high or low suicide risk respectively. In order to extract feature representations of the text data, the BERT model is utilized, and the VADER model is utilized in order to extract sentiment ratings for each individual text. These features are integrated into a single feature vector for each text, and then the Random Forest classifier is trained using this feature vector. A number of different metrics, including accuracy, precision, recall, and F1-score, are utilized in order to assess the performance of the model. The findings of this research indicate that the hybrid model that was suggested is capable of accurately predicting the risk of suicide based on text data and that it is suitable for use as a tool to help clinical decision-making. The performance of the model to recognize patterns and trends in text data that are indicative of suicide risk holds promise for future research in the subject. Our novel composite model combining BERT, VADER with Random Forest Classifier has the accuracy of 82 percent.en_US
dc.description.statementofresponsibilityFatiha Binte Kamrun Nahar
dc.description.statementofresponsibilityUmme Halima Afsana
dc.description.statementofresponsibilityAzizul Muktadir Chowdhury
dc.description.statementofresponsibilityMaha Hasnaen
dc.description.statementofresponsibilitySumaya Jahan
dc.format.extent40 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectData analyticsen_US
dc.subjectMachine learningen_US
dc.subjectNatural language processingen_US
dc.subjectRandom foresten_US
dc.subjectSuicideen_US
dc.subjectDetection of suicideen_US
dc.subjectAlgorithmsen_US
dc.subjectBerten_US
dc.subjectVaderen_US
dc.subjectText-preprocessingen_US
dc.subjectDepressionen_US
dc.subjectArtificial neural networken_US
dc.subjectNatural language processingen_US
dc.subject.lcshMachine learning.
dc.subject.lcshArtificial intelligence.
dc.titleA machine learning-based approach for data analysis to ascertain suicidal individuals from Social media usersen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record