Show simple item record

dc.contributor.advisorSadeque, Farig Yousuf
dc.contributor.authorAhmed, Anika
dc.contributor.authorChowdhury, Nafis
dc.contributor.authorHaque, Moinul
dc.date.accessioned2025-01-14T04:54:51Z
dc.date.available2025-01-14T04:54:51Z
dc.date.copyright©2024
dc.date.issued2024-10
dc.identifier.otherID 21101029
dc.identifier.otherID 21101034
dc.identifier.otherID 21101186
dc.identifier.urihttp://hdl.handle.net/10361/25152
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2024.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 43-44).
dc.description.abstractLarge Language models are artificial intelligence models that hold the capability to understand and generate natural language text as they are trained using large amounts of data for a lot of languages. The sources these models are trained on include books, articles, websites, and many more. As the large language models know the languages along with their syntax and structures thoroughly, we can expect them to work well for the Bengali language and compose enough knowledge related to the Bengali culture. One of the challenges of working with the Bengali language is the lack of Natural Language Processing methods such as Semantic Parsing, Parts of Speech tagging, and Named Entity Recognition. Our motive was to test the effectiveness of large language models in answering Bengali culture and languagebased queries, alongside analyzing which fields of knowledge require improvement. As we do not need Natural Language Processing tools while working with large language models, these models could serve our purpose. Therefore, through our research, we formed a corpus to analyze the utility of large language models for the Bengali language. This corpus aided us in recognizing the gaps of the large language models in terms of factual and cultural commonsense knowledge through natural language processing tasks such as question-answering and masked prediction.en_US
dc.description.statementofresponsibilityAnika Ahmed
dc.description.statementofresponsibilityNafis Chowdhury
dc.description.statementofresponsibilityMoinul Haque
dc.format.extent52 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectLarge language modelen_US
dc.subjectNatural language processingen_US
dc.subjectNLPen_US
dc.subjectBengali languageen_US
dc.subject.lcshNatural language processing (Computer science).
dc.subject.lcshNatural language generation (Computer science).
dc.subject.lcshArtificial intelligence.
dc.titleUtility of large language models to extract commonsense knowledgeen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record