Utility of large language models to extract commonsense knowledge

Ahmed, Anika; Chowdhury, Nafis; Haque, Moinul

dc.contributor.advisor	Sadeque, Farig Yousuf
dc.contributor.author	Ahmed, Anika
dc.contributor.author	Chowdhury, Nafis
dc.contributor.author	Haque, Moinul
dc.date.accessioned	2025-01-14T04:54:51Z
dc.date.available	2025-01-14T04:54:51Z
dc.date.copyright	©2024
dc.date.issued	2024-10
dc.identifier.other	ID 21101029
dc.identifier.other	ID 21101034
dc.identifier.other	ID 21101186
dc.identifier.uri	http://hdl.handle.net/10361/25152
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2024.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 43-44).
dc.description.abstract	Large Language models are artificial intelligence models that hold the capability to understand and generate natural language text as they are trained using large amounts of data for a lot of languages. The sources these models are trained on include books, articles, websites, and many more. As the large language models know the languages along with their syntax and structures thoroughly, we can expect them to work well for the Bengali language and compose enough knowledge related to the Bengali culture. One of the challenges of working with the Bengali language is the lack of Natural Language Processing methods such as Semantic Parsing, Parts of Speech tagging, and Named Entity Recognition. Our motive was to test the effectiveness of large language models in answering Bengali culture and languagebased queries, alongside analyzing which fields of knowledge require improvement. As we do not need Natural Language Processing tools while working with large language models, these models could serve our purpose. Therefore, through our research, we formed a corpus to analyze the utility of large language models for the Bengali language. This corpus aided us in recognizing the gaps of the large language models in terms of factual and cultural commonsense knowledge through natural language processing tasks such as question-answering and masked prediction.	en_US
dc.description.statementofresponsibility	Anika Ahmed
dc.description.statementofresponsibility	Nafis Chowdhury
dc.description.statementofresponsibility	Moinul Haque
dc.format.extent	52 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Large language model	en_US
dc.subject	Natural language processing	en_US
dc.subject	NLP	en_US
dc.subject	Bengali language	en_US
dc.subject.lcsh	Natural language processing (Computer science).
dc.subject.lcsh	Natural language generation (Computer science).
dc.subject.lcsh	Artificial intelligence.
dc.title	Utility of large language models to extract commonsense knowledge	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B.Sc. in Computer Science

Files in this item

Name:: 21101029, 21101034, 21101186_C ...
Size:: 448.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1566]

Show simple item record