Show simple item record

dc.contributor.advisorIslam, Md Saiful
dc.contributor.authorZariyat, Tahreema Rahman
dc.contributor.authorAhmed, Fahim Irfan
dc.contributor.authorOishi, Tahsina Tajrim
dc.contributor.authorMorshed, Maruf
dc.date.accessioned2024-08-19T06:13:34Z
dc.date.available2024-08-19T06:13:34Z
dc.date.copyright2024
dc.date.issued2024-01
dc.identifier.otherID 20101433
dc.identifier.otherID 20101508
dc.identifier.otherID 20101394
dc.identifier.otherID 20101299
dc.identifier.urihttp://hdl.handle.net/10361/23795
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2024.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 32-34).
dc.description.abstract"In this fast-paced world, everyone relies on technology to get their work done quickly and efficiently, since using technology greatly simplifies every task that needs to be done. The majority of the publications are lengthy and packed with crucial data. However, in many instances, extra words are also added to boost the word count, which causes a number of difficulties when trying to get the desired information. For the English language, numerous tools are available to summarize the text and present it in tabular form. However, it is not the same for our mother tongue, Bangla. Despite being the 5th most-spoken native language in the world, there is no tool available to ease the workload in Bengali language. Our research will assist in such circumstances by summarizing the given information in tabular form within the shortest possible time. Since there is no dataset available that will be suitable for our research, we have prepared the dataset ourselves. Then, we have used the mBART-50-large, mT5-base, mT5-m2m-CrossSum and BanglaT5 models for the implementation. Finding the appropriate table headers in light of the context and order of the data is the most important task in this study. To sum up, our main goal is to develop a benchmark dataset for a text-to-table model for the betterment of the NLP research community."en_US
dc.description.statementofresponsibilityTahreema Rahman Zariyat
dc.description.statementofresponsibilityFahim Irfan Ahmed
dc.description.statementofresponsibilityTahsina Tajrim Oishi
dc.description.statementofresponsibilityMaruf Morshed
dc.format.extent34 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectBangla NLPen_US
dc.subjectText2Tableen_US
dc.subjectSummarizeren_US
dc.subjectmBARTen_US
dc.subjectTransformeren_US
dc.subjectInformation extractionen_US
dc.subjectT5en_US
dc.subjectmT5en_US
dc.subject.lcshComputation and Language
dc.titleBnText2Table – dataset and Text-to-Table generation in Banglaen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record