Show simple item record

dc.contributor.advisorParvez, Mohammad Zavid
dc.contributor.advisorAhmed, Sajjad
dc.contributor.authorChowdhury, Mohammad Fahim
dc.contributor.authorSultana, Zakia
dc.contributor.authorJahan, Nusrat
dc.contributor.authorAlavi, Safkat Hasin
dc.date.accessioned2021-10-19T06:35:39Z
dc.date.available2021-10-19T06:35:39Z
dc.date.copyright2021
dc.date.issued2021-01
dc.identifier.otherID 17101293
dc.identifier.otherID 17301182
dc.identifier.otherID 17101332
dc.identifier.otherID 17101515
dc.identifier.urihttp://hdl.handle.net/10361/15435
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 48-49).
dc.description.abstractSpeech to text conversion is a remarkable topic in the field of Artificial Intelligence which is undoubtedly a significant medium of expressing human feelings and thoughts. However, if we compare it with text to speech, work in speech to text conversion has been done less. Among those works, many languages got priority but the numerical value of work in Bengali language is little. Previously a similar work has been done in that language where they got 82.35% accuracy using LSTM[15]. Our approach was to avail more accuracy in speech to text conversion using Neural Network models. We build a novel dataset for research purposes. We tried both GRU and LSTM and focused on LSTM later on. The reason behind it is, GRU showed an unstable and started fluctuating where LSTM is much more stable and minimized errors in case of loss function and the accuracy was also less compared to LSTM. An increasing number of datasets was giving better accuracy and on the whole dataset, the accuracy on testing data is around 90%. In terms of loss function, testing loss is less than 40%. We did data testing manually to justify the result with the given output and we got 90% accuracy rate in a dataset which the model never fed before. In the future, we would like to work with automatic sentence recognition, the process of preparing the response basis of the statement, and also changing sentiment depending on it.en_US
dc.description.statementofresponsibilityMohammad Fahim Chowdhury
dc.description.statementofresponsibilityZakia Sultana
dc.description.statementofresponsibilityNusrat Jahan
dc.description.statementofresponsibilitySafkat Hasin Alavi
dc.format.extent49 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject.lcshLong-term memory
dc.titleConversion of Bengali speech to text using long short-term memory(LSTM)en_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record