Show simple item record

dc.contributor.advisorMostakim, Moin
dc.contributor.authorRashid, Warida
dc.contributor.authorReza, Mohi
dc.date.accessioned2018-01-15T05:23:18Z
dc.date.available2018-01-15T05:23:18Z
dc.date.copyright2017
dc.date.issued2017
dc.identifier.otherID 14301026
dc.identifier.otherID 14101040
dc.identifier.urihttp://hdl.handle.net/10361/9059
dc.descriptionThis thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.en_US
dc.descriptionCataloged from PDF version of thesis report.
dc.descriptionIncludes bibliographical references (pages 31-33).
dc.description.abstractWe have created an isolated-word dataset - Prodorshok 1, which consists of 34 Bengali words related to navigation with 1011 voice samples. The word set is intended to help design speaker dependent/independent, voice-command driven automated speech recognition (ASR) systems that can potentially improve human-computer interaction. This paper presents the results of an objective analysis that was undertaken using a subset of words from Prodorshok I to help assess its reliability in ASR systems that utilize Hidden Markov Models (HMM) with Gaussian emissions and Deep Neural Networks (DNN). The results show that simple data augmentation involving a small pitch shift can make surprisingly tangible improvements to accuracy levels in speech recognition, even when working with small datasets. Prodorshok I will be expanded upon and made publicly available for others to use under an Open Data License (ODbL).en_US
dc.description.statementofresponsibilityWarida Rashid
dc.description.statementofresponsibilityMohi Reza
dc.format.extent34 pages
dc.language.isoenen_US
dc.publisherBRAC Univeristyen_US
dc.rightsBRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectData augmentationen_US
dc.subjectSpeech recognitionen_US
dc.titleBengali isolated speech recognition : a comparative analysis of the effects of data augmentation on HMM and DNN based acoustic modelsen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, BRAC University
dc.description.degreeB. Computer Science and Engineering


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record