Show simple item record

dc.contributor.advisorChakrabarty, Dr. Amitabha
dc.contributor.authorAkhter, Nahid
dc.date.accessioned2017-07-26T10:35:54Z
dc.date.available2017-07-26T10:35:54Z
dc.date.copyright2016
dc.date.issued2016
dc.identifier.otherID 14166001
dc.identifier.urihttp://hdl.handle.net/10361/8366
dc.descriptionThis thesis report is submitted in partial fulfilment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2016.en_US
dc.descriptionCataloged from PDF version of thesis report.
dc.descriptionIncludes bibliographical references (page 46-50).
dc.description.abstractAutomatic Speech Recognition plays an important role in human-computer interaction, which can be applied in various vital applications like crime-fighting and helping the hearing-impaired. It consists of two domains – Audio Speech Recognition and Visual Speech Recognition. This thesis is based on Recognition of Speech in the visual domain only, i.e. it involves recognizing speech without the presence or support of any auditory signal. So far, a lot of research has been done on lip-reading in English and some amount on French and Chinese, as well as few other languages, but not much research has been done on lip-reading in Bengali. This thesis work provides a new approach to lip reading Bengali vowels using a combination of the curvature of the inner and outer lips and Neural Networks. The method uses a more robust and faster algorithm to detect the lip contour than conventional methods used so far, such as Active Contour Model, Active Appearance Model and Active Shape Models. The method used for feature extraction is also new. It makes use of coefficients of the curves of the inner and outer lips. This way, it makes use of a lesser number of parameters to represent the shape of the lip when pronouncing a vowel. Moreover, the method is also robust to alignment of lips at different angles and can work with low resolution pictures also. Finally, for recognition of the viseme, a Backpropagation Neural Network is trained and simulated using gradient descent method.en_US
dc.description.statementofresponsibilityNahid Akhter
dc.format.extent50 pages
dc.language.isoenen_US
dc.publisherBRAC Univeristyen_US
dc.rightsBRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectNeural networken_US
dc.subjectLip curvatureen_US
dc.subjectViseme recognitionen_US
dc.titleA viseme recognition system using lip curvature and neural networks to detect Bangla vowelsen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, BRAC University
dc.description.degreeM. Computer Science and Engineering


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record