Show simple item record

dc.contributor.authorHasnat, Md. Abul
dc.contributor.authorHabib, S. M. Murtoza
dc.contributor.authorKhan, Mumit
dc.date.accessioned2010-10-04T10:44:46Z
dc.date.available2010-10-04T10:44:46Z
dc.date.copyright2007
dc.date.issued2007
dc.identifier.urihttp://hdl.handle.net/10361/327
dc.descriptionIncludes bibliographical references (page 5).
dc.description.abstractResearch on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR for recognizing Bengali script. We select the training data set from the script of the specified domain. We choose Hidden Markov Model (HMM) for character classification due to its simple and straightforward way of representation. We examine the primary error types that mainly occurred at preprocessing level and carefully handled those errors by adding special error correcting module as a part of recognizer. Finally we added a dictionary and some error specific rules to correct the probable errors after the word formation is done. The entire technique significantly increases the performance of the OCR for a specific domain to a great extent.en_US
dc.description.statementofresponsibilityMd. Abul Hasnat
dc.description.statementofresponsibilityS. M. Murtoza Habib
dc.description.statementofresponsibilityMumit Khan
dc.format.extent5 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.subjectOptical character reader (OCR)
dc.subjectBangla language processing
dc.titleA high performance domain specific OCR for Bangla scripten_US
dc.typeArticleen_US
dc.contributor.departmentCenter for Research on Bangla Language Processing (CRBLP), BRAC University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record