BRAC University Institutional Repository

A high performance domain specific OCR for Bangla script

DSpace/Manakin Repository

Show simple item record

dc.contributor.author Hasnat, Md. Abul
dc.contributor.author Habib, S. M. Murtoza
dc.contributor.author Khan, Mumit
dc.date.accessioned 2010-10-27T04:32:33Z
dc.date.available 2010-10-27T04:32:33Z
dc.date.issued 2008
dc.identifier.uri http://hdl.handle.net/10361/641
dc.description.abstract Abstract-Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR for recognizing Bengali script. We select the training data set from the script of the specified domain. We choose Hidden Markov Model (HMM) for character classification due to its simple and straightforward way of representation. We examine the primary error types that mainly occurred at preprocessing level and carefully handled those errors by adding special error correcting module as a part of recognizer. Finally we added a dictionary and some error specific rules to correct the probable errors after the word formation is done. The entire technique significantly increases the performance of the OCR for a specific domain to a great extent. en_US
dc.language.iso en en_US
dc.publisher Center for research on Bangla language processing (CRBLP), BRAC University en_US
dc.title A high performance domain specific OCR for Bangla script en_US
dc.type Technical Report en_US


Files in this item

Files Size Format View
A High Performa ... OCRF or Bangla Script.pdf 319.3Kb PDF View/Open or Preview

This item appears in the following Collection(s)

Show simple item record

Policy Guidelines

Search DSpace


Browse

My Account