Show simple item record

dc.contributor.authorShatil, Adnan Mohammad Shoeb
dc.contributor.authorKhan, Mumit
dc.date.accessioned2010-10-24T04:57:57Z
dc.date.available2010-10-24T04:57:57Z
dc.date.copyright2006
dc.date.issued2006
dc.identifier.urihttp://hdl.handle.net/10361/630
dc.descriptionIncludes bibliographical references (page 5).
dc.description.abstractThis paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes how Bangla characters are processed, trained and then recognized with the use of a Kohonen network. While there have been significant efforts in using the various types of Artificial ,eural ,etworks (A,,) in optical character recognition, this is the first published account of using a segmentation-free optical character recognition system for Bangla using a Kohonen network. The methodology presented here assumes that the OCR pre-processor has minimally segmented the input words into easily segmentable chunks, and presenting each of these as images to the classification engine described here. The size and the font face used to render the characters are also significant in both training and classification. The images are first converted into grayscale and then to binary images; these images are then scaled to a fit a pre-determined area with a fixed but significant number of pixels. The feature vectors are then extracted from the rectangular pixel map, which in this case is simply a series of 0s and 1s of fixed length. Finally, a Kohonen neural network is chosen for the training and classification process. Although the steps are simple, and the simplest network is chosen for the training and recognition process, the resulting classifier is accurate to better than 98%, depending on the quality of the input images.en_US
dc.description.statementofresponsibilityAdnan Mohammad Shoeb Shatil
dc.description.statementofresponsibilityMumit Khan
dc.format.extent5 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.subjectBangla optical character recognition
dc.titleMinimally segmenting performance Bangla optical character recognition using Kohonen networken_US
dc.typeArticleen_US
dc.contributor.departmentCenter for Research on Bangla Language Processing (CRBLP), BRAC University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record