Show simple item record

dc.contributor.authorHasnat, Md. Abul
dc.contributor.authorChowdhury, Muttakinur Rahman
dc.contributor.authorKhan, Mumit
dc.date.accessioned2010-10-25T06:03:34Z
dc.date.available2010-10-25T06:03:34Z
dc.date.copyright2009
dc.date.issued2009
dc.identifier.urihttp://hdl.handle.net/10361/635
dc.descriptionIncludes bibliographical references (page 5).
dc.description.abstractTesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract is capable of only recognizing English, French, Italian, German, Spanish and Dutch. However, it is possible to make Tesseract recognize other scripts if the engine is trained with the requisite data. In this paper, we present a complete methodology to integrate Bangla script recognition support in Tesseract.en_US
dc.description.statementofresponsibilityMd. Abul Hasnat
dc.description.statementofresponsibilityMuttakinur Rahman Chowdhury
dc.description.statementofresponsibilityMumit Khan
dc.format.extent5 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.subjectOptical character reader (OCR)
dc.subjectBangla language processing
dc.titleIntegrating Bangla script recognition support in tesseract OCRen_US
dc.typeArticleen_US
dc.contributor.departmentCenter for Research on Bangla Language Processing (CRBLP), BRAC University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record