Show simple item record

dc.contributor.advisorArif, Hossain
dc.contributor.authorFaruque, MD Yamin
dc.contributor.authorAdeeb, MD Zahin
dc.contributor.authorKamal, Muhammad Maswood
dc.contributor.authorAhmed, Redwan
dc.date.accessioned2021-10-26T05:02:39Z
dc.date.available2021-10-26T05:02:39Z
dc.date.copyright2021
dc.date.issued2021-01
dc.identifier.otherID: 16201059
dc.identifier.otherID: 19241013
dc.identifier.otherID: 16201038
dc.identifier.otherID: 16241005
dc.identifier.urihttp://hdl.handle.net/10361/15541
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 44-45).
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.en_US
dc.description.abstractOptical Character Recognition or OCR is a technology that enables us to detect and extract text from images. In our project, we are designing our OCR system around the Bangla language. This is primarily because, there are many models of text recognition of the English language in the market but there are very few on Bangla. Ourproposedsystemcomprisesofacquiringtheinputimage,pre-processing it,passingitintotheTesseractOCRengine(thebackboneofoursystem)andfinally getting digital output of the text. We have used the latest version of Tesseract, that is, version 5 and even though this is in its alpha stage, it is still stable for endusers. Next, to improve accuracy, we have focused on pre-processing the image as thoroughly as possible and laid out our chosen algorithm in each step. For example, forbinarization,wehaveusedOtsu’sThresholdingalgorithmasthisgaveusthebest results. For segmentation, we have used the Fully Automatic Page Segmentation from Tesseracts own repertoire of segmentation modes. Then we have done our training through Tesseract’s new LSTM engine and improved upon their existing trainedfilewithourfonts. Wehaveselectedthesefontsbasedontheirpopularityof use. Wecalculatedouraccuracyatthewordlevelandourmodelgaveusanaverage accuracy of 95.9% on multiple fonts and on multiple real life scenarios. At best case scenariowehaveevenmanagedtosecure100%accuracy. Finally, wehavediscussed future improvements like the addition of a custom dictionary in our model and how it would increase the overall accuracy in all cases.en_US
dc.description.statementofresponsibilityMD Yamin Faruque
dc.description.statementofresponsibilityMD Zahin Adeeb
dc.description.statementofresponsibilityMuhammad Maswood Kamal
dc.description.statementofresponsibilityRedwan Ahmed
dc.format.extent45 Pages
dc.language.isoen_USen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectOptical Character Recognitionen_US
dc.subjectBangla Languageen_US
dc.subjectBangla OCRen_US
dc.subjectTesseracten_US
dc.subjectRNNen_US
dc.subjectLSTMen_US
dc.subjectOpen CVen_US
dc.subjectOtsu’s Thresholding Algorithmen_US
dc.subjectPythonen_US
dc.subjectjTessEditorFXen_US
dc.subjectImage Processingen_US
dc.subjectCustom Dictionaryen_US
dc.titleBangla optical character recognition from printed text using Tesseract Engineen_US
dc.typeThesis
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record