• Login
    • Library Home
    View Item 
    •   BracU IR
    • Centre for Research on Bangla Language Processing (CRBLP)
    • Conference Papers (Centre for Research on Bangla Language Processing)
    • View Item
    •   BracU IR
    • Centre for Research on Bangla Language Processing (CRBLP)
    • Conference Papers (Centre for Research on Bangla Language Processing)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Segmentation free Bangla OCR using HMM: Training and recognition

    Thumbnail
    View/Open
    Segmentation free Bangla OCR using HMM.pdf (177.2Kb)
    Date
    2007
    Publisher
    BRAC University
    Author
    Hasnat, Md. Abul
    Habib, S. M. Murtoza
    Khan, Mumit
    Metadata
    Show full item record
    URI
    http://hdl.handle.net/10361/666
    Abstract
    The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for character recognition. In this paper, we present the training and recognition mechanism of a Hidden Markov Model (HMM) based multi font supported Optical Character Recognition (OCR) system for Bangla character. In our approach the central idea is separate HMM model for each segmented character or word. We emphasize on word level segmentation and like to consider the single character as a word when the character appears alone after segmentation process is done. The system uses HTK toolkit for data preparation, model training from multiple samples and recognition. Features of each trained character are calculated by applying Discrete Cosine Transform (DCT) to each pixel value of the character image where the image is divided into several frames according to its size. The extracted features of each frame are used as discrete probability distributions that will be given as input parameter to each HMM model. In case of recognition a model for each separated character or word is build up using the same approach. This model is given to the HTK toolkit to perform the recognition using Viterbi Decoding. The experimental result shows significant performance.
    Keywords
    Optical character recognition (OCR); Hidden Markov Model (HMM); HTK; Discrete cosine transform (DCT)
     
    Description
    Includes bibliographical references (page 7-8).
    Department
    Center for Research on Bangla language processing (CRBLP), BRAC University
    Type
    Article
    Collections
    • Conference Papers (Centre for Research on Bangla Language Processing)

    Copyright © 2008-2023 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback
     

     

    Policy Guidelines

    • BracU Policy
    • Publisher Policy

    Browse

    All of BracU Institutional RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Copyright © 2008-2023 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback