Now showing items 26-45 of 63

    • A high performance domain specific OCR for Bangla script 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
      Abstract-Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain ...
    • A high performance domain specific OCR for Bangla script 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
      Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
    • History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? 

      Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
    • Infrastructure for Bangla information retrieval in context of ICT for development 

      Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
      In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
    • Integrating Bangla computing support in openoffice.org 

      Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2007)
      This paper addresses the issues of Integrating Bangla Computing support for OpenOffice.org office suite and in the process, Identifies and describes the different problems associated with OpenOffice.org and that should ...
    • Integrating Bangla script recognition support in tesseract OCR 

      Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
      Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
    • Isolated and continuous bangla speech recognition: implementation, performance and application perspective 

      Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (BRAC University, 2007)
      Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
    • JKimmo: A Multilingual computational mophology frame work for PC-KIMMO 

      Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
      Morphological analysis is of fundamental interest in computational linguistics and language processing. While there are established morphological analyzers for mostly Western and a few other languages using localized ...
    • A light weight stemmer for Bengali and its use in spelling checker 

      Islam, Md. Zahurul; Uddin, Md. Nizam; Khan, Mumit (BRAC University, 2007)
      Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
    • Localization birdging the digital divide 

      Haque, Nafid (BRAC University, 2007)
      In this paper, a proposal has been given to make an equitable access of education and technology among the people of the underdeveloped and developing countries of the world. Here a concept called localization has been ...
    • Minimally segmenting performance Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Mohammad Shoeb; Khan, Mumit (BRAC University, 2006)
      This paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes ...
    • Morphological analysis of inflecting compound words in Bangla 

      Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2005)
      The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...
    • Morphological parsing of Bangla wods using PC-KIMMO 

      Dasgupta, Sajib; Khan,Mumit (BRAC University, 2004)
      This paper describes Morphological parsing of Bangla words using PC-KIMMO, based on Kimmo Koskeniemil's model of two-level Morphology. There are three sections in the PC-KIMMO: rules section lexicon section and grammar ...
    • N-gram based statistical grammar checker for Bangla and English 

      Alam, Md. Jahangir; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
      This paper describes a statistical grammar checker, which considers the n-gram based analysis of words and POS tags to decide whether the sentence is grammatically correct or not. We employed this technique for both Bangla ...
    • Optical character recognition for Bangla documents using HMM 

      Monjel, Md. Sheemam; Khan, Mumit (BRAC University, 2007)
      In this paper we have described an OCR program made for Bangla documents. This program uses HMM for the recognition process. The description of full OCR program is too large to present here. So, we have given emphasis on ...
    • A proposed automated extraction procedure of Bangla text for corpus creation in unicode 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
    • Research report on Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Md. Shoeb (BRAC University, 2007)
      This report discusses the theory and implementation of an Optical Character Recognition (OCR) for Bangla. The principal idea is to convert images of text documents such as those obtained from scanning a document into ...
    • Research report on Bangla wordnet development challenges and solutions 

      Khan, Mumit (BRAC University, 2007-10-08)
      We describe the initial design of Bangla WordNet (BWN), based on the English WordNet 2.2 distribution from Princeton University. Our goal is to develop a 5,000 entry Bangla WordNet over the next two years. At present, we ...
    • Research report on Bengali NLP engine for TTS 

      Alam, Firoj (BRAC University, 2008-04-07)
      This report describes the Bengali NLP processor for TTS, along with the challenges faced in developing the NLP processor.
    • Research report on Bengla lexicon 

      Hayder, Kamrul (BRAC University, 2007)
      We report on the compilation of a comprehensive Bangla word list lexicon. The current list contains 80,969 words from the Standard Chalita Bhasha (SCB) vocabulary. The word list is currently being used by the BRAC University ...