Now showing items 1-20 of 25

    • Acoustic analysis of Bangla vowel inventory 

      Alam, Firoj; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
      This paper describes the acoustic characteristics of Bangla vowels, obtained by analyzing the recordings of male and female voices. First, the duration of each phoneme was identified by averaging both the male and female ...
    • Analysis of and observations from a Bangla News Corpus 

      Majumder, Khair Md. Yasir Arafat (BRAC University, 2006)
      In this paper we present the compilation methodology and some statistical analysis on a Bangla news corpus-“Prothom-Alo”, which is the first of its kind for Bangla. We compare some of the statistics with the CIIL Bangla ...
    • Bangla text input and rendering supports for short message sevice on Mobile devices 

      Rownok, Tofazzal; Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
      Technology is the most important thing that involve in our everyday life. It is involving in almost every aspect of life like communication, work, shopping, recreation etc. Communication through mobile devices is the most ...
    • A comprehensive Bangla spelling checker 

      Naushad UzZaman; Khan, Mumit (BRAC University, 2006)
      We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
    • Example based English-Bengali machine translation using wordnet 

      Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)
      In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
    • A high performance domain specific OCR for Bangla script 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
      Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
    • A high performance domain specific OCR for Bangla script 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
      Abstract-Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain ...
    • Infrastructure for Bangla information retrieval in context of ICT for development 

      Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
      In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
    • Integrating Bangla computing support in openoffice.org 

      Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2007)
      This paper addresses the issues of Integrating Bangla Computing support for OpenOffice.org office suite and in the process, Identifies and describes the different problems associated with OpenOffice.org and that should ...
    • Integrating Bangla script recognition support in tesseract OCR 

      Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
      Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
    • Isolated and continuous bangla speech recognition: implementation, performance and application perspective 

      Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (BRAC University, 2007)
      Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
    • Morphological analysis of inflecting compound words in Bangla 

      Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2005)
      The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...
    • Morphological parsing of Bangla wods using PC-KIMMO 

      Dasgupta, Sajib; Khan,Mumit (BRAC University, 2004)
      This paper describes Morphological parsing of Bangla words using PC-KIMMO, based on Kimmo Koskeniemil's model of two-level Morphology. There are three sections in the PC-KIMMO: rules section lexicon section and grammar ...
    • Optical character recognition for Bangla documents using HMM 

      Monjel, Md. Sheemam; Khan, Mumit (BRAC University, 2007)
      In this paper we have described an OCR program made for Bangla documents. This program uses HMM for the recognition process. The description of full OCR program is too large to present here. So, we have given emphasis on ...
    • Research report on Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Md. Shoeb (BRAC University, 2007)
      This report discusses the theory and implementation of an Optical Character Recognition (OCR) for Bangla. The principal idea is to convert images of text documents such as those obtained from scanning a document into ...
    • Research report on Bengla OCR training and testing methods 

      Hasnat, Md. Abul (BRAC University, 2007)
      In this paper we present the training and recognition mechanism of a Hidden Markov Model (HMM) based multi-font Optical Character Recognition (OCR) system for Bengali character. In our approach, the central idea is to ...
    • Research report on Bengla tagged lexicon 

      Hayder, Kamrul; Islam, Md Zahurul; Khan, Mumit (BRAC University, 2007)
      This report describes the design and implementation of a Bangla tagged lexicon. The resulting lexicon contains 144,770 entries, out of which 58,145 are verbs. The tags used in the lexicon are reproduced here from a previous ...
    • Research report on Bengla tagset 

      Mahmud, Altaf; Khan, Mumit (BRAC University, 2007)
      This report describes the design of a POS tagset for Bangla, based on the Penn Treebank design. The resulting tagset contains 53 morpho-syntactic tags.
    • Research report on Bengla Verb and Noun Morphological analysis 

      Islam, Md. Zahurul (BRAC University, 2007)
      This report describes the inflection Bangla verb and noun morphology and rules, lexicons and grammar for Bangla morphological analysis.
    • Research report on parallel corpus translation challenges and processes 

      Khan, Mumit (BRAC University, 2007-10-08)
      We describe some of the challenges in developing English-Bangla parallel corpora, and look some of the established processes used by other language corpora for solutions to some of these challenges.