BRAC University Institutional Repository

Browsing Centre for Research on Bangla Language Processing (CRBLP) by Author "Khan, Mumit"

DSpace/Manakin Repository

Browsing Centre for Research on Bangla Language Processing (CRBLP) by Author "Khan, Mumit"

Sort by: Order: Results:

  • Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (Center for research on Bangla language processing (CRBLP), BRAC University, 2009)
    In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (Center for research on Bangla language processing (CRBLP), 2007)
    Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2008)
    Abstract-Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain ...
  • Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (Center for Research on Bangla Language Processing, BRAC University, 2006)
    This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
  • Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    This paper addresses the issues of Integrating Bangla Computing support for OpenOffice.org office suite and in the process, Identifies and describes the different problems associated with OpenOffice.org and that should ...
  • Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2009)
    Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
  • Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (Center for research on Bangla language processing (CRBLP), 2007)
    Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
  • Islam, Md. Zahurul; Uddin, Md. Nizam; Khan, Mumit (Center for research on Bangla language processing (CRBLP), 2007)
    Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
  • Shatil, Adnan Mohammad Shoeb; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
    This paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes ...
  • Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2005)
    The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...
  • Alam, Md. Jahangir; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
    This paper describes a statistical grammar checker, which considers the n-gram based analysis of words and POS tags to decide whether the sentence is grammatically correct or not. We employed this technique for both Bangla ...
  • Monjel, Md. Sheemam; Khan, Mumit (2007)
    In this paper we have described an OCR program made for Bangla documents. This program uses HMM for the recognition process. The description of full OCR program is too large to present here. So, we have given emphasis on ...
  • Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007-10-08)
    We describe the initial design of Bangla WordNet (BWN), based on the English WordNet 2.2 distribution from Princeton University. Our goal is to develop a 5,000 entry Bangla WordNet over the next two years. At present, we ...
  • Hayder, Kamrul; Islam, Md Zahurul; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    This report describes the design and implementation of a Bangla tagged lexicon. The resulting lexicon contains 144,770 entries, out of which 58,145 are verbs. The tags used in the lexicon are reproduced here from a previous ...
  • Mahmud, Altaf; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    This report describes the design of a POS tagset for Bangla, based on the Penn Treebank design. The resulting tagset contains 53 morpho-syntactic tags.
  • Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007-10-08)
    We describe some of the challenges in developing English-Bangla parallel corpora, and look some of the established processes used by other language corpora for solutions to some of these challenges.
  • Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007-10-08)
    This report describes the initial translations of gTLDs and ccTLDs in Bengali, along with the challenges faced in creating the translations.
  • Mosaddeque, Ayesha Binte; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
    This paper presents a rule based ronunciation generator for Bangla words. It takes a word and finds the pronunciations for the graphemes of the word. A grapheme is a unit in writing that cannot be analyzed into smaller ...
  • Hasnat, Md. Abul; Khan, Mumit (Center for research on Bangla language processing (CRBLP), 2009)
    Segmentation is the most challenging part of Bangla optical character recognition (OCR). To solve the problems of joining errors, several algorithms have been proposed in the literature, with varying degrees of accuracy. ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...

Policy Guidelines

Search DSpace


Browse

My Account