Recent Submissions

  • Detecting flames and insults in text 

    Mahmud, Altaf; Ahmed, Kazi Zubair; Khan, Mumit (BRAC University, 2008-12)
    While the internet has become the leading source of information, it is also become the medium for flames, insults and other forms of abusive language, which add nothing to the quality of information available. A human ...
  • Text to speech for Bangla language using festival 

    Alam, Firoj; Nath, Promila Kanti; Khan, Mumit (BRAC University, 2007)
    In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the open-source Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
  • Collaborative lexicon development for Bangla 

    Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Shah, Faisal Muhammad; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
  • A proposed automated extraction procedure of Bangla text for corpus creation in unicode 

    Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
  • A comprehensive roman (English)-to-Bangla transliteration scheme 

    Naushad UzZaman,; Zaheen, Arnab; Khan, Mumit (BRAC University, 2006)
    A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and ...
  • Acoustic analysis of Bangla consonants 

    Alam, Firoj; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
    This paper describes the acoustic characteristics of Bangla consonants, obtained by analyzing the recordings of male and female voices. First, the duration of each phoneme was identified by averaging both the male and ...
  • A comprehensive Bangla spelling checker 

    Naushad UzZaman,; Khan, Mumit (BRAC University, 2006)
    We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
  • Segmentation free Bangla OCR using HMM: Training and recognition 

    Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
    The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...
  • BWN- A software platform for developing Bengali wordnet 

    Khan, Mumit; Faruqe, Farhana (BRAC University, 2008)
    Advanced Natural Language Processing (NLP) applications are increasingly dependent on the availability of linguistic resources, ranging from digital lexica to rich tagged and annotated corpora. While these resources are ...
  • A decentralised approach to information retrieval for a developing country like Bangladesh 

    Ali, Hammad; Haque, Nafid (BRAC University, 2007)
    In this paper, we talk about a decentralised information retrieval system which would be suitable for the developing countries that face the problem of limited bandwidth. In this paper we came up with an implementation ...
  • Example based English-Bengali machine translation using wordnet 

    Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)
    In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
  • Development of annotated Bangla speech corpora 

    Alam, Firoj; Habib, S. M. Murtoza; Sultana, Dil Afroza; Khan, Mumit (BRAC University, 2010)
    This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora ...
  • Integrating Bangla script recognition support in tesseract OCR 

    Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
    Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
  • Rule based automated pronunciation generator 

    Mosaddeque, Ayesha Binte; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
    This paper presents a rule based ronunciation generator for Bangla words. It takes a word and finds the pronunciations for the graphemes of the word. A grapheme is a unit in writing that cannot be analyzed into smaller ...
  • N-gram based statistical grammar checker for Bangla and English 

    Alam, Md. Jahangir; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
    This paper describes a statistical grammar checker, which considers the n-gram based analysis of words and POS tags to decide whether the sentence is grammatically correct or not. We employed this technique for both Bangla ...
  • Minimally segmenting performance Bangla optical character recognition using Kohonen network 

    Shatil, Adnan Mohammad Shoeb; Khan, Mumit (BRAC University, 2006)
    This paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes ...
  • JKimmo: A Multilingual computational mophology frame work for PC-KIMMO 

    Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
    Morphological analysis is of fundamental interest in computational linguistics and language processing. While there are established morphological analyzers for mostly Western and a few other languages using localized ...
  • Infrastructure for Bangla information retrieval in context of ICT for development 

    Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
    In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
  • History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? 

    Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
    This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
  • Developing a computational grammar for Bengali using the HPSG formalism 

    Khan, Naira; Khan, Mumit (BRAC University, 2006)
    This paper describes the first phase of developing a computational grammar for Bengali using the Head- Driven Phrase Structure Grammar (HPSG) formalism. The HPSG formalism is a highly developed framework that combines ...

View more