Now showing items 1-20 of 40

    • Morphological parsing of Bangla wods using PC-KIMMO 

      Dasgupta, Sajib; Khan,Mumit (BRAC University, 2004)
      This paper describes Morphological parsing of Bangla words using PC-KIMMO, based on Kimmo Koskeniemil's model of two-level Morphology. There are three sections in the PC-KIMMO: rules section lexicon section and grammar ...
    • Feature unification for morphological parsing in Bangla 

      Dasgupta, Sajib; Khan, Dr. Mumit (BRAC University, 2004)
      This paper describes a Feature Unification Based Word Grammar model for the morphological parsing of Bangla words. While normal morphological parsing strategy is adequate to decompose a word into morphemes, it is not able ...
    • Teaching compiler development to undergraduates using a template based approach 

      Islam, Md Zahurul; Khan, Mumit (BRAC University, 2005)
      Compiler Design remains one of the most dreaded courses in any undergraduate Computer Science curriculum, due in part to the complexity and the breadth of the material covered in a typical 14-15 week semester time frame. ...
    • A double metaphone encoding for approximate name searching and matching in Bangla 

      Naushad UzZaman,; Khan, Mumit (BRAC University, 2005)
      Almost any word can be a Bangali name, and the name in turn is often spelled in many different ways, all of which are considered correct and interchangeable. The reason for the spelling complication is two-fold: (1) there ...
    • A double metaphone encoding for Bangla and its application in spelling checker 

      Naushad UzZaman; Khan, Mumit (BRAC University, 2005)
      We present a Double Metaphone encoding for Bangla that can be used by spelling checkers to improve the quality of suggestions for misspelled words. The complex rules of Bangla spelling present a significant challenge in ...
    • T12: an advanced text input system with phonetic support for mobile devices 

      Naushad UzZaman, Khan Mumit (BRAC University, 2005)
      The popular T9 text input system for mobile devices uses a predictive dictionary-based disambiguation scheme, enabling a user to type in commonly-used words with low overhead. We present a new text input system called ...
    • Morphological analysis of inflecting compound words in Bangla 

      Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2005)
      The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...
    • A comprehensive roman (English)-to-Bangla transliteration scheme 

      Naushad UzZaman,; Zaheen, Arnab; Khan, Mumit (BRAC University, 2006)
      A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and ...
    • GIS Based Real Time Traveler Information System: An Efficient Approach to Minimize Travel Time Using Available Media 

      Hasnat, Md. Abul; Haque, Mohammad Mahmudul; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
    • Skew angle detection of bangla script using radon transform 

      Habib, S. M. Murtoza; Noor, Nawsher Ahamed; Khan, Mumit (BRAC University, 2006)
      Skew angle detection and correction an integral part of any OCR system. Without proper skew correction, the performance of an OCR will simply not be acceptable for most scanned images. We propose an innovative method for ...
    • A proposed automated extraction procedure of Bangla text for corpus creation in unicode 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
    • Collaborative lexicon development for Bangla 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Shah, Faisal Muhammad; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
    • Infrastructure for Bangla information retrieval in context of ICT for development 

      Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
      In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
    • A comprehensive Bangla spelling checker 

      Naushad UzZaman,; Khan, Mumit (BRAC University, 2006)
      We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
    • N-gram based statistical grammar checker for Bangla and English 

      Alam, Md. Jahangir; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
      This paper describes a statistical grammar checker, which considers the n-gram based analysis of words and POS tags to decide whether the sentence is grammatically correct or not. We employed this technique for both Bangla ...
    • Analysis of and observations from a Bangla News Corpus 

      Majumder, Khair Md. Yasir Arafat (BRAC University, 2006)
      In this paper we present the compilation methodology and some statistical analysis on a Bangla news corpus-“Prothom-Alo”, which is the first of its kind for Bangla. We compare some of the statistics with the CIIL Bangla ...
    • History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? 

      Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
    • Developing a computational grammar for Bengali using the HPSG formalism 

      Khan, Naira; Khan, Mumit (BRAC University, 2006)
      This paper describes the first phase of developing a computational grammar for Bengali using the Head- Driven Phrase Structure Grammar (HPSG) formalism. The HPSG formalism is a highly developed framework that combines ...
    • Rule based automated pronunciation generator 

      Mosaddeque, Ayesha Binte; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a rule based ronunciation generator for Bangla words. It takes a word and finds the pronunciations for the graphemes of the word. A grapheme is a unit in writing that cannot be analyzed into smaller ...
    • Minimally segmenting performance Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Mohammad Shoeb; Khan, Mumit (BRAC University, 2006)
      This paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes ...