Now showing items 1-20 of 63

    • Morphological parsing of Bangla wods using PC-KIMMO 

      Dasgupta, Sajib; Khan,Mumit (BRAC University, 2004)
      This paper describes Morphological parsing of Bangla words using PC-KIMMO, based on Kimmo Koskeniemil's model of two-level Morphology. There are three sections in the PC-KIMMO: rules section lexicon section and grammar ...
    • Feature unification for morphological parsing in Bangla 

      Dasgupta, Sajib; Khan, Dr. Mumit (BRAC University, 2004)
      This paper describes a Feature Unification Based Word Grammar model for the morphological parsing of Bangla words. While normal morphological parsing strategy is adequate to decompose a word into morphemes, it is not able ...
    • A Bangla phonetic encoding for better spelling suggesions 

      Naushad UzZaman; Khan, Mumit (BRAC University, 2004)
      We present a phonetic encoding for Bangla that can be used by spelling checkers to provide better suggestions for misspelled words. The encoding is based on the Soundex algorithm, modified to match Bangla phonetics. We ...
    • T12: an advanced text input system with phonetic support for mobile devices 

      Naushad UzZaman, Khan Mumit (BRAC University, 2005)
      The popular T9 text input system for mobile devices uses a predictive dictionary-based disambiguation scheme, enabling a user to type in commonly-used words with low overhead. We present a new text input system called ...
    • A double metaphone encoding for Bangla and its application in spelling checker 

      Naushad UzZaman; Khan, Mumit (BRAC University, 2005)
      We present a Double Metaphone encoding for Bangla that can be used by spelling checkers to improve the quality of suggestions for misspelled words. The complex rules of Bangla spelling present a significant challenge in ...
    • A double metaphone encoding for approximate name searching and matching in Bangla 

      Naushad UzZaman,; Khan, Mumit (BRAC University, 2005)
      Almost any word can be a Bangali name, and the name in turn is often spelled in many different ways, all of which are considered correct and interchangeable. The reason for the spelling complication is two-fold: (1) there ...
    • Morphological analysis of inflecting compound words in Bangla 

      Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2005)
      The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...
    • Teaching compiler development to undergraduates using a template based approach 

      Islam, Md Zahurul; Khan, Mumit (BRAC University, 2005)
      Compiler Design remains one of the most dreaded courses in any undergraduate Computer Science curriculum, due in part to the complexity and the breadth of the material covered in a typical 14-15 week semester time frame. ...
    • Bangla text input and rendering supports for short message sevice on Mobile devices 

      Rownok, Tofazzal; Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
      Technology is the most important thing that involve in our everyday life. It is involving in almost every aspect of life like communication, work, shopping, recreation etc. Communication through mobile devices is the most ...
    • Analysis of N-Gram based text categorization for Bangla in a newspaper 

      Mansur, Munirul; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      In this paper, we study the outcome of using ngram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of ...
    • Analysis of and observations from a Bangla News Corpus 

      Majumder, Khair Md. Yasir Arafat (BRAC University, 2006)
      In this paper we present the compilation methodology and some statistical analysis on a Bangla news corpus-“Prothom-Alo”, which is the first of its kind for Bangla. We compare some of the statistics with the CIIL Bangla ...
    • A proposed automated extraction procedure of Bangla text for corpus creation in unicode 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
    • Infrastructure for Bangla information retrieval in context of ICT for development 

      Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
      In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
    • A comprehensive roman (English)-to-Bangla transliteration scheme 

      Naushad UzZaman,; Zaheen, Arnab; Khan, Mumit (BRAC University, 2006)
      A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and ...
    • Rule based automated pronunciation generator 

      Mosaddeque, Ayesha Binte; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a rule based ronunciation generator for Bangla words. It takes a word and finds the pronunciations for the graphemes of the word. A grapheme is a unit in writing that cannot be analyzed into smaller ...
    • JKimmo: A Multilingual computational mophology frame work for PC-KIMMO 

      Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
      Morphological analysis is of fundamental interest in computational linguistics and language processing. While there are established morphological analyzers for mostly Western and a few other languages using localized ...
    • Skew angle detection of bangla script using radon transform 

      Habib, S. M. Murtoza; Noor, Nawsher Ahamed; Khan, Mumit (BRAC University, 2006)
      Skew angle detection and correction an integral part of any OCR system. Without proper skew correction, the performance of an OCR will simply not be acceptable for most scanned images. We propose an innovative method for ...
    • GIS Based Real Time Traveler Information System: An Efficient Approach to Minimize Travel Time Using Available Media 

      Hasnat, Md. Abul; Haque, Mohammad Mahmudul; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
    • History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? 

      Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
    • A comprehensive Bangla spelling checker 

      Naushad UzZaman,; Khan, Mumit (BRAC University, 2006)
      We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...