Now showing items 21-40 of 63

    • Developing a computational grammar for Bengali using the HPSG formalism 

      Khan, Naira; Khan, Mumit (BRAC University, 2006)
      This paper describes the first phase of developing a computational grammar for Bengali using the Head- Driven Phrase Structure Grammar (HPSG) formalism. The HPSG formalism is a highly developed framework that combines ...
    • N-gram based statistical grammar checker for Bangla and English 

      Alam, Md. Jahangir; UzZaman, Naushad; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2006)
      This paper describes a statistical grammar checker, which considers the n-gram based analysis of words and POS tags to decide whether the sentence is grammatically correct or not. We employed this technique for both Bangla ...
    • Collaborative lexicon development for Bangla 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Shah, Faisal Muhammad; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
    • Comparion of different POS tagging technique (N-Gram, HMM and Brill's tagger) for Bangla 

      Hasan, Fahim Muhammad; Naushad UzZaman; Khan, Mumit (BRAC University, 2006)
      There are different approaches to the problem of assigning each word of a text with a parts-of-speech tag, which is known as Part-Of-Speech (POS) tagging. In this paper we compare the performance of a few POS tagging ...
    • Minimally segmenting performance Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Mohammad Shoeb; Khan, Mumit (BRAC University, 2006)
      This paper presents a method to use Kohonen neural network based classifier in Bangla Optical Character Recognition (OCR) system, providing much higher performance than the traditional neural network based ones. It describes ...
    • Skew angle detection of bangla script using radon transform 

      Habib, S. M. Murtoza; Noor, Nawsher Ahamed; Khan, Mumit (BRAC University, 2006)
      Skew angle detection and correction an integral part of any OCR system. Without proper skew correction, the performance of an OCR will simply not be acceptable for most scanned images. We propose an innovative method for ...
    • A comprehensive Bangla spelling checker 

      Naushad UzZaman; Khan, Mumit (BRAC University, 2006)
      We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
    • Research report on Bengla lexicon 

      Hayder, Kamrul (BRAC University, 2007)
      We report on the compilation of a comprehensive Bangla word list lexicon. The current list contains 80,969 words from the Standard Chalita Bhasha (SCB) vocabulary. The word list is currently being used by the BRAC University ...
    • Optical character recognition for Bangla documents using HMM 

      Monjel, Md. Sheemam; Khan, Mumit (BRAC University, 2007)
      In this paper we have described an OCR program made for Bangla documents. This program uses HMM for the recognition process. The description of full OCR program is too large to present here. So, we have given emphasis on ...
    • Integrating Bangla computing support in openoffice.org 

      Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2007)
      This paper addresses the issues of Integrating Bangla Computing support for OpenOffice.org office suite and in the process, Identifies and describes the different problems associated with OpenOffice.org and that should ...
    • A survey on script segmentation for Bangla OCR 

      Abduallah, Arif Billah Al-Mahmud; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
      Script segmentation is an important primary task for any Optical Character Recognition (OCR) software. Especially, in case of off-line OCR for printed character, it has more importance. Through script segmentation a big ...
    • Building a foundation of HPSG-based treebank on Bangla language 

      Mahmud, Altaf; Khan, Mumit (BRAC University, 2007)
      Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. ...
    • Research report on Bangla optical character recognition using Kohonen network 

      Shatil, Adnan Md. Shoeb (BRAC University, 2007)
      This report discusses the theory and implementation of an Optical Character Recognition (OCR) for Bangla. The principal idea is to convert images of text documents such as those obtained from scanning a document into ...
    • Text to speech for Bangla language using festival 

      Alam, Firoj; Nath, Promila Kanti; Khan, Mumit (BRAC University, 2007)
      In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the opensource Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
    • Segmentation free Bangla OCR using HMM: Training and recognition 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
      The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...
    • Automatic Bangla corpus creation 

      Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2007)
      This paper addresses the issue of automatic Bangla corpus creation, which will significantly help the processes of Lexicon development, Morphological Analysis, Automatic Parts of Speech Detection and Automatic grammar ...
    • Text to speech for Bangla language using festival 

      Alam, Firoj; Nath, Promila Kanti; Khan, Mumit (BRAC University, 2007)
      In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the open-source Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
    • Research report on Bengla OCR training and testing methods 

      Hasnat, Md. Abul (BRAC University, 2007)
      In this paper we present the training and recognition mechanism of a Hidden Markov Model (HMM) based multi-font Optical Character Recognition (OCR) system for Bengali character. In our approach, the central idea is to ...
    • A light weight stemmer for Bengali and its use in spelling checker 

      Islam, Md. Zahurul; Uddin, Md. Nizam; Khan, Mumit (BRAC University, 2007)
      Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
    • Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages 

      Hasan, Muhammad Fahim; Naushad UzZaman; Khan, Mumit (BRAC University, 2007)
      Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...