Now showing items 21-40 of 40

    • Infrastructure for Bangla information retrieval in context of ICT for development 

      Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)
      In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...
    • History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? 

      Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
      This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
    • Bangla text input and rendering supports for short message sevice on Mobile devices 

      Rownok, Tofazzal; Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
      Technology is the most important thing that involve in our everyday life. It is involving in almost every aspect of life like communication, work, shopping, recreation etc. Communication through mobile devices is the most ...
    • A proposed automated extraction procedure of Bangla text for corpus creation in unicode 

      Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
      This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
    • Building a foundation of HPSG-based treebank on Bangla language 

      Mahmud, Altaf; Khan, Mumit (BRAC University, 2007)
      Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. ...
    • Segmentation free Bangla OCR using HMM: Training and recognition 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
      The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...
    • A light weight stemmer for Bengali and its use in spelling checker 

      Islam, Md. Zahurul; Uddin, Md. Nizam; Khan, Mumit (BRAC University, 2007)
      Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
    • Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages 

      Hasan, Muhammad Fahim; Naushad UzZaman; Khan, Mumit (BRAC University, 2007)
      Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...
    • Localization birdging the digital divide 

      Haque, Nafid (BRAC University, 2007)
      In this paper, a proposal has been given to make an equitable access of education and technology among the people of the underdeveloped and developing countries of the world. Here a concept called localization has been ...
    • Isolated and continuous bangla speech recognition: implementation, performance and application perspective 

      Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (BRAC University, 2007)
      Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
    • A high performance domain specific OCR for Bangla script 

      Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
      Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
    • A decentralised approach to information retrieval for a developing country like Bangladesh 

      Ali, Hammad; Haque, Nafid (BRAC University, 2007)
      In this paper, we talk about a decentralised information retrieval system which would be suitable for the developing countries that face the problem of limited bandwidth. In this paper we came up with an implementation ...
    • Text to speech for Bangla language using festival 

      Alam, Firoj; Nath, Promila Kanti; Khan, Mumit (BRAC University, 2007)
      In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the open-source Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
    • BWN- A software platform for developing Bengali wordnet 

      Khan, Mumit; Faruqe, Farhana (BRAC University, 2008)
      Advanced Natural Language Processing (NLP) applications are increasingly dependent on the availability of linguistic resources, ranging from digital lexica to rich tagged and annotated corpora. While these resources are ...
    • Acoustic analysis of Bangla consonants 

      Alam, Firoj; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
      This paper describes the acoustic characteristics of Bangla consonants, obtained by analyzing the recordings of male and female voices. First, the duration of each phoneme was identified by averaging both the male and ...
    • Detecting flames and insults in text 

      Mahmud, Altaf; Ahmed, Kazi Zubair; Khan, Mumit (BRAC University, 2008-12)
      While the internet has become the leading source of information, it is also become the medium for flames, insults and other forms of abusive language, which add nothing to the quality of information available. A human ...
    • Integrating Bangla script recognition support in tesseract OCR 

      Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
      Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
    • Rule based segmentation of lower modifiers in complex Bangla scripts 

      Hasnat, Md. Abul; Khan, Mumit (BRAC University, 2009)
      Segmentation is the most challenging part of Bangla optical character recognition (OCR). To solve the problems of joining errors, several algorithms have been proposed in the literature, with varying degrees of accuracy. ...
    • Example based English-Bengali machine translation using wordnet 

      Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)
      In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
    • Development of annotated Bangla speech corpora 

      Alam, Firoj; Habib, S. M. Murtoza; Sultana, Dil Afroza; Khan, Mumit (BRAC University, 2010)
      This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora ...