BRAC University Institutional Repository

Browsing Conference Papers (Centre for Research on Bangla Language Processing) by Issue Date

DSpace/Manakin Repository

Browsing Conference Papers (Centre for Research on Bangla Language Processing) by Issue Date

Sort by: Order: Results:

  • Naushad UzZaman,; Khan, Mumit (BRAC University, 2006)
    We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
  • Naushad UzZaman,; Zaheen, Arnab; Khan, Mumit (BRAC University, 2006)
    A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and ...
  • Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of automated Bangla corpus creation, which will significantly help the processes of lexicon development, morphological analysis, automatic parts of speech detection and automatic grammar ...
  • Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Shah, Faisal Muhammad; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...
  • Islam, Md. Zahurul; Uddin, Md. Nizam; Khan, Mumit (BRAC University, 2007)
    Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
  • Mahmud, Altaf; Khan, Mumit (BRAC University, 2007)
    Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. ...
  • Hasan, Muhammad Fahim; Naushad UzZaman; Khan, Mumit (BRAC University, 2007)
    Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...
  • Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (BRAC University, 2007)
    Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
  • Haque, Nafid (BRAC University, 2007)
    In this paper, a proposal has been given to make an equitable access of education and technology among the people of the underdeveloped and developing countries of the world. Here a concept called localization has been ...
  • Ali, Hammad; Haque, Nafid (BRAC University, 2007)
    In this paper, we talk about a decentralised information retrieval system which would be suitable for the developing countries that face the problem of limited bandwidth. In this paper we came up with an implementation ...
  • Alam, Firoj; Nath, Promila Kanti; Khan, Dr Mumit (Center for research on Bangla language processing (CRBLP), BRAC University, 2007)
    In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the open-source Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
    Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
  • Khan, Mumit; Faruqe, Farhana (BRAC University, 2008)
    Advanced Natural Language Processing (NLP) applications are increasingly dependent on the availability of linguistic resources, ranging from digital lexica to rich tagged and annotated corpora. While these resources are ...
  • Alam, Firoj; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
    This paper describes the acoustic characteristics of Bangla consonants, obtained by analyzing the recordings of male and female voices. First, the duration of each phoneme was identified by averaging both the male and ...
  • Mahmud, Altaf; Ahmed, Kazi Zubair; Khan, Mumit (BRAC University, 2008-12)
    While the internet has become the leading source of information, it is also become the medium for flames, insults and other forms of abusive language, which add nothing to the quality of information available. A human ...
  • Hasnat, Md. Abul; Khan, Mumit (Center for research on Bangla language processing (CRBLP), 2009)
    Segmentation is the most challenging part of Bangla optical character recognition (OCR). To solve the problems of joining errors, several algorithms have been proposed in the literature, with varying degrees of accuracy. ...
  • Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)
    In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
  • Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
    Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...
  • Alam, Firoj; Habib, S. M. Murtoza; Sultana, Dil Afroza; Khan, Mumit (BRAC University, 2010)
    This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora ...

Policy Guidelines

Search DSpace


Browse

My Account