BRAC University Institutional Repository

Browsing Conference Papers (Centre for Research on Bangla Language Processing) by Author "Khan, Mumit"

DSpace/Manakin Repository

Browsing Conference Papers (Centre for Research on Bangla Language Processing) by Author "Khan, Mumit"

Sort by: Order: Results:

  • Alam, Firoj; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2008)
    This paper describes the acoustic characteristics of Bangla consonants, obtained by analyzing the recordings of male and female voices. First, the duration of each phoneme was identified by averaging both the male and ...
  • Mansur, Munirul; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
    In this paper, we study the outcome of using ngram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of ...
  • Rownok, Tofazzal; Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)
    Technology is the most important thing that involve in our everyday life. It is involving in almost every aspect of life like communication, work, shopping, recreation etc. Communication through mobile devices is the most ...
  • Mahmud, Altaf; Khan, Mumit (BRAC University, 2007)
    Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. ...
  • Khan, Mumit; Faruqe, Farhana (BRAC University, 2008)
    Advanced Natural Language Processing (NLP) applications are increasingly dependent on the availability of linguistic resources, ranging from digital lexica to rich tagged and annotated corpora. While these resources are ...
  • Pavel, Dewan Shahriar Hossain; Sarkar, Asif Iqbal; Shah, Faisal Muhammad; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
  • Hasan, Fahim Muhammad; Naushad UzZaman; Khan, Mumit (BRAC University, 2006)
    There are different approaches to the problem of assigning each word of a text with a parts-of-speech tag, which is known as Part-Of-Speech (POS) tagging. In this paper we compare the performance of a few POS tagging ...
  • Hasan, Muhammad Fahim; Naushad UzZaman; Khan, Mumit (BRAC University, 2007)
    Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...
  • Naushad UzZaman,; Khan, Mumit (BRAC University, 2006)
    We present a comprehensive Bangla spelling checker that improves the quality of suggestions for misspelled words. The complex rules for Bangla spelling presents a significant challenge in producing suggestions for a ...
  • Naushad UzZaman,; Zaheen, Arnab; Khan, Mumit (BRAC University, 2006)
    A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and ...
  • Mahmud, Altaf; Ahmed, Kazi Zubair; Khan, Mumit (BRAC University, 2008-12)
    While the internet has become the leading source of information, it is also become the medium for flames, insults and other forms of abusive language, which add nothing to the quality of information available. A human ...
  • Khan, Naira; Khan, Mumit (BRAC University, 2006)
    This paper describes the first phase of developing a computational grammar for Bengali using the Head- Driven Phrase Structure Grammar (HPSG) formalism. The HPSG formalism is a highly developed framework that combines ...
  • Alam, Firoj; Habib, S. M. Murtoza; Sultana, Dil Afroza; Khan, Mumit (BRAC University, 2010)
    This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora ...
  • Naushad UzZaman,; Khan, Mumit (BRAC University, 2005)
    Almost any word can be a Bangali name, and the name in turn is often spelled in many different ways, all of which are considered correct and interchangeable. The reason for the spelling complication is two-fold: (1) there ...
  • Naushad UzZaman; Khan, Mumit (BRAC University, 2005)
    We present a Double Metaphone encoding for Bangla that can be used by spelling checkers to improve the quality of suggestions for misspelled words. The complex rules of Bangla spelling present a significant challenge in ...
  • Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)
    In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...
  • Hasnat, Md. Abul; Haque, Mohammad Mahmudul; Khan, Mumit (BRAC University, 2006)
    This paper addresses the issue of building a Bangla lexicon with a collaborative effort through stand alone application and web based interface. The words in the lexicon will be annotated with a combination of tags addressing ...
  • Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)
    Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...
  • Khan, Naira; Habib, Md. Tarek; Alam, Md. Jahangir; Rahman, Rajib; UzZaman, Naushad; Khan, Mumit (BRAC University, 2006)
    This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears ...
  • Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)
    Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...

Policy Guidelines

Search DSpace


Browse

My Account