Browsing Conference Papers (Centre for Research on Bangla Language Processing) by Subject "Bangla language processing"

Analysis of and observations from a Bangla News Corpus

Majumder, Khair Md. Yasir Arafat (BRAC University, 2006)

In this paper we present the compilation methodology and some statistical analysis on a Bangla news corpus-“Prothom-Alo”, which is the first of its kind for Bangla. We compare some of the statistics with the CIIL Bangla ...

Bangla text input and rendering supports for short message sevice on Mobile devices

Rownok, Tofazzal; Islam, Md. Zahurul; Khan, Mumit (BRAC University, 2006)

Technology is the most important thing that involve in our everyday life. It is involving in almost every aspect of life like communication, work, shopping, recreation etc. Communication through mobile devices is the most ...

Example based English-Bengali machine translation using wordnet

Salm, Khan Md. Anwarus Salam; Khan, Mumit; Nishino, Tetsuro (BRAC University, 2009)

In this paper we propose an architecture of English-Bengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using ...

A high performance domain specific OCR for Bangla script

Hasnat, Md. Abul; Habib, S. M. Murtoza; Khan, Mumit (BRAC University, 2007)

Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...

Infrastructure for Bangla information retrieval in context of ICT for development

Haque, Nafid; Ali, M Hammad; Abduallah, Matin Saad (BRAC University, 2006)

In this paper, we talk about developing a search engine and information retrieval system for Bangla. Current work done in this area assumes the use of a particular type of encoding or the availability of particular facilities ...

Integrating Bangla script recognition support in tesseract OCR

Hasnat, Md. Abul; Chowdhury, Muttakinur Rahman; Khan, Mumit (BRAC University, 2009)

Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract ...

Isolated and continuous bangla speech recognition: implementation, performance and application perspective

Hasnat, Md. Abul; Mowla, Jabir; Khan, Mumit (BRAC University, 2007)

Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with the introduction of the statistical modeling of speech with the key technology Hidden Markov ...

Morphological analysis of inflecting compound words in Bangla

Dasgupta, Sajib; Khan, Naira; Sarkar, Asif Iqbal; Pavel, Dewan Shahriar Hossain; Khan, Mumit (BRAC University, 2005)

The addition of inflectional suffixes in Bangla com-pound words is fairly complex. A compound is a word that is formed by two or more different words acting as a single entity. One of the key distinguishing features of ...

Morphological parsing of Bangla wods using PC-KIMMO

Dasgupta, Sajib; Khan,Mumit (BRAC University, 2004)

This paper describes Morphological parsing of Bangla words using PC-KIMMO, based on Kimmo Koskeniemil's model of two-level Morphology. There are three sections in the PC-KIMMO: rules section lexicon section and grammar ...