Search
Now showing items 1-10 of 17
Building a foundation of HPSG-based treebank on Bangla language
(BRAC University, 2007)
Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. ...
Text to speech for Bangla language using festival
(BRAC University, 2007)
In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the opensource Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing ...
Segmentation free Bangla OCR using HMM: Training and recognition
(BRAC University, 2007)
The wide area of the application of HMM is in Speech Recognition where each spoken word
is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for ...
Research report on Bangla wordnet development challenges and solutions
(BRAC University, 2007-10-08)
We describe the initial design of Bangla WordNet (BWN), based on the English WordNet 2.2 distribution from Princeton University. Our goal is to develop a 5,000 entry Bangla WordNet over the next two years. At present, we ...
Automatic Bangla corpus creation
(BRAC University, 2007)
This paper addresses the issue of automatic
Bangla corpus creation, which will significantly help the processes of Lexicon development, Morphological Analysis, Automatic Parts of Speech Detection and
Automatic grammar ...
Research report on Translations of gTLDs and ccTLDs in Bangla
(BRAC University, 2007-10-08)
This report describes the initial translations of gTLDs and ccTLDs in Bengali, along with the challenges faced in creating the translations.
Research report on parallel corpus translation challenges and processes
(BRAC University, 2007-10-08)
We describe some of the challenges in developing English-Bangla parallel corpora, and look some of the established processes used by other language corpora for solutions to some of these challenges.
A light weight stemmer for Bengali and its use in spelling checker
(BRAC University, 2007)
Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
(BRAC University, 2007)
Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...
Research report on Bengla tagged lexicon
(BRAC University, 2007)
This report describes the design and
implementation of a Bangla tagged lexicon. The resulting lexicon contains 144,770 entries, out of which 58,145 are verbs. The tags used in the lexicon are reproduced here from a previous ...