Search
Now showing items 11-17 of 17
Research report on parallel corpus translation challenges and processes
(BRAC University, 2007-10-08)
We describe some of the challenges in developing English-Bangla parallel corpora, and look some of the established processes used by other language corpora for solutions to some of these challenges.
A light weight stemmer for Bengali and its use in spelling checker
(BRAC University, 2007)
Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval ...
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
(BRAC University, 2007)
Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of ...
Research report on Bengla tagged lexicon
(BRAC University, 2007)
This report describes the design and
implementation of a Bangla tagged lexicon. The resulting lexicon contains 144,770 entries, out of which 58,145 are verbs. The tags used in the lexicon are reproduced here from a previous ...
Research report on Bengla tagset
(BRAC University, 2007)
This report describes the design of a POS tagset for Bangla, based on the Penn Treebank design. The resulting tagset contains 53 morpho-syntactic tags.
Isolated and continuous bangla speech recognition: implementation, performance and application perspective
(BRAC University, 2007)
Research on automatic speech recognition has been approach progressively since 1930 and the major advances are since 1980 with
the introduction of the statistical modeling of speech with the key technology Hidden Markov ...
A high performance domain specific OCR for Bangla script
(BRAC University, 2007)
Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR ...