BRAC University Institutional Repository

History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla?

Show simple item record Khan, Naira Habib, Md. Tarek Alam, Md. Jahangir Rahman, Rajib UzZaman, Naushad Khan, Mumit 2010-10-24T04:28:50Z 2010-10-24T04:28:50Z 2006
dc.description Includes bibliographical references (page 5).
dc.description.abstract This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n-gram is repeatedly more successful and yields more grammatical results than a forward n-gram. This paper hypothesizes that the rationale behind this success is the syntactic ordering of constituents in Bangla. Bangla is a head-final specifier-initial language as opposed to English, which is head-initial specifier-initial. Hence in Bangla, the head comes after its argument in a phrase. If an n-gram analysis begins with a head and moves backwards it will stretch to its own argument but if you move for-wards then you'll probably grab the argument of an-other head. As probability of occurrence of heads is higher, probability of depending on a head is also higher and hence a backward n-gram will probably have a greater chance of yielding grammatical results. We carried out several experiments to compare different directional results in different applications with an advantage in the backward direction. This will prove a useful linguistic insight in terms of n-gram based analysis depending upon variations of constituent analysis. en_US
dc.description.statementofresponsibility Naira Khan
dc.description.statementofresponsibility Md. Tarek Habib
dc.description.statementofresponsibility Md. Jahangir Alam
dc.description.statementofresponsibility Rajib Rahman
dc.description.statementofresponsibility Naushad UzZaman
dc.description.statementofresponsibility Mumit Khan
dc.format.extent 5 pages
dc.language.iso en en_US
dc.publisher BRAC University en_US
dc.subject N-Gram analysis
dc.title History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? en_US
dc.type Article en_US
dc.contributor.department Center for Research on Bangla Language Processing, BRAC University

Files in this item

This item appears in the following Collection(s)

Show simple item record

Policy Guidelines

Search BRACU Repository

Advanced Search


My Account