dc.contributor.author | Khan, Naira | |
dc.contributor.author | Habib, Md. Tarek | |
dc.contributor.author | Alam, Md. Jahangir | |
dc.contributor.author | Rahman, Rajib | |
dc.contributor.author | UzZaman, Naushad | |
dc.contributor.author | Khan, Mumit | |
dc.date.accessioned | 2010-10-24T04:28:50Z | |
dc.date.available | 2010-10-24T04:28:50Z | |
dc.date.issued | 2006 | |
dc.identifier.uri | http://hdl.handle.net/10361/627 | |
dc.description | Includes bibliographical references (page 5). | |
dc.description.abstract | This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n-gram
is repeatedly more successful and yields more
grammatical results than a forward n-gram. This paper hypothesizes that the rationale behind this success is the syntactic ordering of constituents in Bangla. Bangla is a head-final specifier-initial language as opposed to English, which is head-initial
specifier-initial. Hence in Bangla, the head comes after its argument in a phrase. If an n-gram analysis begins with a head and moves backwards it will stretch to its own argument but if you move for-wards
then you'll probably grab the argument of an-other head. As probability of occurrence of heads is higher, probability of depending on a head is also higher and hence a backward n-gram will probably have a
greater chance of yielding grammatical results. We carried out several experiments to compare different directional results in different applications with an advantage
in the backward direction. This will prove a
useful linguistic insight in terms of n-gram based analysis depending upon variations of constituent analysis. | en_US |
dc.description.statementofresponsibility | Naira Khan | |
dc.description.statementofresponsibility | Md. Tarek Habib | |
dc.description.statementofresponsibility | Md. Jahangir Alam | |
dc.description.statementofresponsibility | Rajib Rahman | |
dc.description.statementofresponsibility | Naushad UzZaman | |
dc.description.statementofresponsibility | Mumit Khan | |
dc.format.extent | 5 pages | |
dc.language.iso | en | en_US |
dc.publisher | BRAC University | en_US |
dc.subject | N-Gram analysis | |
dc.title | History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? | en_US |
dc.type | Article | en_US |
dc.contributor.department | Center for Research on Bangla Language Processing, BRAC University | |