BRAC University Institutional Repository

History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla?

DSpace/Manakin Repository

Show simple item record

dc.contributor.author Khan, Naira
dc.contributor.author Habib, Md. Tarek
dc.contributor.author Alam, Md. Jahangir
dc.contributor.author Rahman, Rajib
dc.contributor.author UzZaman, Naushad
dc.contributor.author Khan, Mumit
dc.date.accessioned 2010-10-24T04:28:50Z
dc.date.available 2010-10-24T04:28:50Z
dc.date.issued 2006
dc.identifier.uri http://hdl.handle.net/10361/627
dc.description Includes bibliographical references (page 5).
dc.description.abstract This paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n-gram is repeatedly more successful and yields more grammatical results than a forward n-gram. This paper hypothesizes that the rationale behind this success is the syntactic ordering of constituents in Bangla. Bangla is a head-final specifier-initial language as opposed to English, which is head-initial specifier-initial. Hence in Bangla, the head comes after its argument in a phrase. If an n-gram analysis begins with a head and moves backwards it will stretch to its own argument but if you move for-wards then you'll probably grab the argument of an-other head. As probability of occurrence of heads is higher, probability of depending on a head is also higher and hence a backward n-gram will probably have a greater chance of yielding grammatical results. We carried out several experiments to compare different directional results in different applications with an advantage in the backward direction. This will prove a useful linguistic insight in terms of n-gram based analysis depending upon variations of constituent analysis. en_US
dc.description.statementofresponsibility Naira Khan
dc.description.statementofresponsibility Md. Tarek Habib
dc.description.statementofresponsibility Md. Jahangir Alam
dc.description.statementofresponsibility Rajib Rahman
dc.description.statementofresponsibility Naushad UzZaman
dc.description.statementofresponsibility Mumit Khan
dc.format.extent 5 pages
dc.language.iso en en_US
dc.publisher BRAC University en_US
dc.subject N-Gram analysis
dc.title History (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla? en_US
dc.type Article en_US
dc.contributor.department Center for Research on Bangla Language Processing, BRAC University


Files in this item

Files Size Format View
History (forward N-Gram).pdf 153.7Kb PDF View/Open or Preview

This item appears in the following Collection(s)

Show simple item record

Policy Guidelines

Search DSpace


Browse

My Account