Show simple item record

dc.contributor.authorKhan, Naira
dc.contributor.authorHabib, Md. Tarek
dc.contributor.authorAlam, Md. Jahangir
dc.contributor.authorRahman, Rajib
dc.contributor.authorUzZaman, Naushad
dc.contributor.authorKhan, Mumit
dc.date.accessioned2010-10-24T04:28:50Z
dc.date.available2010-10-24T04:28:50Z
dc.date.issued2006
dc.identifier.urihttp://hdl.handle.net/10361/627
dc.descriptionIncludes bibliographical references (page 5).
dc.description.abstractThis paper presents a directional advantage of n-gram modeling in terms of backward or forward n-gram modeling in Bangla. The most commonly used n-gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n-gram is repeatedly more successful and yields more grammatical results than a forward n-gram. This paper hypothesizes that the rationale behind this success is the syntactic ordering of constituents in Bangla. Bangla is a head-final specifier-initial language as opposed to English, which is head-initial specifier-initial. Hence in Bangla, the head comes after its argument in a phrase. If an n-gram analysis begins with a head and moves backwards it will stretch to its own argument but if you move for-wards then you'll probably grab the argument of an-other head. As probability of occurrence of heads is higher, probability of depending on a head is also higher and hence a backward n-gram will probably have a greater chance of yielding grammatical results. We carried out several experiments to compare different directional results in different applications with an advantage in the backward direction. This will prove a useful linguistic insight in terms of n-gram based analysis depending upon variations of constituent analysis.en_US
dc.description.statementofresponsibilityNaira Khan
dc.description.statementofresponsibilityMd. Tarek Habib
dc.description.statementofresponsibilityMd. Jahangir Alam
dc.description.statementofresponsibilityRajib Rahman
dc.description.statementofresponsibilityNaushad UzZaman
dc.description.statementofresponsibilityMumit Khan
dc.format.extent5 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.subjectN-Gram analysis
dc.titleHistory (Forward N-Gram) or future (Backward N-Gram)? Which model to consider for N-Gram analysis in Bangla?en_US
dc.typeArticleen_US
dc.contributor.departmentCenter for Research on Bangla Language Processing, BRAC University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record