Show simple item record

dc.contributor.authorHasan, Muhammad Fahim
dc.contributor.authorNaushad UzZaman
dc.contributor.authorKhan, Mumit
dc.date.accessioned2010-10-05T05:03:09Z
dc.date.available2010-10-05T05:03:09Z
dc.date.copyright2007
dc.date.issued2007
dc.identifier.urihttp://hdl.handle.net/10361/330
dc.descriptionIncludes bibliographical references (page 6-8).
dc.description.abstractPart-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of automating the process have been developed and employed for English and other Western languages. Some similar work, most of which utilize the stochastic approaches for POS Tagging has also been done in the same area for South Asian languages. We experimented with some of the widely-used approaches for POS Tagging on three South Asian languages, Bangla, Hindi and Telegu, using corpora of different sizes. We observed the performance of the approaches and found the Brill’s transformation based tagger’s performance to be superior to the other approaches in all of our experiments, though the use of this approach has been very limited until recently.en_US
dc.description.statementofresponsibilityFahim Muhammad Hasan
dc.description.statementofresponsibilityNaushad UzZaman
dc.description.statementofresponsibilityMumit Khan
dc.format.extent8 pages
dc.language.isoenen_US
dc.publisherBRAC Universityen_US
dc.subjectPart-of-speech tagging
dc.subjectLanguage processing
dc.titleComparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languagesen_US
dc.typeArticleen_US
dc.contributor.departmentCenter for Research on Bangla language Processing (CRBLP), BRAC University


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record