Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages

Hasan, Muhammad Fahim; Naushad UzZaman; Khan, Mumit

dc.contributor.author	Hasan, Muhammad Fahim
dc.contributor.author	Naushad UzZaman
dc.contributor.author	Khan, Mumit
dc.date.accessioned	2010-10-05T05:03:09Z
dc.date.available	2010-10-05T05:03:09Z
dc.date.copyright	2007
dc.date.issued	2007
dc.identifier.uri	http://hdl.handle.net/10361/330
dc.description	Includes bibliographical references (page 6-8).
dc.description.abstract	Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of automating the process have been developed and employed for English and other Western languages. Some similar work, most of which utilize the stochastic approaches for POS Tagging has also been done in the same area for South Asian languages. We experimented with some of the widely-used approaches for POS Tagging on three South Asian languages, Bangla, Hindi and Telegu, using corpora of different sizes. We observed the performance of the approaches and found the Brill’s transformation based tagger’s performance to be superior to the other approaches in all of our experiments, though the use of this approach has been very limited until recently.	en_US
dc.description.statementofresponsibility	Fahim Muhammad Hasan
dc.description.statementofresponsibility	Naushad UzZaman
dc.description.statementofresponsibility	Mumit Khan
dc.format.extent	8 pages
dc.language.iso	en	en_US
dc.publisher	BRAC University	en_US
dc.subject	Part-of-speech tagging
dc.subject	Language processing
dc.title	Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages	en_US
dc.type	Article	en_US
dc.contributor.department	Center for Research on Bangla language Processing (CRBLP), BRAC University

Files in this item

Name:: Comparison of Unigram, Bigram, ...
Size:: 140.2Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Conference Papers (Centre for Research on Bangla Language Processing) [40]

Show simple item record