dc.contributor.author | Mahmud, Altaf | |
dc.contributor.author | Khan, Mumit | |
dc.date.accessioned | 2010-10-04T10:57:39Z | |
dc.date.available | 2010-10-04T10:57:39Z | |
dc.date.copyright | 2007 | |
dc.date.issued | 2007 | |
dc.identifier.uri | http://hdl.handle.net/10361/329 | |
dc.description | Includes bibliographical references (page 6). | |
dc.description.abstract | Now a day, the importance of a large annotated corpus for NLP researchers is widely known. In this paper, we describe an initial phase of developing a linguistically annotated corpus for non-configurational ‘Bangla’ language. Since, the formalism differs from those posited for configurational languages; several features have been added for constraint based parsing through HPSG-based formalism. We propose an outline of a semi-automated process by applying both case marking approach and some morphological analysis to constraint the parsing of a relatively free word order language for creating a linguistically rich, highly-lexicalized annotated corpus. | en_US |
dc.format.extent | 6 pages | |
dc.language.iso | en | en_US |
dc.publisher | BRAC University | en_US |
dc.subject | Treebank | en_US |
dc.subject | Hpsg, | en_US |
dc.subject | parsing | en_US |
dc.subject | non-configuration | en_US |
dc.subject | Treebanking | en_US |
dc.title | Building a foundation of HPSG-based treebank on Bangla language | en_US |
dc.type | Article | en_US |
dc.contributor.department | Center for Research on Bangla Language Processing (CRBLP), BRAC University | |