Analysis of and observations from a Bangla News Corpus
Abstract
In this paper we present the compilation
methodology and some statistical analysis on a Bangla news corpus-“Prothom-Alo”, which is the first of its kind for Bangla. We compare some of the statistics with the CIIL Bangla corpus and also present our observation of atypical behavior of Zipf’s curve for Prothom-Alo corpus.