Bangla grammar and spelling check using machine learning
Abstract
Bangla, or Bengali, is one of the world’s most spoken languages, with hundreds of
millions of native speakers worldwide. Thousands of books are written in the Bangla
language every year, and millions of people register in Bangla daily. But there are
only a few researches conducted on Bangla Grammar and Spelling correction because
of the lack of Bangla resources and the complexity of the Bangla language. This
paper is concerned with implementing a Machine Learning based model to detect
grammar and spelling errors in Bangla writing. There are many machine learning
algorithms to see mistakes in writing. This research uses Levenshtein distance and
Double Metaphone algorithms to detect spelling errors. For grammar, Recurrent
Neural Network based sequential model is used with an accuracy of 89%. We have
created a Bangla monolingual corpus containing three hundred thousand sentences
for this paper. Therefore, we expect this research to make Bangla writing easier and
more fascinating for everyone.