Bangla hate speech detection on social media using attention-based recurrent neural network
Citation
Das, A. K., Al Asif, A., Paul, A., & Hossain, M. N. (2021). Bangla hate speech detection on social media using attention-based recurrent neural network. Journal of Intelligent Systems, 30(1), 578-591. doi:10.1515/jisys-2020-0060Abstract
Hate speech has spread more rapidly through the daily use of technology and, most notably, by
sharing your opinions or feelings on social media in a negative aspect. Although numerous works have
been carried out in detecting hate speeches in English, German, and other languages, very few works have
been carried out in the context of the Bengali language. In contrast, millions of people communicate on
social media in Bengali. The few existing works that have been carried out need improvements in both
accuracy and interpretability. This article proposed encoder–decoder-based machine learning model, a
popular tool in NLP, to classify user’s Bengali comments from Facebook pages. A dataset of 7,425 Bengali
comments, consisting of seven distinct categories of hate speeches, was used to train and evaluate our
model. For extracting and encoding local features from the comments, 1D convolutional layers were used.
Finally, the attention mechanism, LSTM, and GRU-based decoders have been used for predicting hate
speech categories. Among the three encoder–decoder algorithms, attention-based decoder obtained the
best accuracy (77%).