Towards devising an efficient VQA in the Bengali Language
Date
2021-12Publisher
Brac UniversityAuthor
Islam, S M ShahriarAuntor, Riyad Ahsan
Islam, Minhajul
Chowdhury, Tahmin Haider
Hossain Anik, Mohammad Yousuf
Metadata
Show full item recordAbstract
This paper aims to provide insight into how Visual question answering might work
on Bangla datasets versus English datasets. Several studies have been conducted on
deep learning methods applied to Bangla datasets up to this point. However, a Bangla
dataset with images and questions embedded in each of them has yet to be created.
We attempted to create a Bangla dataset suitable for such implementation through
our re search. The step-by-step procedures in our work demonstrate how various bar riers can be overcome while developing datasets. We attempted to use existing visual
question answering datasets because there are no actual Bangla datasets created for
this specific task.In the end we successfully created our own Bangla visual question an swering datasets and proposed a model to train and compare among existing datasets.
Following that, the comparison was provided to show how the Bangla dataset differs
from the English datasets in terms of the VQA model. Our work should make more
than enough room for future research and implementation of visual question answering
tasks in Bangla.