Enhanced hate speech detection in social media using transformer-based models
Abstract
Hate speech on social media can escalate into ”cyber conflict,” detrimentally impacting
social life. With the exponential growth of Internet users and media content,
identifying abusive language in audio and video content has become increasingly
challenging. The nuances of human communication mean that individuals might
employ seemingly non-hateful language in derogatory ways, often accompanied by
specific voice tones and gestures that aren’t captured when converting multimedia
into text. This research delves deep into the realm of hate speech detection, aiming
to automatically identify harmful content across various social media platforms.
Initially focused on text, our study utilized remote supervision for automatically labeled
dataset creation and employed word embeddings with a bias toward hate. We
analyzed datasets from Twitter, testing various machine-learning models to gauge
the representation of hate speech and abusive language. Any tweet or online post
exhibiting racist or sexist sentiments was categorized as ”hate speech.” Our objective
was to classify such messages for better content moderation systematically. With
advancements in our research, we have extended our detection capabilities to audio
content. By leveraging Simple Feed-forward Neural Networks, RNNs, and CNNs, we
can now discern hate speech patterns in audio with enhanced accuracy. However, the
vastness of content on social media platforms means not every piece can be manually
moderated. This underscores the importance of our automated hate speech detection,
especially when dealing with content in linguistically challenging languages.
However, social media networks cannot control every piece of user content. Because
of this, it is necessary to identify hate speech automatically. This desire is heightened
when the content is written in challenging languages. Our study provides a
unique transformer-based methodology for detecting hate speech in social media.
The proposed model uses Natural Language Processing (NLP) approaches to assess
text and audio input. To increase the accuracy of hate speech identification, we use
sophisticated deep learning architectures such as attention methods and transformers.
Our model is trained on a huge dataset of tweets and audio recordings, and
its performance is measured using a variety of criteria. Our transformer-based approach
beats existing state-of-the-art hate speech identification methods, according
to the results. Our study makes an essential addition to the field of computer science
and engineering by addressing the critical issue of hate speech on social media and
proposing an effective solution based on modern machine learning techniques.