Voice impersonation detection using LSTM based RNN and explainable AI

Barua, Kawshik; Rahim, Abdur; Parizat, Prantozit Saha; Noor, Md.Asad Uzzaman; Jannah, Miftahul

View/Open

17201034, 17301164, 17301171, 17301128, 13201085_CSE.pdf (1.169Mb)

Date

2021-10

Publisher

Brac University

Abstract

The advancing eld of arti cial synthetic media introduced deepfakes which made it easier to synthesize a person's voice, identical to their original voice mechanically to use it for negative means. People's voices are exposed to public as it is a pro - cient and more convenient media of exchanging information over various mediums, entertainment, speech delivering, news reading and so on, making it easier to collect voice samples for creating fake yet almost identical voice samples to trick people. So it has become vital to prevent this crime which led us to do this research paper for saving the victims of voice impersonation attacks where we used LSTM based RNN model in order to distinguished between real and synthesize voice.Furthermore, to compare the results we got from the mentioned process, we build a SVM classi er and nally we've explained the predicted outputs(fake or real) of both LSTM and SVM model by using an Explainable AI method named LIME. Our research resulted in 98.33% accuracy rate through our proposed model and very low percentage of error in detecting fake/synthesized voices.

Keywords

Deepfakes; Voice impersonation detection; LSTM based RNN; Feature extraction; SVM; LIME; Explainable AI

LC Subject Headings

Artificial intelligence; Machine learning; Automatic speech recognition.

Description

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2022.

Cataloged from PDF version of thesis.

Includes bibliographical references (pages 33-35).

Department

Department of Computer Science and Engineering, Brac University

Type

Thesis

Collections

Thesis & Report, BSc (Computer Science and Engineering) [1589]