RansomListener: Ransom call sound investigation using LSTM and CNN Architectures

Rahman, Rafeed; Rahman, Mehfuz A; Hossain, Shahriar; Hossain, Sajid

dc.contributor.advisor	Milon, Md.Iqbal Hossain
dc.contributor.advisor	Akhond, Mostafijur Rahman
dc.contributor.author	Rahman, Rafeed
dc.contributor.author	Rahman, Mehfuz A
dc.contributor.author	Hossain, Shahriar
dc.contributor.author	Hossain, Sajid
dc.date.accessioned	2021-07-15T04:20:45Z
dc.date.available	2021-07-15T04:20:45Z
dc.date.copyright	2020
dc.date.issued	2020-12
dc.identifier.other	ID: 17101502
dc.identifier.other	ID: 17101378
dc.identifier.other	ID: 17101370
dc.identifier.other	ID: 17101352
dc.identifier.uri	http://hdl.handle.net/10361/14804
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2020.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 27-30).
dc.description.abstract	Getting calls for ransoms are common phenomena in kidnapping and abduction related incidents where the life of the victim remains extremely vulnerable. These phone calls are often analyzed in real-time by law enforcement authorities to quickly identify the suspects and get crucial information for quick action. However, it is often difficult to manually analyze those phone calls due to the quality of sounds and the presence of several background noises. Even with much high-end software in their inventory, it is futile to accurately refine the incoming calls as it takes a huge amount of time to declutter the different layers of noises in the call. This paper proposes a model based on deep convolutional neural network and signal processing for automatic classification of crucial sounds in ransom related phone calls. We have proposed LSTM and 2D CNN customized models and compared their outputs with VGG16 and AlexNet. Moreover, this paper also presents a unique dataset of different sounds in terms of voices like male or female and the environmental sounds where the victim might be in which can be a probable clue for investigation purposes consisting of 17650 audio clips collected from verified online sources. Finally, the models produced very high classification accuracy with the accuracy of LSTM reaching around 93.4%.	en_US
dc.description.statementofresponsibility	Rafeed Rahman
dc.description.statementofresponsibility	Mehfuz A Rahman
dc.description.statementofresponsibility	Shahriar Hossain
dc.description.statementofresponsibility	Sajid Hossain
dc.format.extent	30 Pages
dc.language.iso	en_US	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Convolution	en_US
dc.subject	AlexNET	en_US
dc.subject	VGG16	en_US
dc.subject	LSTM	en_US
dc.subject	Neural Network	en_US
dc.title	RansomListener: Ransom call sound investigation using LSTM and CNN Architectures	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B. Computer Science

Files in this item

Name:: 17101502, 17101378, 17101370, ...
Size:: 7.111Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1589]

Show simple item record