dc.contributor.advisor | Milon, Md.Iqbal Hossain | |
dc.contributor.advisor | Akhond, Mostafijur Rahman | |
dc.contributor.author | Rahman, Rafeed | |
dc.contributor.author | Rahman, Mehfuz A | |
dc.contributor.author | Hossain, Shahriar | |
dc.contributor.author | Hossain, Sajid | |
dc.date.accessioned | 2021-07-15T04:20:45Z | |
dc.date.available | 2021-07-15T04:20:45Z | |
dc.date.copyright | 2020 | |
dc.date.issued | 2020-12 | |
dc.identifier.other | ID: 17101502 | |
dc.identifier.other | ID: 17101378 | |
dc.identifier.other | ID: 17101370 | |
dc.identifier.other | ID: 17101352 | |
dc.identifier.uri | http://hdl.handle.net/10361/14804 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2020. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 27-30). | |
dc.description.abstract | Getting calls for ransoms are common phenomena in kidnapping and abduction
related incidents where the life of the victim remains extremely vulnerable. These
phone calls are often analyzed in real-time by law enforcement authorities to quickly
identify the suspects and get crucial information for quick action. However, it is often difficult to manually analyze those phone calls due to the quality of sounds and
the presence of several background noises. Even with much high-end software in
their inventory, it is futile to accurately refine the incoming calls as it takes a huge
amount of time to declutter the different layers of noises in the call. This paper
proposes a model based on deep convolutional neural network and signal processing for automatic classification of crucial sounds in ransom related phone calls. We
have proposed LSTM and 2D CNN customized models and compared their outputs
with VGG16 and AlexNet. Moreover, this paper also presents a unique dataset of
different sounds in terms of voices like male or female and the environmental sounds
where the victim might be in which can be a probable clue for investigation purposes consisting of 17650 audio clips collected from verified online sources. Finally,
the models produced very high classification accuracy with the accuracy of LSTM
reaching around 93.4%. | en_US |
dc.description.statementofresponsibility | Rafeed Rahman | |
dc.description.statementofresponsibility | Mehfuz A Rahman | |
dc.description.statementofresponsibility | Shahriar Hossain | |
dc.description.statementofresponsibility | Sajid Hossain | |
dc.format.extent | 30 Pages | |
dc.language.iso | en_US | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Convolution | en_US |
dc.subject | AlexNET | en_US |
dc.subject | VGG16 | en_US |
dc.subject | LSTM | en_US |
dc.subject | Neural Network | en_US |
dc.title | RansomListener: Ransom call sound investigation using LSTM and CNN Architectures | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |