Analysis of real-time hostile activitiy detection from spatiotemporal features using time distributed deep convolutional neural networks, recurrent neural networks and attention-based mechanisms

Siddique, Labib Ahmed; Junhai, Rabita; Islam, Moshfeka; Qader, Shafinaz

dc.contributor.advisor	Chakrabarty, Dr. Amitabha
dc.contributor.advisor	Reza, Tanzim
dc.contributor.advisor	Rahman, Tanvir
dc.contributor.author	Siddique, Labib Ahmed
dc.contributor.author	Junhai, Rabita
dc.contributor.author	Islam, Moshfeka
dc.contributor.author	Qader, Shafinaz
dc.date.accessioned	2022-12-14T09:22:07Z
dc.date.available	2022-12-14T09:22:07Z
dc.date.copyright	2022
dc.date.issued	2022-05
dc.identifier.other	ID: 18101478
dc.identifier.other	ID: 18101259
dc.identifier.other	ID: 18101432
dc.identifier.other	ID: 18141006
dc.identifier.uri	http://hdl.handle.net/10361/17652
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2022.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 48-51).
dc.description.abstract	Throughout time, there has been a surge of hostile activities in public places across the globe. With the advancement in technology, it has been possible to monitor public places through real time surveillance. Video surveillance has become essential for ensuring public safety as it provides a significant benefit in lowering the crime rate, as well as monitoring the facility within its reach. Hence, CCTV cameras are installed in all areas where security is a priority. Although CCTV cameras help a lot in increasing security, the main drawback in these surveillance systems is that it requires constant human interaction and monitoring. To eradicate this issue, an automated surveillance system can be built using artificial intelligence, deep learning and IoT (Internet of things). So in this research we explore deep learn ing video classification techniques that can help us automate surveillance systems to detect violence as they are happening. Traditional machine learning or image classification techniques fall short when it comes to classifying videos as they attempt to classify each frame separately for which the predictions start to flicker. So many researchers are coming up with video classification techniques that consider spatiotemporal features while classifying. However, deploying these deep learning models are not always practical in an IoT environment. For this reason we cannot use techniques that are acquired like skeleton points and optical flow through technologies like pose estimation or depth sensors. Although these techniques ensure a higher accuracy score, they are computationally heavy. Keeping these constraints in mind, we experimented with various video classification and action recognition techniques such as ConvLSTM, LRCN (with both custom CNN layers and VGG-16 as feature extractor) CNN-Transformer and C3D (3D-CNN). We achieved a test accuracy of 80% on ConvLSTM, 83.33% on CNN-BiLSTM, 70% on VGG16-BiLstm ,76.76% on CNN-Transformer and 80% on C3D model.	en_US
dc.description.statementofresponsibility	Labib Ahmed Siddique
dc.description.statementofresponsibility	Rabita Junhai
dc.description.statementofresponsibility	Moshfeka Islam
dc.description.statementofresponsibility	Shafinaz Qader
dc.format.extent	51 Pages
dc.language.iso	en_US	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Artificial Intelligence	en_US
dc.subject	Deep learning	en_US
dc.subject	Neural network	en_US
dc.subject	Violence detection	en_US
dc.subject	Video classification	en_US
dc.subject	Attention based encoder	en_US
dc.subject	LRCN	en_US
dc.subject	ConvLSTM	en_US
dc.subject	Transformer	en_US
dc.subject	C3D	en_US
dc.subject.lcsh	Neural networks (Computer science)
dc.subject.lcsh	Neural network.
dc.subject.lcsh	Deep learning (Machine learning)
dc.title	Analysis of real-time hostile activitiy detection from spatiotemporal features using time distributed deep convolutional neural networks, recurrent neural networks and attention-based mechanisms	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B. Computer Science and Engineering

Files in this item

Name:: 17101473, 17101195, 17101329, ...
Size:: 2.760Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1480]

Show simple item record