TinyML for emotion detection in voice signals: evaluating and proposing algorithms for IoT wearable devices

Ahmed, Hasibul Hasan; Ahmed, Zain; Choden, Tshewang; Chaudhary, Nutan

dc.contributor.advisor	Chakrabarty, Amitabha
dc.contributor.advisor	Automatic speech recognition.
dc.contributor.author	Ahmed, Hasibul Hasan
dc.contributor.author	Ahmed, Zain
dc.contributor.author	Choden, Tshewang
dc.contributor.author	Chaudhary, Nutan
dc.date.accessioned	2024-10-17T09:07:40Z
dc.date.available	2024-10-17T09:07:40Z
dc.date.copyright	©2024
dc.date.issued	2024-05
dc.identifier.other	ID 24141144
dc.identifier.other	ID 20101117
dc.identifier.other	ID 20201207
dc.identifier.other	ID 20201199
dc.identifier.uri	http://hdl.handle.net/10361/24346
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2024.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 56-58).
dc.description.abstract	In today’s digital world, voice emotion recognition is essential for applications like intelligent tutoring, audio mining, security, telecommunication, HCI, lie detection, and human-machine interactions in various settings. Voice, which is used to express one’s perspective and communicate inter-personally, is one of the characteristics that differentiate humans. The rise of IoT and wearable technology offers new opportunities for real-time, remote emotion detection through voice. In the context of voice processing-based emotion recognition, particularly in the Internet of Things wearable, this thesis investigates the possibilities of tiny machine learning or TinyML. To accomplish this goal, we evaluated Bidirectional-LSTM and CNN on both vector quantization and raw data gave us notable accuracy of 88%, 80%, 85%, and 81% respectively and LSTM, Random Forest, Logistic Regression, KNN and GRU on only raw data shows accuracy rates of 86%, 89%, 89%, 86% and 82% using the composite dataset that includes well-known datasets such as RAVDESS, CREMA-D, TESS, and SAVEE. Furthermore, the models with the best accuracy were selected to be implemented within the TinyML framework, Tensorflow-lite. Our benchmarks highlighted that most of the best performing models were Recurrent Neural Network (RNN) based, notably BiLSTM, LSTM, GRU alongside the CNN model. Finally, after validating the findings through hardware implementation on Raspberry Pi 4, the study concludes that BiLSTM model would be most suitable for speech emotion recognition tasks (SER) in the TinyML domain . The hardware performance of the model illustrates how confident the model actually is in predicting emotions from raw voice input within significant resource and power constraints . These findings contribute to the ongoing discourse on the intersection of voice emotion recognition, TinyML, and IoT, showcasing the potential for enhanced human-machine interactions in a wide variety of practical domains.	en_US
dc.description.statementofresponsibility	Hasibul Hasan Ahmed
dc.description.statementofresponsibility	Zain Ahmed
dc.description.statementofresponsibility	Tshewang Choden
dc.description.statementofresponsibility	Nutan Chaudhary
dc.format.extent	70 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Tiny machine learning	en_US
dc.subject	Emotion detection	en_US
dc.subject	SER	en_US
dc.subject	Voice signals	en_US
dc.subject	Wearable IoT devices	en_US
dc.subject	BiLSTM	en_US
dc.subject	Convolutional neural network	en_US
dc.subject	KNN	en_US
dc.subject.lcsh	Internet of things.
dc.subject.lcsh	Emotion recognition.
dc.subject.lcsh	Neural networks (Computer science).
dc.subject.lcsh	Speech processing systems.
dc.title	TinyML for emotion detection in voice signals: evaluating and proposing algorithms for IoT wearable devices	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B.Sc. in Computer Science

Files in this item

Name:: 24141144, 20101117, 20201207, ...
Size:: 1.005Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1475]

Show simple item record