dc.contributor.advisor | Chakrabarty, Amitabha | |
dc.contributor.advisor | Automatic speech recognition. | |
dc.contributor.author | Ahmed, Hasibul Hasan | |
dc.contributor.author | Ahmed, Zain | |
dc.contributor.author | Choden, Tshewang | |
dc.contributor.author | Chaudhary, Nutan | |
dc.date.accessioned | 2024-10-17T09:07:40Z | |
dc.date.available | 2024-10-17T09:07:40Z | |
dc.date.copyright | ©2024 | |
dc.date.issued | 2024-05 | |
dc.identifier.other | ID 24141144 | |
dc.identifier.other | ID 20101117 | |
dc.identifier.other | ID 20201207 | |
dc.identifier.other | ID 20201199 | |
dc.identifier.uri | http://hdl.handle.net/10361/24346 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2024. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 56-58). | |
dc.description.abstract | In today’s digital world, voice emotion recognition is essential for applications
like intelligent tutoring, audio mining, security, telecommunication,
HCI, lie detection, and human-machine interactions in various
settings. Voice, which is used to express one’s perspective and communicate
inter-personally, is one of the characteristics that differentiate
humans. The rise of IoT and wearable technology offers new opportunities
for real-time, remote emotion detection through voice. In the context
of voice processing-based emotion recognition, particularly in the
Internet of Things wearable, this thesis investigates the possibilities of
tiny machine learning or TinyML. To accomplish this goal, we evaluated
Bidirectional-LSTM and CNN on both vector quantization and raw data
gave us notable accuracy of 88%, 80%, 85%, and 81% respectively and
LSTM, Random Forest, Logistic Regression, KNN and GRU on only raw
data shows accuracy rates of 86%, 89%, 89%, 86% and 82% using the
composite dataset that includes well-known datasets such as RAVDESS,
CREMA-D, TESS, and SAVEE. Furthermore, the models with the best
accuracy were selected to be implemented within the TinyML framework,
Tensorflow-lite. Our benchmarks highlighted that most of the best performing
models were Recurrent Neural Network (RNN) based, notably
BiLSTM, LSTM, GRU alongside the CNN model. Finally, after validating
the findings through hardware implementation on Raspberry Pi
4, the study concludes that BiLSTM model would be most suitable for
speech emotion recognition tasks (SER) in the TinyML domain . The
hardware performance of the model illustrates how confident the model
actually is in predicting emotions from raw voice input within significant
resource and power constraints . These findings contribute to the ongoing
discourse on the intersection of voice emotion recognition, TinyML, and
IoT, showcasing the potential for enhanced human-machine interactions
in a wide variety of practical domains. | en_US |
dc.description.statementofresponsibility | Hasibul Hasan Ahmed | |
dc.description.statementofresponsibility | Zain Ahmed | |
dc.description.statementofresponsibility | Tshewang Choden | |
dc.description.statementofresponsibility | Nutan Chaudhary | |
dc.format.extent | 70 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Tiny machine learning | en_US |
dc.subject | Emotion detection | en_US |
dc.subject | SER | en_US |
dc.subject | Voice signals | en_US |
dc.subject | Wearable IoT devices | en_US |
dc.subject | BiLSTM | en_US |
dc.subject | Convolutional neural network | en_US |
dc.subject | KNN | en_US |
dc.subject.lcsh | Internet of things. | |
dc.subject.lcsh | Emotion recognition. | |
dc.subject.lcsh | Neural networks (Computer science). | |
dc.subject.lcsh | Speech processing systems. | |
dc.title | TinyML for emotion detection in voice signals: evaluating and proposing algorithms for IoT wearable devices | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B.Sc. in Computer Science | |