Multimodal emotion recognition from Speech and text using heterogeneous ensemble techniques

Rafidul Islam, Sheikh Md; Gomes, Maria; Hossain, Mehran; Raihana, Ramisha

dc.contributor.advisor	Esfar-E-Alam, A. M.
dc.contributor.advisor	Monim, Mobashir
dc.contributor.author	Rafidul Islam, Sheikh Md
dc.contributor.author	Gomes, Maria
dc.contributor.author	Hossain, Mehran
dc.contributor.author	Raihana, Ramisha
dc.date.accessioned	2022-11-23T06:16:38Z
dc.date.available	2022-11-23T06:16:38Z
dc.date.copyright	2022
dc.date.issued	2022-05
dc.identifier.other	ID: 22141059
dc.identifier.other	ID: 22141070
dc.identifier.other	ID: 22141043
dc.identifier.other	ID: 21241077
dc.identifier.uri	http://hdl.handle.net/10361/17613
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 29-30).
dc.description.abstract	Emotion recognition and sentiment analysis serves many purposes from analyzing human behavior under specific conditions to enhancement of customer experience for various services. In this paper, a multimodal approach is used to identify 4 classes of emotions by combining both speech and text features to improve classification accuracy. The methodology involves the implementation of several models for both audio and text domains combined using 4 different heterogeneous ensemble tech niques - hard voting, soft voting, blending and stacking. The effects of the different ensemble learning methods on the accuracy for the multimodal classification task are also investigated. The results of this study show that stacking is the highest performing ensemble technique, and the implementation outperforms several exist ing methods for 4-class emotion detection on the IEMOCAP dataset, obtaining a weighted accuracy of 81.2%.	en_US
dc.description.statementofresponsibility	Sheikh Md Rafidul Islam
dc.description.statementofresponsibility	Maria Gomes
dc.description.statementofresponsibility	Mehran Hossain
dc.description.statementofresponsibility	Ramisha Raihana
dc.format.extent	30 Pages
dc.language.iso	en_US	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Multimodal	en_US
dc.subject	Ensemble learning	en_US
dc.subject	Emotion recognition	en_US
dc.subject	Speech	en_US
dc.subject	Text	en_US
dc.subject	Stacking	en_US
dc.subject	IEMOCAP	en_US
dc.subject.lcsh	Emotions--Computer simulation
dc.subject.lcsh	Emotions -- Computer simulation.
dc.title	Multimodal emotion recognition from Speech and text using heterogeneous ensemble techniques	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B. Computer Science

Files in this item

Name:: 22141059, 22141070, 22141043, ...
Size:: 1.145Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1480]

Show simple item record