dc.contributor.advisor | Esfar-E-Alam, A. M. | |
dc.contributor.advisor | Monim, Mobashir | |
dc.contributor.author | Rafidul Islam, Sheikh Md | |
dc.contributor.author | Gomes, Maria | |
dc.contributor.author | Hossain, Mehran | |
dc.contributor.author | Raihana, Ramisha | |
dc.date.accessioned | 2022-11-23T06:16:38Z | |
dc.date.available | 2022-11-23T06:16:38Z | |
dc.date.copyright | 2022 | |
dc.date.issued | 2022-05 | |
dc.identifier.other | ID: 22141059 | |
dc.identifier.other | ID: 22141070 | |
dc.identifier.other | ID: 22141043 | |
dc.identifier.other | ID: 21241077 | |
dc.identifier.uri | http://hdl.handle.net/10361/17613 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 29-30). | |
dc.description.abstract | Emotion recognition and sentiment analysis serves many purposes from analyzing
human behavior under specific conditions to enhancement of customer experience for
various services. In this paper, a multimodal approach is used to identify 4 classes
of emotions by combining both speech and text features to improve classification
accuracy. The methodology involves the implementation of several models for both
audio and text domains combined using 4 different heterogeneous ensemble tech niques - hard voting, soft voting, blending and stacking. The effects of the different
ensemble learning methods on the accuracy for the multimodal classification task
are also investigated. The results of this study show that stacking is the highest
performing ensemble technique, and the implementation outperforms several exist ing methods for 4-class emotion detection on the IEMOCAP dataset, obtaining a
weighted accuracy of 81.2%. | en_US |
dc.description.statementofresponsibility | Sheikh Md Rafidul Islam | |
dc.description.statementofresponsibility | Maria Gomes | |
dc.description.statementofresponsibility | Mehran Hossain | |
dc.description.statementofresponsibility | Ramisha Raihana | |
dc.format.extent | 30 Pages | |
dc.language.iso | en_US | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Multimodal | en_US |
dc.subject | Ensemble learning | en_US |
dc.subject | Emotion recognition | en_US |
dc.subject | Speech | en_US |
dc.subject | Text | en_US |
dc.subject | Stacking | en_US |
dc.subject | IEMOCAP | en_US |
dc.subject.lcsh | Emotions--Computer simulation | |
dc.subject.lcsh | Emotions -- Computer simulation. | |
dc.title | Multimodal emotion recognition from Speech and text using heterogeneous ensemble techniques | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |