dc.contributor.advisor | Alam, Md. Ashraful | |
dc.contributor.author | Hakim, Talukder Juhaer | |
dc.contributor.author | Monsur, Sayema Binte | |
dc.contributor.author | Shuvo, Abtahi Maskawath | |
dc.contributor.author | Azrine, Tasmia | |
dc.contributor.author | Labib, Md. Zarif | |
dc.date.accessioned | 2024-07-02T06:20:57Z | |
dc.date.available | 2024-07-02T06:20:57Z | |
dc.date.copyright | ©2023 | |
dc.date.issued | 2023 | |
dc.identifier.other | ID 19301134 | |
dc.identifier.other | ID 19301030 | |
dc.identifier.other | ID 19301131 | |
dc.identifier.other | ID 20301165 | |
dc.identifier.other | ID 19301165 | |
dc.identifier.uri | http://hdl.handle.net/10361/23636 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 48-50). | |
dc.description.abstract | Gender classification based on voice analysis is one of the elemental tasks in speech
and audio processing, with various applications such as speech recognition systems,
voice assistants, call center analytics, etc. For speech synthesis, speaker identification,
and human-computer interaction- gender recognition plays a vital role. Although
extensive research on this topic has been done in various languages, any
studies can hardly be found regarding gender classification in the Bangla language.
Our research paper aims to recognize gender in the Bangla language using deep
learning approaches and voice analysis. The core of our approach involves the use of
CNN models (ResNet50, EfficientNetB0, InceptionV3, and DenseNet-121) for our
data training. The Mel-Frequency Cepstral Coefficients (MFCC) and short-time
Fourier transforms (STFT) were computed from audio recordings and used as input
features to the neural network model. The system’s excellent accuracy rate
demonstrates its potential for use in practical settings. By providing light on the
application of deep learning techniques in the context of the Bangla language, this
study advances the area of gender identification. 95% accuracy was achieved in the
InspectionV3 and EfficientNetB0 models with the MFCC input. | en_US |
dc.description.statementofresponsibility | Talukder Juhaer Hakim | |
dc.description.statementofresponsibility | Sayema Binte Monsur | |
dc.description.statementofresponsibility | Abtahi Maskawath Shuvo | |
dc.description.statementofresponsibility | Tasmia Azrine | |
dc.description.statementofresponsibility | Md. Zarif Labib | |
dc.format.extent | 59 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Deep learning | en_US |
dc.subject | Machine learning | en_US |
dc.subject | Bangla language | en_US |
dc.subject | F1-score | en_US |
dc.subject | Decision tree | en_US |
dc.subject | Inception V3 | en_US |
dc.subject | DenseNet-121 | en_US |
dc.subject | STFT | en_US |
dc.subject | MFCC | en_US |
dc.subject.lcsh | Neural networks (Computer science) | |
dc.subject.lcsh | Cognitive learning theory (Deep learning) | |
dc.subject.lcsh | Automatic speech recognition--Data processing | |
dc.title | Gender classification in Bangla language using deep learning-based voice analysis | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B.Sc. in Computer Science | |