dc.contributor.advisor | Alam, Md. Ashraful | |
dc.contributor.author | Zaman, K M Tahzeem | |
dc.contributor.author | Hasan, Zahid | |
dc.contributor.author | Hossain, Mohd. Ibrahim | |
dc.date.accessioned | 2023-08-13T06:47:47Z | |
dc.date.available | 2023-08-13T06:47:47Z | |
dc.date.copyright | 2023 | |
dc.date.issued | 2023-01 | |
dc.identifier.other | ID: 17101212 | |
dc.identifier.other | ID: 17101466 | |
dc.identifier.other | ID: 17201021 | |
dc.identifier.uri | http://hdl.handle.net/10361/19385 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 24-25). | |
dc.description.abstract | In recent years, the primary solution to sound enhancement has gained popularity.
There is a rich research contribution from academia and industry to remove noise
and enhance sound quality. With the advance in machine learning and deep learn ing algorithms, well-performing audio enhancement models now exist. But such a
sophisticated and well-researched model has not existed utilizing the language of
Bangla. Although there have been models trained and tested to comprehend the
language, no such model exists that can process real-time Bangla speech. Also,
no such dataset exists that contains a substantial amount of speeches conducted
in the Bangla language spanning over multiple hours. In this research, we stud ied the existing models that are working to separate noise in composite auditory
environments, and on the basis of that study, we designed and implemented a U
Net architecture model that has been trained in the Bangla language and is able
to isolate and separate external noise from Bangla language speeches providing a
clean feed to the listeners. Implementation of convolution neural networks in digital
signal processing is a different approach and we achieved our desired results through
it. | en_US |
dc.description.statementofresponsibility | K M Tahzeem Zaman | |
dc.description.statementofresponsibility | Zahid Hasan | |
dc.description.statementofresponsibility | Mohd. Ibrahim Hossain | |
dc.format.extent | 25 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Short-time Fourier Transform (STFT) | en_US |
dc.subject | U-Net | en_US |
dc.subject | Singal to Distortion Ratio (SDR) | en_US |
dc.subject | Speech separation | en_US |
dc.subject.lcsh | Neural networks (Computer science) | |
dc.title | Bangla speech isolation from noisy auditory environment using convolutional neural network | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |