Bangla speech isolation from noisy auditory environment using convolutional neural network

Zaman, K M Tahzeem; Hasan, Zahid; Hossain, Mohd. Ibrahim

dc.contributor.advisor	Alam, Md. Ashraful
dc.contributor.author	Zaman, K M Tahzeem
dc.contributor.author	Hasan, Zahid
dc.contributor.author	Hossain, Mohd. Ibrahim
dc.date.accessioned	2023-08-13T06:47:47Z
dc.date.available	2023-08-13T06:47:47Z
dc.date.copyright	2023
dc.date.issued	2023-01
dc.identifier.other	ID: 17101212
dc.identifier.other	ID: 17101466
dc.identifier.other	ID: 17201021
dc.identifier.uri	http://hdl.handle.net/10361/19385
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 24-25).
dc.description.abstract	In recent years, the primary solution to sound enhancement has gained popularity. There is a rich research contribution from academia and industry to remove noise and enhance sound quality. With the advance in machine learning and deep learn ing algorithms, well-performing audio enhancement models now exist. But such a sophisticated and well-researched model has not existed utilizing the language of Bangla. Although there have been models trained and tested to comprehend the language, no such model exists that can process real-time Bangla speech. Also, no such dataset exists that contains a substantial amount of speeches conducted in the Bangla language spanning over multiple hours. In this research, we stud ied the existing models that are working to separate noise in composite auditory environments, and on the basis of that study, we designed and implemented a U Net architecture model that has been trained in the Bangla language and is able to isolate and separate external noise from Bangla language speeches providing a clean feed to the listeners. Implementation of convolution neural networks in digital signal processing is a different approach and we achieved our desired results through it.	en_US
dc.description.statementofresponsibility	K M Tahzeem Zaman
dc.description.statementofresponsibility	Zahid Hasan
dc.description.statementofresponsibility	Mohd. Ibrahim Hossain
dc.format.extent	25 pages
dc.language.iso	en	en_US
dc.publisher	Brac University	en_US
dc.rights	Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Short-time Fourier Transform (STFT)	en_US
dc.subject	U-Net	en_US
dc.subject	Singal to Distortion Ratio (SDR)	en_US
dc.subject	Speech separation	en_US
dc.subject.lcsh	Neural networks (Computer science)
dc.title	Bangla speech isolation from noisy auditory environment using convolutional neural network	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, Brac University
dc.description.degree	B. Computer Science

Files in this item

Name:: 17101212, 17101466, 17201021_C ...
Size:: 11.38Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1588]

Show simple item record