Now showing items 1-3 of 3

    • Automated image caption generator in Bangla using multimodal learning 

      Rodoshi, Mashiat Hasin; Ahmed, Moin Uddin; Ashraf, Md. Sobhan; Mim, Md. Galib Hasan; Khanam, Ashfia (Brac University, 2023-01)
      Experiencing an image on-screen is a privilege that we often seem not to care about. A visually impaired person does not have that luxury. A system that can automatically produce closed captions of an image can thus help ...
    • Bangla speech to text conversion using CMU sphinx 

      Bristy, Israt Jerin; Shakil, Nadim Imtiaz; Musavee, Tesnim; Choton, Akibur Rahman (Brac University, 2019-08)
      Speech is the most normal type of communication and association between people while content (text) and images are the most basic types of exchange in the computer system. Therefore, enthusiasm in regards to transformation ...
    • Personal information from Bangla speech signal using MFCC and GMM 

      Hridy, Maisha Munawara; Hasan, Md. Hasib; Emon, Mahfuz Al (Brac University, 2019-08)
      Our system extracts personal information from bangla speech. Dataset that was used consists real-life voice inputs from di erent age and gender groups. A set of Bengali speech samples from YouTube were used as input ...