Bangla speech to text conversion using CMU sphinx

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Bristy, Israt Jerin, Shakil, Nadim Imtiaz, Musavee, Tesnim, Choton, Akibur Rahman
مؤلفون آخرون: Arif, Hossain
التنسيق: أطروحة
اللغة:English
منشور في: Brac University 2020
الموضوعات:
الوصول للمادة أونلاين:http://hdl.handle.net/10361/13632
id 10361-13632
record_format dspace
spelling 10361-136322022-01-26T10:08:20Z Bangla speech to text conversion using CMU sphinx Bristy, Israt Jerin Shakil, Nadim Imtiaz Musavee, Tesnim Choton, Akibur Rahman Arif, Hossain Department of Computer Science and Engineering, Brac University Bangla Voice recognition CMUSphinx Acoustic model Language model Machine learning. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019. Cataloged from PDF version of thesis. Includes bibliographical references (pages 30-32). Speech is the most normal type of communication and association between people while content (text) and images are the most basic types of exchange in the computer system. Therefore, enthusiasm in regards to transformation between speech and text is expanding day by day for integrating the human-computer relation. Understanding speech for a human is not a challenge but for a machine it is a big deal because a machine does not catch expression or human nature. For the conversion of speech into text, this proposed model requires the usage of the open sourced framework Sphinx 4 which is written in Java. For the proposed system, it requires certain steps which are training an acoustic model, creating a language model and building a dictionary with CMUSphinx. For training, the audio files were recorded by 8 speakers both male and female for more accuracy. Among them, 6 speakers recorded each word 3 times. To test the accuracy, we took audio recordings from 2 speakers among them one speaker is unknown to the system. After testing, we got the accuracy around 59.01%. For known speakers we got 78.57% accuracy. We gave audio files as input only to check accuracy as our main purpose was to make a system which works in real time. In our system, user can speak in real time and the system converts it into text. Israt Jerin Bristy Nadim Imtiaz Shakil Tesnim Musavee Akibur Rahman Choton B. Computer Science 2020-01-20T04:24:59Z 2020-01-20T04:24:59Z 2019 2019-08 Thesis ID 15301006 ID 15301037 ID 15101110 ID 15301102 http://hdl.handle.net/10361/13632 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 32 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Bangla
Voice recognition
CMUSphinx
Acoustic model
Language model
Machine learning.
spellingShingle Bangla
Voice recognition
CMUSphinx
Acoustic model
Language model
Machine learning.
Bristy, Israt Jerin
Shakil, Nadim Imtiaz
Musavee, Tesnim
Choton, Akibur Rahman
Bangla speech to text conversion using CMU sphinx
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.
author2 Arif, Hossain
author_facet Arif, Hossain
Bristy, Israt Jerin
Shakil, Nadim Imtiaz
Musavee, Tesnim
Choton, Akibur Rahman
format Thesis
author Bristy, Israt Jerin
Shakil, Nadim Imtiaz
Musavee, Tesnim
Choton, Akibur Rahman
author_sort Bristy, Israt Jerin
title Bangla speech to text conversion using CMU sphinx
title_short Bangla speech to text conversion using CMU sphinx
title_full Bangla speech to text conversion using CMU sphinx
title_fullStr Bangla speech to text conversion using CMU sphinx
title_full_unstemmed Bangla speech to text conversion using CMU sphinx
title_sort bangla speech to text conversion using cmu sphinx
publisher Brac University
publishDate 2020
url http://hdl.handle.net/10361/13632
work_keys_str_mv AT bristyisratjerin banglaspeechtotextconversionusingcmusphinx
AT shakilnadimimtiaz banglaspeechtotextconversionusingcmusphinx
AT musaveetesnim banglaspeechtotextconversionusingcmusphinx
AT chotonakiburrahman banglaspeechtotextconversionusingcmusphinx
_version_ 1814307344097476608