Bangla speech to text conversion using CMU sphinx
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.
المؤلفون الرئيسيون: | , , , |
---|---|
مؤلفون آخرون: | |
التنسيق: | أطروحة |
اللغة: | English |
منشور في: |
Brac University
2020
|
الموضوعات: | |
الوصول للمادة أونلاين: | http://hdl.handle.net/10361/13632 |
id |
10361-13632 |
---|---|
record_format |
dspace |
spelling |
10361-136322022-01-26T10:08:20Z Bangla speech to text conversion using CMU sphinx Bristy, Israt Jerin Shakil, Nadim Imtiaz Musavee, Tesnim Choton, Akibur Rahman Arif, Hossain Department of Computer Science and Engineering, Brac University Bangla Voice recognition CMUSphinx Acoustic model Language model Machine learning. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019. Cataloged from PDF version of thesis. Includes bibliographical references (pages 30-32). Speech is the most normal type of communication and association between people while content (text) and images are the most basic types of exchange in the computer system. Therefore, enthusiasm in regards to transformation between speech and text is expanding day by day for integrating the human-computer relation. Understanding speech for a human is not a challenge but for a machine it is a big deal because a machine does not catch expression or human nature. For the conversion of speech into text, this proposed model requires the usage of the open sourced framework Sphinx 4 which is written in Java. For the proposed system, it requires certain steps which are training an acoustic model, creating a language model and building a dictionary with CMUSphinx. For training, the audio files were recorded by 8 speakers both male and female for more accuracy. Among them, 6 speakers recorded each word 3 times. To test the accuracy, we took audio recordings from 2 speakers among them one speaker is unknown to the system. After testing, we got the accuracy around 59.01%. For known speakers we got 78.57% accuracy. We gave audio files as input only to check accuracy as our main purpose was to make a system which works in real time. In our system, user can speak in real time and the system converts it into text. Israt Jerin Bristy Nadim Imtiaz Shakil Tesnim Musavee Akibur Rahman Choton B. Computer Science 2020-01-20T04:24:59Z 2020-01-20T04:24:59Z 2019 2019-08 Thesis ID 15301006 ID 15301037 ID 15101110 ID 15301102 http://hdl.handle.net/10361/13632 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 32 pages application/pdf Brac University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Bangla Voice recognition CMUSphinx Acoustic model Language model Machine learning. |
spellingShingle |
Bangla Voice recognition CMUSphinx Acoustic model Language model Machine learning. Bristy, Israt Jerin Shakil, Nadim Imtiaz Musavee, Tesnim Choton, Akibur Rahman Bangla speech to text conversion using CMU sphinx |
description |
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019. |
author2 |
Arif, Hossain |
author_facet |
Arif, Hossain Bristy, Israt Jerin Shakil, Nadim Imtiaz Musavee, Tesnim Choton, Akibur Rahman |
format |
Thesis |
author |
Bristy, Israt Jerin Shakil, Nadim Imtiaz Musavee, Tesnim Choton, Akibur Rahman |
author_sort |
Bristy, Israt Jerin |
title |
Bangla speech to text conversion using CMU sphinx |
title_short |
Bangla speech to text conversion using CMU sphinx |
title_full |
Bangla speech to text conversion using CMU sphinx |
title_fullStr |
Bangla speech to text conversion using CMU sphinx |
title_full_unstemmed |
Bangla speech to text conversion using CMU sphinx |
title_sort |
bangla speech to text conversion using cmu sphinx |
publisher |
Brac University |
publishDate |
2020 |
url |
http://hdl.handle.net/10361/13632 |
work_keys_str_mv |
AT bristyisratjerin banglaspeechtotextconversionusingcmusphinx AT shakilnadimimtiaz banglaspeechtotextconversionusingcmusphinx AT musaveetesnim banglaspeechtotextconversionusingcmusphinx AT chotonakiburrahman banglaspeechtotextconversionusingcmusphinx |
_version_ |
1814307344097476608 |