Automatic subtitle generation for Bengali multimedia using deep learning
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
প্রধান লেখক: | , , |
---|---|
অন্যান্য লেখক: | |
বিন্যাস: | গবেষণাপত্র |
ভাষা: | English |
প্রকাশিত: |
Brac University
2024
|
বিষয়গুলি: | |
অনলাইন ব্যবহার করুন: | http://hdl.handle.net/10361/23554 |
id |
10361-23554 |
---|---|
record_format |
dspace |
spelling |
10361-235542024-06-25T21:03:38Z Automatic subtitle generation for Bengali multimedia using deep learning Rhythm, Ehsanur Rahman Arnob, Shafakat Sowroar Shuvo, Rajvir Ahmed Jahan,Sifat E Rasel, Annajiat Alim Department of Computer Science and Engineering, Brac University Automatic subtitle generation Bengali audio Deep learning Natural language processing Natural language processing (Computer science) Computational linguistics Data mining This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 51-53). For audio or video material to be more inclusive and accessible, automatic subtitle generation is essential. Nevertheless, implementing this technology into Bengali presents significant challenges due to scarce resources and linguistic difficulty. In this study, a new deep learning based system for creating Subtitles for Bengali multimedia automatically is introduced. The suggested approach makes use of the Wav2vec2 and the Common Voice Bengali Dataset, a large collection of Bengali audio recordings. This study uses the Common Voice Dataset Bengali to train and tune the Wav2vec2 model in order to accurately convert Bengali audio into text. Current automatic speech recognition approaches are combined with Bengali language-specific factors in the created system to give accurate and reliable transcription works. The transcribed text is synced with the matching audio parts throughout the subtitle production process. The produced subtitles are enhanced using post-processing approaches, similar to capitalization and punctuation restoration, to ensure readability and consistency. The findings of this study might greatly improve Bengali language media’s usability and availability across a range of sectors. The created subtitles may enhance the watching experience for Bengali multimedia by easing greater understanding, and expanding availability. The study demonstrates the potential of using deep learning and ASR methods to get over the difficulties of automated subtitle production in the Bengali language, advancing multimedia availability and inclusion. Ehsanur Rahman Rhythm Shafakat Sowroar Arnob Rajvir Ahmed Shuvo B.Sc in Computer Science 2024-06-25T03:29:58Z 2024-06-25T03:29:58Z ©2023 2023-09 Thesis ID 22241163 ID 20101129 ID 20141003 http://hdl.handle.net/10361/23554 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 62 pages application/pdf Brac University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Automatic subtitle generation Bengali audio Deep learning Natural language processing Natural language processing (Computer science) Computational linguistics Data mining |
spellingShingle |
Automatic subtitle generation Bengali audio Deep learning Natural language processing Natural language processing (Computer science) Computational linguistics Data mining Rhythm, Ehsanur Rahman Arnob, Shafakat Sowroar Shuvo, Rajvir Ahmed Automatic subtitle generation for Bengali multimedia using deep learning |
description |
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. |
author2 |
Jahan,Sifat E |
author_facet |
Jahan,Sifat E Rhythm, Ehsanur Rahman Arnob, Shafakat Sowroar Shuvo, Rajvir Ahmed |
format |
Thesis |
author |
Rhythm, Ehsanur Rahman Arnob, Shafakat Sowroar Shuvo, Rajvir Ahmed |
author_sort |
Rhythm, Ehsanur Rahman |
title |
Automatic subtitle generation for Bengali multimedia using deep learning |
title_short |
Automatic subtitle generation for Bengali multimedia using deep learning |
title_full |
Automatic subtitle generation for Bengali multimedia using deep learning |
title_fullStr |
Automatic subtitle generation for Bengali multimedia using deep learning |
title_full_unstemmed |
Automatic subtitle generation for Bengali multimedia using deep learning |
title_sort |
automatic subtitle generation for bengali multimedia using deep learning |
publisher |
Brac University |
publishDate |
2024 |
url |
http://hdl.handle.net/10361/23554 |
work_keys_str_mv |
AT rhythmehsanurrahman automaticsubtitlegenerationforbengalimultimediausingdeeplearning AT arnobshafakatsowroar automaticsubtitlegenerationforbengalimultimediausingdeeplearning AT shuvorajvirahmed automaticsubtitlegenerationforbengalimultimediausingdeeplearning |
_version_ |
1814308647191183360 |