Automatic subtitle generation for Bengali multimedia using deep learning

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক: Rhythm, Ehsanur Rahman, Arnob, Shafakat Sowroar, Shuvo, Rajvir Ahmed
অন্যান্য লেখক: Jahan,Sifat E
বিন্যাস: গবেষণাপত্র
ভাষা:English
প্রকাশিত: Brac University 2024
বিষয়গুলি:
অনলাইন ব্যবহার করুন:http://hdl.handle.net/10361/23554
id 10361-23554
record_format dspace
spelling 10361-235542024-06-25T21:03:38Z Automatic subtitle generation for Bengali multimedia using deep learning Rhythm, Ehsanur Rahman Arnob, Shafakat Sowroar Shuvo, Rajvir Ahmed Jahan,Sifat E Rasel, Annajiat Alim Department of Computer Science and Engineering, Brac University Automatic subtitle generation Bengali audio Deep learning Natural language processing Natural language processing (Computer science) Computational linguistics Data mining This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 51-53). For audio or video material to be more inclusive and accessible, automatic subtitle generation is essential. Nevertheless, implementing this technology into Bengali presents significant challenges due to scarce resources and linguistic difficulty. In this study, a new deep learning based system for creating Subtitles for Bengali multimedia automatically is introduced. The suggested approach makes use of the Wav2vec2 and the Common Voice Bengali Dataset, a large collection of Bengali audio recordings. This study uses the Common Voice Dataset Bengali to train and tune the Wav2vec2 model in order to accurately convert Bengali audio into text. Current automatic speech recognition approaches are combined with Bengali language-specific factors in the created system to give accurate and reliable transcription works. The transcribed text is synced with the matching audio parts throughout the subtitle production process. The produced subtitles are enhanced using post-processing approaches, similar to capitalization and punctuation restoration, to ensure readability and consistency. The findings of this study might greatly improve Bengali language media’s usability and availability across a range of sectors. The created subtitles may enhance the watching experience for Bengali multimedia by easing greater understanding, and expanding availability. The study demonstrates the potential of using deep learning and ASR methods to get over the difficulties of automated subtitle production in the Bengali language, advancing multimedia availability and inclusion. Ehsanur Rahman Rhythm Shafakat Sowroar Arnob Rajvir Ahmed Shuvo B.Sc in Computer Science 2024-06-25T03:29:58Z 2024-06-25T03:29:58Z ©2023 2023-09 Thesis ID 22241163 ID 20101129 ID 20141003 http://hdl.handle.net/10361/23554 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 62 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Automatic subtitle generation
Bengali audio
Deep learning
Natural language processing
Natural language processing (Computer science)
Computational linguistics
Data mining
spellingShingle Automatic subtitle generation
Bengali audio
Deep learning
Natural language processing
Natural language processing (Computer science)
Computational linguistics
Data mining
Rhythm, Ehsanur Rahman
Arnob, Shafakat Sowroar
Shuvo, Rajvir Ahmed
Automatic subtitle generation for Bengali multimedia using deep learning
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
author2 Jahan,Sifat E
author_facet Jahan,Sifat E
Rhythm, Ehsanur Rahman
Arnob, Shafakat Sowroar
Shuvo, Rajvir Ahmed
format Thesis
author Rhythm, Ehsanur Rahman
Arnob, Shafakat Sowroar
Shuvo, Rajvir Ahmed
author_sort Rhythm, Ehsanur Rahman
title Automatic subtitle generation for Bengali multimedia using deep learning
title_short Automatic subtitle generation for Bengali multimedia using deep learning
title_full Automatic subtitle generation for Bengali multimedia using deep learning
title_fullStr Automatic subtitle generation for Bengali multimedia using deep learning
title_full_unstemmed Automatic subtitle generation for Bengali multimedia using deep learning
title_sort automatic subtitle generation for bengali multimedia using deep learning
publisher Brac University
publishDate 2024
url http://hdl.handle.net/10361/23554
work_keys_str_mv AT rhythmehsanurrahman automaticsubtitlegenerationforbengalimultimediausingdeeplearning
AT arnobshafakatsowroar automaticsubtitlegenerationforbengalimultimediausingdeeplearning
AT shuvorajvirahmed automaticsubtitlegenerationforbengalimultimediausingdeeplearning
_version_ 1814308647191183360