Tri-modal ensemble for enhanced Bangla sign language recognition

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.

Dades bibliogràfiques
Autors principals: Shams, Khan Abrar, Reaz, Md. Rafid, Islam, Sanjida, Rafi, Mohammad Ryan Ur
Altres autors: Rahman, Md. Shahriar
Format: Thesis
Idioma:English
Publicat: Brac University 2024
Matèries:
Accés en línia:http://hdl.handle.net/10361/23577
id 10361-23577
record_format dspace
spelling 10361-235772024-06-25T21:01:13Z Tri-modal ensemble for enhanced Bangla sign language recognition Shams, Khan Abrar Reaz, Md. Rafid Islam, Sanjida Rafi, Mohammad Ryan Ur Rahman, Md. Shahriar Rahman, Rafeed Department of Computer Science and Engineering, Brac University Bangla sign language Convolutional neural network Ensemble method Neural networks (Computer science) Computer linguistics This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 49-52). Sign language is the most common method of communication for people with disabling hearing loss. Bangladesh, where BdSL is prominently used among the disabling people, finds communicating with the general mass challenging. Thus, a system to understand BdSL accurately and efficiently has become a popular demand. Deep learning architectures such as CNN, ANN, RNN, and Axis Independent LSTM can interpret Bangla Sign Language into readable digital wording. Commonly, an image-based sign language recognition system contains a recording camera that continuously sends images to a model. The model then provides a prediction based on those images. However, it creates a lot of uncertainty variables, such as the lighting issue, noisy background, skin color, and hand orientations. To this end, we propose a procedure that can reduce this uncertainty variable by considering three different modalities, spatial information, skeleton awareness, and edge awareness. We propose three image pre-processing techniques and integrate three convolutional neural network models. Finally, we tested out nine different ensemble meta-learning algorithms where five of the algorithms are modifications of averaging and voting techniques. As a result, our proposed model achieved higher training accuracy at 99.77%, 98.11%, and 99.30% than any other state-of-the-art image classification architectures except for ResNet50 at 99.87%. We achieved the highest accuracy of 95.13% on the testing set. This research shows that considering multiple modalities can improve the system’s overall performance. Khan Abrar Shams Md. Rafid Reaz Sanjida Islam Mohammad Ryan Ur Rafi B.Sc in Computer Science 2024-06-25T06:48:54Z 2024-06-25T06:48:54Z ©2023 2023-09 Thesis ID 19201052 ID 19201044 ID 20101615 ID 22241152 http://hdl.handle.net/10361/23577 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 61 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Bangla sign language
Convolutional neural network
Ensemble method
Neural networks (Computer science)
Computer linguistics
spellingShingle Bangla sign language
Convolutional neural network
Ensemble method
Neural networks (Computer science)
Computer linguistics
Shams, Khan Abrar
Reaz, Md. Rafid
Islam, Sanjida
Rafi, Mohammad Ryan Ur
Tri-modal ensemble for enhanced Bangla sign language recognition
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
author2 Rahman, Md. Shahriar
author_facet Rahman, Md. Shahriar
Shams, Khan Abrar
Reaz, Md. Rafid
Islam, Sanjida
Rafi, Mohammad Ryan Ur
format Thesis
author Shams, Khan Abrar
Reaz, Md. Rafid
Islam, Sanjida
Rafi, Mohammad Ryan Ur
author_sort Shams, Khan Abrar
title Tri-modal ensemble for enhanced Bangla sign language recognition
title_short Tri-modal ensemble for enhanced Bangla sign language recognition
title_full Tri-modal ensemble for enhanced Bangla sign language recognition
title_fullStr Tri-modal ensemble for enhanced Bangla sign language recognition
title_full_unstemmed Tri-modal ensemble for enhanced Bangla sign language recognition
title_sort tri-modal ensemble for enhanced bangla sign language recognition
publisher Brac University
publishDate 2024
url http://hdl.handle.net/10361/23577
work_keys_str_mv AT shamskhanabrar trimodalensembleforenhancedbanglasignlanguagerecognition
AT reazmdrafid trimodalensembleforenhancedbanglasignlanguagerecognition
AT islamsanjida trimodalensembleforenhancedbanglasignlanguagerecognition
AT rafimohammadryanur trimodalensembleforenhancedbanglasignlanguagerecognition
_version_ 1814307299570745344