Gender classification in Bangla language using deep learning-based voice analysis
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
Главные авторы: | , , , , |
---|---|
Другие авторы: | |
Формат: | Диссертация |
Язык: | English |
Опубликовано: |
Brac University
2024
|
Предметы: | |
Online-ссылка: | http://hdl.handle.net/10361/23636 |
id |
10361-23636 |
---|---|
record_format |
dspace |
spelling |
10361-236362024-07-03T05:09:15Z Gender classification in Bangla language using deep learning-based voice analysis Hakim, Talukder Juhaer Monsur, Sayema Binte Shuvo, Abtahi Maskawath Azrine, Tasmia Labib, Md. Zarif Alam, Md. Ashraful Department of Computer Science and Engineering, Brac University Deep learning Machine learning Bangla language F1-score Decision tree Inception V3 DenseNet-121 STFT MFCC Neural networks (Computer science) Cognitive learning theory (Deep learning) Automatic speech recognition--Data processing This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 48-50). Gender classification based on voice analysis is one of the elemental tasks in speech and audio processing, with various applications such as speech recognition systems, voice assistants, call center analytics, etc. For speech synthesis, speaker identification, and human-computer interaction- gender recognition plays a vital role. Although extensive research on this topic has been done in various languages, any studies can hardly be found regarding gender classification in the Bangla language. Our research paper aims to recognize gender in the Bangla language using deep learning approaches and voice analysis. The core of our approach involves the use of CNN models (ResNet50, EfficientNetB0, InceptionV3, and DenseNet-121) for our data training. The Mel-Frequency Cepstral Coefficients (MFCC) and short-time Fourier transforms (STFT) were computed from audio recordings and used as input features to the neural network model. The system’s excellent accuracy rate demonstrates its potential for use in practical settings. By providing light on the application of deep learning techniques in the context of the Bangla language, this study advances the area of gender identification. 95% accuracy was achieved in the InspectionV3 and EfficientNetB0 models with the MFCC input. Talukder Juhaer Hakim Sayema Binte Monsur Abtahi Maskawath Shuvo Tasmia Azrine Md. Zarif Labib B.Sc. in Computer Science 2024-07-02T06:20:57Z 2024-07-02T06:20:57Z ©2023 2023 Thesis ID 19301134 ID 19301030 ID 19301131 ID 20301165 ID 19301165 http://hdl.handle.net/10361/23636 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 59 pages application/pdf Brac University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Deep learning Machine learning Bangla language F1-score Decision tree Inception V3 DenseNet-121 STFT MFCC Neural networks (Computer science) Cognitive learning theory (Deep learning) Automatic speech recognition--Data processing |
spellingShingle |
Deep learning Machine learning Bangla language F1-score Decision tree Inception V3 DenseNet-121 STFT MFCC Neural networks (Computer science) Cognitive learning theory (Deep learning) Automatic speech recognition--Data processing Hakim, Talukder Juhaer Monsur, Sayema Binte Shuvo, Abtahi Maskawath Azrine, Tasmia Labib, Md. Zarif Gender classification in Bangla language using deep learning-based voice analysis |
description |
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. |
author2 |
Alam, Md. Ashraful |
author_facet |
Alam, Md. Ashraful Hakim, Talukder Juhaer Monsur, Sayema Binte Shuvo, Abtahi Maskawath Azrine, Tasmia Labib, Md. Zarif |
format |
Thesis |
author |
Hakim, Talukder Juhaer Monsur, Sayema Binte Shuvo, Abtahi Maskawath Azrine, Tasmia Labib, Md. Zarif |
author_sort |
Hakim, Talukder Juhaer |
title |
Gender classification in Bangla language using deep learning-based voice analysis |
title_short |
Gender classification in Bangla language using deep learning-based voice analysis |
title_full |
Gender classification in Bangla language using deep learning-based voice analysis |
title_fullStr |
Gender classification in Bangla language using deep learning-based voice analysis |
title_full_unstemmed |
Gender classification in Bangla language using deep learning-based voice analysis |
title_sort |
gender classification in bangla language using deep learning-based voice analysis |
publisher |
Brac University |
publishDate |
2024 |
url |
http://hdl.handle.net/10361/23636 |
work_keys_str_mv |
AT hakimtalukderjuhaer genderclassificationinbanglalanguageusingdeeplearningbasedvoiceanalysis AT monsursayemabinte genderclassificationinbanglalanguageusingdeeplearningbasedvoiceanalysis AT shuvoabtahimaskawath genderclassificationinbanglalanguageusingdeeplearningbasedvoiceanalysis AT azrinetasmia genderclassificationinbanglalanguageusingdeeplearningbasedvoiceanalysis AT labibmdzarif genderclassificationinbanglalanguageusingdeeplearningbasedvoiceanalysis |
_version_ |
1814307020387385344 |