A viseme recognition system using lip curvature and neural networks to detect Bangla vowels

This thesis report is submitted in partial fulfilment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2016.

Bibliografske podrobnosti
Glavni avtor: Akhter, Nahid
Drugi avtorji: Chakrabarty, Dr. Amitabha
Format: Thesis
Jezik:English
Izdano: BRAC Univeristy 2017
Teme:
Online dostop:http://hdl.handle.net/10361/8366
id 10361-8366
record_format dspace
spelling 10361-83662022-01-26T07:38:48Z A viseme recognition system using lip curvature and neural networks to detect Bangla vowels Akhter, Nahid Chakrabarty, Dr. Amitabha Department of Computer Science and Engineering, BRAC University Neural network Lip curvature Viseme recognition This thesis report is submitted in partial fulfilment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2016. Cataloged from PDF version of thesis report. Includes bibliographical references (page 46-50). Automatic Speech Recognition plays an important role in human-computer interaction, which can be applied in various vital applications like crime-fighting and helping the hearing-impaired. It consists of two domains – Audio Speech Recognition and Visual Speech Recognition. This thesis is based on Recognition of Speech in the visual domain only, i.e. it involves recognizing speech without the presence or support of any auditory signal. So far, a lot of research has been done on lip-reading in English and some amount on French and Chinese, as well as few other languages, but not much research has been done on lip-reading in Bengali. This thesis work provides a new approach to lip reading Bengali vowels using a combination of the curvature of the inner and outer lips and Neural Networks. The method uses a more robust and faster algorithm to detect the lip contour than conventional methods used so far, such as Active Contour Model, Active Appearance Model and Active Shape Models. The method used for feature extraction is also new. It makes use of coefficients of the curves of the inner and outer lips. This way, it makes use of a lesser number of parameters to represent the shape of the lip when pronouncing a vowel. Moreover, the method is also robust to alignment of lips at different angles and can work with low resolution pictures also. Finally, for recognition of the viseme, a Backpropagation Neural Network is trained and simulated using gradient descent method. Nahid Akhter M. Computer Science and Engineering 2017-07-26T10:35:54Z 2017-07-26T10:35:54Z 2016 2016 Thesis ID 14166001 http://hdl.handle.net/10361/8366 en BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 50 pages application/pdf BRAC Univeristy
institution Brac University
collection Institutional Repository
language English
topic Neural network
Lip curvature
Viseme recognition
spellingShingle Neural network
Lip curvature
Viseme recognition
Akhter, Nahid
A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
description This thesis report is submitted in partial fulfilment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2016.
author2 Chakrabarty, Dr. Amitabha
author_facet Chakrabarty, Dr. Amitabha
Akhter, Nahid
format Thesis
author Akhter, Nahid
author_sort Akhter, Nahid
title A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
title_short A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
title_full A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
title_fullStr A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
title_full_unstemmed A viseme recognition system using lip curvature and neural networks to detect Bangla vowels
title_sort viseme recognition system using lip curvature and neural networks to detect bangla vowels
publisher BRAC Univeristy
publishDate 2017
url http://hdl.handle.net/10361/8366
work_keys_str_mv AT akhternahid avisemerecognitionsystemusinglipcurvatureandneuralnetworkstodetectbanglavowels
AT akhternahid visemerecognitionsystemusinglipcurvatureandneuralnetworkstodetectbanglavowels
_version_ 1814309618044633088