Early threat warning via speech and emotion recognition from voice calls

This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.

Bibliografiska uppgifter
Huvudupphovsmän: Ishtiak, Ifaz, Rahman, Mohammad Mazedur, Usmani, Md.Razaul Haque
Övriga upphovsmän: Arif, Hossain
Materialtyp: Lärdomsprov
Språk:English
Publicerad: BRAC University 2019
Ämnen:
Länkar:http://hdl.handle.net/10361/11412
id 10361-11412
record_format dspace
spelling 10361-114122022-01-26T10:08:18Z Early threat warning via speech and emotion recognition from voice calls Ishtiak, Ifaz Rahman, Mohammad Mazedur Usmani, Md.Razaul Haque Arif, Hossain Department of Computer Science and Engineering, BRAC University Emotion recognition Vector machines Speech to Text Random forest Feature extraction MFCC Human-computer interaction. Artificial intelligence. Emotions -- Computer simulation. This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. Includes bibliographical references (pages 53-56). Cataloged from PDF version of thesis. The aim of this system is to identify potential cases of threats, and provide an early warning or alert to such cases. This will be based on voice such as voice chat over telecommunication networks or social media. The intended result will be achieved in three major steps. At first, the conversion of speech to text from both real time audio recordings and from accent groups will be applied using primarily IBM Watson’s Speech to Text. This will then be used to identify possible trigger words or word patterns from a classified selection of threat-related and negative words. And finally, the same audio source will be utilized for detecting emotions from the frequency shifts through vocal feature extraction from audio input and processing it using multiple classifier algorithms such as Support Vector Machines (SVMs), Random Forests and Naïve Bayes. Libraries such as LibROSA will be applied to extract primary audio features such as Mel Frequency Cepstral Coefficients (MFCC) to generate accurate predictions. The system yields a result of approximately 84% using the SVM RBF (Radial Basis Function) kernel, which highlights the accuracy of emotion detected based on the speech. Keywords— Emotion Recognition; Support Vector Machines; Speech to Text; Random Forest; Feature Extraction; MFCC Ifaz Ishtiak Rahman, Mohammad Mazedur Md.Razaul Haque Usmani B. Computer Science and Engineering 2019-02-14T05:37:49Z 2019-02-14T05:37:49Z 2018 2018-12 Thesis ID 15101118 ID 15101043 ID 14241005 http://hdl.handle.net/10361/11412 en BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 63 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Emotion recognition
Vector machines
Speech to Text
Random forest
Feature extraction
MFCC
Human-computer interaction.
Artificial intelligence.
Emotions -- Computer simulation.
spellingShingle Emotion recognition
Vector machines
Speech to Text
Random forest
Feature extraction
MFCC
Human-computer interaction.
Artificial intelligence.
Emotions -- Computer simulation.
Ishtiak, Ifaz
Rahman, Mohammad Mazedur
Usmani, Md.Razaul Haque
Early threat warning via speech and emotion recognition from voice calls
description This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.
author2 Arif, Hossain
author_facet Arif, Hossain
Ishtiak, Ifaz
Rahman, Mohammad Mazedur
Usmani, Md.Razaul Haque
format Thesis
author Ishtiak, Ifaz
Rahman, Mohammad Mazedur
Usmani, Md.Razaul Haque
author_sort Ishtiak, Ifaz
title Early threat warning via speech and emotion recognition from voice calls
title_short Early threat warning via speech and emotion recognition from voice calls
title_full Early threat warning via speech and emotion recognition from voice calls
title_fullStr Early threat warning via speech and emotion recognition from voice calls
title_full_unstemmed Early threat warning via speech and emotion recognition from voice calls
title_sort early threat warning via speech and emotion recognition from voice calls
publisher BRAC University
publishDate 2019
url http://hdl.handle.net/10361/11412
work_keys_str_mv AT ishtiakifaz earlythreatwarningviaspeechandemotionrecognitionfromvoicecalls
AT rahmanmohammadmazedur earlythreatwarningviaspeechandemotionrecognitionfromvoicecalls
AT usmanimdrazaulhaque earlythreatwarningviaspeechandemotionrecognitionfromvoicecalls
_version_ 1814307255869243392