Efficient Spatio-temporal feature extraction for human action recognition

This thesis is submitted in partial fulfilment of the requirements for the degree of Master of Engineering in Computer Science and Engineering, 2021.

书目详细资料
主要作者:	Ghosh, Dipon Kumar
其他作者:	Chakrabarty, Amitabha
格式:	Thesis
语言:	English
出版:	Brac University 2022
主题:	Human action recognition (HAR) Surveillance systems Violence detection Skeleton-based human action recognition Convolutional neural network (CNN) Graph convolutional networks (GCN) Feature fusion Human activity recognition Neural network (Computer Science)
在线阅读:	http://hdl.handle.net/10361/15946

id	10361-15946
record_format	dspace
spelling	10361-159462022-01-26T07:38:45Z Efficient Spatio-temporal feature extraction for human action recognition Ghosh, Dipon Kumar Chakrabarty, Amitabha Department of Computer Science and Engineering, Brac University Human action recognition (HAR) Surveillance systems Violence detection Skeleton-based human action recognition Convolutional neural network (CNN) Graph convolutional networks (GCN) Feature fusion Human activity recognition Neural network (Computer Science) This thesis is submitted in partial fulfilment of the requirements for the degree of Master of Engineering in Computer Science and Engineering, 2021. Cataloged from PDF version of thesis. Includes bibliographical references (pages 67-75). Human actuation recognition (HAR) has been performed using current deep learning (DL) algorithms using a variety of input formats, including video footage, optical flow, and even skeleton points, which may be acquired via depth sensors or pose estimation technologies. Recent techniques, on the other hand, are computationally costly and have a high memory footprint, making them unsuitable for use in realworld environments. Furthermore, the design of existing techniques does not allow for the full extraction of spatial and temporal characteristics of an action, and as a result, information is lost throughout the recognition process. Here, we present a novel framework for action recognition that extracts spatial and temporal characteristics separately while reducing the amount of information lost by a substantial amount. The multi-dimensional convolutional network (MDCN) and the redefined spatio-temporal graph convolutional network (RSTCN) are two models developed in accordance with this framework. In both cases, spatial and temporal information are extracted irrespective of the precise spatio-temporal location. Our approach was evaluated in two particular aspects of human action recognition, namely violence detection and skeleton-based action recognition, in order to ensure that our models were accurate and reliable. In spite of being cost e↵ective and having less parameters, our proposed MDCN achieved 87.5% accuracy in the largest violence detection benchmark dataset and RST-GCN obtained 92.2% accuracy on the skeleton dataset. The performance of our models edge devices with limited resources, which are suitable for deploying at real-world environments is also also analyze and compare, such as surveillance system and smart healthcare system. The proposed MDCN model processes 80 frames per second on edge device such as, Nvidia Jetson Nano and RST-GCN performs at a speed of 993 frames per second. Our proposed methods o↵er a strong balance between accuracy, memory consumption, and processing time, which make them suitable for deploying at real-world environments. Dipon Kumar Ghosh M. Computer Science and Engineering 2022-01-17T06:37:05Z 2022-01-17T06:37:05Z 2021 2021-11 Thesis ID 19366007 http://hdl.handle.net/10361/15946 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 75 pages application/pdf Brac University
institution	Brac University
collection	Institutional Repository
language	English
topic	Human action recognition (HAR) Surveillance systems Violence detection Skeleton-based human action recognition Convolutional neural network (CNN) Graph convolutional networks (GCN) Feature fusion Human activity recognition Neural network (Computer Science)
spellingShingle	Human action recognition (HAR) Surveillance systems Violence detection Skeleton-based human action recognition Convolutional neural network (CNN) Graph convolutional networks (GCN) Feature fusion Human activity recognition Neural network (Computer Science) Ghosh, Dipon Kumar Efficient Spatio-temporal feature extraction for human action recognition
description	This thesis is submitted in partial fulfilment of the requirements for the degree of Master of Engineering in Computer Science and Engineering, 2021.
author2	Chakrabarty, Amitabha
author_facet	Chakrabarty, Amitabha Ghosh, Dipon Kumar
format	Thesis
author	Ghosh, Dipon Kumar
author_sort	Ghosh, Dipon Kumar
title	Efficient Spatio-temporal feature extraction for human action recognition
title_short	Efficient Spatio-temporal feature extraction for human action recognition
title_full	Efficient Spatio-temporal feature extraction for human action recognition
title_fullStr	Efficient Spatio-temporal feature extraction for human action recognition
title_full_unstemmed	Efficient Spatio-temporal feature extraction for human action recognition
title_sort	efficient spatio-temporal feature extraction for human action recognition
publisher	Brac University
publishDate	2022
url	http://hdl.handle.net/10361/15946
work_keys_str_mv	AT ghoshdiponkumar efficientspatiotemporalfeatureextractionforhumanactionrecognition
_version_	1814307065894535168

Efficient Spatio-temporal feature extraction for human action recognition

相似书籍