Skeleton based Activity Recognition by Fusing Part-wise Spatio-temporal and Attention Driven Residues

by   Chhavi Dhiman, et al.

There exist a wide range of intra class variations of the same actions and inter class similarity among the actions, at the same time, which makes the action recognition in videos very challenging. In this paper, we present a novel skeleton-based part-wise Spatiotemporal CNN RIAC Network-based 3D human action recognition framework to visualise the action dynamics in part wise manner and utilise each part for action recognition by applying weighted late fusion mechanism. Part wise skeleton based motion dynamics helps to highlight local features of the skeleton which is performed by partitioning the complete skeleton in five parts such as Head to Spine, Left Leg, Right Leg, Left Hand, Right Hand. The RIAFNet architecture is greatly inspired by the InceptionV4 architecture which unified the ResNet and Inception based Spatio-temporal feature representation concept and achieving the highest top-1 accuracy till date. To extract and learn salient features for action recognition, attention driven residues are used which enhance the performance of residual components for effective 3D skeleton-based Spatio-temporal action representation. The robustness of the proposed framework is evaluated by performing extensive experiments on three challenging datasets such as UT Kinect Action 3D, Florence 3D action Dataset, and MSR Daily Action3D datasets, which consistently demonstrate the superiority of our method


page 6

page 9

page 10

page 11

page 15


Skepxels: Spatio-temporal Image Representation of Human Skeleton Joints for Action Recognition

Human skeleton joints are popular for action analysis since they can be ...

Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

The task of skeleton-based action recognition remains a core challenge i...

Action Capsules: Human Skeleton Action Recognition

Due to the compact and rich high-level representations offered, skeleton...

Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling

This paper simultaneously addresses three limitations associated with co...

Skeleton-Based Online Action Prediction Using Scale Selection Network

Action prediction is to recognize the class label of an ongoing activity...

Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition

3D action recognition - analysis of human actions based on 3D skeleton d...

HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment

Human actions involve complex pose variations and their 2D projections c...

Please sign up or login with your details

Forgot password? Click here to reset