Action Machine: Rethinking Action Recognition in Trimmed Videos

12/14/2018
by   Jiagang Zhu, et al.
26

Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit the scenes and objects. In this work, we present a conceptually simple, general and high-performance framework for action recognition in trimmed videos, aiming at person-centric modeling. The method, called Action Machine, takes as inputs the videos cropped by person bounding boxes. It extends the Inflated 3D ConvNet (I3D) by adding a branch for human pose estimation and a 2D CNN for pose-based action recognition, being fast to train and test. Action Machine can benefit from the multi-task training of action recognition and pose estimation, the fusion of predictions from RGB images and poses. On NTU RGB-D, Action Machine achieves the state-of-the-art performance with top-1 accuracies of 97.2 cross-subject respectively. Action Machine also achieves competitive performance on another three smaller action recognition datasets: Northwestern UCLA Multiview Action3D, MSR Daily Activity3D and UTD-MHAD. Code will be made available.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 12

page 13

research
12/18/2022

2D Pose Estimation based Child Action Recognition

We present a graph convolutional network with 2D pose estimation for the...
research
04/08/2017

First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations

In this work we study the use of 3D hand poses to recognize first-person...
research
12/09/2019

Synthetic Humans for Action Recognition from Unseen Viewpoints

Our goal in this work is to improve the performance of human action reco...
research
04/03/2023

On the Benefits of 3D Pose and Tracking for Human Action Recognition

In this work we study the benefits of using tracking and 3D poses for ac...
research
12/11/2018

Loss Guided Activation for Action Recognition in Still Images

One significant problem of deep-learning based human action recognition ...
research
10/29/2018

ActionXPose: A Novel 2D Multi-view Pose-based Algorithm for Real-time Human Action Recognition

We present ActionXPose, a novel 2D pose-based algorithm for posture-leve...
research
07/04/2017

Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition

We propose Human Pose Models that represent RGB and depth images of huma...

Please sign up or login with your details

Forgot password? Click here to reset