Multi-Modal Three-Stream Network for Action Recognition

09/08/2019
by   Muhammad Usman Khalid, et al.
0

Human action recognition in video is an active yet challenging research topic due to high variation and complexity of data. In this paper, a novel video based action recognition framework utilizing complementary cues is proposed to handle this complex problem. Inspired by the successful two stream networks for action classification, additional pose features are studied and fused to enhance understanding of human action in a more abstract and semantic way. Towards practices, not only ground truth poses but also noisy estimated poses are incorporated in the framework with our proposed pre-processing module. The whole framework and each cue are evaluated on varied benchmarking datasets as JHMDB, sub-JHMDB and Penn Action. Our results outperform state-of-the-art performance on these datasets and show the strength of complementary cues.

READ FULL TEXT
research
04/03/2017

Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection

General human action recognition requires understanding of various visua...
research
05/22/2018

Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Recently, pose-based action recognition has gained more and more attenti...
research
06/11/2015

P-CNN: Pose-based CNN Features for Action Recognition

This work targets human action recognition in video. While recent method...
research
08/15/2016

Depth2Action: Exploring Embedded Depth for Large-Scale Action Recognition

This paper performs the first investigation into depth for large-scale h...
research
04/01/2016

Learning a Pose Lexicon for Semantic Action Recognition

This paper presents a novel method for learning a pose lexicon comprisin...
research
08/29/2023

IndGIC: Supervised Action Recognition under Low Illumination

Technologies of human action recognition in the dark are gaining more an...
research
09/15/2019

Multitask Learning to Improve Egocentric Action Recognition

In this work we employ multitask learning to capitalize on the structure...

Please sign up or login with your details

Forgot password? Click here to reset