Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos

03/23/2016
by   Amir Shahroudy, et al.
0

Single modality action recognition on RGB or depth sequences has been extensively explored recently. It is generally accepted that each of these two modalities has different strengths and limitations for the task of action recognition. Therefore, analysis of the RGB+D videos can help us to better study the complementary properties of these two types of modalities and achieve higher levels of performance. In this paper, we propose a new deep autoencoder based shared-specific feature factorization network to separate input multimodal signals into a hierarchy of components. Further, based on the structure of the features, a structured sparsity learning machine is proposed which utilizes mixed norms to apply regularization within components and group selection between them for better classification performance. Our experimental results show the effectiveness of our cross-modality feature analysis framework by achieving state-of-the-art accuracy for action classification on five challenging benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2018

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

Different from RGB videos, depth data in RGB-D videos provide key comple...
research
07/31/2015

Multimodal Multipart Learning for Action Recognition in Depth Videos

The articulated and complex nature of human actions makes the task of ac...
research
02/28/2017

Scene Flow to Action Map: A New Representation for RGB-D based Action Recognition with Convolutional Neural Networks

Scene flow describes the motion of 3D objects in real world and potentia...
research
06/19/2018

Modality Distillation with Multiple Stream Networks for Action Recognition

Diverse input data modalities can provide complementary cues for several...
research
12/23/2019

DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition

In this work, we address the problem of learning an ensemble of speciali...
research
03/16/2022

Know your sensORs x2013 A Modality Study For Surgical Action Classification

The surgical operating room (OR) presents many opportunities for automat...
research
08/10/2023

Ensemble Modeling for Multimodal Visual Action Recognition

In this work, we propose an ensemble modeling approach for multimodal ac...

Please sign up or login with your details

Forgot password? Click here to reset