THORN: Temporal Human-Object Relation Network for Action Recognition

04/20/2022
by   Mohammed Guermal, et al.
0

Most action recognition models treat human activities as unitary events. However, human activities often follow a certain hierarchy. In fact, many human activities are compositional. Also, these actions are mostly human-object interactions. In this paper we propose to recognize human action by leveraging the set of interactions that define an action. In this work, we present an end-to-end network: THORN, that can leverage important human-object and object-object interactions to predict actions. This model is built on top of a 3D backbone network. The key components of our model are: 1) An object representation filter for modeling object. 2) An object relation reasoning module to capture object relations. 3) A classification layer to predict the action labels. To show the robustness of THORN, we evaluate it on EPIC-Kitchen55 and EGTEA Gaze+, two of the largest and most challenging first-person and human-object interaction datasets. THORN achieves state-of-the-art performance on both datasets.

READ FULL TEXT

page 1

page 3

page 7

research
12/20/2019

Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks

Human action is naturally compositional: humans can easily recognize and...
research
10/05/2017

A self-organizing neural network architecture for learning human-object interactions

The visual recognition of transitive actions comprising human-object int...
research
12/19/2016

Asynchronous Temporal Fields for Action Recognition

Actions are more than just movements and trajectories: we cook to eat an...
research
10/11/2019

Interaction Relational Network for Mutual Action Recognition

Person-person mutual action recognition (also referred to as interaction...
research
01/17/2016

Face-space Action Recognition by Face-Object Interactions

Action recognition in still images has seen major improvement in recent ...
research
07/31/2020

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities

Understanding and interpreting human actions is a long-standing challeng...
research
09/10/2019

Reasoning About Human-Object Interactions Through Dual Attention Networks

Objects are entities we act upon, where the functionality of an object i...

Please sign up or login with your details

Forgot password? Click here to reset