UntrimmedNets for Weakly Supervised Action Recognition and Detection

03/09/2017
by   Limin Wang, et al.
0

Current action recognition methods heavily rely on trimmed videos for model training. However, it is expensive and time-consuming to acquire a large-scale trimmed video dataset. This paper presents a new weakly supervised architecture, called UntrimmedNet, which is able to directly learn action recognition models from untrimmed videos without the requirement of temporal annotations of action instances. Our UntrimmedNet couples two important components, the classification module and the selection module, to learn the action models and reason about the temporal duration of action instances, respectively. These two components are implemented with feed-forward networks, and UntrimmedNet is therefore an end-to-end trainable architecture. We exploit the learned models for action recognition (WSR) and detection (WSD) on the untrimmed video datasets of THUMOS14 and ActivityNet. Although our UntrimmedNet only employs weak supervision, our method achieves performance superior or comparable to that of those strongly supervised approaches on these two datasets.

READ FULL TEXT

page 3

page 8

research
01/21/2020

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understa...
research
01/16/2020

Learning Spatiotemporal Features via Video and Text Pair Discrimination

Current video representations heavily rely on learning from manually ann...
research
02/20/2019

Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision

Action recognition in videos has attracted a lot of attention in the pas...
research
11/11/2019

Guided weak supervision for action recognition with scarce data to assess skills of children with autism

Diagnostic and intervention methodologies for skill assessment of autism...
research
07/19/2017

Discriminative convolutional Fisher vector network for action recognition

In this work we propose a novel neural network architecture for the prob...
research
12/19/2016

Asynchronous Temporal Fields for Action Recognition

Actions are more than just movements and trajectories: we cook to eat an...
research
04/09/2019

Action Recognition from Single Timestamp Supervision in Untrimmed Videos

Recognising actions in videos relies on labelled supervision during trai...

Please sign up or login with your details

Forgot password? Click here to reset