Action Recognition from Single Timestamp Supervision in Untrimmed Videos

04/09/2019
by   Davide Moltisanti, et al.
0

Recognising actions in videos relies on labelled supervision during training, typically the start and end times of each action instance. This supervision is not only subjective, but also expensive to acquire. Weak video-level supervision has been successfully exploited for recognition in untrimmed videos, however it is challenged when the number of different actions in training videos increases. We propose a method that is supervised by single timestamps located around each action instance, in untrimmed videos. We replace expensive action bounds with sampling distributions initialised from these timestamps. We then use the classifier's response to iteratively update the sampling distributions. We demonstrate that these distributions converge to the location and extent of discriminative action segments. We evaluate our method on three datasets for fine-grained recognition, with increasing number of different actions per video, and show that single timestamps offer a reasonable compromise between recognition performance and labelling effort, performing comparably to full temporal supervision. Our update method improves top-1 test accuracy by up to 5.4

READ FULL TEXT

page 2

page 3

page 5

page 8

research
08/15/2021

Temporal Action Segmentation with High-level Complex Activity Labels

Over the past few years, the success in action recognition on short trim...
research
09/09/2016

Image and Video Mining through Online Learning

Within the field of image and video recognition, the traditional approac...
research
03/30/2020

Speech2Action: Cross-modal Supervision for Action Recognition

Is it possible to guess human action from dialogue alone? In this work w...
research
04/19/2021

Temporal Query Networks for Fine-grained Video Understanding

Our objective in this work is fine-grained classification of actions in ...
research
03/09/2017

UntrimmedNets for Weakly Supervised Action Recognition and Detection

Current action recognition methods heavily rely on trimmed videos for mo...
research
05/19/2020

Retrieving and Highlighting Action with Spatiotemporal Reference

In this paper, we present a framework that jointly retrieves and spatiot...
research
11/11/2019

Guided weak supervision for action recognition with scarce data to assess skills of children with autism

Diagnostic and intervention methodologies for skill assessment of autism...

Please sign up or login with your details

Forgot password? Click here to reset