Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization

05/01/2022
by   Qinying Liu, et al.
0

In weakly-supervised temporal action localization (WS-TAL), the methods commonly follow the "localization by classification" procedure, which uses the snippet predictions to form video class scores and then optimizes a video classification loss. In this procedure, the snippet predictions (or snippet attention weights) are used to separate foreground and background. However, the snippet predictions are usually inaccurate due to absence of frame-wise labels, and then the overall performance is hindered. In this paper, we propose a novel C^3BN to achieve robust snippet predictions. C^3BN includes two key designs by exploring the inherent characteristics of video data. First, because of the natural continuity of adjacent snippets, we propose a micro data augmentation strategy to increase the diversity of snippets with convex combination of adjacent snippets. Second, we propose a macro-micro consistency regularization strategy to force the model to be invariant (or equivariant) to the transformations of snippets with respect to video semantics, snippet predictions and snippet features. Experimental results demonstrate the effectiveness of our proposed method on top of baselines for the WS-TAL tasks with video-level and point-level supervision.

READ FULL TEXT
research
06/22/2022

Weakly-supervised Action Localization via Hierarchical Mining

Weakly-supervised action localization aims to localize and classify acti...
research
05/04/2023

Weakly-supervised Micro- and Macro-expression Spotting Based on Multi-level Consistency

Most micro- and macro-expression spotting methods in untrimmed videos su...
research
07/14/2022

Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization

With video-level labels, weakly supervised temporal action localization ...
research
08/14/2021

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

As a challenging task of high-level video understanding, weakly supervis...
research
02/04/2020

Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

We present a method for weakly-supervised action localization based on g...
research
09/14/2021

Tribrid: Stance Classification with Neural Inconsistency Detection

We study the problem of performing automatic stance classification on so...
research
03/27/2020

Weakly-Supervised Action Localization by Generative Attention Modeling

Weakly-supervised temporal action localization is a problem of learning ...

Please sign up or login with your details

Forgot password? Click here to reset