Rethinking the Faster R-CNN Architecture for Temporal Action Localization

04/20/2018
by   Yu-Wei Chao, et al.
0

We propose TAL-Net, an improved approach to temporal action localization in video that is inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key shortcomings of existing approaches: (1) we improve receptive field alignment using a multi-scale architecture that can accommodate extreme variation in action durations; (2) we better exploit the temporal context of actions for both proposal generation and action classification by appropriately extending receptive fields; and (3) we explicitly consider multi-stream feature fusion and demonstrate that fusing motion late is important. We achieve state-of-the-art performance for both action proposal and localization on THUMOS'14 detection benchmark and competitive performance on ActivityNet challenge.

READ FULL TEXT

page 8

page 12

page 13

research
07/21/2017

Temporal Convolution Based Action Proposal: Submission to ActivityNet 2017

In this notebook paper, we describe our approach in the submission to th...
research
07/29/2019

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2

This technical report presents an overview of our solution used in the s...
research
09/09/2019

Gaussian Temporal Awareness Networks for Action Localization

Temporally localizing actions in a video is a fundamental challenge in v...
research
01/04/2021

Global2Local: Efficient Structure Search for Video Action Segmentation

Temporal receptive fields of models play an important role in action seg...
research
08/02/2019

Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos

Temporal action localization is a recently-emerging task, aiming to loca...
research
07/03/2022

SSD-Faster Net: A Hybrid Network for Industrial Defect Inspection

The quality of industrial components is critical to the production of sp...
research
02/18/2020

Constraining Temporal Relationship for Action Localization

Recently, temporal action localization (TAL), i.e., finding specific act...

Please sign up or login with your details

Forgot password? Click here to reset