PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition

06/09/2020
by   Yuecong Xu, et al.
0

Long-range spatiotemporal dependencies capturing plays an essential role in improving video features for action recognition. The non-local block inspired by the non-local means is designed to address this challenge and have shown excellent performance. However, the non-local block brings significant increase in computation cost to the original network. It also lacks the ability to model regional correlation in videos. To address the above limitations, we propose Pyramid Non-Local (PNL) module, which extends the non-local block by incorporating regional correlation at multiple scales through a pyramid structured module. This extension upscales the effectiveness of non-local operation by attending to the interaction between different regions. Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83.09 with decreased computation cost compared to the non-local block.

READ FULL TEXT

page 3

page 18

page 19

page 21

research
08/21/2019

Asymmetric Non-local Neural Networks for Semantic Segmentation

The non-local module works as a particularly useful technique for semant...
research
10/31/2018

Compact Generalized Non-local Network

The non-local module is designed for capturing long-range spatio-tempora...
research
08/22/2020

PNEN: Pyramid Non-Local Enhanced Networks

Existing neural networks proposed for low-level image processing tasks a...
research
03/20/2021

Efficient Spatialtemporal Context Modeling for Action Recognition

Contextual information plays an important role in action recognition. Lo...
research
06/11/2020

Disentangled Non-Local Neural Networks

The non-local block is a popular module for strengthening the context mo...
research
03/11/2019

Spatial-Aware Non-Local Attention for Fashion Landmark Detection

Fashion landmark detection is a challenging task even using the current ...
research
11/16/2018

Relational Long Short-Term Memory for Video Action Recognition

Spatial and temporal relationships, both short-range and long-range, bet...

Please sign up or login with your details

Forgot password? Click here to reset