Location Dependency in Video Prediction

10/11/2018
by   Niloofar Azizi, et al.
0

Deep convolutional neural networks are used to address many computer vision problems, including video prediction. The task of video prediction requires analyzing the video frames, temporally and spatially, and constructing a model of how the environment evolves. Convolutional neural networks are spatially invariant, though, which prevents them from modeling location-dependent patterns. In this work, the authors propose location-biased convolutional layers to overcome this limitation. The effectiveness of location bias is evaluated on two architectures: Video Ladder Network (VLN) and Convolutional redictive Gating Pyramid (Conv-PGP). The results indicate that encoding location-dependent features is crucial for the task of video prediction. Our proposed methods significantly outperform spatially invariant models.

READ FULL TEXT
research
10/10/2015

DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

Understanding and predicting the human visual attentional mechanism is a...
research
02/02/2020

Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks

A number of techniques for interpretability have been presented for deep...
research
04/12/2016

Video Description using Bidirectional Recurrent Neural Networks

Although traditionally used in the machine translation field, the encode...
research
09/21/2023

Video Scene Location Recognition with Neural Networks

This paper provides an insight into the possibility of scene recognition...
research
10/16/2016

Location Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities

The anatomical location of imaging features is of crucial importance for...
research
05/28/2017

Continuous Video to Simple Signals for Swimming Stroke Detection with Convolutional Neural Networks

In many sports, it is useful to analyse video of an athlete in competiti...
research
07/03/2017

Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video

We present an automatic method to describe clinically useful information...

Please sign up or login with your details

Forgot password? Click here to reset