CortexNet: a Generic Network Family for Robust Visual Temporal Representations

06/08/2017
by   Alfredo Canziani, et al.
0

In the past five years we have observed the rise of incredibly well performing feed-forward neural networks trained supervisedly for vision related tasks. These models have achieved super-human performance on object recognition, localisation, and detection in still images. However, there is a need to identify the best strategy to employ these networks with temporal visual inputs and obtain a robust and stable representation of video data. Inspired by the human visual system, we propose a deep neural network family, CortexNet, which features not only bottom-up feed-forward connections, but also it models the abundant top-down feedback and lateral connections, which are present in our visual cortex. We introduce two training schemes - the unsupervised MatchNet and weakly supervised TempoNet modes - where a network learns how to correctly anticipate a subsequent frame in a video clip or the identity of its predominant subject, by learning egomotion clues and how to automatically track several objects in the current scene. Find the project website at https://engineering.purdue.edu/elab/CortexNet/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2015

Basic Level Categorization Facilitates Visual Object Recognition

Recent advances in deep learning have led to significant progress in the...
research
11/21/2017

Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons

Deep feed-forward convolutional neural networks (CNNs) have become ubiqu...
research
04/21/2016

Humans and deep networks largely agree on which kinds of variation make object recognition harder

View-invariant object recognition is a challenging problem, which has at...
research
01/25/2019

A Neurally-Inspired Hierarchical Prediction Network for Spatiotemporal Sequence Learning and Prediction

In this paper we developed a hierarchical network model, called Hierarch...
research
03/10/2021

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, d...
research
10/16/2019

Adaptive and Iteratively Improving Recurrent Lateral Connections

The current leading computer vision models are typically feed forward ne...
research
12/08/2021

Symmetry Perception by Deep Networks: Inadequacy of Feed-Forward Architectures and Improvements with Recurrent Connections

Symmetry is omnipresent in nature and perceived by the visual system of ...

Please sign up or login with your details

Forgot password? Click here to reset