Deep Quantization: Encoding Convolutional Activations with Deep Generative Model

11/29/2016
by   Zhaofan Qiu, et al.
0

Deep convolutional neural networks (CNNs) have proven highly effective for visual recognition, where learning a universal representation from activations of convolutional layer plays a fundamental problem. In this paper, we present Fisher Vector encoding with Variational Auto-Encoder (FV-VAE), a novel deep architecture that quantizes the local activations of convolutional layer in a deep generative model, by training them in an end-to-end manner. To incorporate FV encoding strategy into deep generative models, we introduce Variational Auto-Encoder model, which steers a variational inference and learning in a neural network which can be straightforwardly optimized using standard stochastic gradient method. Different from the FV characterized by conventional generative models (e.g., Gaussian Mixture Model) which parsimoniously fit a discrete mixture model to data distribution, the proposed FV-VAE is more flexible to represent the natural property of data for better generalization. Extensive experiments are conducted on three public datasets, i.e., UCF101, ActivityNet, and CUB-200-2011 in the context of video action recognition and fine-grained image classification, respectively. Superior results are reported when compared to state-of-the-art representations. Most remarkably, our proposed FV-VAE achieves to-date the best published accuracy of 94.2 UCF101.

READ FULL TEXT

page 9

page 10

research
05/19/2017

VAE with a VampPrior

Many different methods to train deep generative models have been introdu...
research
10/13/2021

The deep generative decoder: Using MAP estimates of representations

A deep generative model is characterized by a representation space, its ...
research
06/07/2019

Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

A novel resampling framework is proposed to evaluate the robustness and ...
research
05/10/2018

Unsupervised Deep Representations for Learning Audience Facial Behaviors

In this paper, we present an unsupervised learning approach for analyzin...
research
12/05/2016

Authoring image decompositions with generative models

We show how to extend traditional intrinsic image decompositions to inco...
research
10/20/2019

Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models

This paper describes a framework for the development of an integrative c...
research
04/26/2018

Generative Model for Heterogeneous Inference

Generative models (GMs) such as Generative Adversary Network (GAN) and V...

Please sign up or login with your details

Forgot password? Click here to reset