Deep Clustered Convolutional Kernels

03/06/2015
by   Minyoung Kim, et al.
0

Deep neural networks have recently achieved state of the art performance thanks to new training algorithms for rapid parameter estimation and new regularization methods to reduce overfitting. However, in practice the network architecture has to be manually set by domain experts, generally by a costly trial and error procedure, which often accounts for a large portion of the final system performance. We view this as a limitation and propose a novel training algorithm that automatically optimizes network architecture, by progressively increasing model complexity and then eliminating model redundancy by selectively removing parameters at training time. For convolutional neural networks, our method relies on iterative split/merge clustering of convolutional kernels interleaved by stochastic gradient descent. We present a training algorithm and experimental results on three different vision tasks, showing improved performance compared to similarly sized hand-crafted architectures.

READ FULL TEXT
research
10/10/2018

Automatic Configuration of Deep Neural Networks with EGO

Designing the architecture for an artificial neural network is a cumbers...
research
02/09/2019

An Algorithm Unrolling Approach to Deep Image Deblurring

While neural networks have achieved vastly enhanced performance over tra...
research
10/08/2019

Differentiable Sparsification for Deep Neural Networks

A deep neural network has relieved the burden of feature engineering by ...
research
07/20/2020

ThriftyNets : Convolutional Neural Networks with Tiny Parameter Budget

Typical deep convolutional architectures present an increasing number of...
research
06/21/2018

Lamarckian Evolution of Convolutional Neural Networks

Convolutional neural networks belong to the most successul image classif...
research
03/22/2018

A Comprehensive Analysis of Deep Regression

Deep learning revolutionized data science, and recently, its popularity ...
research
05/24/2019

A Polynomial-Based Approach for Architectural Design and Learning with Deep Neural Networks

In this effort we propose a novel approach for reconstructing multivaria...

Please sign up or login with your details

Forgot password? Click here to reset