An Effective and Efficient Training Algorithm for Multi-layer Feedforward Neural Networks

05/16/2020
by   Zebin Yang, et al.
0

Network initialization is the first and critical step for training neural networks. In this paper, we propose a novel network initialization scheme based on the celebrated Stein's identity. By viewing multi-layer feedforward neural networks as cascades of multi-index models, the projection weights to the first hidden layer are initialized using eigenvectors of the cross-moment matrix between the input's second-order score function and the response. The input data is then forward propagated to the next layer and such a procedure can be repeated until all the hidden layers are initialized. Finally, the weights for the output layer are initialized by generalized linear modeling. Such a proposed SteinGLM method is shown through extensive numerical results to be much faster and more accurate than other popular methods commonly used for training neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2020

An Effective and Efficient Initialization Scheme for Training Multi-layer Feedforward Neural Networks

Network initialization is the first and critical step for training neura...
research
04/05/2020

Backprojection for Training Feedforward Neural Networks in the Input and Feature Spaces

After the tremendous development of neural networks trained by backpropa...
research
06/11/2014

Explicit Computation of Input Weights in Extreme Learning Machines

We present a closed form expression for initializing the input weights i...
research
06/11/2014

Techniques for Learning Binary Stochastic Feedforward Neural Networks

Stochastic binary hidden units in a multi-layer perceptron (MLP) network...
research
12/08/2014

Provable Methods for Training Neural Networks with Sparse Connectivity

We provide novel guaranteed approaches for training feedforward neural n...
research
08/11/2023

Automated Sizing and Training of Efficient Deep Autoencoders using Second Order Algorithms

We propose a multi-step training method for designing generalized linear...
research
10/09/2020

Neural Random Projection: From the Initial Task To the Input Similarity Problem

In this paper, we propose a novel approach for implicit data representat...

Please sign up or login with your details

Forgot password? Click here to reset