The Rate of Convergence of Variation-Constrained Deep Neural Networks

06/22/2021
by   Gen Li, et al.
0

Multi-layer feedforward networks have been used to approximate a wide range of nonlinear functions. An important and fundamental problem is to understand the learnability of a network model through its statistical risk, or the expected prediction error on future data. To the best of our knowledge, the rate of convergence of neural networks shown by existing works is bounded by at most the order of n^-1/4 for a sample size of n. In this paper, we show that a class of variation-constrained neural networks, with arbitrary width, can achieve near-parametric rate n^-1/2+δ for an arbitrarily small positive constant δ. It is equivalent to n^-1 +2δ under the mean squared error. This rate is also observed by numerical experiments. The result indicates that the neural function space needed for approximating smooth functions may not be as large as what is often perceived. Our result also provides insight to the phenomena that deep neural networks do not easily suffer from overfitting when the number of neurons and learning parameters rapidly grow with n or even surpass n. We also discuss the rate of convergence regarding other network parameters, including the input dimension, network layer, and coefficient norm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2021

Function approximation by deep neural networks with parameters {0,±1/2, ± 1, 2}

In this paper it is shown that C_β-smooth functions can be approximated ...
research
09/10/2018

Approximation and Estimation for High-Dimensional Deep Learning Networks

It has been experimentally observed in recent years that multi-layer art...
research
05/19/2017

The Landscape of Deep Learning Algorithms

This paper studies the landscape of empirical risk of deep neural networ...
research
04/22/2022

On Feature Learning in Neural Networks with Global Convergence Guarantees

We study the optimization of wide neural networks (NNs) via gradient flo...
research
12/09/2019

Over-parametrized deep neural networks do not generalize well

Recently it was shown in several papers that backpropagation is able to ...
research
09/02/2018

On overcoming the Curse of Dimensionality in Neural Networks

Let H be a reproducing Kernel Hilbert space. For i=1,...,N, let x_i∈R^d ...
research
06/01/2018

The Nonlinearity Coefficient - Predicting Overfitting in Deep Neural Networks

For a long time, designing neural architectures that exhibit high perfor...

Please sign up or login with your details

Forgot password? Click here to reset