Skip connections and normalisation layers form two standard architectura...
Many important problems involving molecular property prediction from 3D
...
Training very deep neural networks is still an extremely challenging tas...
In this note we extend kernel function approximation results for neural
...
Increasing the batch size is a popular way to speed up neural network
tr...
Adversarial training is an effective methodology for training deep neura...
Natural gradient descent has proven effective at mitigating the effects ...
Deep learning is built on the foundational guarantee that gradient desce...
The recently proposed Unbiased Online Recurrent Optimization algorithm (...
The cornerstone underpinning deep learning is the guarantee that gradien...
Second-order optimization methods such as natural gradient descent have ...
Deep feedforward and recurrent networks have achieved impressive results...
We propose an efficient method for approximating natural gradient descen...
Natural gradient descent is an optimization method traditionally motivat...
Sum Product Networks (SPNs) are a recently developed class of deep gener...
In this work we develop Curvature Propagation (CP), a general technique ...