research
∙
02/08/2021
Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise
The empirical success of deep learning is often attributed to SGD's myst...
research
∙
10/31/2017