research
∙
07/28/2020
Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Stochastic gradient descent (SGD) and its variants have been the dominat...
research
∙
02/26/2020
Stagewise Enlargement of Batch Size for SGD-based Learning
Existing research shows that the batch size can seriously affect the per...
research
∙
05/30/2019