This paper considers the problem of understanding the behavior of a gene...
Algorithmic stability is an important notion that has proven powerful fo...
Cyclic and randomized stepsizes are widely used in the deep learning pra...
Heavy-tail phenomena in stochastic gradient descent (SGD) have been repo...
We consider a distributionally robust stochastic optimization problem an...
We consider the constrained sampling problem where the goal is to sample...
Recent studies have shown that heavy tails can emerge in stochastic
opti...
Recent theoretical studies have shown that heavy-tails can emerge in
sto...
This work proposes a distributed algorithm for solving empirical risk
mi...
Understanding generalization in deep learning has been one of the major
...
This work proposes a time-efficient Natural Gradient Descent method, cal...
Recent studies have provided both empirical and theoretical evidence
ill...
Gaussian noise injections (GNIs) are a family of simple and widely-used
...
We present two classes of differentially private optimization algorithms...
Stochastic gradient Langevin dynamics (SGLD) and stochastic gradient
Ham...
We introduce a framework for designing primal methods under the decentra...
We consider a distributionally robust formulation of stochastic optimiza...
In recent years, various notions of capacity and complexity have been
pr...
A common approach to initialization in deep neural networks is to sample...
Stochastic gradient Langevin dynamics (SGLD) is a poweful algorithm for
...
Stochastic gradient descent with momentum (SGDm) is one of the most popu...
The gradient noise (GN) in the stochastic gradient descent (SGD) algorit...
Stochastic gradient descent (SGD) has been widely used in machine learni...
We study the problem of minimizing a strongly convex and smooth function...
Momentum methods such as Polyak's heavy ball (HB) method, Nesterov's
acc...
The gradient noise (GN) in the stochastic gradient descent (SGD) algorit...
Langevin dynamics (LD) has been proven to be a powerful technique for
op...
Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of
stoc...
The fast iterative soft thresholding algorithm (FISTA) is used to solve
...