We study the size of a neural network needed to approximate the maximum
...
We consider 1-dimensional location estimation, where we estimate a param...
We revisit the problem of estimating the mean of a real-valued distribut...
We introduce a framework for studying how distributional assumptions on ...
Given a mixture between two populations of coins, "positive" coins that ...
We consider deep networks, trained via stochastic gradient descent to
mi...