We introduce TeraHAC, a (1+ϵ)-approximate hierarchical agglomerative
clu...
Understanding when and how much a model gradient leaks information about...
The Superfacility model is designed to leverage HPC for experimental sci...
In overparametrized models, the noise in stochastic gradient descent (SG...
We propose an efficient inference procedure for non-autoregressive machi...
Many sequence-to-sequence generation tasks, including machine translatio...
As part of a feasibility study, this paper shows the NASA Valkyrie human...
Emergent multi-agent communication protocols are very different from nat...
Although neural machine translation models reached high translation qual...
A recent line of work studies overparametrized neural networks in the
"k...
In neural dialogue modeling, a neural network is trained to predict the ...
Recently machine learning algorithms based on deep layered artificial ne...
We show that gradient descent on full-width linear convolutional network...
The implicit bias of gradient descent is not fully understood even in si...
We study the bias of generic optimization methods, including Mirror Desc...
We propose a conditional non-autoregressive neural sequence model based ...
Visapult is a prototype application and framework for remote visualizati...
While most machine translation systems to date are trained on large para...
Most existing machine translation systems operate at the level of words,...
The matrix-completion problem has attracted a lot of attention, largely ...