Large language models have been shown to achieve remarkable performance
...
Recent neural network-based language models have benefited greatly from
...
We present the design of a new large scale orchestration layer for
accel...
Model-free reinforcement learning (RL) can be used to learn effective
po...
Batch-splitting (data-parallelism) is the dominant distributed Deep Neur...
Tensor2Tensor is a library for deep learning models that is well-suited ...
We show that generating English Wikipedia articles can be approached as ...