We present GSPMD, an automatic, compiler-based parallelization system fo...
Recent results in language understanding using neural networks have requ...
Neural network scaling has been critical for improving the model quality...
In data-parallel synchronous training of deep neural networks, different...
The recent submission of Google TPU-v3 Pods to the industry wide MLPerf ...
Lingvo is a Tensorflow framework offering a complete solution for
collab...
GPipe is a scalable pipeline parallelism library that enables learning o...
Batch-splitting (data-parallelism) is the dominant distributed Deep Neur...