In response to innovations in machine learning (ML) models, production
w...
We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE)
t...
Recent results in language understanding using neural networks have requ...
Scientific workloads have traditionally exploited high levels of sparsit...
Conventional neural accelerators rely on isolated self-sufficient functi...
Machine learning is experiencing an explosion of software and hardware
s...
Batch-splitting (data-parallelism) is the dominant distributed Deep Neur...
Many architects believe that major improvements in cost-energy-performan...
Neural Machine Translation (NMT) is an end-to-end learning approach for
...