Large multimodal models trained on natural documents, which interleave i...
As language models grow ever larger, the need for large-scale high-quali...
The crystallization of modeling methods around the Transformer architect...
The infrastructure necessary for training state-of-the-art models is bec...
Modern deep learning applications require increasingly more compute to t...