The current trend of scaling language models involves increasing both
pa...
As language models grow ever larger, the need for large-scale high-quali...
A key feature of neural models is that they can produce semantic vector
...
Multitask prompted finetuning (MTF) has been shown to help large languag...
The crystallization of modeling methods around the Transformer architect...
Large pretrained Transformer language models have been shown to exhibit
...
Large language models have recently been shown to attain reasonable zero...
The scale, variety, and quantity of publicly-available NLP datasets has ...
When fine-tuning pretrained models for classification, researchers eithe...
Although Neural Differential Equations have shown promise on toy problem...
In this paper, we propose the use of in-training matrix factorization to...