Despite the power of Large Language Models (LLMs) like GPT-4, they still...
Large language models (LLMs) have achieved widespread success on a varie...
Humans possess an extraordinary ability to create and utilize tools, all...
Language models (LMs) are pretrained to imitate internet text, including...
Fine-tuning large language models for different tasks can be costly and
...
The crystallization of modeling methods around the Transformer architect...
The use of language-model-based question-answering systems to aid humans...
Over the past two years, EleutherAI has established itself as a radicall...
We present the results of the NLP Community Metasurvey. Run from May to ...
While large pretrained Transformer models have proven highly capable at
...
Summarization datasets are often assembled either by scraping naturally
...
We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive languag...
Current QA systems can generate reasonable-sounding yet false answers wi...
To enable building and testing models on long-document comprehension, we...
More capable language models increasingly saturate existing task benchma...
It is well documented that NLP models learn social biases present in the...
Despite the success of fine-tuning pretrained language encoders like BER...
Recent years have seen numerous NLP datasets introduced to evaluate the
...
Recent work has demonstrated that increased training dataset diversity
i...
Saliency maps that identify the most informative regions of an image for...
Breast cancer is the most common cancer in women, and hundreds of thousa...
Intermediate-task training has been shown to substantially improve pretr...
While pretrained models such as BERT have shown large gains across natur...
We introduce jiant, an open source toolkit for conducting multitask and
...
Medical images differ from natural images in significantly higher resolu...
We investigate the extent to which individual attention heads in pretrai...
Though state-of-the-art sentence representation models can perform tasks...
We trained and evaluated a localization-based deep CNN for breast cancer...
Radiologists typically compare a patient's most recent breast cancer
scr...
Deep learning models designed for visual classification tasks on natural...
We present a deep convolutional neural network for breast cancer screeni...
Pretraining with language modeling and related unsupervised tasks has
re...
In sentence compression, the task of shortening sentences while retainin...