Augmenting large language models (LLM) to use external tools enhances th...
Large Language Models (LLMs) may hallucinate and generate fake informati...
With growing capabilities of large language models, prompting them has b...
Despite their impressive performance on diverse tasks, large language mo...
How reliably can we trust the scores obtained from social bias benchmark...
We present UnifiedQA-v2, a QA model built with the same process as Unifi...
When an NLP model is trained on text data from one time period and teste...
Humans often solve complex problems by interacting (in natural language)...
How can model designers turn task instructions into effective prompts fo...
Is it possible to use natural language to intervene in a model's behavio...
Can we enable NLP models to appropriately respond to instructional promp...
While day-to-day questions come with a variety of answer types, the curr...
We present the ARC-DA dataset, a direct-answer ("open response", "freefo...
Leaderboards have eased model development for many NLP datasets by
stand...
A key limitation in current datasets for multi-hop reasoning is that the...
While language embeddings have been shown to have stereotyping biases, h...
A common approach to solve complex tasks is by breaking them down into s...
Temporal common sense (e.g., duration and frequency of events) is crucia...
Question answering (QA) tasks have been posed using a variety of formats...
Commonsense knowledge acquisition is a key problem for artificial
intell...
While recent models have achieved human-level scores on many NLP dataset...
Standard test sets for supervised learning evaluate in-distribution
gene...
Empirical research in Natural Language Processing (NLP) has adopted a na...
Understanding time is crucial for understanding events expressed in natu...
Natural language understanding (NLU) of text is a fundamental challenge ...
Coreference resolution is a key problem in natural language understandin...
The problem of entity-typing has been studied predominantly in supervise...
This work presents PerspectroScope, a web-based system which lets users ...
We propose a novel method for exploiting the semantic structure of text ...
One key consequence of the information revolution is a significant incre...
Recent systems for natural language understanding are strong at overcomi...
Many real world systems need to operate on heterogeneous information net...
We consider the problem of strongly-convex online optimization in presen...
Answering science questions posed in natural language is an important AI...
In this paper, we propose a model-based clustering method (TVClust) that...
In this work we propose a heteroscedastic generalization to RVM, a fast
...
Dribbling an opponent player in digital soccer environment is an importa...