Modern language models often exhibit powerful but brittle behavior, lead...
Human ratings are treated as the gold standard in NLG evaluation. The
st...
We examine whether some countries are more richly represented in embeddi...
Cosine similarity of contextual embeddings is used in many NLP tasks (e....
Probing experiments investigate the extent to which neural representatio...
Shapley Values, a solution to the credit assignment problem in cooperati...
How does word frequency in pre-training data affect the behavior of
simi...
Benchmarks such as GLUE have helped drive advances in NLP by incentivizi...
Evaluation is a bottleneck in the development of natural language genera...
Most NLP datasets are not annotated with protected attributes such as ge...
Replacing static word embeddings with contextualized word representation...
A notable property of word embeddings is that word relationships can exi...
Word embeddings are often criticized for capturing undesirable word
asso...
A surprising property of word vectors is that vector algebra can often b...