William Huang | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Kyunghyun Cho
218 publications
Samuel R. Bowman
79 publications
Tal Linzen
52 publications
He He
33 publications
Jason Phang
33 publications
Alex Warstadt
18 publications
Nikita Nangia
17 publications
Haokun Liu
16 publications
Angelica Chen
16 publications
Clara Vania
15 publications
Phu Mon Htut
14 publications

research

∙ 11/16/2021

Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair

More capable language models increasingly saturate existing task benchma...

12 Jason Phang, et al. ∙

research

∙ 09/14/2021

Types of Out-of-Distribution Texts and How to Detect Them

Despite agreement on the importance of detecting out-of-distribution (OO...

7 Udit Arora, et al. ∙

research

∙ 06/01/2021

Comparing Test Sets with Item Response Theory

Recent years have seen numerous NLP datasets introduced to evaluate the ...

7 Clara Vania, et al. ∙

research

∙ 04/15/2021

Does Putting a Linguist in the Loop Improve NLU Data Collection?

Many crowdsourced NLP datasets contain systematic gaps and biases that a...

0 Alicia Parrish, et al. ∙

research

∙ 10/09/2020

Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data

A growing body of work shows that models exploit annotation artifacts to...

0 William Huang, et al. ∙

research

∙ 10/08/2020

Precise Task Formalization Matters in Winograd Schema Evaluations

Performance on the Winograd Schema Challenge (WSC), a respected English ...

0 Haokun Liu, et al. ∙

Success!

An error occurred