Many useful tasks on scientific documents, such as topic classification ...
Is the output softmax layer, which is adopted by most language models (L...
Ensembling BERT models often significantly improves accuracy, but at the...
For many business applications, we often seek to analyze sentiments
asso...
Universal schema (USchema) assumes that two sentence patterns that share...
Large Transformer-based language models can aid human authors by suggest...
Most unsupervised NLP models represent each word with a single point or
...
Can one build a knowledge graph (KG) for all products in the world? Know...
Existing deep active learning algorithms achieve impressive sampling
eff...
Materials science literature contains millions of materials synthesis
pr...
Leveraging new data sources is a key step in accelerating the pace of
ma...
Word sense induction (WSI), which addresses polysemy by unsupervised
dis...
Computational synthesis planning approaches have achieved recent success...
Modeling hypernymy, such as poodle is-a dog, is an important generalizat...
Self-paced learning and hard example mining re-weight training instances...