Supersized pre-trained language models have pushed the accuracy of vario...
In the era of deep learning, modeling for most NLP tasks has converged t...
Knowledge distillation (KD) has gained much attention due to its
effecti...
Both performance and efficiency are crucial factors for sequence labelin...
As a simple technique to accelerate inference of large-scale pre-trained...
Aspect-based Sentiment Analysis (ABSA), aiming at predicting the polarit...
With the emerging branch of incorporating factual knowledge into pre-tra...
Recently, the emergence of pre-trained models (PTMs) has brought natural...
Most existing deep multi-task learning models are based on parameter sha...