Recently, the contrastive language-image pre-training, e.g., CLIP, has
d...
Few-shot dialogue state tracking (DST) is a realistic problem that train...
Fusion technique is a key research topic in multimodal sentiment analysi...
In this paper, we present GEM as a General Evaluation benchmark for
Mult...
Video-text retrieval plays an essential role in multi-modal research and...
Span extraction is an essential problem in machine reading comprehension...
In this paper, we focus on the imbalance issue, which is rarely studied ...
We propose UniViLM: a Unified Video and Language pre-training Model for
...
This paper focuses on two related subtasks of aspect-based sentiment
ana...
This paper uses the weather forecasting as an application background to
...
Currently there exists a gap between deep learning and the techniques
re...
Aspect term extraction is one of the important subtasks in aspect-based
...