In this work, we investigate the problem of out-of-distribution (OOD)
ge...
Albeit having gained significant progress lately, large-scale graph
repr...
Grounded video description (GVD) encourages captioning models to attend ...
This paper investigates the feasibility of learning good representation ...
It is a consensus that small models perform quite poorly under the parad...
The challenge of the Class Incremental Learning (CIL) lies in difficulty...
The journey of reducing noise from distant supervision (DS) generated
tr...
Recently, a newly proposed self-supervised framework Bootstrap Your Own
...
Visual navigation is a task of training an embodied agent by intelligent...
Existing methods in the Visual Storytelling field often suffer from the
...