We study the task of zero-shot vision-and-language navigation (ZS-VLN), ...
We address a practical yet challenging problem of training robot agents ...
Temporal action localization has long been researched in computer vision...
We addressed the challenging task of video question answering, which req...
We address the problem of video grounding from natural language queries....
Recently, deep neural networks (DNNs) have made great progress on automa...
Most state-of-the-art action localization systems process each action
pr...
Deep reinforcement learning has made significant progress in the field o...