The development of biologically-plausible learning algorithms is importa...
Natural language instructions for visual navigation often use scene
desc...
Recent Visual Question Answering (VQA) models have shown impressive
perf...
Recent works have proposed several long term tracking benchmarks and
hig...
In this paper, we propose a new long video dataset (called Track Long an...