The aim of audio-visual segmentation (AVS) is to precisely differentiate...
Sequence modeling has important applications in natural language process...
We explore a new task for audio-visual-language modeling called fine-gra...
In this work, we study two self-play training schemes, Chainer and Pool,...
Learning continuous control in high-dimensional sparse reward settings, ...
As one of the most popular techniques for solving the ranking problem in...