This paper aims to establish a generic multi-modal foundation model that...
Spatio-Temporal video grounding (STVG) focuses on retrieving the
spatio-...
The conversational recommender systems (CRSs) have received extensive
at...
Rule-based dialogue management is still the most popular solution for
in...
Steganography represents the art of unobtrusively concealing a secrete
m...