The popular VQ-VAE models reconstruct images through learning a discrete...
Video summarization aims to distill the most important information from ...
The exploration of mutual-benefit cross-domains has shown great potentia...
In computer vision, multi-label classification, including zero-shot
mult...
It is critical to obtain high resolution features with long range depend...
In recent years, most of the accuracy gains for video action recognition...