Vision transformers (ViTs) are changing the landscape of object detectio...
Large-scale vision-language pre-training has achieved promising results ...
While self-supervised representation learning (SSL) has received widespr...
Conventional semi-supervised learning (SSL) methods, e.g., MixMatch, ach...
Neural networks are susceptible to catastrophic forgetting. They fail to...
Liquify is a common technique for image editing, which can be used for i...
Self-supervised learning has shown great potentials in improving the vid...
One significant factor we expect the video representation learning to
ca...
An essential task in predictive maintenance is the prediction of the
Rem...
In this paper, we propose Double Supervised Network with Attention Mecha...
In this paper, we introduce a novel end-end framework for multi-oriented...