Multimodal Large Language Model (MLLM) relies on the powerful LLM to per...
We introduce MQ-Det, an efficient architecture and pre-training strategy...
Open-vocabulary object detection (OVD) aims to scale up vocabulary size ...
Vision transformers (ViTs) are changing the landscape of object detectio...
This paper proposes an Any-time super-Resolution Method (ARM) to tackle ...
Unsupervised domain adaptation (UDA) aims to address the domain-shift pr...
Domain generalization (DG) serves as a promising solution to handle pers...
Recently, the applications of person re-identification in visual surveil...
In most probabilistic topic models, a document is viewed as a collection...
We present a novel method for hierarchical topic detection where topics ...
Hierarchical latent tree analysis (HLTA) is recently proposed as a new m...