CLIP (Contrastive Language-Image Pretraining) is well-developed for
open...
Non-maximum suppression (NMS) is widely used in object detection pipelin...
Temporal action detection (TAD) aims to determine the semantic label and...
Object detection has recently experienced substantial progress. Yet, the...