While developing a new vision-language LLM (VL-LLM) by pre-training on
t...
Large-scale commonsense knowledge bases empower a broad range of AI
appl...
This paper summarizes the outcomes from the ISCSLP 2022 Intelligent Cock...
Speech command recognition (SCR) has been commonly used on resource
cons...
Vision-language pre-training (VLP) has shown impressive performance on a...
Recent works have shown promising results of prompt tuning in stimulatin...
Scene graph generation (SGG) aims to extract (subject, predicate, object...
Pre-Trained Vision-Language Models (VL-PTMs) have shown promising
capabi...
Collecting large-scale annotated satellite imagery datasets is essential...
Object detection using automotive radars has not been explored with deep...
Scene graph generation aims to identify objects and their relations in
i...
Data augmentation has attracted a lot of research attention in the deep
...
Graph neural networks (GNNs) achieve remarkable performance for tasks on...
Learning a classifier with control on the false-positive rate plays a
cr...