False negatives (FN) in 3D object detection, e.g., missing predictions
o...
This technical report summarizes the winning solution for the 3D Occupan...
Augmenting pretrained language models (LMs) with a vision encoder (e.g.,...
We propose Mask Auto-Labeler (MAL), a high-quality Transformer-based mas...
This report describes the winning solution to the semantic segmentation ...
Built on top of self-attention mechanisms, vision transformers have
demo...
We introduce DiscoBox, a novel framework that jointly learns instance
se...
We present a novel architecture for 3D object detection, M3DeTR, which
c...
Real-time 3D object detection is crucial for autonomous cars. Achieving
...
Object detection is an essential step towards holistic scene understandi...
Recent advances in deep convolutional neural networks (CNNs) have motiva...
Objects appear to scale differently in natural images. This fact require...