SSD: Single Shot MultiBox Detector

by   Wei Liu, et al.

We present a method for detecting objects in images using a single deep neural network. Our approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location. At prediction time, the network generates scores for the presence of each object category in each default box and produces adjustments to the box to better match the object shape. Additionally, the network combines predictions from multiple feature maps with different resolutions to naturally handle objects of various sizes. Our SSD model is simple relative to methods that require object proposals because it completely eliminates proposal generation and subsequent pixel or feature resampling stage and encapsulates all computation in a single network. This makes SSD easy to train and straightforward to integrate into systems that require a detection component. Experimental results on the PASCAL VOC, MS COCO, and ILSVRC datasets confirm that SSD has comparable accuracy to methods that utilize an additional object proposal step and is much faster, while providing a unified framework for both training and inference. Compared to other single stage methods, SSD has much better accuracy, even with a smaller input image size. For 300× 300 input, SSD achieves 72.1 X and for 500× 500 input, SSD achieves 75.1 comparable state of the art Faster R-CNN model. Code is available at .


page 3

page 9

page 13


Single Multi-feature detector for Amodal 3D Object Detection in RGB-D Images

This paper aims at fast and high-accuracy amodal 3D object detections in...

Ensemble-based Adaptive Single-shot Multi-box Detector

We propose two improvements to the SSD---single shot multibox detector. ...

Residual Features and Unified Prediction Network for Single Stage Detection

Recently, a lot of single stage detectors using multi-scale features hav...

Precise Single-stage Detector

There are still two problems in SDD causing some inaccurate results: (1)...

FastMask: Segment Multi-scale Object Candidates in One Shot

Objects appear to scale differently in natural images. This fact require...

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

It is hard to detect on-road objects under various lighting conditions. ...

Random Boxes Are Open-world Object Detectors

We show that classifiers trained with random region proposals achieve st...

Please sign up or login with your details

Forgot password? Click here to reset