Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

09/10/2023
by   Aref Miri Rekavandi, et al.
0

Transformers have rapidly gained popularity in computer vision, especially in the field of object recognition and detection. Upon examining the outcomes of state-of-the-art object detection methods, we noticed that transformers consistently outperformed well-established CNN-based detectors in almost every video or image dataset. While transformer-based approaches remain at the forefront of small object detection (SOD) techniques, this paper aims to explore the performance benefits offered by such extensive networks and identify potential reasons for their SOD superiority. Small objects have been identified as one of the most challenging object types in detection frameworks due to their low visibility. We aim to investigate potential strategies that could enhance transformers' performance in SOD. This survey presents a taxonomy of over 60 research studies on developed transformers for the task of SOD, spanning the years 2020 to 2023. These studies encompass a variety of detection applications, including small object detection in generic images, aerial images, medical images, active millimeter images, underwater images, and videos. We also compile and present a list of 12 large-scale datasets suitable for SOD that were overlooked in previous studies and compare the performance of the reviewed studies using popular metrics such as mean Average Precision (mAP), Frames Per Second (FPS), number of parameters, and more. Researchers can keep track of newer studies on our web page, which is available at <https://github.com/arekavandi/Transformer-SOD>.

READ FULL TEXT
research
06/07/2023

2D Object Detection with Transformers: A Review

Astounding performance of Transformers in natural language processing (N...
research
06/26/2023

CST-YOLO: A Novel Method for Blood Cell Detection Based on Improved YOLOv7 and CNN-Swin Transformer

Blood cell detection is a typical small-scale object detection problem i...
research
06/23/2023

Bridging the Performance Gap between DETR and R-CNN for Graphical Object Detection in Document Images

This paper takes an important step in bridging the performance gap betwe...
research
07/09/2023

A Survey and Approach to Chart Classification

Charts represent an essential source of visual information in documents ...
research
05/16/2022

Transformers in 3D Point Clouds: A Survey

In recent years, Transformer models have been proven to have the remarka...
research
06/27/2023

Taming Detection Transformers for Medical Object Detection

The accurate detection of suspicious regions in medical images is an err...
research
07/21/2022

Focused Decoding Enables 3D Anatomical Detection by Transformers

Detection Transformers represent end-to-end object detection approaches ...

Please sign up or login with your details

Forgot password? Click here to reset