Dynamic Zoom-in Network for Fast Object Detection in Large Images

11/14/2017
by   Mingfei Gao, et al.
0

We introduce a generic framework that reduces the computational cost of object detection while retaining accuracy for scenarios where objects with varied sizes appear in high resolution images. Detection progresses in a coarse-to-fine manner, first on a down-sampled version of the image and then on a sequence of higher resolution regions identified as likely to improve the detection accuracy. Built upon reinforcement learning, our approach consists of a model (R-net) that uses coarse detection results to predict the potential accuracy gain for analyzing a region at a higher resolution and another model (Q-net) that sequentially selects regions to zoom in. Experiments on the Caltech Pedestrians dataset show that our approach reduces the number of processed pixels by over 50 of our approach become more significant on a high resolution test set collected from YFCC100M dataset where our approach maintains high detection performance while reducing the number of processed pixels by about 70 time by over 50

READ FULL TEXT

page 1

page 6

page 7

research
03/02/2023

A Coarse to Fine Framework for Object Detection in High Resolution Image

Object detection is a fundamental problem in computer vision, aiming at ...
research
07/21/2021

You Better Look Twice: a new perspective for designing accurate detectors with reduced computations

General object detectors use powerful backbones that uniformly extract f...
research
12/09/2019

Efficient Object Detection in Large Images using Deep Reinforcement Learning

Traditionally, an object detector is applied to every part of the scene ...
research
11/18/2020

TJU-DHD: A Diverse High-Resolution Dataset for Object Detection

Vehicles, pedestrians, and riders are the most important and interesting...
research
10/24/2018

Fast and accurate object detection in high resolution 4K and 8K video using GPUs

Machine learning has celebrated a lot of achievements on computer vision...
research
03/27/2023

Learning to Zoom and Unzoom

Many perception systems in mobile computing, autonomous navigation, and ...
research
05/22/2016

Automated Resolution Selection for Image Segmentation

It is well-known in image processing that computational cost increases r...

Please sign up or login with your details

Forgot password? Click here to reset