The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models

06/16/2023
by   Roy Voetman, et al.
0

Despite the notable accomplishments of deep object detection models, a major challenge that persists is the requirement for extensive amounts of training data. The process of procuring such real-world data is a laborious undertaking, which has prompted researchers to explore new avenues of research, such as synthetic data generation techniques. This study presents a framework for the generation of synthetic datasets by fine-tuning pretrained stable diffusion models. The synthetic datasets are then manually annotated and employed for training various object detection models. These detectors are evaluated on a real-world test set of 331 images and compared against a baseline model that was trained on real-world images. The results of this study reveal that the object detection models trained on synthetic data perform similarly to the baseline model. In the context of apple detection in orchards, the average precision deviation with the baseline ranges from 0.09 to 0.12. This study illustrates the potential of synthetic data generation techniques as a viable alternative to the collection of extensive training data for the training of deep models.

READ FULL TEXT

page 5

page 7

page 14

page 15

page 16

page 17

page 18

page 19

research
06/20/2023

Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards

Deep object detection models have achieved notable successes in recent y...
research
04/28/2019

Synthetic Data Generation and Adaption for Object Detection in Smart Vending Machines

This paper presents an improved scheme for the generation and adaption o...
research
01/29/2021

Synthetic Data and Hierarchical Object Detection in Overhead Imagery

The performance of neural network models is often limited by the availab...
research
05/28/2019

LeagueAI: Improving object detector performance and flexibility through automatically generated training data and domain randomization

In this technical report I present my method for automatic synthetic dat...
research
02/26/2019

An Annotation Saved is an Annotation Earned: Using Fully Synthetic Training for Object Instance Detection

Deep learning methods typically require vast amounts of training data to...
research
11/25/2022

Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images

Line Chart Data Extraction is a natural extension of Optical Character R...
research
10/24/2019

Animal Detection in Man-made Environments

Automatic detection of animals that have strayed into human inhabited ar...

Please sign up or login with your details

Forgot password? Click here to reset