End-to-End Diffusion Latent Optimization Improves Classifier Guidance

03/23/2023
by   Bram Wallace, et al.
0

Classifier guidance – using the gradients of an image classifier to steer the generations of a diffusion model – has the potential to dramatically expand the creative control over image generation and editing. However, currently classifier guidance requires either training new noise-aware models to obtain accurate gradients or using a one-step denoising approximation of the final generation, which leads to misaligned gradients and sub-optimal control. We highlight this approximation's shortcomings and propose a novel guidance method: Direct Optimization of Diffusion Latents (DOODL), which enables plug-and-play guidance by optimizing diffusion latents w.r.t. the gradients of a pre-trained classifier on the true generated pixels, using an invertible diffusion process to achieve memory-efficient backpropagation. Showcasing the potential of more precise guidance, DOODL outperforms one-step classifier guidance on computational and human evaluation metrics across different forms of guidance: using CLIP guidance to improve generations of complex prompts from DrawBench, using fine-grained visual classifiers to expand the vocabulary of Stable Diffusion, enabling image-conditioned generation with a CLIP visual encoder, and improving image aesthetics using an aesthetic scoring network.

READ FULL TEXT

page 20

page 21

page 22

page 23

page 24

page 25

page 26

page 27

research
12/20/2021

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Diffusion models have recently been shown to generate high-quality synth...
research
04/25/2023

Exploring Compositional Visual Generation with Latent Classifier Guidance

Diffusion probabilistic models have achieved enormous success in the fie...
research
06/16/2023

Drag-guided diffusion models for vehicle image generation

Denoising diffusion models trained at web-scale have revolutionized imag...
research
06/23/2022

Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation

Denoising Diffusion Probabilistic Model (DDPM) is able to make flexible ...
research
08/31/2023

Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance

Sketch-based terrain generation seeks to create realistic landscapes for...
research
08/18/2022

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

Denoising diffusion probabilistic models (DDPMs) are a recent family of ...
research
02/22/2023

Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC

Since their introduction, diffusion models have quickly become the preva...

Please sign up or login with your details

Forgot password? Click here to reset