StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

08/02/2021
by   Rinon Gal, et al.
13

Can a generative model be trained to produce images from a specific domain, guided by a text prompt only, without seeing any image? In other words: can an image generator be trained blindly? Leveraging the semantic power of large scale Contrastive-Language-Image-Pre-training (CLIP) models, we present a text-driven method that allows shifting a generative model to new domains, without having to collect even a single image from those domains. We show that through natural language prompts and a few minutes of training, our method can adapt a generator across a multitude of domains characterized by diverse styles and shapes. Notably, many of these modifications would be difficult or outright impossible to reach with existing methods. We conduct an extensive set of experiments and comparisons across a wide range of domains. These demonstrate the effectiveness of our approach and show that our shifted models maintain the latent-space properties that make generative models appealing for downstream tasks.

READ FULL TEXT

page 6

page 7

page 8

page 11

page 15

page 16

page 17

page 18

research
11/29/2022

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model

Recent 3D generative models have achieved remarkable performance in synt...
research
12/08/2022

Diffusion Guided Domain Adaptation of Image Generators

Can a text-to-image diffusion model be used as a training objective for ...
research
04/04/2023

PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion

Recently, significant advancements have been made in 3D generative model...
research
01/12/2023

Domain Expansion of Image Generators

Can one inject new concepts into an already trained generative model, wh...
research
10/01/2015

A Generative Model of Words and Relationships from Multiple Sources

Neural language models are a powerful tool to embed words into semantic ...
research
03/19/2021

Paint by Word

We investigate the problem of zero-shot semantic image painting. Instead...
research
05/25/2017

Latent Geometry and Memorization in Generative Models

It can be difficult to tell whether a trained generative model has learn...

Please sign up or login with your details

Forgot password? Click here to reset