Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

06/11/2023
by   Dani Valevski, et al.
1

We present Face0, a novel way to instantaneously condition a text-to-image generation model on a face, in sample time, without any optimization procedures such as fine-tuning or inversions. We augment a dataset of annotated images with embeddings of the included faces and train an image generation model, on the augmented dataset. Once trained, our system is practically identical at inference time to the underlying base model, and is therefore able to generate images, given a user-supplied face image and a prompt, in just a couple of seconds. Our method achieves pleasing results, is remarkably simple, extremely fast, and equips the underlying model with new capabilities, like controlling the generated images both via text or via direct manipulation of the input face embeddings. In addition, when using a fixed random vector instead of a face embedding from a user supplied image, our method essentially solves the problem of consistent character generation across images. Finally, while requiring further research, we hope that our method, which decouples the model's textual biases from its biases on faces, might be a step towards some mitigation of biases in future text-to-image models.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 9

page 10

research
08/31/2023

Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images

Generating 3D faces from textual descriptions has a multitude of applica...
research
09/13/2023

Unbiased Face Synthesis With Diffusion Models: Are We There Yet?

Text-to-image diffusion models have achieved widespread popularity due t...
research
07/01/2023

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

While large-scale pre-trained text-to-image models can synthesize divers...
research
05/25/2023

ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Personalizing generative models offers a way to guide image generation w...
research
09/19/2022

The Biased Artist: Exploiting Cultural Biases via Homoglyphs in Text-Guided Image Generation Models

Text-guided image generation models, such as DALL-E 2 and Stable Diffusi...
research
04/17/2022

StyleT2F: Generating Human Faces from Textual Description Using StyleGAN2

AI-driven image generation has improved significantly in recent years. G...
research
03/10/2023

New Benchmarks for Accountable Text-based Visual Re-creation

Given a command, humans can directly execute the action after thinking o...

Please sign up or login with your details

Forgot password? Click here to reset