Zero-Shot Image Harmonization with Generative Model Prior

07/17/2023
by   Jianqi Chen, et al.
0

Recent image harmonization methods have demonstrated promising results. However, due to their heavy reliance on a large number of composite images, these works are expensive in the training phase and often fail to generalize to unseen images. In this paper, we draw lessons from human behavior and come up with a zero-shot image harmonization method. Specifically, in the harmonization process, a human mainly utilizes his long-term prior on harmonious images and makes a composite image close to that prior. To imitate that, we resort to pretrained generative models for the prior of natural images. For the guidance of the harmonization direction, we propose an Attention-Constraint Text which is optimized to well illustrate the image environments. Some further designs are introduced for preserving the foreground content structure. The resulting framework, highly consistent with human behavior, can achieve harmonious results without burdensome training. Extensive experiments have demonstrated the effectiveness of our approach, and we have also explored some interesting applications.

READ FULL TEXT

page 1

page 4

page 5

page 7

page 8

page 12

page 13

page 14

research
07/31/2018

A Zero-Shot Framework for Sketch-based Image Retrieval

Sketch-based image retrieval (SBIR) is the task of retrieving images fro...
research
11/10/2022

Zero-shot Visual Commonsense Immorality Prediction

Artificial intelligence is currently powering diverse real-world applica...
research
12/27/2021

A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision

Using natural language as a supervision for training visual recognition ...
research
05/25/2023

ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image

Recent advancements in text-to-image generation have enabled significant...
research
04/07/2023

Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior

Anatomically consistent field-of-view (FOV) completion to recover trunca...
research
04/11/2022

No Token Left Behind: Explainability-Aided Image Classification and Generation

The application of zero-shot learning in computer vision has been revolu...
research
02/02/2021

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

In this research work we present CLIP-GLaSS, a novel zero-shot framework...

Please sign up or login with your details

Forgot password? Click here to reset