Semantic Image Synthesis via Class-Adaptive Cross-Attention

08/30/2023
by   Tomaso Fontanini, et al.
0

In semantic image synthesis, the state of the art is dominated by methods that use spatially-adaptive normalization layers, which allow for excellent visual generation quality and editing versatility. Granted their efficacy, recent research efforts have focused toward finer-grained local style control and multi-modal generation. By construction though, such layers tend to overlook global image statistics leading to unconvincing local style editing and causing global inconsistencies such as color or illumination distribution shifts. Also, the semantic layout is required for mapping styles in the generator, putting a strict alignment constraint over the features. In response, we designed a novel architecture where cross-attention layers are used in place of de-normalization ones for conditioning the image generation. Our model inherits the advantages of both solutions, retaining state-of-the-art reconstruction quality, as well as improved global and local style transfer. Code and models available at https://github.com/TFonta/CA2SIS.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

research
03/18/2019

Semantic Image Synthesis with Spatially-Adaptive Normalization

We propose spatially-adaptive normalization, a simple but effective laye...
research
11/28/2019

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization

We propose semantic region-adaptive normalization (SEAN), a simple but e...
research
03/30/2023

Masked and Adaptive Transformer for Exemplar Based Image Translation

We present a novel framework for exemplar based image translation. Recen...
research
08/27/2022

AesUST: Towards Aesthetic-Enhanced Universal Style Transfer

Recent studies have shown remarkable success in universal style transfer...
research
08/20/2019

Image Synthesis From Reconfigurable Layout and Style

Despite remarkable recent progress on both unconditional and conditional...
research
12/08/2020

Efficient Semantic Image Synthesis via Class-Adaptive Normalization

Spatially-adaptive normalization (SPADE) is remarkably successful recent...
research
07/13/2022

Context-Consistent Semantic Image Editing with Style-Preserved Modulation

Semantic image editing utilizes local semantic label maps to generate th...

Please sign up or login with your details

Forgot password? Click here to reset