Hierarchical Semantic Regularization of Latent Spaces in StyleGANs

08/07/2022
by   Tejan Karmali, et al.
8

Progress in GANs has enabled the generation of high-resolution photorealistic images of astonishing quality. StyleGANs allow for compelling attribute modification on such images via mathematical operations on the latent style vectors in the W/W+ space that effectively modulate the rich hierarchical representations of the generator. Such operations have recently been generalized beyond mere attribute swapping in the original StyleGAN paper to include interpolations. In spite of many significant improvements in StyleGANs, they are still seen to generate unnatural images. The quality of the generated images is predicated on two assumptions; (a) The richness of the hierarchical representations learnt by the generator, and, (b) The linearity and smoothness of the style spaces. In this work, we propose a Hierarchical Semantic Regularizer (HSR) which aligns the hierarchical representations learnt by the generator to corresponding powerful features learnt by pretrained networks on large amounts of data. HSR is shown to not only improve generator representations but also the linearity and smoothness of the latent style spaces, leading to the generation of more natural-looking style-edited images. To demonstrate improved linearity, we propose a novel metric - Attribute Linearity Score (ALS). A significant reduction in the generation of unnatural images is corroborated by improvement in the Perceptual Path Length (PPL) metric by 16.19 simultaneously improving the linearity of attribute-change in the attribute editing tasks.

READ FULL TEXT

page 2

page 10

page 11

page 12

page 14

page 19

page 20

page 21

research
03/25/2023

Spatial Latent Representations in Generative Adversarial Networks for Image Generation

In the majority of GAN architectures, the latent space is defined as a s...
research
10/24/2021

Image-Based CLIP-Guided Essence Transfer

The conceptual blending of two signals is a semantic task that may under...
research
11/25/2020

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

We explore and analyze the latent style space of StyleGAN2, a state-of-t...
research
04/14/2021

Aligning Latent and Image Spaces to Connect the Unconnectable

In this work, we develop a method to generate infinite high-resolution i...
research
11/17/2022

Assessing Neural Network Robustness via Adversarial Pivotal Tuning

The ability to assess the robustness of image classifiers to a diverse s...
research
12/03/2019

Analyzing and Improving the Image Quality of StyleGAN

The style-based GAN architecture (StyleGAN) yields state-of-the-art resu...
research
06/03/2019

DualDis: Dual-Branch Disentangling with Adversarial Learning

In computer vision, disentangling techniques aim at improving latent rep...

Please sign up or login with your details

Forgot password? Click here to reset