The effect of loss function on conditional generative adversarial networks

by   Omar S. Al-Kadi, et al.

Conditional Generative Adversarial Network (cGAN) is a general purpose approach for many image-to-image translation tasks, which aims to translate images from one form to another resulting in high-quality translated images. In this paper, the loss function of the cGAN model is modified by combining the adversarial loss of state-of-the-art Generative Adversarial Network (GAN) models with a new combination of non-adversarial loss functions to enhance model performance and generate more realistic images. Specifically, the effect of the Wasserstein GAN (WGAN), the WGAN with Gradient Penalty (WGAN-GP), and least Squared GAN (lsGAN) adversarial loss functions are explored. Several comparisons are performed to select an optimized combination of L1 with structure, gradient, content-based, Kullback-Leibler divergence, and softmax non-adversarial loss functions. For experimentation purposes, the Facades dataset is used in case of image-to-image translation task. Peak-signal-to-noise-ratio (PSNR), Structural Similarity Index (SSIM), Universal Quality Index (UQI), and Visual Information Fidelity (VIF) are used to quantitatively evaluate the translated images. Based on our experimental results, the best combination of the loss functions for image-to-image translation on facade dataset is (WGAN) adversarial loss with (L1 and content) non-adversarial loss functions. The model generates fine structure images, and captures both high and low frequency details of translated images. Image in-painting and lesion segmentation is investigated to demonstrate practicality of proposed work.


page 1

page 2

page 3

page 4

page 5

page 6

page 10

page 12


In2I : Unsupervised Multi-Image-to-Image Translation Using Generative Adversarial Networks

In unsupervised image-to-image translation, the goal is to learn the map...

Flow-based Deformation Guidance for Unpaired Multi-Contrast MRI Image-to-Image Translation

Image synthesis from corrupted contrasts increases the diversity of diag...

Image-to-Image Translation with Conditional Adversarial Networks

We investigate conditional adversarial networks as a general-purpose sol...

MFIF-GAN: A New Generative Adversarial Network for Multi-Focus Image Fusion

Multi-Focus Image Fusion (MFIF) is one of the promising techniques to ob...

Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel

In laparoscopic surgery, the visibility in the image can be severely deg...

Sampling Using Neural Networks for colorizing the grayscale images

The main idea of this paper is to explore the possibilities of generatin...

Evolving GAN Formulations for Higher Quality Image Synthesis

Generative Adversarial Networks (GANs) have extended deep learning to co...

Please sign up or login with your details

Forgot password? Click here to reset