Mathematical analysis of loss function of GAN and its loss function variants
Journal: International Journal of Advanced Technology and Engineering Exploration (IJATEE) (Vol.9, No. 94)Publication Date: 2022-10-01
Authors : Rayeesa Mehmood Rumaan Bashir; Kaiser J. Giri;
Page : 1327-1348
Keywords : Generative adversarial networks; Divergence minimization; Loss functions; Stable training; Mode collapse; Non-convergence.;
Abstract
Generative adversarial networks (GANs) have turned up as the most widely used approaches for creating realistic samples. They're the effective latent variable models for learning complex real distributions. However, despite their enormous success and popularity, the process of training GANs remains challenging and suffers from a number of failures. These failures include mode collapse where the generator generates the same set of output for different inputs which finally leads to loss of diversity; non-convergence because of the diverging and oscillatory behaviors of both generator and discriminator while being trained; and vanishing or exploding gradients due to which either no learning or extremely slow learning takes place. In the past years, a variety of strategies for stabilizing GAN training have been explored which includes modified architectures, loss functions, and other methods. The choice of loss function has been found to be the most crucial part of the GAN model because it influences the vanishing gradient and model collapse directly. Viewing these loss functions as divergence minimization has provided a rich avenue of development. All of these factors make GAN training inherently unstable, and this instability is difficult to comprehend mathematically. This paper intends to provide a thorough mathematical explanation of these divergence minimization functions. It illustrates in great detail the two variants of the loss functions of the original GAN, their optimization to Kullback-Leibler (KL) divergence and Jensen-Shannon (JS) divergence along with their shortcomings. It also describes the loss functions of the different loss function GAN variants that have been proposed to mitigate these shortcomings as well as their minimization. The original GAN and its loss function variants have also been implemented using the standard MNIST, Fashion-MNIST, and CIFAR-10 datasets.
Other Latest Articles
- Hybrid beam-forming techniques for multi-cell massive MIMO
- Seismic analysis of reinforced concrete tubular structures with lateral load resisting systems in high rise buildings
- Thermal and acoustic characterization of musa-coir-epoxy based novel hybrid composites for railway coach interior applications
- BIFACIAL (N+/P/P+) SILICON SOLAR CELL BASE THICKNESS OPTIMIZATION, WHILE ILLUMINATED BY THE REAR FACE WITH MONOCHROMATIC LIGHT OF SHORTWAVELENGTHS
- Impact on slopes with development of shear band
Last modified: 2022-10-18 19:27:29