generative adversarial networks
play

Generative Adversarial Networks Stefano Ermon, Aditya Grover - PowerPoint PPT Presentation

Generative Adversarial Networks Stefano Ermon, Aditya Grover Stanford University Lecture 10 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 1 / 17 Selected GANs https://github.com/hindupuravinash/the-gan-zoo The GAN


  1. Generative Adversarial Networks Stefano Ermon, Aditya Grover Stanford University Lecture 10 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 1 / 17

  2. Selected GANs https://github.com/hindupuravinash/the-gan-zoo The GAN Zoo: List of all named GANs Today Rich class of likelihood-free objectives via f -GANs Inferring latent representations via BiGAN Application: Image-to-image translation via CycleGANs Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 2 / 17

  3. Beyond KL and Jenson-Shannon Divergence What choices do we have for d ( · )? KL divergence: Autoregressive Models, Flow models (scaled and shifted) Jenson-Shannon divergence: original GAN objective Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 3 / 17

  4. f divergences Given two densities p and q , the f -divergence is given by � � p ( x ) �� D f ( p , q ) = E x ∼ q f q ( x ) where f is any convex, lower-semicontinuous function with f (1) = 0. Convex: Line joining any two points lies above the function Lower-semicontinuous: function value at any point x 0 is close to f ( x 0 ) or greater than f ( x 0 ) Example: KL divergence with f ( u ) = u log u Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 4 / 17

  5. f divergences Many more f-divergences! Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 5 / 17

  6. f -GAN: Variational Divergence Minimization To use f -divergences as a two-sample test objective for likelihood-free learning, we need to be able to estimate it only via samples Fenchel conjugate: For any function f ( · ), its convex conjugate is defined as f ∗ ( t ) = sup ( ut − f ( u )) u ∈ dom f Duality: f ∗∗ = f . When f ( · ) is convex, lower semicontinous, so is f ∗ ( · ) ( tu − f ∗ ( t )) f ( u ) = sup t ∈ dom f ∗ Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 6 / 17

  7. f -GAN: Variational Divergence Minimization We can obtain a lower bound to any f -divergence via its Fenchel conjugate � � �� p ( x ) D f ( p , q ) = E x ∼ q f q ( x ) � � �� t p ( x ) q ( x ) − f ∗ ( t ) = E x ∼ q sup t ∈ dom f ∗ � � T ( x ) p ( x ) q ( x ) − f ∗ ( T ( x )) := E x ∼ q � X [ T ( x ) p ( x ) − f ∗ ( T ( x )) q ( x )] d x = � X ( T ( x ) p ( x ) − f ∗ ( T ( x )) q ( x )) d x ≥ sup T ∈T = sup T ∈T ( E x ∼ p [ T ( x )] − E x ∼ q [ f ∗ ( T ( x )))]) where T : X �→ R is an arbitrary class of functions Note: Lower bound is likelihood-free w.r.t. p and q Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 7 / 17

  8. f -GAN: Variational Divergence Minimization Variational lower bound ( E x ∼ p [ T ( x )] − E x ∼ q [ f ∗ ( T ( x )))]) D f ( p , q ) ≥ sup T ∈T Choose any f -divergence Let p = p data and q = p G Parameterize T by φ and G by θ Consider the following f -GAN objective F ( θ, φ ) = E x ∼ p data [ T φ ( x )] − E x ∼ p G θ [ f ∗ ( T φ ( x )))] min θ max φ Generator G θ tries to minimize the divergence estimate and discriminator T φ tries to tighten the lower bound Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 8 / 17

  9. Inferring latent representations in GANs The generator of a GAN is typically a directed, latent variable model with latent variables z and observed variables x How can we infer the latent feature representations in a GAN? Unlike a normalizing flow model, the mapping G : z �→ x need not be invertible Unlike a variational autoencoder, there is no inference network q ( · ) which can learn a variational posterior over latent variables Solution 1 : For any point x , use the activations of the prefinal layer of a discriminator as a feature representation Intuition: Similar to supervised deep neural networks, the discriminator would have learned useful representations for x while distinguishing real and fake x Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 9 / 17

  10. Inferring latent representations in GANs If we want to directly infer the latent variables z of the generator, we need a different learning algorithm A regular GAN optimizes a two-sample test objective that compares samples of x from the generator and the data distribution Solution 2 : To infer latent representations, we will compare samples of x , z from the joint distributions of observed and latent variables as per the model and the data distribution For any x generated via the model, we have access to z (sampled from a simple prior p ( z )) For any x from the data distribution, the z is however unobserved (latent) Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 10 / 17

  11. Bidirectional Generative Adversarial Networks (BiGAN) In a BiGAN, we have an encoder network E in addition to the generator network G The encoder network only observes x ∼ p data ( x ) during training to learn a mapping E : x �→ z As before, the generator network only observes the samples from the prior z ∼ p ( z ) during training to learn a mapping G : z �→ x Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 11 / 17

  12. Bidirectional Generative Adversarial Networks (BiGAN) The discriminator D observes samples from the generative model z , G ( z ) and the encoding distribution E ( x ) , x The goal of the discriminator is to maximize the two-sample test objective between z , G ( z ) and E ( x ) , x After training is complete, new samples are generated via G and latent representations are inferred via E Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 12 / 17

  13. Translating across domains Image-to-image translation: We are given images from two domains, X and Y Paired vs. unpaired examples Paired examples can be expensive to obtain. Can we translate from X ↔ Y in an unsupervised manner? Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 13 / 17

  14. CycleGAN: Adversarial training across two domains To match the two distributions, we learn two parameterized conditional generative models G : X ↔ Y and F : Y ↔ X G maps an element of X to an element of Y . A discriminator D Y compares the observed dataset Y and the generated samples ˆ Y = G ( X ) Similarly, F maps an element of Y to an element of X . A discriminator D X compares the observed dataset X and the generated samples ˆ X = F ( Y ) Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 14 / 17

  15. CycleGAN: Cycle consistency across domains Cycle consistency: If we can go from X to ˆ Y via G , then it should also be possible to go from ˆ Y back to X via F F ( G ( X )) ≈ X Similarly, vice versa: G ( F ( Y )) ≈ Y Overall loss function F , G , D X , D Y L GAN ( G , D Y , X , Y ) + L GAN ( F , D X , X , Y ) min + λ ( E X [ � F ( G ( X )) − X � 1 ] + E Y [ � G ( F ( Y )) − Y � 1 ]) � �� � cycle consistency Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 15 / 17

  16. CycleGAN in practice Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 16 / 17

  17. Summary of Generative Adversarial Networks Key observation: Samples and likelihoods are not correlated in practice Two-sample test objectives allow for learning generative models only via samples (likelihood-free) Wide range of two-sample test objectives covering f -divergences (and more) Latent representations can be inferred via BiGAN Interesting applications such as CycleGAN Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 10 17 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend