Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff - PowerPoint PPT Presentation

ID-CGAN LR-GAN MedGAN Progressive GAN CGAN IcGAN A ff GAN DiscoGAN LS-GAN b-GAN LAPGAN MPM-GAN AdaGAN CoGAN iGAN SN-GAN AMGAN LSGAN InfoGAN IAN CatGAN Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff Research Scientist, Google Brain McGAN NIPS 2017 Workshop: Deep Learning: Bridging Theory and Practice MGAN B-GAN FF-GAN Long Beach, 2017-12-09 GoGAN C-VAE-GAN C-RNN-GAN DR-GAN DCGAN CCGAN AC-GAN DRAGAN MAGAN 3D-GAN GMAN BiGAN GAWWN DualGAN CycleGAN alpha-GAN GP-GAN Bayesian GAN AnoGAN WGAN-GP EBGAN DTN MAD-GAN Context-RNN-GAN ALI BEGAN AL-CGAN f-GAN ArtGAN MARTA-GAN MalGAN

Generative Modeling • Density estimation • Sample generation Training examples Model samples (Goodfellow 2017)

Adversarial Nets Framework D tries to make D(G(z)) near 0, D (x) tries to be G tries to make near 1 D(G(z)) near 1 Di ff erentiable D function D x sampled from x sampled from data model Di ff erentiable function G Input noise z (Goodfellow et al., 2014) (Goodfellow 2017)

How long until GANs can do this? Training examples Model samples (Goodfellow 2017)

Progressive GANs (Karras et al., 2017) (Goodfellow 2017)

Spectrally Normalized GANs Welsh Springer Spaniel Palace Pizza (Miyato et al., 2017) (Goodfellow 2017)

Building a bridge from simple to complex theoretical models GANs in GANs in pdf generator space function spaceParameterized GANs Finite sized GANs Limited precision GANs (Goodfellow 2017)

Building a bridge from intuition to theory How quickly? Do we converge Is it in the to it? right place? Is there an equilibrium? Basic idea of GANs (Goodfellow 2017)

Building the bridge GANs in GANs in generator function pdf space Parameterized space GANs Finite sized GANs Limited precision GANs (Goodfellow 2017)

Optimizing over densities D ( x ) generator density Data samples x generator function z (Goodfellow et al, 2014) (Goodfellow 2017)

Tips and Tricks • A good strategy to simplify a model for theoretical purposes is to work in function space . • Binary or linear models are often too di ff erent from neural net models to provide useful theory. • Use convex analysis in this function space. (Goodfellow 2017)

Results • Goodfellow et al 2014: • Nash equilibrium exists • Nash equilibrium corresponds to recovering data- generating distribution • Nested optimization converges • Kodali et al 2017: simultaneous SGD converges (Goodfellow 2017)

Building a bridge from simple to complex theoretical models GANs in GANs in pdf generator function space Parameterized space GANs Finite sized GANs Limited precision GANs (Goodfellow 2017)

Non-Equilibrium Mode Collapse min G max D V ( G, D ) 6 = max D min G V ( G, D ) • D in inner loop: convergence to correct distribution • G in inner loop: place all mass on most likely point (Metz et al 2016) (Goodfellow 2017)

Equilibrium mode collapse Neighbors in generator Mode collapse function space are worse x x z z (Appendix A1 of Unterthiner et al, 2017) (Goodfellow 2017)

Building a bridge from simple to complex theoretical models GANs in GANs in pdf generator space function space Parameterized GANs Finite sized GANs Limited precision GANs (Goodfellow 2017)

Simple Non-convergence Example • For scalar x and y , consider the value function: V ( x, y ) = xy • Does this game have an equilibrium? Where is it? • Consider the learning dynamics of simultaneous gradient descent with infinitesimal learning rate (continuous time). Solve for the trajectory followed by these dynamics. ∂ x ∂ t = − ∂ ∂ xV ( x ( t ) , y ( t )) ∂ y ∂ t = ∂ ∂ y V ( x ( t ) , y ( t )) (Goodfellow 2017)

Solution This is the canonical example of a saddle point. There is an equilibrium, at x = 0, y = 0. (Goodfellow 2017)

Solution • The dynamics are a circular orbit: x ( t ) = x (0) cos( t ) − y (0) sin( t ) y ( t ) = x (0) sin( t ) + y (0) cos( t ) Discrete time gradient descent can spiral outward for large step sizes (Goodfellow 2017)

Tips and Tricks • Use nonlinear dynamical systems theory to study behavior of optimization algorithms • Demonstrated and advocated especially by Nagarajan and Kolter 2017 (Goodfellow 2017)

Results • The good equilibrium is a stable fixed point (Nagarajan and Kolter, 2017) • Two-timescale updates converge (Heusel et al, 2017) • Their recommendation: use a di ff erent learning rate for G and D • My recommendation: decay your learning rate for G • Convergence is very ine ffi cient (Mescheder et al, 2017) (Goodfellow 2017)

Intuition for the Jacobian How firmly does player 1 How much can player 1 want to stay in place? dislodge player 2? g (1) g (2) θ (1) H (1) r θ (1) g (2) θ (2) r θ (2) g (1) H (2) How firmly does player 2 How much can player 2 want to stay in place? dislodge player 1? (Goodfellow 2017)

What happens for GANs? G D g (1) g (2) θ (1) H (1) r θ (1) g (2) D θ (2) r θ (2) g (1) H (2) G All zeros! The optimal discriminator is constant. Locally, the generator does not have any “retaining force” (Goodfellow 2017)

Building a bridge from simple to complex theoretical models GANs in GANs in pdf generator space function spaceParameterized GANs Finite sized GANs Limited precision GANs (Goodfellow 2017)

Does a Nash equilibrium exist, in the right place? • PDF space: yes • Generator function space: yes, but there can also be bad equilibria • What about for neural nets with a finite number of finite- precision parameters? • Arora et al, 2017: yes… for mixtures • Infinite mixture • Approximate an infinite mixture with a finite mixture (Goodfellow 2017)

Open Challenges • Design an algorithm that avoids bad equilibria in generator function space OR reparameterize the problem so that it does not have bad equilibria • Design an algorithm that converges rapidly to the equilibrium • Study the global convergence properties of the existing algorithms (Goodfellow 2017)

Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff - PowerPoint PPT Presentation

ID-CGAN LR-GAN MedGAN Progressive GAN CGAN IcGAN A ff GAN DiscoGAN LS-GAN b-GAN LAPGAN MPM-GAN AdaGAN CoGAN iGAN SN-GAN AMGAN LSGAN InfoGAN IAN CatGAN Bridging Theory and Practice of GANs MIX+GAN Ian Goodfellow, Sta ff Research

GANs for Word Embeddings Akshay Budhkar and Krishnapriya Introduction GANs have shown incredible

Entropic GANs meet VAEs: A Statistical Approach to Compute Sample Likelihoods in GANs Yogesh

Reading group: Latent Optimized GANs (Game theory brings guns to GANs) Michal Sustr Dept. of

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

GANs, Optimal Transport, and Implicit Distribution Estimation Tengyuan Liang Econometrics and

Advanced Section #8: Generative Adversarial Networks (GANs) CS109B Data Science 2 Vincent Casser

Lecture 20: GANS CS109B Data Science 2 Pavlos Protopapas and Mark Glickman 1 Outline Review of

Research on Race Bridging for 2020 Ben Bolender Assistant Division Chief Population Estimates

On Minimax Optimality of GANs for Robust Mean Estimation Kaiwen Wu 1,2 With Gavin Weiguang Ding 3

GAN Frontiers/Related Methods Improving GAN Training Improved Techniques for Training GANs

Intro Tutorial on GANs Michela Paganini Fermilab Machine Learning Group Meeting March 21, 2018

Theory or Practice? Theory : Without theory, practice is but routine born out of habit.

: BRIDGING ACADEMIA AND INDUSTRY : BRIDGING ACADEMIA AND INDUSTRY THROUGH PROJECT BASED

Residential Bridging Residential Bridging (rate control; 2006Mar06) (rate control; 2006Mar06)

Overview of Bridging Leadership Leadership for Change, Module Two, Mauritius, March 2012 (Barry

Bridging the Gap on Breaches: What Makes the Difference? Sponsored By: Bridging the Gap on

Snabb: Open Source Meets Dataplane RIPE77, October 2018, Amsterdam Andy Wingo | wingo@igalia.com

e-Science Introduction Eric Yen e-Science Workshop, March 2011 Outline Workshop Overview

priorityQueue-ADTLPO.li Stores a collection of pairs ( item , priority ) are from - Priorities

FINANCIAL AID 101 PRESENTER: CHERYL SUSZYNSKI LEAF ADVISOR csuszynski@leaf-ohio.org AGENDA

1 ILP Ferrara sept 2018 Games 2 ILP Ferrara sept 2018 Interest of games for AI Excellent

Work-based Learning Why Now and Where to? Work rk-based Le Learnin ing: : Why Now? Interest

Tor and circumvention: Lessons learned Roger Dingledine The Tor Project https://torproject.org/

WEBINAR WEDNESDAY Negotiation and Navigation in a Post-Naviance World March 7, 2018 Todays