Generative Adversarial Networks (GANs) Ian Goodfellow, OpenAI - - PowerPoint PPT Presentation

generative adversarial networks gans
SMART_READER_LITE
LIVE PREVIEW

Generative Adversarial Networks (GANs) Ian Goodfellow, OpenAI - - PowerPoint PPT Presentation

Generative Adversarial Networks (GANs) Ian Goodfellow, OpenAI Research Scientist Re-Work Deep Learning Summit San Francisco, 2017-01-26 Generative Modeling Density estimation Sample generation Training examples Model samples


slide-1
SLIDE 1

Generative Adversarial Networks (GANs)

Ian Goodfellow, OpenAI Research Scientist Re-Work Deep Learning Summit San Francisco, 2017-01-26

slide-2
SLIDE 2

(Goodfellow 2016)

Generative Modeling

  • Density estimation
  • Sample generation

Training examples Model samples

slide-3
SLIDE 3

(Goodfellow 2016)

Next Video Frame Prediction

Ground Truth MSE Adversarial

(Lotter et al 2016)

slide-4
SLIDE 4

(Goodfellow 2016)

iGAN

youtube (Zhu et al 2016) youtube (Brock et al 2016)

IAN

slide-5
SLIDE 5

(Goodfellow 2016)

Image to Image Translation

Input Ground truth Output

(Isola et al 2016)

Aerial to Map Labels to Street Scene

input

  • utput

input

  • utput
slide-6
SLIDE 6

(Goodfellow 2016)

Fully Visible Belief Nets

  • Explicit formula based on chain

rule:

  • Disadvantages:
  • O(n) sample generation cost
  • Generation not controlled by a

latent code

pmodel(x) = pmodel(x1)

n

Y

i=2

pmodel(xi | x1, . . . , xi−1)

(Frey et al, 1996) PixelCNN elephants (van den Ord et al 2016)

slide-7
SLIDE 7

(Goodfellow 2016)

WaveNet

Amazing quality Sample generation slow Two minutes to synthesize

  • ne second of audio
slide-8
SLIDE 8

(Goodfellow 2016)

Adversarial Nets Framework

x sampled from data Differentiable function D D(x) tries to be near 1 Input noise z Differentiable function G x sampled from model D D tries to make D(G(z)) near 0, G tries to make D(G(z)) near 1

slide-9
SLIDE 9

(Goodfellow 2016)

Vector Space Arithmetic

  • +

=

Man with glasses Man Woman Woman with Glasses (Radford et al, 2015)

slide-10
SLIDE 10

(Goodfellow 2016)

3D GAN

(Wu et al, 2016)

slide-11
SLIDE 11

(Goodfellow 2016)

OpenAI GAN-created images

slide-12
SLIDE 12

(Goodfellow 2016)

Problems with Counting

slide-13
SLIDE 13

(Goodfellow 2016)

Problems with Perspective

slide-14
SLIDE 14

(Goodfellow 2016)

Problems with Global Structure

slide-15
SLIDE 15

(Goodfellow 2016)

This one is real

slide-16
SLIDE 16

(Goodfellow 2016)

Semi-Supervised Classification

(Salimans et al 2016)

Model Test error rate for a given number of labeled samples 1000 2000 4000 8000 Ladder network [24] 20.40±0.47 CatGAN [14] 19.58±0.46 Our model 21.83±2.01 19.61±2.09 18.63±2.32 17.72±1.82 Ensemble of 10 of our models 19.22±0.54 17.25±0.66 15.59±0.47 14.87±0.89

Model Percentage of incorrectly predicted test examples for a given number of labeled samples 500 1000 2000 DGN [21] 36.02±0.10 Virtual Adversarial [22] 24.63 Auxiliary Deep Generative Model [23] 22.86 Skip Deep Generative Model [23] 16.61±0.24 Our model 18.44 ± 4.8 8.11 ± 1.3 6.16 ± 0.58 Ensemble of 10 of our models 5.88 ± 1.0

CIFAR-10 SVHN

slide-17
SLIDE 17

(Goodfellow 2016)

Learning interpretable latent codes / controlling the generation process

InfoGAN (Chen et al 2016)

slide-18
SLIDE 18

(Goodfellow 2016)

Plug and Play Generative Networks

(Nguyen et al 2016)

slide-19
SLIDE 19

(Goodfellow 2016)

PPGN for caption to image

(Nguyen et al 2016)

slide-20
SLIDE 20

(Goodfellow 2016)

GAN loss is a key ingredient

Raw data Reconstruction by PPGN Reconstruction by PPGN without GAN Images from Nguyen et al 2016 First observed by Dosovitskiy et al 2016

slide-21
SLIDE 21

(Goodfellow 2016)

StackGANs

This small blue bird has a short pointy beak and brown on its wings This bird is completely red with black wings and pointy beak A small sized bird that has a cream belly and a short pointed bill A small bird with a black head and wings and features grey wings

(Zhang et al 2016)

slide-22
SLIDE 22

(Goodfellow 2016)

Conclusion

  • GANs produce rich, realistic imagery
  • GANs learn to draw samples from a probability

distribution

  • Applications include learning from very few labeled

examples, interactive artwork generation, and differential privacy