pixelgan autoencoders
play

PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine - PowerPoint PPT Presentation

PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine learning Group University of Toronto CIFAR Deep Learning Summer School Montreal, Canada June 29 th , 2017 Alireza Makhzani PixelGAN Autoencoders 1 / 27 Outline 1. Background


  1. PixelGAN Autoencoders Alireza Makhzani, Brendan Frey Machine learning Group University of Toronto CIFAR Deep Learning Summer School Montreal, Canada June 29 th , 2017 Alireza Makhzani PixelGAN Autoencoders 1 / 27

  2. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 2 / 27

  3. PixelCNNs ✦ Learn the image statistics directly at the pixel level. ✦ Good at modelling low-level pixel statistics. ✦ Samples lack global structure. ✦ Lacking latent representation. ✦ Conditional PixelCNNs can learn conditional densities. van den Oord et al., 2016 Alireza Makhzani PixelGAN Autoencoders 3 / 27

  4. Variational Autoencoders log p ( x ) > � E q ( z | x ) [ � log( p ( x | z )] � KL( q ( z | x ) k p ( z )) ✦ Good at capturing the global structure, but samples are blurry. ✦ Learn hierarchical representations useful for down-stream tasks. ✦ Attempts at combining PixelCNN with VAEs: • PixelVAE ( Gulrajani et al., 2016 ) • VLAE ( Chen et al., 2017 ) Kingma et al., 2013 Alireza Makhzani PixelGAN Autoencoders 4 4 / 27

  5. Adversarial Autoencoders B D Code Space of MNIST: Gaussian Prior Mixture of Gaussians Makhzani et al., 2015 Alireza Makhzani PixelGAN Autoencoders 5 / 27

  6. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 6 / 27

  7. Limitations of Variational/Adversarial Autoencoders ✦ All the image statistics are captured by the single latent vector. VAE label, style p ( z ) Latent Variable z global and local Deterministic p ( x | z ) None (factorized Gaussians) x Alireza Makhzani PixelGAN Autoencoders 7 / 27

  8. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. p ( z ) Latent Variable z p ( x | z ) PixelCNN x Alireza Makhzani PixelGAN Autoencoders 8 / 27

  9. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. p ( z ) Latent Variable z GAN p ( x | z ) PixelCNN x Alireza Makhzani PixelGAN Autoencoders 9 / 27

  10. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. PixelGAN PixelGAN (Gaussian) (Categorical) Global Discrete p ( z ) Latent Variable z (low-frequency) (label) GAN Local Continuous p ( x | z ) PixelCNN (high-frequency) (Style) x Alireza Makhzani PixelGAN Autoencoders 10 / 27

  11. PixelGAN Autoencoders ✦ The image statistics are captured jointly by the latent vector and the autoregressive decoder. PixelGAN PixelGAN (Gaussian) (Categorical) Global Discrete p ( z ) Latent Variable z (low-frequency) (label) GAN Local Continuous p ( x | z ) PixelCNN (high-frequency) (Style) x Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 11 / 27

  12. PixelGAN Autoencoders Cost function of PixelGAN = Reconstruction + Adversarial Cost Alireza Makhzani PixelGAN Autoencoders 12 / 27

  13. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors Clustering Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 13 / 27

  14. Global vs. Local Decomposition Alireza Makhzani PixelGAN Autoencoders 14 / 27

  15. Code Space Code Space of MNIST: Alireza Makhzani PixelGAN Autoencoders 15 / 27

  16. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 16 / 27

  17. PixelGAN Autoencoders with Categorical Priors Alireza Makhzani PixelGAN Autoencoders 17 / 27

  18. Discrete vs. Continuous Decomposition (Clustering) Alireza Makhzani PixelGAN Autoencoders 18 / 27

  19. Discrete vs. Continuous Decomposition (Clustering) 0.3% Error rate Alireza Makhzani PixelGAN Autoencoders 19 / 27

  20. Unsupervised Clustering Alireza Makhzani PixelGAN Autoencoders 20 / 27

  21. Unsupervised Clustering Alireza Makhzani PixelGAN Autoencoders 21 / 27

  22. Unsupervised Clustering 5% Error rate Alireza Makhzani PixelGAN Autoencoders 22 / 27

  23. Outline 1. Background • PixelCNNs • Variational Autoencoders • Adversarial Autoencoders 2. PixelGAN Autoencoders • Gaussian Priors • Categorical Priors ✦ Clustering ✦ Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 23 / 27

  24. Semi-supervised Learning Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 24 / 27

  25. Semi-supervised Learning Semi-supervised Learning Alireza Makhzani PixelGAN Autoencoders 25 / 27

  26. Semi-supervised Classification Alireza Makhzani PixelGAN Autoencoders 26 / 27

  27. Thank you! Alireza Makhzani PixelGAN Autoencoders 27 / 27

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend