Pixel Recurrent Neural Networks
Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google Deepmind ICML'16
188 citations
Pixel Recurrent Neural Networks Aaron van den Oord, Nal - - PowerPoint PPT Presentation
Pixel Recurrent Neural Networks Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google Deepmind ICML'16 188 citations Pixel Recurrent Neural Networks 1. What is the task? 2. Other models: GAN, VAE, 3. PixelCNN model 4. Results
Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu Google Deepmind ICML'16
188 citations
for any image x
Higher p(x) is better.
denoising: input corrupted image,
density separation)
manifold of natural images)
variations in data
GAN Variational Autoencoder (VAE) Pixel CNN (This talk) Invertible models Real NVP Compute exact likelihood p(x)
Has latent variable z
Compute latent variable z (inference)
Stable training? (No mode collapse)
Sharp images?
𝑞 𝑦 = 𝑞 𝑦4 𝑦3, 𝑦2, 𝑦1 𝑞 𝑦3 𝑦2, 𝑦1 𝑞 𝑦2 𝑦1 𝑞 𝑦1
1. Order pixels 2. Imagine already generated pixels 1-6, want to predict pixel 7 3. Mask pixels 7-16 (set to 0) 4. CNN outputs normalized histogram for pixel 7 given pixel values 1-6 (maksed input)
Image from trainset 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Masked image 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 CNN INPUT OUTPUT
CIFAR-10. NLL = Negative log likelihood in bits per dimension (lower is better)
Lampert
https://www.cs.toronto.edu/~duvenaud/courses/csc2541/index.html Good course on Deep Generative Models (GAN, VAE, pixelCNN, Real NVP,…)