CMP784 DEEP LEARNING Lecture #11 Variational Autoencoders Aykut - PowerPoint PPT Presentation

latent by Tom White CMP784 DEEP LEARNING Lecture #11 – Variational Autoencoders Aykut Erdem // Hacettepe University // Spring 2020

Artificial faces synthesized by StyleGAN (Nvidia) Previously on CMP784 • Supervised vs. Unsupervised Representation Learning • Sparse Coding • Autoencoders • Autoregressive Generative Models 2

Lecture overview • Motivation for Variational Autoencoders (VAEs) • Mechanics of VAEs • Separatibility of VAEs • Training of VAEs • Evaluating representations • Vector Quantized Variational Autoencoders (VQ-VAEs) sclaimer: Much of the material and slides for this lecture were borrowed from Discl — Pavlov Protopapas, Mark Glickman and Chris Tanner's Harvard CS109B class — Andrej Risteski's CMU 10707 class — David McAllester's TTIC 31230 class 3

Lecture overview • Motivation for Variational Autoenco coders s (VAEs) s) • Mechanics of VAEs • Separatibility of VAEs • Training of VAEs • Evaluating representations • Vector Quantized Variational Autoencoders (VQ-VAEs) sclaimer: Much of the material and slides for this lecture were borrowed from Discl — Pavlov Protopapas, Mark Glickman and Chris Tanner's Harvard CS109B class — Andrej Risteski's CMU 10707 class — David McAllester's TTIC 31230 class 4

Recap: Autoencoders Feature Representation Feed-back, Feed-back, Feed-forward, Feed-forward, generative, generative, bottom-up Decoder Encoder bottom-up top-down top-down path Input Image • • Details of what goes insider the encoder and decoder matter! • Need constraints to avoid learning an identity. 5

Parameter space of autoencoder • Let’s examine the latent space of an AE. • Is there any separation of the different classes? If the AE learned the “essence” of the MNIST images, similar images should be close to each other. • Plot the latent space and examine the separation. • Here we plot the 2 PCA components of the latent space. Image taken from A. Glassner, Deep Learning, Vol. 2: From Basics to Practice 6

Traversing the latent space • We start at the start of the arrows in latent space and then move to end of the arrow in 7 steps. • For each value of z we use the already trained decoder to produce an image. Image taken from A. Glassner, Deep Learning, Vol. 2: From Basics to Practice 7

Problems with Autoencoders • Gaps in the latent space • Discrete latent space • Separability in the latent space 8

Lecture overview • Motivation for Variational Autoencoders (VAEs) • Mech chanics cs of VAEs • Separatibility of VAEs • Training of VAEs • Evaluating representations • Vector Quantized Variational Autoencoders (VQ-VAEs) sclaimer: Much of the material and slides for this lecture were borrowed from Discl — Pavlov Protopapas, Mark Glickman and Chris Tanner's Harvard CS109B class — Andrej Risteski's CMU 10707 class — David McAllester's TTIC 31230 class 9

Generative models • Imagine we want to generate data from a distribution, x ∼ p ( x ) • e.g. x ∼ N ( µ, σ )

Generative models • But how do we generate such samples? z ∼ Unif(0 , 1)

Generative models • But how do we generate such samples? z ∼ Unif(0 , 1) x = ln z

Generative models • In other words we can think that if we choose z ~ Uniform then there is a mapping: # = "(&) such as: # ∼ )(#) where in general " is some complicated function. • We already know that Neural Networks are great in learning complex functions . # ∼ )(#) & ∼ *(&) # = "(&)

Traditional Autoencoders • In traditional autoencoders, we can think of encoder and decoders as some function mapping. z Encoder Decoder & = ℎ(") " = $(&) ! 14

Variational Autoencoders • To go to variational autoencoders, we need to first add some stochasticity and think of it as a probabilistic modeling. z Encoder Decoder 15

Variational Autoencoders Sample from g (z) Decoder e.g. Standard z !(# $|&) Gaussian & ∼ )(&) $ = +(&) # $ ∼ !($|&) # 16

Variational Autoencoders z Encoder Tr Tradit ditiona ional A l AE E Consider this ! " to be the mean of a normal $ Decode Encoder Consider this to ! # be the std of a normal % Va Variational AE Randomly chosen value Latent value, z 17

Variational Autoencoders 18

Variational Autoencoders 19

Variational Autoencoders 512 256 256 512 784 20 neurons neurons neurons neurons neurons neurons ReLU ReLU ReLU ReLU ReLU ReLU Centers 512 256 20 256 512 784 Random neurons neurons neurons neurons neurons neurons Variable ReLU ReLU ReLU ReLU ReLU ReLU Spreads 20

Lecture overview • Motivation for Variational Autoencoders (VAEs) • Mechanics of VAEs • Sep Separ arat atibility of of VAEs AEs • Training of VAEs • Evaluating representations • Vector Quantized Variational Autoencoders (VQ-VAEs) sclaimer: Much of the material and slides for this lecture were borrowed from Discl — Pavlov Protopapas, Mark Glickman and Chris Tanner's Harvard CS109B class — Andrej Risteski's CMU 10707 class — David McAllester's TTIC 31230 class 21

Separability in Variational Autoencoders • Separability is not only between classes but we also want similar items in the same class to be near each other. • For example, there are different ways of writing “2”, we want similar styles to end up near each other. • Let’s examine VAE, there is something magic happening once we add stochasticity in the latent space. 22

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Encode the first sample (a “2”) and find ! " , $ " 23

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Sample z " ∼ $(& " , ( " ) 24

Blending Latent Variables Latent Space SD σ ENCODER DECODER Mean µ Decode to ! " # 25

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Encode the second sample (a “3”) find ! " , $ " . Sample z " ∼ ((! " , $ " ) 26

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Decode to ! " # 27

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Train with the first sample (a “2”) again and find ! " , $ " . However z " ∼ ((! " , $ " ) will not be the sam same . It can happen to be close to the “3” in latent space. 28

Separability in Variational Autoencoders Latent Space SD σ ENCODER DECODER Mean µ Decode to ! " # . Since the decoder only knows how to map from latent space to ! " space, it will return a “3”. 29

Separability in Variational Autoencoders Train with 1 st sample again Latent Space SD σ ENCODER DECODER Latent space starts to re-organize Mean µ 30

Separability in Variational Autoencoders And again… Latent Space SD σ ENCODER DECODER 3 is pushed away Mean µ 31

Separability in Variational Autoencoders Many times… Latent Space SD σ ENCODER DECODER Mean µ 32

Separability in Variational Autoencoders Now lets test again Latent Space SD σ ENCODER DECODER Mean µ 33

Separability in Variational Autoencoders Training on 3’s again Latent Space SD σ ENCODER DECODER Mean µ 34

Separability in Variational Autoencoders Many times… Latent Space SD σ ENCODER DECODER Mean µ 35

Lecture overview • Motivation for Variational Autoencoders (VAEs) • Mechanics of VAEs • Separatibility of VAEs • Tr Traini ning ng of of VAEs AEs • Evaluating representations • Vector Quantized Variational Autoencoders (VQ-VAEs) sclaimer: Much of the material and slides for this lecture were borrowed from Discl — Pavlov Protopapas, Mark Glickman and Chris Tanner's Harvard CS109B class — Andrej Risteski's CMU 10707 class — David McAllester's TTIC 31230 class 36

Training Encoder Decoder µ ' ' ( ! ! & " # % Training means learning ! " and ! # . Define a loss function ℒ • Use stochastic gradient descent (or Adam) to minimize ℒ • The Loss function: , ' / 1 Reconstruction error: ℒ * = - ∑ / ' / − ( • Similarity between the probability of z given x, p & ' , and some predefined probability • distribution p(z) , which can be computed by Kullback-Leibler divergence (KL): 67(8(&|')||8 & ) 37

Bayesian AE Encoder Decoder µ ' ' ( Bayes rule: ! ! & " # p ) * ∝ , * ) , ) % Parameters Posterior for our parameters, z is: of the model p & ', ( ' ∝ , ( ' &, ' , & ( ) is z) Posterior predictive, probability to see ( ' given '; this is INFERENCE : p ( ' ' = ∫ , ( ' &, ' , & ' 1& Posterior Decoder: NN 38

CMP784 DEEP LEARNING Lecture #11 Variational Autoencoders Aykut - PowerPoint PPT Presentation

latent by Tom White CMP784 DEEP LEARNING Lecture #11 Variational Autoencoders Aykut Erdem // Hacettepe University // Spring 2020 Artificial faces synthesized by StyleGAN (Nvidia) Previously on CMP784 Supervised vs. Unsupervised

CMP784 DEEP LEARNING Lecture #12 Deep Reinforcement Learning Aykut Erdem // Hacettepe

CMP784 DEEP LEARNING Lecture #12 Self-Supervised Learning Aykut Erdem // Hacettepe

CMP784 DEEP LEARNING Lecture #03 Multi-layer Perceptrons Aykut Erdem // Hacettepe University

CMP784 DEEP LEARNING Lecture #08 Attention and Memory Aykut Erdem // Hacettepe University //

CMP784 DEEP LEARNING Lecture #08 Attention and Memory Aykut Erdem // Hacettepe University //

CMP784 DEEP LEARNING Lecture #03 Multi-layer Perceptrons Aykut Erdem // Hacettepe University

Algorithmic Decision Theory and Smart Cities Fred Roberts Rutgers University 1 Algorithmic

COMP80122 Slides and Presentations Carole Goble | Uli Sattler School of Computer Science

CLARIN: how to make it all fit together? Steven Krauwer Utrecht institute of Linguistics UiL-OTS

The DUTCH GOLDEN AGE (I) Even before the Renaissance, the low- country region (modern

Joel Yahweh is God Day of the L ORD Priests vs Prophets

http://cs224w.stanford.edu Last time: Decision Based Models Utility based Deterministic

How I Learned to Stop Worrying and Love to Spoof Ethan Katz-Bassett, Harsha V. Madhyastha,

Mars Hill, Athens modern day Our Characters Councilman Nicias The Council of Athens The

Developing Long Term Partnerships Dale Verstegen TransCen Senior Research Associate will begin

Cautionary Tales from the Landscape Keith R. Dienes University of Arizona This work was

2 Chronicles 7:14 NIV if my people, who are called by my name, will humble themselves and pray

2. Greece and Its Legacy 2.1. Early Greece: Minoan and Mycenaean Civilizations 2.2. The Polis:

Beam Cooling for High Luminosity Colliders Yaroslav Derbenev Center for Advanced Studies of

Mediterranean Sea Rugged, Irregular Coastlinegreat for Pick ONE. Slides or Fill ins?

A Practical Theory of Language-Integrated Query James Cheney, Sam Lindley, Philip Wadler

CS244 Online for COVID-19 This is the first time for us too, so please email us if you have ideas

Web Services and Service Oriented Architecture CS 4720

Pawe Jdrzejewski I really like Open Source I created Sylius I co-founded Lakion I help organize