Variational Autoencoders Tom Fletcher March 25, 2019 Talking about - PowerPoint PPT Presentation

Jan 13, 2024 •254 likes •409 views

Variational Autoencoders Tom Fletcher March 25, 2019 Talking about this paper: Diederik Kingma and Max Welling, Auto-Encoding Variational Bayes, In International Conference on Learning Representation (ICLR) , 2014. Autoencoders Input Latent

Variational Autoencoders Tom Fletcher March 25, 2019
Talking about this paper: Diederik Kingma and Max Welling, Auto-Encoding Variational Bayes, In International Conference on Learning Representation (ICLR) , 2014.
Autoencoders Input Latent Space Output x ′ ∈ R D x ∈ R D z ∈ R d d << D
Autoencoders ◮ Linear activation functions give you PCA ◮ Training: 1. Given data x , feedforward to x ′ output 2. Compute loss, e.g., L ( x , x ′ ) = � x − x ′ � 2 3. Backpropagate loss gradient to update weights ◮ Not a generative model!
Variational Autoencoders Input Latent Space Output μ σ 2 x ′ ∈ R D x ∈ R D z ∼ N ( µ, σ 2 )
Generative Models z Sample a new x in two steps: θ p ( z ) Prior: p θ ( x | z ) Generator: x Now the analogy to the “encoder” is: Posterior: p ( z | x )
Posterior Inference Posterior via Bayes’ Rule: p θ ( x | z ) p ( z ) p ( z | x ) = � p θ ( x | z ) p ( z ) dz Integral in denominator is (usually) intractable! Could use Monte Carlo to approximate, but it’s expensive
Kullback-Leibler Divergence � p ( z ) � � D KL ( q � p ) = − q ( z ) log dz q ( z ) � � p �� = E q − log q The average information gained from moving from q to p
Variational Inference Approximate intractable posterior p ( z | x ) with a manageable distribution q ( z ) Minimize the KL divergence: D KL ( q ( z ) � p ( z | x ))
Evidence Lower Bound (ELBO) D KL ( q ( z ) � p ( z | x )) � � p ( z | x ) �� = E q − log q ( z ) � � − log p ( z , x ) = E q q ( z ) p ( x ) = E q [ − log p ( z , x ) − log q ( z ) + log p ( x )] = − E q [log p ( z , x )] + E q [log q ( z )] + log p ( x ) log p ( x ) = D KL ( q ( z ) � p ( z | x )) + L [ q ( z )] ELBO: L [ q ( z )] = E q [log p ( z , x )] − E q [log q ( z )]
Variational Autoencoder q φ ( z | x ) p θ ( x | z ) Encoder Network Decoder Network Maximize ELBO: L ( θ, φ, x ) = E q φ [log p θ ( x , z ) − log q φ ( z | x )]
VAE ELBO L ( θ, φ, x ) = E q φ [log p θ ( x , z ) − log q φ ( z | x )] = E q φ [log p θ ( z ) + log p θ ( x | z ) − log q φ ( z | x )] � p θ ( z ) � = E q φ log q φ ( z | x ) + log p θ ( x | z ) = − D KL ( q φ ( z | x ) � p θ ( z )) + E q φ [log p θ ( x | z )] Problem: Gradient ∇ φ E q φ [log p θ ( x | z )] is intractable! Use Monte Carlo approx., sampling z ( s ) ∼ q φ ( z | x ) : S ∇ φ E q φ [log p θ ( x | z )] ≈ 1 � log p θ ( x | z ) ∇ φ log q φ ( z ( s ) ) S s = 1
Reparameterization Trick What about the other term? − D KL ( q φ ( z | x ) � p θ ( z )) Says encoder, q φ ( z | x ) , should make code z look like prior distribution Instead of encoding z , encode parameters for a normal distribution, N ( µ, σ 2 )
Reparameterization Trick q φ ( z j | x ( i ) ) = N ( µ ( i ) j , σ 2 ( i ) ) j p θ ( z ) = N ( 0 , I ) KL divergence between these two is: d D KL ( q φ ( z | x ( i ) ) � p θ ( z )) = − 1 � j ) 2 − σ 2 ( i ) � � 1 + log( σ 2 ( i ) ) − ( µ ( i ) j j 2 j = 1
Results from Kingma & Welling

Recommend

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and Learning Lab Seoul National University, South Korea Introduction - Variational Autoencoders - Two Challenges of Amortized Variational Inference -

783 views • 16 slides

CSC421/2516 Lecture 17: Variational Autoencoders Roger Grosse and Jimmy Ba Roger Grosse and

CSC421/2516 Lecture 17: Variational Autoencoders Roger Grosse and Jimmy Ba Roger Grosse and Jimmy Ba CSC421/2516 Lecture 17: Variational Autoencoders 1 / 28 Overview Recall the generator network: One of the goals of unsupervised learning is

397 views • 28 slides

Semi-Amortized Variational Autoencoders Yoon Kim Sam Wiseman Andrew Miller David Sontag

Semi-Amortized Variational Autoencoders Yoon Kim Sam Wiseman Andrew Miller David Sontag Alexander Rush Code: https://github.com/harvardnlp/sa-vae Background: Variational Autoencoders (VAE) (Kingma et al. 2013) Generative model: Draw z from a

1.02k views • 68 slides

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Lecture 3 Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS In this talk I will in some detail describe the paper of Kingma and Welling. Auto-Encoding Variational Bayes , International

706 views • 58 slides

An Introduction to An Introduction to Variational Variational Methods for Graphical Models

An Introduction to An Introduction to Variational Variational Methods for Graphical Models Methods for Graphical Models By Jordan, M., Ghahramani, Z., Jaakkola, T.S., Saul, L.K. Basics of Basics of Variational Variational Methodology

549 views • 40 slides

CS598LAZ - Variational Autoencoders Raymond Yeh, Junting Lou, Teck-Yian Lim Outline - Review

CS598LAZ - Variational Autoencoders Raymond Yeh, Junting Lou, Teck-Yian Lim Outline - Review Generative Adversarial Network - Introduce Variational Autoencoder (VAE) - VAE applications - VAE + GANs - Introduce Conditional VAE (CVAE) -

1.84k views • 109 slides

LUC HENDRIKS RADBOUD UNIVERSITY, NIJMEGEN (NL) VARIATIONAL

iDark 1 The intelligent dark matter survey VARIATIONAL AUTOENCODERS LUC HENDRIKS RADBOUD UNIVERSITY, NIJMEGEN (NL) VARIATIONAL AUTOENCODERS 2 Conceptual talk about VAEs VAEs as a tool to

262 views • 24 slides

Disentangling Disentanglement in Variational Autoencoders ICML 2019 June 12, 2019 Departments

Disentangling Disentanglement in Variational Autoencoders ICML 2019 June 12, 2019 Departments of Statistics and Engineering Science, University of Oxford Emile Mathieu , Tom Rainforth , N. Siddharth , Yee Whye Teh Variational

214 views • 19 slides

CSC321 Lecture 20: Autoencoders Roger Grosse Roger Grosse CSC321 Lecture 20: Autoencoders 1 /

CSC321 Lecture 20: Autoencoders Roger Grosse Roger Grosse CSC321 Lecture 20: Autoencoders 1 / 16 Overview Latent variable models so far: mixture models Boltzmann machines Both of these involve discrete latent variables. Now lets talk

200 views • 17 slides

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today Motivation PCA algorithms Applications PCA shortcomings Autoencoders Kernel PCA 2 Autoencoders 3

550 views • 34 slides

CSCE 496/896 Lecture 5: Stephen Scott Autoencoders Introduction Basic Idea Stacked AE Stephen

CSCE 496/896 Lecture 5: Autoencoders CSCE 496/896 Lecture 5: Stephen Scott Autoencoders Introduction Basic Idea Stacked AE Stephen Scott Denoising AE Sparse AE Contractive (Adapted from Paul Quint and Ian Goodfellow) AE Variational

452 views • 34 slides

Unsupervised Learning There is no direct ground truth for the quantity of interest

Unsupervised Learning There is no direct ground truth for the quantity of interest Autoencoders Variational Autoencoders (VAEs) Generative Adversarial Networks (GANs) 1 Autoencoders Goal: Meaningful features that capture the main

793 views • 46 slides

Deep Variational Inference FLARE Reading Group Presentation Wesley Tansey 9/28/2016 What is

Deep Variational Inference FLARE Reading Group Presentation Wesley Tansey 9/28/2016 What is Variational Inference? What is Variational Inference? p*(x) Want to estimate some distribution, p*(x) What is Variational Inference?

684 views • 52 slides

Variational Inference for GPs: Presenters Group1: Stochastic variational inference. Slides 2 - 28

Variational Inference for GPs: Presenters Group1: Stochastic variational inference. Slides 2 - 28 Chaoqi Wang Sana Tonekaboni Will Grathwohl Group2: Variational inference for GPs. Slides 29 - 57 Trefor Evans Kingsley Chang Shems Saleh James

1.32k views • 98 slides

Rejection Sampling Variational Inference Karan Grewal CSC2547 / STA4273 Overview Variational

Rejection Sampling Variational Inference Karan Grewal CSC2547 / STA4273 Overview Variational Inference Interested in computing posterior , but it is often intractable parametrize a variational family of distributions to

623 views • 12 slides

Variational Autoencoders + Deep Generative Models Matt Gormley Lecture 27 Dec. 4, 2019 1

10-418 / 10-618 Machine Learning for Structured Data Machine Learning Department School of Computer Science Carnegie Mellon University Variational Autoencoders + Deep Generative Models Matt Gormley Lecture 27 Dec. 4, 2019 1 Reminders

990 views • 86 slides

The Discovery of Asymptotic Freedom & The Emergence of QCD David Gross Nobel Lecture

The Discovery of Asymptotic Freedom & The Emergence of QCD David Gross Nobel Lecture December 8, 2004 The Weak and the Strong The forces operating in the nucleus are of two kinds: WEAK INTERACTIONS STRONG INTERACTIONS Responsible for

749 views • 37 slides

Correlated Variational Auto-Encoders Da Tang 1 Dawen Liang 2 Tony Jebara 1 , 2 Nicholas Ruozzi 3 1

Correlated Variational Auto-Encoders Da Tang 1 Dawen Liang 2 Tony Jebara 1 , 2 Nicholas Ruozzi 3 1 Columbia University 2 Netflix Inc. 3 The University of Texas at Dallas June 11, 2019 Variational Auto-Encoders (VAEs) Learn stochastic low

332 views • 19 slides

VCMC: Variational Consensus Monte Carlo Maxim Rabinovich, Elaine Angelino, Michael I. Jordan

VCMC: Variational Consensus Monte Carlo Maxim Rabinovich, Elaine Angelino, Michael I. Jordan Berkeley Vision and Learning Center September 22, 2015 probabilistic models! sky fog bridge water grass object tracking & recognition

624 views • 45 slides

Entropy and mutual information in models of deep neural networks NeurIPS 2018 - Thursday Dec 06th

Entropy and mutual information in models of deep neural networks NeurIPS 2018 - Thursday Dec 06th - Spotlight Poster @ Room 210 & 230 AB #110 Marylou Gabri (LPS ENS), Andre Manoel (INRIA Saclay, Owkin), Clment Luneau, Jean Barbier,

1.16k views • 34 slides

A Probabilistic Model for Using Social Networks in Personalized Item Recommendation Allison J.B.

A Probabilistic Model for Using Social Networks in Personalized Item Recommendation Allison J.B. Chaney Princeton University Tina Eliassi-Rad David M. Blei Rutgers University Columbia University ajbc.io/spf Personalized Item Recommendation

971 views • 41 slides

Cycle-Consistent Adversarial Learning as Approximate Bayesian Inference Louis C. Tiao 1 Edwin V.

Cycle-Consistent Adversarial Learning as Approximate Bayesian Inference Louis C. Tiao 1 Edwin V. Bonilla 2 Fabio Ramos 1 July 22, 2018 1 University of Sydney, 2 University of New South Wales Motivation: Unpaired Image-to-Image Translation

329 views • 22 slides

Deep Gaussian Processes (IPVI DGP) Haibin Yu, Yizhou Chen Zhongxiang Dai Bryan Kian Hsiang Low

Implicit Posterior Variational Inference for Deep Gaussian Processes (IPVI DGP) Haibin Yu*, Yizhou Chen* Zhongxiang Dai Bryan Kian Hsiang Low and Patrick Jaillet Department of Computer Science National University of Singapore Department of

511 views • 25 slides

Clarinet: WAN-Aware Optimization for Analyt ytics Queries Raajay Viswanathan, Ganesh

Clarinet: WAN-Aware Optimization for Analyt ytics Queries Raajay Viswanathan, Ganesh Ananthanarayanan, Aditya Akella 1 Overview Web apps hosted on multiple DCs Low latency access to end-user 2 Overview Web apps hosted on

1.34k views • 106 slides