19 Auto Lecture encoders : Ankur Bambhanoliya Scribes : - PowerPoint PPT Presentation

I Variational 19 Auto Lecture encoders : Ankur Bambhanoliya Scribes : Donald Hamnett

Motivation Inferring from Images Latent Variables : Dataset MNIST Goh images ; hand of digits written - Goal two variables Infer labels 1. Digit , Style { 9 } y e 0 - , . . , variables 2 RD 7 €

Assume all digits Deep IX g Generative Models frequent equally , ) Neural Network Idea µyn,7n I Use to define : a , mode generative a generative µ .ae ( Digit ) supervise can ( to ) Discrete yn ~ 0.1 0.1 . . . , , Normal I ) ( 2- a n n , Berinoulli ( :O ) ) Xn n I Neural network Questions of 9 How ? Style train I this do we . ? supervision ) How inference do ( Image do 2 no we .

Generative Training Models Deep Gradient Use Ascent Idea Stochastic 2 : on a lower bound approximate likelihood estimation to maximum Generative model with True data weights O network distribution a > KL ( pdatacxsllpcxio ) - To ) I T leg E = - paan , pcxnlo It . xhnpdatac :3 ftp.logpcxioyfFEIFstiaip Epdata = from unknown a distribution data N I , § To log , ) = = ,

Generative Training Models Deep Gradient Use Ii II Ascent Idea Stochastic 2 : 10 on a lower bound approximate likelihood estimation to maximum .HN//ply,z1x,oD)/ variational distribution I proposal depends x on : ' " Ll o ' ' ftp.nm.lt#gI..leos' KL ( 9cg panta ( hog E ) PK = - Tx - Cx ) 's , I Ipo - To ( pm KL JL Col qcyitlxl ply 7190 when ) - = - , ] ) # I Ealy , ,×,[ To log 0£10 ) Ftpdnta 7107 pix , b. = pdatacx , n Tuff ? x - leg Do pcxn.gs , # = to , 5,7 ' 71×4 racy ,

Generative Training Models Deep Gradient Use ;D ÷ : Ascent Idea Stochastic 3 : Learn Variational perform inference to doin Ii : " 't . pan .am#as.....teost Perform Combining 1+2 gradient both i ascent on Max To 7 , 4) Likelihood I ( O : . £10,4 model ) OF ( leg generative pgcxl wgnfax Variational Inference Tq ) 2 : . ) ) ( Approx . Posterior ) 't arqninklfqcyit.gs/lpry,7Ix of =

Generative Training Models Deep Use Idea 4 neural network to define : ( . ) inference variation model the a.k.a. the dist Digit Inference model f I '(xn ;¢↳ ) ) ( Discrete yn ~ networks } Neural Normal ( Itkmyn ;¢D 7 n ~ y.tl/)=Mnqcyn,7n1xn18tule Vaniaticnal Dtst / in q( Image

Variational Auto encoders Learn Objective model generative deep and : a a by inference corresponding model optimizing Em ) Lead Oo . F. ' - - . .mn/logP:Yi;::iIiTI -

Auto Intermezzo encoders : 7h Xn ( all continuous >

latent Encoder Mapping from code to image : x 7 Multi Perception 768 256 whijxn bh 50 layer 2- - hn ;=6( { ;) , ; + . wtinhni - Activation linear map o ( § 10h Tainan Qtnptoochws bti ) + zn ;= , , Activation functions 6kt X

Mapping Decoder from to code image 7 x : Multi Perception layer . • ( § ) BY wiijzn hni + = , ; Whinhnj gxnzooh dipterous + bhi ) pwws = 6( § In . ; Loss Binary - entropy Cross : ,¢)=n§ .pl?jxn.plogxn.p LH ^( Minimize SGD with =

Learned Auto Latent Codes encoder

Auto Variation Treat latent al variable encoder a : as

Auto Variational Treat latent variable encoder a : as Encoder ) Model ( ( Decoder ) Inference Model Generative O ) ph , 7- Q ) 917=1 Xn pin ; 's n - g 6 ( Whxn tbh ) ( Normal I ) hn 7- o = n n , ¢M Oh 6 ( Whan → on → , tbh ) " " µ ? b hut hn W = = 46 ' I ok - r a of What " Whn Git ) exp I y :-. b b = + ( pint , 67 ) Normal ( Yn ) Bernoulli 7- n xn n n 7×71 , on I log Pg ' , ] Lto Objective Eg , on : - - , ×

Variutional Auto Learned Latent Codes encoder

Variation Auto Auto encoder encoder vs

Training The Re Trick parameterization : fowl qczb lxb tf batch Compute of gradient images : on % " Pok " Uniform ( f X xbn . ,Xr3 ) ) gczhsix , , . . B ¢ .iq#gczixbsllogqoTzx Analogue BBVI REINFORCE Estimator of style : - - § , 's § To b b s , to , LIO ) log , a) ' ' , log = 's ) B a Y b. s b 2- ( 2- 7 I X ~ g ¢ Problem will be high variance very ;

Training The Re Trick parameterization : = Mkt = toff eh Eb b Idea Sample 's parameterized dist z using a : te +61×3 b. s Normal } ( 0,1 ) q ~ b. s ebiipce b it 9ft ) Ix ~ s :p ) ;¢ , 7 Tret ^ Neural works Result B- Estimator Re parameterized pce.bg/b8qo,czces;qslxb I Poet : b ;p ) ) fo I z £10,10 ) , Tq ) . B b b. s s ) ' 0 ) " E Po " [ Is ' leg , = 4 b. s B ; 9) 94 ( 2- C E lxb ) be s - I - . In S often I enough practice :

Variational Auto encoders Learn Objective model generative deep and : a a by inference corresponding model optimizing ) scores % F. - - .GE?Em..*.leogP:YiiiiiiiTI -

( 7 style & digit ) Continuous ,7n ) ( Xn both pp nlxnl 9$17 : encodes , ft style ) Continuous Discrete ynifnl pfxn encodes qfbn lxn ) : ,7n + , ,

Disentangled Representations Hao Learn Interpretable Babahtsarthaht Features : Xn Zn ' , Zn , -2 : in ,7 C 3 3 - ÷ 1 1 ) × 6 Fn ,i Zn 7h ,i=3 =-3 ,i=o

19 Auto Lecture encoders : Ankur Bambhanoliya Scribes : - PowerPoint PPT Presentation

I Variational 19 Auto Lecture encoders : Ankur Bambhanoliya Scribes : Donald Hamnett Motivation Inferring from Images Latent Variables : Dataset MNIST Goh images ; hand of digits written - Goal two variables Infer

A Tutorial on Deep Probabilistic Generative Models Ryan P. Adams Princeton University Machine

Policy Evaluation with Latent Confounders via Optimal Balance Andrew Bennett 1 Cornell University

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Learning Semantic Visual Codebook for Action Recognition by Embedding into Concept Space Behrouz

Unsupervised Discovery of Object Landmarks as Structural Representations Yuting Zhang 1 , Yijie

More Than a Query Language: SQL in the 21 st Century @MarkusWinand @ModernSQL

Web-based Attacks on Local IoT Devices Gunes Acar Danny Huang Frank Li Arvind Narayanan

ADVANCED DATABASE SYSTEMS Server-side Logic Execution @ Andy_Pavlo // 15- 721 // Spring 2019

A B A Bett etter Mod er Model f el for or Pen T en Test esting ing Mike Saunders

List Decoding of Universal Polar Codes Boaz Shuval and Ido Tal 1/15 Overview Universal

Assessments Streets Reconstruction Unit Assessment 50% of the Actual Project Cost or 2019

Vegetation Lateral Selection Methodology Comparison (1) Lateral selection vs. (2) Substation

Similarity is crucial to cognition General (often implicit) hypothesis: similar stimulus in

discovered single-photon CMOS imaging Stanford University EE Computer Systems Colloquium February

Potential of Non Destructive Evaluation Techniques for DUNE Cold Electronics QA/QC Leonard J.

Update on Critical I have no financial relationships with Foreign Body Ingestions any commercial

multi-wavelength variability from jets in X-ray binaries Piergiorgio Casella (University of

The Properties of Ultra High Energy Cosmic Rays and the Problems that they pose Alan Watson

High-energy emission from Gamma-Ray Bursts Frdric Daigne Institut dAstrophysique de

MPEXS benchmark results - phase space data - Akinori Kimura 14 February 2017 Aim To

AFTERGLOW PHYSICS Alin Panaitescu Los Alamos National Laboratory Relativistic blast-wave model

Compact Lin inear Collider (CLIC) Mathieu Benoit, University of Geneva

DAMIC DARK MATTER IN CCD ROMAIN GAIOR (LPNHE PARIS) DARK MATTER DAY (APC 2016/12/01) 1

Applications of Bayesian Analysis to Coronal Seismology Iigo Arregui True,