MIWAE: Deep Generative Modelling and Imputation of Incomplete Data - PowerPoint PPT Presentation

MIWAE: Deep Generative Modelling and Imputation of Incomplete Data Sets Pierre-Alexandre Mattei IT University of Copenhagen http://pamattei.github.io/ @pamattei ICML 2019 Joint work with Jes Frellsen (ITU Copenhagen) 1

How to handle missing data with deep generative models? z Let ( x i , z i ) i ≤ n be i.i.d. random variables driven by a deep generative model: � z ∼ p ( z ) (prior) θ x ∼ p θ ( x | z ) (observation model) x n 2

How to handle missing data with deep generative models? z Let ( x i , z i ) i ≤ n be i.i.d. random variables driven by a deep generative model: � z ∼ p ( z ) (prior) θ x ∼ p θ ( x | z ) (observation model) x n Assume that some of the training data are missing-at-random (MAR). We can then split each sample i ∈ { 1 , . . . , n } into • the observed features x o i and • the missing features x m i . 2

How to handle missing data with deep generative models? z Let ( x i , z i ) i ≤ n be i.i.d. random variables driven by a deep generative model: � z ∼ p ( z ) (prior) θ x ∼ p θ ( x | z ) (observation model) x n Assume that some of the training data are missing-at-random (MAR). We can then split each sample i ∈ { 1 , . . . , n } into • the observed features x o i and • the missing features x m i . 1. Can we train p θ in a VAE fashion in spite of the missingness? 2

How to handle missing data with deep generative models? z Let ( x i , z i ) i ≤ n be i.i.d. random variables driven by a deep generative model: � z ∼ p ( z ) (prior) θ x ∼ p θ ( x | z ) (observation model) x n Assume that some of the training data are missing-at-random (MAR). We can then split each sample i ∈ { 1 , . . . , n } into • the observed features x o i and • the missing features x m i . 1. Can we train p θ in a VAE fashion in spite of the missingness? 2. Can we impute the missing values? 2

1. Can we train p θ in a VAE fashion in spite of the missingness? Under the MAR assumption, the relevant quantity to maximise is the likelihood of the observed data equal to n n � � � ℓ o ( θ ) = log p θ ( x o p θ ( x o i ) = log i | z ) p ( z ) d z . i = 1 i = 1 3

1. Can we train p θ in a VAE fashion in spite of the missingness? Under the MAR assumption, the relevant quantity to maximise is the likelihood of the observed data equal to n n � � � ℓ o ( θ ) = log p θ ( x o p θ ( x o i ) = log i | z ) p ( z ) d z . i = 1 i = 1 Building on the importance weighted autoencoder (IWAE) of Burda et al. (2016), we derive an approachable stochastic lower bound of ℓ o ( θ ) , the missing IWAE (MIWAE) bound: n � K � p θ ( x o i | z ik ) p ( z ik ) log 1 � � ≤ ℓ o ( θ ) . L K ( θ, γ ) = E z i 1 ,..., z iK ∼ q γ ( z | x o i ) q γ ( z ik | x o i ) K i = 1 k = 1 Like for the IWAE, the MIWAE bound gets tighter when the number of importance weights K grows. 3

2. Can we impute the missing values? For the single imputation problem we use self-normalised importance sampling to approximate E [ x m | x o ] : L � E [ x m | x o ] ≈ w l x m ( l ) , l = 1 where ( x m ( 1 ) , z ( 1 ) ) , . . . , ( x m ( L ) , z ( L ) ) are i.i.d. samples from p θ ( x m | x o , z ) q γ ( z | x o ) and , with r l = p θ ( x o | z ( l ) ) p ( z ( l ) ) r l w l = . r 1 + . . . + r L q γ ( z ( l ) | x o ) 4

2. Can we impute the missing values? For the single imputation problem we use self-normalised importance sampling to approximate E [ x m | x o ] : L � E [ x m | x o ] ≈ w l x m ( l ) , l = 1 where ( x m ( 1 ) , z ( 1 ) ) , . . . , ( x m ( L ) , z ( L ) ) are i.i.d. samples from p θ ( x m | x o , z ) q γ ( z | x o ) and , with r l = p θ ( x o | z ( l ) ) p ( z ( l ) ) r l w l = . r 1 + . . . + r L q γ ( z ( l ) | x o ) Multiple imputation , i.e. sampling from p θ ( x m | x o ) , can be done using sampling importance resampling according to the weights w l for large L . 4

Single imputation of UCI data sets (50% MCAR) Banknote Breast Concrete Red White Yeast MIWAE 0.446 (0.038) 0.280 (0.021) 0.501 (0.040) 0.643 (0.026) 0.735 (0.033) 0.964(0.057) MVAE 0.593 (0.059) 0.318 (0.018) 0.587(0.026) 0.686 (0.120) 0.782 (0.018) 0.997 (0.064) missForest 0.676 (0.040) 0.291 (0.026) 0.510 (0.11) 0.697 (0.050) 0.798 (0.019) 1.41 (0.02) PCA 0.682 (0.016) 0.729 (0.068) 0.938 (0.033) 0.890 (0.033) 0.865 (0.024) 1.05(0.061) k NN 0.744 (0.033) 0.831 (0.029) 0.962(0.034) 0.981 (0.037) 0.929 (0.025) 1.17 (0.048) Mean 1.02 (0.032) 1.00 (0.04) 1.01 (0.035) 1.00 (0.03) 1.00 (0.02) 1.06 (0.052) Mean-squared error for single imputation for various continuous UCI data sets. 5

Imputation incomplete versions of binary MNIST Single imputations: 6

Imputation incomplete versions of binary MNIST Single imputations: Multiple imputations : 6

Classification of binary MNIST (50% MCAR pixels) Test accuracy Test cross-entropy Zero imputation 0.9739 (0.0018) 0.1003 (0.0092) missForest imputation 0.9805 (0.0018) 0.0645 (0.0066) MIWAE single imputation 0.9847 (0.0009) 0.0510 (0.0035) MIWAE multiple imputation 0.9870 (0.0003) 0.0396 (0.0003) Complete data 0.9866 (0.0007) 0.0464 (0.0026) Learn more about MIWAE at poster 9 in the Pacific ballroom at 6.30! Thanks for your attention :) 7

MIWAE: Deep Generative Modelling and Imputation of Incomplete Data - PowerPoint PPT Presentation

MIWAE: Deep Generative Modelling and Imputation of Incomplete Data Sets Pierre-Alexandre Mattei IT University of Copenhagen http://pamattei.github.io/ @pamattei ICML 2019 Joint work with Jes Frellsen (ITU Copenhagen) 1 How to handle missing

generative design systems Generative Brief Design Definitions Workshop Processes

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Overview Multiple Imputation for Multilevel Data Bayesian estimation for MLMs Univariate

Consistent Variance Estimates for Multiple Multiple imputation Imputation in R MI alternative

Deep-Learning: Unsupervised Generative models Deep Belief Networks Deep Stacked AutoEncoders

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Deep Generative models for Inverse Problems Alex Dimakis joint work with Ashish Bora, Dave Van

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Genotype imputation accuracy with different reference panels Guan-Hua Huang and Yi-Chi Tseng

Handling missing data in Stata: Imputation and likelihood-based approaches Rose Medeiros

MixtComp software: Model-based clustering/imputation with mixed data, missing data and uncertain

Genotype Imputation in Genome-wide Association Studies Fernando Rivadeneira 1,2 1 Department of

Measuring the dark matter mass in spite of astrophysical uncertainties Bradley J Kavanagh

OVERCOMING RACISM IN SPITE OF OUR BLINDERS REV. ELIZABETH BIER & REV. JEN COLLINS SAINT PAUL

The rise of Genomics The rise of machines Types of genetic testing Variant Triage Analysis

Mindful Eating Positive Schools - 2017 Conference Smiling Mind is modern meditation 1.8+

Matrix Theories and Emergent Space Frank FERRARI Universit Libre de Bruxelles International

SST Fall 2016 11/14/16 All students respond to the Core Survey and one of two SERU seeks to

Empirical Methods in Natural Language Processing Lecture 4 Language Modeling (II): Smoothing and

Challenging the Advanced First- Year Students Learning Process through Student Presentations

MIWAE: Deep Generative Modelling and Imputation of Incomplete Data - PowerPoint PPT Presentation

MIWAE: Deep Generative Modelling and Imputation of Incomplete Data Sets Pierre-Alexandre Mattei IT University of Copenhagen http://pamattei.github.io/ @pamattei ICML 2019 Joint work with Jes Frellsen (ITU Copenhagen) 1 How to handle missing

generative design systems Generative Brief Design Definitions Workshop Processes

Multiple Imputation for Missing Data in KLoSA Juwon Song Korea University and UCLA Contents 1.

Overview Multiple Imputation for Multilevel Data Bayesian estimation for MLMs Univariate

Consistent Variance Estimates for Multiple Multiple imputation Imputation in R MI alternative

Deep-Learning: Unsupervised Generative models Deep Belief Networks Deep Stacked AutoEncoders

Learning Deep Generative Models Inference &amp; Representation Lecture 12 Rahul G. Krishnan

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Deep Generative models for Inverse Problems Alex Dimakis joint work with Ashish Bora, Dave Van

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Missing Data and Imputation NINA ORWITZ OCTOBER 30 TH , 2017 Outline Types of missing data

Genotype imputation accuracy with different reference panels Guan-Hua Huang and Yi-Chi Tseng

Handling missing data in Stata: Imputation and likelihood-based approaches Rose Medeiros

MixtComp software: Model-based clustering/imputation with mixed data, missing data and uncertain

Genotype Imputation in Genome-wide Association Studies Fernando Rivadeneira 1,2 1 Department of

Measuring the dark matter mass in spite of astrophysical uncertainties Bradley J Kavanagh

OVERCOMING RACISM IN SPITE OF OUR BLINDERS REV. ELIZABETH BIER &amp; REV. JEN COLLINS SAINT PAUL

The rise of Genomics The rise of machines Types of genetic testing Variant Triage Analysis

Mindful Eating Positive Schools - 2017 Conference Smiling Mind is modern meditation 1.8+

Matrix Theories and Emergent Space Frank FERRARI Universit Libre de Bruxelles International

SST Fall 2016 11/14/16 All students respond to the Core Survey and one of two SERU seeks to

Empirical Methods in Natural Language Processing Lecture 4 Language Modeling (II): Smoothing and

Challenging the Advanced First- Year Students Learning Process through Student Presentations

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

OVERCOMING RACISM IN SPITE OF OUR BLINDERS REV. ELIZABETH BIER & REV. JEN COLLINS SAINT PAUL