Towards Disentangled Representations via Variational Sparse Coding - PowerPoint PPT Presentation

LatinX in AI Research Workshop - ICML 2019 Robert Aduviri and Alfredo De La Fuente Pontifical Catholic University of Peru, Skolkovo Institute of Science and Technology Towards Disentangled Representations via Variational Sparse Coding

1. Motivation 2. Research Problem 3. Technical Contribution 4. Current Results 5. Next Steps 1 Table of contents

Motivation

• Simple machine learning algorithms depend heavily on the representation of the data they are given. • The process of designing the right representation for a specific them automatically. 2 Representation Learning task is commonly known as feature engineering . • An alternative to hand-design these representations is to learn

Generative Models to the rescue! etc). 3 Representation Learning • Lower dimensional representation of raw data (images, text, • Efficiently sample from a high-dimensional data distribution. • Latent space with meaningful properties.

3 etc). Representation Learning • Lower dimensional representation of raw data (images, text, • Efficiently sample from a high-dimensional data distribution. • Latent space with meaningful properties. → Generative Models to the rescue!

Proposed by Kingma & Welling (2013) and Rezende et al. (2014) (1) latent prior distribution be? 4 Variational AutoEncoders (VAE) L ( θ, φ ) = E q φ ( z | x ) [ log p θ ( x | z )] − KL ( q φ ( z | x ) ∥ p ( z )) How expressive can a Gaussian

produce a smooth latent representation of the data. Both the reconstruction loss and the KL divergence are necessary to 5 VAE vs AE

6 VAE latent codes distribution

Constitutes the complex task of learning representations that Scheme from the paper “Towards a Definition of Disentangled Representations” by Higgins et al. (2018) 7 Disentanglement separate the underlying structure of the world into disjoint parts of its representation .

Proposed by Higgins et al. (2017) as a constrained version of VAE to discover disentangled latent factors. (2) Azimuth(orientation) traversal comparison. 8 β - VAE L Beta ( θ, φ ) = E q φ ( z | x ) [ log p θ ( x | z )] − β KL ( q φ ( z | x ) ∥ p ( z ))

Created by Matthey et al. (2017) as a way to assess the disentanglement properties of unsupervised learning methods. These 2D shapes were procedurally generated from 6 ground truth independent latent factors: color, shape, scale, rotation, x and y positions of a sprite. 9 dSprites Dataset

Research Problem

We aim to tackle the following challenges: • Meaningful low-dimensional representations of images • Interpretable and disentangled features on latent space. • Quantitatively and qualitative evaluation of disentanglement. 10 Learning Disentangled Representations

Technical Contribution

11 (3) variation. J The model captures subjectively understandable sources of (4) J which leads to a recognition function as a discrete mixture model, Variational Sparse Coding (VSC) Tonolini et al. (2019) suggest the use of a Spike-and-Slab prior p ( z ) . ∏ ( ) p s ( z ) = α N ( z j ; 0 , 1 ) + ( 1 − α ) δ ( z j ) j = 1 ( ) ∏ q φ ( z | x i ) = γ i , j N ( z i , j ; µ z , i , j , σ 2 z , i , j ) + ( 1 − γ i , j ) δ ( z i , j ) j = 1

A convolutional architecture was used for the encoder/decoder of the VAE and VSC for comparison, based on the configuration used by Hig- gins et al. (2017). 12 Convolutional encoder/decoder Figure 1: Convolutional architecture used for VAE and VSC

Current Results

dSprites dataset. 13 Latent Codes Comparison Figure 2: Reconstruction and latent codes of Convolutional VSC (left) ( α = 0 . 01, β = 2) and Convolutional VAE (right) ( β = 2) models with the

14 Latent Space Traversal via VSC Figure 3: Latent traversals on MNIST (left) and Fashion-MNIST (right).

15 Latent Space Traversal via VSC Figure 4: Latent traversals on CelebA (left) and dSprites (right).

Convolutional VAE (right) models with the dSprites and CelebA datasets. 16 Latent Space Traversal Comparison Figure 5: Latent traversals using the Convolutional VSC (left) and

Next Steps

The quantitative evaluation of disentanglement is a recent area of research, with several metrics being constantly proposed, in addition to new models and datasets: Gap, SAP score, DCI, MCE, IRS Cars3D, Shapes3D 17 Disentanglement Metrics and Models • Metrics: BetaVAE score, FactorVAE score, Mutual Information • Models: BetaVAE, FactorVAE, BetaTCVAE, DIP-VAE, InfoGAN • Datasets: dSprites, Color/Noisy/Scream-dSprites, SmallNORB,

• Perform quantitative disentanglement evaluation with previously proposed metrics. • Extend comparison with recent models also proposed for disentanglement, both VAE-based and GAN-based. • Perform ablation studies for key features of the model, such as the sparse prior, -VAE regularization and encoder/decoder used. 18 Next Steps

• Perform quantitative disentanglement evaluation with previously proposed metrics. • Extend comparison with recent models also proposed for disentanglement, both VAE-based and GAN-based. • Perform ablation studies for key features of the model, such as used. 18 Next Steps the sparse prior, β -VAE regularization and encoder/decoder

Our source code and experiments are available at: github.com/Alfo5123/Variational-Sparse-Coding See you at the poster session! robert.aduviri@pucp.edu.pe alfredo.delafuente@skoltech.ru 19 Thank you!

Towards Disentangled Representations via Variational Sparse Coding - PowerPoint PPT Presentation

LatinX in AI Research Workshop - ICML 2019 Robert Aduviri and Alfredo De La Fuente Pontifical Catholic University of Peru, Skolkovo Institute of Science and Technology Towards Disentangled Representations via Variational Sparse Coding 1.

Disentangled Representation Learning 2020.5.21 Seung-Hoon Na Jeonbuk National University

Disentangled Graph Convolutional Networks Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Robustly Disentangled Causal Mechanisms: Validating Deep Representations for Interventional

An Introduction to An Introduction to Variational Variational Methods for Graphical Models

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio

Deep Variational Inference FLARE Reading Group Presentation Wesley Tansey 9/28/2016 What is

Variational Inference for GPs: Presenters Group1: Stochastic variational inference. Slides 2 - 28

Rejection Sampling Variational Inference Karan Grewal CSC2547 / STA4273 Overview Variational

Variational Hamiltonian Monte Carlo via Score Matching Cheng Zhang (Joint work with Prof. Shahbaba

A TWO-STEP DISENTANGLEMENT METHOD SNU Datamining Laboratory 2018. 8. 6 Seminar Sungwon, Lyu

61A Lecture 16 Announcements String Representations String Representations 4 String

Lecture Variational 13 Inference Panini Kaushal Scribes : - Margulies Smedeuranh Niklas

The Variational Predictive Natural Gradient Da Tang 1 Rajesh Ranganath 2 1 Columbia University 2

American-style options, stochastic volatility, and degenerate parabolic variational inequalities

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and

Learning Latent Semantic Relations from Clickthrough Data for Query Suggestion Hao Ma, Haixuan

Statistical modelling of a terrorist network with the latent class model and Bayesian model

Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of

Factorization Meets the Neighborhood: a Multifaceted Collaborative Filtering Model Yehuda Koren

"Labour Exclusion and Informality in a Latin American country, a Latent Class model approach

Flexible Latent Trait Metrics An Application of the Filtered Monotonic Polynomial Item Response

Trajectories of Health: Methods and Insights from Structural Equation Modeling Adam T.

Multimodal Implementation Plan Multimodal Implementation Plan OUTLINE Overview

Towards Disentangled Representations via Variational Sparse Coding - PowerPoint PPT Presentation

LatinX in AI Research Workshop - ICML 2019 Robert Aduviri and Alfredo De La Fuente Pontifical Catholic University of Peru, Skolkovo Institute of Science and Technology Towards Disentangled Representations via Variational Sparse Coding 1.

Disentangled Representation Learning 2020.5.21 Seung-Hoon Na Jeonbuk National University

Disentangled Graph Convolutional Networks Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu

Variational Auto-encoders 2 VARIATIONAL AUTO-ENCODERS INTRODUCTION VARIATIONAL AUTO-ENCODERS

Robustly Disentangled Causal Mechanisms: Validating Deep Representations for Interventional

An Introduction to An Introduction to Variational Variational Methods for Graphical Models

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio

Deep Variational Inference FLARE Reading Group Presentation Wesley Tansey 9/28/2016 What is

Variational Inference for GPs: Presenters Group1: Stochastic variational inference. Slides 2 - 28

Rejection Sampling Variational Inference Karan Grewal CSC2547 / STA4273 Overview Variational

Variational Hamiltonian Monte Carlo via Score Matching Cheng Zhang (Joint work with Prof. Shahbaba

A TWO-STEP DISENTANGLEMENT METHOD SNU Datamining Laboratory 2018. 8. 6 Seminar Sungwon, Lyu

61A Lecture 16 Announcements String Representations String Representations 4 String

Lecture Variational 13 Inference Panini Kaushal Scribes : - Margulies Smedeuranh Niklas

The Variational Predictive Natural Gradient Da Tang 1 Rajesh Ranganath 2 1 Columbia University 2

American-style options, stochastic volatility, and degenerate parabolic variational inequalities

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and

Learning Latent Semantic Relations from Clickthrough Data for Query Suggestion Hao Ma, Haixuan

Statistical modelling of a terrorist network with the latent class model and Bayesian model

Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of

Factorization Meets the Neighborhood: a Multifaceted Collaborative Filtering Model Yehuda Koren

&quot;Labour Exclusion and Informality in a Latin American country, a Latent Class model approach

Flexible Latent Trait Metrics An Application of the Filtered Monotonic Polynomial Item Response

Trajectories of Health: Methods and Insights from Structural Equation Modeling Adam T.

Multimodal Implementation Plan Multimodal Implementation Plan OUTLINE Overview

"Labour Exclusion and Informality in a Latin American country, a Latent Class model approach