Learning Hierarchical Priors in VAEs Alexej Klushyn, Nutan Chen, - - PowerPoint PPT Presentation

learning hierarchical priors in vaes
SMART_READER_LITE
LIVE PREVIEW

Learning Hierarchical Priors in VAEs Alexej Klushyn, Nutan Chen, - - PowerPoint PPT Presentation

Learning Hierarchical Priors in VAEs Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt Machine Learning Research Lab, Volkswagen Group VAEs as a Constrained Optimisation Problem In the context of VAEs, it is a


slide-1
SLIDE 1

Learning Hierarchical Priors in VAEs

Alexej Klushyn, Nutan Chen, Richard Kurle, Botond Cseke, Patrick van der Smagt

Machine Learning Research Lab, Volkswagen Group

slide-2
SLIDE 2

VAEs as a Constrained Optimisation Problem

  • In the context of VAEs, it is a desired ability to being able to control the

reconstruction quality

  • Therefore, Rezende & Viola (2018) proposed to formulate the learning problem as

min

φ

EpD(x)

  • KL
  • qφ(z|x) p(z)
  • ptimisation objective

s.t. EpD(x) Eqφ(z|x)

  • Cθ(x, z)
  • ≤ κ2
  • inequality constraint
  • Cθ(x, z) is defined as the reconstruction-error-related term in − log pθ(x|z)
slide-3
SLIDE 3

Hierarchical Priors for Learning Informative Latent Representations

  • The optimal empirical Bayes prior is the aggregated posterior distribution

p∗(z) = EpD(x)

  • qφ(z|x)
  • In order to express p∗(z), we use a hierarchical model

p(z) =

  • pΘ(z|ζ) p(ζ) dζ
  • The parameters are learned by applying an importance-weighted lower bound

Ep∗(z)

  • log p(z)
  • ≥ EpD(x) Eqφ(z|x) Eζ1:K∼qΦ(ζ|z)
  • log 1

K

K

  • k=1

pΘ(z, ζk) qΦ(ζk|z)

  • ≡ F(Θ,Φ;z)
slide-4
SLIDE 4

Lagrangian & Optimisation Problem

  • The corresponding Lagrangian is

L(θ, φ, Θ, Φ; λ) = EpD(x) Eqφ(z|x)

  • log qφ(z|x) − F(Θ, Φ; z) + λ
  • Cθ(x, z) − κ2
  • As a result, we arrive to the optimisation problem

min

Θ,Φ

  • empirical

Bayes M-step

  • min

θ

max

λ

min

φ

  • E-step

L(θ, φ, Θ, Φ; λ) s.t. λ ≥ 0

  • minθ L and maxλ minφ L can be interpreted as the corresponding steps of the
  • riginal EM algorithm for training VAEs
slide-5
SLIDE 5

CMU Human Motion Data

VHP-VAE IWAE

slide-6
SLIDE 6

3D Faces

VHP-VAE IWAE

Poster #153

Today, 05:30–07:30 PM