Particle Flow Bayes Rule Xinshi Chen 1* , Hanjun Dai 1* , Le - - PowerPoint PPT Presentation

β–Ά
particle flow bayes rule
SMART_READER_LITE
LIVE PREVIEW

Particle Flow Bayes Rule Xinshi Chen 1* , Hanjun Dai 1* , Le - - PowerPoint PPT Presentation

Particle Flow Bayes Rule Xinshi Chen 1* , Hanjun Dai 1* , Le Song 1,2 1 Georgia Tech, 2 Ant Financial (*equal contribution) ICML 2019 Sequential Bayesian Inference 1. Prior distribution () # $ %


slide-1
SLIDE 1

Particle Flow Bayes’ Rule

Xinshi Chen1*, Hanjun Dai1*, Le Song1,2

1Georgia Tech, 2Ant Financial

(*equal contribution) ICML 2019

slide-2
SLIDE 2

Sequential Bayesian Inference

𝑦 𝑝# 𝑝$ 𝑝%

π‘ž 𝑦 𝑝# π‘ž 𝑦 𝑝#:$ π‘ž 𝑦 𝑝#:% … …… …… prior 𝜌(𝑦) 𝑝# 𝑝$ 𝑝%

π‘ž 𝑦 𝑝#:% ∝ π‘ž 𝑦 𝑝#:%,# π‘ž 𝑝% 𝑦

updated posterior current posterior Likelihood Or prior

1. Prior distribution 𝜌(𝑦) 2. Likelihood function π‘ž(𝑝|𝑦) 3. Observations 𝑝#, 𝑝$, … , 𝑝% arrive sequentially Need efficient online update! Sequential Monte Carlo:

  • 𝑂 particles 𝒴2 = 𝑦2

#, … , 𝑦2 4 from prior 𝜌(𝑦)

  • Reweight the particles using likelihood
  • Particle degeneracy problem

(Doucet et al., 2001)

slide-3
SLIDE 3

Our Approach: Particle Flow

  • 𝑢 particles 𝒴2 = {𝑦2

#, … , 𝑦2 4}, from prior 𝝆(π’š)

  • Move particles through an ordinary differential equation (ODE)

⟹ solution 𝑦#

; = 𝑦(π‘ˆ)

𝑒𝑦 𝑒𝑒 = 𝑔 𝒴2, 𝑝#, 𝑦 𝑒 , 𝑒 𝑦 0 = 𝑦2

;

and

𝒴2 = {𝑦2

#, … , 𝑦2 4}

𝒴# = {𝑦#

#, … , 𝑦# 4}

𝒴$ = {𝑦$

#, … , 𝑦$ 4}

A

2 B

𝑔 𝒴2, 𝑝#, 𝑦(𝑒) 𝑒𝑒 A

2 B

𝑔 𝒴#, 𝑝$, 𝑦(𝑒) 𝑒𝑒 A

2 B

𝑔 𝒴$, 𝑝C, 𝑦(𝑒) 𝑒𝑒

𝑦 𝑝# 𝑝$ 𝑝%

π‘ž 𝑦 𝑝# π‘ž 𝑦 𝑝#:$ π‘ž 𝑦 𝑝#:% … …… prior 𝜌(𝑦) 𝑝# 𝑝$ 𝑝% Does a unified flow velocity 𝑔 exist? Does Particle Flow Bayes’ Rule (PFBR) exist?

slide-4
SLIDE 4

Our Approach: Particle Flow

  • Move particles to next posterior through an ordinary differential equation (ODE)

⟹ solution 𝑦#

; = 𝑦(π‘ˆ)

𝑒𝑦 𝑒𝑒 = 𝑔 𝒴2, 𝑝#, 𝑦 𝑒 , 𝑒 𝑦 0 = 𝑦2

;

and

𝒴2 = {𝑦2

#, … , 𝑦2 4}

𝒴# = {𝑦#

#, … , 𝑦# 4}

𝒴$ = {𝑦$

#, … , 𝑦$ 4}

A

2 B

𝑔 𝒴2, 𝑝#, 𝑦(𝑒) 𝑒𝑒 A

2 B

𝑔 𝒴#, 𝑝$, 𝑦(𝑒) 𝑒𝑒 A

2 B

𝑔 𝒴$, 𝑝C, 𝑦(𝑒) 𝑒𝑒

𝑦 𝑝# 𝑝$ 𝑝%

π‘ž 𝑦 𝑝# π‘ž 𝑦 𝑝#:$ π‘ž 𝑦 𝑝#:% … …… prior 𝜌(𝑦) 𝑝# 𝑝$ 𝑝% Does a unified flow velocity 𝑔 exist? Does Particle Flow Bayes’ Rule (PFBR) exist?

Yes!!! 𝑔: = 𝛼

E log 𝜌 𝑦 π‘ž(𝑝|𝑦) βˆ’ π‘₯βˆ— 𝜌 𝑦 , π‘ž 𝑝|𝑦 , 𝑦, 𝑒

slide-5
SLIDE 5

Existence of Particle Flow Bayes’ Rule

𝑒𝑦 𝑒 = 𝛼

E log 𝜌 𝑦 π‘ž(𝑝|𝑦) 𝑒𝑒 +

2 𝒆𝒙 𝒖

Langevin dynamics

βœ“ density 𝒓 π’š, 𝒖 converges to posterior 𝒒 π’š|𝒑 ✘ stochastic flow

Fokker-Planck Equation + Continuity Equation deterministic, closed-loop

𝑒𝑦 𝑒 = 𝛼

E log 𝜌 𝑦 π‘ž(𝑝|𝑦) βˆ’ βˆ‡E log 𝒓(π’š, 𝒖) 𝑒𝑒

βœ“ density 𝒓 π’š, 𝒖 converges to posterior 𝒒 π’š|𝒑 βœ“ deterministic flow ✘ closed-loop flow: depends on 𝒓(π’š, 𝒖)

Optimal control theory: closed-loop to open loop deterministic, open-loop

𝑒𝑦 𝑒 = 𝛼

E log 𝜌 𝑦 π‘ž(𝑝|𝑦) βˆ’ π‘₯βˆ— 𝜌 𝑦 , π‘ž 𝑝|𝑦 , 𝑦, 𝑒 𝑒𝑒

βœ“ density 𝒓 π’š, 𝒖 converges to posterior 𝒒 π’š|𝒑 βœ“ deterministic flow βœ“ open-loop flow

slide-6
SLIDE 6

Parameterization

The unified flow velocity is in form of:

π’ˆ(𝝆 π’š , 𝒒 𝒑 π’š , π’š, 𝒖): = 𝛼

E log 𝜌 𝑦 π‘ž(𝑝|𝑦) βˆ’ π‘₯βˆ— 𝜌 𝑦 , π‘ž 𝑝|𝑦 , 𝑦, 𝑒

π’†π’š 𝒆𝒖 = π’ˆ 𝓨, 𝒑, π’š 𝒖 , 𝒖 ≔ π’Š 𝟐 𝑢 Z

𝒐\𝟐 𝑢

𝝔 π’šπ’ , 𝒑, π’š 𝒖 , 𝒖

Deep set π’Š neural networks

slide-7
SLIDE 7

Experiment 1: Multimodal Posterior

Gaussian Mixture Model

  • prior 𝑦#, 𝑦$ ∼ π’ͺ 0,1
  • bservations o|𝑦#, 𝑦$ ∼ #

$ π’ͺ 𝑦#, 1 + # $ π’ͺ(𝑦# + 𝑦$, 1)

  • With 𝑦#, 𝑦$ = (1, βˆ’2), the resulting posterior 𝒒(π’š|π’‘πŸ, … , 𝒑𝒏) will have two modes:
βˆ’2 βˆ’1.5 βˆ’1 βˆ’0.5 0.5 1 1.5 2 βˆ’3 βˆ’2 βˆ’1 1 2 3 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.016 0.018 βˆ’2 βˆ’1.5 βˆ’1 βˆ’0.5 0.5 1 1.5 2 βˆ’3 βˆ’2 βˆ’1 1 2 3 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 x 10 βˆ’3 βˆ’2 βˆ’1.5 βˆ’1 βˆ’0.5 0.5 1 1.5 2 βˆ’5 βˆ’4 βˆ’3 βˆ’2 βˆ’1 1 2 3 4 5 2 4 6 8 10 12 14 x 10 βˆ’3 βˆ’2 βˆ’1.5 βˆ’1 βˆ’0.5 0.5 1 1.5 2 βˆ’3 βˆ’2 βˆ’1 1 2 3 1 2 3 4 5 6 7 8 9 10 x 10 βˆ’3

(a) True posterior (b) Stochastic Variational Inference (c) Stochastic Gradient Langevin Dynamics (d) Gibbs Sampling (e) One-pass SMC

βˆ’2 βˆ’1.5 βˆ’1 βˆ’0.5 0.5 1 1.5 2 βˆ’3 βˆ’2 βˆ’1 1 2 3 1 2 3 x 10 βˆ’4
slide-8
SLIDE 8

Experiment 1: Multimodal Posterior

PFBR vs one-pass SMC

Visualization of the evolution of posterior density from left to right.

slide-9
SLIDE 9

Experiment 2: Efficiency in #Particles

Comparison to SMC and ASMC (Autoencoding SMC, Filtering Variational Objectives, and Variational SMC) (Le et al., 2018; Maddison et al., 2017; Naesseth et al., 2018). Our Approach

slide-10
SLIDE 10

Thanks! Poster: Pacific Ballroom #218, Tue, 06:30 PM Contact: xinshi.chen@gatech.edu