New Langevin based algorithms for MCMC in high dimensions
Alain Durmus Joint work with Gareth O. Roberts, Gilles Vilmart and Konstantinos Zygalakis.
Département TSI, Telecom ParisTech Sixièmes rencontres des jeunes statisticiens
New Langevin based algorithms for MCMC in high dimensions Alain - - PowerPoint PPT Presentation
New Langevin based algorithms for MCMC in high dimensions Alain Durmus Joint work with Gareth O. Roberts, Gilles Vilmart and Konstantinos Zygalakis. Dpartement TSI, Telecom ParisTech Siximes rencontres des jeunes statisticiens Main themes
Département TSI, Telecom ParisTech Sixièmes rencontres des jeunes statisticiens
Main themes of this talk
page 2
New Langevin based algorithms for MCMC in high dimensions
Outlines
page 3
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results
Outlines
page 4
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
Outlines
page 5
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
Motivation
n
page 6
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
Markov chain theory
A q(x, y)dy is a Markov kernel on Rd with density q.
page 7
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
Markov chain theory
page 8
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
Markov chain theory
n
π-a.s.
page 9
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Introduction
MCMC : rationale
page 10
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
Outlines
page 11
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
The Metropolis-Hastings algorithm (I)
page 12
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
The RWM and MALA
db(Xk)/2 + σdZk+1
db(x))/σd)
db(Xk)/2 + σdZk) .
page 13
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
Scaling problems and diffusion limits
page 14
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
Efficiency of HM algorithms
n
a.s.
n→+∞
n
n→+∞ N(0, σ2(F, P)) ,
n→+∞ n Varπ
n
page 15
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
Efficiency of MH algorithms
page 16
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ The Metropolis-Hastings algorithm
Expected Square Jump Distance
page 17
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Speed of Langevin diffusions
Outlines
page 18
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Speed of Langevin diffusions
Langevin diffusion
a.s.
t→+∞
t→+∞ N(0, σ2(F, Y)) ,
t→+∞ t Varπ
page 19
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Speed of Langevin diffusions
scaled Langevin equation
t = (cb(Yt)/2)dt +
ct)t≥0 :
ct = Y 1 0 +
s )/2)ds + Bct s=cu
0 +
cu)/2)ds +
page 20
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Speed of Langevin diffusions
Efficiency of Langevin solutions
t )t≥0) ?
t )t≥0) = lim t→+∞ t Varπ
cs)ds
t→+∞ ct Varπ
s )ds
t )t≥0) = c−1σ2(F, (Y 1 t )t≥0) ,
page 21
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Review of the scaling results for the RWM and MALA
Outlines
page 22
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Review of the scaling results for the RWM and MALA
Scaling of the RWM
i=1 π(xi) and {X d,RWM k
t
⌊td⌋,1 .
t
page 23
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Review of the scaling results for the RWM and MALA
Scaling of the MALA
db(Xk)/2 + σdZk+1
db(x))/σd)
db(Xk)/2 + σdZk) .
i=1 π(xi) and {X d,E k
t
t
page 24
New Langevin based algorithms for MCMC in high dimensions
Brief review of scaling results ◮ Review of the scaling results for the RWM and MALA
Consequences on the tuning of the two algorithms
page 25
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5
Outlines
page 26
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
Outlines
page 27
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
Framework
d
d
0 ∼ πd and the proposal is some Gaussian kernel :
k+1,i
k,i, σd) + S(X d k,i, σd)Z d k+1,i
k+1
k + ✶{U≤α(Xd
k ,Y d k+1)}(Y d
k+1 − X d k ) .
page 28
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
Form of the proposal
k+1,i = ψ(X d k,i, γ, Z d k+1,i) depends on :
k .
k+1 ∼ N(0, Idd).
page 29
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
choice of γ
d→∞ E[α(X d 0 , Y d 1 )] > 0} .
0 ∼ πd.
page 30
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
Formal derivation
d
k
d
page 31
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
Formal derivation
k
d
d
d
0,i, Z d 0,i) ∗
d→+∞ N(0, σ∗) ,
page 32
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Formal derivation
A proposal with a new scaling
d
d
d
page 33
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Main results
Outlines
page 34
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Main results
Assumptions
page 35
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Main results
Limiting acceptance probability
k
d
d
d→+∞
d
d
−∞ e−t2/2dt, for
page 36
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Main results
Scaling of mMALA
k
t
t
page 37
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Maximization of the speed of the diffusion
Outlines
page 38
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Maximization of the speed of the diffusion
Maximizing h(ℓ)
−1(afMALA(ℓ)/2)
−1(afMALA(ℓ)/2)
page 39
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Maximization of the speed of the diffusion
Back to ESJD
page 40
New Langevin based algorithms for MCMC in high dimensions
A new MH algorithm with a scaling in d1/5 ◮ Maximization of the speed of the diffusion
The End
page 41
New Langevin based algorithms for MCMC in high dimensions