Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
Gersende FORT
LTCI CNRS - TELECOM ParisTech
Stability of Markov Chains based on fluid limit techniques. - - PowerPoint PPT Presentation
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC Stability of Markov Chains based on fluid limit techniques. Applications to MCMC Gersende FORT LTCI CNRS - TELECOM ParisTech In collaboration with Sean MEYN
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
LTCI CNRS - TELECOM ParisTech
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
◮ a transformation of the Markov Chain −
◮ such that the stability of this process, is related to the ergodicity of
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
◮ a transformation of the Markov Chain −
◮ such that the stability of this process, is related to the ergodicity of
◮ are iterative algorithms that draw path of a Markov chain with given
◮ the performances of which are related (among other factors) to some
◮ ⇒ find the role of the parameters in the definition of the fluid limit
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-a General presentation
◮ to explore the target density π. ◮ to approximate quantities of the form Eπ[g(Φ)] as soon as a LLN
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Choose a selection probability : ω = {ωi, i ∈ {1, · · · , d}} ◮ Choose a family of transition kernels on R, qi(x, y)
ex. qi(x, y) = N (x, σ2 i )[y]
◮ Repeat :
π(Φn) qI(Y,Φn,I) qI(Φn,I,Y )
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π)
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted , Propose
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted , Propose , Rejected
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose , Accepted
−3 −2 −1 1 2 3 −3 −2 −1 1 2 3
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-b Metropolis-within-Gibbs samplers
◮ Explore on R2 a Gaussian distribution π with diagonal dispersion
◮ and in each direction, the move is Gaussian.
Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose , Accepted , After 10000 iterations.
−10 −5 5 10 −5 −4 −3 −2 −1 1 2 3 4 5
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-d Design parameters for the MwG
−15 −10 −5 5 10 15 −3 −2 −1 1 2 3 −15 −10 −5 5 10 15 −3 −2 −1 1 2 3
(left) ω1 = ω2, σ1 = σ2 (right) ω1 = ω2, σ1 >> σ2.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
I-d Design parameters for the MwG
◮ Optimal value of the design parameters. ◮ Adaptive methods : modify “on line” these parameters based on the
◮ characterization of the role of these parameters on the dynamic of
◮ guidelines to fix / adapt the value of these parameters.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
One initial value
−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
One initial value, different initial values (r = 100)
−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
One initial value, different initial values (r = 100), different scaling factors r (r = 1000)
−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
One initial value, different initial values (r = 100), different scaling factors r (r = 1000) (r = 5000)
−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-a Definition
1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
One initial value, different initial values (r = 100), different scaling factors r (r = 1000) (r = 5000) (Fluid Limit)
−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-b Existence
martingale-increment
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-b Existence
martingale-increment
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-c Stability
[0,T ] |η(t)| ≤ ρ
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-c Stability
(Fort et al, 2007)
{f,|f|≤1+|x|p−q}
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-c Stability
(Fort et al, 2007)
{f,|f|≤1+|x|p−q}
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ ∆(r x).
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ ∆(r x).
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
◮ In the easiest cases ( ? = X), fluid limits are Dirac mass at a
◮ Otherwise, more technical results, no general conditions.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
·
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
·
π(x1, x2) ∝ (1 + x2 1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))
−10 −5 5 10 −10 −8 −6 −4 −2 2 4 6 8 10 −4 −2 2 4 6 8 10 12 14 −10 −8 −6 −4 −2 2 4 6 8 10
−0.2 0.2 0.4 0.6 0.8 1 1.2 −0.2 0.2 0.4 0.6 0.8 1 1.2
Level curves of π and fields ∆, h and draws of the fluid limit
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
·
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
r→+∞ sup x∈H
·
−15 −10 −5 5 10 15 −15 −10 −5 5 10 15 Level curves of the target density 2 2.5 3 3.5 4 4.5 5 5.5 6 2 2.5 3 3.5 4 4.5 5 5.5 6 −0.2 0.2 0.4 0.6 0.8 1 1.2 −0.2 0.2 0.4 0.6 0.8 1 1.2
Level curves of π and fields ∆, h and realizations of the fluid limits
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
α=1 Oα ∪ b β=1{x, f ′ βx = 0}.
r→+∞ sup x∈H
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-d Characterization of the fluid limits
α=1 Oα ∪ b β=1{x, f ′ βx = 0}.
r→+∞ sup x∈H
−5 −4 −3 −2 −1 1 2 3 4 5 −5 −4 −3 −2 −1 1 2 3 4 5 −1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Level curves of π and fluid limits when ω1 = 0.25 and fluid limits when ω1 = 0.5
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-e Conclusion
◮ By renormalization of the chain,
◮ the fluid model characterizes the behavior of the chain started “far in
◮ the deterministic ’hidden’ behavior is obtained by removing the
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-e Conclusion
◮ By renormalization of the chain,
◮ the fluid model characterizes the behavior of the chain started “far in
◮ the deterministic ’hidden’ behavior is obtained by removing the
◮ Ergodicity of the initial chain is related to the stability of the fluid
◮ Fluid model characterized (almost everywhere) by an ODE.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-e Conclusion
◮ By renormalization of the chain,
◮ the fluid model characterizes the behavior of the chain started “far in
◮ the deterministic ’hidden’ behavior is obtained by removing the
◮ Ergodicity of the initial chain is related to the stability of the fluid
◮ Fluid model characterized (almost everywhere) by an ODE. ◮ The fluid limit gives informations on the dynamic of the chain in the
◮ But- in some cases - with quite cumbersome and fastidious
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-f Other results
◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-f Other results
◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.
◮ the chain has a slower dynamic. ◮ trivial fluid limit : Qx = δµ with µ(t) = x. ◮ modify the definition of the normalized process
◮ weaker ergodicity.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
II-f Other results
◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.
◮ the chain has a slower dynamic. ◮ trivial fluid limit : Qx = δµ with µ(t) = x. ◮ modify the definition of the normalized process
◮ weaker ergodicity.
◮ State space : not necessarily X = Rd.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
i when
i = c.
i and fix the probability
i and ωi, i ≤ d.
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-a Expression of ∆
i )
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-a Expression of ∆
i )
r→+∞ ∆(r x)
◮ the limit of the rejection area {y ∈ R, π(r x + yei) < π(r x)} when
◮ the behavior of the gradient ∇ ln π(r x)
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-b Expression of the field h
∇ ln π(rx) |∇ ln π(rx)| = ℓ(x)
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-b Expression of the field h
∇ ln π(rx) |∇ ln π(rx)| = ℓ(x)
r→+∞
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-b Expression of the field h
r→+∞
◮ h (and thus, the fluid limit) depends upon π through the
◮ The fluid limit depends upon the design parameters through the
◮ The field h is constant (and thus, the ODE is linear) on the sets
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-c Characterization of the fluid limit
|Γ−1x|.
−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 }
γα = [−1 1] γα = [1 −1] γα = [1 1] γα = [−1 −1]
−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-c Characterization of the fluid limit
|Γ−1x|.
−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 }
γα = [−1 1] γα = [1 −1] γα = [1 1] γα = [−1 −1]
−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
◮ linear till the first time it enters one of the sets {x, ℓi(x) = 0}, i ≤ d
◮ then, the behavior depends on the field h in a neighborhood of these
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-c Characterization of the fluid limit
−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 } −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Controlled Markov chain
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Controlled Markov chain
·
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Controlled Markov chain
·
r
r
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Controlled Markov chain
−1.5 −1 −0.5 0.5 1 1.5 −1.5 −1 −0.5 0.5 1 1.5 M O N −1.5 −1 −0.5 0.5 1 1.5 −1.5 −1 −0.5 0.5 1 1.5
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Comparison of the strategies
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Comparison of the strategies
Non-Adaptive, ω1 = 0.25 Non-Adaptive, ω1 = 0.5
1 2 3 4 5 6 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Polar coordinate (radius 1) Time 1 2 3 4 5 6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 Polar coordinate (radius 1) Time
π ∼ N2(0, Γ2) Γ2 non-diagonal π ∼ N2(0, Γ1) + N2(0, Γ2)
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Comparison of the strategies
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Comparison of the strategies
100 200 300 400 500 600 700 50 100 150 200 250 300 350 400 Strategy 1 Strategy 2
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
III-d Comparison of the strategies
100 200 300 400 500 600 700 800 50 100 150 200 250 300 350 400 450 500 RWMG algorithm Adaptive algorithm
Stability of Markov Chains based on fluid limit techniques. Applications to MCMC
◮ Hist. : fluid limits are common tools in queuing theory
◮ Fluid limits or drift conditions to prove the ergodicity of the chain ? ◮ provide an analysis of the chain in its transient phase (before