Stability of Markov Chains based on fluid limit techniques. - - PowerPoint PPT Presentation

stability of markov chains based on fluid limit
SMART_READER_LITE
LIVE PREVIEW

Stability of Markov Chains based on fluid limit techniques. - - PowerPoint PPT Presentation

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC Stability of Markov Chains based on fluid limit techniques. Applications to MCMC Gersende FORT LTCI CNRS - TELECOM ParisTech In collaboration with Sean MEYN


slide-1
SLIDE 1

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

Gersende FORT

LTCI CNRS - TELECOM ParisTech

In collaboration with Sean MEYN (Univ. Illinois), Eric MOULINES (TELECOM ParisTech) and Pierre PRIOURET (Univ. Paris 6).

slide-2
SLIDE 2

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

We introduce

◮ a transformation of the Markov Chain −

→ family of time-continuous processes − → a limiting time-continuous process

◮ such that the stability of this process, is related to the ergodicity of

the Markov chain. ⇒ characterization of the ergodicity ; ⇒ identification of the factors that play a role in the dynamic of the Markov chain.

slide-3
SLIDE 3

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

We introduce

◮ a transformation of the Markov Chain −

→ family of time-continuous processes − → a limiting time-continuous process

◮ such that the stability of this process, is related to the ergodicity of

the Markov chain. ⇒ characterization of the ergodicity ; ⇒ identification of the factors that play a role in the dynamic of the Markov chain. The Markov Chain Monte Carlo (MCMC) algorithms

◮ are iterative algorithms that draw path of a Markov chain with given

stationary distribution ;

◮ the performances of which are related (among other factors) to some

parameters of implementation (design parameters).

◮ ⇒ find the role of the parameters in the definition of the fluid limit

and propose an “optimal choice” of these parameters.

slide-4
SLIDE 4

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

֒ → Outline of the talk

  • I. A MCMC sampler : the Metropolis-within-Gibbs (MwG), and its

design parameters.

  • II. Fluid limits.
  • III. Applications : guidelines on the choice of the design parameters for

the MwG.

slide-5
SLIDE 5

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-a General presentation

MCMC samplers :

Given a probability π, sample a Markov chain {Φn, n ≥ 0} with unique stationary distribution π. ֒ → Allow

◮ to explore the target density π. ◮ to approximate quantities of the form Eπ[g(Φ)] as soon as a LLN

exists (and other limit theorems). ֒ → Algorithms : Hastings-Metropolis, Gibbs, Metropolis-within-Gibbs, · · ·

slide-6
SLIDE 6

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Metropolis-within-Gibbs samplers in Rd

◮ Choose a selection probability : ω = {ωi, i ∈ {1, · · · , d}} ◮ Choose a family of transition kernels on R, qi(x, y)

ex. qi(x, y) = N (x, σ2 i )[y]

◮ Repeat :

  • select a direction I with prob. P(I = k) = ωk.
  • draw a candidate Y ∼ qI(Φn,I, ·).
  • accept or reject the candidate : all the components are

unchanged except the I-th Φn+1,I =

  • Y

with proba α(Φn, Y ) = 1 ∧ π(Y,Φn,−I)

π(Φn) qI(Y,Φn,I) qI(Φn,I,Y )

Φn,I

  • therwise.
slide-7
SLIDE 7

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π)

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-8
SLIDE 8

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-9
SLIDE 9

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-10
SLIDE 10

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted , Propose

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-11
SLIDE 11

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted , Propose , Rejected

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-12
SLIDE 12

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-13
SLIDE 13

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose , Accepted

−3 −2 −1 1 2 3 −3 −2 −1 1 2 3

slide-14
SLIDE 14

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-b Metropolis-within-Gibbs samplers

Example : Metropolis-within-Gibbs (MwG)

◮ Explore on R2 a Gaussian distribution π with diagonal dispersion

matrix

◮ and in each direction, the move is Gaussian.

Initial value (and level curves of π), Propose , Accepted , Propose , Rejected , Propose , Accepted , After 10000 iterations.

−10 −5 5 10 −5 −4 −3 −2 −1 1 2 3 4 5

slide-15
SLIDE 15

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-d Design parameters for the MwG

Design parameters for the MwG

· Selection {ωi, i ≤ d}, · Gaussian proposal distributions in each direction, with std σi. ֒ → Efficiency of the algorithm π ∼ N2(0, ∆) with diagonal dispersion

matrix ∆ such that ∆1,1 >> ∆2,2,

−15 −10 −5 5 10 15 −3 −2 −1 1 2 3 −15 −10 −5 5 10 15 −3 −2 −1 1 2 3

(left) ω1 = ω2, σ1 = σ2 (right) ω1 = ω2, σ1 >> σ2.

slide-16
SLIDE 16

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • I. MCMC samplers

I-d Design parameters for the MwG

֒ → Questions

◮ Optimal value of the design parameters. ◮ Adaptive methods : modify “on line” these parameters based on the

past behavior of the algorithm. ֒ → Hereafter,

◮ characterization of the role of these parameters on the dynamic of

the chain.

◮ guidelines to fix / adapt the value of these parameters.

slide-17
SLIDE 17

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits
  • II. Fluid Limits
slide-18
SLIDE 18

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Normalized processes

Let {Φk, k ≥ 0} be a Markov chain on X (X = Rd). A set of transformations : normalized process ηr, for r > 0 (i) in the initial value : ηr(0; x) = 1 r Φ0 = x ∈ Rd, Φ0 = rx (ii) in time and space : ηr(t; x) = 1 r Φ⌊tr⌋.

slide-19
SLIDE 19

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Normalized processes

Let {Φk, k ≥ 0} be a Markov chain on X (X = Rd). A set of transformations : normalized process ηr, for r > 0 (i) in the initial value : ηr(0; x) = 1 r Φ0 = x ∈ Rd, Φ0 = rx (ii) in time and space : ηr(t; x) = 1 r Φ⌊tr⌋. Hence ηr(·; x) = 1 r Φk

  • n the time interval

k r ; (k + 1) r

  • .

By definition, cad-lag paths.

slide-20
SLIDE 20

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Definition

֒ → Distributions · Px : law of the canonical chain {Φk, k ≥ 0} with initial value δx. · Qr;x : distribution image of Prx by ηr(·; x), distribution on D(R+, X) of cadlag functions R+ → X

slide-21
SLIDE 21

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Definition

֒ → Distributions · Px : law of the canonical chain {Φk, k ≥ 0} with initial value δx. · Qr;x : distribution image of Prx by ηr(·; x), distribution on D(R+, X) of cadlag functions R+ → X ֒ → Definition Fluid Limit. Q distribution on D(R+, X) is a fluid limit is there exists a family of scaling factors rn → +∞ such that Qrn;x = ⇒ Q. Denoted by Qx hereafter.

slide-22
SLIDE 22

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Definition

֒ → Distributions · Px : law of the canonical chain {Φk, k ≥ 0} with initial value δx. · Qr;x : distribution image of Prx by ηr(·; x), distribution on D(R+, X) of cadlag functions R+ → X ֒ → Definition Fluid Limit. Q distribution on D(R+, X) is a fluid limit is there exists a family of scaling factors rn → +∞ such that Qrn;x = ⇒ Q. Denoted by Qx hereafter. ֒ → Rmk : fluid limit ↔ limr Qr,x and Qr,x image of Prx ↔ behavior of the chains when started in the tails of π.

slide-23
SLIDE 23

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Example

{Φn, n ≥ 0} Hastings-Metropolis chain with target distribution on R2 given by π(x1, x2) ∝ (1 + x2

1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

and Gaussian proposal distribution 4 N2(x, I).

Figures : Different draws of the normalized process ηr(·, x) on [0, T] ; for different initial values x ; and different scaling factors r.

One initial value

−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-24
SLIDE 24

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Example

{Φn, n ≥ 0} Hastings-Metropolis chain with target distribution on R2 given by π(x1, x2) ∝ (1 + x2

1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

and Gaussian proposal distribution 4 N2(x, I).

Figures : Different draws of the normalized process ηr(·, x) on [0, T] ; for different initial values x ; and different scaling factors r.

One initial value, different initial values (r = 100)

−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-25
SLIDE 25

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Example

{Φn, n ≥ 0} Hastings-Metropolis chain with target distribution on R2 given by π(x1, x2) ∝ (1 + x2

1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

and Gaussian proposal distribution 4 N2(x, I).

Figures : Different draws of the normalized process ηr(·, x) on [0, T] ; for different initial values x ; and different scaling factors r.

One initial value, different initial values (r = 100), different scaling factors r (r = 1000)

−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-26
SLIDE 26

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Example

{Φn, n ≥ 0} Hastings-Metropolis chain with target distribution on R2 given by π(x1, x2) ∝ (1 + x2

1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

and Gaussian proposal distribution 4 N2(x, I).

Figures : Different draws of the normalized process ηr(·, x) on [0, T] ; for different initial values x ; and different scaling factors r.

One initial value, different initial values (r = 100), different scaling factors r (r = 1000) (r = 5000)

−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-27
SLIDE 27

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-a Definition

Example

{Φn, n ≥ 0} Hastings-Metropolis chain with target distribution on R2 given by π(x1, x2) ∝ (1 + x2

1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

and Gaussian proposal distribution 4 N2(x, I).

Figures : Different draws of the normalized process ηr(·, x) on [0, T] ; for different initial values x ; and different scaling factors r.

One initial value, different initial values (r = 100), different scaling factors r (r = 1000) (r = 5000) (Fluid Limit)

−0.2 0.2 0.4 0.6 0.8 1 1.2 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-28
SLIDE 28

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-b Existence

Suff Cond for existence

Φk+1 = Φk + E [Φk+1|Fk] − Φk + Φk+1 − E [Φk+1|Fk] = Φk + Ex [Φk+1 − Φk|Fk]

  • ∆(Φk)

+ (Φk+1 − Ex [Φk+1|Fk])

  • ǫk+1

martingale-increment

.

slide-29
SLIDE 29

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-b Existence

Suff Cond for existence

Φk+1 = Φk + E [Φk+1|Fk] − Φk + Φk+1 − E [Φk+1|Fk] = Φk + Ex [Φk+1 − Φk|Fk]

  • ∆(Φk)

+ (Φk+1 − Ex [Φk+1|Fk])

  • ǫk+1

martingale-increment

. ◮ Theorem (Fort et al, 2007) If · ∃p > 1, limK→+∞ supx∈X Ex

  • |ǫ1|p1

I|ǫ1|>K

  • → 0.

· 0 < supx∈X |∆(x)| < ∞. Then ∀x · ∀rn → +∞, ∃ sub-sequence {rnj, j ≥ 1} such that Qrnj ;x ⇒ Qx · Qx prob. on the space of the continuous functions from R+ to X.

slide-30
SLIDE 30

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-c Stability

Stability of the fluid limits

֒ → Definition Stable Fluid model : there exist T > 0 and ρ < 1 such that for any x on the unit sphere, Qx

  • η ∈ D(R+, X), inf

[0,T ] |η(t)| ≤ ρ

  • = 1.
slide-31
SLIDE 31

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-c Stability

Theorem (⋆ ⋆ ⋆ ⋆)

(Fort et al, 2007)

If

  • {Φk, k ≥ 0} is phi-irreducible, aperiodic ; and compact

sets are petite.

  • the fluid model exists and is stable.

Then the Markov chain is (f, r)-ergodic (n+1)q−1 sup

{f,|f|≤1+|x|p−q}

|Ex[f(Φn)] − π(f)| − →n→+∞ 0, 1 ≤ q ≤ p.

slide-32
SLIDE 32

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-c Stability

Theorem (⋆ ⋆ ⋆ ⋆)

(Fort et al, 2007)

If

  • {Φk, k ≥ 0} is phi-irreducible, aperiodic ; and compact

sets are petite.

  • the fluid model exists and is stable.

Then the Markov chain is (f, r)-ergodic (n+1)q−1 sup

{f,|f|≤1+|x|p−q}

|Ex[f(Φn)] − π(f)| − →n→+∞ 0, 1 ≤ q ≤ p. p : control of the martingale increment in the decomposition Φn+1 − Φn = ∆(Φn) + martingale-increment. The hitting-time T of the ball of radius ρ by the fluid model plays a role in the control of convergence of P n(x, ·) to π. (control of the returns to the “center”)

slide-33
SLIDE 33

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Fluid Limit = Skeleton of the chain

Φk+1 = Φk + (Ex [Φk+1|Fk] − Φk)

  • ∆(Φk)

+ (Φk+1 − Ex [Φk+1|Fk])

  • ǫk+1martingale-increment

◮ For the normalized process (piecewise constant, jumps at time k/r) : ηr k + 1 r , x

  • = 1

r Φk+1 = ηr k r , x

  • + 1

r ∆

  • r ηr

k r , x

  • + 1

r ǫk+1 = ηr k r , x

  • + 1

r h

  • ηr

k r , x

  • + 1

r (ξk + ǫk+1) where we set h (x) = lim

r→+∞ ∆(r x).

slide-34
SLIDE 34

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Fluid Limit = Skeleton of the chain

Φk+1 = Φk + (Ex [Φk+1|Fk] − Φk)

  • ∆(Φk)

+ (Φk+1 − Ex [Φk+1|Fk])

  • ǫk+1martingale-increment

◮ For the normalized process (piecewise constant, jumps at time k/r) : ηr k + 1 r , x

  • = 1

r Φk+1 = ηr k r , x

  • + 1

r ∆

  • r ηr

k r , x

  • + 1

r ǫk+1 = ηr k r , x

  • + 1

r h

  • ηr

k r , x

  • + 1

r (ξk + ǫk+1) where we set h (x) = lim

r→+∞ ∆(r x).

◮ Hence, noisy ’observation’ of µ k + 1 r

  • = µ

k r

  • + 1

r h

  • µ

k r

→ ODE : ˙ µ(t) = h(µ(t))

slide-35
SLIDE 35

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

To be more precise, fluid limit are characterized by lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0, for any compact H ⊂ ?

slide-36
SLIDE 36

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

To be more precise, fluid limit are characterized by lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0, for any compact H ⊂ ?

◮ In the easiest cases ( ? = X), fluid limits are Dirac mass at a

function µ that solves the ODE ˙ µ = h(µ). Stability of the fluid model ← → Stability of the ODE.

◮ Otherwise, more technical results, no general conditions.

slide-37
SLIDE 37

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 1

◮ If · ∃ h continuous such that H X \ {0}, lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0. · the ODE

·

µ= h(µ) is stable for any initial value x. Then the fluid model is stable.

slide-38
SLIDE 38

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 1

◮ If · ∃ h continuous such that H X \ {0}, lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0. · the ODE

·

µ= h(µ) is stable for any initial value x. Then the fluid model is stable. ◮ Example : Hastings-Metropolis

π(x1, x2) ∝ (1 + x2 1 + x2 2 + x8 1x2 2) exp(−(x2 1 + x2 2))

−10 −5 5 10 −10 −8 −6 −4 −2 2 4 6 8 10 −4 −2 2 4 6 8 10 12 14 −10 −8 −6 −4 −2 2 4 6 8 10

−0.2 0.2 0.4 0.6 0.8 1 1.2 −0.2 0.2 0.4 0.6 0.8 1 1.2

Level curves of π and fields ∆, h and draws of the fluid limit

slide-39
SLIDE 39

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 2

◮ If · ∃ h continuous such that for any compact H in a cone of X \ {0}, lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0. · the ODE

·

µ= h(µ) started from a point in the cone are stable · the cone is “ attractive”. Then the fluid model is stable.

slide-40
SLIDE 40

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 2

◮ If · ∃ h continuous such that for any compact H in a cone of X \ {0}, lim

r→+∞ sup x∈H

|∆(rx) − h(x)| = 0. · the ODE

·

µ= h(µ) started from a point in the cone are stable · the cone is “ attractive”. Then the fluid model is stable. ◮ Example : Hastings-Metropolis. π mixture of Gaussian distributions

−15 −10 −5 5 10 15 −15 −10 −5 5 10 15 Level curves of the target density 2 2.5 3 3.5 4 4.5 5 5.5 6 2 2.5 3 3.5 4 4.5 5 5.5 6 −0.2 0.2 0.4 0.6 0.8 1 1.2 −0.2 0.2 0.4 0.6 0.8 1 1.2

Level curves of π and fields ∆, h and realizations of the fluid limits

slide-41
SLIDE 41

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 3 (X = R2)

◮ If · X = a

α=1 Oα ∪ b β=1{x, f ′ βx = 0}.

· ∃ Σα such that for any compact H of Oα, lim

r→+∞ sup x∈H

|∆(rx) − Σα| = 0. · hyperplanes are “attractive” and “stable”. Then the fluid model is stable.

slide-42
SLIDE 42

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-d Characterization of the fluid limits

Characterization : case 3 (X = R2)

◮ If · X = a

α=1 Oα ∪ b β=1{x, f ′ βx = 0}.

· ∃ Σα such that for any compact H of Oα, lim

r→+∞ sup x∈H

|∆(rx) − Σα| = 0. · hyperplanes are “attractive” and “stable”. Then the fluid model is stable. ◮ Example : Metropolis within Gibbs

−5 −4 −3 −2 −1 1 2 3 4 5 −5 −4 −3 −2 −1 1 2 3 4 5 −1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

Level curves of π and fluid limits when ω1 = 0.25 and fluid limits when ω1 = 0.5

slide-43
SLIDE 43

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-e Conclusion

Conclusion (II)

◮ By renormalization of the chain,

◮ the fluid model characterizes the behavior of the chain started “far in

the tails” Φ0 = rx and r → +∞.

◮ the deterministic ’hidden’ behavior is obtained by removing the

stochastic perturbations.

slide-44
SLIDE 44

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-e Conclusion

Conclusion (II)

◮ By renormalization of the chain,

◮ the fluid model characterizes the behavior of the chain started “far in

the tails” Φ0 = rx and r → +∞.

◮ the deterministic ’hidden’ behavior is obtained by removing the

stochastic perturbations.

◮ Ergodicity of the initial chain is related to the stability of the fluid

model.

◮ Fluid model characterized (almost everywhere) by an ODE.

slide-45
SLIDE 45

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-e Conclusion

Conclusion (II)

◮ By renormalization of the chain,

◮ the fluid model characterizes the behavior of the chain started “far in

the tails” Φ0 = rx and r → +∞.

◮ the deterministic ’hidden’ behavior is obtained by removing the

stochastic perturbations.

◮ Ergodicity of the initial chain is related to the stability of the fluid

model.

◮ Fluid model characterized (almost everywhere) by an ODE. ◮ The fluid limit gives informations on the dynamic of the chain in the

transient phase (i.e. before the stationary phase).

◮ But- in some cases - with quite cumbersome and fastidious

computations in order to obtain an explicit characterization by an ODE.

slide-46
SLIDE 46

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-f Other results

Other results not discussed here

◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.

slide-47
SLIDE 47

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-f Other results

Other results not discussed here

◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.

◮ the chain has a slower dynamic. ◮ trivial fluid limit : Qx = δµ with µ(t) = x. ◮ modify the definition of the normalized process

ηr(t, x) = 1 r Φ⌈tr1+β⌉ Φ0 = rx.

◮ weaker ergodicity.

slide-48
SLIDE 48

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • II. Fluid limits

II-f Other results

Other results not discussed here

◮ When supx∈X |x|β|∆(x)| < +∞ for some 0 ≤ β < 1.

◮ the chain has a slower dynamic. ◮ trivial fluid limit : Qx = δµ with µ(t) = x. ◮ modify the definition of the normalized process

ηr(t, x) = 1 r Φ⌈tr1+β⌉ Φ0 = rx.

◮ weaker ergodicity.

◮ State space : not necessarily X = Rd.

slide-49
SLIDE 49

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler
  • III. Metropolis-within-Gibbs

֒ → Design parameters (a) the selection probability ω = {ωi, i ≤ d}. (b) the size of the moves in each direction (e.g. the variances σ2

i when

the proposal is Gaussian in each direction i). ֒ → Which approach ? (a) try to optimize the choice of ω and fix the variances σ2

i = c.

(b) try to optimize the choice of the variances σ2

i and fix the probability

ωi = 1/d. (c) try to optimize both σ2

i and ωi, i ≤ d.

slide-50
SLIDE 50

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-a Expression of ∆

Expression of ∆(x) = Ex[Φ1 − Φ0]

For any i ∈ {1, · · · , d}, qi = N(0, σ2

i )

∆i(x) = ωi

  • {y∈R,π(x+yei)<π(x)}

y π(x + yei) π(x) − 1

  • qi(y) dy.

where {ei, i ≤ d} is the canonical basis.

slide-51
SLIDE 51

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-a Expression of ∆

Expression of ∆(x) = Ex[Φ1 − Φ0]

For any i ∈ {1, · · · , d}, qi = N(0, σ2

i )

∆i(x) = ωi

  • {y∈R,π(x+yei)<π(x)}

y π(x + yei) π(x) − 1

  • qi(y) dy.

where {ei, i ≤ d} is the canonical basis. In order to characterize fluid limit, the radial limit h(x) = lim

r→+∞ ∆(r x)

is required. To that goal, assumptions on

◮ the limit of the rejection area {y ∈ R, π(r x + yei) < π(r x)} when

r → +∞,

◮ the behavior of the gradient ∇ ln π(r x)

are needed.

slide-52
SLIDE 52

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-b Expression of the field h

֒ → For any target density π such that · limr→+∞ |∇ ln π(rx)| = +∞. · ℓ given by limr→+∞

∇ ln π(rx) |∇ ln π(rx)| = ℓ(x)

is continuous(−).

slide-53
SLIDE 53

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-b Expression of the field h

֒ → For any target density π such that · limr→+∞ |∇ ln π(rx)| = +∞. · ℓ given by limr→+∞

∇ ln π(rx) |∇ ln π(rx)| = ℓ(x)

is continuous(−). ֒ → the field h is given by hi(x) = sign(ℓi(x)) ωi σi √ 2π ℓ(x) := lim

r→+∞

∇ ln π(rx) |∇ ln π(rx)|.

slide-54
SLIDE 54

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-b Expression of the field h

hi(x) = sign(ℓi(x)) ωi σi √ 2π ℓ(x) := lim

r→+∞

∇ ln π(rx) |∇ ln π(rx)|. ֒ → This implies that

◮ h (and thus, the fluid limit) depends upon π through the

“normalized limiting gradient”.

◮ The fluid limit depends upon the design parameters through the

products {ωiσi, i ≤ d}.

◮ The field h is constant (and thus, the ODE is linear) on the sets

Oα = {x, sign(ℓ(x)) = γα} where γα ∈ {−1, 1}d.

slide-55
SLIDE 55

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-c Characterization of the fluid limit

Piecewise linear fluid limits

֒ → Example : MwG, π ∼ N2(0, Γ) = ⇒ ℓ(x) = − Γ−1x

|Γ−1x|.

−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 }

γα = [−1 1] γα = [1 −1] γα = [1 1] γα = [−1 −1]

−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

slide-56
SLIDE 56

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-c Characterization of the fluid limit

Piecewise linear fluid limits

֒ → Example : MwG, π ∼ N2(0, Γ) = ⇒ ℓ(x) = − Γ−1x

|Γ−1x|.

−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 }

γα = [−1 1] γα = [1 −1] γα = [1 1] γα = [−1 −1]

−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

֒ → The fluid limit is

◮ linear till the first time it enters one of the sets {x, ℓi(x) = 0}, i ≤ d

  • which in the above example - are the hyperplanes in green.

◮ then, the behavior depends on the field h in a neighborhood of these

sets.

slide-57
SLIDE 57

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-c Characterization of the fluid limit

In any cases, · there exists at least one “absorbing” set. · this set is “stable” i.e. the fluid limits - when trapped in these sets - move towards the origin. Two situations, obtained with different values of the design parameters

−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 }

−1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

−0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 −1.5 −1 −0.5 0.5 1 1.5 {x, l1(x) = 0 } {x,l2(x) = 0 } −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1
slide-58
SLIDE 58

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Controlled Markov chain

Strategy :

◮ Since the fluid limit depends on the design parameters through the product ωiσi, Strategy 1. Fix ωi = 1/d and choose the std of the form σi(x). Strategy 2. Fix σi = c and choose the selection of the form ωi(x).

slide-59
SLIDE 59

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Controlled Markov chain

Strategy :

◮ Since the fluid limit depends on the design parameters through the product ωiσi, Strategy 1. Fix ωi = 1/d and choose the std of the form σi(x). Strategy 2. Fix σi = c and choose the selection of the form ωi(x). then, the fluid limit ← → solves the ODE

·

µ= h(µ) with h(x) = 1 √ 2π sign(ℓi(x)) [ωi σi] (x)

slide-60
SLIDE 60

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Controlled Markov chain

Strategy :

◮ Since the fluid limit depends on the design parameters through the product ωiσi, Strategy 1. Fix ωi = 1/d and choose the std of the form σi(x). Strategy 2. Fix σi = c and choose the selection of the form ωi(x). then, the fluid limit ← → solves the ODE

·

µ= h(µ) with h(x) = 1 √ 2π sign(ℓi(x)) [ωi σi] (x) ◮ We propose [ωiσi](x) = c

  • lim

r

∇i ln π(rx) |∇ ln π(rx)|

  • so that

h(x) = c √ 2π

  • lim

r

∇ ln π(rx) |∇ ln π(rx)|

  • A gradient algorithm so that the chain - started far in the tails - is

attracted towards the mode of π (i.e. the “center” of the space)

slide-61
SLIDE 61

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Controlled Markov chain

  • Ex. : Fluid limits of the MwG

[left] non-adaptive [right] adaptive

◮ When π ∼ N2(0, Γ1) Γ1 diagonal

−1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 −1 −0.5 0.5 1 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1

◮ When π ∼ N2(0, Γ1) + N2(0, Γ2)

−1.5 −1 −0.5 0.5 1 1.5 −1.5 −1 −0.5 0.5 1 1.5 M O N −1.5 −1 −0.5 0.5 1 1.5 −1.5 −1 −0.5 0.5 1 1.5

slide-62
SLIDE 62

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Comparison of the strategies

Comparison of the strategies

֒ → Criterion 1 : Based on the Fluid Limit and on its hitting-time of a sphere of radius ρ ∈]0, 1[ when initialized on the unit sphere.

slide-63
SLIDE 63

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Comparison of the strategies

Comparison of the strategies

֒ → Criterion 1 : Based on the Fluid Limit and on its hitting-time of a sphere of radius ρ ∈]0, 1[ when initialized on the unit sphere.

x-axes : polar coordinate of the initial value. y-axes : hitting-time. for the three algorithms Adaptive strategy

Non-Adaptive, ω1 = 0.25 Non-Adaptive, ω1 = 0.5

1 2 3 4 5 6 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Polar coordinate (radius 1) Time 1 2 3 4 5 6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 Polar coordinate (radius 1) Time

π ∼ N2(0, Γ2) Γ2 non-diagonal π ∼ N2(0, Γ1) + N2(0, Γ2)

slide-64
SLIDE 64

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Comparison of the strategies

֒ → Criterion 2 : Based on the Markov chain and its hitting-time of the “center of the space ” when chain started “far” from the center.

slide-65
SLIDE 65

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Comparison of the strategies

֒ → Criterion 2 : Based on the Markov chain and its hitting-time of the “center of the space ” when chain started “far” from the center. ◮ Example : comparison of the two adaptive procedures

π ∼ N8(0, Γ) d = 8 Γ : diagonal, with Γi,i ∼ E(1). 5000 adaptive chains, started from x ∈ {z′Γ−1z = d}. x-axes : hitting-time of the ball of radius √ d for Strat 1 (adapt σ ) y-axes : hitting-time of the ball of radius √ d for Strat 2 (adapt ω)

100 200 300 400 500 600 700 50 100 150 200 250 300 350 400 Strategy 1 Strategy 2

slide-66
SLIDE 66

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • III. Adaptive Metropolis-within-Gibbs sampler

III-d Comparison of the strategies

◮ Example : adaptive vs non-adaptive

π ∼ N8(0, Γ) d = 8 Γ : diagonal, with Γi,i ∼ E(1). 5000 adaptive chains, started from x ∈ {z′Γ−1z = d} x-axes : hitting-time of the ball of radius √ d for the classical algorithm y-axes : hitting-time of the ball of radius √ d for the Strat 2 (adapt ω)

100 200 300 400 500 600 700 800 50 100 150 200 250 300 350 400 450 500 RWMG algorithm Adaptive algorithm

slide-67
SLIDE 67

Stability of Markov Chains based on fluid limit techniques. Applications to MCMC

  • IV. Conclusion

To conclude,

◮ Hist. : fluid limits are common tools in queuing theory

(continuous-time Markov process) We provided an extension of this theory to the study of some (discrete-time) Markov chains.

◮ Fluid limits or drift conditions to prove the ergodicity of the chain ? ◮ provide an analysis of the chain in its transient phase (before

“stationnarity”) Available results · G. Fort, S. Meyn, E. Moulines and P. Priouret. The ODE method for the stability of skip-free Markov Chains with applications to MCMC. To be published, Ann. Appl.

  • Probab. (2008)

· G. Fort. Fluid limit-based tuning of some hybrid MCMC

  • samplers. Submitted (2007).