Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
Parallel tempering and Interacting MCMC algorithms
Gersende FORT / Eric MOULINES
Telecom Paris Tech CNRS - LTCI
Parallel tempering and Interacting MCMC algorithms Gersende FORT / - - PowerPoint PPT Presentation
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler Parallel tempering and Interacting MCMC algorithms Gersende FORT / Eric MOULINES Telecom Paris Tech CNRS - LTCI Parallel tempering and Interacting MCMC algorithms
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
Telecom Paris Tech CNRS - LTCI
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ The Equi Energy sampler Kou et al (2006) is an example of Interacting
◮ The idea is to replace an instantaneous swap by an interaction
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ The Equi Energy sampler Kou et al (2006) is an example of Interacting
◮ The idea is to replace an instantaneous swap by an interaction
◮ Will define X(t) = {X(t) n , n ≥ 0} with X(1) (hot temperature), · · · ,
◮ Algorithm: given the previous level X(k−1) 1:n−1 and the current point
n−1, define X(k) n
◮ (MCMC step / local moves) with probability ǫ,
n
n−1, ·)
◮ (Interaction step / global moves) otherwise,
1:n−1} with the same
n−1
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
1 2 3 4 5 6 7 8 9 10 −2 2 4 6 8 10 Target density : mixture of 2−dim Gaussian draws means of the components
◮ target density : π = 20 i=1 N2(µi, Σi) ◮ K processes with target distribution π1/Tk
−2 2 4 6 8 10 12 −4 −2 2 4 6 8 10 12 14 Target density at temperature 1 draws means of the components −2 2 4 6 8 10 12 −2 2 4 6 8 10 12 Target density at temperature 2 draws means of the components 1 2 3 4 5 6 7 8 9 10 −2 2 4 6 8 10 12 Target density at temperature 3 draws means of the components 1 2 3 4 5 6 7 8 9 10 −2 2 4 6 8 10 12 Target density at temperature 4 draws means of the components 1 2 3 4 5 6 7 8 9 −2 2 4 6 8 10 Target density at temperature 5 draws means of the components 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 10 Hastings−Metropolis draws means of the components
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ ergodicity:
n
◮ law of large numbers: 1 n
j=1 h(X(K) j
◮ CLT:
j=1{h(X(K) j
see e.g. Kou, Zhou, Wong (2006); Atchad´ e (2010); Andrieu, Jasra, Doucet, Del Moral (2011); Fort, Moulines, Priouret (2012); Fort, Moulines, Priouret, Vandekerkhove (2012) these problems are still open.
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ In the original EE: energy rings = strata in the range of the energy
◮ Our contribution: tune adaptively the boundaries of the strata
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ Target distribution on R6
◮ We compare Hastings-Metropolis (HM); and the EE sampler and the
◮ The last plot is for the 2-d projection
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
1 2 3 4 5 6 x 10
50.5 1 1.5 2 2.5 L1 error when estimating the means MH EES SA−AEES 1 2 3 4 5 6 x 10
530 40 50 60 70 80 90 100 Time spent in the left mode ees aees true
[Top] Error when estimating the means 1 6
6
n
n
X(K)
j,i
− Eπ[Xi]
where the path is initialized.
[Bottom R] Probability
being in some ellipsoids, for the first mode (line) and the second one (dashed line)
1 2 3 4 5 6 x 10
50.02 0.04 0.06 0.08 0.1 0.12 Probability of being in some area (true prob. is 0.05) mode 1 ees mode 2 ees mode 1 aees mode 2 aees true mode 1 true mode 2
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
1000 5000 25 k 100 k 250 k 400 k 550 k 0.5 1 1.5 2 L1 error for HM (red), EES (black) and AEES (blue) 1000 5000 25 k 100 k 250 k 400 k 550 k 10 20 30 40 50 60 70 80 90 100 Percent of the time spent in the first component, for EES (black) and AEES (blue)
[Top] Error when estimating the means 1 6
6
n
n
X(K)
j,i
− Eπ[Xi]
where the path is initialized.
[Bottom R] Probability
being in some ellipsoids for the first mode
1000 5000 25 k 100 k 250 k 400 k 550 k 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 Proba of being in the left ellipsoid, for EES (black) and AEES (blue)
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ Level 1 (Hot level)
◮ Sample X(1) with target π1/T1 (MCMC). ◮ at each time n, update the boundaries H(1)
n,1, · · · , H(1) n,L computed
1:n
◮ Level 2
◮ Sample X(2) (MCMC step and interaction step) with target π1/T2 .
◮ at each time n, update the boundaries H(2)
n,1, · · · , H(2) n,L computed
1:n
◮ Repeat until Level K.
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ we assume the process X(k−1) ”converges”. ◮ we prove that the process X(k) has the same convergence properties. ◮ Repeat from level 1 to K.
◮ the conditional distribution L(X(k) n |past(1:k) n−1 ) is P (k) θn−1(X(k) n−1, ·)
P (k) θn (x, dy) = ǫP (k)(x, dy) + (1 − ǫ)K(k) θn (x, dy) K(k) θn (x, A) =
α(k) θn (x, y) gθn (x, y)θn(dy)
+ δx(A)
θn (x, y)} gθn (x, y)θn(dy)
θn = 1 n n
δ X(k−1) j α(k) θn (x, y) = 1 ∧ π1/Tk−1/Tk−1 (y) π1/Tk−1/Tk−1 (x)
gθn (x, y) = ”x and y are in the same energy ring with boundaries defined by H(k−1) n,• ” (ex.) =
1 if otherwise
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ we assume the process X(k−1) ”converges”. ◮ we prove that the process X(k) has the same convergence properties. ◮ Repeat from level 1 to K.
◮ the conditional distribution L(X(k) n |past(1:k) n−1 ) is P (k) θn−1(X(k) n−1, ·) ◮ containment and diminishing adaptation conditions extensions from the pioneering
work by (Roberts, Rosenthal (2005)) + Poisson equation + Limit Theorems for
◮ condition on the adapted boundaries
n,• − H(k) n−1,•
n,• → H(k) ∞,• w.p.1 when n → ∞.
x
∞ (x, y)π1/Tk(dy) > 0
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
i
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
n,i ) for 1 ≤ i ≤ L (computed from X(k) 1:n) as an estimator
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
n,i ) for 1 ≤ i ≤ L (computed from X(k) 1:n) as an estimator
n ◮ determine the ring such that
n−1) ≤ Hi ◮ choose (at random) one point among X(k−1) 1
n−1
◮ When convergence:
n
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
n (h) = 1
n
j
)≤h
n+1,• = H(k) n,• + γn+1 Ξ
n+1, H(k) n,•
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
−40 −20 20 40 −30 −20 −10 10 20 30 −40 −20 20 40 −30 −20 −10 10 20 30 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22
1 2 3 4 5 6 7 8 9 10
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ a background sequence, with a Markovian transition (known) ◮ motifs, of known length, with independent multinomial transition
Parallel tempering and Interacting MCMC algorithms Adaptive Equi-Energy sampler
◮ EE depends on many design parameters that all play a role on the
◮ Convergence results are established ∗ when the quantiles are
◮ Work in progress: convergence when the quantiles are estimated by
◮ First convergence results on EE with selection of the auxiliary point
∗Submitted, available at http://perso.telecom-paristech.fr/ schreck