SLIDE 1 Advanced Algorithms (XIV)
Shanghai Jiao Tong University
Chihao Zhang
June 8, 2020
SLIDE 2
Mixing Time via Coupling
SLIDE 3
Mixing Time via Coupling
The state space Ω
SLIDE 4
Mixing Time via Coupling
The state space Ω Transition matrix P ∈ ℝΩ×Ω
SLIDE 5
Mixing Time via Coupling
The state space Ω Two chains and
(X0, X1, …) (Y0, Y1, …)
Transition matrix P ∈ ℝΩ×Ω
SLIDE 6
Mixing Time via Coupling
The state space Ω Two chains and
(X0, X1, …) (Y0, Y1, …)
Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0
SLIDE 7
Mixing Time via Coupling
The state space Ω Two chains and
(X0, X1, …) (Y0, Y1, …)
Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that:
SLIDE 8
Mixing Time via Coupling
The state space Ω Two chains and
(X0, X1, …) (Y0, Y1, …)
Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that: E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − α) ⋅ d(Xt, Yt)
SLIDE 9
SLIDE 10
In other words, is a super martingale
{d(Xt, Yt)}t≥0
SLIDE 11
In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time
SLIDE 12
In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time
SLIDE 13
In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time By coupling lemma
SLIDE 14
In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0]
SLIDE 15 In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that
Ω min
x,y∈Ω:x≠y d(x, y) = 1
SLIDE 16 In other words, is a super martingale
{d(Xt, Yt)}t≥0
Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that
Ω min
x,y∈Ω:x≠y d(x, y) = 1
Pr[d(Xt, Yt) > 0] = Pr[d(Xt, Yt) ≥ 1] ≤ E[d(Xt, Yt)] ≤ (1 − α)t ⋅ d(X0, Y0)
SLIDE 17
Sampling Proper Colorings
SLIDE 18
Sampling Proper Colorings
SLIDE 19
Sampling Proper Colorings
SLIDE 20
Sampling Proper Colorings
SLIDE 21
Sampling Proper Colorings
SLIDE 22 Sampling Proper Colorings
- the number of proper colorings
q
SLIDE 23 Sampling Proper Colorings
- the number of proper colorings
q
- a graph of maximum degree
G Δ
SLIDE 24 Sampling Proper Colorings
- the number of proper colorings
q
- a graph of maximum degree
G Δ
Is colorable using colors?
G q
SLIDE 25
SLIDE 26
The problem is NP-hard in general
SLIDE 27
The problem is NP-hard in general We consider the case when q > Δ
SLIDE 28
The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”
SLIDE 29 The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”
and u.a.r.
v ∈ V c ∈ [q] v c
SLIDE 30 The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”
and u.a.r.
v ∈ V c ∈ [q] v c
The chain is irreducible when q ≥ Δ + 2
SLIDE 31
The Coupling
SLIDE 32
The Coupling
Two chains choose the same and
v c
SLIDE 33 The Coupling
Two chains choose the same and
v c
v v c =
Good Move
Xt Yt
SLIDE 34 The Coupling
Two chains choose the same and
v c
v v c =
Good Move
Xt Yt
d(Xt+1, Yt+1) = d(Xt, Yt) − 1
SLIDE 35 The Coupling
Two chains choose the same and
v c
Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q
v v c =
Good Move
Xt Yt
d(Xt+1, Yt+1) = d(Xt, Yt) − 1
SLIDE 36 The Coupling
Two chains choose the same and
v c
Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q
v v c =
Good Move
Xt Yt v v c =
Bad Move
Xt Yt
d(Xt+1, Yt+1) = d(Xt, Yt) − 1
SLIDE 37 The Coupling
Two chains choose the same and
v c
Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q
v v c =
Good Move
Xt Yt v v c =
Bad Move
Xt Yt
d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1
SLIDE 38 The Coupling
Two chains choose the same and
v c
Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q
v v c =
Good Move
Xt Yt v v c =
Bad Move
Xt Yt
Pr[ ⋅ ] ≤ 2d(Xt, Yt)Δ Nq d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1
SLIDE 39
SLIDE 40 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
SLIDE 41 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
So if , we have
q ≥ 4Δ − 1
SLIDE 42 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
So if , we have
q ≥ 4Δ − 1
E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)
SLIDE 43 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
So if , we have
q ≥ 4Δ − 1
E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)
SLIDE 44 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
So if , we have
q ≥ 4Δ − 1
E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)
dTV(Xt, Yt) ≤ (1 − 1 qN )
t
⋅ N ≤ ε
SLIDE 45 E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )
So if , we have
q ≥ 4Δ − 1
E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)
dTV(Xt, Yt) ≤ (1 − 1 qN )
t
⋅ N ≤ ε ⟹ τmix(ε) ≤ qN (log N + log ε−1)
SLIDE 46
Geometric View of Mixing
SLIDE 47
Geometric View of Mixing
A Markov chain is a random walk on the state space
SLIDE 48
Geometric View of Mixing
A Markov chain is a random walk on the state space
SLIDE 49
Geometric View of Mixing
A Markov chain is a random walk on the state space Which random walk mixes faster?
SLIDE 50
Geometric View of Mixing
A Markov chain is a random walk on the state space Which random walk mixes faster? We will develop tools to formalize the intuition
SLIDE 51
Back to Graph Spectrum