Advanced Algorithms (XIV) Shanghai Jiao Tong University Chihao - - PowerPoint PPT Presentation

▶

Jul 23, 2023 478 likes •1.01k views

Advanced Algorithms (XIV) Shanghai Jiao Tong University Chihao Zhang June 8, 2020 Mixing Time via Coupling Mixing Time via Coupling The state space Mixing Time via Coupling The state space Transition matrix P Mixing Time

SLIDE 1

Advanced Algorithms (XIV)

Shanghai Jiao Tong University

Chihao Zhang

June 8, 2020

SLIDE 2

Mixing Time via Coupling

SLIDE 3

Mixing Time via Coupling

The state space Ω

SLIDE 4

Mixing Time via Coupling

The state space Ω Transition matrix P ∈ ℝΩ×Ω

SLIDE 5

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω

SLIDE 6

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0

SLIDE 7

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that:

SLIDE 8

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that: E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − α) ⋅ d(Xt, Yt)

SLIDE 9

SLIDE 10

In other words, is a super martingale

{d(Xt, Yt)}t≥0

SLIDE 11

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time

SLIDE 12

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time

SLIDE 13

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma

SLIDE 14

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0]

SLIDE 15

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that

Ω min

x,y∈Ω:x≠y d(x, y) = 1

SLIDE 16

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that

Ω min

x,y∈Ω:x≠y d(x, y) = 1

Pr[d(Xt, Yt) > 0] = Pr[d(Xt, Yt) ≥ 1] ≤ E[d(Xt, Yt)] ≤ (1 − α)t ⋅ d(X0, Y0)

SLIDE 17

Sampling Proper Colorings

SLIDE 18

Sampling Proper Colorings

SLIDE 19

Sampling Proper Colorings

SLIDE 20

Sampling Proper Colorings

SLIDE 21

Sampling Proper Colorings

SLIDE 22

Sampling Proper Colorings

the number of proper colorings

q

SLIDE 23

Sampling Proper Colorings

the number of proper colorings

q

a graph of maximum degree

G Δ

SLIDE 24

Sampling Proper Colorings

the number of proper colorings

q

a graph of maximum degree

G Δ

Is colorable using colors?

G q

SLIDE 25

SLIDE 26

The problem is NP-hard in general

SLIDE 27

The problem is NP-hard in general We consider the case when q > Δ

SLIDE 28

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

SLIDE 29

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

Pick

and u.a.r.

Recolor with if possible

v ∈ V c ∈ [q] v c

SLIDE 30

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

Pick

and u.a.r.

Recolor with if possible

v ∈ V c ∈ [q] v c

The chain is irreducible when q ≥ Δ + 2

SLIDE 31

The Coupling

SLIDE 32

The Coupling

Two chains choose the same and

v c

SLIDE 33

The Coupling

Two chains choose the same and

v c

v v c =

Good Move

Xt Yt

SLIDE 34

The Coupling

Two chains choose the same and

v c

v v c =

Good Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

SLIDE 35

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

SLIDE 36

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

SLIDE 37

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1

SLIDE 38

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

Pr[ ⋅ ] ≤ 2d(Xt, Yt)Δ Nq d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1

SLIDE 39

SLIDE 40

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

SLIDE 41

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

SLIDE 42

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

SLIDE 43

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

SLIDE 44

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

dTV(Xt, Yt) ≤ (1 − 1 qN )

⋅ N ≤ ε

SLIDE 45

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

dTV(Xt, Yt) ≤ (1 − 1 qN )

⋅ N ≤ ε ⟹ τmix(ε) ≤ qN (log N + log ε−1)

SLIDE 46

Geometric View of Mixing

SLIDE 47

Geometric View of Mixing

A Markov chain is a random walk on the state space

SLIDE 48

Geometric View of Mixing

A Markov chain is a random walk on the state space

SLIDE 49

Geometric View of Mixing

A Markov chain is a random walk on the state space Which random walk mixes faster?

SLIDE 50

Geometric View of Mixing

A Markov chain is a random walk on the state space Which random walk mixes faster? We will develop tools to formalize the intuition

SLIDE 51