Advanced Algorithms (XIV) Shanghai Jiao Tong University Chihao - - PowerPoint PPT Presentation

advanced algorithms xiv
SMART_READER_LITE
LIVE PREVIEW

Advanced Algorithms (XIV) Shanghai Jiao Tong University Chihao - - PowerPoint PPT Presentation

Advanced Algorithms (XIV) Shanghai Jiao Tong University Chihao Zhang June 8, 2020 Mixing Time via Coupling Mixing Time via Coupling The state space Mixing Time via Coupling The state space Transition matrix P Mixing Time


slide-1
SLIDE 1

Advanced Algorithms (XIV)

Shanghai Jiao Tong University

Chihao Zhang

June 8, 2020

slide-2
SLIDE 2

Mixing Time via Coupling

slide-3
SLIDE 3

Mixing Time via Coupling

The state space Ω

slide-4
SLIDE 4

Mixing Time via Coupling

The state space Ω Transition matrix P ∈ ℝΩ×Ω

slide-5
SLIDE 5

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω

slide-6
SLIDE 6

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0

slide-7
SLIDE 7

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that:

slide-8
SLIDE 8

Mixing Time via Coupling

The state space Ω Two chains and

(X0, X1, …) (Y0, Y1, …)

Transition matrix P ∈ ℝΩ×Ω A distance d : Ω × Ω → ℝ≥0 Two chains are “coupled” so that: E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − α) ⋅ d(Xt, Yt)

slide-9
SLIDE 9
slide-10
SLIDE 10

In other words, is a super martingale

{d(Xt, Yt)}t≥0

slide-11
SLIDE 11

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time

slide-12
SLIDE 12

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time

slide-13
SLIDE 13

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma

slide-14
SLIDE 14

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0]

slide-15
SLIDE 15

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that

Ω min

x,y∈Ω:x≠y d(x, y) = 1

slide-16
SLIDE 16

In other words, is a super martingale

{d(Xt, Yt)}t≥0

Recall the mixing time By coupling lemma dTV(Xt, Yt) ≤ Pr[Xt ≠ Yt] = Pr[d(Xt, Yt) > 0] For finite , we assume WLOG that

Ω min

x,y∈Ω:x≠y d(x, y) = 1

Pr[d(Xt, Yt) > 0] = Pr[d(Xt, Yt) ≥ 1] ≤ E[d(Xt, Yt)] ≤ (1 − α)t ⋅ d(X0, Y0)

slide-17
SLIDE 17

Sampling Proper Colorings

slide-18
SLIDE 18

Sampling Proper Colorings

slide-19
SLIDE 19

Sampling Proper Colorings

slide-20
SLIDE 20

Sampling Proper Colorings

slide-21
SLIDE 21

Sampling Proper Colorings

slide-22
SLIDE 22

Sampling Proper Colorings

  • the number of proper colorings

q

slide-23
SLIDE 23

Sampling Proper Colorings

  • the number of proper colorings

q

  • a graph of maximum degree

G Δ

slide-24
SLIDE 24

Sampling Proper Colorings

  • the number of proper colorings

q

  • a graph of maximum degree

G Δ

Is colorable using colors?

G q

slide-25
SLIDE 25
slide-26
SLIDE 26

The problem is NP-hard in general

slide-27
SLIDE 27

The problem is NP-hard in general We consider the case when q > Δ

slide-28
SLIDE 28

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

slide-29
SLIDE 29

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

  • Pick

and u.a.r.

  • Recolor with if possible

v ∈ V c ∈ [q] v c

slide-30
SLIDE 30

The problem is NP-hard in general We consider the case when q > Δ Consider the chain obtained via the “Metropolis Rule”

  • Pick

and u.a.r.

  • Recolor with if possible

v ∈ V c ∈ [q] v c

The chain is irreducible when q ≥ Δ + 2

slide-31
SLIDE 31

The Coupling

slide-32
SLIDE 32

The Coupling

Two chains choose the same and

v c

slide-33
SLIDE 33

The Coupling

Two chains choose the same and

v c

v v c =

Good Move

Xt Yt

slide-34
SLIDE 34

The Coupling

Two chains choose the same and

v c

v v c =

Good Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

slide-35
SLIDE 35

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

slide-36
SLIDE 36

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1

slide-37
SLIDE 37

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1

slide-38
SLIDE 38

The Coupling

Two chains choose the same and

v c

Pr[ ⋅ ] ≥ d(Xt, Yt) N ⋅ q − 2(Δ − 1) q

v v c =

Good Move

Xt Yt v v c =

Bad Move

Xt Yt

Pr[ ⋅ ] ≤ 2d(Xt, Yt)Δ Nq d(Xt+1, Yt+1) = d(Xt, Yt) − 1 d(Xt+1, Yt+1) = d(Xt, Yt) + 1

slide-39
SLIDE 39
slide-40
SLIDE 40

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

slide-41
SLIDE 41

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

slide-42
SLIDE 42

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

slide-43
SLIDE 43

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

slide-44
SLIDE 44

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

dTV(Xt, Yt) ≤ (1 − 1 qN )

t

⋅ N ≤ ε

slide-45
SLIDE 45

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ d(Xt, Yt) ⋅ (1 + 2Δ − (q − 2Δ + 2)) qN ) = d(Xt, Yt) ⋅ (1 − q − 4Δ + 2 qN )

So if , we have

q ≥ 4Δ − 1

E[d(Xt+1, Yt+1) ∣ (Xt, Yt)] ≤ (1 − 1 qN) d(Xt, Yt)

dTV(Xt, Yt) ≤ (1 − 1 qN )

t

⋅ N ≤ ε ⟹ τmix(ε) ≤ qN (log N + log ε−1)

slide-46
SLIDE 46

Geometric View of Mixing

slide-47
SLIDE 47

Geometric View of Mixing

A Markov chain is a random walk on the state space

slide-48
SLIDE 48

Geometric View of Mixing

A Markov chain is a random walk on the state space

slide-49
SLIDE 49

Geometric View of Mixing

A Markov chain is a random walk on the state space Which random walk mixes faster?

slide-50
SLIDE 50

Geometric View of Mixing

A Markov chain is a random walk on the state space Which random walk mixes faster? We will develop tools to formalize the intuition

slide-51
SLIDE 51

Back to Graph Spectrum