Rapid mixing and Markov bases Tobias Windisch OvGU Magdeburg AMS - - PowerPoint PPT Presentation

rapid mixing and markov bases
SMART_READER_LITE
LIVE PREVIEW

Rapid mixing and Markov bases Tobias Windisch OvGU Magdeburg AMS - - PowerPoint PPT Presentation

Rapid mixing and Markov bases Tobias Windisch OvGU Magdeburg AMS Sectional Meeting, Chicago, USA October 4, 2015 1 / 11 Integer points of polytopes Fibers Let A Z m d and b Z m , the b-fiber of A is F A , b := { u N d : A u


slide-1
SLIDE 1

Rapid mixing and Markov bases

Tobias Windisch

OvGU Magdeburg

AMS Sectional Meeting, Chicago, USA October 4, 2015

1 / 11

slide-2
SLIDE 2

Integer points of polytopes

Fibers

Let A ∈ Zm×d and b ∈ Zm, the b-fiber of A is

FA,b := {u ∈ Nd : A · u = b}.

2 / 11

slide-3
SLIDE 3

Integer points of polytopes

Fibers

Let A ∈ Zm×d and b ∈ Zm, the b-fiber of A is

FA,b := {u ∈ Nd : A · u = b}. Fiber graphs

Let M ⊆ ker(A) ∩ Zd. The graph FA,b(M) has vertices FA,b and u, v ∈ FA,b are adjacent if u − v ∈ ±M.

Markov bases M is a Markov basis if FA,b(M) is connected for all b ∈ Zd.

2 / 11

slide-4
SLIDE 4

Integer points of polytopes

Fibers

Let A ∈ Zm×d and b ∈ Zm, the b-fiber of A is

FA,b := {u ∈ Nd : A · u = b}. Fiber graphs

Let M ⊆ ker(A) ∩ Zd. The graph FA,b(M) has vertices FA,b and u, v ∈ FA,b are adjacent if u − v ∈ ±M.

Markov bases M is a Markov basis if FA,b(M) is connected for all b ∈ Zd.

2 / 11

slide-5
SLIDE 5

Integer points of polytopes

Fibers

Let A ∈ Zm×d and b ∈ Zm, the b-fiber of A is

FA,b := {u ∈ Nd : A · u = b}. Fiber graphs

Let M ⊆ ker(A) ∩ Zd. The graph FA,b(M) has vertices FA,b and u, v ∈ FA,b are adjacent if u − v ∈ ±M.

Markov bases M is a Markov basis if FA,b(M) is connected for all b ∈ Zd. Theorem (Diaconis, Sturmfels, 1996) M is a Markov basis of A if and only if IM = IA.

2 / 11

slide-6
SLIDE 6

Walking randomly on fibers

Algorithm (Simple Walk) Input: M Markov basis, u0 ∈ FA,b, r ∈ N Output: Uniform sample from FA,b

◮ FOR i = 1..r

◮ Sample m ∈ ±M uniformly ◮ IF ui + m ∈ Nd ◮ THEN ui = ui−1 + m ◮ ELSE ui = ui−1

◮ RETURN ur

→ This gives an irreducible and aperiodic Markov chain. Convergence Theorem of Markov chains

◮ After r = O

  • −1

log(|λ|)

  • steps, ur can be regarded as an uniform sample

from FA,b

◮ −1 < λ < 1: SLEM of transition matrix of the random walk.

3 / 11

slide-7
SLIDE 7

Analysing the speed

What is fast? (Gn)n∈N sequence of graphs with simple walk

◮ Expanders: number of steps needed is independent of n ◮ Rapid mixing: number of steps needed is polynomial in log |V(Gn)|

4 / 11

slide-8
SLIDE 8

Analysing the speed

What is fast? (Gn)n∈N sequence of graphs with simple walk

◮ Expanders: number of steps needed is independent of n ◮ Rapid mixing: number of steps needed is polynomial in log |V(Gn)|

→ Consider (n · b)n∈N ⊆ NA

4 / 11

slide-9
SLIDE 9

Analysing the speed

What is fast? (Gn)n∈N sequence of graphs with simple walk

◮ Expanders: number of steps needed is independent of n ◮ Rapid mixing: number of steps needed is polynomial in log |V(Gn)|

→ Consider (n · b)n∈N ⊆ NA Expander and rapid mixing

For any n ∈ N, let λn be the SLEM of the simple walk on FA,n·b(Mn)

◮ (FA,n·b(Mn))n∈N is an expander if λn ≤ 1 − ǫ for all n ∈ N ◮ (FA,n·b(Mn))n∈N is rapidly mixing if λn ≤ 1 − 1 p(log n) for all n ∈ N

4 / 11

slide-10
SLIDE 10

Analysing the speed

What is fast? (Gn)n∈N sequence of graphs with simple walk

◮ Expanders: number of steps needed is independent of n ◮ Rapid mixing: number of steps needed is polynomial in log |V(Gn)|

→ Consider (n · b)n∈N ⊆ NA Expander and rapid mixing

For any n ∈ N, let λn be the SLEM of the simple walk on FA,n·b(Mn)

◮ (FA,n·b(Mn))n∈N is an expander if λn ≤ 1 − ǫ for all n ∈ N ◮ (FA,n·b(Mn))n∈N is rapidly mixing if λn ≤ 1 − 1 p(log n) for all n ∈ N

Algebraic statistics

Use the same moves for every right-hand side: Mn := M

4 / 11

slide-11
SLIDE 11

Fiber walks can be slow!

No expander

◮ H :=

 

I3 I3

−13

I3 I3

−13

1 1

 

◮ Ray {n · e7 : n ∈ N}

2 4 6 8 10 0.4 0.6 0.8 1 n SLEM Graver Gr¨

  • bner

5 / 11

slide-12
SLIDE 12

Fiber walks can be slow!

No expander

◮ H :=

 

I3 I3

−13

I3 I3

−13

1 1

 

◮ Hemmecke-Ray {n · e7 : n ∈ N}

2 4 6 8 10 0.4 0.6 0.8 1 n SLEM Graver Gr¨

  • bner

5 / 11

slide-13
SLIDE 13

Fiber walks can be slow!

No expander

◮ H :=

 

I3 I3

−13

I3 I3

−13

1 1

 

◮ Hemmecke-Ray {n · e7 : n ∈ N} ◮ limn→∞ λ(Gr¨

  • bner) = 1

◮ limn→∞ λ(Graver) = 1

2 4 6 8 10 0.4 0.6 0.8 1 n SLEM Graver Gr¨

  • bner

5 / 11

slide-14
SLIDE 14

Fiber walks can be slow!

No expander

◮ H :=

 

I3 I3

−13

I3 I3

−13

1 1

 

◮ Hemmecke-Ray {n · e7 : n ∈ N} ◮ limn→∞ λ(Gr¨

  • bner) = 1

◮ limn→∞ λ(Graver) = 1

2 4 6 8 10 0.4 0.6 0.8 1 n SLEM Graver Gr¨

  • bner

Not rapidly mixing

◮ A := (1, 1) ∈ Z1×2, M := {(1, −1)t}, b = 1 ◮ FA,n(M) ∼

=

. . .

⇒ SLEM of FA,n(M) ≥ 1 −

1 n+1

5 / 11

slide-15
SLIDE 15

Fiber walks can be slow!

No expander

◮ H :=

 

I3 I3

−13

I3 I3

−13

1 1

 

◮ Hemmecke-Ray {n · e7 : n ∈ N} ◮ limn→∞ λ(Gr¨

  • bner) = 1

◮ limn→∞ λ(Graver) = 1

2 4 6 8 10 0.4 0.6 0.8 1 n SLEM Graver Gr¨

  • bner

Not rapidly mixing

◮ A := (1, 1) ∈ Z1×2, M := {(1, −1)t}, b = 1 ◮ FA,n(M) ∼

=

. . .

⇒ SLEM of FA,n(M) ≥ 1 −

1 n+1 ◮ Is this (asymptotic) behaviour typical for fiber walks?

5 / 11

slide-16
SLIDE 16

Edge-expansion

Edge-expansion, Cheeger constant

h(G) := min|U|≤ 1

2|V|

#(edges leaving U) |U|

Expander Mixing Lemma

Simple walk on d-regular graph G: 1 − h(G)

d

≤ |λ| ≤ 1 − h(G)2

d2

Why is this convenient for fibers? FA,n1·b(M) FA,n2·b(M)

6 / 11

slide-17
SLIDE 17

How fast are fiber walks?

Theorem (W.; 2015)

Let M be a Markov basis of A and let b ∈ NA with dim(FA,b) > 0. Then:

◮ limn→∞ h(FA,n·b(M)) = 0

(no expander)

7 / 11

slide-18
SLIDE 18

How fast are fiber walks?

Theorem (W.; 2015)

Let M be a Markov basis of A and let b ∈ NA with dim(FA,b) > 0. Then:

◮ limn→∞ h(FA,n·b(M)) = 0

(no expander)

◮ h(FA,n·b(M)) ∈ O( 1 n)

(not rapidly mixing)

7 / 11

slide-19
SLIDE 19

How fast are fiber walks?

Theorem (W.; 2015)

Let M be a Markov basis of A and let b ∈ NA with dim(FA,b) > 0. Then:

◮ limn→∞ h(FA,n·b(M)) = 0

(no expander)

◮ h(FA,n·b(M)) ∈ O( 1 n)

(not rapidly mixing)

Graver vs. Gr¨

  • bner vs. Markov

Using more structural moves does not improve mixing asymptotically.

7 / 11

slide-20
SLIDE 20

How fast are fiber walks?

Theorem (W.; 2015)

Let M be a Markov basis of A and let b ∈ NA with dim(FA,b) > 0. Then:

◮ limn→∞ h(FA,n·b(M)) = 0

(no expander)

◮ h(FA,n·b(M)) ∈ O( 1 n)

(not rapidly mixing)

Graver vs. Gr¨

  • bner vs. Markov

Using more structural moves does not improve mixing asymptotically.

A way out?

Use an “adapted” Markov basis Mb

n for

every FA,n·b and control h(FA,n·b(Mb

n))

|Mb

n|

.

7 / 11

slide-21
SLIDE 21

How fast are fiber walks?

Theorem (W.; 2015)

Let M be a Markov basis of A and let b ∈ NA with dim(FA,b) > 0. Then:

◮ limn→∞ h(FA,n·b(M)) = 0

(no expander)

◮ h(FA,n·b(M)) ∈ O( 1 n)

(not rapidly mixing)

Graver vs. Gr¨

  • bner vs. Markov

Using more structural moves does not improve mixing asymptotically.

A way out?

Use an “adapted” Markov basis Mb

n for

every FA,n·b and control h(FA,n·b(Mb

n))

|Mb

n|

.

◮ In general, |Mb n| = O(log n) does not suffice to obtain rapid mixing.

7 / 11

slide-22
SLIDE 22

Can we construct expanders on fibers?

Adaption

Let M = {m1, . . . , mr} ⊆ Zd be a Markov basis , n ∈ N, and b ∈ NA.

Mb

n :=

  

r

  • j=1

λjmj : λ1, . . . , λr ∈ Z,

r

  • j=1

|λj| ≤ Diam(FA,n·b(M))    .

SLEM: 1 − |FA,n·b|

|Mb

n| 8 / 11

slide-23
SLIDE 23

Can we construct expanders on fibers?

Adaption

Let M = {m1, . . . , mr} ⊆ Zd be a Markov basis , n ∈ N, and b ∈ NA.

Mb

n :=

  

r

  • j=1

λjmj : λ1, . . . , λr ∈ Z,

r

  • j=1

|λj| ≤ Diam(FA,n·b(M))    .

SLEM: 1 − |FA,n·b|

|Mb

n|

Theorem (W.; 2015)

Diam(FA,nb(M)) ∈ O(n) ⇒ (FA,njb(Mb

nj))j∈N is an expander.

8 / 11

slide-24
SLIDE 24

How to interpret this?

0.8 0.9 1 n SLEM conventional adapted

9 / 11

slide-25
SLIDE 25

Computational results

Sampling from Mb

i ◮ Sample coefficients λi ∈ [li, ui] and use the move r i=1 λimi.

Hemmecke-Ray FH3,n·e7(Gr¨

  • bnere7

n )

n = 103: 3.551.720 n = 106: 4.058.733 n = 1050: 4.059.281

3 × 3 independence model FA33,n·16(M16

i )

n = 1010: 21.062.343 n = 10100: 37.255.074 n = 101000: 37.255.074

Adapted vs. conventional

◮ A = (1, 1, 1) ◮ M = {e1 − e2, e1 − e3} ◮ bn = n · 1

5 10 15 20 0.8 0.9 1 n SLEM M M1

2n 10 / 11

slide-26
SLIDE 26

Wrap up

This talk was about

◮ Fiber walks are (asymptotically) slow. ◮ Adaption of Markov basis can lead to expanders.

Open problems

◮ How can mixing behaviour be improved for single fibers? ◮ What happens with the SLEM when using the Metropolis-Hastings

Algorithm? arXiv:1505.03018

11 / 11

slide-27
SLIDE 27

Wrap up

This talk was about

◮ Fiber walks are (asymptotically) slow. ◮ Adaption of Markov basis can lead to expanders.

Open problems

◮ How can mixing behaviour be improved for single fibers? ◮ What happens with the SLEM when using the Metropolis-Hastings

Algorithm? arXiv:1505.03018 Thanks!

11 / 11