Improved FPTAS for Multi - spin Systems Pinyan Lu Yitong Yin - - PowerPoint PPT Presentation

improved fptas for multi spin systems
SMART_READER_LITE
LIVE PREVIEW

Improved FPTAS for Multi - spin Systems Pinyan Lu Yitong Yin - - PowerPoint PPT Presentation

Improved FPTAS for Multi - spin Systems Pinyan Lu Yitong Yin Microsoft Research Asia Nanjing University presented by: Sangxia Huang KTH Colorings instance : undirected G(V,E) with max-degree q colors: goal : counting the number of


slide-1
SLIDE 1

Improved FPTAS

for

Multi-spin Systems

presented by: Sangxia Huang KTH Yitong Yin Nanjing University Pinyan Lu Microsoft Research Asia

slide-2
SLIDE 2

Colorings

instance: undirected G(V,E) with max-degree ≤Δ goal: counting the number of proper q-colorings for G q colors:

  • exact counting is #P-hard
  • when q<Δ , decision of existence is NP-hard
slide-3
SLIDE 3

Colorings

instance: undirected G(V,E) with max-degree ≤Δ goal: counting the number of proper q-colorings for G q colors:

  • exact counting is #P-hard
  • when q<Δ , decision of existence is NP-hard

approximately counting the number of proper q-colorings for G when q ≥αΔ+β

equivalent to sampling an almost uniform random q-coloring

slide-4
SLIDE 4

Spin System

(pairwise Markov random field)

  • undirected G(V,E);
  • q states: [q];
  • each edge e∈E associated with an activity:
  • each vertex v∈V associated with an external field:

Ae : [q] × [q] → R≥0

a symmetric nonnegative q×q matrix

Fv : [q] → R≥0

a nonnegative q-vector

instance: Z = X

x∈[q]V

Y

e=uv∈E

Ae(xu, xv) Y

v∈V

Fv(xv) goal: computing the partition function:

Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy

slide-5
SLIDE 5

Spin System

Z = X

x∈[q]V

Y

e=uv∈E

Ae(xu, xv) Y

v∈V

Fv(xv)

Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy enumerate all configurations binary constraints unary constraints

Ae : [q] × [q] → R≥0 Fv : [q] → R≥0 partition function: count the # of solutions to an CSP

slide-6
SLIDE 6

Spin System

Z = X

x∈[q]V

Y

e=uv∈E

Ae(xu, xv) Y

v∈V

Fv(xv)

Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy enumerate all configurations binary constraints unary constraints

A =      ...     

1 1

F =      1 1 . . . 1     

Ae : [q] × [q] → R≥0 Fv : [q] → R≥0

coloring:

partition function: count the # of solutions to an CSP

slide-7
SLIDE 7
  • 2-spin: q=2
  • hardcore model (independent set), Ising model, etc.
  • multi-spin: general q
  • coloring:

Examples of Spin systems

A =      ...     

1 1

F =      1 1 . . . 1     

slide-8
SLIDE 8
  • 2-spin: q=2
  • hardcore model (independent set), Ising model, etc.
  • multi-spin: general q
  • coloring:
  • Potts model: inverse temperature β

Examples of Spin systems 1 1

A =      eβ eβ ... eβ     

arbitrary F when β=-∞ and F=(1,1,... ,1), it is coloring

A =      ...     

1 1

F =      1 1 . . . 1     

slide-9
SLIDE 9

Results

  • coloring: q ≥αΔ+β
  • randomized algorithms: by simulating a random walk

(the Glauber dynamics) over colorings

  • α=11/6 (Jerrum’95⇢Bubley-Dyer’97⇢Vigoda’99)
  • deterministic algorithms: by exploiting the

correlation decay (spatial mixing) property

  • α≈2.8432 (Gamarnik-Katz’07)
  • just correlation decay (no FPTAS): α≈1.763

(Goldberg-Martin-Paterson’05, Gamarnik-Katz-Misra’12)

sufficient conditions for FPTAS for classes of spin systems

this paper: deterministic FPTAS for α≈2.58071

slide-10
SLIDE 10

Results

  • general multi-spin system:
  • Gamarnik-Katz’07:
  • on Potts model (with inverse temperature β):

sufficient conditions for FPTAS for classes of spin systems

(c∆ − c−∆)∆q∆ < 1

in terms of it implies:

3∆(e|β| − 1) ≤ 1 c = max

e∈E w,x,y,z∈[q]

Ae(x, y) Ae(w, z)

this paper:

3∆(c∆ − 1) ≤ 1

an exponential improvement!

slide-11
SLIDE 11

Results

  • general multi-spin system:
  • Gamarnik-Katz’07:
  • on Potts model (with inverse temperature β):

sufficient conditions for FPTAS for classes of spin systems

(c∆ − c−∆)∆q∆ < 1

in terms of it implies:

3∆(e|β| − 1) ≤ 1 c = max

e∈E w,x,y,z∈[q]

Ae(x, y) Ae(w, z)

this paper:

3∆(c∆ − 1) ≤ 1

  • confirming the conjecture of in [GK’07]
  • asymptotically matching the inaproximability

bound for in [Galanis-Stefankovic-Vigoda’13] eβ < 1 − q

β < 0 |β| = O 1

  • an exponential

improvement!

slide-12
SLIDE 12

The standard first step:

reducing to the computing of marginal probability

Z = X

x∈[q]V

Y

e=uv∈E

Ae(xu, xv) Y

v∈V

Fv(xv) for any configuration x ∈ [q]V Gibbs measure: marginal probability:

P[X = x] = Q

e=uv∈E Ae(xu, xv) Q v∈V Fv(xv)

Z P[Xv = xv]

slide-13
SLIDE 13

The standard first step:

reducing to the computing of marginal probability

Z = X

x∈[q]V

Y

e=uv∈E

Ae(xu, xv) Y

v∈V

Fv(xv) for any configuration x ∈ [q]V Gibbs measure: marginal probability:

P[X = x] = Q

e=uv∈E Ae(xu, xv) Q v∈V Fv(xv)

Z P[Xv = xv]

for self-reducible class of spin-systems:

efficient approximation

  • f marginal probability

(with additive error)

FPTAS for Z

Jerrum-Valiant-Vazirani’86

slide-14
SLIDE 14

The standard first step:

  • self-reducible: general spin systems, Potts models
  • not self-reducible: coloring
  • self-reducible superclass of coloring: list-coloring

instance: undirected G(V,E)

each vertex v associated with a list Lv of colors allowed to use on v

{ } { } { } { } { }

coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin

| {z }

self-reducible

slide-15
SLIDE 15

The standard first step:

  • self-reducible: general spin systems, Potts models
  • not self-reducible: coloring
  • self-reducible superclass of coloring: list-coloring

for

multi-spin system Potts model list-coloring ( )

approximate the marginal (with additive error)

P[Xv = x]

new goal:

instance: undirected G(V,E)

each vertex v associated with a list Lv of colors allowed to use on v

{ } { } { } { } { }

coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin

| {z }

self-reducible

slide-16
SLIDE 16

The standard first step:

  • self-reducible: general spin systems, Potts models
  • not self-reducible: coloring
  • self-reducible superclass of coloring: list-coloring

for

multi-spin system Potts model list-coloring ( )

approximate the marginal (with additive error)

P[Xv = x]

new goal:

instance: undirected G(V,E)

each vertex v associated with a list Lv of colors allowed to use on v

{ } { } { } { } { }

coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin

| {z }

self-reducible classic way: random walk new way: correlation decay

slide-17
SLIDE 17

Recursion for List-Coloring

PΩ(Xv = x) = Qd

i=1 PΩi

v,x(Xvi 6= x)

P

y∈Lv

Qd

i=1 PΩi

v,x(Xvi 6= y)

= Qd

i=1

⇣ 1 PΩi

v,x(Xvi = x)

⌘ P

y∈Lv

Qd

i=1

⇣ 1 PΩi

v,x(Xvi = y)

⌘ Ω

list-coloring instance

Ωi

v,x

v v1 vi vd

G

v v1 vd

G

n vi−1

: delete v

∀j<i, delete x

from list

Lvj

Lvj − {x}

v’s neighbors: v1, v2, . . . , vd color: x

Gamarnik-Katz’07:

slide-18
SLIDE 18

Recursion for List-Coloring

PΩ(Xv = x) = Qd

i=1 PΩi

v,x(Xvi 6= x)

P

y∈Lv

Qd

i=1 PΩi

v,x(Xvi 6= y)

= Qd

i=1

⇣ 1 PΩi

v,x(Xvi = x)

⌘ P

y∈Lv

Qd

i=1

⇣ 1 PΩi

v,x(Xvi = y)

⌘ Ω

list-coloring instance

Ωi

v,x

v v1 vi vd

G

v v1 vd

G

n vi−1

: delete v

∀j<i, delete x

from list

Lvj

Lvj − {x}

v’s neighbors: v1, v2, . . . , vd color: x

Gamarnik-Katz’07:

telescopic products

slide-19
SLIDE 19

Recursion for general multi-spin system

multi-spin system

Ωi

v,x

v v1 vi vd

G

v v1 vd

G

n vi−1

: delete v

∀j<i, new external field v’s neighbors: v1, v2, . . . , vd state: x

F 0

vj(y) = Avvj(x, y)Fvj(y)

F 0

vj

augmented by edge activity

PΩ(Xv = x) =

Fv(x) Qd

i=1

✓ Avvi(x,x)−P

z6=x(Avvi(x,x)−Avvi(x,z))PΩi v,x(Xvi=z)

◆ P

y2[q] Fv(y) Qd i=1

✓ Avvi(y,y)−P

z6=y(Avvi(y,y)−Avvi(y,z))PΩi v,y (Xvi=z)

a natural generalization of list-coloring:

slide-20
SLIDE 20

Recursion for general multi-spin system

multi-spin system

Ωi

v,x

v v1 vi vd

G

v v1 vd

G

n vi−1

: delete v

∀j<i, new external field v’s neighbors: v1, v2, . . . , vd state: x

F 0

vj(y) = Avvj(x, y)Fvj(y)

F 0

vj

augmented by edge activity

PΩ(Xv = x) =

Fv(x) Qd

i=1

✓ Avvi(x,x)−P

z6=x(Avvi(x,x)−Avvi(x,z))PΩi v,x(Xvi=z)

◆ P

y2[q] Fv(y) Qd i=1

✓ Avvi(y,y)−P

z6=y(Avvi(y,y)−Avvi(y,z))PΩi v,y (Xvi=z)

a natural generalization of list-coloring: for list-coloring:

PΩ(Xv = x) = Qd

i=1

⇣ 1 − PΩi

v,x(Xvi = x)

⌘ P

y∈Lv

Qd

i=1

⇣ 1 − PΩi

v,x(Xvi = y)

special case

slide-21
SLIDE 21

Correlation Decay

f(p) =

Fv(x) Qd

i=1(Avvi(x,x)−P z6=x(Avvi(x,x)−Avvi(x,z))pi,x,z)

P

y2[q] Fv(y) Qd i=1(Avvi(y,y)−P z6=y(Avvi(y,y)−Avvi(y,z))pi,y,z)

vector

p = (pi,y,z)1id; y,z2[q]; y6=z

pi,y,z

PΩ[Xv = x] = f(p)

pi,y,z = PΩi

v,y(Xvi = z)

where recursion tree

  • an exponential-time exact

algorithm

slide-22
SLIDE 22

Correlation Decay

f(p) =

Fv(x) Qd

i=1(Avvi(x,x)−P z6=x(Avvi(x,x)−Avvi(x,z))pi,x,z)

P

y2[q] Fv(y) Qd i=1(Avvi(y,y)−P z6=y(Avvi(y,y)−Avvi(y,z))pi,y,z)

vector

p = (pi,y,z)1id; y,z2[q]; y6=z

pi,y,z

PΩ[Xv = x] = f(p)

pi,y,z = PΩi

v,y(Xvi = z)

where recursion tree t

  • an exponential-time exact

algorithm

error: ≤1 error: ε

correlation decay: error at root ε = exp(-Ω(t))

  • truncation:
  • compute up-to level t
  • use arbitrary estimation at level t
slide-23
SLIDE 23

Correlation Decay

pi,y,z

t

error at leaf: ≤1 error at root: ε correlation decay:

error at root ε = exp(-Ω(t))

f(p) if running the recursion up-to level t

a sufficient condition: (stepwise decay) then due to the Mean Value Theorem

pi,y,z

f(p)

at any step

✏ ≤ X

i,y,z

  • @f(p)

@pi,y,z

  • ✏i,y,z ≤  · max

i,y,z ✏i,y,z

κ , X

i,y,z

  • ∂f(p)

∂pi,y,z

  • < 1

induction!

slide-24
SLIDE 24

Correlation Decay

pi,y,z

t

error at leaf: ≤1 error at root: ε correlation decay:

error at root ε = exp(-Ω(t))

f(p) if running the recursion up-to level t

a sufficient condition: (stepwise decay) then due to the Mean Value Theorem

pi,y,z

f(p)

at any step

✏ ≤ X

i,y,z

  • @f(p)

@pi,y,z

  • ✏i,y,z ≤  · max

i,y,z ✏i,y,z

κ , X

i,y,z

  • ∂f(p)

∂pi,y,z

  • < 1

induction!

decay at every step!

slide-25
SLIDE 25

Correlation Decay

pi,y,z

t

error at leaf: ≤1 error at root: ε correlation decay:

error at root ε = exp(-Ω(t))

f(p) if running the recursion up-to level t

a sufficient condition: (stepwise decay) then due to the Mean Value Theorem

pi,y,z

f(p)

at any step

✏ ≤ X

i,y,z

  • @f(p)

@pi,y,z

  • ✏i,y,z ≤  · max

i,y,z ✏i,y,z

κ , X

i,y,z

  • ∂f(p)

∂pi,y,z

  • < 1

induction! amortized behavior correlation decay?

decay at every step!

error ε t

stepwise decay amortized decay

slide-26
SLIDE 26

The Potential Method

pi,y,z

p = f(~ p) ✏i,y,z

error error ✏ error error

ξi,y,z

δ

δi,y,z

⇠ = g(~ ⇠) φ ξ = φ(p)

ξi,y,z = φ(pi,y,z)

  • riginal:

potential:

slide-27
SLIDE 27

The Potential Method

pi,y,z

p = f(~ p) ✏i,y,z

error error ✏ error error

ξi,y,z

δ

δi,y,z

⇠ = g(~ ⇠) φ ξ = φ(p)

ξi,y,z = φ(pi,y,z)

⇠ = g(~ ⇠) = (f(−1(⇠i,y,z), ∀i, y, z)))

  • riginal:

potential:

g

new recursion

by Mean Value Thm:

Φ(x) = d φ(x) d x

let

δ ≤ X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z)δi,y,z

slide-28
SLIDE 28

The Potential Method

pi,y,z

p = f(~ p) ✏i,y,z

error error ✏ error error

ξi,y,z

δ

δi,y,z

⇠ = g(~ ⇠) φ ξ = φ(p)

ξi,y,z = φ(pi,y,z)

⇠ = g(~ ⇠) = (f(−1(⇠i,y,z), ∀i, y, z)))

  • riginal:

potential:

g

new recursion

with good choice of potential function φ :

error ε t error δ t

φ

  • riginal

world potential world by Mean Value Thm:

Φ(x) = d φ(x) d x

let

δ ≤ X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z)δi,y,z

slide-29
SLIDE 29

The Potential Method

error ε t error δ t

φ

  • riginal

world potential world

δ ≤ X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z)δi,y,z

  • The potential method has been used for analyzing the

correlation decay in 2-spin systems (Restrepo-Shin-Tetali- Vigoda-Yang’11, Sinclair-Srivastava-Thurley’12, Li-Lu-Yin’12, Li-Lu-Yin’13, Sinclair-Srivastava-Yin’13).

  • This is the first time it is used for multi-spin systems.
slide-30
SLIDE 30

Amortized Correlation Decay

amortized decay condition:

  • at any step, we have
  • the values of and are bounded over domain

∃ a positive-valued function , s.t.

X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z) < 1

Φ(p)

1 Φ(p)

Φ(p)

slide-31
SLIDE 31

Amortized Correlation Decay

amortized decay condition:

  • at any step, we have
  • the values of and are bounded over domain

∃ a positive-valued function , s.t.

X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z) < 1

control the costs of translating initially from and finally back to the original world

Φ(p)

1 Φ(p)

Φ(p)

slide-32
SLIDE 32

Amortized Correlation Decay

amortized decay condition:

  • at any step, we have
  • the values of and are bounded over domain

∃ a positive-valued function , s.t.

X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z) < 1

control the costs of translating initially from and finally back to the original world

by induction: amortized decay condition exponential correlation decay

for the considered classes of spin systems Φ(p)

1 Φ(p)

Φ(p)

slide-33
SLIDE 33

Amortized Correlation Decay

amortized decay condition:

  • at any step, we have
  • the values of and are bounded over domain

∃ a positive-valued function , s.t.

X

i,y,z

  • ∂f(p)

∂pi,y,z

  • Φ(f(p))

Φ(pi,y,z) < 1

control the costs of translating initially from and finally back to the original world

by induction: amortized decay condition exponential correlation decay

FPTAS

efficient approximation

  • f marginal probability

for the considered classes of spin systems Φ(p)

1 Φ(p)

Φ(p)

slide-34
SLIDE 34

Establishing the Decay

for general multi-spin systems: with max-degree Δ choose with small enough η > 0

c = max

e∈E w,x,y,z∈[q]

Ae(x, y) Ae(w, z)

denoted

3∆(c∆ − 1) ≤ 1

amortized decay condition for Potts model (with inverse temperature β): directly translated to 3∆(e|β| − 1) ≤ 1

(by easy calculation) Φ(p) = 1 p + η

slide-35
SLIDE 35

Establishing the Decay

for general multi-spin systems: with max-degree Δ choose with small enough η > 0

c = max

e∈E w,x,y,z∈[q]

Ae(x, y) Ae(w, z)

denoted

3∆(c∆ − 1) ≤ 1

amortized decay condition for Potts model (with inverse temperature β): directly translated to 3∆(e|β| − 1) ≤ 1

(by easy calculation) Φ(p) = 1 p + η

* Other potential functions may further improve the constant factor (but may be harder to analyze).

slide-36
SLIDE 36

Establishing the Decay

for list-coloring: with max-degree Δ, each vertex v with color list Lv choose

  • bserving that for list-coloring satisfying the condition,

marginals are always bounded away from both 0 and 1 Φ(p) = 1 (1 − p)√p

for coloring: replacing | Lv | with q amortized decay condition

(by more involved calculation) ∀v, |Lv| ≥ α∆ + 1

α≈2.58071

slide-37
SLIDE 37

Establishing the Decay

for list-coloring: with max-degree Δ, each vertex v with color list Lv choose

  • bserving that for list-coloring satisfying the condition,

marginals are always bounded away from both 0 and 1 Φ(p) = 1 (1 − p)√p

for coloring: replacing | Lv | with q

* The potential functions are chosen in an ad hoc way.

amortized decay condition

(by more involved calculation) ∀v, |Lv| ≥ α∆ + 1

α≈2.58071

slide-38
SLIDE 38

Open Problem

  • Find a more systematic way for designing

good potential functions.

  • Further improve the bounds for correlation

decay and FPTAS for multi-spin systems.

  • For coloring: α=2 is a barrier for the

approach due to the overheads caused by total differentiation. Overcome this barrier.