Improved FPTAS
for
Multi-spin Systems
presented by: Sangxia Huang KTH Yitong Yin Nanjing University Pinyan Lu Microsoft Research Asia
Improved FPTAS for Multi - spin Systems Pinyan Lu Yitong Yin - - PowerPoint PPT Presentation
Improved FPTAS for Multi - spin Systems Pinyan Lu Yitong Yin Microsoft Research Asia Nanjing University presented by: Sangxia Huang KTH Colorings instance : undirected G(V,E) with max-degree q colors: goal : counting the number of
presented by: Sangxia Huang KTH Yitong Yin Nanjing University Pinyan Lu Microsoft Research Asia
instance: undirected G(V,E) with max-degree ≤Δ goal: counting the number of proper q-colorings for G q colors:
instance: undirected G(V,E) with max-degree ≤Δ goal: counting the number of proper q-colorings for G q colors:
approximately counting the number of proper q-colorings for G when q ≥αΔ+β
equivalent to sampling an almost uniform random q-coloring
Ae : [q] × [q] → R≥0
a symmetric nonnegative q×q matrix
Fv : [q] → R≥0
a nonnegative q-vector
instance: Z = X
x∈[q]V
Y
e=uv∈E
Ae(xu, xv) Y
v∈V
Fv(xv) goal: computing the partition function:
Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy
Z = X
x∈[q]V
Y
e=uv∈E
Ae(xu, xv) Y
v∈V
Fv(xv)
Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy enumerate all configurations binary constraints unary constraints
Ae : [q] × [q] → R≥0 Fv : [q] → R≥0 partition function: count the # of solutions to an CSP
Z = X
x∈[q]V
Y
e=uv∈E
Ae(xu, xv) Y
v∈V
Fv(xv)
Aa Ab Ac Ad Ae Af Fw Fu Fv Fx Fy enumerate all configurations binary constraints unary constraints
A = ...
F = 1 1 . . . 1
Ae : [q] × [q] → R≥0 Fv : [q] → R≥0
partition function: count the # of solutions to an CSP
A = ...
1 1
F = 1 1 . . . 1
A = eβ eβ ... eβ
arbitrary F when β=-∞ and F=(1,1,... ,1), it is coloring
A = ...
1 1
F = 1 1 . . . 1
(the Glauber dynamics) over colorings
correlation decay (spatial mixing) property
(Goldberg-Martin-Paterson’05, Gamarnik-Katz-Misra’12)
sufficient conditions for FPTAS for classes of spin systems
this paper: deterministic FPTAS for α≈2.58071
sufficient conditions for FPTAS for classes of spin systems
(c∆ − c−∆)∆q∆ < 1
in terms of it implies:
3∆(e|β| − 1) ≤ 1 c = max
e∈E w,x,y,z∈[q]
Ae(x, y) Ae(w, z)
this paper:
3∆(c∆ − 1) ≤ 1
an exponential improvement!
sufficient conditions for FPTAS for classes of spin systems
(c∆ − c−∆)∆q∆ < 1
in terms of it implies:
3∆(e|β| − 1) ≤ 1 c = max
e∈E w,x,y,z∈[q]
Ae(x, y) Ae(w, z)
this paper:
3∆(c∆ − 1) ≤ 1
bound for in [Galanis-Stefankovic-Vigoda’13] eβ < 1 − q
∆
β < 0 |β| = O 1
∆
improvement!
reducing to the computing of marginal probability
Z = X
x∈[q]V
Y
e=uv∈E
Ae(xu, xv) Y
v∈V
Fv(xv) for any configuration x ∈ [q]V Gibbs measure: marginal probability:
P[X = x] = Q
e=uv∈E Ae(xu, xv) Q v∈V Fv(xv)
Z P[Xv = xv]
reducing to the computing of marginal probability
Z = X
x∈[q]V
Y
e=uv∈E
Ae(xu, xv) Y
v∈V
Fv(xv) for any configuration x ∈ [q]V Gibbs measure: marginal probability:
P[X = x] = Q
e=uv∈E Ae(xu, xv) Q v∈V Fv(xv)
Z P[Xv = xv]
for self-reducible class of spin-systems:
efficient approximation
(with additive error)
FPTAS for Z
Jerrum-Valiant-Vazirani’86
instance: undirected G(V,E)
each vertex v associated with a list Lv of colors allowed to use on v
{ } { } { } { } { }
coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin
| {z }
self-reducible
for
multi-spin system Potts model list-coloring ( )
approximate the marginal (with additive error)
P[Xv = x]
instance: undirected G(V,E)
each vertex v associated with a list Lv of colors allowed to use on v
{ } { } { } { } { }
coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin
| {z }
self-reducible
for
multi-spin system Potts model list-coloring ( )
approximate the marginal (with additive error)
P[Xv = x]
instance: undirected G(V,E)
each vertex v associated with a list Lv of colors allowed to use on v
{ } { } { } { } { }
coloring ⊂ list-coloring ⊂ Potts ⊂ multi-spin
| {z }
self-reducible classic way: random walk new way: correlation decay
PΩ(Xv = x) = Qd
i=1 PΩi
v,x(Xvi 6= x)
P
y∈Lv
Qd
i=1 PΩi
v,x(Xvi 6= y)
= Qd
i=1
⇣ 1 PΩi
v,x(Xvi = x)
⌘ P
y∈Lv
Qd
i=1
⇣ 1 PΩi
v,x(Xvi = y)
⌘ Ω
list-coloring instance
Ωi
v,x
v v1 vi vd
v v1 vd
n vi−1
∀j<i, delete x
from list
Lvj
Lvj − {x}
v’s neighbors: v1, v2, . . . , vd color: x
Gamarnik-Katz’07:
PΩ(Xv = x) = Qd
i=1 PΩi
v,x(Xvi 6= x)
P
y∈Lv
Qd
i=1 PΩi
v,x(Xvi 6= y)
= Qd
i=1
⇣ 1 PΩi
v,x(Xvi = x)
⌘ P
y∈Lv
Qd
i=1
⇣ 1 PΩi
v,x(Xvi = y)
⌘ Ω
list-coloring instance
Ωi
v,x
v v1 vi vd
v v1 vd
n vi−1
∀j<i, delete x
from list
Lvj
Lvj − {x}
v’s neighbors: v1, v2, . . . , vd color: x
Gamarnik-Katz’07:
telescopic products
Ω
multi-spin system
Ωi
v,x
v v1 vi vd
v v1 vd
n vi−1
∀j<i, new external field v’s neighbors: v1, v2, . . . , vd state: x
F 0
vj(y) = Avvj(x, y)Fvj(y)
F 0
vj
augmented by edge activity
PΩ(Xv = x) =
Fv(x) Qd
i=1
✓ Avvi(x,x)−P
z6=x(Avvi(x,x)−Avvi(x,z))PΩi v,x(Xvi=z)
◆ P
y2[q] Fv(y) Qd i=1
✓ Avvi(y,y)−P
z6=y(Avvi(y,y)−Avvi(y,z))PΩi v,y (Xvi=z)
◆
a natural generalization of list-coloring:
Ω
multi-spin system
Ωi
v,x
v v1 vi vd
v v1 vd
n vi−1
∀j<i, new external field v’s neighbors: v1, v2, . . . , vd state: x
F 0
vj(y) = Avvj(x, y)Fvj(y)
F 0
vj
augmented by edge activity
PΩ(Xv = x) =
Fv(x) Qd
i=1
✓ Avvi(x,x)−P
z6=x(Avvi(x,x)−Avvi(x,z))PΩi v,x(Xvi=z)
◆ P
y2[q] Fv(y) Qd i=1
✓ Avvi(y,y)−P
z6=y(Avvi(y,y)−Avvi(y,z))PΩi v,y (Xvi=z)
◆
a natural generalization of list-coloring: for list-coloring:
PΩ(Xv = x) = Qd
i=1
⇣ 1 − PΩi
v,x(Xvi = x)
⌘ P
y∈Lv
Qd
i=1
⇣ 1 − PΩi
v,x(Xvi = y)
⌘
special case
f(p) =
Fv(x) Qd
i=1(Avvi(x,x)−P z6=x(Avvi(x,x)−Avvi(x,z))pi,x,z)
P
y2[q] Fv(y) Qd i=1(Avvi(y,y)−P z6=y(Avvi(y,y)−Avvi(y,z))pi,y,z)
vector
p = (pi,y,z)1id; y,z2[q]; y6=z
pi,y,z
PΩ[Xv = x] = f(p)
pi,y,z = PΩi
v,y(Xvi = z)
where recursion tree
algorithm
f(p) =
Fv(x) Qd
i=1(Avvi(x,x)−P z6=x(Avvi(x,x)−Avvi(x,z))pi,x,z)
P
y2[q] Fv(y) Qd i=1(Avvi(y,y)−P z6=y(Avvi(y,y)−Avvi(y,z))pi,y,z)
vector
p = (pi,y,z)1id; y,z2[q]; y6=z
pi,y,z
PΩ[Xv = x] = f(p)
pi,y,z = PΩi
v,y(Xvi = z)
where recursion tree t
algorithm
error: ≤1 error: ε
correlation decay: error at root ε = exp(-Ω(t))
pi,y,z
t
error at leaf: ≤1 error at root: ε correlation decay:
error at root ε = exp(-Ω(t))
f(p) if running the recursion up-to level t
a sufficient condition: (stepwise decay) then due to the Mean Value Theorem
pi,y,z
f(p)
at any step
✏ ≤ X
i,y,z
@pi,y,z
i,y,z ✏i,y,z
κ , X
i,y,z
∂pi,y,z
induction!
pi,y,z
t
error at leaf: ≤1 error at root: ε correlation decay:
error at root ε = exp(-Ω(t))
f(p) if running the recursion up-to level t
a sufficient condition: (stepwise decay) then due to the Mean Value Theorem
pi,y,z
f(p)
at any step
✏ ≤ X
i,y,z
@pi,y,z
i,y,z ✏i,y,z
κ , X
i,y,z
∂pi,y,z
induction!
decay at every step!
pi,y,z
t
error at leaf: ≤1 error at root: ε correlation decay:
error at root ε = exp(-Ω(t))
f(p) if running the recursion up-to level t
a sufficient condition: (stepwise decay) then due to the Mean Value Theorem
pi,y,z
f(p)
at any step
✏ ≤ X
i,y,z
@pi,y,z
i,y,z ✏i,y,z
κ , X
i,y,z
∂pi,y,z
induction! amortized behavior correlation decay?
decay at every step!
error ε t
stepwise decay amortized decay
pi,y,z
p = f(~ p) ✏i,y,z
error error ✏ error error
ξi,y,z
δ
δi,y,z
⇠ = g(~ ⇠) φ ξ = φ(p)
ξi,y,z = φ(pi,y,z)
potential:
pi,y,z
p = f(~ p) ✏i,y,z
error error ✏ error error
ξi,y,z
δ
δi,y,z
⇠ = g(~ ⇠) φ ξ = φ(p)
ξi,y,z = φ(pi,y,z)
⇠ = g(~ ⇠) = (f(−1(⇠i,y,z), ∀i, y, z)))
potential:
g
new recursion
by Mean Value Thm:
Φ(x) = d φ(x) d x
let
δ ≤ X
i,y,z
∂pi,y,z
Φ(pi,y,z)δi,y,z
pi,y,z
p = f(~ p) ✏i,y,z
error error ✏ error error
ξi,y,z
δ
δi,y,z
⇠ = g(~ ⇠) φ ξ = φ(p)
ξi,y,z = φ(pi,y,z)
⇠ = g(~ ⇠) = (f(−1(⇠i,y,z), ∀i, y, z)))
potential:
g
new recursion
with good choice of potential function φ :
error ε t error δ t
φ
world potential world by Mean Value Thm:
Φ(x) = d φ(x) d x
let
δ ≤ X
i,y,z
∂pi,y,z
Φ(pi,y,z)δi,y,z
error ε t error δ t
φ
world potential world
δ ≤ X
i,y,z
∂pi,y,z
Φ(pi,y,z)δi,y,z
correlation decay in 2-spin systems (Restrepo-Shin-Tetali- Vigoda-Yang’11, Sinclair-Srivastava-Thurley’12, Li-Lu-Yin’12, Li-Lu-Yin’13, Sinclair-Srivastava-Yin’13).
∃ a positive-valued function , s.t.
X
i,y,z
∂pi,y,z
Φ(pi,y,z) < 1
Φ(p)
1 Φ(p)
Φ(p)
∃ a positive-valued function , s.t.
X
i,y,z
∂pi,y,z
Φ(pi,y,z) < 1
control the costs of translating initially from and finally back to the original world
Φ(p)
1 Φ(p)
Φ(p)
∃ a positive-valued function , s.t.
X
i,y,z
∂pi,y,z
Φ(pi,y,z) < 1
control the costs of translating initially from and finally back to the original world
by induction: amortized decay condition exponential correlation decay
for the considered classes of spin systems Φ(p)
1 Φ(p)
Φ(p)
∃ a positive-valued function , s.t.
X
i,y,z
∂pi,y,z
Φ(pi,y,z) < 1
control the costs of translating initially from and finally back to the original world
by induction: amortized decay condition exponential correlation decay
efficient approximation
for the considered classes of spin systems Φ(p)
1 Φ(p)
Φ(p)
for general multi-spin systems: with max-degree Δ choose with small enough η > 0
c = max
e∈E w,x,y,z∈[q]
Ae(x, y) Ae(w, z)
denoted
3∆(c∆ − 1) ≤ 1
amortized decay condition for Potts model (with inverse temperature β): directly translated to 3∆(e|β| − 1) ≤ 1
(by easy calculation) Φ(p) = 1 p + η
for general multi-spin systems: with max-degree Δ choose with small enough η > 0
c = max
e∈E w,x,y,z∈[q]
Ae(x, y) Ae(w, z)
denoted
3∆(c∆ − 1) ≤ 1
amortized decay condition for Potts model (with inverse temperature β): directly translated to 3∆(e|β| − 1) ≤ 1
(by easy calculation) Φ(p) = 1 p + η
* Other potential functions may further improve the constant factor (but may be harder to analyze).
for list-coloring: with max-degree Δ, each vertex v with color list Lv choose
marginals are always bounded away from both 0 and 1 Φ(p) = 1 (1 − p)√p
for coloring: replacing | Lv | with q amortized decay condition
(by more involved calculation) ∀v, |Lv| ≥ α∆ + 1
α≈2.58071
for list-coloring: with max-degree Δ, each vertex v with color list Lv choose
marginals are always bounded away from both 0 and 1 Φ(p) = 1 (1 − p)√p
for coloring: replacing | Lv | with q
* The potential functions are chosen in an ad hoc way.
amortized decay condition
(by more involved calculation) ∀v, |Lv| ≥ α∆ + 1
α≈2.58071