The condensation phase transition in random graph coloring Victor - - PowerPoint PPT Presentation

the condensation phase transition in random graph coloring
SMART_READER_LITE
LIVE PREVIEW

The condensation phase transition in random graph coloring Victor - - PowerPoint PPT Presentation

The condensation phase transition in random graph coloring Victor Bapst Goethe University, Frankfurt Joint work with Amin Coja-Oghlan, Samuel Hetterich, Felicia Rassmann and Dan Vilenchik arXiv:1404.5513 Tuesday, May 6, 2014 Outline


slide-1
SLIDE 1

The condensation phase transition in random graph coloring

Victor Bapst

Goethe University, Frankfurt

Joint work with Amin Coja-Oghlan, Samuel Hetterich, Felicia Rassmann and Dan Vilenchik arXiv:1404.5513

Tuesday, May 6, 2014

slide-2
SLIDE 2

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

2 / 31

slide-3
SLIDE 3

Overview

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

3 / 31

slide-4
SLIDE 4

Overview The model

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

4 / 31

slide-5
SLIDE 5

Overview The model

Random graph coloring

  • Draw a random graph on N vertices by connecting any two vertices with

probability d/N at random.

  • Is this graph k-colorable ?
  • How many k-colorings can we find ? For a given graph: Z(G).

In general: either zero or exponentially many.

  • Taking the average over the choice of the graph and N → ∞:
  • Average number of colorings: [EZ(G)]1/N → k(1 − 1/k)d/2.
  • Typical number of colorings: E

h Z(G)1/Ni

?

→ Φk(d) =??.

5 / 31

slide-6
SLIDE 6

Overview The model

Phase transitions

  • Phase transition (informel): discontinuity in some “macroscopic”

quantity describing a problem. For instance:

  • the size of the largest connected component for Erd˝
  • s-R´

enyi random graphs, upon increasing the average degree,

  • the density when freezing water,
  • the derivative of the magnetization when heating a magnet.
  • Phase transition (here): non analyticity of Φk(d).

For instance it is conjectured that there exists dcol(k) such that:

  • for d < dcol(k), Φk(d) > 0, and limdրdcol(k) Φk(d) > 0.
  • for d > dcol(k), Φk(d) = 0.
  • Here we look at another phase transition that happens for d < dcol(k).

6 / 31

slide-7
SLIDE 7

Overview Clustering and condensation

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

7 / 31

slide-8
SLIDE 8

Overview Clustering and condensation

The physics picture

  • A powerful tool to study random optimization problems: the cavity

method.

  • Introduced by M´

ezard and Parisi in 2000.

  • General overview for random optimization problems: Krzakala, Montanari,

Ricci-Tersenghi, Semerjian, Zdeborov´ a in PNAS 2007.

  • Application to coloring: Krzakala, Pagnani, Weigt, Zdeborov´

a ...

  • Upon increasing d, solutions tend to group into clusters.

C(G, σ) = {colorings τ that can be reached from σ by altering at most N/(k log k) vertices at a time}

(Proofs: [Achlioptas - Coja-Oghlan 2008, Molloy 2012])

dcol(k) d

8 / 31

slide-9
SLIDE 9

Overview Clustering and condensation

The physics picture [Zdeborov´

a - Krzakala 2007]

Compare the cluster size with the total number of colorings. Φk(d) |C(G, σ)|1/N dcond(k) d Number of clusters: Φk(d) − |C(G, σ)|1/N. What happens for d > dcond(k) ?

9 / 31

slide-10
SLIDE 10

Overview Clustering and condensation

Interlude: a broader view of condensation.

  • A similar phenomenon appears when cooling too fast some liquids.
  • This is the famous Kauzmann paradox:
  • .

10 / 31

slide-11
SLIDE 11

Overview Clustering and condensation

The physics picture [Zdeborov´

a - Krzakala 2007]

Physics prediction: dcond(k) marks a phase transition:

  • for d < dcond(k), |C(G, σ)|1/N < Φk(d) = k(1 − 1/k)d/2,
  • for d > dcond(k), |C(G, σ)|1/N = Φk(d) < k(1 − 1/k)d/2,
  • the second derivative of Φk(d) is discontinuous at dcond(k).

k(1 − 1/k)d/2 Φk(d) |C(G, σ)|1/N dcond(k) dcol(k) d

11 / 31

slide-12
SLIDE 12

Overview Clustering and condensation

The physics picture [Zdeborov´

a - Krzakala 2007]

Upon increasing d, the geometry of the set of solutions dramatically changes. C(G, σ) = {colorings τ that can be reached from σ by altering at most N/(k log k) vertices at a time} dcond(k) dcol(k) d Condensation: when the number of clusters becomes sub-exponential. ⇔ when the cluster size |C(G, σ)|1/N equals Φk(d).

12 / 31

slide-13
SLIDE 13

Overview Rigorous results

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

13 / 31

slide-14
SLIDE 14

Overview Rigorous results

A first transition: the satisfiability transition

The number of colorings is easily understood when d < 1. Φk(d) d 1 k(1 − 1/k)d/2

14 / 31

slide-15
SLIDE 15

Overview Rigorous results

Upper bounds

Upper bound on the typical number of colorings: first moment method. Can be improved from the naive result [Coja-Oghlan 2013]. Φk(d) d 1 dk − 1 k(1 − 1/k)d/2 dk = (2k − 1) ln k

15 / 31

slide-16
SLIDE 16

Overview Rigorous results

Lower bounds

Lower bound on the typical number of colorings: second moment method [Achlioptas - Naor 2005, Coja-Oghlan - Vilenchik 2010]. Φk(d) d 1 dk − 2 ln 2 dk − 1 k(1 − 1/k)d/2 dk = (2k − 1) ln k

16 / 31

slide-17
SLIDE 17

Overview Rigorous results

The condensation transition

Theorem (1/2): for k large enough there exists dcond(k) such that: – there is a phase transition at dcond(k), – for d < dcond(k) : Φk(d) = k(1 − 1/k)d/2, – for d > dcond(k) : Φk(d) < k(1 − 1/k)d/2 (or does not exist). Φk(d) d dk − 2 ln 2 dk − 1 k(1 − 1/k)d/2 dk = (2k − 1) ln k dcond(k)

17 / 31

slide-18
SLIDE 18

Overview Rigorous results

The condensation transition

Theorem (2/2): dcond(k) is given by the formula predicted by the cavity method [Zdeborov´ a - Krzakala 2007]. That is:

  • Ω = {probability distributions on {1, . . . , k}},
  • f :

γ≥0 Ωγ → Ω,

f (µ1, . . . , µγ)(i) = γ

j=1 1 − µj(i)

  • h∈[k]

γ

j=1 1 − µj(h).

  • P = {probability distributions on Ω},
  • Fk,d : P → P

Fk,d(π) =

X

γ=0

γd exp(−d) γ! · Zγ(π) Z

Ωγ

"

k

X

h=1 γ

Y

j=1

1 − µj(h) # · δf [µ1,...,µγ ]

γ

O

j=1

dπ(µj).

where Zγ(π) = k

h=1

  • 1 −
  • Ω µ(h)dπ(µ)

γ

  • Σk,d : P → R (“Complexity”). Σk,d(π) = ...
  • dcond(k) is the unique solution of Σk,d(π⋆

k,d) = 0 in [dk − 2, dk], where

π⋆

k,d is a particular fixed point of Fk,d.

18 / 31

slide-19
SLIDE 19

Overview Rigorous results

Conjectures: the satisfiability transition

Conjecture 1: Φk(d) exists for all d. ⇒ There exist a colorability threshold dcol(k). ⇒ There is a phase transition at dcol(k). Conjecture 2: dcond(k) < dcol(k). There are exactly two phase transitions. Φk(d) d dk − 2 ln 2 dk − 1 k(1 − 1/k)d/2 dcond(k) dk = (2k − 1) ln k dcol(k)

19 / 31

slide-20
SLIDE 20

Outline of the proof

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

20 / 31

slide-21
SLIDE 21

Outline of the proof Using the planted model

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

21 / 31

slide-22
SLIDE 22

Outline of the proof Using the planted model

The planted model

  • The condensation corresponds to the point where the cluster size

|C(G, σ)|1/N equals (w.h.p.) k(1 − 1/k)d/2.

  • However it is hard to compute the cluster size:

given a random graph, how do we even find a coloring ?

  • Planting: first pick a configuration σ⋆ at random.

Then generate a graph G ⋆ by adding edges independently and uniformly at random such that: – G ⋆ has as many vertices as G (in average), – σ⋆ is a coloring of this graph. Generating the pair (G ⋆, σ⋆) is easy.

  • The cluster size |C(G ⋆, σ⋆)|1/N is also easier to compute.

22 / 31

slide-23
SLIDE 23

Outline of the proof Using the planted model

Condensation and clusters sizes

k(1 − 1/k)d/2 Φk(d) |C(G, σ)|1/N |C(G ⋆, σ⋆)|1/N dcond(k) d

  • Physics intuition: [Krzakala - Zdeborov´

a 2009] – if d < dcond(k), |C(G, σ)|1/N = |C(G ⋆, σ⋆)|1/N < Φk(d) = k(1 − 1/k)d/2, – if d > dcond(k), |C(G, σ)|1/N = Φk(d) < k(1 − 1/k)d/2 < |C(G ⋆, σ⋆)|1/N.

23 / 31

slide-24
SLIDE 24

Outline of the proof Using the planted model

Condensation and clusters sizes

k(1 − 1/k)d/2 |C(G ⋆, σ⋆)|1/N dcond(k) d

  • We use the following result: ∀ǫ > 0 [Coja-Oghlan - Vilenchik 2010]

– if |C(G ⋆, σ⋆)|1/N < k(1 − 1/k)d/2 − ǫ, then d < dcond(k), – if |C(G ⋆, σ⋆)|1/N > k(1 − 1/k)d/2 + ǫ, then d > dcond(k).

  • Therefore it is enough to understand the cluster size in the planted model.

24 / 31

slide-25
SLIDE 25

Outline of the proof Identifying the frozen vertices

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

25 / 31

slide-26
SLIDE 26

Outline of the proof Identifying the frozen vertices

Frozen vertices

  • We need to compute |C(G ⋆, σ⋆)|1/N.

Remember that we defined C(G ⋆, σ⋆) = {colorings τ that can be reached from σ⋆ by altering at most N/(k log k) vertices at a time}.

  • Close to dcond(k) most of the vertices are frozen : they take the same

value for all τ ∈ C(G ⋆, σ⋆). Most of : all but a fraction 1/k.

  • We need to identify the frozen vertices.

26 / 31

slide-27
SLIDE 27

Outline of the proof Identifying the frozen vertices

Frozen vertices

  • Close to dcond(k) most of the vertices are frozen: they take the same

value for all τ ∈ C(G ⋆, σ⋆).

  • Intuition for that: a vertex v typically has many neighbors of each color.

v

27 / 31

slide-28
SLIDE 28

Outline of the proof Identifying the frozen vertices

Frozen vertices

  • Close to dcond(k) most of the vertices are frozen: they take the same

value for all τ ∈ C(G ⋆, σ⋆).

  • Intuition for that: a vertex v typically has many neighbors of each color.

v

  • If in addition to that, most of the neighbors of v are frozen, then so is v.
  • Technically: “Warning Propagation” + existence a priori of a large set of

frozen vertices + convergence of local neighborhoods to trees.

27 / 31

slide-29
SLIDE 29

Outline of the proof The remaining: a problem over finite trees

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

28 / 31

slide-30
SLIDE 30

Outline of the proof The remaining: a problem over finite trees

  • Outcome of the previous analysis: coloring a graph where the color of

some vertices is fixed v w

29 / 31

slide-31
SLIDE 31

Outline of the proof The remaining: a problem over finite trees

  • Outcome of the previous analysis: coloring a graph where each vertex v

can take colors in Lv ⊂ [k] fixed. v w

29 / 31

slide-32
SLIDE 32

Outline of the proof The remaining: a problem over finite trees

  • Outcome of the previous analysis: coloring a graph where each vertex v

can take colors in Lv ⊂ [k] fixed. v w

29 / 31

slide-33
SLIDE 33

Outline of the proof The remaining: a problem over finite trees

  • Outcome of the previous analysis: coloring a graph where each vertex v

can take colors in Lv ⊂ [k] fixed. v w

  • If |Lv| = 1, we can remove v.

29 / 31

slide-34
SLIDE 34

Outline of the proof The remaining: a problem over finite trees

  • Outcome of the previous analysis: coloring a graph where each vertex v

can take colors in Lv ⊂ [k] fixed. v w

  • If |Lv| = 1, we can remove v.
  • If Lv ∩ Lw = {∅}, we can disconnect v and w

→ problem over finite trees: easy.

29 / 31

slide-35
SLIDE 35

Conclusions

Outline

Overview The model Clustering and condensation Rigorous results Outline of the proof Using the planted model Identifying the frozen vertices The remaining: a problem over finite trees Conclusions

30 / 31

slide-36
SLIDE 36

Conclusions

Conclusions

  • Rigorous proof of the existence of the condensation transition for

(Erd˝

  • s-R´

enyi) random graphs coloring. The transition point is a number (does not depend on N).

  • Confirms the prediction of the cavity method.

Condensation for a model with fluctuating degrees.

  • Some directions for future work: what about
  • the colorability threshold ?
  • finite temperature ?
  • models where the non-condensed phase is non-trivial ?

31 / 31

slide-37
SLIDE 37

Conclusions

Warning propagation

Consider the following process:

  • associate to each pair of vertices (v, w) connected by an edge a sequence

µv→w(i ∈ [k], t ≥ 0) ∈ {0, 1} defined by:

  • µv→w(i, t = 0) = 1 iff v has color i under σ⋆,

µa→v µb→v µc→v µv→w a b c v w

32 / 31

slide-38
SLIDE 38

Conclusions

Warning propagation

Consider the following process:

  • associate to each pair of vertices (v, w) connected by an edge a sequence

µv→w(i ∈ [k], t ≥ 0) ∈ {0, 1} defined by:

  • µv→w(i, t = 0) = 1 iff v has color i under σ⋆,

µa→v µb→v µc→v µv→w a b c v w 1 1 1 1 t = 0

33 / 31

slide-39
SLIDE 39

Conclusions

Warning propagation

Consider the following process:

  • associate to each pair of vertices (v, w) connected by an edge a sequence

µv→w(i ∈ [k], t ≥ 0) ∈ {0, 1} defined by:

  • µv→w(i, t = 0) = 1 iff v has color i under σ⋆,
  • µv→w(i, t + 1) = 1 iff for all j = i, there is u ∈ ∂v \ {w} such that

µu→v(j, t) = 1 (“u warns v that it cannot take color j”).

µa→v µb→v µc→v µv→w a b c v w 1 1 1 t = 1

34 / 31

slide-40
SLIDE 40

Conclusions

Warning propagation

Consider the following process:

  • associate to each pair of vertices (v, w) connected by an edge a sequence

µv→w(i ∈ [k], t ≥ 0) ∈ {0, 1} defined by:

  • µv→w(i, t = 0) = 1 iff v has color i under σ⋆,
  • µv→w(i, t + 1) = 1 iff for all j = i, there is u ∈ ∂v \ {w} such that

µu→v(j, t) = 1 (“u warns v that it cannot take color j”).

µa→v µb→v µc→v µv→w a b c v w 1 1 t = 2

35 / 31

slide-41
SLIDE 41

Conclusions

Warning propagation

  • The process is decreasing and converges.
  • Define L(v) = {i ∈ [k], ∀u∈ ∂v, µu→v(i, t = ∞) = 0}

(“colors that v is allowed to take”).

  • Let Z(G ⋆, σ⋆) be the number of colorings of G ⋆ such that σ(v) ∈ L(v).

Then |C(G ⋆, σ⋆)|1/N = Z(G ⋆, σ⋆)1/N w.h.p.

36 / 31

slide-42
SLIDE 42

Conclusions

Warning propagation (2d version)

Consider the following process:

  • associate to each pair of vertices (v, w) connected by an edge a sequence

µv→w(i ∈ [k], t ≥ 0) ∈ {0, 1} defined by:

  • µv→w(i, t = 0) = 1 iff v has color i under σ⋆ and v is in the core,
  • µv→w(i, t + 1) = 1 iff for all j = i, there is u ∈ ∂v \ {w} such that

µu→v(j, t) = 1 (“u warns v that it cannot take color j”).

  • The process is increasing and converges.

Define L2(v) = {i ∈ [k], ∀u∈ ∂v, µu→v(i, t = ∞) = 0} (“colors that v is allowed to take”).

  • Let Z2(G ⋆, σ⋆) be the number of colorings of G ⋆ such that σ(v) ∈ L2(v).

Then Z2(G ⋆, σ⋆) is an upper bound on the cluster size (w.h.p). W.h.p. ln Z1(G ⋆, σ⋆) = ln Z2(G ⋆, σ⋆) + o(N).

37 / 31