Directed Random Graphs with Given Degree Distributions Mariana - - PowerPoint PPT Presentation

directed random graphs with given degree distributions
SMART_READER_LITE
LIVE PREVIEW

Directed Random Graphs with Given Degree Distributions Mariana - - PowerPoint PPT Presentation

Directed Random Graphs with Given Degree Distributions Mariana Olvera-Cravioto Columbia University molvera@ieor.columbia.edu Joint work with Ningyuan Chen July 23th, 2012 ECT, Trento, Italy Directed Random Graphs with Given Degree


slide-1
SLIDE 1

Directed Random Graphs with Given Degree Distributions

Mariana Olvera-Cravioto

Columbia University molvera@ieor.columbia.edu

Joint work with Ningyuan Chen July 23th, 2012

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 1/25

slide-2
SLIDE 2

The motivating example: WWW

Opte project. Part of the MoMA permanent collection ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 2/25

slide-3
SLIDE 3

The World Wide Web

◮ WWW seen as a directed graph (webpages = nodes, links = edges). ◮ Empirical observations:

fraction pages > k in-links ∝ k−α, α = 1.1 fraction pages > k out-links ∝ k−β, β = 1.72

◮ We want a directed random graph model that matches the degree

distributions.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 3/25

slide-4
SLIDE 4

Degree distributions

◮ Directed graph on n nodes V = {v1, . . . , vn}. ◮ In-degree and out-degree:

◮ mi = in-degree of node vi = number of edges pointing to vi. ◮ di = out-degree of node vi = number of edges pointing from vi.

◮ (m, d) = ({mi}, {di}) is called a bi-degree-sequence. ◮ Target distributions:

F = (fk : k = 0, 1, 2, . . . ), and G = (gk : k = 0, 1, 2, . . . ).

◮ We want the bi-degree-sequence to satisfy:

1 n

n

  • i=1

1(mi = k) ≈ fk and 1 n

n

  • i=1

1(di = k) ≈ gk.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 4/25

slide-5
SLIDE 5

Simple graphs

◮ Definition: We say that a directed graph is simple if it has no self-loops

and at most one edge in each direction between any two nodes.

◮ Definition: We say that (m, d) is graphical if there exists a simple

directed graph having (m, d) as its bi-degree-sequence.

◮ Goal: Choose a graph uniformly at random from all simple graphs having

bi-degree-sequence (m, d), where (m, d) has approximately the target distributions F and G.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 5/25

slide-6
SLIDE 6

Simple graphs

◮ Definition: We say that a directed graph is simple if it has no self-loops

and at most one edge in each direction between any two nodes.

◮ Definition: We say that (m, d) is graphical if there exists a simple

directed graph having (m, d) as its bi-degree-sequence.

◮ Goal: Choose a graph uniformly at random from all simple graphs having

bi-degree-sequence (m, d), where (m, d) has approximately the target distributions F and G.

◮ Two problems:

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 5/25

slide-7
SLIDE 7

Simple graphs

◮ Definition: We say that a directed graph is simple if it has no self-loops

and at most one edge in each direction between any two nodes.

◮ Definition: We say that (m, d) is graphical if there exists a simple

directed graph having (m, d) as its bi-degree-sequence.

◮ Goal: Choose a graph uniformly at random from all simple graphs having

bi-degree-sequence (m, d), where (m, d) has approximately the target distributions F and G.

◮ Two problems:

◮ Construct an appropriate bi-degree-sequence that with high probability will

be graphical.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 5/25

slide-8
SLIDE 8

Simple graphs

◮ Definition: We say that a directed graph is simple if it has no self-loops

and at most one edge in each direction between any two nodes.

◮ Definition: We say that (m, d) is graphical if there exists a simple

directed graph having (m, d) as its bi-degree-sequence.

◮ Goal: Choose a graph uniformly at random from all simple graphs having

bi-degree-sequence (m, d), where (m, d) has approximately the target distributions F and G.

◮ Two problems:

◮ Construct an appropriate bi-degree-sequence that with high probability will

be graphical.

◮ Choose uniformly at random a simple graph from such bi-degree-sequence. ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 5/25

slide-9
SLIDE 9

The configuration model (Wormald ’78, Bollobas ’80)

◮ For undirected graphs, given a degree sequence d = (d1, . . . , dn):

◮ assign to each node vi a number di of stubs or half-edges; ◮ for the first half-edge of node v1 choose uniformly at random from all

  • ther half-edges, and if the selected half-edge belongs to, say, node vj,

draw an edge between node v1 and vj;

◮ proceed in the same way for all remaining unpaired half-edges, i.e., choose

uniformly from the set of unpaired half-edges and draw an edge between the current node and the node to which the selected half-edge belongs.

◮ The result is a multigraph (e.g., with self-loops and multiple edges) on

nodes {v1, . . . , vn}.

◮ If we discard any realization that is not simple, we obtain a uniformly

chosen simple graph.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 6/25

slide-10
SLIDE 10

The directed configuration model

◮ For directed graphs, given a bi-degree-sequence (m, d):

◮ assign to each node vi a number mi of inbound stubs and a number di of

  • utbound stubs;

◮ pair outbound stubs to inbound stubs to form directed edges by matching

to each inbound stub an outbound stub chosen uniformly at random from the set of unpaired outbound stubs.

◮ The result is again a multigraph, but if we discard realizations that have

self-loops or multiple edges we obtain a uniformly chosen simple graph.

◮ Questions:

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 7/25

slide-11
SLIDE 11

The directed configuration model

◮ For directed graphs, given a bi-degree-sequence (m, d):

◮ assign to each node vi a number mi of inbound stubs and a number di of

  • utbound stubs;

◮ pair outbound stubs to inbound stubs to form directed edges by matching

to each inbound stub an outbound stub chosen uniformly at random from the set of unpaired outbound stubs.

◮ The result is again a multigraph, but if we discard realizations that have

self-loops or multiple edges we obtain a uniformly chosen simple graph.

◮ Questions:

◮ What is the probability of the resulting graph being simple? ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 7/25

slide-12
SLIDE 12

The directed configuration model

◮ For directed graphs, given a bi-degree-sequence (m, d):

◮ assign to each node vi a number mi of inbound stubs and a number di of

  • utbound stubs;

◮ pair outbound stubs to inbound stubs to form directed edges by matching

to each inbound stub an outbound stub chosen uniformly at random from the set of unpaired outbound stubs.

◮ The result is again a multigraph, but if we discard realizations that have

self-loops or multiple edges we obtain a uniformly chosen simple graph.

◮ Questions:

◮ What is the probability of the resulting graph being simple? ◮ Under what conditions is it bounded away from zero as n → ∞? ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 7/25

slide-13
SLIDE 13

Probability of graph being simple

◮ For the undirected configuration model it is known that if d satisfies

certain regularity conditions, the number of self-loops, Sn, and the number of multiple edges, Mn, satisfy (Sn, Mn) ⇒ (S, M) n → ∞, where S and M are independent Poisson r.v.s. (Janson ’09, Van der Hofstad ’08-’12).

◮ Then,

lim

n→∞ P(graph is simple) = P(S = 0, M = 0) > 0. ◮ The same should be true for the directed version.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 8/25

slide-14
SLIDE 14

Regularity conditions

Given {(mn, dn)}n∈N satisfying n

i=1 mni = n i=1 dni for all n, let

P((N [n], D[n]) = (i, j)) = 1 n

n

  • k=1

1(mnk = i, dnk = j).

  • 1. Weak convergence. For some γ, ξ with E[γ] = E[ξ] > 0,

(N [n], D[n]) ⇒ (γ, ξ), n → ∞.

  • 2. Convergence of the first moments.

lim

n→∞ E[N [n]] = E[γ]

and lim

n→∞ E[D[n]] = E[ξ].

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 9/25

slide-15
SLIDE 15

Regularity conditions... continued

  • 3. Convergence of the covariance.

lim

n→∞ E[N [n]D[n]] = E[γξ].

  • 4. Convergence of the second moments.

lim

n→∞ E[(N [n])2] = E[γ2]

and lim

n→∞ E[(D[n])2] = E[ξ2]. ◮ Note: (N [n], D[n]) denote the in-degree and out-degree of a randomly

chosen node.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 10/25

slide-16
SLIDE 16

Poisson Limit for Self-Loops and Multiple Edges

◮ Proposition: (Chen, O-C ’12) If {(mn, dn)}n∈N satisfies the regularity

conditions with E[γ] = E[ξ] = µ > 0, then (Sn, Mn) ⇒ (S, M) as n → ∞, where S and M are independent Poisson r.v.s with means λ1 = E[γξ] µ and λ2 = E[γ(γ − 1)]E[ξ(ξ − 1)] 2µ2 .

◮ Proof adapted from the undirected case (Van der Hofstad ’08 -’12). ◮ Theorem: Under the same assumptions,

lim

n→∞ P(graph obtained from (mn, dn) is simple) = e−λ1−λ2 > 0.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 11/25

slide-17
SLIDE 17

The repeated and erased models

◮ Repeated model: If all four regularity conditions are satisfied, then

repeat the random pairing until a simple graph is obtained.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 12/25

slide-18
SLIDE 18

The repeated and erased models

◮ Repeated model: If all four regularity conditions are satisfied, then

repeat the random pairing until a simple graph is obtained.

◮ If condition (4) is not satisfied, the probability of obtaining a simple graph

converges to zero as n → ∞.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 12/25

slide-19
SLIDE 19

The repeated and erased models

◮ Repeated model: If all four regularity conditions are satisfied, then

repeat the random pairing until a simple graph is obtained.

◮ If condition (4) is not satisfied, the probability of obtaining a simple graph

converges to zero as n → ∞.

◮ Erased model: Simply erase the self-loops and merge multiple edges in

the same direction into a single edge.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 12/25

slide-20
SLIDE 20

The repeated and erased models

◮ Repeated model: If all four regularity conditions are satisfied, then

repeat the random pairing until a simple graph is obtained.

◮ If condition (4) is not satisfied, the probability of obtaining a simple graph

converges to zero as n → ∞.

◮ Erased model: Simply erase the self-loops and merge multiple edges in

the same direction into a single edge.

◮ Question: (yet to answer) What are the in-degree and out-degree

distributions of the resulting simple graphs?

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 12/25

slide-21
SLIDE 21

Degree sequences

◮ How to construct an appropriate bi-degree-sequence?

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 13/25

slide-22
SLIDE 22

Degree sequences

◮ How to construct an appropriate bi-degree-sequence? ◮ For the undirected case one can take Dn = {D1, . . . , Dn}, where the

{Di} are i.i.d. r.v.s.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 13/25

slide-23
SLIDE 23

Degree sequences

◮ How to construct an appropriate bi-degree-sequence? ◮ For the undirected case one can take Dn = {D1, . . . , Dn}, where the

{Di} are i.i.d. r.v.s.

◮ Question: When is an i.i.d. sequence graphical?

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 13/25

slide-24
SLIDE 24

Degree sequences

◮ How to construct an appropriate bi-degree-sequence? ◮ For the undirected case one can take Dn = {D1, . . . , Dn}, where the

{Di} are i.i.d. r.v.s.

◮ Question: When is an i.i.d. sequence graphical? ◮ Answer: (Arratia and Ligget ’ 05) Provided E[D1] < ∞,

lim

n→∞ P(Dn is graphical) =

  • 1/2,

if P(D1 = odd) > 0, 1, if P(D1 = odd) = 0.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 13/25

slide-25
SLIDE 25

Degree sequences

◮ How to construct an appropriate bi-degree-sequence? ◮ For the undirected case one can take Dn = {D1, . . . , Dn}, where the

{Di} are i.i.d. r.v.s.

◮ Question: When is an i.i.d. sequence graphical? ◮ Answer: (Arratia and Ligget ’ 05) Provided E[D1] < ∞,

lim

n→∞ P(Dn is graphical) =

  • 1/2,

if P(D1 = odd) > 0, 1, if P(D1 = odd) = 0.

◮ Easy fix: either resample Dn until its sum is even, or simply add 1 to the

last node if the sum is odd.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 13/25

slide-26
SLIDE 26

Bi-degree-sequence

◮ We want a bi-degree-sequence (N, D)n such that the {Ni} and the {Di}

are close to being independent sequences of i.i.d. r.v.s from distributions F and G, resp.

◮ The sequences must satisfy n

  • i=1

Ni =

n

  • i=1

Di for all n.

◮ Problem: In general, if {γi} and {ξi} are independent i.i.d. sequences

with E[γ1] = E[ξ1], lim

n→∞ P

n

  • i=1

γi =

n

  • i=1

ξi

  • = 0.

◮ The “easy fix”: add a few in-degrees or out-degrees to match the sums.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 14/25

slide-27
SLIDE 27

Assumptions on the target distributions

◮ In-degree target distribution F. ◮ Out-degree target distribution G. ◮ Assume F and G have support on {0, 1, 2, . . . } and have common mean

µ > 0.

◮ Suppose further that for some α, β > 1,

F(x) =

  • k>x

fk ≤ x−αLF (x) and G(x) =

  • k>x

gk ≤ x−βLG(x), for all x ≥ 0, where LF (·) and LG(·) are slowly varying.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 15/25

slide-28
SLIDE 28

The Algorithm

  • 1. Fix 0 < δ0 < 1 − θ, θ = max{α−1, β−1, 1/2}.
  • 2. Sample {γ1, . . . , γn} i.i.d. from F; let Γn = n

i=1 γi.

  • 3. Sample {ξ1, . . . , ξn} i.i.d. from G; let Ξn = n

i=1 ξi.

  • 4. Let ∆n = Γn − Ξn. If |∆n| ≤ nθ+δ0 go to step 5; otherwise go to step 2.
  • 5. Choose randomly |∆n| nodes S = {i1, i2, . . . , i|∆n|} without

remplacement and let Ni = γi + τi, Di = ξi + χi, i = 1, 2, . . . , n, where χi =

  • 1

if ∆n ≥ 0 and i ∈ S,

  • therwise,

and τi =

  • 1

if ∆n < 0 and i ∈ S,

  • therwise.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 16/25

slide-29
SLIDE 29

Some basic properties

◮ The parameter θ is chosen so that

lim

n→∞ P(|∆n| ≤ nθ+δ0) = 1. ◮ Proposition: (Nn, Dn) satisfies for any fixed r, s ∈ N,

(Ni1, . . . , Nir, Dj1, . . . , Djs) ⇒ (γ1, . . . , γr, ξ1, . . . , ξs) as n → ∞, where {γi} and {ξi} are independent sequences of i.i.d. random variables having distributions F and G, respectively.

◮ (Nn, Dn) is an approximate equivalent of the i.i.d. degree sequence for

the undirected case.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 17/25

slide-30
SLIDE 30

Is the bi-degree-sequence graphical?

◮ Theorem: (Chen, O-C ’12) The bi-degree-sequence (Nn, Dn) satisfies

lim

n→∞ P ((Nn, Dn) is graphical) = 1.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 18/25

slide-31
SLIDE 31

Is the bi-degree-sequence graphical?

◮ Theorem: (Chen, O-C ’12) The bi-degree-sequence (Nn, Dn) satisfies

lim

n→∞ P ((Nn, Dn) is graphical) = 1. ◮ The proof uses a graphicality criterion from Berge ’76.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 18/25

slide-32
SLIDE 32

Random pairing with (Nn, Dn)

◮ Does (Nn, Dn) satisfy the regularity conditions?

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 19/25

slide-33
SLIDE 33

Random pairing with (Nn, Dn)

◮ Does (Nn, Dn) satisfy the regularity conditions? ◮ It can be shown that

1 n

n

  • k=1

1(Nk = i, Dk = j)

P

− → figj, for all i, j ∈ N ∪ {0}, 1 n

n

  • i=1

Ni

P

− → E[γ1], 1 n

n

  • i=1

Di

P

− → E[ξ1], and 1 n

n

  • i=1

NiDi

P

− → E[γ1ξ1],

and provided E[γ2

1 + ξ2 1] < ∞,

1 n

n

  • i=1

N 2

i P

− → E[γ2

1],

and 1 n

n

  • i=1

D2

i P

− → E[ξ2

1].

◮ Therefore, if E[γ2 1 + ξ2 1] < ∞, the directed configuration model will

produce a simple graph with probability bounded away from zero.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 19/25

slide-34
SLIDE 34

Repeated directed configuration model

◮ Let N (r) k

and D(r)

k

be the in-degree and out-degree of node k in the resulting simple graph.

◮ Define for i, j = 0, 1, 2, . . . ,

h(n)(i, j) = 1 n

n

  • k=1

P(N (r)

k

= i, D(r)

k

= j),

  • fi

(n) = 1

n

n

  • k=1

1(N (r)

k

= i) and

  • gj

(n) = 1

n

n

  • k=1

1(D(r)

k

= j).

◮ Proposition: For the repeated directed configuration model with

bi-degree-sequence (Nn, Dn) we have for all i, j = 0, 1, 2, . . . ,

1. h(n)(i, j) → figj as n → ∞, and 2.

  • fi

(n) P

− → fi and

  • gj

(n) P

− → gj, n → ∞.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 20/25

slide-35
SLIDE 35

10 10

1

10

2

10

−7

10

−6

10

−5

10

−4

10

−3

10

−2

10

−1

10

Repeated Model, Out-degree distribution, Empirical vs. Target (Paretos α = 5, β = 5) k P(D > k) n = 100 n = 1,000 n = 10,000 n = 100,000 n = 1,000,000 Target distr.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 21/25

slide-36
SLIDE 36

If the regularity conditions fail...

◮ If E[γ2 1 + ξ2 1] = ∞ but parts (1)-(3) of the regularity condition hold:

“erase all the self-lops and merge multiple edges in the same direction into a single edge.”

◮ Note: The resulting simple graph no longer has (Nn, Dn) as its

bi-degree-sequence.

◮ Question: Do the in-degrees and out-degrees still follow the target

distributions F and G?

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 22/25

slide-37
SLIDE 37

Erased directed configuration model

◮ Let N (e) k

and D(e)

k

be the in-degree and out-degree of node k in the resulting simple graph.

◮ Define for i, j = 0, 1, 2, . . . ,

h(n)(i, j) = 1 n

n

  • k=1

P(N (e)

k

= i, D(e)

k

= j),

  • fi

(n) = 1

n

n

  • k=1

1(N (e)

k

= i) and

  • gj

(n) = 1

n

n

  • k=1

1(D(e)

k

= j).

◮ Proposition: For the erased directed configuration model with

bi-degree-sequence (Nn, Dn) we have for all i, j = 0, 1, 2, . . . ,

1. h(n)(i, j) → figj as n → ∞, and 2.

  • fi

(n) P

− → fi and

  • gj

(n) P

− → gj, n → ∞.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 23/25

slide-38
SLIDE 38

10 10

1

10

2

10

3

10

4

10

−4

10

−3

10

−2

10

−1

10

Erased Model, In-degree distribution, Empirical vs. Target (Paretos α = 1.1, β = 1.5) k P(N > k) n = 100 n = 1,000 n = 10,000 n = 100,000 n = 1,000,000 Target distr.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 24/25

slide-39
SLIDE 39

Thank you for your attention.

ECT, Trento, Italy Directed Random Graphs with Given Degree Distributions 25/25