Navigability of Small World Networks Pierre Fraigniaud CNRS and - - PowerPoint PPT Presentation

navigability of small world networks
SMART_READER_LITE
LIVE PREVIEW

Navigability of Small World Networks Pierre Fraigniaud CNRS and - - PowerPoint PPT Presentation

Navigability of Small World Networks Pierre Fraigniaud CNRS and University Paris Sud http://www.lri.fr/~pierre Introduction Interaction Networks Communication networks Internet Ad hoc and sensor networks Societal networks


slide-1
SLIDE 1

Navigability of Small World Networks

Pierre Fraigniaud CNRS and University Paris Sud

http://www.lri.fr/~pierre

slide-2
SLIDE 2

Introduction

slide-3
SLIDE 3
  • Dec. 19, 2006

HiPC'06 3

Interaction Networks

  • Communication networks

– Internet – Ad hoc and sensor networks

  • Societal networks

– The Web – P2P networks (the unstructured ones)

  • Social network

– Acquaintance – Mail exchanges

  • Biology (Interactome network), linguistics, etc.
slide-4
SLIDE 4
  • Dec. 19, 2006

HiPC'06 4

Common statistical properties

  • Low density
  • “Small world” properties:

– Average distance between two nodes is small, typically O(log n) – The probability p that two distinct neighbors u1 and u2 of a same node v are neighbors is large. p = clustering coefficient

  • “Scale free” properties:

– Heavy tailed probability distributions (e.g., of the degrees)

slide-5
SLIDE 5
  • Dec. 19, 2006

HiPC'06 5

Gaussian vs. Heavy tail

µ Example : human sizes Example : salaries

slide-6
SLIDE 6
  • Dec. 19, 2006

HiPC'06 6

Power law

prob{ X=k } prob{ X=k } ≈ ≈ k k-

  • α

α

log log p pk

k

log k log k

slide-7
SLIDE 7
  • Dec. 19, 2006

HiPC'06 7

Random graphs vs. Interaction networks

  • Random graphs: prob{e exists} ≈ log(n)/n

– low clustering coefficient – Gaussian distribution of the degrees

  • Interaction networks

– High clustering coefficient – Heavy tailed distribution of the degrees

slide-8
SLIDE 8
  • Dec. 19, 2006

HiPC'06 8

New problematic

  • Why these networks share these

properties?

  • What model for

– Performance analysis of these networks – Algorithm design for these networks

  • Impact of the measures?
  • This lecture addresses navigability
slide-9
SLIDE 9

Navigability

slide-10
SLIDE 10
  • Dec. 19, 2006

HiPC'06 10

Milgram Experiment

  • Source person s (e.g., in Wichita)
  • Target person t (e.g., in Cambridge)

– Name, professional occupation, city of living, etc.

  • Letter transmitted via a chain of

individuals related on a personal basis

  • Result: “six degrees of separation”
slide-11
SLIDE 11
  • Dec. 19, 2006

HiPC'06 11

Navigability

  • Jon Kleinberg (2000)

– Why should there exist short chains of acquaintances linking together arbitrary pairs of strangers? – Why should arbitrary pairs of strangers be able to find short chains of acquaintances that link them together?

  • In other words: how to navigate in a

small worlds?

slide-12
SLIDE 12
  • Dec. 19, 2006

HiPC'06 12

Nevanlinna Price

  • Price rewarding a major contribution in

Mathematics for its impact in computer science.

  • Laureats

– 1982 - Robert Tarjan – 1986 - Leslie Valiant – 1990 - A.A. Razborov – 1994 - Avi Wigderson – 1998 - Peter Shor – 2002 - Madhu Sudan – 2006 - Jon Kleinberg

slide-13
SLIDE 13
  • Dec. 19, 2006

HiPC'06 13

Augmented graphs H=G+D

  • Individuals as nodes of a graph G

– Edges of G model relations between individuals deducible from their societal positions

  • A number k of “long links” are added to G at

random, according to the probability distribution D

– Long links model relations between individuals that cannot be deduced from their societal positions

slide-14
SLIDE 14
  • Dec. 19, 2006

HiPC'06 14

Greedy Routing in augmented graphs

  • Source s ∈ V(G)
  • Target t ∈ V(G)
  • Current node x selects among its degG(x)+k

neighbors the closest to t in G, that is according to the distance function distG().

Greedy routing in augmented graphs aims at modeling the routing process performed by social entities in Milgram’s experiment.

slide-15
SLIDE 15
  • Dec. 19, 2006

HiPC'06 15

Augmented meshes

Kleinberg [STOC 2000] d-dimensional n-node meshes augmented with d-harmonic links u u v v prob(u prob(u→ →v) v) ≈ ≈ 1 1/

/(

(log(n)*dist(u,v)

log(n)*dist(u,v)d

d)

)

slide-16
SLIDE 16
  • Dec. 19, 2006

HiPC'06 16

Harmonic distribution

  • d-dimensional mesh
  • B(x,r) = ball centered at x of radius r
  • S(x,r) = sphere centered at x of radius r
  • In d-dimensional meshes:

|B(x,r)| ≈ rd |S(x,r )| ≈ rd-1

Σv≠u(1/dist(u,v)d) = Σr |S(u,r)|/rd ≈ Σr 1/r ≈ log n

slide-17
SLIDE 17
  • Dec. 19, 2006

HiPC'06 17

Performances

dist(x,t)=r x t z y Expected #steps to enter B(t,r/2) B(t,r/2) is is O(log n) O(log n) B(t,r/2) For a current node For a current node x x at distance at distance r r from from t t, , prob{x → B(t,r/2)} is at least Ω(1/log n)

slide-18
SLIDE 18
  • Dec. 19, 2006

HiPC'06 18

Kleinberg’s theorems

  • Greedy routing performs in O(log2n / k)

expected #steps in d-dimensional meshes augmented with k links per node, chosen according to the d-harmonic distribution.

– Note: k = log n ⇒ O(log n) expect. #steps

  • Greedy routing in d-dimensional meshes

augmented with a h-harmonic distribution, h≠d, performs in Ω(nε) expected #steps.

slide-19
SLIDE 19
  • Dec. 19, 2006

HiPC'06 19

Extensions

  • Two-step greedy routing: O(log n / loglog n)

– Coppersmith, Gamarnik, Sviridenko (2002)

  • Percolation theory

– Manku, Naor, Wieder (2004)

  • NoN routing
  • Routing with partial knowledge: O(log1+1/d n)

– Martel, Nguyen (2004)

  • Non-oblivious routing

– Fraigniaud, Gavoille, Paul (2004)

  • Oblivious routing
  • Decentralized routing: O(log n * log2log n)

– Lebhar, Schabanel (2004)

  • O(log2n) expected #steps to find the route
slide-20
SLIDE 20

polylog navigable networks

slide-21
SLIDE 21
  • Dec. 19, 2006

HiPC'06 21

Navigable graphs

  • Let f : N → R be a function
  • An n-node graph G is f-navigable if

there exists an augmentation D for G such that greedy routing in G+D performs in at most f(n) expected #steps.

  • I.e., for any two nodes u,v we have

ED(#stepsu→v) ≤ f(n)

slide-22
SLIDE 22
  • Dec. 19, 2006

HiPC'06 22

polylog(n)-navigable graphs

  • Bounded growth graphs

– Definition: |B(x,2r)| ≤ ρ |B(x,r)| – Duchon, Hanusse, Lebhar, Schabanel (2005,2006)

  • Bounded doubling dimension

– Definition: DD d if every B(x,2r) can be covered by at most 2d balls of radius r – Slivkins (2005)

  • Graphs of bounded treewidth

– Fraigniaud (2005)

  • Graphs excluding a fixed minor

– Abraham, Gavoille (2006)

slide-23
SLIDE 23
  • Dec. 19, 2006

HiPC'06 23

Doubling dimension

slide-24
SLIDE 24
  • Dec. 19, 2006

HiPC'06 24

Slivkins’ theorem

  • Theorem: Any family of graphs with

doubling dimension O(loglog n) is polylog(n)-navigable.

  • Proof: Graphs are augmented with

– distG(u,v) = r – prob(u → v) ≈ 1/|B(v,r)|

x t

slide-25
SLIDE 25
  • Dec. 19, 2006

HiPC'06 25

Question

Are all graphs polylog(n)-navigable?

slide-26
SLIDE 26
  • Dec. 19, 2006

HiPC'06 26

Impossibility result

Theorem Let d such that limn→+∞ loglog n / d(n) = 0 There exists an infinite family of n-node graphs with doubling dimension at most d(n) that are not polylog(n)-navigable. Consequences:

  • 1. Slivkins’ result is tight
  • 2. Not all graphs are polylog(n)-navigable
slide-27
SLIDE 27
  • Dec. 19, 2006

HiPC'06 27

Proof of non-navigability

The graphs Hd with n=pd nodes

x = x x = x1

1 x

x2

2 ... x

... xd

d

is connected to all nodes y = y y = y1

1 y

y2

2 ... y

... yd

d

such that y yi

i = x

= xi

i + a

+ ai

i where

a ai

i

∈ ∈ {-1,0,+1} {-1,0,+1} H Hd

d has doubling dimension d

d

slide-28
SLIDE 28
  • Dec. 19, 2006

HiPC'06 28

Intuitive approach

  • Large doubling dimension d

⇒ every nodes x ∈ Hd has choices over exponentially many directions

  • The underlying metric of Hd is L∞
slide-29
SLIDE 29
  • Dec. 19, 2006

HiPC'06 29

Directions

+1,+1 +1,0 +1,-1

  • 1,+1
  • 1,0
  • 1,-1

0,+1 0,-1

δ = (δ1, ..., δd) where δi ∈ ∈ {-1,0,+1} {-1,0,+1} Dirδ(u)={v / vi =ui + xi δi where xi = 1...p/2}

slide-30
SLIDE 30
  • Dec. 19, 2006

HiPC'06 30

Case of symmetric distribution

Source s Target t

Disadvantaged Disadvantaged direction direction At every step: probability ≤ ≤ 1/2 1/2d

d

to go in the right direction

slide-31
SLIDE 31
  • Dec. 19, 2006

HiPC'06 31

  • - General case --

Diagonals

+1,+1 +1,0 +1,-1

  • 1,+1
  • 1,0
  • 1,-1

0,+1 0,-1

slide-32
SLIDE 32
  • Dec. 19, 2006

HiPC'06 32

Lines

p p lines in each direction lines in each direction p p p p

slide-33
SLIDE 33
  • Dec. 19, 2006

HiPC'06 33

Intervals

J J

slide-34
SLIDE 34
  • Dec. 19, 2006

HiPC'06 34

Certificates

J J v v v v is a certificate for is a certificate for J J

slide-35
SLIDE 35
  • Dec. 19, 2006

HiPC'06 35

Counting argument

  • 2d directions
  • Lines are split in intervals of length L
  • n/L × 2d intervals in total
  • Every node belongs to many intervals, but

can be the certificate of at most one interval

  • If L<2d there is one interval J0 without

certificate

slide-36
SLIDE 36
  • Dec. 19, 2006

HiPC'06 36

L-1 steps from s to t

J J0

source s target t

slide-37
SLIDE 37
  • Dec. 19, 2006

HiPC'06 37

In expectation...

  • n/L × 2d - n intervals without certificate
  • L = 2d-1 ⇒ n of the 2n intervals are without certificate
  • This is true for any trial of the long links
  • Hence Ε = ED(#interval without certificate) ≥ n
  • On the other hand:

Ε = ∑J Pr(J has no certificate)

  • Hence there is an interval J0=[s,t] such that

Pr(J0 has no certificate) ≥1/2

  • Hence ED(#stepss→t) ≥ (L-1)/2 QED

Remark: The proof still holds even if the long links are not set pairwise independently.

slide-38
SLIDE 38

Hierarchical Models

slide-39
SLIDE 39
  • Dec. 19, 2006

HiPC'06 39

Kleinberg’s Hierarchical Model

Θ(log n) long links per node Prob(x→y) ≈ height of their lowest common ancestor

slide-40
SLIDE 40
  • Dec. 19, 2006

HiPC'06 40

Interleaved Hierarchies

  • Many hierarchies:

– place of living – professional activity – recreative activity – etc.

  • Can we extract a “global” hierarchy reflecting

all these interleaved hierarchies?

slide-41
SLIDE 41
  • Dec. 19, 2006

HiPC'06 41

Graph classes

Bounded doubling dimension Bounded treewidth Meshes Paths Trees

slide-42
SLIDE 42
  • Dec. 19, 2006

HiPC'06 42

Tree-Decomposition

Definition: A tree-decomposition of a graph G is a pair (T,X) where T is a tree of node set I and X is a collection {Xi ⊆ V(G), i ∈ I} such that

– ∪i∈I Xi = V(G) – ∀ e={x,y} ∈ E(G), ∃ i ∈ I / {x,y} ⊆ Xi – if k ∈ I in on the path between i and j in T, then Xi ∩ Xj ⊆ Xk

slide-43
SLIDE 43
  • Dec. 19, 2006

HiPC'06 43

Recursive Separators

slide-44
SLIDE 44
  • Dec. 19, 2006

HiPC'06 44

Treewidth

  • The width of a tree-decomposition (T,X)

is: width(T,X) = maxi∈I |Xi|-1

  • The treewith of a graph G is the

minimum width of any tree- decomposition (T,X) of G: tw(G) = min(T,X) width(T,X)

slide-45
SLIDE 45
  • Dec. 19, 2006

HiPC'06 45

Centroid

A centroid of an n-node tree T is a vertex v such that T-{v} is a forest of trees, each of at most n/2 vertices.

slide-46
SLIDE 46
  • Dec. 19, 2006

HiPC'06 46

Tree-Decomposition Based Distribution

(1) (3) (2)

x (4)

slide-47
SLIDE 47
  • Dec. 19, 2006

HiPC'06 47

Theorem

  • For any n-node graph G of treewidth k, there

exists a tree-decomposition based distribution D such that greedy routing in G+D performs in O(k log2n) expected number of steps.

  • Application: graphs of bounded treewidth.
slide-48
SLIDE 48
  • Dec. 19, 2006

HiPC'06 48

Proof Sketch

  • Let c be the centroid separating the current

node x and t.

  • It takes O(log n) expect. #steps to reach a

node in c.

  • The centroid c cannot be visited more than

tw(G)+1 times

  • There are ≤ log n levels of centroids
slide-49
SLIDE 49

Ressource Finding in P2P Networks

slide-50
SLIDE 50
  • Dec. 19, 2006

HiPC'06 50

Peer-to-Peer (P2P)

Key space Users Ressources Hashing (DHT) Supports:

  • Publish
  • Search
  • Join/leave
slide-51
SLIDE 51
  • Dec. 19, 2006

HiPC'06 51

Routing in Key Space A Case Study: Chord

1 Lookups and publish in O(log n) steps [Stoica, Morris, Karger, Kaashoek, Balakrishnan]

slide-52
SLIDE 52
  • Dec. 19, 2006

HiPC'06 52

Small World (Symphony)

1 Lookups and publish in O(log2n) steps 1/distance

slide-53
SLIDE 53

Challenges

slide-54
SLIDE 54
  • Dec. 19, 2006

HiPC'06 54

Research Directions

  • Augmenting arbitrary graphs
  • Models

– social networks – emerging properties and structures

  • Applications:

– P2P networks – Grid computing

slide-55
SLIDE 55
  • Dec. 19, 2006

HiPC'06 55

Open Problem

  • Input: an n-node graph G
  • Output: a collection of probability

distributions D={pu, u∈V(G)} for aumenting G, where Pr{u→v} = pu(v)

  • Measure: T(n) = maxG of order n TG where

TG = maxs,t∈V(G) ED(GR from s to t)

  • T(n) = O(√n) (in fact, O(n1/3))
  • T(n) = Ω(n1/√(log n))
slide-56
SLIDE 56

THANK YOU !