Rumor Spreading and Conductance Flavio Chierichetti Silvio Lattanzi - - PowerPoint PPT Presentation

rumor spreading and conductance
SMART_READER_LITE
LIVE PREVIEW

Rumor Spreading and Conductance Flavio Chierichetti Silvio Lattanzi - - PowerPoint PPT Presentation

Rumor Spreading and Conductance Flavio Chierichetti Silvio Lattanzi Alessandro Panconesi Sapienza University of Rome Why is rumor spreading fast in social networks? How to answer this question? How to define rumor spreading? What


slide-1
SLIDE 1

Rumor Spreading and Conductance

Flavio Chierichetti Silvio Lattanzi Alessandro Panconesi

Sapienza University of Rome

slide-2
SLIDE 2

Why is rumor spreading fast in social networks?

  • How to answer this question?
  • How to define rumor spreading?
  • What are social networks?
slide-3
SLIDE 3

Rumor spreading

slide-4
SLIDE 4

Rumor spreading

  • Push, Pull and Push-Pull

Introduced in the contest of distributed database.

Demers, Greene, Hauser, Irish, Larson, Shenker, Sturgis, Swinehart, Terry, PODC 1987

  • Basic mechanisms for information

dissemination in networks.

slide-5
SLIDE 5

Push

slide-6
SLIDE 6

Push

slide-7
SLIDE 7

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-8
SLIDE 8

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-9
SLIDE 9

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-10
SLIDE 10

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-11
SLIDE 11

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-12
SLIDE 12

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-13
SLIDE 13

In each round, each informed node “pushes” the information to a neighbor chosen UAR.

Push

slide-14
SLIDE 14

Pull

slide-15
SLIDE 15

In each round, each uninformed node tries to “pull” the information from a neighbor chosen UAR.

Pull

slide-16
SLIDE 16

In each round, each uninformed node tries to “pull” the information from a neighbor chosen UAR.

Pull

slide-17
SLIDE 17

In each round, each uninformed node tries to “pull” the information from a neighbor chosen UAR.

Pull

slide-18
SLIDE 18

In each round, each uninformed node tries to “pull” the information from a neighbor chosen UAR.

Pull

slide-19
SLIDE 19

Push-Pull

slide-20
SLIDE 20

Each node performs both actions on neighbor chosen UAR.

Push-Pull

slide-21
SLIDE 21

Each node performs both actions on neighbor chosen UAR.

Push-Pull

slide-22
SLIDE 22

Push-Pull

Each node performs both actions on neighbor chosen UAR.

slide-23
SLIDE 23

Each node performs both actions on neighbor chosen UAR.

Push-Pull

slide-24
SLIDE 24

Each node performs both actions on neighbor chosen UAR.

Push-Pull

slide-25
SLIDE 25

Push Pull

slide-26
SLIDE 26
  • What are the completion times TPUSH, TPULL,

TPUSH-PULL?

  • How many rounds will it take for each node to

know the information with probability 1 – o(1), assuming a worst-case source?

Performance

slide-27
SLIDE 27
  • TPUSH, TPULL = Θ(log n) if G = Kn

Frieze, Grimmet, Algorithms 1985

Previous work

slide-28
SLIDE 28
  • TPUSH, TPULL = Θ(log n) if G = Kn

Frieze, Grimmet, Algorithms 1985

  • TPUSH ≤ O(n log n)

TPUSH ≤ O(Δ(G) ( diam(G) + log n )) TPUSH ≤ O(log n) in Hypercubes and G(n,p) Graphs

Feige, Peleg, Raghavan, Upfal, Algorithms 1990

Previous work

slide-29
SLIDE 29
  • TPUSH, TPULL = Θ(log n) if G = Kn

Frieze, Grimmet, Algorithms 1985

  • TPUSH ≤ O(n log n)

TPUSH ≤ O(Δ(G) ( diam(G) + log n )) TPUSH ≤ O(log n) in Hypercubes and G(n,p) Graphs

Feige, Peleg, Raghavan, Upfal, Algorithms 1990

Previous work

Huge in Social Networks!

slide-30
SLIDE 30
  • TPUSH O(log n) in “quasi-regular” expanders

Berenbrink, Elsässer, Friedetzky, PODC 2008 Doerr, Friedrich, Sauerwald, ICALP 2009 Sauerwald, ISAAC 2007

  • TPUSH-PULL O(log2n) in PA graphs

TPUSH-PULL O(poly(Φ-1 log n)) if conductance = Φ

Chierichetti, Lattanzi, Panconesi, ICALP 2009, SODA 2010

  • Non uniform rumor spreading and conductance

Boyd, Ghosh, Prabhakar, Shah, IEEE Transaction on Information Theory 2006 Mosk-Aoyama, Shah, IEEE Transaction on Information Theory 2008

Previous work

slide-31
SLIDE 31
  • TPUSH O(log n) in “quasi-regular” expanders

Berenbrink, Elsässer, Friedetzky, PODC 2008 Doerr, Friedrich, Sauerwald, ICALP 2009 Sauerwald, ISAAC 2007

  • TPUSH-PULL O(log2n) in PA graphs

TPUSH-PULL O(poly(Φ-1 log n)) if conductance = Φ

Chierichetti, Lattanzi, Panconesi, ICALP 2009, SODA 2010

  • Non uniform rumor spreading and conductance

Boyd, Ghosh, Prabhakar, Shah, IEEE Transaction on Information Theory 2006 Mosk-Aoyama, Shah, IEEE Transaction on Information Theory 2008

Previous work

Social Networks are highly irregular!

slide-32
SLIDE 32
  • TPUSH O(log n) in “quasi-regular” expanders

Berenbrink, Elsässer, Friedetzky, PODC 2008 Doerr, Friedrich, Sauerwald, ICALP 2009 Sauerwald, ISAAC 2007

  • TPUSH-PULL O(log2n) in PA graphs

TPUSH-PULL O(poly(Φ-1 log n)) if conductance = Φ

Chierichetti, Lattanzi, Panconesi, ICALP 2009, SODA 2010

  • Non uniform rumor spreading and conductance

Boyd, Ghosh, Prabhakar, Shah, IEEE Transaction on Information Theory 2006 Mosk-Aoyama, Shah, IEEE Transaction on Information Theory 2008

Previous work

Connections to Spielman-Teng sparsification theory

slide-33
SLIDE 33

Social networks

slide-34
SLIDE 34

Empirical evidence

  • Leskovec, Lang, Dasgupta and Mahoney give

empirical evidence that social networks have conductance

  • Can we relate the performance of rumor

spreading algorithms with the conductance of the graph?

  • 1

log n

slide-35
SLIDE 35

Conductance

S C(S,V-S)

φ(G) = min

S⊆V vol(S)≤|E|

|C(S, V − S)| min(vol(S), vol(V − S))

slide-36
SLIDE 36

Social networks high conductance

slide-37
SLIDE 37

Social networks high conductance

Is rumor spreading fast on high conductance graphs?

slide-38
SLIDE 38

TPUSH

Is the PUSH strategy fast?

slide-39
SLIDE 39

TPUSH

Is the PUSH strategy fast?

slide-40
SLIDE 40

TPUSH

Is the PUSH strategy fast? The star has constant conductance.

slide-41
SLIDE 41

TPUSH

Is the PUSH strategy fast? The star has constant conductance.

slide-42
SLIDE 42

TPUSH

Is the PUSH strategy fast? The star has constant conductance.

slide-43
SLIDE 43

TPUSH

Is the PUSH strategy fast? The star has constant conductance.

slide-44
SLIDE 44

TPUSH

Is the PUSH strategy fast? Coupon Collector!

slide-45
SLIDE 45

TPUSH

Is the PUSH strategy fast? NO Coupon Collector!

slide-46
SLIDE 46

TPULL

Is the PULL strategy fast?

slide-47
SLIDE 47

TPULL

Is the PULL strategy fast?

slide-48
SLIDE 48

TPULL

Is the PULL strategy fast? The central node has to PULL the information from the right node.

slide-49
SLIDE 49

TPULL

The central node has to PULL the information from the right node. Is the PULL strategy fast? NO

slide-50
SLIDE 50

TPUSH and TPULL in social networks

Social networks do not look like a star...

slide-51
SLIDE 51

TPUSH and TPULL in social networks

Social networks do not look like a star... ... but there are low degree nodes connected only to high degree nodes.

slide-52
SLIDE 52

TPUSH and TPULL in social networks

Social networks do not look like a star... ... but there are low degree nodes connected only to high degree nodes. SAME ISSUES!!!

slide-53
SLIDE 53

TPUSH-PULL?

slide-54
SLIDE 54

Theorem

Let G be a graph with conductance Φ, then w.h.p.

slide-55
SLIDE 55

Theorem

Let G be a graph with conductance Φ, then w.h.p.

slide-56
SLIDE 56

Lower bound

  • Take any 3-regular graph of constant vertex

expansion of order Θ(nΦ) and diameter Θ(log n)

slide-57
SLIDE 57

Lower bound

  • Take any 3-regular graph of constant vertex

expansion of order Θ(nΦ) and diameter Θ(log n)

  • Replace each edge with a path of length Θ(Φ-1)
slide-58
SLIDE 58

Lower bound

  • Take any 3-regular graph of constant vertex

expansion of order Θ(nΦ) and diameter Θ(log n)

  • Replace each edge with a path of length Θ(Φ-1)
  • The resulting graph will have order Θ(n), diameter

Θ(Φ-1 log n) and conductance Θ(Φ).

slide-59
SLIDE 59

Upper bound

Let G be a graph with conductance Φ, then w.h.p.

slide-60
SLIDE 60

Proof strategy

Key lemma

S

slide-61
SLIDE 61

Proof strategy

S

We consider the process for O(Φ-1) steps. Key lemma

slide-62
SLIDE 62

Proof strategy

S’

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

Key lemma

S

slide-63
SLIDE 63

Proof strategy

S

We consider macro-phases composed by O(Φ-1)

steps

slide-64
SLIDE 64

Proof strategy

We say that a macro-phase is successful if the volume increases by a factor of (1+ Φ).

S

We consider macro-phases composed by O(Φ-1)

steps

slide-65
SLIDE 65

Proof strategy

After 1 successful macro-phase, we have:

Vol(S’) ≥ (1+ Ω(Φ)) Vol(S) S’

We consider macro-phases composed by O(Φ-1)

steps

slide-66
SLIDE 66

Proof strategy

After 2 successful macro-phases, we have:

Vol(S’’) ≥ (1+ Ω(Φ))2 Vol(S) S’’

We consider macro-phases composed by O(Φ-1)

steps

slide-67
SLIDE 67

Proof strategy

After Θ(Φ-1 log n) successful macro-phases, we have:

Vol(INFORMED) > Vol(G)/2

INFORMED

We consider macro-phases composed by O(Φ-1)

steps

slide-68
SLIDE 68

Proof strategy

  • A macro-phase is successful with constant

probability.

slide-69
SLIDE 69

Proof strategy

  • A macro-phase is successful with constant

probability.

  • After O(Φ-1 log n) successful macro-phases, we

have Vol(INFORMED) ≥ Vol(G)/2

slide-70
SLIDE 70

Proof strategy

  • A macro-phase is successful with constant

probability.

  • After O(Φ-1 log n) successful macro-phases, we

have Vol(INFORMED) ≥ Vol(G)/2

  • Using the Chernoff bound after O(Φ-1 log n)

macro-phases, we have O(Φ-1 log n) successful macro-phases.

slide-71
SLIDE 71

Proof strategy

After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p.

slide-72
SLIDE 72

Proof strategy

After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p. E1

slide-73
SLIDE 73

Proof strategy

E1 After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p.

slide-74
SLIDE 74

Proof strategy

E1 E2 After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p.

slide-75
SLIDE 75

Proof strategy

E1 E2 After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p.

slide-76
SLIDE 76

Proof strategy

E1 E2 P(E1) = P(E2) After O(Φ-2 log n) steps we have Vol(INFORMED) > Vol(G)/2 w.h.p.

slide-77
SLIDE 77

Proof strategy

  • After O(Φ-2 log n) steps we have

Vol(INFORMED) > Vol(G)/2 w.h.p.

  • After O(Φ-2 log n) steps each node pulls the

information from a set of nodes of Vol(G)/2

w.h.p.

  • After O(Φ-2 log n) steps all the nodes have

the info w.h.p.

slide-78
SLIDE 78

Key lemma

S’

After O(Φ-1) steps with constant probability, we

have that for the new set of informed nodes S’ Vol(S’) ≥ (1+ Ω(Φ)) Vol(S) S

slide-79
SLIDE 79

Sketch of the proof

Idea: analyze what happens to each node in S in a macro-phase

S

slide-80
SLIDE 80

Sketch of the proof

S

v0

slide-81
SLIDE 81

Sketch of the proof

S

v0

slide-82
SLIDE 82

Sketch of the proof

S

v0 G(v0)

slide-83
SLIDE 83

Sketch of the proof

S

v1 G(v0) v0

slide-84
SLIDE 84

Sketch of the proof

S

v1 G(v0) v0

slide-85
SLIDE 85

Sketch of the proof

S

v1 G(v0) v0

slide-86
SLIDE 86

Sketch of the proof

S

v1 G(v0) v0

slide-87
SLIDE 87

Sketch of the proof

S

v1 G(v0) v0 G(v1)

slide-88
SLIDE 88

Sketch of the proof

S

We define the following sets:

slide-89
SLIDE 89

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider

S A

slide-90
SLIDE 90

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider

S A

slide-91
SLIDE 91

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase

S A B

slide-92
SLIDE 92

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase

S A B

slide-93
SLIDE 93

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase

S A B

slide-94
SLIDE 94

Sketch of the proof

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase

S A B

slide-95
SLIDE 95

Sketch of the proof

S A B U

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase
  • U useful nodes in A
slide-96
SLIDE 96

Sketch of the proof

S A B U

We define the following sets:

  • A⊆S, informed nodes that we still have to consider
  • B⊇S, informed nodes at the current phase
  • U useful nodes in A
slide-97
SLIDE 97

Sketch of the proof

The set of useful nodes U is

S A B U

U = UB(A) =

  • v ∈ A | deg+

B(v)

deg(v) ≥ φ 2

slide-98
SLIDE 98

Sketch of the proof

The set of useful nodes U is

S A B U

  • 1. The cut (U, V - B) is a large part of the cut

(S, V - S), which has size at least Φ Vol(S).

U = UB(A) =

  • v ∈ A | deg+

B(v)

deg(v) ≥ φ 2

slide-99
SLIDE 99

Sketch of the proof

The set of useful nodes U is

S A B U

  • 1. The cut (U, V - B) is a large part of the cut

(S, V - S), which has size at least Φ Vol(S).

  • 2. And, furthermore, each node in U will have

constant probability of gaining a constant fraction of its edges in the cut.

U = UB(A) =

  • v ∈ A | deg+

B(v)

deg(v) ≥ φ 2

slide-100
SLIDE 100

Sketch of the proof

In order to get the key lemma we prove that for every macro- phase, and every v in U Pr

  • G(v) ≥ 1

20 · deg+

B(v)

  • ≥ 1 − e−1
slide-101
SLIDE 101

U

Sketch of the proof

v

Bi

d+(v)

slide-102
SLIDE 102

U

Sketch of the proof

v

Bi H

d+(v)

L

slide-103
SLIDE 103

U

Sketch of the proof

v

By applying Chebyshev inequality with some arithmetic manipulation, we get that, in the PULL regime:

Bi

Pr

  • g(v) > 1

20 · d+(v)

  • ≥ 1

10

L

≥ 1/2 · d+(v)

slide-104
SLIDE 104

U

Sketch of the proof

v

Bi

In the PUSH regime, we had:

Pr

  • g(v) > 1

20 · d+(v)

  • ≥ φ

10

H

≥ 1/2 · d+(v)

slide-105
SLIDE 105

U

Sketch of the proof

v

Bi

So, in general,

Pr

  • g(v) > 1

20 · d+(v)

  • ≥ φ

10

d+(v)

slide-106
SLIDE 106

Sketch of the proof

Since we go on for Φ-1 steps, Pr

  • G(v) ≥ 1

20 · deg+

B(v)

  • ≥ 1 − e−1
slide-107
SLIDE 107

Upper bound

Let G be a graph with conductance Φ, then w.h.p.

slide-108
SLIDE 108

The tighter bound

Let G be a graph with conductance Φ, then w.h.p.

slide-109
SLIDE 109

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

slide-110
SLIDE 110

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

slide-111
SLIDE 111

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

slide-112
SLIDE 112

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

∀v deg(v) ≃ Φ−1

slide-113
SLIDE 113

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

deg = 2 deg ≃ Φ−1

slide-114
SLIDE 114

Can the key lemma be improved?

After Θ(Φ-1) steps with Θ(1) probability, if S’

is the new set of informed nodes, then Vol(S’) ≥ (1+ Ω(Φ)) Vol(S)

No?

slide-115
SLIDE 115

Stronger key lemma

S

For some p ≥ Φ, after O(1/p) steps with constant probability,

we have that for the new set of informed nodes S’

vol(S′) ≥

  • 1 + Ω
  • φ

p log2 φ−1

  • · vol(S)
slide-116
SLIDE 116

Conclusion

  • We studied the rumor spreading problem in graph
  • f conductance Φ.
  • We showed that the PUSH and the PULL strategies are

not fast,

  • and that the PUSH-PULL strategy is fast, and we gave an

almost tight bound for its performance.

slide-117
SLIDE 117

Conclusion

  • We studied the rumor spreading problem in graph
  • f conductance Φ.
  • We showed that the PUSH and the PULL strategies are

not fast, (fast if some kind of “degree uniformity” exists)

  • and that the PUSH-PULL strategy is fast, and we gave an

almost tight bound for its performance.

slide-118
SLIDE 118

Open problems

  • Find a tight bound for the PUSH-PULL strategy.
  • Study the relationship between rumor spreading and

vertex expansion.

  • Can the PUSH strategy inform efficiently a large part of

a social network?

slide-119
SLIDE 119

Thank you! Questions?