http://cs224w.stanford.edu Epidemic Model based on Random Trees (a - - PowerPoint PPT Presentation

โ–ถ
http cs224w stanford edu epidemic model based on random
SMART_READER_LITE
LIVE PREVIEW

http://cs224w.stanford.edu Epidemic Model based on Random Trees (a - - PowerPoint PPT Presentation

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Epidemic Model based on Random Trees (a variant of branching processes) Root node, patient 0 A patient meets d other


slide-1
SLIDE 1

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

http://cs224w.stanford.edu

slide-2
SLIDE 2

๏‚ก Epidemic Model based on Random Trees

  • (a variant of branching processes)
  • A patient meets d other people
  • With probability q>0 infects each
  • f them

๏‚ก Q: For which values of d and q

does the epidemic run forever?

  • Run forever: lim๐‘œโ†’โˆž๐‘„ ๐‘—๐‘—๐‘—๐‘—๐‘—๐‘—๐‘—๐‘— ๐‘—๐‘œ๐‘—๐‘—

๐‘๐‘— ๐‘—๐‘—๐‘’๐‘—๐‘’ ๐‘— > 0

  • Die out: -- || -- = 0

10/23/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2

Root node, โ€œpatient 0โ€ Start of epidemic d subtrees

slide-3
SLIDE 3

๏‚ก pn = prob. there is an infected node at depth n ๏‚ก We need: lim๐‘œโ†’โˆž ๐‘’๐‘œ =? (based on q and d) ๏‚ก Need recurrence for pn

๐‘’๐‘œ = 1 โˆ’ 1 โˆ’ ๐‘Ÿ๐‘’๐‘œโˆ’1 ๐‘’

๏‚ก lim

๐‘œโ†’โˆž ๐‘’๐‘œ = result of iterating

f x = 1 โˆ’ 1 โˆ’ ๐‘Ÿ๐‘ฆ ๐‘’

  • Starting at x=1 (since p1=1)

10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 3

No infected node at depth n

slide-4
SLIDE 4

10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 4

x f(x) 1 y=x Going to first fixed point

f 0 = 0, f 1 = 1 f 1 = 1 โˆ’ 1 โˆ’ q d < 1 f โ€ฒ x = qd 1 โˆ’ qx dโˆ’1

y = f x

f โ€ฒ 0 = qd โˆถ f โ€ฒ(x) is monotone decreasing on [0,1]

When is this going to 0?

What do we know about f(x)?

slide-5
SLIDE 5

10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 5

x f(x) 1 y=x y = f x

We need f(x) to be bellow y=x! f โ€ฒ 0 < 1

lim

๐‘œโ†’โˆž ๐‘’๐‘œ = 0 ? to ๐‘Ÿ๐‘— < 1

qd = expected # of people at we infect

slide-6
SLIDE 6

๏‚ก In this model nodes only go from

inactive โ†’ active

๏‚ก Can generalize to allow nodes to alternate

between active and inactive state by:

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 6

slide-7
SLIDE 7
slide-8
SLIDE 8

๏‚ก Generalizing to model to Virus Propagation

2 Parameters:

๏‚ก (Virus) birth rate ฮฒ:

  • probability than an infected neighbor attacks

๏‚ก (Virus) death rate ฮด:

  • probability that an infected node heals

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 8

Infected Healthy N N1 N3 N2

  • Prob. ฮฒ
  • Prob. ฮด
slide-9
SLIDE 9

๏‚ก General scheme for epidemic models:

  • Each node can go through phases:
  • Transition probs. are governed by model parameters

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

Sโ€ฆsusceptible Eโ€ฆexposed Iโ€ฆinfected Rโ€ฆrecovered Zโ€ฆimmune

9

slide-10
SLIDE 10

๏‚ก Node goes through phases

  • Models chickenpox or plague:
  • Once you heal, you can never get infected again

๏‚ก Assuming perfect mixing

  • network is a complete graph

the model dynamics is

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 10

Susceptible Infected Recovered time Number of nodes

slide-11
SLIDE 11

๏‚ก Susceptible-Infective-Susceptible (SIS) model ๏‚ก Cured nodes immediately become susceptible ๏‚ก Virus โ€œstrengthโ€: s = ฮฒ / ฮด ๏‚ก Node state transition diagram:

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 11

Susceptible Infective

Infected by neighbor with prob. ฮฒ Cured internally with prob. ฮด

slide-12
SLIDE 12

๏‚ก Models flu:

  • Susceptible node

becomes infected

  • The node then heals

and become susceptible again

๏‚ก Assuming perfect

mixing (complete graph):

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

Susceptible Infected

I SI dt dI ฮด ฮฒ โˆ’ =

I SI dt dS ฮด ฮฒ + โˆ’ =

time Number of nodes

12

I(t) S(t)

slide-13
SLIDE 13

๏‚ก SIS Model ๏‚ก Epidemic threshold of a graph G is a

value of t, such that:

  • If virus strength s = ฮฒ / ฮด < t

the epidemic can not happen (it eventually dies out)

๏‚ก Given a graph what is its epidemic

threshold?

Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 13 10/13/2009

slide-14
SLIDE 14

๏‚ก We have no epidemic if:

Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

ฮฒ/ฮด < ฯ„ = 1/ ฮป1,A

โ–บ ฮป1,A alone captures the property of the graph!

(Virus) Birth rate (Virus) Death rate Epidemic threshold largest eigenvalue

  • f adj. matrix A

[Wang et al. 2003]

10/13/2009 14

slide-15
SLIDE 15

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 15

100 200 300 400 500 250 500 750 1000

Time Number of Infected Nodes

ฮด: 0.05 0.06 0.07 Oregon ฮฒ = 0.001

ฮฒ/ฮด > ฯ„ (above threshold) ฮฒ/ฮด = ฯ„ (at the threshold) ฮฒ/ฮด < ฯ„ (below threshold)

10,900 nodes and 31,180 edges

[Wang et al. 2003]

slide-16
SLIDE 16

๏‚ก Does it matter how many people are

initially infected?

Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 10/13/2009 16

slide-17
SLIDE 17

10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 17

slide-18
SLIDE 18

๏‚ก Blogs โ€“ Information epidemics

  • Which are the influential/infectious blogs?
  • Which blogs create big cascades?

๏‚ก Viral marketing

  • Who are the influencers?
  • Where should I advertise?

๏‚ก Disease spreading

  • Where to place monitoring

stations to detect epidemics?

18 10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

vs.

slide-19
SLIDE 19

๏‚ก Independent Cascade Model

  • Directed finite G=(V,E)
  • Set S starts out with new behavior
  • Say nodes with this behavior are โ€œactiveโ€
  • Each edge (v,w) has a probability pvw
  • If node v is active, it gets one chance to

make w active, with probability pvw

  • Each edge fires at most once

๏‚ก Does scheduling matter? No

  • E.g., u,v both active, doesnโ€™t matter which fires first
  • But the time moves in discrete steps

10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 19

slide-20
SLIDE 20

๏‚ก Initially some nodes S are active ๏‚ก Each edge (v,w) has probability (weight) pvw ๏‚ก When node v becomes active:

  • It activates each out-neighbor w with prob. pvw

๏‚ก Activations spread through the network

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 20

0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2

e g f c b a d h i f g e

slide-21
SLIDE 21

๏‚ก S: is initial active set ๏‚ก f(S): the expected size of final active set ๏‚ก Set S is more influential if f(S) is larger

f({a,b} < f({a,c}) < f({a,d})

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 21

graph G

a b d c โ€ฆ influence set

  • f a node
slide-22
SLIDE 22

Problem:

๏‚ก Most influential set of

size k: set S of k nodes producing largest expected cascade size f(S) if activated

[Domingos-Richardson โ€˜01]

๏‚ก Optimization problem:

10/20/2010 22 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

) ( max

k size

  • f

S

S f

0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2 c b e a d g f h i Influence set of b

slide-23
SLIDE 23

๏‚ก Most influential set of k nodes: set S on k

nodes producing largest expected cascade size f(S) if activated

๏‚ก The optimization problem: ๏‚ก How hard is this problem?

  • NP-HARD!
  • Show that finding most influential

set is at least as hard as a vertex cover

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 23

) ( max

k size

  • f

S

S f

slide-24
SLIDE 24

๏‚ก Vertex cover problem:

  • Given universe of elements U={u1,โ€ฆ,un}

and sets S1,โ€ฆ, Sm โІ U

  • Are there k sets among S1,โ€ฆ, Sm such that

their union is U?

๏‚ก Goal:

Encode vertex cover as an instance of

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 24

) ( max

k size

  • f

S

S f

U S1

S2 S3

S4

slide-25
SLIDE 25

๏‚ก Given a vertex cover instance with sets S1,โ€ฆ, Sm ๏‚ก Build a bipartite โ€œS-to-Uโ€ graph: ๏‚ก There exists a set S of size k with f(S)=k+n

iff there exists a size k set cover

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 25

Construction:

  • Create edge

(Si,u) โˆ€Si โˆ€ uโˆˆSi

  • - directed edge

from sets to their elements

  • Put weight 1 on

each edge

u1 u2 u3 un e.g.: S1={u1, u2, u3}

1 1 1

S1 S2 S3 Sm Note: Optimal solution is always a set of Si This is hard in general, could be special cases that are easier

slide-26
SLIDE 26

๏‚ก Bad news:

  • Influence maximization is NP-hard

๏‚ก Next, good news:

  • There exists an approximation algorithm!

๏‚ก Consider the Hill Climbing algorithm to find S:

  • Input: Influence set of each node u = {v1, v2, โ€ฆ }
  • If we activate u, nodes {v1, v2, โ€ฆ } will eventually get active
  • Algorithm: At each step take the node u that gives

best marginal gain: max f(Si-1โˆช{u})

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 26

slide-27
SLIDE 27

Algorithm:

๏‚ก Start with S0={} ๏‚ก For i=1โ€ฆk

  • Take node v that max f(Si-1โˆช{v})
  • Let Si = Si-1โˆช{v}

๏‚ก Example:

  • Eval f({a}),โ€ฆ f({d}), pick max
  • Eval f({a,b}),โ€ฆ f({a,d}), pick max
  • Eval f(a,b,c},โ€ฆ f({a,b,d}, pick โ€ฆ

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

a b c a b c d d f(Si-1โˆช{v}) e e

27

slide-28
SLIDE 28

๏‚ก Hill climbing produces a solution S

where: f(S) โ‰ฅ(1-1/e)*OPT (f(S)>0.63*OPT)

[Nemhauser, Fisher, Wolsey โ€™78, Kempe, Kleinberg, Tardos โ€˜03]

๏‚ก Claim holds for functions f() with 2 properties:

  • f is monotone: (activating more nodes doesnโ€™t hurt)

if S โІ T then f(S) โ‰ค f(T) and f({})=0

  • f is submodular: (activating each additional node helps less)

adding an element to a set gives less improvement than adding it to one of its subsets: โˆ€S โІ T

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 28

Gain of adding a node to a small set Gain of adding a node to a large set

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

slide-29
SLIDE 29

๏‚ก Diminishing returns:

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 29

f(X) Solution size, |X|

Gain of adding a node to a small set Gain of adding a node to a large set

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

f(S) f(S โˆช{u}) f(T โˆช{u})

โˆ€S โІ T

f(T) Adding u to T helps less!

slide-30
SLIDE 30
slide-31
SLIDE 31

๏‚ก We must show our f() is submodular: ๏‚ก โˆ€S โІ T ๏‚ก Basic fact 1:

  • If f1(x), โ€ฆ,fk(x) are submodular, and c1,โ€ฆ,ck โ‰ฅ 0

then F x = โˆ‘ ๐‘—๐‘— โˆ™ ๐‘—

๐‘— ๐‘ฆ ๐‘—

is also submodular

(Linear combination of submodular functions is a submodular function)

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 31

Gain of adding a node to a small set Gain of adding a node to a large set

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

(trivially uโˆ‰T)

slide-32
SLIDE 32

๏‚ก โˆ€S โІ T: ๏‚ก Basic fact 2: A simple submodular function

  • Sets A1, โ€ฆ, Am
  • ๐‘— ๐‘‡ = โ‹ƒ

๐ต๐‘—

๐‘—โˆˆ๐‘‡

(size of the union of sets Ai, iโˆˆS)

  • Claim: f(S) is submodular!

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 32

S T u

Gain of adding u to a small set Gain of adding u to a large set

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

S โІ T

The more sets you already have the less new area a new set will cover

slide-33
SLIDE 33

๏‚ก Principle of deferred decision:

  • Flip all the coins at the

beginning and record which edges fire successfully.

  • Now we have a

deterministic graph!

  • Edges which succeed are live

๏‚ก For the i-th realization of coin flips

  • fi(S) = size of the set reachable by

live-edge paths from nodes in S

  • fi(S={a,b}) = {a,f,c,g,b}
  • fi(S={a,d}) = {a,f,c,g,d,e,h}

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 33

c b e g f h i

a d

Influence sets: fi(a) = {a,f,c,g} fi(d) = {d,e,h} fi(b) = {b,c}, โ€ฆ

slide-34
SLIDE 34

๏‚ก Fix outcome i of coin flips ๏‚ก fi(v) = set of nodes

reachable from v on live-edge paths

๏‚ก fi(S) = size of cascades

from S given coin flips i

๏‚ก ๐‘—

๐‘— ๐‘‡ = โ‹ƒ

๐‘—

๐‘—(๐‘ค) ๐‘คโˆˆ๐‘‡

โ‡’ fi(S) is submodular

  • fi(v) are sets and fi(S) is the size of the union

๏‚ก Expected influence set size:

๐‘— ๐‘‡ = โˆ‘ ๐‘—

๐‘—(๐‘‡) ๐‘—

โ‡’ f(S) is submodular!

  • f(S) is linear combination of submodular functions

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 34

c b e g f h i

a d

slide-35
SLIDE 35
slide-36
SLIDE 36

Claim: If f(S) is monotone and submodular. Hill climbing produces a solution S where: f(S) โ‰ฅ(1-1/e)*OPT (f(S)>0.63*OPT)

๏‚ก Setting

  • Keep adding nodes that give the largest gain
  • Start with S0={}, produce sets S1, S2,โ€ฆ,Sk
  • Add elements one by one
  • Marginal gain: ฮดi = f(Si) - f(Si-1)
  • Let T={t1โ€ฆtk} be the optimal set of size k

๏‚ก We need to show: f(S) โ‰ฅ (1-1/e) f(T)

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 36

slide-37
SLIDE 37

๐‘— ๐ต โˆช ๐ถ1 โˆ’ ๐‘— ๐ต โˆช ๐ถ0 + ๐‘— ๐ต โˆช ๐ถ2 โˆ’ ๐‘— ๐ต โˆช ๐ถ1 + ๐‘— ๐ต โˆช ๐ถ3 โˆ’ โ‹ฏ + ๐‘— ๐ต โˆช ๐ถ๐‘™ โˆ’ ๐‘—(๐ต โˆช ๐ถ๐‘™โˆ’1)

๏‚ก ๐‘—(๐ต โˆช ๐ถ) โˆ’ ๐‘—(๐ต) โ‰ค โˆ‘

[๐‘—(๐ต โˆช {๐‘

๐‘˜} ๐‘™ ๐‘˜=1

) โˆ’ ๐‘—(๐ต)]

  • where: B = {b1,โ€ฆ,bk} and f is submodular,

๏‚ก Proof:

  • Let Bi = {b1,โ€ฆbi}, so we have B1, B2, โ€ฆ, Bk=B
  • ๐‘— ๐ต โˆช B โˆ’ ๐‘— ๐ต = โˆ‘

๐‘— ๐ต โˆช ๐ถ๐‘— โˆ’ ๐‘— ๐ต โˆช ๐ถ๐‘—โˆ’1

๐‘™ ๐‘—=1

  • = โˆ‘

๐‘— ๐ต โˆช ๐ถ๐‘—โˆ’1 โˆช ๐‘๐‘— โˆ’ ๐‘— ๐ต โˆช ๐ถ๐‘—โˆ’1

๐‘™ ๐‘—=1

  • โ‰ค โˆ‘

๐‘— ๐ต โˆช {๐‘๐‘—} โˆ’ ๐‘— ๐ต

๐‘™ ๐‘—=1

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 37

Work out the sum. Everything but 1st and last term cancels out. By submodularity since AโˆชX โˆช{b} โЇ Aโˆช{b}

slide-38
SLIDE 38

๏‚ก ๐‘— ๐‘ˆ โ‰ค ๐‘— ๐‘‡๐‘— โˆช ๐‘ˆ ๏‚ก = ๐‘— ๐‘‡๐‘— โˆช ๐‘ˆ โˆ’ ๐‘— ๐‘‡๐‘— + ๐‘— ๐‘‡๐‘— ๏‚ก โ‰ค โˆ‘

๐‘— ๐‘‡๐‘— โˆช {๐‘—๐‘˜} โˆ’ ๐‘— ๐‘‡๐‘— + ๐‘—(๐‘‡๐‘—)

๐‘™ ๐‘˜=1

๏‚ก โ‰ค โˆ‘

๐œ€๐‘—+1

๐‘™ ๐‘˜=1

+ ๐‘— ๐‘‡๐‘— = ๐‘— ๐‘‡๐‘— + ๐‘™ ๐œ€๐‘—+1

๏‚ก Thus: ๐‘— ๐‘ˆ โ‰ค ๐‘— ๐‘‡๐‘— + ๐‘™ ๐œ€๐‘—+1 ๏‚ก โ‡’ ๐œ€๐‘—+1 โ‰ฅ

1 ๐‘™ [๐‘— ๐‘ˆ โˆ’ ๐‘—(๐‘‡๐‘—)]

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 38

(by monotonicity) (by prev. slide) T = {t1, โ€ฆ tk} tj is one choice

  • f a next

element, and we greedily choose the best one, for a gain of ฮดi+1

slide-39
SLIDE 39

๏‚ก We just showed: ๐œ€๐‘—+1 โ‰ฅ 1

๐‘™ [๐‘— ๐‘ˆ โˆ’ ๐‘—(๐‘‡๐‘—)]

๏‚ก What is f(Si+1)?

  • ๐‘— ๐‘‡๐‘—+1 = ๐‘— ๐‘‡๐‘— + ๐œ€๐‘—+1
  • โ‰ฅ ๐‘— ๐‘‡๐‘— +

1 ๐‘™ ๐‘— ๐‘ˆ โˆ’ ๐‘— ๐‘‡๐‘—

  • = 1 โˆ’

1 ๐‘™ ๐‘— ๐‘‡๐‘— + 1 ๐‘™ ๐‘—(๐‘ˆ)

๏‚ก What is f(Sk)?

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 39

slide-40
SLIDE 40

๏‚ก Claim:

Proof by induction:

๏‚ก ๐‘— = 0:

  • ๐‘— ๐‘‡0 = ๐‘—({}) = 0
  • 1 โˆ’ 1 โˆ’

1 ๐‘™

๐‘— ๐‘ˆ = 0

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 40

) ( 1 1 1 ) ( T f k S f

i i

๏ฃบ ๏ฃบ ๏ฃป ๏ฃน ๏ฃฏ ๏ฃฏ ๏ฃฐ ๏ฃฎ ๏ฃท ๏ฃธ ๏ฃถ ๏ฃฌ ๏ฃญ ๏ฃซ โˆ’ โˆ’ โ‰ฅ

slide-41
SLIDE 41

๏‚ก Claim:

Proof by induction:

๏‚ก At ๐‘— + 1:

  • ๐‘— ๐‘‡๐‘—+1 โ‰ฅ 1 โˆ’

1 ๐‘™ ๐‘— ๐‘‡๐‘— + 1 ๐‘™ ๐‘— ๐‘ˆ

  • โ‰ฅ 1 โˆ’

1 ๐‘™

1 โˆ’ 1 โˆ’

1 ๐‘™ ๐‘—

๐‘— ๐‘ˆ +

1 ๐‘™ ๐‘— ๐‘ˆ

  • = 1 โˆ’ 1 โˆ’

1 ๐‘™ ๐‘—+1

๐‘—(๐‘ˆ)

10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 41

) ( 1 1 1 ) ( T f k S f

i i

๏ฃบ ๏ฃบ ๏ฃป ๏ฃน ๏ฃฏ ๏ฃฏ ๏ฃฐ ๏ฃฎ ๏ฃท ๏ฃธ ๏ฃถ ๏ฃฌ ๏ฃญ ๏ฃซ โˆ’ โˆ’ โ‰ฅ

slide-42
SLIDE 42

๏‚ก Thus:

๐‘— ๐‘‡ = ๐‘— ๐‘‡๐‘™ โ‰ฅ 1 โˆ’ 1 โˆ’ 1 ๐‘™

๐‘™

๐‘— ๐‘ˆ

๏‚ก Then:

๐‘— ๐‘‡๐‘™ โ‰ฅ 1 โˆ’ 1 ๐‘— ๐‘—(๐‘ˆ)

10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 42

โ‰ค ๐Ÿ ๐’‡ qed

slide-43
SLIDE 43

We just proved:

๏‚ก Hill climbing finds solution S which

f(S) โ‰ฅ (1-1/e)*OPT

  • this is a data independent bound
  • This is a worst case bound
  • No matter what is the input data (influence sets) we

know that Hill Climbing wonโ€™t do worse than 0.63*OPT

Data dependent bound:

๏‚ก We want a bound whose value depends on

the input data

If the data is โ€œeasyโ€, we are likely doing better than 63% of OPT

10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 43

slide-44
SLIDE 44

๏‚ก Suppose S is some solution to

argmaxS f(S) s.t. |S| โ‰ค k f() is monotone & submodular and let T = {t1,โ€ฆ,tk} be the OPT solution

๏‚ก CLAIM:

For each u โˆ‰ S let ฮดu = f(Sโˆช{u})-f(S) Order ฮดu so that ฮด1 โ‰ฅ ฮด2 โ‰ฅ โ€ฆ โ‰ฅ ฮดn Then: f(T) โ‰ค f(S) + โˆ‘i=1

k ฮดi

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 44

slide-45
SLIDE 45

๏‚ก For each u โˆ‰ S let ฮดu = f(Sโˆช{u})-f(S)

Order ฮดu so that ฮด1 โ‰ฅ ฮด2 โ‰ฅ โ€ฆ โ‰ฅ ฮดn Then: f(T) โ‰ค f(S) + โˆ‘i=1

k ฮดi

๏‚ก Proof:

  • ๐‘— ๐‘ˆ โ‰ค ๐‘— ๐‘ˆ โˆช ๐‘‡ =

๐‘— ๐‘‡ + โˆ‘ ๐‘— ๐‘‡ โˆช ๐‘—1 โ€ฆ ๐‘—๐‘— โˆ’ ๐‘— ๐‘‡ โˆช ๐‘—1 โ€ฆ ๐‘—๐‘—โˆ’1

๐‘™ ๐‘—=1

  • โ‰ค ๐‘— ๐‘‡ + โˆ‘

๐‘— ๐‘‡ โˆช ๐‘—๐‘— โˆ’ ๐‘— ๐‘‡

๐‘™ ๐‘—=1

  • = ๐‘— ๐‘‡ + โˆ‘

๐œ€๐‘ข๐‘—

๐‘™ ๐‘—=1

  • โ‰ค ๐‘— ๐‘‡ + โˆ‘

๐œ€๐‘—

๐‘™ ๐‘—=1

โ‡’ ๐‘— ๐‘ˆ โ‰ค ๐‘— ๐‘‡ + โˆ‘ ๐œ€๐‘—

๐‘™ ๐‘—=1

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 45

slide-46
SLIDE 46

10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 46

slide-47
SLIDE 47

What do we know about

  • ptimizing submodular

functions?

๏‚ก A hill-climbing is near optimal

(1-1/e (~63%) of OPT)

๏‚ก But

  • Hill-climbing algorithm is slow
  • At each iteration we need to re-

evaluate marginal gains

  • It scales as O(n k)

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 47

a b c d reward e

Hill-climbing

Add node with highest marginal gain

slide-48
SLIDE 48

๏‚ก In round i+1: So far we picked Si = {s1,โ€ฆ,si}

  • Now pick si+1 = argmaxu F(Si โˆช {u}) - F(Si)
  • maximize the โ€œmarginal benefitโ€ ฮดu(Si) = F(Si โˆช {u}) - F(Si)

๏‚ก By submodularity property:

๐‘— ๐‘‡๐‘— โˆช ๐‘ฃ โˆ’ ๐‘— ๐‘‡๐‘— โ‰ฅ ๐‘— ๐‘‡

๐‘˜ โˆช ๐‘ฃ

โˆ’ ๐‘— ๐‘‡

๐‘˜ for i<j

๏‚ก Observation: Submodularity implies

i โ‰ค j โ‡’ ฮดx(Si) โ‰ฅ ฮดx(Sj) since SiโІSj Marginal benefits ฮดx only shrink!

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 48

u ฮดu(Si) โ‰ฅ ฮดu(Si+1)

[Leskovec et al., KDD โ€™07]

Activating node u in step i helps more than activating it at step j (j>i)

slide-49
SLIDE 49

๏‚ก Idea:

  • Use ฮดi as upper-bound on ฮดj (j>i)

๏‚ก Lazy hill-climbing:

  • Keep an ordered list of marginal

benefits ฮดi from previous iteration

  • Re-evaluate ฮดi only for top node
  • Re-sort and prune

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 49

a b c d Marginal gain e

[Leskovec et al., KDD โ€™07]

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

S โІ T S1={a}

slide-50
SLIDE 50

๏‚ก Idea:

  • Use ฮดi as upper-bound on ฮดj (j>i)

๏‚ก Lazy hill-climbing:

  • Keep an ordered list of marginal

benefits ฮดi from previous iteration

  • Re-evaluate ฮดi only for top node
  • Re-sort and prune

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 50

a d b c e Marginal gain

[Leskovec et al., KDD โ€™07]

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

S โІ T S1={a}

slide-51
SLIDE 51

๏‚ก Idea:

  • Use ฮดi as upper-bound on ฮดj (j>i)

๏‚ก Lazy hill-climbing:

  • Keep an ordered list of marginal

benefits ฮดi from previous iteration

  • Re-evaluate ฮดi only for top node
  • Re-sort and prune

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 51

a c d b e Marginal gain

[Leskovec et al., KDD โ€™07]

f(S โˆช {u}) โ€“ f(S) โ‰ฅ f(T โˆช {u}) โ€“ f(T)

S โІ T S1={a} S2={a,b}

slide-52
SLIDE 52
slide-53
SLIDE 53

๏‚ก Given a real city water

distribution network

๏‚ก And data on how

contaminants spread in the network

๏‚ก Detect the

contaminant as quickly as possible

๏‚ก Problem posed by the

US Environmental Protection Agency

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 53

S S

[Leskovec et al., KDD โ€™07]

slide-54
SLIDE 54

๏‚ก Given a graph G(V,E) ๏‚ก Data on how outbreaks spread over the

network:

  • for each outbreak i we know the

time T(i,u) when outbreak i contaminated node u

๏‚ก Select a subset of nodes A that maximize

the expected reward:

๏‚ก Reward: Save the most people

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 54

Reward for detecting

  • utbreak i

[Leskovec et al., KDD โ€™07]

slide-55
SLIDE 55

๏‚ก Observation: Diminishing returns

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 55

S1 S2

Placement A={s1, s2}

Sโ€™

New sensor: Adding sโ€™ helps a lot

S2 S4 S1 S3

Placement Aโ€™={s1, s2, s3, s4}

sโ€™

Adding sโ€™ helps very little

[Leskovec et al., KDD โ€™07]

slide-56
SLIDE 56

๏‚ก Claim:

  • The reward function is submodular

๏‚ก Consider outbreak i:

  • Ri(uk) = set of nodes saved from uk
  • Ri(A) = size of union Ri(uk), ukโˆˆA

โ‡’Ri is submodular

๏‚ก Global optimization:

  • R(A) = โˆ‘i Prob(i) Ri(A)

โ‡’ R(A) is submodular

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 56

u1 fi(u1)

  • utbreak i

u2 fi(u2)

[Leskovec et al., KDD โ€™07]

slide-57
SLIDE 57

๏‚ก Real metropolitan area

water network

  • V = 21,000 nodes
  • E = 25,000 pipes

๏‚ก Use a cluster of 50 machines for a month ๏‚ก Simulate 3.6 million epidemic scenarios

(152 GB of epidemic data)

๏‚ก By exploiting sparsity we fit it into main

memory (16GB)

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 57

[Leskovec et al., KDD โ€™07]

slide-58
SLIDE 58

Submodularity gives data-dependent bounds on the performance of any algorithm

58

Solution quality F(A) Higher is better

5 10 15 20 0.2 0.4 0.6 0.8 1 1.2 1.4

โ€œOfflineโ€

the (1-1/e) bound

Data-dependent bound Hill Climbing

Number of sensors placed

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

slide-59
SLIDE 59

๏‚ก Placement heuristics perform much worse

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 59

[Leskovec et al., KDD โ€™07]

slide-60
SLIDE 60

= I have 10 minutes. Which blogs should I read to be most up to date? = Who are the most influential bloggers?

60

?

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

slide-61
SLIDE 61

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 61

Detect all stories but late.

Want to read things before others do.

Detect blue & yellow soon but miss red.

slide-62
SLIDE 62

๏‚ก Online bound is much tighter:

  • 13% instead of 37%

(1-1/e) bound Data dependent bound Hill Climbing

10/20/2010 62 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

[Leskovec et al., KDD โ€™07]

slide-63
SLIDE 63

๏‚ก Heuristics perform much worse

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 63

[Leskovec et al., KDD โ€™07]

slide-64
SLIDE 64

๏‚ก Lazy evaluation

runs 700 times faster than naรฏve Hill Climbing algorithm

10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 64

[Leskovec et al., KDD โ€™07]

Naรฏve hill climbing Lazy hill climbing