http://cs224w.stanford.edu Epidemic Model based on Random Trees (a - - PowerPoint PPT Presentation
http://cs224w.stanford.edu Epidemic Model based on Random Trees (a - - PowerPoint PPT Presentation
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Epidemic Model based on Random Trees (a variant of branching processes) Root node, patient 0 A patient meets d other
๏ก Epidemic Model based on Random Trees
- (a variant of branching processes)
- A patient meets d other people
- With probability q>0 infects each
- f them
๏ก Q: For which values of d and q
does the epidemic run forever?
- Run forever: lim๐โโ๐ ๐๐๐๐๐๐๐๐ ๐๐๐๐
๐๐ ๐๐๐๐๐ ๐ > 0
- Die out: -- || -- = 0
10/23/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2
Root node, โpatient 0โ Start of epidemic d subtrees
๏ก pn = prob. there is an infected node at depth n ๏ก We need: lim๐โโ ๐๐ =? (based on q and d) ๏ก Need recurrence for pn
๐๐ = 1 โ 1 โ ๐๐๐โ1 ๐
๏ก lim
๐โโ ๐๐ = result of iterating
f x = 1 โ 1 โ ๐๐ฆ ๐
- Starting at x=1 (since p1=1)
10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 3
No infected node at depth n
10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 4
x f(x) 1 y=x Going to first fixed point
f 0 = 0, f 1 = 1 f 1 = 1 โ 1 โ q d < 1 f โฒ x = qd 1 โ qx dโ1
y = f x
f โฒ 0 = qd โถ f โฒ(x) is monotone decreasing on [0,1]
When is this going to 0?
What do we know about f(x)?
10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 5
x f(x) 1 y=x y = f x
We need f(x) to be bellow y=x! f โฒ 0 < 1
lim
๐โโ ๐๐ = 0 ? to ๐๐ < 1
qd = expected # of people at we infect
๏ก In this model nodes only go from
inactive โ active
๏ก Can generalize to allow nodes to alternate
between active and inactive state by:
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 6
๏ก Generalizing to model to Virus Propagation
2 Parameters:
๏ก (Virus) birth rate ฮฒ:
- probability than an infected neighbor attacks
๏ก (Virus) death rate ฮด:
- probability that an infected node heals
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 8
Infected Healthy N N1 N3 N2
- Prob. ฮฒ
- Prob. ฮด
๏ก General scheme for epidemic models:
- Each node can go through phases:
- Transition probs. are governed by model parameters
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
Sโฆsusceptible Eโฆexposed Iโฆinfected Rโฆrecovered Zโฆimmune
9
๏ก Node goes through phases
- Models chickenpox or plague:
- Once you heal, you can never get infected again
๏ก Assuming perfect mixing
- network is a complete graph
the model dynamics is
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 10
Susceptible Infected Recovered time Number of nodes
๏ก Susceptible-Infective-Susceptible (SIS) model ๏ก Cured nodes immediately become susceptible ๏ก Virus โstrengthโ: s = ฮฒ / ฮด ๏ก Node state transition diagram:
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 11
Susceptible Infective
Infected by neighbor with prob. ฮฒ Cured internally with prob. ฮด
๏ก Models flu:
- Susceptible node
becomes infected
- The node then heals
and become susceptible again
๏ก Assuming perfect
mixing (complete graph):
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
Susceptible Infected
I SI dt dI ฮด ฮฒ โ =
I SI dt dS ฮด ฮฒ + โ =
time Number of nodes
12
I(t) S(t)
๏ก SIS Model ๏ก Epidemic threshold of a graph G is a
value of t, such that:
- If virus strength s = ฮฒ / ฮด < t
the epidemic can not happen (it eventually dies out)
๏ก Given a graph what is its epidemic
threshold?
Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 13 10/13/2009
๏ก We have no epidemic if:
Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
ฮฒ/ฮด < ฯ = 1/ ฮป1,A
โบ ฮป1,A alone captures the property of the graph!
(Virus) Birth rate (Virus) Death rate Epidemic threshold largest eigenvalue
- f adj. matrix A
[Wang et al. 2003]
10/13/2009 14
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 15
100 200 300 400 500 250 500 750 1000
Time Number of Infected Nodes
ฮด: 0.05 0.06 0.07 Oregon ฮฒ = 0.001
ฮฒ/ฮด > ฯ (above threshold) ฮฒ/ฮด = ฯ (at the threshold) ฮฒ/ฮด < ฯ (below threshold)
10,900 nodes and 31,180 edges
[Wang et al. 2003]
๏ก Does it matter how many people are
initially infected?
Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 10/13/2009 16
10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 17
๏ก Blogs โ Information epidemics
- Which are the influential/infectious blogs?
- Which blogs create big cascades?
๏ก Viral marketing
- Who are the influencers?
- Where should I advertise?
๏ก Disease spreading
- Where to place monitoring
stations to detect epidemics?
18 10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
vs.
๏ก Independent Cascade Model
- Directed finite G=(V,E)
- Set S starts out with new behavior
- Say nodes with this behavior are โactiveโ
- Each edge (v,w) has a probability pvw
- If node v is active, it gets one chance to
make w active, with probability pvw
- Each edge fires at most once
๏ก Does scheduling matter? No
- E.g., u,v both active, doesnโt matter which fires first
- But the time moves in discrete steps
10/24/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 19
๏ก Initially some nodes S are active ๏ก Each edge (v,w) has probability (weight) pvw ๏ก When node v becomes active:
- It activates each out-neighbor w with prob. pvw
๏ก Activations spread through the network
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 20
0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2
e g f c b a d h i f g e
๏ก S: is initial active set ๏ก f(S): the expected size of final active set ๏ก Set S is more influential if f(S) is larger
f({a,b} < f({a,c}) < f({a,d})
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 21
graph G
a b d c โฆ influence set
- f a node
Problem:
๏ก Most influential set of
size k: set S of k nodes producing largest expected cascade size f(S) if activated
[Domingos-Richardson โ01]
๏ก Optimization problem:
10/20/2010 22 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
) ( max
k size
- f
S
S f
0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2 c b e a d g f h i Influence set of b
๏ก Most influential set of k nodes: set S on k
nodes producing largest expected cascade size f(S) if activated
๏ก The optimization problem: ๏ก How hard is this problem?
- NP-HARD!
- Show that finding most influential
set is at least as hard as a vertex cover
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 23
) ( max
k size
- f
S
S f
๏ก Vertex cover problem:
- Given universe of elements U={u1,โฆ,un}
and sets S1,โฆ, Sm โ U
- Are there k sets among S1,โฆ, Sm such that
their union is U?
๏ก Goal:
Encode vertex cover as an instance of
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 24
) ( max
k size
- f
S
S f
U S1
S2 S3
S4
๏ก Given a vertex cover instance with sets S1,โฆ, Sm ๏ก Build a bipartite โS-to-Uโ graph: ๏ก There exists a set S of size k with f(S)=k+n
iff there exists a size k set cover
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 25
Construction:
- Create edge
(Si,u) โSi โ uโSi
- - directed edge
from sets to their elements
- Put weight 1 on
each edge
u1 u2 u3 un e.g.: S1={u1, u2, u3}
1 1 1
S1 S2 S3 Sm Note: Optimal solution is always a set of Si This is hard in general, could be special cases that are easier
๏ก Bad news:
- Influence maximization is NP-hard
๏ก Next, good news:
- There exists an approximation algorithm!
๏ก Consider the Hill Climbing algorithm to find S:
- Input: Influence set of each node u = {v1, v2, โฆ }
- If we activate u, nodes {v1, v2, โฆ } will eventually get active
- Algorithm: At each step take the node u that gives
best marginal gain: max f(Si-1โช{u})
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 26
Algorithm:
๏ก Start with S0={} ๏ก For i=1โฆk
- Take node v that max f(Si-1โช{v})
- Let Si = Si-1โช{v}
๏ก Example:
- Eval f({a}),โฆ f({d}), pick max
- Eval f({a,b}),โฆ f({a,d}), pick max
- Eval f(a,b,c},โฆ f({a,b,d}, pick โฆ
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
a b c a b c d d f(Si-1โช{v}) e e
27
๏ก Hill climbing produces a solution S
where: f(S) โฅ(1-1/e)*OPT (f(S)>0.63*OPT)
[Nemhauser, Fisher, Wolsey โ78, Kempe, Kleinberg, Tardos โ03]
๏ก Claim holds for functions f() with 2 properties:
- f is monotone: (activating more nodes doesnโt hurt)
if S โ T then f(S) โค f(T) and f({})=0
- f is submodular: (activating each additional node helps less)
adding an element to a set gives less improvement than adding it to one of its subsets: โS โ T
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 28
Gain of adding a node to a small set Gain of adding a node to a large set
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
๏ก Diminishing returns:
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 29
f(X) Solution size, |X|
Gain of adding a node to a small set Gain of adding a node to a large set
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
f(S) f(S โช{u}) f(T โช{u})
โS โ T
f(T) Adding u to T helps less!
๏ก We must show our f() is submodular: ๏ก โS โ T ๏ก Basic fact 1:
- If f1(x), โฆ,fk(x) are submodular, and c1,โฆ,ck โฅ 0
then F x = โ ๐๐ โ ๐
๐ ๐ฆ ๐
is also submodular
(Linear combination of submodular functions is a submodular function)
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 31
Gain of adding a node to a small set Gain of adding a node to a large set
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
(trivially uโT)
๏ก โS โ T: ๏ก Basic fact 2: A simple submodular function
- Sets A1, โฆ, Am
- ๐ ๐ = โ
๐ต๐
๐โ๐
(size of the union of sets Ai, iโS)
- Claim: f(S) is submodular!
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 32
S T u
Gain of adding u to a small set Gain of adding u to a large set
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
S โ T
The more sets you already have the less new area a new set will cover
๏ก Principle of deferred decision:
- Flip all the coins at the
beginning and record which edges fire successfully.
- Now we have a
deterministic graph!
- Edges which succeed are live
๏ก For the i-th realization of coin flips
- fi(S) = size of the set reachable by
live-edge paths from nodes in S
- fi(S={a,b}) = {a,f,c,g,b}
- fi(S={a,d}) = {a,f,c,g,d,e,h}
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 33
c b e g f h i
a d
Influence sets: fi(a) = {a,f,c,g} fi(d) = {d,e,h} fi(b) = {b,c}, โฆ
๏ก Fix outcome i of coin flips ๏ก fi(v) = set of nodes
reachable from v on live-edge paths
๏ก fi(S) = size of cascades
from S given coin flips i
๏ก ๐
๐ ๐ = โ
๐
๐(๐ค) ๐คโ๐
โ fi(S) is submodular
- fi(v) are sets and fi(S) is the size of the union
๏ก Expected influence set size:
๐ ๐ = โ ๐
๐(๐) ๐
โ f(S) is submodular!
- f(S) is linear combination of submodular functions
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 34
c b e g f h i
a d
Claim: If f(S) is monotone and submodular. Hill climbing produces a solution S where: f(S) โฅ(1-1/e)*OPT (f(S)>0.63*OPT)
๏ก Setting
- Keep adding nodes that give the largest gain
- Start with S0={}, produce sets S1, S2,โฆ,Sk
- Add elements one by one
- Marginal gain: ฮดi = f(Si) - f(Si-1)
- Let T={t1โฆtk} be the optimal set of size k
๏ก We need to show: f(S) โฅ (1-1/e) f(T)
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 36
๐ ๐ต โช ๐ถ1 โ ๐ ๐ต โช ๐ถ0 + ๐ ๐ต โช ๐ถ2 โ ๐ ๐ต โช ๐ถ1 + ๐ ๐ต โช ๐ถ3 โ โฏ + ๐ ๐ต โช ๐ถ๐ โ ๐(๐ต โช ๐ถ๐โ1)
๏ก ๐(๐ต โช ๐ถ) โ ๐(๐ต) โค โ
[๐(๐ต โช {๐
๐} ๐ ๐=1
) โ ๐(๐ต)]
- where: B = {b1,โฆ,bk} and f is submodular,
๏ก Proof:
- Let Bi = {b1,โฆbi}, so we have B1, B2, โฆ, Bk=B
- ๐ ๐ต โช B โ ๐ ๐ต = โ
๐ ๐ต โช ๐ถ๐ โ ๐ ๐ต โช ๐ถ๐โ1
๐ ๐=1
- = โ
๐ ๐ต โช ๐ถ๐โ1 โช ๐๐ โ ๐ ๐ต โช ๐ถ๐โ1
๐ ๐=1
- โค โ
๐ ๐ต โช {๐๐} โ ๐ ๐ต
๐ ๐=1
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 37
Work out the sum. Everything but 1st and last term cancels out. By submodularity since AโชX โช{b} โ Aโช{b}
๏ก ๐ ๐ โค ๐ ๐๐ โช ๐ ๏ก = ๐ ๐๐ โช ๐ โ ๐ ๐๐ + ๐ ๐๐ ๏ก โค โ
๐ ๐๐ โช {๐๐} โ ๐ ๐๐ + ๐(๐๐)
๐ ๐=1
๏ก โค โ
๐๐+1
๐ ๐=1
+ ๐ ๐๐ = ๐ ๐๐ + ๐ ๐๐+1
๏ก Thus: ๐ ๐ โค ๐ ๐๐ + ๐ ๐๐+1 ๏ก โ ๐๐+1 โฅ
1 ๐ [๐ ๐ โ ๐(๐๐)]
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 38
(by monotonicity) (by prev. slide) T = {t1, โฆ tk} tj is one choice
- f a next
element, and we greedily choose the best one, for a gain of ฮดi+1
๏ก We just showed: ๐๐+1 โฅ 1
๐ [๐ ๐ โ ๐(๐๐)]
๏ก What is f(Si+1)?
- ๐ ๐๐+1 = ๐ ๐๐ + ๐๐+1
- โฅ ๐ ๐๐ +
1 ๐ ๐ ๐ โ ๐ ๐๐
- = 1 โ
1 ๐ ๐ ๐๐ + 1 ๐ ๐(๐)
๏ก What is f(Sk)?
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 39
๏ก Claim:
Proof by induction:
๏ก ๐ = 0:
- ๐ ๐0 = ๐({}) = 0
- 1 โ 1 โ
1 ๐
๐ ๐ = 0
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 40
) ( 1 1 1 ) ( T f k S f
i i
๏ฃบ ๏ฃบ ๏ฃป ๏ฃน ๏ฃฏ ๏ฃฏ ๏ฃฐ ๏ฃฎ ๏ฃท ๏ฃธ ๏ฃถ ๏ฃฌ ๏ฃญ ๏ฃซ โ โ โฅ
๏ก Claim:
Proof by induction:
๏ก At ๐ + 1:
- ๐ ๐๐+1 โฅ 1 โ
1 ๐ ๐ ๐๐ + 1 ๐ ๐ ๐
- โฅ 1 โ
1 ๐
1 โ 1 โ
1 ๐ ๐
๐ ๐ +
1 ๐ ๐ ๐
- = 1 โ 1 โ
1 ๐ ๐+1
๐(๐)
10/13/2009 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 41
) ( 1 1 1 ) ( T f k S f
i i
๏ฃบ ๏ฃบ ๏ฃป ๏ฃน ๏ฃฏ ๏ฃฏ ๏ฃฐ ๏ฃฎ ๏ฃท ๏ฃธ ๏ฃถ ๏ฃฌ ๏ฃญ ๏ฃซ โ โ โฅ
๏ก Thus:
๐ ๐ = ๐ ๐๐ โฅ 1 โ 1 โ 1 ๐
๐
๐ ๐
๏ก Then:
๐ ๐๐ โฅ 1 โ 1 ๐ ๐(๐)
10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 42
โค ๐ ๐ qed
We just proved:
๏ก Hill climbing finds solution S which
f(S) โฅ (1-1/e)*OPT
- this is a data independent bound
- This is a worst case bound
- No matter what is the input data (influence sets) we
know that Hill Climbing wonโt do worse than 0.63*OPT
Data dependent bound:
๏ก We want a bound whose value depends on
the input data
If the data is โeasyโ, we are likely doing better than 63% of OPT
10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 43
๏ก Suppose S is some solution to
argmaxS f(S) s.t. |S| โค k f() is monotone & submodular and let T = {t1,โฆ,tk} be the OPT solution
๏ก CLAIM:
For each u โ S let ฮดu = f(Sโช{u})-f(S) Order ฮดu so that ฮด1 โฅ ฮด2 โฅ โฆ โฅ ฮดn Then: f(T) โค f(S) + โi=1
k ฮดi
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 44
๏ก For each u โ S let ฮดu = f(Sโช{u})-f(S)
Order ฮดu so that ฮด1 โฅ ฮด2 โฅ โฆ โฅ ฮดn Then: f(T) โค f(S) + โi=1
k ฮดi
๏ก Proof:
- ๐ ๐ โค ๐ ๐ โช ๐ =
๐ ๐ + โ ๐ ๐ โช ๐1 โฆ ๐๐ โ ๐ ๐ โช ๐1 โฆ ๐๐โ1
๐ ๐=1
- โค ๐ ๐ + โ
๐ ๐ โช ๐๐ โ ๐ ๐
๐ ๐=1
- = ๐ ๐ + โ
๐๐ข๐
๐ ๐=1
- โค ๐ ๐ + โ
๐๐
๐ ๐=1
โ ๐ ๐ โค ๐ ๐ + โ ๐๐
๐ ๐=1
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 45
10/25/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 46
What do we know about
- ptimizing submodular
functions?
๏ก A hill-climbing is near optimal
(1-1/e (~63%) of OPT)
๏ก But
- Hill-climbing algorithm is slow
- At each iteration we need to re-
evaluate marginal gains
- It scales as O(n k)
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 47
a b c d reward e
Hill-climbing
Add node with highest marginal gain
๏ก In round i+1: So far we picked Si = {s1,โฆ,si}
- Now pick si+1 = argmaxu F(Si โช {u}) - F(Si)
- maximize the โmarginal benefitโ ฮดu(Si) = F(Si โช {u}) - F(Si)
๏ก By submodularity property:
๐ ๐๐ โช ๐ฃ โ ๐ ๐๐ โฅ ๐ ๐
๐ โช ๐ฃ
โ ๐ ๐
๐ for i<j
๏ก Observation: Submodularity implies
i โค j โ ฮดx(Si) โฅ ฮดx(Sj) since SiโSj Marginal benefits ฮดx only shrink!
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 48
u ฮดu(Si) โฅ ฮดu(Si+1)
[Leskovec et al., KDD โ07]
Activating node u in step i helps more than activating it at step j (j>i)
๏ก Idea:
- Use ฮดi as upper-bound on ฮดj (j>i)
๏ก Lazy hill-climbing:
- Keep an ordered list of marginal
benefits ฮดi from previous iteration
- Re-evaluate ฮดi only for top node
- Re-sort and prune
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 49
a b c d Marginal gain e
[Leskovec et al., KDD โ07]
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
S โ T S1={a}
๏ก Idea:
- Use ฮดi as upper-bound on ฮดj (j>i)
๏ก Lazy hill-climbing:
- Keep an ordered list of marginal
benefits ฮดi from previous iteration
- Re-evaluate ฮดi only for top node
- Re-sort and prune
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 50
a d b c e Marginal gain
[Leskovec et al., KDD โ07]
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
S โ T S1={a}
๏ก Idea:
- Use ฮดi as upper-bound on ฮดj (j>i)
๏ก Lazy hill-climbing:
- Keep an ordered list of marginal
benefits ฮดi from previous iteration
- Re-evaluate ฮดi only for top node
- Re-sort and prune
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 51
a c d b e Marginal gain
[Leskovec et al., KDD โ07]
f(S โช {u}) โ f(S) โฅ f(T โช {u}) โ f(T)
S โ T S1={a} S2={a,b}
๏ก Given a real city water
distribution network
๏ก And data on how
contaminants spread in the network
๏ก Detect the
contaminant as quickly as possible
๏ก Problem posed by the
US Environmental Protection Agency
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 53
S S
[Leskovec et al., KDD โ07]
๏ก Given a graph G(V,E) ๏ก Data on how outbreaks spread over the
network:
- for each outbreak i we know the
time T(i,u) when outbreak i contaminated node u
๏ก Select a subset of nodes A that maximize
the expected reward:
๏ก Reward: Save the most people
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 54
Reward for detecting
- utbreak i
[Leskovec et al., KDD โ07]
๏ก Observation: Diminishing returns
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 55
S1 S2
Placement A={s1, s2}
Sโ
New sensor: Adding sโ helps a lot
S2 S4 S1 S3
Placement Aโ={s1, s2, s3, s4}
sโ
Adding sโ helps very little
[Leskovec et al., KDD โ07]
๏ก Claim:
- The reward function is submodular
๏ก Consider outbreak i:
- Ri(uk) = set of nodes saved from uk
- Ri(A) = size of union Ri(uk), ukโA
โRi is submodular
๏ก Global optimization:
- R(A) = โi Prob(i) Ri(A)
โ R(A) is submodular
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 56
u1 fi(u1)
- utbreak i
u2 fi(u2)
[Leskovec et al., KDD โ07]
๏ก Real metropolitan area
water network
- V = 21,000 nodes
- E = 25,000 pipes
๏ก Use a cluster of 50 machines for a month ๏ก Simulate 3.6 million epidemic scenarios
(152 GB of epidemic data)
๏ก By exploiting sparsity we fit it into main
memory (16GB)
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 57
[Leskovec et al., KDD โ07]
Submodularity gives data-dependent bounds on the performance of any algorithm
58
Solution quality F(A) Higher is better
5 10 15 20 0.2 0.4 0.6 0.8 1 1.2 1.4
โOfflineโ
the (1-1/e) bound
Data-dependent bound Hill Climbing
Number of sensors placed
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
๏ก Placement heuristics perform much worse
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 59
[Leskovec et al., KDD โ07]
= I have 10 minutes. Which blogs should I read to be most up to date? = Who are the most influential bloggers?
60
?
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 61
Detect all stories but late.
Want to read things before others do.
Detect blue & yellow soon but miss red.
๏ก Online bound is much tighter:
- 13% instead of 37%
(1-1/e) bound Data dependent bound Hill Climbing
10/20/2010 62 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
[Leskovec et al., KDD โ07]
๏ก Heuristics perform much worse
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 63
[Leskovec et al., KDD โ07]
๏ก Lazy evaluation
runs 700 times faster than naรฏve Hill Climbing algorithm
10/20/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 64
[Leskovec et al., KDD โ07]
Naรฏve hill climbing Lazy hill climbing