http://cs224w.stanford.edu Spreading through networks: Spreading - - PowerPoint PPT Presentation

http cs224w stanford edu spreading through networks
SMART_READER_LITE
LIVE PREVIEW

http://cs224w.stanford.edu Spreading through networks: Spreading - - PowerPoint PPT Presentation

CS224W: Social and Information Network Analysis Jure Leskovec Stanford University Jure Leskovec, Stanford University http://cs224w.stanford.edu Spreading through networks: Spreading through networks: Cascading behavior Diffusion of


slide-1
SLIDE 1

CS224W: Social and Information Network Analysis Jure Leskovec Stanford University Jure Leskovec, Stanford University

http://cs224w.stanford.edu

slide-2
SLIDE 2

 Spreading through networks:

Spreading through networks:

  • Cascading behavior
  • Diffusion of innovations
  • Epidemics

 Examples:

  • Biological:

Biological:

  • Diseases via contagion
  • Technological:
  • Cascading failures
  • Cascading failures
  • Spread of information
  • Social:
  • Rumors, news, new technology
  • Viral marketing

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2

slide-3
SLIDE 3

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 3

slide-4
SLIDE 4

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 4

slide-5
SLIDE 5

c1

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 5

slide-6
SLIDE 6

 One of the networks is a spread of a disease  One of the networks is a spread of a disease,

the other one is product recommendations

 Which is which?   Which is which? 

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 6

slide-7
SLIDE 7

 A fundamental process in social networks:

A fundamental process in social networks: Behaviors that cascade from node to node like an epidemic

N i i f d b l d

  • News, opinions, rumors, fads, urban legends, …
  • Word‐of‐mouth effects in marketing: rise of new websites,

free web based services

  • Virus, disease propagation
  • Change in social priorities: smoking, recycling

S t ti t i diff i bl

  • Saturation news coverage: topic diffusion among bloggers
  • Internet‐energized political campaigns
  • Cascading failures in financial markets

g

  • Localized effects: riots, people walking out of a lecture

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 7

slide-8
SLIDE 8

 Experimental studies of diffusion:

Experimental studies of diffusion:

  • Spread of new agricultural practices [Ryan‐Gross 1943]
  • Adoption of a new hybrid‐corn between the 259 farmers in

Iowa

  • Classical study of diffusion
  • Interpersonal network plays important role in adoption

p p y p p  Diffusion is a social process

  • Spread of new medical practices [Coleman et al. 1966]
  • Studied the adoption of a new drug between doctors in Illinois
  • Studied the adoption of a new drug between doctors in Illinois
  • Clinical studies and scientific evaluations were not sufficient to

convince the doctors It th i l f th t l d t d ti

  • It was the social power of peers that led to adoption

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 8

slide-9
SLIDE 9

Diffusion is a social process

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 9

slide-10
SLIDE 10

 Senders and followers of recommendations  Senders and followers of recommendations

receive discounts on products

10% credit 10% off

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 10

slide-11
SLIDE 11

 Diffusion has many (very interesting)  Diffusion has many (very interesting)

flavors:

  • The contagion of obesity [Christakis et al 2007]
  • The contagion of obesity [Christakis et al. 2007]
  • If you have an overweight friend your chances of

becoming obese increases by 57% g y

  • Psychological effects of
  • thers’ opinions, e.g.:

Which line is closest in length to A? [Asch 1958]

A B C D C D

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 11

slide-12
SLIDE 12

 Basis for models:

  • Probability of adopting new behavior depends on the

number of friends who have adopted [Bass ‘69, Granovetter ‘78, Shelling ’78]

Wh ’ h d d ?

 What’s the dependence?

  • n
  • n
  • f adoptio
  • f adoptio

k = number of friends adopting

  • Prob. o

k = number of friends adopting

  • Prob. o

k = number of friends adopting k = number of friends adopting

Diminishing returns? Critical mass?

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 12

slide-13
SLIDE 13
  • n
  • n
  • f adoptio
  • f adoptio

k = number of friends adopting

  • Prob. o

k = number of friends adopting

  • Prob. o

k number of friends adopting k number of friends adopting

Diminishing returns? Critical mass?

 Key issue: qualitative shape of diffusion curves

  • Diminishing returns? Critical mass?
  • Distinction has consequences for models of diffusion
  • Distinction has consequences for models of diffusion

at population level

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 13

slide-14
SLIDE 14

 Probabilistic models:  Probabilistic models:

  • Example:
  • “catch” a disease with some prob
  • catch a disease with some prob.

from neighbors in the network

 Decision based models:

  • Example:
  • Adopt new behaviors if k of your friends do

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 14

slide-15
SLIDE 15

 Two flavors two types of questions:

Two flavors, two types of questions:

  • A) Probabilistic models: Virus Propagation
  • SIS: Susceptible–Infective–Susceptible (e.g., flu)
  • SIR: Susceptible–Infective–Recovered (e.g., chicken‐pox)
  • Question: Will the virus take over the network?
  • Independent contagion model

Independent contagion model

  • B) Decision based models: Diffusion of Innovation
  • Threshold model
  • Herding behavior
  • Questions:
  • Finding influential nodes
  • Detecting cascades

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 15

slide-16
SLIDE 16

[Banerjee ‘92]

 Influence of actions of others  Influence of actions of others

  • Model where everyone sees everyone else’s behavior

 Sequential decision making

Sequential decision making

  • Picking a restaurant:
  • Consider you are choosing a restaurant in an unfamiliar town

y g

  • Based on Yelp reviews you intend to go to restaurant A
  • But then you arrive there is no one eating at A but the next

door restaurant B is nearly full door restaurant B is nearly full

  • What will you do?
  • Information that you can infer from other’s choices may be

Information that you can infer from other s choices may be more powerful than your own

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 16

slide-17
SLIDE 17

 Herding:  Herding:

  • There is a decision to be made
  • P

l k th d i i ti ll

  • People make the decision sequentially
  • Each person has some private information that

helps guide the decision helps guide the decision

  • You can’t directly observe private info of others

but can see what they do but can see what they do

  • Can make inferences about their private information

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 17

slide-18
SLIDE 18

 Consider an urn with 3 marbles. It can be either:

  • Majority‐blue: 2 blue, 1 red, or
  • Majority‐red: 1 blue, 2 red

 Each person wants to best guess whether the

urn is majority‐blue or majority‐red E i t O b h

 Experiment: One by one each person:

  • Draws a marble
  • Privately looks are the color and puts the marble back

y p

  • Publicly guesses whether the urn is majority‐red
  • r majority‐blue

 You see all the guesses beforehand

g

 How should you guess?

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 18

slide-19
SLIDE 19

[Banerjee ‘92]

 What happens:  What happens:

  • 1st person: Guess the color you draw from the urn
  • 2nd person: Guess the color you draw from the urn

if l 1st th ith it

  • if same color as 1st, then go with it
  • If different, break the tie by doing with your own color
  • 3rd person:

If th t b f d diff t

  • If the two before made different guesses,

go with your color

  • Else, just go with their guess

(regardless of the color you see)

  • 4th person:
  • If the first two guesses were the same, go with it
  • 3rd person’s guess conveys no information

C d l hi f i i h B l

 Can model this type of reasoning using the Bayes rule

  • see chapter 16 of Easley‐Kleinberg

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 19

slide-20
SLIDE 20

 Cascade begins when the difference between  Cascade begins when the difference between

the number of blue and red guesses reaches 2

sses #red gues #blue – #

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 20

slide-21
SLIDE 21

 Easy to occur given right structural conditions

Easy to occur given right structural conditions

  • Can lead to bizarre patterns of decisions

 Non‐optimal outcomes

Wi h b ⅓ ⅓ ⅟ fi h l

  • With prob. ⅓⅓=⅟9 first two see the wrong color,

from then on the whole population guesses wrong

 Can be very fragile

  • Suppose first two guess blue
  • People 100 and 101 draw red and cheat by

showing their marbles showing their marbles

  • Person 102 now has 4 pieces of information,

she guesses based on her own color

  • C

d i b k

  • Cascade is broken

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 21

slide-22
SLIDE 22

[Granovetter ‘78]

 Collective action [Granovetter ‘78]  Collective action [Granovetter, 78]

  • Model where everyone sees everyone

else’s behavior else s behavior

  • Examples:
  • Clapping or getting up and leaving in a theater
  • Clapping or getting up and leaving in a theater
  • Keeping your money or not in a stock market
  • Neighborhoods in cities changing ethnic composition

g g g p

  • Riots, protests, strikes

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 22

slide-23
SLIDE 23

 n people – everyone observes all actions  n people – everyone observes all actions  Each person i has a threshold ti

  • Node i will adopt the behavior iff at
  • Node i will adopt the behavior iff at

least ti other people are adopters:

  • Small ti: early adopter

Small ti: early adopter

  • Large ti: late adopter

 The population is described by {t1,…,tn}

  • F( )

f ti f l ith th h ld t 

  • F(x) … fraction of people with threshold ti  x

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 23

slide-24
SLIDE 24

 Think of the step‐by‐step change in number of

p y p g people adopting the behavior:

  • F(x) … fraction of people with threshold  x
  • s(t) … number of participants at time t

y=x

( ) p p

 Easy to simulate:

  • s(0) = 0

(1) (0)

y=F(x) y=x

  • s(1) = F(0)
  • s(2) = F(s(1)) = F(F(0))
  • s(t+1) = F(s(t)) = Ft+1(0)

 F(x)=x – stable point

  • There could be other fixed

points but starting from 0 points but starting from 0 we never reach them

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

x

24

slide-25
SLIDE 25

 What if we start the process somewhere else?

What if we start the process somewhere else?

  • We move up/down to the next fixed point
  • How is market going to change?

y=x

g g g

y=F(x) y=x

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

x

25

slide-26
SLIDE 26

y=x y=F(x)

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

x

26

slide-27
SLIDE 27

 Each threshold t is drawn independently from  Each threshold ti is drawn independently from

some distribution F(x) = Pr[thresh  x]

  • Suppose: Normal with  n/2 variance 
  • Suppose: Normal with =n/2, variance 

Small : Large :

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 27

slide-28
SLIDE 28

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 28

Bigger variance let’s you build a bridge from early adopters to mainstream

slide-29
SLIDE 29

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 29

But if we increase the variance even more we move the higher fixed point lover

slide-30
SLIDE 30

 It does not take into account:  It does not take into account:

  • No notion of social network – more influential

users

  • It matters who the early adopters are, not just

how many

  • Models people’s awareness of size of participation

not just actual number of people participating

  • M d li

th h ld

  • Modeling thresholds
  • Richer distributions
  • Deriving thresholds from mode basic assumptions

Deriving thresholds from mode basic assumptions

  • game theoretic models

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 30

slide-31
SLIDE 31

 It does not take into account:  It does not take into account:

  • Modeling perceptions of who is adopting the

behavior/ who you believe is adopting behavior/ who you believe is adopting

  • Non monotone behavior – dropping out if too

many people adopt many people adopt

  • Similarity – thresholds not based only on numbers
  • People get “locked in” to certain choice over a
  • People get locked in to certain choice over a

period of time

 Network matters! (next slide)

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 31

slide-32
SLIDE 32

 You live in an oppressive society  You live in an oppressive society  You know of a demonstration against the

government planned for tomorrow

 If a lot of people show up the government  If a lot of people show up, the government

will fall

 If only a few people show up, the demonstrators

will be arrested and it would have been better had everyone stayed at home

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 32

slide-33
SLIDE 33

 You should do something if you believe you are

You should do something if you believe you are in the majority!

 Dictator tip: Pluralistic ignorance – erroneous

estimates about the prevalence of certain

  • pinions in the population
  • pinions in the population
  • Survey conducted in the U.S. in 1970 showed that

Survey conducted in the U.S. in 1970 showed that while a clear minority of white Americans at that point favored racial segregation, significantly more than 50% believed that it was favored by a majority of white believed that it was favored by a majority of white Americans in their region of the country

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 33

slide-34
SLIDE 34

 Personal threshold k: “I will show up to the  Personal threshold k: I will show up to the

protest if I am sure at least k people in total (including myself) will show up” (including myself) will show up

 Each node in the network knows the  Each node in the network knows the

thresholds of all their friends

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 34

slide-35
SLIDE 35

 Will uprising occur?  Will uprising occur?

No!

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 35

slide-36
SLIDE 36

 Will uprising occur?  Will uprising occur?

No!

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 36

slide-37
SLIDE 37

 Will uprising occur?  Will uprising occur?

Yes!

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 37

slide-38
SLIDE 38

[Morris 2000]

 Based on 2 player coordination game  Based on 2 player coordination game  2 players – each chooses technology A or B

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 38

slide-39
SLIDE 39

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 39

slide-40
SLIDE 40

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 40

slide-41
SLIDE 41

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 41

slide-42
SLIDE 42
  • 1. Each person can only adopt one “behavior”
  • 2. You gain more if your friends have

adopted the same behavior as you adopted the same behavior as you

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 42

slide-43
SLIDE 43

 If both v and w adopt

p behavior A, they each get payoff a>0

 If v and w adopt

behavior B, they reach get payoff b>0

 If v and w adopt the

  • pposite behaviors
  • pposite behaviors,

they each get 0

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 43

slide-44
SLIDE 44

 In some large network:

  • Each node v is playing a copy of this game with

each of its neighbors

  • Payoff = sum of payoffs per game

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 44

slide-45
SLIDE 45

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 45

slide-46
SLIDE 46

 Let v have d neighbors  Let v have d neighbors  If a fraction p of its neighbors adopt A,

then: then:

 Payoff

= a∙p∙d if v chooses A

 Payoffv = a∙p∙d

if v chooses A = b∙(1‐p)∙d if v chooses B

 v chooses A if: a∙p∙d > b∙(1‐p)∙d

b

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

q b a b p   

46

slide-47
SLIDE 47

 Scenario:  Scenario:

Graph where everyone starts with B. Small set S of early adopters of A Small set S of early adopters of A

  • hard wire S – they keep using A no matter

what payoffs tell them to do what payoffs tell them to do

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 47

slide-48
SLIDE 48

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

48

slide-49
SLIDE 49

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

49

slide-50
SLIDE 50

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

50

slide-51
SLIDE 51

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

51

slide-52
SLIDE 52

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

52

slide-53
SLIDE 53

If more than f 50% of my friends are red I’ll be red

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

I ll be red

53

slide-54
SLIDE 54

 Observation:

Observation:

  • The use of A spreads monotonically

(nodes only switch from B to A, and never back to B)

 Why?

  • Induction on time

S d it h f A B id

  • Suppose a node switches from AB, consider

the first node v to do this at time t

  • Earlier at some time t’ (t’<t) node v switched BA

( )

  • So at time t’ v was above threshold for A
  • Up to time t no node switches back, so node v can only

have more neighbors who use A at t compared to t’

10/10/2010 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 54