General Threshold Model for Social Cascades Jie Gao, Golnaz - - PowerPoint PPT Presentation

general threshold model for social cascades
SMART_READER_LITE
LIVE PREVIEW

General Threshold Model for Social Cascades Jie Gao, Golnaz - - PowerPoint PPT Presentation

General Threshold Model for Social Cascades Jie Gao, Golnaz Ghasemiesfeh, Grant Schoenebeck, Fang-Yi Yu Contagions, diffusion, cascade Ideas, beliefs, behaviors, and technology adoption spread through network Why do we need to study


slide-1
SLIDE 1

General Threshold Model for Social Cascades

Jie Gao, Golnaz Ghasemiesfeh, Grant Schoenebeck, Fang-Yi Yu

slide-2
SLIDE 2

Contagions, diffusion, cascade…

  • Ideas, beliefs, behaviors, and

technology adoption spread through network

  • Why do we need to study this

phenomena?

– Better Understanding – Promoting good behaviors/beliefs – Stopping bad behavior

slide-3
SLIDE 3

Outline

  • Cascade Model
  • Empirical Results: Testing Network Models

– Real Data – Synthetic Models

  • Theoretical Results

– Directed case – Undirected case

slide-4
SLIDE 4

Outline

  • Cascade Model
  • Empirical Results: Testing Network Models

– Real Data – Synthetic Models

  • Theoretical Results

– Directed case – Undirected case

slide-5
SLIDE 5

Social Contagion

  • Contagion is a chain reaction that starts with early adopters

and spreads through the social network

slide-6
SLIDE 6

Social Contagion

  • Contagion is a chain reaction that starts with early adopters

and spreads through the social network

slide-7
SLIDE 7

Social Contagion

  • Contagion is a chain reaction that starts with early adopters

and spreads through the social network

slide-8
SLIDE 8

Social Contagion

  • Contagion is a chain reaction that starts with early adopters

and spreads through the social network

slide-9
SLIDE 9

General Threshold Contagion

  • General Threshold Contagion GTC(G,D,S) [G 1973; MR 2010]

– Social network: Graph, G – Reaction: Threshold distribution, 𝐸 = 𝑉Δ – Early adopters: Seeded nodes, 𝑇 = {𝑣}

u v w x y z 1 2 2 1 4 2 2 1 3 2

slide-10
SLIDE 10

How general is this model?

  • Captures many models as special cases

– Independent cascade – Linear threshold model – k-complex contagion

slide-11
SLIDE 11

Outline

  • Cascade Model
  • Empirical Results: Testing Network Models

– Real Data – Synthetic Models

  • Theoretical Results

– Directed case – Undirected case

slide-12
SLIDE 12

Experiment Setups

  • G: graph

– DBLP co-authorship network with 317,080 nodes – Stanford web graph with 281,903 nodes

  • D: threshold ~ Poisson distribution with different mean 𝜇
  • S: The ‘earliest’ 25 nodes
slide-13
SLIDE 13

Contagion on DBLP Database

  • G: DBLP co-authorship network

– 317,080 nodes 1,049,866 edges – 3.3 average degree

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10

slide-14
SLIDE 14

Outline

  • Cascade Model
  • Empirical Results: Testing Network Models

– Real Data – Synthetic Models

  • Configuration Model
  • Stochastic Attachment Model
  • Theoretical Results

– Directed case – Undirected case

slide-15
SLIDE 15

Social Networks

  • Can we generate synthetic but “realistic” graphs?

– Configuration models – Preferential attachment networks – …

slide-16
SLIDE 16

Configuration Model

Original Graph (Karate Club) Configuration model

slide-17
SLIDE 17

Real Network and Configuration Model

  • Graph

– DBLP – Configuration Model

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 INFECTION OF THE NETWORK (FRACTION) Λ

CONTAGION ON DBLP

DBLP Dataset

  • Config. Model
slide-18
SLIDE 18

Having better model for DBLP

  • Time evolving graphs?

– A growing network in which newcomers connect to old nodes.

slide-19
SLIDE 19

Having better model for DBLP

  • Preferential attachment network

– Add a new node, create m out-links to old nodes – Connect old nodes with attachment rule 𝔹

  • Preferentially with probability 𝛽
  • Uniformly random otherwise
  • How can we model DBLP by PA?
slide-20
SLIDE 20

Having better model for DBLP

  • Preferential attachment network

– Add a new node, create m out-links to old nodes – Connect old nodes with attachment rule 𝔹

  • Preferentially with probability 𝛽
  • Uniformly random otherwise
  • How can we model DBLP by PA?
slide-21
SLIDE 21

Stochastic Attachment Model (SA)

  • Model

– Add a new node, create m out-links from distribution M to the old nodes – Connect old nodes with attachment rule 𝔹

  • Preferentially with probability 𝛽
  • Uniformly random otherwise
slide-22
SLIDE 22

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-23
SLIDE 23

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-24
SLIDE 24

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-25
SLIDE 25

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-26
SLIDE 26

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-27
SLIDE 27

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-28
SLIDE 28

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node

slide-29
SLIDE 29

Parameters for the SA

  • Learn parameters from real social network

– Learn M by iteratively remove the minimal degree node – Try different 𝛽: 0, 0.25, 0.5, 0.75, 1

slide-30
SLIDE 30

Stochastic Attachment and Contagions

  • Graph:

– DBLP – Configuration Model – Stochastic Attachment Network

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes
  • 0.1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10

slide-31
SLIDE 31

Stochastic Attachment and Contagions

  • Graph:

– DBLP – Configuration Model – Stochastic Attachment Network

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes
  • 0.1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10

slide-32
SLIDE 32

Contagion on Stanford Web Graph

  • Graph: Stanford Web Graph

– 281,903 nodes 2,312,497 edges – 7.3 average degree

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

slide-33
SLIDE 33

Contagion on Real Network

  • Graph

– Stanford Web Graph – Configuration Model

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

slide-34
SLIDE 34

Contagion on Real Network

  • Graph

– Stanford Web Graph – Configuration Model – Stochastic Attachment Network

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

slide-35
SLIDE 35

Contagion on Real Network

  • Graph

– Stanford Web Graph – Configuration Model – Stochastic Attachment Network

  • D: Poisson distribution
  • S: The ‘earliest’ 25 nodes

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

slide-36
SLIDE 36

Outline

  • Cascade Model
  • Empirical Results: Testing Network Models

– Real Data – Synthetic Models

  • Configuration Model
  • Stochastic Attachment Model
  • Theoretical Results

– Directed case – Undirected case

slide-37
SLIDE 37

How would contagion spread on directed PA?

A) B)

slide-38
SLIDE 38

Theorem in Directed Case

  • The fraction of infection would converge to the stable fixed

points of “feedback function” 𝑔 𝑦

slide-39
SLIDE 39

Observations

slide-40
SLIDE 40

Observations

1 2 1 3 1 2 2

slide-41
SLIDE 41

Observations

1 2 1 3 1 2 2

slide-42
SLIDE 42

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 𝑍

3 = 1

slide-43
SLIDE 43

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 𝑍

3 = 1

slide-44
SLIDE 44

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 𝑍

4 = 0.75

slide-45
SLIDE 45

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 1 𝑍

4 = 0.75

slide-46
SLIDE 46

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 1 𝑍

5 = 0.8

slide-47
SLIDE 47

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 1 2 2 𝑍

6 = 0.83

slide-48
SLIDE 48

Observations

  • Time evolving property

– Reveal both the edges and thresholds sequentially

1 2 1 3 1 2 2 𝑍

7 = 0.86

slide-49
SLIDE 49

Feedback Function

  • The probability of a newcomer

get infected

– Distribution of threshold – M out-links

Stable point Stable fixed points

slide-50
SLIDE 50

Feedback Function

Stable point Stable fixed points

slide-51
SLIDE 51

Outline

  • Background and Motivation
  • Model and Experimental Results

– General Threshold Contagion – Experiment on Real Network – Stochastic Attachment Network

  • Theoretical results

– Directed cases – Undirected cases (please see the paper)

slide-52
SLIDE 52

Future Work

  • Better graph models to approximate contagions on real

networks

  • Unclear when the contagions can die out in undirected case

with 0 as a fixed point