Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
CSE 6240: Web Search and Text Mining. Spring 2020
Cascades and Contagion
- Prof. Srijan Kumar
Cascades and Contagion Prof. Srijan Kumar - - PowerPoint PPT Presentation
CSE 6240: Web Search and Text Mining. Spring 2020 Cascades and Contagion Prof. Srijan Kumar http://cc.gatech.edu/~srijan 1 Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining Todays Lecture Introduction
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
2
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
3
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
4
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
5
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
6
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7
S…susceptible E…exposed I…infected R…recovered Z…immune
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
8
Susceptible Infected Recovered time Number of nodes
I(t) S(t) R(t) 𝛾 𝜀
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
9
Infected by neighbor with prob. β Cured with
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
Susceptible Infected
time Number of nodes
10
I(t) S(t)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
11
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
12
[Wang et al. 2003]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
13
100 200 300 400 500 250 500 750 1000
Time Number of Infected Nodes
δ: 0.05 0.06 0.07 Oregon β = 0.001
10,900 nodes and 31,180 edges
[Wang et al. 2003]
Autonomous Systems Graph
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
14
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
15
[Gomes et al., 2014]
[Gomes et al., Assessing the International Spreading Risk Associated with the 2014 West African Ebola Outbreak, PLOS Current Outbreaks, ‘14]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
16
S: susceptible individuals, E: exposed individuals, I: infectious cases in the community, H: hospitalized cases, F: dead but not yet buried, R: individuals no longer transmitting the disease
[Gomes et al., Assessing the International Spreading Risk Associated with the 2014 West African Ebola Outbreak, PLOS Current Outbreaks, ‘14]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
17
References: 1. Epidemiological Modeling of News and Rumors on Twitter. Jin et al. SNAKDD 2013 2. False Information on Web and Social Media: A survey. Kumar et al., arXiv :1804.08559
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
18
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
19
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
20
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
21
REAL EVENTS RUMORS
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
22
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
23
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
24
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
25
Notation: S = Susceptible I = Infected E = Exposed Z = Skeptics
All parameters learned by model fitting to real data (from previous slides)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
26
Rumors
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
27
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
28
, active neighbor of v w v w v
, neighbor of
v w w v
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
29
Inactive Node Active Node Threshold Active neighbors
0.5 0.3 0.2 0.5 0.1 0.4 0.3 0.2 0.6 0.2
U X
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
30
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
31
0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2
e g f c b a d h i f g e
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
32
0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.4 0.3 0.3 0.3 0.3 0.3 0.3 0.2
e g f c b a d h i f g e
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
33
[KDD ‘12]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
34
k = number of friends adopting
k = number of friends adopting
“Probabilistic” spreading: Viruses, Information Critical mass: Decision making … adopters
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
35
Prob(Infection) # exposures Probability of infection ever increases Nodes build resistance [KDD ‘12]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
36
10% credit 10% off
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
37
0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 10 20 30 40
[Leskovec et al., TWEB ’07]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
38
[Backstrom et al. KDD ‘06]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
39
[Backstrom et al., KDD ’06]
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
40