Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
CSE 6240: Web Search and Text Mining. Spring 2020
Message Passing and Node Classification
- Prof. Srijan Kumar
Message Passing and Node Classification Prof. Srijan Kumar 1 - - PowerPoint PPT Presentation
CSE 6240: Web Search and Text Mining. Spring 2020 Message Passing and Node Classification Prof. Srijan Kumar 1 Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining Outline Main question today: Given a network with
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
CSE 6240: Web Search and Text Mining. Spring 2020
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
2
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
3
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
4
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
5
(Easley and Kleinberg, 2010)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
6
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
8
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
9
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
10
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
11
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
12
Local Classifier
Relational Classifier
correlations between nodes Collective Inference
correlations through network
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
13
Collective Inference
correlations through network
Local Classifier
label
Relational Classifier
correlations between nodes
its neighbors to label one node
labels is minimized
prediction
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
14
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
15
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
16
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
17
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
18
P(Y = 1) = 0 P(Y = 1) = 0 P(Y=1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 1 P(Y = 1) = 1
P(Y=1) = 0.5
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
19
P(Y = 1) = 0 P(Y = 1) = 0 P(Y=1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 1 P(Y = 1) = 1
P(Y=1|N3) = 1/3 (0 + 0 + 0.5) = 0.17
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
20
P(Y = 1) = 0 P(Y = 1) = 0 P(Y=1|N4)= ¼(0+ 0.17+0.5+1) = 0.42
P(Y=1) = 0.17
P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 1 P(Y = 1) = 1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
21
P(Y = 1) = 0 P(Y = 1) = 0 P(Y=1|N4)= 0.42
P(Y=1) = 0.17
P(Y=1|N5) = ¼ (0.42+1+1+0.5) = 0.73 P(Y = 1) = 0.5 P(Y = 1) = 0.5 P(Y = 1) = 1 P(Y = 1) = 1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
22
P(Y = 1) = 0 P(Y = 1) = 0 P(Y = 1) = 0.17 P(Y = 1) = 0.42 P(Y = 1) = 0.73 P(Y = 1) = 0.91 P(Y = 1) = 1.00
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
23
P(Y = 1) = 0 P(Y = 1) = 0 P(Y = 1) = 0.14 P(Y = 1) = 0.47 P(Y = 1) = 0.85 P(Y = 1) = 0.95 P(Y = 1) = 1.00
All neighbors values are
value can not change.
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
24
P(Y = 1) = 0 P(Y = 1) = 0 P(Y = 1) = 0.16 P(Y = 1) = 0.50 P(Y = 1) = 0.86 P(Y = 1) = 0.95 P(Y = 1) = 1.00
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
25
P(Y = 1) = 0 P(Y = 1) = 0 P(Y = 1) = 0.16 P(Y = 1) = 0.51 P(Y = 1) = 0.86 P(Y = 1) = 0.95 P(Y = 1) = 1.00
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
26
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
27
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
28
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
29
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
30
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
31
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
32
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
33
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
34
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
35
Each product has a ‘goodness’ score G 𝑞 ∈ −1,1 Each user has a ‘fairness’ score 𝐺 𝑣 ∈ 0,1 Each rating has a ‘reliability’ score R 𝑣, 𝑞 ∈ 0,1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
36
Each product has a ‘goodness’ score G 𝑞 ∈ −1,1 Each user has a ‘fairness’ score 𝐺 𝑣 ∈ 0,1 Each rating has a ‘reliability’ score R 𝑣, 𝑞 ∈ 0,1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
37
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
38
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
39
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
40
G(p) = 1 G(p) = 1 G(p) = 1 F(u) = 1 F(u) = 1 F(u) = 1 R(u,p) = 1 R(u,p) = 1 R(u,p) = 1 R(u,p) = 1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
41
F(u) = 1 F(u) = 1 F(u) = 1 F(u) = 1 F(u) = 1
R(r) = 1
R(r) = 1 G(p) = 0.67 G(p) = 0.67 G(p) = -0.67 R(r) = 1 R(r) = 1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
42
F(u) = 1 F(u) = 1 F(u) = 1 F(u) = 1 F(u) = 1 F(u) = 1 R(r) = 0.92 R(r) = 0.92 R(r) = 0.92 R(r) = 0.58 R(r) = 0.58 G(p) = 0.67 G(p) = 0.67 G(p) = -0.67 Both gamma values are set to 1
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
43
F(u) = 0.92 F(u) = 0.92 F(u) = 0.58 F(u) = 0.92 F(u) = 0.92 F(u) = 0.92 R(r) = 0.92 R(r) = 0.92 R(r) = 0.58 R(r) = 0.58 R(r) = 0.92 G(p) = 0.67 G(p) = 0.67 G(p) = -0.67
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
44
F(u) = 0.83 F(u) = 0.83 F(u) = 0.17 F(u) = 0.83 F(u) = 0.83 F(u) = 0.83 R(r) = 0.83 R(r) = .83 R(r) = 0.83 R(r) = 0.17 R(r) = 0.83 R(r) = 0.17 G(p) = 0.67 G(p) = 0.67 G(p) = -0.67
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
45
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
46
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
47
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
48
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
49
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
50
adapted from MacKay (2003) textbook
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1 before you
2 before you there's 1 of me 3 before you 4 before you 5 before you
51
1 after you 2 after you 3 after you 4 after you 5 after you 6 after you
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
3 behind you
2 before you
there's 1 of me Belief: Must be 2 + 1 + 3 = 6
my incoming messages
52
2 before you
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
4 behind you 1 before you there's 1 of me
my incoming messages
53
Belief: Must be 2 + 1 + 3 = 6
Belief: Must be 1 + 1 + 4 = 6
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7 here 3 here 11 here (= 7+3+1) 1 of me
54
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
3 here 3 here 7 here (= 3+3+1)
55
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7 here 3 here 11 here (= 7+3+1)
56
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7 here 3 here 3 here Belief: Must be 14 of us
57
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7 here 3 here 3 here Belief: Must be 14 of us
58
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
59
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
60
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
61
61
Label-label potential Prior All messages from neighbors Sum over all states
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
62
Prior All messages from neighbors
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
63
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
64
T 2 F 1 T 2 F 1 T 2 F 1 T 2 F 1 T 2 F 1 T 4 F 1 T 4 F 1
independent parts of the graph.
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
65
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
66
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
67
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
68
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
69
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
70
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
71
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
72
Initialize all nodes as unbiased
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
73
Initialize all nodes as unbiased At each iteration, for each node, compute messages to its neighbors
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
74
Initialize all nodes as unbiased Continue till convergence At each iteration, for each node, compute messages to its neighbors
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
75
P(fraudster) P(associate) P(honest)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
76