1
Confluence: Conformity Influence in Large Social Networks
Jie Tang*, Sen Wu*, and Jimeng Sun+
*Tsinghua University +IBM TJ Watson Research Center
Confluence: Conformity Influence in Large Social Networks Jie Tang * - - PowerPoint PPT Presentation
Confluence: Conformity Influence in Large Social Networks Jie Tang * , Sen Wu * , and Jimeng Sun + * Tsinghua University + IBM TJ Watson Research Center 1 Conformity Conformity is the act of matching attitudes, opinions, and behaviors to
1
*Tsinghua University +IBM TJ Watson Research Center
2
[1] R.B. Cialdini, & N.J. Goldstein. Social influence: Compliance and conformity. Annual Review of Psych., 2004, 55, 591–621. [2] H.C. Kelman. Compliance, Identification, and Internalization: Three Processes of Attitude Change. Journal of Conflict Resolution, 1958, 2 (1): 51–60.
3
I love Obama Obama is great! Obama is fantastic I hate Obama, the worst president ever He cannot be the next president! No Obama in 2012! Positive Negative
4
I love Obama Obama is great! Obama is fantastic Positive Negative
conformity
conformity
5
6
– Influence and correlation [Anagnostopoulos-et-al 2008] Distinguish influence and homophily [Aral-et-al 2009, La Fond-Nevill 2010] – Topic-based influence measure [Tang-Sun-Wang-Yang 2009, Liu-et-al 2012] Learning influence probability [Goyal-Bonchi-Lakshmanan 2010]
– Linear threshold and cascaded model [Kempe-Kleinberg-Tardos 2003] – Efficient algorithm [Chen-Wang-Yang 2009]
Ada Frank Eve David Carol Bob George
Input: coauthor network
Ada Frank Eve David Carol George
Social influence anlaysis
θi1=.5 θi2=.5 Topic distribution
g(v1,y1,z)
θi1 θi2 Topic distribution Node factor function
f (yi,yj, z)
Edge factor function
rz az Output: topic-based social influences
Topic 1: Data mining Topic 2: Database Topics: Bob Output Ada Frank Eve Bob George Topic 1: Data mining Ada Frank Eve David George Topic 2: Database
. . .
2 1 1 4 2 2 3 3
7
8
9
Network #Nodes #Edges Behavior #Actions
All the datasets are publicly available for research.
10
Alice’s friend Other users Alice Legend
11
Time t-1, t-2…
12
13
All actions by user v A specific action performed by user v at time t Exists a friend v′ who performed the same action at time t’′
14
All actions by user v′ A specific action performed by user v′ at time t′ User v follows v′ to perform the action a at time t
15
All τ-group actions performed by users in the group Ck A specific τ-group action User v conforms to the group to perform the action a at time t τ-group action: an action performed by more than a percentage τ
16
0.0005 0.001 0.0015 0.002 0.0025 0.003 2000 2005 2010
Peer Conformity
Peer Random 0.0005 0.001 0.0015 0.002 0.0025 Clustering Influence Recommendation Topic Model
Group Conformity
KDD ICDM CIKM 0.005 0.01 0.015 0.02 0.025 KDD ICDM CIKM
Individual Conformity
KDD
17
18
g(v1, icf (v1)) Users
Confluence model
v2 v3
y1=a Input Network
v4 v5 v7
Group 1: C1 Group 2: C2
g(y1, y’3, pcf (v1, v3))
g(y1, gcf (v1, C1))
v6 v1
Group 3: C3
Group conformity factor function Peer conformity factor function Random variable y: Action Individual conformity factor function
19
Individual conformity factor function Group conformity factor function Peer conformity factor function
20
[1] X. Song, Y. Chi, K. Hino, and B. L. Tseng. Identifying opinion leaders in the blogosphere. In CIKM’06, pages 971–974, 2007. [2] T. Lou and J Tang. Mining Structural Hole Spanners Through Information Diffusion in Social Networks. In WWW'13. pp.
837-848.
[3] M. Granovetter. The strength of weak ties. American Journal of Sociology, 78(6):1360–1380, 1973. [4] D. Easley and J. Kleinberg. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge
University Press, 2010.
21
(1) Master (3) Master (2) Slave
Unknown parameters to estimate
22
Slave Compute local gradient via random sampling Master Global update
23
24
Network #Nodes #Edges Behavior #Actions
Weibo 1,776,950 308,489,739 Post a tweet 6,761,186 Flickr 1,991,509 208,118,719 Add comment 3,531,801 Gowalla 196,591 950,327 Check-in 6,442,890 ArnetMiner 737,690 2,416,472 Publish paper 1,974,466
25
t-test, p<<0.01
26
Confluencebase stands for the Confluence method without any social based features Confluencebase+I stands for the Confluencebase method plus only individual conformity features Confluencebase+P stands for the Confluencebase method plus only peer conformity features Confluencebase+G stands for the Confluencebase method plus only group conformity
27
Achieve ∼ 9×speedup with 16 cores
28
29
30
*Tsinghua University +IBM TJ Watson Research Center
31
32
I love Obama Positive Negative
1
Conformity
Conformity
33
I love Obama Obama is great! Positive Negative
conformity
2
Conformity
34
I love Obama Obama is great! Positive Negative
conformity
conformity
3
35
I love Obama Obama is great! Obama is fantastic Positive Negative
conformity
conformity
4