Modeling Information Diffusion in Implicit Networks. Jaewon Yang - PowerPoint PPT Presentation

Modeling Information Diffusion in Implicit Networks. Jaewon Yang ， Jure Leskovec IEEE International Conference On Data Mining (ICDM), 2010 Presenter: SHI, Conglei(clshi@cse.ust.hk)

PROBLEM ¤ There are some limitations for parameter estimation: ¤ Need complete network data: FACT: Commonly , we only observe nodes got “infected”. ¤ Contagion can only spread over the edges: FACT: The diffusion is not just depend on the social network.

METHODS ¤ Focusing on modeling the global influence a node has on the rate of diffusion through the implicit network. ¤ Ignore the knowledge of the network ¤ Also model how the diffusion unfold over time. ¤ Proposed Linear Influence Model(LIM) ¤ Base Assumption: number of newly infected nodes depends on which other nodes got infected in the past.

LINEAR INFLUENCE MODEL ¤ V(t) : The number of nodes that mention the info at t ¤ I : The Influence of the node u at time t ¤ How to model ?

MODELING INFLUENCE FUNCTION ¤ Parametric approach: ¤ Too simplistic, assuming all the nodes follow the same form ¤ Non-parametric approach: ¤ Do not make any assumption about the shape of function ¤ Represent the function as a non-negative vector of length L ¤ Can study how the function varies for different types.

ESTIMATING FUNCTIONS ¤ Consider a set of N nodes, K contagions. ¤ Design an indicator function . If node u got infected by contagion k at time t , . ¤ : The number of nodes that got infected by k at time t .

ESTIMATING FUNCTIONS

ESTIMATING FUNCTIONS ¤ This problem is called Non-negative Least Squares(NNLS) problem ¤ Minimize ¤ The Matrix M is sparse in nature ¤ Using Reflective Newton Method is ¤ Subject to very effective. ¤ Tikhonov regularization is also applied to smooth the estimates.

EXTENSIONS ¤ Accounting for novelty: ¤ One node’s influence is related to the time it appears. ¤ Introduce a multiplicative factor . ¤ The equation is convex both and , which means we can use a coordinate descent procedure.

EXTENSIONS ¤ Accounting for imitation ¤ Some information diffusion is the effect of imitation. ¤ Introduce to model the latent volume. ¤ Also linear.

EXPERIMENTS ¤ First datasets ¤ Memetracker data: Extracting 343 million short textual phrases from 172 million news article and blog post. ¤ Time period: Sep.1 2008 to Aug. 31 2009 ¤ Choosing 1000 phrases with highest volume in a 5 day window around their peak volume

EXPERIMENTS ¤ Second datasets ¤ Twitter data: Identifying 6 million different hashtags from a stream of 580 million Twitter posts. ¤ Time period: Jun. 2009 to Feb. 2010 ¤ Choosing 1000 hashtags with highest volume in a 5 day window around their peak volume ¤ Grouping users into groups of 100 users.

EXPERIMENTS ¤ Evaluate LIM model on a time series prediction task. ¤ Employ 10-fold cross validation. ¤ Calculate ¤ Relative error is what we want.

RESULT 23.00% 21.00% 19.00% 17.00% AR 15.00% ARMA 13.00% LIM 11.00% B-LIM 9.00% α -LIM 7.00% 5.00% 1 2 3 4 5 6 7 Yang, J., & Leskovec, J. Patterns of temporal variation in online media. (WSDM '11)

RESULT AR 13.00% 8.00% ARMA 3.00% LIM -2.00% 1 2 3 4 5 6 7 -7.00% B-LIM -12.00% α -LIM -17.00% -22.00% AR+LIM -27.00%

RESULT

CONCLUSION ¤ Proposed the Linear Influence Model. ¤ Considered some other factors to enhance the model. ¤ Used large scale of data to justify the effectiveness of the model. ¤ Opened up a new framework for the analysis of diffusion. ¤ Future work: extend the linear model to non-linear model.

THANKS FOR YOUR ATTENTION!

Modeling Information Diffusion in Implicit Networks. Jaewon Yang - PowerPoint PPT Presentation

Modeling Information Diffusion in Implicit Networks. Jaewon Yang Jure Leskovec IEEE International Conference On Data Mining (ICDM), 2010 Presenter: SHI, Conglei(clshi@cse.ust.hk) PROBLEM There are some limitations for parameter

c + = Diffusion Diffusion 2 6.82 10 -6 v c D c 10 -1 Equation

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Modeling Information Diffusion Modeling Information Diffusion in Multi in Multi-Sensitive

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

PLS Advanced Diffusion Model New Advanced Diffusion Model for Dopants in Silicon Advanced Dopant

Information Diffusion on Social Networks SMART Summer School 2017 Sylvain Lamprier LIP6 - UPMC

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

NON-SYMMETRIC FRACTIONAL DIFFUSION NON-SYMMETRIC FRACTIONAL DIFFUSION AS A SPECIAL CASE OF AS A

Implicit Bias: Transcript Inclusive Teaching Series: Implicit Bias Welcome to the third module of

Implicit Extremes and Implicit MaxStable Laws Stilian Stoev ( sstoev@umich.edu ) University of

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Surfaces CPSC 599.86 / 601.86 Sonny Chan University of Calgary (some board work happened

COMP522 - Project Presentation Modelling Information Diffusion over Networks using DEVS By: Hiu

Dynamic Mathematical Modeling of Information Diffusion in Online Social Networks Feng Wang,

12 Implicit Spatial Discretization for Advection-Diffusion-Reaction Equation Kundan Kumar

A Bloch Torrey Equation for Diffusion in a Deforming Media Damien Rohmer November 21, 2006 A

Applications of Diffusion Models in Telecommunications Nigel Meade 2 Introduction Recent

Help Me Grow, the Diffusion of Innovation , and Early Childhood System Building Paul H. Dworkin,

Managing a Large Dilute Plume Impacted by Matrix Diffusion: MEW Case Study Presented at Federal

New Efficient methods to Characterize Effects of Framework Flexibility on CH 4 Diffusion in 8MR

2000, 2008 and 2012 Bond Programs San Francisco Clean and Safe Parks GO BONDS 2008 BOND Program

2000 Neighborhood Parks Bond CGOBOC September 25, 2014 2000 Neighborhood Parks General Obligation

2012 CLEAN & SAFE NEIGHBORHOOD PARKS BOND Request 3 rd 2012 Sale Capital Planning Committee

Physics Beyond the y y Standard Model Standard Model David Toback David Toback Texas A&M

Sambuz

Useful Links

Newsletter

Mail Us

Modeling Information Diffusion in Implicit Networks. Jaewon Yang - PowerPoint PPT Presentation

Modeling Information Diffusion in Implicit Networks. Jaewon Yang Jure Leskovec IEEE International Conference On Data Mining (ICDM), 2010 Presenter: SHI, Conglei(clshi@cse.ust.hk) PROBLEM There are some limitations for parameter

c + = Diffusion Diffusion 2 6.82 10 -6 v c D c 10 -1 Equation

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Modeling Information Diffusion Modeling Information Diffusion in Multi in Multi-Sensitive

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

PLS Advanced Diffusion Model New Advanced Diffusion Model for Dopants in Silicon Advanced Dopant

Information Diffusion on Social Networks SMART Summer School 2017 Sylvain Lamprier LIP6 - UPMC

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

NON-SYMMETRIC FRACTIONAL DIFFUSION NON-SYMMETRIC FRACTIONAL DIFFUSION AS A SPECIAL CASE OF AS A

Implicit Bias: Transcript Inclusive Teaching Series: Implicit Bias Welcome to the third module of

Implicit Extremes and Implicit MaxStable Laws Stilian Stoev ( sstoev@umich.edu ) University of

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Surfaces CPSC 599.86 / 601.86 Sonny Chan University of Calgary (some board work happened

COMP522 - Project Presentation Modelling Information Diffusion over Networks using DEVS By: Hiu

Dynamic Mathematical Modeling of Information Diffusion in Online Social Networks Feng Wang,

12 Implicit Spatial Discretization for Advection-Diffusion-Reaction Equation Kundan Kumar

A Bloch Torrey Equation for Diffusion in a Deforming Media Damien Rohmer November 21, 2006 A

Applications of Diffusion Models in Telecommunications Nigel Meade 2 Introduction Recent

Help Me Grow, the Diffusion of Innovation , and Early Childhood System Building Paul H. Dworkin,

Managing a Large Dilute Plume Impacted by Matrix Diffusion: MEW Case Study Presented at Federal

New Efficient methods to Characterize Effects of Framework Flexibility on CH 4 Diffusion in 8MR

2000, 2008 and 2012 Bond Programs San Francisco Clean and Safe Parks GO BONDS 2008 BOND Program

2000 Neighborhood Parks Bond CGOBOC September 25, 2014 2000 Neighborhood Parks General Obligation

2012 CLEAN &amp; SAFE NEIGHBORHOOD PARKS BOND Request 3 rd 2012 Sale Capital Planning Committee

Physics Beyond the y y Standard Model Standard Model David Toback David Toback Texas A&amp;M

Sambuz

Useful Links

Newsletter

Mail Us

2012 CLEAN & SAFE NEIGHBORHOOD PARKS BOND Request 3 rd 2012 Sale Capital Planning Committee

Physics Beyond the y y Standard Model Standard Model David Toback David Toback Texas A&M