final report
play

Final Report Interest-aware Information Diffusion in Dynamic Social - PowerPoint PPT Presentation

Final Report Interest-aware Information Diffusion in Dynamic Social Network Zhenhao Cao Ru Wang Mobile Internet 2018. 6 Outline Introduction Related Work Challenge & Motivation Proposed Model Experiments References


  1. Final Report Interest-aware Information Diffusion in Dynamic Social Network Zhenhao Cao Ru Wang Mobile Internet 2018. 6

  2. Outline • Introduction • Related Work • Challenge & Motivation • Proposed Model • Experiments • References EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 1/42

  3. Introduction • Social Network EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 2/42

  4. Introduction – A Taxonomy • An earlier survey: a taxonomy for information cascade prediction √ • Collaborative Filtering methods • Leverage homophily: insightful • Get rid of troublesome feature engineering EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 3/42

  5. Introduction – Why CF? • Key idea behind CF: Homophily • Transplantable to information diffusion modeling Commodity adopt Information adopt adoption entity adoption (retweet a post) not adopt not adopt EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 4/42

  6. Related Work – Extant CF-based Studies • CRPM & IRPM [1] (CIKM2015) EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 5/42

  7. Related Work – Extant CF-based Studies • GPOP [2] (WWW2017) EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 6/42

  8. Related Work – Extant CF-based Studies • A Collaborative Filtering Model for Personalized Retweeting Prediction [3] (DASFAA2015) EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 7/42

  9. Challenge & Motivation • More sufficient utility of social network information • Better adapted for Information Diffusion modeling • Novel insights into user retweet behavior EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 8/42

  10. Challenge & Motivation • More sufficient utility of social network information - A flat “snapshot” of users’ historical behaviors - Information loss: Permutation? Sequence? Diffusion topologies? Diffusion Topology Retweet Matrix 𝑆 ··· 0 ··· 1 ··· 1 compress ··· 0 ··· 1 ··· 0 ··· 0 ··· 1 ··· ··· ··· ··· ··· ··· ··· ··· EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 9/42

  11. Challenge & Motivation • More sufficient utility of social network information • Better adapted for Information Diffusion modeling - Leverage diffusion topologies * Essence of information diffusion * A main difference from recommendation system problems EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 10/42

  12. Challenge & Motivation • More sufficient utility of social network information • Better adaption to Information Diffusion modeling • Novel insights into user retweet behavior Post Post Int ntere rest Att ttraction Ret Retweet or or not not? Others’ Resis Resistance Inf nfluence EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 11/42

  13. Our Work

  14. Our Work - ReTrend • A novel framework for information diffusion Interest-extraction Component 𝑇 𝐷 𝑌 𝑍 𝐵 Prediction Component 𝑆 𝐸𝑗𝑔 𝑎 𝑈 Resistance-extraction Component EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 12/42

  15. ReTrend – Observable Data • Four matrices carrying observable data - Subscription Matrix (S) Interest-extraction Component 𝑇 𝐷 - Contagion Matrix (C) 𝐵 𝑌 𝑍 - Resistance Matrix (T) Prediction Component 𝑆 - Retweet Matrix (R) 𝐸𝑗𝑔 𝑎 𝑈 Resistance-extraction Component EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 13/42

  16. ReTrend – Learning Latent Feature • Four factor matrices carrying latent feature vectors - User Interest Matrix (X) Interest-extraction Component 𝑇 𝐷 - User Influence Matrix (Y) 𝐵 𝑌 𝑍 - User Resistance Matrix (Z) Prediction Component 𝑆 - Item Attraction Matrix (A) 𝐸𝑗𝑔 𝑎 𝑈 Resistance-extraction Component EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 14/42

  17. ReTrend – Learning Latent Feature • Four factor matrices carrying latent feature vectors - User Interest Matrix (X) Interest-extraction Component 𝑇 𝐷 - User Influence Matrix (Y) 𝐵 𝑌 𝑍 - Use ser r Res esistance Matr trix (Z (Z) Prediction Component 𝑆 - Item Attraction Matrix (A) 𝐸𝑗𝑔 𝒂 𝑈 Resistance-extraction Component • We deem this inherent attribute ‘resistance’ varies over latent space but remains fixed for a fixed user EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 15/42

  18. ReTrend – Logic Explanation • Take Contagion Matrix for example • Contagion Matrix: |user| × |post| • Entry 𝐷 𝑣𝑗 : count of retweet behaviors Interest-extraction Component 𝑇 𝐷 triggered by user 𝑣 w.r.t. post 𝑗 𝐵 𝑌 𝑍 • 𝐷 𝑣𝑗 reflects two facts: Prediction Component 𝑆 - to what degree a user can trigger his 𝐸𝑗𝑔 friends to retweet the post 𝑎 𝑈 - how attractive the post is Resistance-extraction Component EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 16/42

  19. ReTrend – Logic Explanation • Take Contagion Matrix for example Contagion Matrix C Item Attraction Matrix 𝐵 ··· User Influence Matrix 𝑍 0 ··· ··· ··· ··· ··· ··· ··· ··· 0 ··· ··· ··· 2 ··· ··· ··· ≈ 0 × ··· ··· ··· 1 ··· 𝑙 ··· ··· 0 ··· ··· ··· 0 ··· ··· ··· 0 ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· 𝑙 • Assume a Gaussian observation noise EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 17/42

  20. ReTrend – Logic Explanation • For Retweet Matrix Interest-extraction • Retweet behavior can be determined Component 𝑇 𝐷 by user interest, resistance, parent 𝐵 𝑌 𝑍 influence and post attraction Prediction Component 𝑆 𝐸𝑗𝑔 𝑎 𝑈 Resistance-extraction Component where EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 18/42

  21. ReTrend – Entire Model • Conditional distribution over all observed data as • Place zero-mean spherical Gaussian priors on latent feature vectors EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 19/42

  22. ReTrend – Entire Model • By modifying the log-likelihood, we obtain the loss function as • SGD for optimization EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 20/42

  23. ReTrend – Retweet-tree Encoding • How ReTrend leverage information better? • Tree-structured essence of information cascade – Retweet-tree ··· ··· ··· ··· ··· EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 21/42

  24. ReTrend – Retweet-tree Encoding • Subscription Matrix ··· ··· Subscribe Matrix 𝑇 ··· 0 1 1 0 1 0 1 0 ··· ··· 1 0 1 1 0 0 1 0 ··· 0 0 0 1 1 0 0 1 ··· 0 1 0 0 0 1 1 0 ··· 1 0 0 1 0 0 0 1 ··· 0 0 1 0 0 1 0 0 ··· 0 1 0 0 1 0 1 0 ··· 0 1 1 1 0 0 1 1 ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 22/42

  25. ReTrend – Retweet-tree Encoding • Retweet Matrix ··· ··· Retweet Matrix 𝑆 ··· 0 ··· ··· 1 ··· 1 ··· 0 ··· 1 ··· 0 ··· 0 ··· 1 ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 23/42

  26. ReTrend – Retweet-tree Encoding • Contagion Matrix ··· ··· Contagion Matrix C ··· 0 ··· ··· 0 ··· 2 ··· 0 ··· 1 ··· 0 ··· 0 ··· 0 ··· ··· ··· ··· ··· ··· ··· ··· ··· ··· EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 24/42

  27. ReTrend – Training • Dynamic inference on the most likely retweet-tree structure EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 26/42

  28. ReTrend – Training • AND, it is post-transcending EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 27/42

  29. Modification

  30. Matrix Factorization – Drawbacks • Simple and fixed inner-product: Low Non-linearity[4] • Complex inference in low-dimensional latent space • Too much constraints 𝑇 𝐷 𝑍 𝐵 𝑌 Pure linear operation: 𝑆 𝐸𝑗𝑔 Empirically lo low per performance 𝑎 𝑈 EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 28/42

  31. MLP Module – Optimization for MF • Replace multiplication with a simple MLP module. • Level up non-linearity Matrix A Matrix B Matrix A Matrix B MLP Module Result Result EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 29/42

  32. MLP Module – Detail EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 30/42

  33. Experiments – Dataset • Rea eal-world ld da data taset fro from Twitt tter • More than 90,000 users and 99,696,204 tweets related [1][2] . • 440,000+ subscribes. • 2,370,000+ retweet behaviors. • 18,210,000+ un-retweet behaviors. • 18,210,000+ resistance tuples. • 2,170,000+ contagion tuples. [1] https://www.aminer.cn/data-sna#Twitter-Dynamic-Net [2] https://www.aminer.cn/data-sna#Twitter-Dynamic-Action EE447 2018.6 Final Report – Zhenhao Cao, Ru Wang 31/42

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend