hierarchically clustering time directed graphs and the
play

Hierarchically clustering time-directed graphs and the effects of - PowerPoint PPT Presentation

Hierarchically clustering time-directed graphs and the effects of teleportation and memory Jevin West, Information School, University of Washington Network Clustering Graph Partitioning Community Detection Block Models Module Detection


  1. Hierarchically clustering time-directed graphs and the effects of teleportation and memory Jevin West, Information School, University of Washington

  2. Network Clustering Graph Partitioning Community Detection Block Models Module Detection

  3. http://www.iloveaba.com/2015/07/no-one-size-does-not-fit-all.html

  4. No one size fits all • No canonical solution or one generalizable method for all data and all problems (i.e. there is no method that works best on all networks in all situations) • Need to know the context for why the user is interested in clustering • We don’t even have a definition of a community • Umbrella term for many facets Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  5. No one size fits all Cu Cut-bas based: d: community detection as minimization of some form of constraint violation Da Data clus ustering ng: community detection framed as a discretized analogue of data clustering, in which densely knit groups of nodes are to be found Sto Stochas asti tic equival valence: community detection aiming to identify structurally equivalent nodes in a network, leading to notions such as stochastic block models Dy Dyna namics perspective: community detection looking for simplified descriptions of the dynamical flows occurring on the network, that is, some form of dynamical model reduction Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  6. Hierarchical Herd Immunity

  7. Community Detection Perspectives Circuit layout Data Clustering Social Networks System behavior, processes Minimizing cuts Maximizing node density Connectivity Profiles Non-adjacency focused Load balancing unknown k, unbalanced Stochastic equivalence Airline network Eigenvectors Conductance SBMs, LFR Markovian diffusion process Spectral methods Local, global p-values, hypothesis testing Undirected, Directed Image segmentation Modularity Bipartite treatment InfoMap Predict missing links Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  8. Community Detection Perspectives Circuit layout Data Clustering Social Networks System behavior, processes Minimizing cuts Maximizing node density Connectivity Profiles Non-adjacency focused Load balancing unknown k, unbalanced Stochastic equivalence Airline network Eigenvectors Conductance SBMs, LFR Markovian diffusion process Spectral methods Local, global p-values, hypothesis testing Undirected, Directed Image segmentation Modularity Bipartite treatment InfoMap Predict missing links Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  9. Community Detection Perspectives Circuit layout Data Clustering Social Networks System behavior, processes Minimizing cuts Maximizing node density Connectivity Profiles Non-adjacency focused Load balancing unknown k, unbalanced Stochastic equivalence Airline network Eigenvectors Conductance SBMs, LFR Markovian diffusion process Spectral methods Local, global p-values, hypothesis testing Undirected, Directed Image segmentation Modularity Bipartite treatment InfoMap Predict missing links Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  10. Community Detection Perspectives Circuit layout Data Clustering Social Networks System behavior, processes Minimizing cuts Maximizing node density Connectivity Profiles Non-adjacency focused Load balancing unknown k, unbalanced Stochastic equivalence Airline network Eigenvectors Conductance SBMs, LFR Markovian diffusion process Spectral methods Local, global p-values, hypothesis testing Undirected, Directed Image segmentation Modularity Bipartite treatment InfoMap Predict missing links Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  11. Higher Resolution Maps Rosvall et al. (2014) Memory in network flows and its effects on spreading dynamics and community detection. Nature Communications

  12. In the spirit of clustering context…

  13. The Scholarly Graph

  14. Tens of millions articles, patents, books Billions of citation links Years: 1600 – 2016 1. Mapping Knowledge Domains 2. Science of Science 3. Hierarchical Navigation 4. Recommendation

  15. 1 Mapping Knowledge Domains Rosvall, Martin, and Carl T. Bergstrom. "Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems." PloS one 6.4 (2011): e18209.

  16. 2 The Role of Gender in Science West, J.D. (2012) The Role of Gender in Scholarly Authorship. PLoS One

  17. 3 Hierarchical Navigation

  18. Recommendation 4 Expert Classic West, Wesley-Smith, Bergstrom (2016) A recommendation system based on hierarchical clustering of an article-level citation network. IEEE, Transactions on Big Data (in press)

  19. Community Detection Perspectives Circuit layout Data Clustering Social Networks System behavior, processes Minimizing cuts Maximizing node density Connectivity Profiles Non-adjacency focused Load balancing unknown k, unbalanced Stochastic equivalence Airline network Eigenvectors Conductance SBMs, LFR Markovian diffusion process Spectral methods Local, global p-values, hypothesis testing Undirected, Directed Image segmentation Modularity Bipartite treatment InfoMap Predict missing links Schaub, M.T. et al. (2017) The many facets of community detection in complex networks . Applied Network Science

  20. Finding regularities in citation networks Rosvall and Bergstrom (2008) PNAS

  21. The Emergence of Neuroscience Rosvall and Bergstrom (2010) PLoS One

  22. Data Compressing Finding patterns If we can find a good code for describing flow on a network, we will have solved the dual problem of finding the important structures with respect to that flow.

  23. The map equation frequency of inter-module movements frequency of movements within module i code length of module names code length of node names in module i Rosvall and Bergstrom (2008) PNAS

  24. Mapequation.org, Daniel Edler

  25. The relationship between ranking and clustering Clustering Ranking Dynamics Structure

  26. Step Length, Teleportation and Memory ..and their effects on ranking and clustering

  27. Memory: capturing higher order dynamics Rosvall et al. (2014) Memory in network flows and its effects on spreading dynamics and community detection. Nature Communications

  28. Memory: capturing higher order dynamics Rosvall et al. (2014) Memory in network flows and its effects on spreading dynamics and community detection. Nature Communications

  29. Higher Resolution Maps Rosvall et al. (2014) Memory in network flows and its effects on spreading dynamics and community detection. Nature Communications

  30. Higher Order Dynamics Rosvall et al. (2014) Memory in network flows and its effects on spreading dynamics and community detection. Nature Communications

  31. Citation Networks Types Journal-Level Networks (Memory) Article-level Networks Time-Directed (Acyclic) Graphs

  32. PageRank Variants (EigenFactor) + (1 − α ) a.e T P = α H Matrix representing the Probability of teleporting Probability of random walk over citations to completely new journal not teleporting weighted by the number Cross-citation Matrix of articles in that journal dictating the structure of the citation network Leading eigenvector H π of the random walk EF = 100 matrix P. ∑ [ H π ] i i Normalization West, JD et al. (2010) College of Research Libraries

  33. PageRank Pitfalls Maslov, S. & Redner, S. (2008) Promise and Pitfalls of Extending Google’s PageRank Algorithm to Citation Networks. The Journal of Neuroscience

  34. Teleportation Strategies ) ⍺ - 1 DIR-R ( PageRank ) ( d r o c e r S S don’t record E E D D DIR-UR ( EigenFactor ) O O N N in-degree teleport other in-out L L record other I I N N K K S S out-degree other d o n in-degree ’ t INDIR:DIR r e c o r d in-out UNDIR:DIR o u t - d e g r e e OUTDIR-DIR (Count Links)

  35. Smart Teleportation Lambiotte, R. & Rosvall, M. (2012) Ranking and clustering of nodes in networks with smart teleportation

  36. Smart Teleportation and Clustering Lambiotte, R. & Rosvall, M. (2012) Ranking and clustering of nodes in networks with smart teleportation

  37. Article-level Ranking and Mapping DIR-R ( PageRank ) UNDIR:DIR Smooths ranking ~ better clustering West et al. (2016) Ranking and mapping article-level citation networks. in prep.

  38. Teleportation Strategies ) α – 1 DIR-R ( PageRank ) ( d r o c e r S S don’t record E E D D DIR-UR ( EigenFactor ) O O N N in-degree teleport other total L L record other I I N N K K S S out-degree other d o n in-degree ’ t INDIR:DIR r e c o r d total UNDIR:DIR o u t - d e g r e e OUTDIR-DIR (Count Links)

  39. Article-level Eigenfactor

  40. Running Experiments

  41. Clustering on time-directed networks • Empirical exploration of hierarchical partitions with varying dynamics • The effects of changing recorded teleportation ranking and clustering Ranking Effects Clustering Effects

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend