introduction to network metrics
play

Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro - PowerPoint PPT Presentation

Outline Network metrics Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Universitat Polit` ecnica de Catalunya Version 0.4 Complex and Social Networks (20 20 -20 21 ) Master in Innovation and Research in Informatics


  1. Outline Network metrics Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Universitat Polit` ecnica de Catalunya Version 0.4 Complex and Social Networks (20 20 -20 21 ) Master in Innovation and Research in Informatics (MIRI) Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  2. Outline Network metrics Official website: www.cs.upc.edu/~csn/ Contact: ◮ Ramon Ferrer-i-Cancho, rferrericancho@cs.upc.edu, http://www.cs.upc.edu/~rferrericancho/ ◮ Argimiro Arratia, argimiro@cs.upc.edu, http://www.cs.upc.edu/~argimiro/ Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  3. Outline Network metrics Network metrics Distance metrics Clustering metrics Degree correlation metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  4. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Network analysis Two major approaches: visual and statistical analysis (e.g., large scale properties). (from Webopedia) Statistical analysis: compression of information (e.g., one value that summarizes some aspect of the network). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  5. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Perspectives Metrics as compression of an adjacency matrix. Three perspectives: ◮ Distance between nodes. ◮ Transitivity ◮ Mixing (properties of vertices making an edge). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  6. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Geodesic path ◮ Geodesic path between two vertices u and v = shortest path between u and v [Newman, 2010] ◮ d ij : length of a geodesic path from the i -th to the j -th vertex (network or topological distance between i and j ). ◮ d ij = 1 if i and j are connected. ◮ ◮ d ij = ∞ if i and j are in different connected components . ◮ Computed with a breadth-first search algorithm (in unweighted undirected networks). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  7. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Local distance measures l i : mean geodesic distance from vertex i ◮ Definitions: N l i = 1 � d ij or N j =1 N 1 � l i = as d ii = 0 d ij N − 1 j =1( i � = j ) C i : closeness centrality of vertex i . ◮ Definition (harmonic mean) N 1 1 � C i = , N − 1 d ij j =1( i � = j ) as d ii = 0. ◮ Better than C ′ i = 1 / l i . Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  8. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Global distance metrics ◮ Diameter: largest geodesic distance. ◮ Mean (geodesic distance): N l = 1 � l i N i =1 ◮ Problem: l might be ∞ . ◮ Solutions: focus on the largest connected component, mean over l within each connected component, ... ◮ Mean closeness centrality: N C = 1 � C i N i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  9. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Global distance metrics ◮ Closeness measures have rarely been used (for historical reasons). ◮ The closeness centrality of a vertex can be seen as measure of the importance of a vertex (alternative approaches: degree, PageRank,...). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  10. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Transitivity Zachary’s Karate Club ◮ A relation ◦ is transitive if a ◦ b and b ◦ c imply a ◦ c . ◮ Example: a ◦ b = a and b are friends. ◮ Edges as relations. ◮ Perfect transitivity: clique (complete graph) but real network are not cliques. ◮ Big question: how transitive are (social) networks? Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  11. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Clustering coefficient ◮ A path of length two uvw is closed if u and w are connected. C = number of closed paths of length 2 number of paths of length 2 A proportion of transitive triples ◮ C = 1 perfect transitivity / C = 0 no transitivity ( e.g.,: ? ). ◮ Algorithm: Consider each vertex as v in the path uvw , checking if u and w are connected (only vertices of degree ≥ 2 matter). ◮ Number of paths of length 2 = ?. ◮ Equivalently: number of triangles × 3 C = number of connected triples of vertices ◮ Key: triangle = set of three nodes forming a clique; number of connected triples = number of labelled trees of 3 vertices Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  12. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Alternative clustering coefficient Watts & Strogatz (WS) clustering coefficient [Watts and Strogatz, 1998] ◮ Local clustering: C i = number of pairs of neighbors of i that are connected number of pairs of neighbours of i ◮ Assuming undirected graph without loops: � j − 1 � N k =1 a ij a ik a jk j =1 C i = � k i � 2 ◮ Global clustering: N C WS = 1 � C i N i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  13. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on clustering coefficients I ◮ Given a network, C and C WS can differ substantially. ◮ C WS has been used very often for historical reasons ( C WS was proposed first). ◮ C is can be dominated by the contribution of vertices of high degree (which have many adjancent nodes). ◮ C WS is can be dominated by the contribution of vertices of low degree (which are many in the majority of networks). ◮ C WS needs taking further decision on C i when k i < 2 ( C is more elegant from a mathematical point of view). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  14. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on clustering coefficients II ◮ Conclusion 0: C and C WS meassure transitivity in different ways (different assumptions/goals). ◮ Conclusion 1: each measure has its strengths and weaknesses. ◮ Conclusion 2: explain your methods with precision! Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  15. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on efficient computation ◮ Computational challenge: time consuming computation of metrics on large networks. ◮ Solution: Monte Carlo methods for computing. ◮ Instead of computing N C WS = 1 � C i N i =1 estimate C WS from a mean of C i over a small fraction of randomly selected vertices. ◮ High precision exploring a small fraction of nodes (e.g., 5%). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  16. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Degree correlations I What is the dependency between the degrees of vertices at both ends of an edge? ◮ Assortative mixing (by degree): high degree nodes tend to be connected to high degree nodes, typical of social networks (coauthorship in physics, film actor collaboration,...). ◮ Disassortative mixing (by degree): high degree nodes tend to be connected to low degree nodes, e.g., neural network ( C. Elegans ), ecological networks (trophic relations). ◮ No tendency (e.g., Erd¨ os-R´ enyi graph, Barab´ asi-Albert model). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  17. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Degree correlations II ◮ k i : degree of the i -th vertex. ◮ k ′ i = k i − 1: remaining degree of the i -th after discounting the edge i ∼ j . Correlation ◮ correlation between k i and k j for every edge i ∼ j . ◮ correlation between k ′ i and k ′ j for every edge i ∼ j . ◮ metric ρ : − 1 ≤ ρ ≤ 1. Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  18. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Interclass correlation Theoretical (interclass) correlation: COV ( X , Y ) ρ ( X , Y ) = σ X σ Y E [( X − E [ X ])( Y − E [ Y ])] = σ X σ Y E [ XY ] − E [ X ] E [ Y ] = σ X σ Y Symmetry: ρ ( X , Y ) = ρ ( Y , X ), ρ S ( X , Y ) = ρ S ( Y , X ). Empirical correlation: ◮ Paired mesurements: ( x 1 , y 1 ),...,( x i , y i ),...,( x n , y n ). ◮ Sample (interclass) correlation: � n i =1 ( x i − ¯ x )( y i − ¯ y ) ρ s ( X , Y ) = �� n x ) 2 �� n x ) 2 i =1 ( x i − ¯ i =1 ( y i − ¯ Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

  19. Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Intraclass correlation Theoretical intraclass correlation: ρ = COV intra ( X ) σ ( X ) 2 Empirical correlation: ◮ Paired measurements: ( x 1 , 1 , x 1 , 2 ),...,( x i , 1 , x i , 2 ),...,( x n , 1 , x n , 2 ) n 1 � ρ s = ( x i , 1 − ¯ x )( x i , 2 − ¯ x ) ( N − 1) σ 2 s i =1 n x = 1 � ¯ ( x i , 1 + x i , 2 ) 2 N i =1 n 1 x ) 2 + ( x i , 2 − ¯ � σ 2 x ) 2 � � s = ( x i , 1 − ¯ 2( N − 1) i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend