Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro - PowerPoint PPT Presentation

Outline Network metrics Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Universitat Polit` ecnica de Catalunya Version 0.4 Complex and Social Networks (20 20 -20 21 ) Master in Innovation and Research in Informatics (MIRI) Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Outline Network metrics Official website: www.cs.upc.edu/~csn/ Contact: ◮ Ramon Ferrer-i-Cancho, rferrericancho@cs.upc.edu, http://www.cs.upc.edu/~rferrericancho/ ◮ Argimiro Arratia, argimiro@cs.upc.edu, http://www.cs.upc.edu/~argimiro/ Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Outline Network metrics Network metrics Distance metrics Clustering metrics Degree correlation metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Network analysis Two major approaches: visual and statistical analysis (e.g., large scale properties). (from Webopedia) Statistical analysis: compression of information (e.g., one value that summarizes some aspect of the network). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Perspectives Metrics as compression of an adjacency matrix. Three perspectives: ◮ Distance between nodes. ◮ Transitivity ◮ Mixing (properties of vertices making an edge). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Geodesic path ◮ Geodesic path between two vertices u and v = shortest path between u and v [Newman, 2010] ◮ d ij : length of a geodesic path from the i -th to the j -th vertex (network or topological distance between i and j ). ◮ d ij = 1 if i and j are connected. ◮ ◮ d ij = ∞ if i and j are in different connected components . ◮ Computed with a breadth-first search algorithm (in unweighted undirected networks). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Local distance measures l i : mean geodesic distance from vertex i ◮ Definitions: N l i = 1 � d ij or N j =1 N 1 � l i = as d ii = 0 d ij N − 1 j =1( i � = j ) C i : closeness centrality of vertex i . ◮ Definition (harmonic mean) N 1 1 � C i = , N − 1 d ij j =1( i � = j ) as d ii = 0. ◮ Better than C ′ i = 1 / l i . Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Global distance metrics ◮ Diameter: largest geodesic distance. ◮ Mean (geodesic distance): N l = 1 � l i N i =1 ◮ Problem: l might be ∞ . ◮ Solutions: focus on the largest connected component, mean over l within each connected component, ... ◮ Mean closeness centrality: N C = 1 � C i N i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Global distance metrics ◮ Closeness measures have rarely been used (for historical reasons). ◮ The closeness centrality of a vertex can be seen as measure of the importance of a vertex (alternative approaches: degree, PageRank,...). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Transitivity Zachary’s Karate Club ◮ A relation ◦ is transitive if a ◦ b and b ◦ c imply a ◦ c . ◮ Example: a ◦ b = a and b are friends. ◮ Edges as relations. ◮ Perfect transitivity: clique (complete graph) but real network are not cliques. ◮ Big question: how transitive are (social) networks? Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Clustering coefficient ◮ A path of length two uvw is closed if u and w are connected. C = number of closed paths of length 2 number of paths of length 2 A proportion of transitive triples ◮ C = 1 perfect transitivity / C = 0 no transitivity ( e.g.,: ? ). ◮ Algorithm: Consider each vertex as v in the path uvw , checking if u and w are connected (only vertices of degree ≥ 2 matter). ◮ Number of paths of length 2 = ?. ◮ Equivalently: number of triangles × 3 C = number of connected triples of vertices ◮ Key: triangle = set of three nodes forming a clique; number of connected triples = number of labelled trees of 3 vertices Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Alternative clustering coefficient Watts & Strogatz (WS) clustering coefficient [Watts and Strogatz, 1998] ◮ Local clustering: C i = number of pairs of neighbors of i that are connected number of pairs of neighbours of i ◮ Assuming undirected graph without loops: � j − 1 � N k =1 a ij a ik a jk j =1 C i = � k i � 2 ◮ Global clustering: N C WS = 1 � C i N i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on clustering coefficients I ◮ Given a network, C and C WS can differ substantially. ◮ C WS has been used very often for historical reasons ( C WS was proposed first). ◮ C is can be dominated by the contribution of vertices of high degree (which have many adjancent nodes). ◮ C WS is can be dominated by the contribution of vertices of low degree (which are many in the majority of networks). ◮ C WS needs taking further decision on C i when k i < 2 ( C is more elegant from a mathematical point of view). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on clustering coefficients II ◮ Conclusion 0: C and C WS meassure transitivity in different ways (different assumptions/goals). ◮ Conclusion 1: each measure has its strengths and weaknesses. ◮ Conclusion 2: explain your methods with precision! Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Comments on efficient computation ◮ Computational challenge: time consuming computation of metrics on large networks. ◮ Solution: Monte Carlo methods for computing. ◮ Instead of computing N C WS = 1 � C i N i =1 estimate C WS from a mean of C i over a small fraction of randomly selected vertices. ◮ High precision exploring a small fraction of nodes (e.g., 5%). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Degree correlations I What is the dependency between the degrees of vertices at both ends of an edge? ◮ Assortative mixing (by degree): high degree nodes tend to be connected to high degree nodes, typical of social networks (coauthorship in physics, film actor collaboration,...). ◮ Disassortative mixing (by degree): high degree nodes tend to be connected to low degree nodes, e.g., neural network ( C. Elegans ), ecological networks (trophic relations). ◮ No tendency (e.g., Erd¨ os-R´ enyi graph, Barab´ asi-Albert model). Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Degree correlations II ◮ k i : degree of the i -th vertex. ◮ k ′ i = k i − 1: remaining degree of the i -th after discounting the edge i ∼ j . Correlation ◮ correlation between k i and k j for every edge i ∼ j . ◮ correlation between k ′ i and k ′ j for every edge i ∼ j . ◮ metric ρ : − 1 ≤ ρ ≤ 1. Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Interclass correlation Theoretical (interclass) correlation: COV ( X , Y ) ρ ( X , Y ) = σ X σ Y E [( X − E [ X ])( Y − E [ Y ])] = σ X σ Y E [ XY ] − E [ X ] E [ Y ] = σ X σ Y Symmetry: ρ ( X , Y ) = ρ ( Y , X ), ρ S ( X , Y ) = ρ S ( Y , X ). Empirical correlation: ◮ Paired mesurements: ( x 1 , y 1 ),...,( x i , y i ),...,( x n , y n ). ◮ Sample (interclass) correlation: � n i =1 ( x i − ¯ x )( y i − ¯ y ) ρ s ( X , Y ) = �� n x ) 2 �� n x ) 2 i =1 ( x i − ¯ i =1 ( y i − ¯ Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Distance metrics Outline Clustering metrics Network metrics Degree correlation metrics Intraclass correlation Theoretical intraclass correlation: ρ = COV intra ( X ) σ ( X ) 2 Empirical correlation: ◮ Paired measurements: ( x 1 , 1 , x 1 , 2 ),...,( x i , 1 , x i , 2 ),...,( x n , 1 , x n , 2 ) n 1 � ρ s = ( x i , 1 − ¯ x )( x i , 2 − ¯ x ) ( N − 1) σ 2 s i =1 n x = 1 � ¯ ( x i , 1 + x i , 2 ) 2 N i =1 n 1 x ) 2 + ( x i , 2 − ¯ � σ 2 x ) 2 � � s = ( x i , 1 − ¯ 2( N − 1) i =1 Ramon Ferrer-i-Cancho & Argimiro Arratia Introduction to network metrics

Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro - PowerPoint PPT Presentation

Outline Network metrics Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Universitat Polit` ecnica de Catalunya Version 0.4 Complex and Social Networks (20 20 -20 21 ) Master in Innovation and Research in Informatics

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Software Metrics Alex Boughton Executive Summary What are software metrics? Why are

Astheno-Khler and strong KT General results metrics Bismut connection Definition of strong KT

NDCs and metrics Andrei Marcu , Director, ERCST 1 NDCs and metrics Main issues: - Which metrics

Metrics are Pivotal A NATIONAL FARM TO INSTITUTION METRICS COLLABORATIVE WEBINAR Local

Metrics and Estimation Rahul Premraj + Andreas Zeller 1 Metrics Quantitative measures that

Software Metrics And I gnominy Software Metrics And I gnominy Software Metrics And I gnominy

Software Metrics Chapter 4 1 SW Metrics SW process and product metrics are quantitative

Software Metrics Overview SE 350 Software Process & Product Quality Lecture Objectives

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular

Opsim and MAF Metrics Lynne Jones Opsim and MAF metrics Call for white papers on LSST survey

CSA Z260 Pipeline Safety Metrics CSA Z260 - Pipeline Safety Metrics Provide a suite of

Open Metrics Journal Metrics Perspective from an Open Access Publisher Martin Fenner Technical

Interactions of light mesons with photons Stefan Leupold Uppsala University Meson 2014, Cracow,

Functional Parallel Don Syme Principal Researcher Microsoft Research, Cambridge Disclaimer

The Irresponsible Commercial Landscape: A Primer of What Journal Editors Need to Know! April 19,

Board Meeting The Falmouth Historical Society May 5, 2020 Agenda Local History

MAXIMUM MARGIN CLASSIFIERS MAXIMUM MARGIN CLASSIFIERS Matthieu R Bloch Tuesday, February 11,

Getting Lost or Getting Trapped: On the Effect of Moves to Incomparable Points in Multiobjective

Parents Briefing Secondary 2 Belief: Learn. Improve. Excel. GREENRIDGE SECONDARY SCHOOL

000000 000001 000010 000011 000100 000101 000110 000111 001000 000000 Reserve 2

Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro - PowerPoint PPT Presentation

Outline Network metrics Introduction to network metrics Ramon Ferrer-i-Cancho & Argimiro Arratia Universitat Polit` ecnica de Catalunya Version 0.4 Complex and Social Networks (20 20 -20 21 ) Master in Innovation and Research in Informatics

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Software Metrics Alex Boughton Executive Summary What are software metrics? Why are

Astheno-Khler and strong KT General results metrics Bismut connection Definition of strong KT

NDCs and metrics Andrei Marcu , Director, ERCST 1 NDCs and metrics Main issues: - Which metrics

Metrics are Pivotal A NATIONAL FARM TO INSTITUTION METRICS COLLABORATIVE WEBINAR Local

Metrics and Estimation Rahul Premraj + Andreas Zeller 1 Metrics Quantitative measures that

Software Metrics And I gnominy Software Metrics And I gnominy Software Metrics And I gnominy

Software Metrics Chapter 4 1 SW Metrics SW process and product metrics are quantitative

Software Metrics Overview SE 350 Software Process &amp; Product Quality Lecture Objectives

Hawkular Metrics Metric Storage &amp; Alerting Stefan Negrea About Me Co-Creator of Hawkular

Opsim and MAF Metrics Lynne Jones Opsim and MAF metrics Call for white papers on LSST survey

CSA Z260 Pipeline Safety Metrics CSA Z260 - Pipeline Safety Metrics Provide a suite of

Open Metrics Journal Metrics Perspective from an Open Access Publisher Martin Fenner Technical

Interactions of light mesons with photons Stefan Leupold Uppsala University Meson 2014, Cracow,

Functional Parallel Don Syme Principal Researcher Microsoft Research, Cambridge Disclaimer

The Irresponsible Commercial Landscape: A Primer of What Journal Editors Need to Know! April 19,

Board Meeting The Falmouth Historical Society May 5, 2020 Agenda Local History

MAXIMUM MARGIN CLASSIFIERS MAXIMUM MARGIN CLASSIFIERS Matthieu R Bloch Tuesday, February 11,

Getting Lost or Getting Trapped: On the Effect of Moves to Incomparable Points in Multiobjective

Parents Briefing Secondary 2 Belief: Learn. Improve. Excel. GREENRIDGE SECONDARY SCHOOL

000000 000001 000010 000011 000100 000101 000110 000111 001000 000000 Reserve 2

Software Metrics Overview SE 350 Software Process & Product Quality Lecture Objectives

Hawkular Metrics Metric Storage & Alerting Stefan Negrea About Me Co-Creator of Hawkular