A Multi-Level Approach for Evaluating Internet Topology Generators - PowerPoint PPT Presentation

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 , Nilothpal Talukder 1 , 2 1 Purdue University, IN 2 Rensselaer Polytechnic Institute, NY Email: { rrossi,fahmy } @cs.purdue.edu, talukn@cs.rpi.edu May 23, 2013 1 / 24

Motivation Why Topology Generators? ◮ Generate representative network topologies of different sizes ◮ Used for experiments to design protocols, predict performance, and understand robustness and scalability of the future Internet ◮ Unfortunately, many fail to capture static and evolutionary properties of today’s Internet, e.g., assume average path length and clustering coefficient constant Our goal: ◮ Determine how to quantitatively assess a generator through a multi-level hierarchy of graph, node and link measures ◮ Focus on 2 popular generators: Orbis [SIGCOMM06] and WIT [INFOCOM07] ◮ Validate using different views of the Internet: data (traceroute), control (BGP tables), and management (WHOIS) planes 2 / 24

Taxonomy of Topology Generators Generators Process Model Type Topology WIT Parametric AS Random-walks RSurfer Parametric N/A Orbis Data-driven AS & RL Optimization HOT Parametric RL Mod. HOT Parametric AS Parametric N/A AB Preferential BRITE Data-driven AS & RL Attachment Inet Parametric AS Parametric AS GLP Parametric AS & RL SWT Geometry GT-ITM Parametric AS & RL 3 / 24

Orbis Topology Generator [SIGCOMM 2006] ◮ Series of measures based on degree correlations ◮ The first few dK distributions are: ◮ 0 K (average degree) ◮ 1 K (degree distribution: P ( k ) = n ( k ) / n ) ◮ 2 K (joint degree distribution: P ( k 1 , k 2 ) = m ( k 1 , k 2 ) µ ( k 1 , k 2 ) / (2 m ), where µ ( k 1 , k 2 ) = 2 if k 1 = k 2 , otherwise 1) ◮ 3 K (wedges and triangles), etc. ◮ Fails to capture global characteristics ◮ d must be small in practice due to increasing complexity ◮ Relies on rescaling technique; inaccurate as topology becomes larger 4 / 24

WIT Topology Generator [INFOCOM 2007] ◮ Captures the “wealth” of ISPs over time ◮ Multiplicative stochastic process, u i ( t ) = λ i ( t ) u i ( t − 1), where u i ( t ) is the unscaled wealth and λ i ( t ) is an independent random variable ◮ w i ( t ) is the normalized wealth for node i , and z i ( t ) = C · d i ( t ) is the expense ◮ In each iteration, ◮ If w i ( t ) − z i ( t ) > C + T , place a link between the node i and an arbitrary node by randomly walking l -steps from i ◮ If w i ( t ) − z i ( t ) > − T , remove a random link of node i ◮ Threshold T is carefully chosen to avoid oscillation 5 / 24

Orbis versus WIT ◮ WIT attempts to model the evolution of the AS topology ◮ Fails when the underlying process and growth of the Internet change ◮ Orbis generates topologies that preserve a set of measures ◮ Fails if the set of characteristics is incomplete w.r.t. the actual AS topology ◮ What is the best representative set of local and global measures? 6 / 24

Network Properties Measure Importance in Computer Networks Degree Local Fault tolerance, local robustness Assortativity Clustering Path diversity, fault tolerance, local ro- coefficient bustness Distance Scalability, performance, protocol design Global Betweenness Traffic engineering, potential congestion points Eigenvector Network robustness, performance, clus- ters/hierarchy, traffic engineering 7 / 24

Measures Used The order of evaluation measures in terms of the difficulty of preservation Link ≥ Node ≥ Graph Graph Measures ◮ Traditional: Average degree, Assortativity coefficient, Average clustering and Average distance, etc. ◮ Additional: largest singular value ( λ 1 ), Network conductance ( λ 1 − λ 2 ), radius, and diameter, etc. Node Measures ◮ Traditional: Degree distribution, Clustering coefficient, distance, eccentricity, betweenness, etc. ◮ Additional: Network values, Scree Plots, K-walks, K-core, etc. 8 / 24

Measures Used (cont’d) Link Measures ◮ Order of the nodes with respect to the magnitude of their coordinates along the principal direction ◮ The closest k -approximation of the topology Community measures ◮ Louvain’s modularity 9 / 24

Measures Used (cont’d) Quantitative Measures ◮ Graph based: ◮ The normalized root-mean-square error (NRMSE) x − ˆ x ) 2 ] E [( � � x , ˆ D NRMSE ( � � x ) = . x , ˆ x , ˆ max( � � x ) − min( � � x ) ◮ Node based: ◮ Kolmogorov-Smirnov (KS): KS ( F 1 , F 2 ) = max x | F 1 ( x ) − F 2 ( x ) | . ◮ Kullback-Leibler (KL) divergence: P ( i ) ln P ( i ) � D KL ( P || Q ) = Q ( i ) . i 10 / 24

Learning Graph Measures [ReFeX, SIGKDD 2011] Instead of selecting a set of graph measures, we automatically learn a set of graph measures recursively. 1. Base set of measures. The process starts by computing degree (in/out/total edges) and egonet measures (in/out egonet). ◮ egonet includes the node, its neighbors, and any edges in the induced subgraph on these nodes. 2. Aggregate measures. The existing measures of a node are aggregated to create additional measures by taking the sum/mean of the neighbors (done in a recursive fashion). One simple measure is the mean value of the degree among all neighbors of a node. 3. Prune correlated measures. At each iteration, we test for redundant measures using a simple correlation test, and remove all measures that are highly correlated. 4. Stopping Criteria. Repeat steps 2-3 until no new measures are retained. 11 / 24

Evaluation Strategy 1. Given G ⋆ n of size n , generate same size graph G n s.t. M ( G n ) ≈ M ( G ⋆ n ) 2. Given G ⋆ n of size n , generate G m of size m where m ≥ n s.t. M ( G m ) ≈ M ( G ⋆ n ) 3. Given an ordered sequence G ⋆ t for t = 1 , 2 , ..., m , generate a corresponding sequence G t for t = 1 , 2 , ..., m s.t. G t is the same size as G ⋆ t and M ( G t ) ≈ M ( G ⋆ t ) 12 / 24

Datasets for Validation ◮ Skitter traceroute ◮ RouteViews’ BGP tables (RV) 1 ◮ RIPE’s WHOIS ◮ HOT ◮ RocketFuel 1 AS-level subgraphs for 2004-2012 13 / 24

Results: Graph Measures −0.18 0.26 4.35 −0.185 0.25 4.3 0.24 −0.19 4.25 0.23 −0.195 4.2 0.22 −0.2 4.15 0.21 −0.205 2005 2006 2007 2008 2009 2010 2011 2012 2005 2006 2007 2008 2009 2010 2011 2012 2005 2006 2007 2008 2009 2010 2011 2012 (a) Average Degree (b) Average Clustering (c) Assortativity 17 85 16 3.9 15 80 14 3.85 13 75 12 3.8 11 70 10 3.75 9 2005 2006 2007 2008 2009 2010 2011 2012 2005 2006 2007 2008 2009 2010 2011 2012 2005 2006 2007 2008 2009 2010 2011 2012 (d) Characteristic Path (e) Diameter (f) Largest Eigenvalue Length 14 / 24

Results: Node Measures 1 10 0 0.9 −1 10 0.8 0.7 RV RV −2 Orbis Orbis 10 0.6 WIT WIT CCDF CCDF 0.5 −3 10 0.4 0.3 −4 10 0.2 0.1 −5 10 0 0 1 2 3 4 10 10 10 10 10 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Degree Clustering Coefficient (a) Degree (b) Clustering Coefficient 1 0 10 0.9 0.8 0.7 RV −1 RV 10 Orbis 0.6 Orbis WIT WIT CCDF CCDF 0.5 0.4 10 −2 0.3 0.2 0.1 −3 0 10 0 5 10 15 0 5 10 15 20 25 30 Distance K−cores (c) Distance (d) K-cores 15 / 24

Results: Node Measures (a) Scree plot (b) Network values 16 / 24

Results: Link Measures (a) WHOIS (b) HOT (c) RocketFuel (d) Orbis (WHOIS) (e) Orbis (HOT) (f) Orbis (RocketFuel) 17 / 24

Results: Quantitative Measures Table : Quantitative Evaluation of Orbis using KS Distance. Deg. CC Ecc. Kcores PR EigDiff Net-Value Hot 0.009 0.000 0.000 0.078 0.067 0.588 0.131 RF 0.013 0.450 0.000 0.088 0.215 0.629 0.680 Whois 0.059 0.480 0.224 0.060 0.536 0.169 0.159 Skitter 0.010 0.211 0.029 0.009 0.342 0.096 0.182 18 / 24

Results: Community Measures Table : Evaluating the Community Structure of the Topologies. Communities Q Nodes Edges Degree CC RouteViews 24 0.65 3951 13360 3.38 0.45 957 2254 2.36 0.10 2004 Orbis 46 0.48 WIT 57 0.92 755 2653 3.51 0.64 RouteViews 34 0.68 6048 18496 3.06 0.22 2347 5640 2.40 0.12 2011 Orbis 60 0.48 WIT 66 0.94 2095 11727 5.60 0.45 Communities C-path Radius Diameter RouteViews 24 2.74 3 6 3.01 4 8 2004 Orbis 46 WIT 57 2.75 4 7 RouteViews 34 3.27 5 9 2.91 4 8 2011 Orbis 60 WIT 66 3.44 5 10 19 / 24

Results: Selected versus Learned Measures 0.45 Orbis (vs. RV) WIT (vs. RV) 0.4 Normalized RMSE 0.35 0.3 0.25 0.2 0.15 0.1 2005 2006 2007 2008 2009 2010 2011 (a) Selected Measures 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 2005 2006 2007 2008 2009 2010 2011 (b) Learned Measures 20 / 24

Results: Learned Graph Measures (a) RV (Internet) (b) Orbis (c) WIT 21 / 24

Conclusions ◮ We propose a multi-level framework for understanding Internet topologies, and evaluating generators (focus on Orbis, WIT) ◮ We leverage both macro measures (graph) and micro measures (node and link measures) to accurately compare topologies ◮ We show that the existing generators fail to capture static and evolutionary properties of the Internet AS topology ◮ Data-driven generators generate static topologies with little or no variance ◮ Parametric generators typically cannot accurately model Internet evolution 22 / 24

A Multi-Level Approach for Evaluating Internet Topology Generators - PowerPoint PPT Presentation

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 , Nilothpal Talukder 1 , 2 1 Purdue University, IN 2 Rensselaer Polytechnic Institute, NY Email: { rrossi,fahmy } @cs.purdue.edu, talukn@cs.rpi.edu May

Evaluating Soul City Evaluating Soul City A multi level A multi level Communication for Social

Level Set Method Applied to Topology Optimization David Herrero P erez February 2012 Level

iToM: An Internet Topology Mapping Project Kamil Sarac (ksarac@utdallas.edu) Department of

NETWORK LAYER INTERNET TOPOLOGY CONSTRUCTION M. Engin Tozal & Kamil Sarac The University of

Topological data analysis and topology-based visualization Leila De Floriani Topology-based

Evaluating the fully automatic multi Evaluating the fully automatic multi- g g y y -language

Large-Scale Topology Discovery Benoit Donnet, Timur Friedman LIP6-CNRS laboratory, UPMC, Paris

Topology Discovery Correlating different network topology layers in heterogeneous environments

Combinatorics and topology of toric arrangements II. Topology of arrangements in the complex torus

Order Topology Definition Let ( X , < ) be an ordered set. Then the order topology on X is the

I2RS Service Topology Draft-hares-i2rs-service-topo-dm-05 I2RS Service Topology Model Why

Cheleby: Subnet Level Internet Topology Mehmet Hadi Gunes with Hakan Kardes and Mehmet B. Akgun

A Study on Traceroute Potentiality in Revealing the Internet AS-level Topology V. Luconi A.

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

CS 457 Networking and the Internet Fall 2016 The Global Internet (Then) The tree structure of

Multi-Source Adjustment of Multi-Layer Annotation: the Bits of Wisdom Approach Kilian Evang 20

Design of Robust CAN-FD Networks An automated Model based Design Flow Federico Pereira 1 Agenda

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Network/graph theory Dr. Chris Davis Postdoc, Faculty of TPM, TU Delft Network/graph theory

Ad Hoc Network Visualization Module Name: Tactical Network Visualization with VisualCyberVAN

Network Economics and Security Engineering Tyler Moore (joint with Ross Anderson and Shishir

Skydive An analyzer for network topology and traffic Sylvain Baubeau Sylvain Afchain Skydive

Prescriptive Topology Daemon Dinesh G Dutt, Pradosh Mohapatra Cumulus Networks Problem statement

Guide to Networking Essentials Fifth Edition Chapter 2 Network Design Essentials Objectives

A Multi-Level Approach for Evaluating Internet Topology Generators - PowerPoint PPT Presentation

A Multi-Level Approach for Evaluating Internet Topology Generators Ryan Rossi 1 , Sonia Fahmy 1 , Nilothpal Talukder 1 , 2 1 Purdue University, IN 2 Rensselaer Polytechnic Institute, NY Email: { rrossi,fahmy } @cs.purdue.edu, talukn@cs.rpi.edu May

Evaluating Soul City Evaluating Soul City A multi level A multi level Communication for Social

Level Set Method Applied to Topology Optimization David Herrero P erez February 2012 Level

iToM: An Internet Topology Mapping Project Kamil Sarac (ksarac@utdallas.edu) Department of

NETWORK LAYER INTERNET TOPOLOGY CONSTRUCTION M. Engin Tozal &amp; Kamil Sarac The University of

Topological data analysis and topology-based visualization Leila De Floriani Topology-based

Evaluating the fully automatic multi Evaluating the fully automatic multi- g g y y -language

Large-Scale Topology Discovery Benoit Donnet, Timur Friedman LIP6-CNRS laboratory, UPMC, Paris

Topology Discovery Correlating different network topology layers in heterogeneous environments

Combinatorics and topology of toric arrangements II. Topology of arrangements in the complex torus

Order Topology Definition Let ( X , &lt; ) be an ordered set. Then the order topology on X is the

I2RS Service Topology Draft-hares-i2rs-service-topo-dm-05 I2RS Service Topology Model Why

Cheleby: Subnet Level Internet Topology Mehmet Hadi Gunes with Hakan Kardes and Mehmet B. Akgun

A Study on Traceroute Potentiality in Revealing the Internet AS-level Topology V. Luconi A.

PowerWizard Level 1.0 &amp; Level 2.0 Control Systems Training Systems Comparison Level 2

CS 457 Networking and the Internet Fall 2016 The Global Internet (Then) The tree structure of

Multi-Source Adjustment of Multi-Layer Annotation: the Bits of Wisdom Approach Kilian Evang 20

Design of Robust CAN-FD Networks An automated Model based Design Flow Federico Pereira 1 Agenda

Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI

Network/graph theory Dr. Chris Davis Postdoc, Faculty of TPM, TU Delft Network/graph theory

Ad Hoc Network Visualization Module Name: Tactical Network Visualization with VisualCyberVAN

Network Economics and Security Engineering Tyler Moore (joint with Ross Anderson and Shishir

Skydive An analyzer for network topology and traffic Sylvain Baubeau Sylvain Afchain Skydive

Prescriptive Topology Daemon Dinesh G Dutt, Pradosh Mohapatra Cumulus Networks Problem statement

Guide to Networking Essentials Fifth Edition Chapter 2 Network Design Essentials Objectives

NETWORK LAYER INTERNET TOPOLOGY CONSTRUCTION M. Engin Tozal & Kamil Sarac The University of

Order Topology Definition Let ( X , < ) be an ordered set. Then the order topology on X is the

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2