Revisiting AS-Level Graph Reduction The Eighth IEEE International - PowerPoint PPT Presentation

Revisiting AS-Level Graph Reduction The Eighth IEEE International Workshop on Network Science for Communication Networks Erik C. Rye ∗ , Justin P. Rohrer + , Robert Beverly + ∗ US Naval Academy Annapolis, MD + Naval Postgraduate School Monterey, CA 11 April 2016 1 / 15

Outline Motivation and Prior Work 1 Motivation Prior Work Methodology 2 k -core Reductions Results 3 Conclusions 4 2 / 15

Motivation Long-standing need to model macroscopic behavior of the Internet e.g., at the Autonomous System (AS) level: ISPs as nodes and links as their (complex) interconnection ◮ Evaluate new routing protocol ◮ Understand provider filtering (BCP38, SBGP, etc) ◮ Active topology mapping (our particular motivation) But. . . ◮ Size of entire-Internet AS graph makes emulation infeasible and simulation difficult ◮ Thus, a need for smaller, “representative” Internet models exists ◮ But what is representative? ⋆ Degree distribution? Clustering? Avg. path len? ◮ And how? ⋆ Constructive – build graph from ground-up ⋆ Reductive – begin with AS graph, pare down 3 / 15

Our Contribution Our Contribution Re-evaluation of prior sampling (reductive) algorithm on multiple modern Internet graphs Development of new graph sampling algorithms that out perform existing techniques on modern Internet graphs 4 / 15

Prior Work Lots of prior work on Constructive Internet graph generators We focus on reduction: ◮ Cem et al. – Induced Random Vertex, Random Walk, Random Edge sampling on varied networks ◮ Vaquero et al. – Breadth-First Search to reduce backbone AS architecture for end-to-end delay estimation ◮ Krishnamurthy, Faloutsos 5 / 15

Prior Work Continued Krishnamurthy, Faloutsos, et al. . ◮ Sampling large Internet topologies for simulation purposes ◮ Start with May 2001 AS-level graphs of the Internet ◮ Data obtained passively, obtained from RouteViews Border Gateway Protocol (BGP) Router Information Base (RIB) dumps ◮ Reduce these graphs using 16 different methodologies to target reduction order – Jan 1998 Internet instance ◮ Compare fidelity of reduced graphs to Jan 1998 Internet graph metrics Use their methodology as a starting point. . . . . . but draw from chronologically newer data . . . expand data sources . . . and improve with new algorithm 6 / 15

Methodology We successfully replicate the results of Krishnamurthy et al. : Contraction ◮ Contract two endpoints of an edge together into new node ◮ New node retains all edges incident to original two nodes Deletion ◮ Delete randomly selected node or edge ◮ How we pick edges, in particular, affects resultant topology Exploration ◮ Use Breadth/Depth - First Search strategies We consider the same methods, and introduce two novel sampling strategies based on the graph’s k -core. 7 / 15

k -core Reduction Our approach Prior work shows k -cores of the AS-level Internet graph exhibits self-similarity to complete AS-level Internet graph (Alvarez-Hamelin et al. , Zhou et al. ) Implement reduction by computing successive k -cores (until ( k + 1) st -core contains too few vertices), then either: ◮ KDD: reduce by removing edge incident to random vertex, then delete random edges. ◮ KKD: or reduce by removing nodes with degree k to meet vertex count, then delete random edges. 8 / 15

Methodology Continued We compare the reduced graphs (of the Jan 1998 Internet order) to the actual Jan 1998 AS-level Internet graph in the following metrics: ◮ Average degree ◮ Clustering, using the 100 largest eigenvalues of normalized adjacency matrix (normalized graph spectra) ◮ Hop-plot (% of vertex pairs reachable within x hops along a geodesic) ◮ Degree distribution First three metrics studied in prior work; degree dist. added for more fine-grained degree comparison 9 / 15

Data Sets Dataset Source Construction Time Frame RV1 RouteViews Observed AS PATH 01/1998 - 05/2001 RV2 RouteViews Observed AS PATH 01/1998 - 12/2014 CAIDA1 CAIDA ITDK Traceroute 01/1998 - 05/2001 CAIDA2 CAIDA ITDK Traceroute 01/1998 - 12/2014 10 / 15

Summary of Results RV1 RV2 CAIDA1 CAIDA2 DHYB-0.7 DHYB-0.6 DHYB-0.6 DHYB-0.1 Avg. Deg DHYB-0.8 DHYB-0.6 DHYB-0.7 DHYB-0.2 KDD KDD DHYB-0.8 KDD Spectral DHYB-0.7 DHYB-0.6 KKD KKD EDFS DHYB-0.7 KDD DHYB-0.1 DHYB-0.7 EDFS KDD DHYB-0.3 DHYB-0.7 DHYB-0.7 DRV KDD Hop Plot DHYB-0.8 DHYB-0.6 DHYB-0.6 EDFS DHYB-0.6 DRV DHYB-0.4 KDD DHYB-0.1 KKD KKD KKD DHYB-0.7 KDD DHYB-0.5 DRE Deg. Dist. DHYB-0.6 DHYB-0.5 DHYB-0.4 DRV KDD DHYB-0.6 DHYB-0.6 KKD 11 / 15

Summary of Results RV1 RV2 CAIDA1 CAIDA2 DHYB-0.7 DHYB-0.6 DHYB-0.6 DHYB-0.1 Avg. Deg By construction, DHYB-0.8 DHYB-0.6 DHYB-0.7 DHYB-0.2 KDD and KKD KDD KDD DHYB-0.8 KDD Spectral match avg. DHYB-0.7 DHYB-0.6 KKD KKD degree exactly EDFS DHYB-0.7 KDD DHYB-0.1 While DHYB DHYB-0.7 EDFS KDD DHYB-0.3 does well, it is DHYB-0.7 DHYB-0.7 DRV KDD Hop Plot sensitive to DHYB-0.8 DHYB-0.6 DHYB-0.6 EDFS parameterization DHYB-0.6 DRV DHYB-0.4 KDD Our algorithms DHYB-0.1 KKD KKD KKD DHYB-0.7 KDD DHYB-0.5 perform well DRE Deg. Dist. w/o parameters DHYB-0.6 DHYB-0.5 DHYB-0.4 DRV KDD DHYB-0.6 DHYB-0.6 KKD 12 / 15

Reduced Graph Spectra 1.00 0.95 0.90 Eigenvalue 0.85 0.80 0.75 20 40 60 80 100 Order DHYB-0.6 DHYB-0.8 DRV EDFS KDD DHYB-0.7 DRE DRVE Internet KKD (See paper for full metrics comparison) Spectra of KDD closely matches target Internet instance What about other time periods and data sources? 13 / 15

Reduced Graph Spectra 1.00 1.00 0.95 0.95 0.90 0.90 Eigenvalue Eigenvalue 0.85 0.85 0.80 0.80 0.75 0.75 20 40 60 80 100 20 40 60 80 100 Order Order DHYB-0.6 DHYB-0.8 DRV EDFS KDD DHYB-0.6 DHYB-0.8 DRV EDFS KDD DHYB-0.7 DRE DRVE Internet KKD DHYB-0.7 DRE DRVE Internet KKD RV1 CAIDA1 1.00 1.00 0.95 0.95 0.90 0.90 Eigenvalue Eigenvalue 0.85 0.85 0.80 0.80 0.75 0.75 20 40 60 80 100 20 40 60 80 100 Order Order DHYB-0.5 DHYB-0.7 DRV EBFS KDD CRE DHYB-0.2 DRE DRVE KDD DHYB-0.6 DRE DRVE Internet KKD DHYB-0.1 DHYB-0.3 DRV Internet KKD RV2 CAIDA2 14 / 15

Conclusions Previous best reduction methods differ considerably across time periods and AS-graph inference methods ◮ DHYB often a good choice, but probability values fluctuate wildly Leveraging Internet AS graph properties more promising than random deletion methods ◮ k -core-based reduction algorithms consistently in top 4 reduction methods across data sources and time frames ◮ k -core reduction methods match average degree of target graph precisely Our implementation is publicly available at https://github.com/cmand/graphreduce Thanks! Questions? 15 / 15

Revisiting AS-Level Graph Reduction The Eighth IEEE International - PowerPoint PPT Presentation

Revisiting AS-Level Graph Reduction The Eighth IEEE International Workshop on Network Science for Communication Networks Erik C. Rye , Justin P. Rohrer + , Robert Beverly + US Naval Academy Annapolis, MD + Naval Postgraduate School

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Revisiting the Estim ation of the Revisiting the Estim ation of the Marginal Cost of Highw ay

Historical Spaces Historical Spaces Revisiting revolution memory and Revisiting revolution

Revisiting Magnetic Field Limits in Revisiting Magnetic Field Limits in Quadrupoles Arising From

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

Introduction to Harm Reduction Definition of Harm Reduction Harm reduction refers to policies,

B ohm Reduction in Infinitary Term Graph Rewriting Systems Patrick Bahr IT University of

Completed Rehab of Level 1 and Level 3 Completed Bypass Adit and Entry into Level 1

Split clique graph complexity L. Alcn and M. Gutierrez La Plata, Argentina L. Faria and C. M.

XL1C: Graph Times-Series Using Ratio Display 3/9/2017 V0D XL1C: V0D XL1C: V0D Graph by Time

XL1A: Graph Nominal Frequency Data Using Excel2013 3/10/2017 V0E XL1A: V0E XL1A: V0E Graph

Graph Sparsifiers Smaller graph that (approximately) preserves the values of some set of

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

27 August 2014 Company Announcements Office Australian Securities Exchange Limited 20 Bridge

Harald Wilhelm Chief Financial Officer Safe Harbour Statement 2 DISCLAIMER This presentation

Equity Raising Presentation 21 November 2019 NOT FOR DISTRIBUTION OR RELEASE IN THE UNITED STATES

Ossen Innovation Co., Ltd. (NASDAQ: OSN) Investors Presentation October 2016 Safe Harbor

Summary of the Material Sector Business Briefing, held on September 8, 2016 Asahi Kasei Corp.

Pollutant Transfer Coefficient in Street Canyons of Different Aspect Ratios Tracy N.H. Chung

with Constrained Layer Damping Systems RAM 6 Workshop October 15 & 16, 2013 Paul Riehle

Turbulence, Heat and Turbulence, Heat and (Editors) Mass Transfer Mass Transfer 6 6 Edited

Revisiting AS-Level Graph Reduction The Eighth IEEE International - PowerPoint PPT Presentation

Revisiting AS-Level Graph Reduction The Eighth IEEE International Workshop on Network Science for Communication Networks Erik C. Rye , Justin P. Rohrer + , Robert Beverly + US Naval Academy Annapolis, MD + Naval Postgraduate School

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Revisiting the Estim ation of the Revisiting the Estim ation of the Marginal Cost of Highw ay

Historical Spaces Historical Spaces Revisiting revolution memory and Revisiting revolution

Revisiting Magnetic Field Limits in Revisiting Magnetic Field Limits in Quadrupoles Arising From

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

Graph Indexing: Tree + Delta Delta &gt;= Graph &gt;= Graph Graph Indexing: Tree + Peixian Zhao,

Graph Mining Marco Serafini COMPSCI 532 Lecture 11 Classes of Graph Systems Graph

PowerWizard Level 1.0 &amp; Level 2.0 Control Systems Training Systems Comparison Level 2

Introduction to Harm Reduction Definition of Harm Reduction Harm reduction refers to policies,

B ohm Reduction in Infinitary Term Graph Rewriting Systems Patrick Bahr IT University of

Completed Rehab of Level 1 and Level 3 Completed Bypass Adit and Entry into Level 1

Split clique graph complexity L. Alcn and M. Gutierrez La Plata, Argentina L. Faria and C. M.

XL1C: Graph Times-Series Using Ratio Display 3/9/2017 V0D XL1C: V0D XL1C: V0D Graph by Time

XL1A: Graph Nominal Frequency Data Using Excel2013 3/10/2017 V0E XL1A: V0E XL1A: V0E Graph

Graph Sparsifiers Smaller graph that (approximately) preserves the values of some set of

Graph Data Processing M. Tamer Ozsu 1 / 75 Outline Introduction RDF Graph Querying

27 August 2014 Company Announcements Office Australian Securities Exchange Limited 20 Bridge

Harald Wilhelm Chief Financial Officer Safe Harbour Statement 2 DISCLAIMER This presentation

Equity Raising Presentation 21 November 2019 NOT FOR DISTRIBUTION OR RELEASE IN THE UNITED STATES

Ossen Innovation Co., Ltd. (NASDAQ: OSN) Investors Presentation October 2016 Safe Harbor

Summary of the Material Sector Business Briefing, held on September 8, 2016 Asahi Kasei Corp.

Pollutant Transfer Coefficient in Street Canyons of Different Aspect Ratios Tracy N.H. Chung

with Constrained Layer Damping Systems RAM 6 Workshop October 15 &amp; 16, 2013 Paul Riehle

Turbulence, Heat and Turbulence, Heat and (Editors) Mass Transfer Mass Transfer 6 6 Edited

Graph Indexing: Tree + Delta Delta >= Graph >= Graph Graph Indexing: Tree + Peixian Zhao,

PowerWizard Level 1.0 & Level 2.0 Control Systems Training Systems Comparison Level 2

with Constrained Layer Damping Systems RAM 6 Workshop October 15 & 16, 2013 Paul Riehle