Degree correlations and topology generators Dmitri Krioukov - PowerPoint PPT Presentation

Degree correlations and topology generators Dmitri Krioukov dima@caida.org Priya Mahadevan and Bradley Huffaker 5 th CAIDA-WIDE Workshop

Outline 0K 1K 2K 3K . . . DK

What’s the problem? Veracious topology generators. Why? � New routing and other protocol design, development, and testing � Scalability � For example: new routing might offer X -time smaller routing tables for today but scale Y -time worse, with Y >> X � Network robustness, resilience under attack � Traffic engineering, capacity planning, network management � In general: “what if”

Veracious topology generators Reproducing closely as many topology characteristics as possible. Why “many”? � Better stay on the safe side: you reproduced characteristic X OK, but what if characteristic Y turns out to be also important later on and you fail to capture it? � Standard storyline in topology papers: all those before us could reproduce X , but we found they couldn’t reproduce Y . Look, we can do Y ! Emphasis on practically important characteristics

Important topology characteristics Distance (shortest path length) distribution � Performance parameters of most modern routing algorithms depend solely on distance distribution � Prevalence of short distances makes routing hard (one of the fundamental causes of BGP scalability concerns (86% of AS pairs are at distance 3 or 4 AS hops)) Betweenness distribution Spectrum

How to reproduce? Brute force doesn’t work � There is no way to produce graphs with a given form of any of important characteristics � Even more so for combinations of those More intelligent approach � What are the inter-dependencies between characteristics? � Can we, by reproducing most basic, simple, but not necessarily practically relevant characteristics, also reproduce (capture) all other characteristics, including practically important? � Is there the one(s) defining all other? We answer positively to these questions

Maximum entropy constructions Reproduce characteristic X ( 0K, 1K, etc.) but make sure that the graph is maximally random in all other respects Direct analogy with physics (maximum entropy principle)

Most basic characteristics: Connectivity Tag Name Correlations of Notation degrees of nodes at distance: 0K Average node degree None <k> 1K Node degree distribution 0 P(k) 2K Joint node degree distribution 1 P(k 1 ,k 2 ) or edge degree distribution 3K Joint edge degree distribution 2 P(k 1 ,k 2 ,k 3 ) … … … … DK Full degree distribution D = maximum P(k 1 ,k 2 ,…,k D ) distance (diameter)

0K Tells you � Average node degree (connectivity) in the graph <k> = 2m / n Maximum entropy construction ( 0K -random) � Connect every pair of nodes with probability p = <k> / n � Classical Erdös-Rényi random graphs � P(k) ~ e -<k> <k> k / k!

1K Tells you � Probability that a randomly selected node is of degree k P(k) = n(k) / n � Connectivity in 0 -hop neighborhood of a node Defines � <k> = S k k P(k)

1K Maximum entropy construction ( 1K -random) � 1. Assign n numbers q ’s (expected degrees) distributed according to P(k) to all the nodes; 2. Connect pairs of nodes of expected degrees q 1 and q 2 with probability p(q 1 ,q 2 ) = q 1 q 2 / (n<q>) � More care to reproduce P(k) exactly � Power-law random graph (PLRG) generator � Inet generator

2K Tells you � Probability that a randomly selected edge connects nodes of degrees k 1 and k 2 P(k 1 ,k 2 ) = m(k 1 ,k 2 ) / m � Probability that a randomly selected node of degree k 1 is connected to a node of degree k 2 P(k 2 |k 1 ) = <k> P(k 1 ,k 2 ) / (k 1 P(k 1 )) � Connectivity in 1 -hop neighborhood of a node

2K Defines � <k> = [ S k1,k2 P(k 1 ,k 2 )/k 1 ] -1 � P(k) = <k> S k2 P(k,k 2 ) / k 2

2K Maximum entropy construction ( 2K -random) � 1. Assign n numbers q ’s (expected degrees) distributed according to P(k) to all the nodes; 2. Connect pairs of nodes of expected degrees q 1 and q 2 with probability p(q 1 ,q 2 ) = (<q> / n) P(q 1 ,q 2 ) / (P(q 1 )P(q 2 )) � Much more care to reproduce P(k 1 ,k 2 ) exactly � Have not been studied in the networking community

3K Tells you � Probability that a randomly selected pair of edges connect nodes of degrees k 1 , k 2 , and k 3 � Probability that a randomly selected triplet of nodes are of degrees k 1 , k 2 , and k 3 � Connectivity in 2 -hop neighborhood of a node Defines � <k> � P(k) � P(k 1 ,k 2 ) Maximum entropy construction ( 3K -random) � Unknown

0K, 1K, 2K, 3K, … What’s going on here? As d increases in dK , we get: � More information about local structure of the topology � More accurate description of node neighborhood � Description of wider neighborhoods Analogy with Taylor series � Connection between spectral theory of graphs and Riemannian manifolds Conjecture: DK -random versions of a graph are all isomorphic to the original graph � DK contains full information about the graph

DK ? Do we need to go all the way through to DK , or can we stop before at d << D ? Known fact #1 � 0K works bad Known fact #2 � 1K works much better, but far from perfect in many respects Let’s try 2K !

What we did Understood and formalized all this stuff Devised an algorithm to produce 2K - random graphs with exactly the same 2K distribution Checked its accuracy on Internet AS-level topologies extracted from different data sources (skitter, BGP, WHOIS)

What worked All characteristics that we care about exhibited perfect match

Example: distance in BGP 0.7 Random 2-k BGP tables 0.6 Inet 0.5 0.4 PDF 0.3 0.2 0.1 0 0 2 4 6 8 10 12 Distance (in hops)

Example: distance in skitter 0.7 0.6 0.5 0.4 PDF 0.3 0.2 0.1 Generated Skitter 0 1 2 3 4 5 6 7 Distance (in hops)

What did not work Clustering � Expected to be captured by 3K Router-level � Expected to be captured by dK , where d is a characteristic distance between high-degree nodes

Main contribution 0K 1K 2K 3K . . . DK

Future work Clustering in 3K -random graphs Given a class of graphs, find d such that dK - random graphs capture all you need Generalize maximum entropy construction algorithm for dK -random graphs with any d

More information “Comparative Analysis of the Internet AS- Level Topologies Extracted from Different Data Sources” http://www.caida.org/~dima/pub/as-topo-comparisons.pdf 2-3 more papers upcoming

Degree correlations and topology generators Dmitri Krioukov - PowerPoint PPT Presentation

Degree correlations and topology generators Dmitri Krioukov dima@caida.org Priya Mahadevan and Bradley Huffaker 5 th CAIDA-WIDE Workshop Outline 0K 1K 2K 3K . . . DK Whats the problem? Veracious topology generators. Why? New

Degree-degree correlations in directed networks with heavy-tailed degrees Pim van der Hoorn

Micro Power Generators Sung Park Kelvin Yuk ECS 203 Overview Why Micro Power Generators are

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

How do financial correlations grow? How do financial correlations grow? C. Borghesi Borghesi

Statistically-Significant Correlations 11 Oct, 2014 0F 2014 NNN4 Statistically-Significant

Measurements of BB Angular Correlations Measurements of BB Angular Correlations Measurements of

Albert-Lszl Barabsi With Emma K. Towlson and Sean P. Cornelius www.BarabasiLab.com

National Generators Forum NGF Sponsored Rule changes Information provisions Alex Cruickshank

Documentation Generators Steven J Zeil March 3, 2013 Documentation Generators Outline

Topological data analysis and topology-based visualization Leila De Floriani Topology-based

Control AIRS clear-sky radiances AIRS cloudy retrievals Anomaly Correlations computed from 90S

Targeted Pseudorandom Generators, Simulation Advice Generators, and Derandomizing Logspace William

Generators of quantum Markov Semigroups Matt Ziemke University of South Carolina Virginia

AEMC Presentation AEMC Presentation Southern Generators proposal Southern Generators

ECEN 5022 Cryptography Pseudo Random Number Generators Peter Mathys University of Colorado

When the catenary degree meets the tame degree in embedding dimension three numerical semigroups

Undecidability in group theory, topology, and F.p. groups Word problem Markov properties

Stabilization of Branching Queueing Networks Tom Brzdil 1 Stefan Kiefer 2 1 Masaryk

First order optimization. Last time. Other scenarios. min f ( x ) Gradient Descent: Dont you

Semimartingale methods for Markov chains, interacting particle systems and random growth models

Virtual Topologies Virtual Topologies Convenient process naming. Naming scheme to fit the

V-topologies, t-Henselian Fields and Definable Valuations Katharina Dupont Department of

Topological Autoencoders 13. November, 2019 Machine Learning and Computational Biology Group, ETH

Space-time structure may be topological and not geometrical Gabriele Carcassi and Christine