Directed Probing for Efficient and Accurate Active Measurements - PowerPoint PPT Presentation

Directed Probing for Efficient and Accurate Active Measurements Arthur Berger 1 Robert Beverly Naval Postgraduate School 1 MIT CSAIL rbeverly@nps.edu, awberger@csail.mit.edu February 8, 2010 AIMS-2 - Workshop on Active Internet Measurements R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 1 / 43

The Problem Outline The Problem 1 Deconstructing Probing Cycle 2 Methodology 3 Directed Probing 4 5 Open Questions R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 2 / 43

The Problem Motivation Internet Topology Measurement The Internet is: Large, and complex 1 Poorly instrumented 2 ⇒ Poorly understood topology Internet Topology – why do we care? Critical infrastructure protection Network modeling, routing research, protocol validation, etc. Future Internet architectures, Internet evolution, etc. R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 3 / 43

The Problem Motivation State of the Art Infer structure... Measure from available vantage points... Monitor Monitor Internet Monitor Monitor R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 4 / 43

The Problem Motivation Problem Internet Topology Measurement What we have: Handful of monitoring points from which to run path probes Requires significant time and resources to probe all IPv4 destinations Attempt to balance load vs. measurement cycle time What we want: Many vantage points High frequency scanning But, with low-load Coordination between vantage points? R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 5 / 43

The Problem Motivation Problem Hypothesis: By leveraging network priors (knowledge of routing, structure, etc.) and adaptive sampling (progressively learned knowledge), we can: Significantly lower probing load Without sacrificing measurement fidelity (and perhaps increase fidelity) R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 6 / 43

The Problem Motivation Intuition Scaling: ∼ 2 32 − 1 possible destinations (2.9B from Jan 2010 routeviews) But, because of hierarchy and aggregation and classful history, practitioners often aggregate measurements into /24’s 2 24 − 1 destinations much more manageable – but, right granularity? Example: Necessary to probe all 2 16 /24’s in 18.0.0.0/8 to ascertain path characteristics or latency? This work investigates how we can use network priors to “intelligently” drive probing for more efficient and accurate topology measurements R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 7 / 43

The Problem Motivation Network Priors (xkcd insight...) Lots of information/structure at our disposal: Registry information (e.g. whois ) Geolocation databases (e.g. EdgeScape) BGP routing information Key insight – Adaptive Sampling: Learn as probing progresses R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 8 / 43

The Problem Archipelago Archipelago Investigate hypothesis using CAIDA’s Ark as case study: Distributed “team probing,” ∼ 41 monitors All routed addresses divided into /24’s; partitioned across monitors From each /24, a single address is selected at random to probe Probe == traceroute ++ ; record router interfaces on forward path Uses scamper (cf. Luckie) for constant load A “cycle” == traceroutes to all routed /24’s R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 9 / 43

The Problem Assumptions WIP Caveats Work in Progress – At this stage: Deconstruct probing process of Ark as case study Use BGP information from routeviews as decision prior Looking at router-level topology, not organization or AS Not yet incorporating any alias resolution Not making claims about topological correctness; investigate ability to reproduce baseline more efficiently R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 10 / 43

Deconstructing Probing Cycle Outline The Problem 1 Deconstructing Probing Cycle 2 Methodology 3 Directed Probing 4 5 Open Questions R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 11 / 43

Deconstructing Probing Cycle Descriptive Statistics Data Set First, let’s deconstruct Ark cycle: Before developing our new technique (next), understand data Start with a single vantage point, AMW-US Data from this node for a cycle on January 11, 2010 Represents: 263K traceroutes 55K distinct BGP prefixes ∼ 4.4M probe packets Q: What do we learn? R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 12 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Meta-Question: What’s the information gain of successive traceroutes? Q1: How similar are traceroutes to the same destination BGP prefix? Use Levenshtein “edit” distance DP algorithm Determine the minimum number of edits (insert, delete, substitute) to transform one string into another e.g. “ robert ” → “ robber ” = 2 We use: Σ = { 0 , 1 , . . . , 2 32 − 1 } Each unsigned 32-bit IP address along traceroute paths ∈ Σ ED=2 129.186.6.251 129.186.254.131 192.245.179.52 4.53.34.13 129.186.6.251 192.245.179.52 4.69.145.12 R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 13 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Q1: How similar are Information Gain of Multiply-targeted BGP Prefixes (262,956 Probes) traceroutes to the 1 same destination BGP 0.9 prefix? 0.8 Cumulative Fraction of Path Pairs ∼ 60% of traces to 0.7 destinations in 0.6 0.5 same BGP prefix 0.4 have ED ≤ 3 0.3 Fewer than 50% of 0.2 random traces 0.1 Intra-BGP Prefix have ED ≤ 10 Random Prefix Pair 0 0 5 10 15 20 25 Levenshtein Edit Distance R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 14 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Q1: How similar are Information Gain of Multiply-targeted BGP Prefixes (262,956 Probes) traceroutes to the 1 same destination BGP 0.9 prefix? 0.8 Cumulative Fraction of Path Pairs ∼ 60% of traces to 0.7 destinations in 0.6 0.5 same BGP prefix 0.4 have ED ≤ 3 0.3 Fewer than 50% of 0.2 random traces 0.1 Intra-BGP Prefix have ED ≤ 10 Random Prefix Pair 0 0 5 10 15 20 25 Levenshtein Edit Distance Confirms our intuition R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 14 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Q2: How much path variance is due to the last-hop AS? Intuitively, number of potential paths exponential in the depth More information gain at the end of the traceroute? Rtr Rtr Rtr Internet Rtr Rtr Monitor Rtr Rtr Rtr R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 15 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Q2: How much path variance is due to the last-hop AS? Information Gain of Multiply-targeted BGP Prefixes (262,956 Probes) 1 Lob off last AS 0.9 Answer: lots! 0.8 Cumulative Fraction of Path Pairs For ∼ 70% of 0.7 probes to same 0.6 prefix, we get no 0.5 additional 0.4 information 0.3 beyond leaf AS 0.2 Intra-BGP Prefix Lob off Dest AS Intra-BGP Prefix Lob off Dest AS Rand Prefix 0.1 0 5 10 15 20 25 Levenshtein Edit Distance R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 16 / 43

Deconstructing Probing Cycle Descriptive Statistics Edit Distance Q2: How much path variance is due to the last-hop AS? Information Gain of Multiply-targeted BGP Prefixes (262,956 Probes) 1 Lob off last AS 0.9 Answer: lots! 0.8 Cumulative Fraction of Path Pairs For ∼ 70% of 0.7 probes to same 0.6 prefix, we get no 0.5 additional 0.4 information 0.3 beyond leaf AS 0.2 Intra-BGP Prefix Lob off Dest AS Intra-BGP Prefix Lob off Dest AS Rand Prefix 0.1 0 5 10 15 20 25 Conclusion 1: Levenshtein Edit Distance Significant packet savings possible R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 16 / 43

Deconstructing Probing Cycle Descriptive Statistics Multiple Vantage Points Q3: How much information gain do multiple vantage points yield? Intuitively, expect traceroute “tail” to be similar Majority of information gain in first half of trace? Monitor Monitor Rtr Monitor Rtr Rtr Internet Monitor Rtr Rtr Rtr Rtr Rtr Monitor Monitor Monitor R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 17 / 43

Deconstructing Probing Cycle Descriptive Statistics Multiple Vantage Points Q3: How much information gain do multiple vantage points yield? Information gain is at both tails Monitor D1 Monitor D2 Monitor D3 AS Ingress Monitor R. Beverly, A. Berger (NPS) Directed Active Probing AIMS 2010 18 / 43

Directed Probing for Efficient and Accurate Active Measurements - PowerPoint PPT Presentation

Directed Probing for Efficient and Accurate Active Measurements Arthur Berger 1 Robert Beverly Naval Postgraduate School 1 MIT CSAIL rbeverly@nps.edu, awberger@csail.mit.edu February 8, 2010 AIMS-2 - Workshop on Active Internet Measurements

Linear probing with constant independence Anna Pagh, Rasmus Pagh, and Milan Ru i IT

Finding Strongly Connected Components Directed Acyclic Graphs Directed Acyclic Graphs Directed

Random Probing Security Verification, Composition, Expansion and New Constructions Sonia Belad 1

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Timing and Bandwidth Issues in Active Measurement or Physical Constraints on Active Probing

TAKING DATA ON FORM TAKING DATA ON FORM- -WOUND WOUND MOTORS MOTORS By : Manuel Manny

Incidence Relations and Directed Cycles Hao Wu George Washington University Directed graphs and

3.5 Connectivity in Directed Graphs Directed Graphs Directed graph. G = (V, E) Edge (u, v)

6.1 Directed Acyclic Graphs Directed acyclic graphs , or DAGs are acyclic directed graphs where

5.1 Directed Acyclic Graphs Directed acyclic graphs , or DAGs are acyclic directed graphs where

CS 401 Greedy Algorithms Xiaorui Sun 1 Directed Acyclic Graphs (DAG) Def: A DAG is a directed

Probing Neutrino Masses and Mixings with Probing Neutrino Masses and Mixings with Accelerator and

Occultations for Probing for Probing Occultations Atmosphere and Climate: Atmosphere and

Probing Protein Mechanics with Probing Protein Mechanics with Molecular Dynamics Simulations and

Probing Nucleon Spin Structure Using Probing Nucleon Spin Structure Using Deep Inelastic

Why F Funct unctional nal Progr gramming M ng Matter ers John Hughes Mary Sheeran

Sail, RISC-V, and CHERI-RISC-V Prashanth Mundkur and Peter G. Neumann, SRI International (most of

Privacy in Ubiquitous Computing Systems Marc Langheinrich ETH Zurich, Switzerland

Reasoning about the C/C++ weak memory model Viktor Vafeiadis Max Planck Institute for Software

Real-Time Systems: Testing Your System in Linux Jiangnan Liu CSE 520S Spring 02/13/2020 With

Beyond the Asymmetric Turing Test Fintan Mallory Rethinking, Reworking and Revolutionising The

1 Illustration of measurements Boston Corpus: Structural hypothesis Right-headed vs. left-headed

The Most Important Thing How Mozilla Does Security and What You Can Steal Johnathan Nightingale