Operationalizing Yarrp: High-Speed Active Network Topology Mapping - PowerPoint PPT Presentation

Operationalizing Yarrp: High-Speed Active Network Topology Mapping from AWS https://yarrp.nps.tancad.net/ Justin P. Rohrer (jprohrer@nps.edu) Department of Computer Science US Naval Postgraduate School AIMS-KISMET, February 28, 2020 1

alternate title: How we’ve collected hourly Internet topology snapshots for the last 6 months* * Except for the month where AWS shut us down 2

Background 3

Background • Yarrp is a thing: https://www.cmand.org/yarrp/ • Probing rates ~1M PPS • Active Network Topology Mapping: • Send probes into the network from vantage points • Induce routers to send responses • Build a map of how Internet is connected and data forwarded • Goal: create/collect Internet topology “snapshots” • E.g. probe al IPv4 /24s within 5 minutes • Compare snapshots over time • Vantage points supporting Yarrp CPU/BW are hard to find/maintain 4

Major Yarrp Milestones yarrp-0.5: fill mode, multi-instance yarrp-0.2: UDP, ICMP support IMC: IP of Beholder Pub. CAIDA Full Internet scan yarrp-0.6: features IMC: Yarrp Pub. Multipath Yarrp Yarrp on AWS 2020 2016 2017 2018 2019 5

Deploying Yarrp in the cloud 6

Distributed Yarrp (Freyr) • Running Yarrp from multiple locations: • Provides greater discovery • Allows for higher aggregate rates • Needs: • Deploy Yarrp at scale • Provide manageability and elasticity • Provide fault-tolerance and robustness • Plan: • use AWS compute/bandwidth resources at geographically distributed vantage points 7

Challenges • AWS designed to do the same job many times in one place (AZ) • Most services don’t support cross-region operation • Undocumented behavior, easily overwhelmed middleboxes • E.g. security policy allow ICMP from ANY drops 90% of inbound ICMP • All hosts NATed, even when assigned public IPs • PTR record support extremely limited, only for SMTP servers • IPv6 support not on par with IPv4 • No sysadmin to design/operate this • Needs to keep running with only sporadic attention from me • High-bandwidth/CPU instances are expensive • Getting data out of AWS is expensive 8

Yarrp AWS deployment scope • Deployed to vantage points (VPs) in 15 datacenters worldwide • Particular measurements may use subset or all VPs • Targets may be distributed across VPs • Automatic resilience – unresponsive VP targets reassigned to responsive VPs • Targets may be probed in parallel by multiple VPs STK CA IRL OR LDN TYO FRA SEL SFO OH PAR VA MUM SG SYD 9

Yarrp AWS deployment architecture • Includes global (orchestration) infrastructure • Process & distribute targets to regions; Collect & process results • Per-region probing resources are replicated to all data centers 10

Operational Status • Probing Set 1: • A target address in each routed /24 of the IPv4 Internet • Once per hour • Distributed across 15 AWS regions • Probing Set 2: • A target address in each routed /16 of the IPv4 Internet • Once per hour • Redundantly by all AWS regions • Data available on request. Large downloads use “requester pays” model • Currently running continuous production, work proceeds to improve user interface, add IPv6 support, etc. 11

Lessons Learned 12

AWS Policy Interactions • Traceroute is not a violation of the AWS Acceptable Use Policy • But it could still get your account shut down • Abuse reports only go to the root account • The security and abuse team will never interact with users directly • A user must have an AWS account manager to advocate for them • Each region has different limitations • E.g. don’t send packets with TTL=10 in region X 13

Topology Observations • There are 10-12 (region dependent) hops between ec2 and Internet • Mostly in 100.64.0.0/10 shared address space (RFC6598) • Comparing snapshots is hard due to prevalence of load-balancing • Load-balancing analysis using MDA-Yarrp (shameless plug): https://rbeverly.net/research/papers/dminer-nsdi20.html • 65% of paths have load-balancing • Significant load-balancing between ASes • Observed diamonds with 100s of nodes and 1000s of edges • Flows rebalanced periodically (order of hrs) 14

Collaboration Goals • Share the data • AWS S3 requester-pays model • Make Yarrp data queryable • Via AWS Athena (BigTable equivalent) • Support multipath (primitive type can’t be traceroute) • Feedback on usefulness of hourly snapshots • Or, what is the “right” snapshot frequency • Feedback on target set permutation and goals • Reuse for longitudinal analysis • Permutation for coverage 15

End of slides 16

Operationalizing Yarrp: High-Speed Active Network Topology Mapping - PowerPoint PPT Presentation

Operationalizing Yarrp: High-Speed Active Network Topology Mapping from AWS https://yarrp.nps.tancad.net/ Justin P. Rohrer (jprohrer@nps.edu) Department of Computer Science US Naval Postgraduate School AIMS-KISMET, February 28, 2020 1

Operationalizing Operationalizing Political Economy: Political Economy: Urban Bus Operations in

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Yarrping the Internet Robert Beverly Naval Postgraduate School February 12, 2016 Active

Topological data analysis and topology-based visualization Leila De Floriani Topology-based

Operationalizing Water-Wise Cities Guangzhe Chen, Senior Director Stockholm, August 30, 2017

Why this issue brief? To encourage more companies to take action on operationalizing the UN

Topology Discovery Correlating different network topology layers in heterogeneous environments

1 Ring Topology Ring Topology In a ring network, every device has exactly two neighbours

Slow Speed Network Slow Speed Network Strategic Plan for the Strategic Plan for the South Bay

Cedar Rapids RLR & Speed Des Moines RLR & Speed

Speed, speed, speed D. J. Bernstein University of Illinois at Chicago; Ruhr University Bochum

SPEED OF THOUGHT SPEED OF THOUGHT 120m/s SPEED OF THOUGHT COMMUNICATIVE The Artist is Absent:

High-speed Serial Interface Lect. 1 Introduction 1 High-Speed Circuits and Systems Lab.,

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Partnership event 21 st November 2019 Welcome #ActiveBradford Active Bradford Members Active

Experiences with practice-focused undergraduate security education Robert L. Fanelli and Terrence

XSEDE Cybersecurity Program & Information Sharing Overview James Marsteller Agenda XSEDE

C.W. Morey School Family Orientation Remote Learning September 2020 2 Lowell Public Schools

Just a few things to help guide you to a happy and productive life inside our company... Personal

Attack Graphs Systems and Internet Infrastructure Security (SIIS) Laboratory Page 1 Outline Attack

Getting Started with DUNE's Software and Computing Thomas R. Junk Young Dune September 16, 2016

Lecture 15 Access Control Stephen Checkoway University of Illinois at Chicago CS 487

Access Control CSE497b - Spring 2007 Introduction Computer and Network Security Professor Jaeger

Operationalizing Yarrp: High-Speed Active Network Topology Mapping - PowerPoint PPT Presentation

Operationalizing Yarrp: High-Speed Active Network Topology Mapping from AWS https://yarrp.nps.tancad.net/ Justin P. Rohrer (jprohrer@nps.edu) Department of Computer Science US Naval Postgraduate School AIMS-KISMET, February 28, 2020 1

Operationalizing Operationalizing Political Economy: Political Economy: Urban Bus Operations in

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Yarrping the Internet Robert Beverly Naval Postgraduate School February 12, 2016 Active

Topological data analysis and topology-based visualization Leila De Floriani Topology-based

Operationalizing Water-Wise Cities Guangzhe Chen, Senior Director Stockholm, August 30, 2017

Why this issue brief? To encourage more companies to take action on operationalizing the UN

Topology Discovery Correlating different network topology layers in heterogeneous environments

1 Ring Topology Ring Topology In a ring network, every device has exactly two neighbours

Slow Speed Network Slow Speed Network Strategic Plan for the Strategic Plan for the South Bay

Cedar Rapids RLR &amp; Speed Des Moines RLR &amp; Speed

Speed, speed, speed D. J. Bernstein University of Illinois at Chicago; Ruhr University Bochum

SPEED OF THOUGHT SPEED OF THOUGHT 120m/s SPEED OF THOUGHT COMMUNICATIVE The Artist is Absent:

High-speed Serial Interface Lect. 1 Introduction 1 High-Speed Circuits and Systems Lab.,

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Partnership event 21 st November 2019 Welcome #ActiveBradford Active Bradford Members Active

Experiences with practice-focused undergraduate security education Robert L. Fanelli and Terrence

XSEDE Cybersecurity Program &amp; Information Sharing Overview James Marsteller Agenda XSEDE

C.W. Morey School Family Orientation Remote Learning September 2020 2 Lowell Public Schools

Just a few things to help guide you to a happy and productive life inside our company... Personal

Attack Graphs Systems and Internet Infrastructure Security (SIIS) Laboratory Page 1 Outline Attack

Getting Started with DUNE's Software and Computing Thomas R. Junk Young Dune September 16, 2016

Lecture 15 Access Control Stephen Checkoway University of Illinois at Chicago CS 487

Access Control CSE497b - Spring 2007 Introduction Computer and Network Security Professor Jaeger

Cedar Rapids RLR & Speed Des Moines RLR & Speed

XSEDE Cybersecurity Program & Information Sharing Overview James Marsteller Agenda XSEDE