Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it - PowerPoint PPT Presentation

Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it ITNOG-3, Bologna, 12th November 2017

BGP route collectors Route collectors (RCs) are devices which collects BGP routing data from co-operating ASes A route collector Maintains a routing table (RIB) with the best routes received Dumps the content of the RIB and received UPDATEs periodically 2/21

BGP route collector projects University of Oregon Route Views Project Route Views was originally conceived as a tool for Internet operators to obtain real-time information about the global routing system from the perspectives of several different backbones and locations around the Internet. It collects BGP packets since 1997, in MRT format since 1997 http://www.routeviews.org RIPE NCC Routing Information Service (RIS) The RIPE NCC collects and stores Internet routing data from several locations around the globe, using RIS. It collects BGP packets in MRT format since 1999 https://www.ripe.net/analyse/internet-measurements/routing-information-service-ris Packet Clearing House (PCH) PCH is the international organization responsible for providing operational support and security to critical Internet infrastructure, including Internet exchange points and the core of the domain name system. It operates route collectors at more than 100 IXPs around the world and its data is made available in MRT format since 2011 https://www.pch.net/resources/Raw Routing Data Isolario Isolario is a route collecting project which provides inter-domain real-time monitoring services to its participants. It collects BGP packets in MRT format since 2013 https://www.isolario.it 3/21

Real-time requirements Off-line analysis of data . . . Route collectors were originally conceived as a tool for network administrators to obtain information about the Internet inter-domain routing status . . . vs real-time monitoring Depending on the application, some information must be obtained on-the-fly, i.e. without the delay caused by storing the data Example Check the routes a feeder is using to reach a given portion of IP space Check MRT data? RIB snapshots are available every 2-8 hours Update messages are available every 5-15 minutes 4/21

Real-time requirements Is it possible to perform real-time queries on the RC RIB? Current situation Typically RCs run general-purpose routing software, e.g. Quagga Cons The collection process is affected by the queries because most RC software is single-threaded Overhead in terms of CPU and memory usage due to BGP specs (e.g. BGP decision process) 5/21

Example with Quagga 1.2x10 6 F2 1x10 6 End-of-RIB # of MRT records delay F1 800000 End-of-RIB 600000 F2 starts 400000 F1 200000 starts 0 0 20 40 60 80 100 120 time [s] t = 0 Feeder F 1 starts a RIB transfer (ends t = 35, ∼ 580k prefixes) t = 45 Feeder F 2 starts another RIB transfer t = 60 ten F 1 full table read operations are issued sequentially Data collection is delayed of about 20 seconds After the read requests, packets arrives with higher rate 6/21

ICE: an Interactive Collector Engine Each BGP session is handled by a dedicated set of threads Each service request is handled by a dedicated thread Readers and writers sync on the RIB according to the classic readers-writers paradigm 7/21

RIB implementation: Patricia Trie Each node represents a subnet Each subnet has associated a set of path attributes, one for each feeder that announced the subnet Readers and Writers Writer: feeder thread Reader: request thread Writers/Readers can W lock / R lock both the whole trie and single nodes 8/21

RIB implementation: Patricia Trie Writer 1 Checks if the node is present ( R lock RIB) 2 If not, inserts new node ( W lock RIB) 3 Inserts/updates the path attribute ( W lock node) Reader 1 Checks if the node is present ( R lock RIB) 2 If yes, reads the node ( R lock node) 9/21

Test: delay in storing BGP packets 1.4*10 6 1.2*10 6 # of MRT records Multiple readers 1.0*10 6 starts F1 ends 8.0*10 5 RIB transfer 6.0*10 5 4.0*10 5 F2 starts RIB transfer No readers 1 reader 2.0*10 5 2 readers 4 readers 8 readers 0.0*10 0 0 10 20 30 40 50 60 70 time [s] t = 0 Feeder F 1 starts a RIB transfer t ∼ 40 Feeder F 2 starts a RIB transfer t ∼ 41 Multiple F 1 full table read operations are issued simultaneously Thanks to the scheduler activity (and the multi-threaded design) ICE is able to write incoming packets while readers are reading 10/21

Test: delay in storing BGP packets Number of cores = 1 Number of cores = 8 1.4*10 6 1.4*10 6 1.2*10 6 1.2*10 6 # of MRT records # of MRT records Multiple readers 1.0*10 6 1.0*10 6 Multiple readers starts starts F1 ends F1 ends 8.0*10 5 8.0*10 5 RIB transfer RIB transfer 6.0*10 5 6.0*10 5 4.0*10 5 4.0*10 5 F2 starts F2 starts RIB transfer No readers No readers RIB transfer 1 reader 1 reader 2.0*10 5 2.0*10 5 2 readers 2 readers 4 readers 4 readers 8 readers 8 readers 0.0*10 0 0.0*10 0 0 10 20 30 40 50 60 70 0 10 20 30 40 50 60 70 time [s] time [s] t = 0 Feeder F 1 starts a RIB transfer t ∼ 40 Feeder F 2 starts a RIB transfer t ∼ 41 Multiple F 1 full table read operations are issued simultaneously Thanks to the scheduler activity (and the multi-threaded design) ICE is able to write incoming packets while readers are reading 10/21

Test: delay in reading the RIB Before F 2 RIB transfer During F 2 RIB transfer # of readers 1 core 8 cores 1 core 8 cores µ σ µ σ µ σ µ σ 1 2.12 0.07 2.09 0.07 2.21 0.07 2.18 0.08 2 2.15 0.10 2.14 0.10 2.19 0.09 2.16 0.08 4 2.32 0.15 2.20 0.11 2.28 0.13 2.17 0.11 8 3.66 0.06 2.15 0.14 3.73 0.09 2.13 0.15 Similar to the previous test, but from the reader side How much time a reader takes to retrieve the full routing table of F 1 before and during F 2 table transfer? µ and σ are respectively the average time and the standard deviation The time the second set of readers takes to retrieve F 1 table is very close to the time taken by the first set, confirming that multiple readers can proceeed in parallel with F 2 11/21

What about memory? 6GB 5GB Memory usage 4GB 3GB 2GB 1GB 0 B 1 2 4 8 16 32 64 # of IPv4 full routing tables Memory consumption ICE uses ∼ 82.4MB per feeder, ∼ 100 feeders on a standard machine This means that in scenarios where the feeders are near to a thousand (e.g. route collecting, route servers) at least ten machines are needed How to reduce memory consumption? 12/21

Repetitiveness of BGP data The memory usage is mainly caused by the PATH ATTRIBUTE s Is that possible to compress them? Are they repetitive? We analysed BGP data collected by Route Views, RIS and Isolario route collectors: March 2nd, 2016 February 2nd, 2016 March 2nd, 2015 March 2nd, 2006 (only Route Views and RIS) We consider only full feeders data We computed an uniqueness index u − index for each attribute Given a full feeder RIB and an attribute A , the u − index is the ratio between the different values assumed by A and the number of occurrences of A 13/21

Repetitiveness of BGP data: results IPv4 IPv6 1 1 AS path AS path Next hop Next hop MED MED 0.8 0.8 Aggregator Aggregator Communities Communities Avg. u-index Avg. u-index 0.6 0.6 0.4 0.4 0.2 0.2 0 0 Mar,2006 Mar,2015 Feb,2016 Mar,2016 Mar,2006 Mar,2015 Feb,2016 Mar,2016 Attributes are more repetitive in IPv4 scenarios than IPv6 Only about one every five ASes appears in IPv6 routes an AS announces on average two subnets vs ten in the IPv4 case NEXT HOP is the most repetitive (of course) 14/21

Compression algorithm Data can be compressed! Requirements for the compression algorithm Lossless Adaptive Random access at record level Compression algorithms categories Entropy encoding Dictionary encoding Recurring patterns are substituted with fixed-size indexes 15/21

Lampel-Ziv-Welch 31 41 59 One dictionary per feeder The dictionary is kept in memory The decompressor does not need to rebuild the dictionary (small compression variant applied here) 16/21

Results 6GB without LZW* with LZW* 5GB Memory usage 4GB 3GB 2GB 1GB 0 B 1 2 4 8 16 32 64 # of IPv4 full routing tables Compression saves up to 30% of memory, i.e. 57.5MB per feeder Route processing time increases from 12 µ s-20 µ s to 30 µ s-64 µ s Amount of time it takes a complete feeder table transfer 17/21

Isolario: an ICE use case https://www.isolario.it Isolario is a route collector project which offers real-time monitoring services to its participants Services obtain the real-time inter-domain status of a feeder by querying ICE Evolution is obtained with subsequent UPDATE messages 18/21

Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it - PowerPoint PPT Presentation

Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it ITNOG-3, Bologna, 12th November 2017 BGP route collectors Route collectors (RCs) are devices which collects BGP routing data from co-operating ASes A route collector Maintains a

A Semi Preemptive Garbage A Semi Preemptive Garbage Collector for Solid State Collector

Bipolar Junction Transistors Emitter p n p Collector Emitter n p n Collector Base Base

San Mateo County San Mateo County Treasurer- -Tax Collector Tax Collector Treasurer Lee

RDA Dissolution Update Oscar Valdez Auditor-Controller/Treasurer/Tax Collector September 13,

TOY FAIR 2010 CELEBRATION 2010 COLLECTOR EVENT COLLECTOR EVENT PRESENTATION PRESENTATION o

ESTABLISHING GOALS & PERFORMANCE MEASURES Jordan Kaufman, Kern County Treasurer-Tax Collector

CS 574: Randomized Algorithms Lecture 5. Coupon Collector Problems September 8, 2015 Lecture 5.

On the Biased Partial Word Collector Problem Philippe Duchon and Cyril Nicaud LIGM Universit e

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

Interactive Proofs Lecture 18 AM 1 Interactive Proofs 2 Interactive Proofs IP[k] 2

Whats New in Engine Research Whats New in Engine Research Mark Musculus Engine Combustion

1 Mapping Relational Data Model Patterns To The App Engine Datastore Max Ross November 19,

Google App Engine Guido van Rossum Stanford EE380 Colloquium, Nov 5, 2008 Google App Engine

IGBTs IXYS Corporation WWW.IXYS.COM Table Of Contents I. IXYS IGBT Technologies II. Product

H-3. East-West Minor Collector Harris County 58 Location 59 Request Add Calvary Hill Drive

Solar Controller Training Presentation AAV solar heated 1 hot water SOLAR COLLECTOR 3

High Resolution Model Analyses of Climate Change and Air Quality Donald J. Wuebbles, Zach Zobel,

PINGU, MicroBooNE & LAr1-ND Justin Evans University of Manchester 11th May 2012

an update Introduction Progress on the experiment Schedule DESY PRC Zeuthen Richard

Clean Energy: Thermoelectrics and Photovoltaics Akram Boukai Ph.D. Solar Energy Use

The Noise Collector for sparse recovery in high dimensions Alexei Novikov Department of

SACM Terminology https://sacmwg.github.io/draft-ietf-sacm-terminology/

Scalable performance analysis of large-scale parallel applications Brian Wylie & Markus

past, present, and future Dr Jose Bilbao, Associate Lecturer SPREE Systems & Policy Group

Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it - PowerPoint PPT Presentation

Interactive Collector Engine Luca Sani luca.sani@iit.cnr.it ITNOG-3, Bologna, 12th November 2017 BGP route collectors Route collectors (RCs) are devices which collects BGP routing data from co-operating ASes A route collector Maintains a

A Semi Preemptive Garbage A Semi Preemptive Garbage Collector for Solid State Collector

Bipolar Junction Transistors Emitter p n p Collector Emitter n p n Collector Base Base

San Mateo County San Mateo County Treasurer- -Tax Collector Tax Collector Treasurer Lee

RDA Dissolution Update Oscar Valdez Auditor-Controller/Treasurer/Tax Collector September 13,

TOY FAIR 2010 CELEBRATION 2010 COLLECTOR EVENT COLLECTOR EVENT PRESENTATION PRESENTATION o

ESTABLISHING GOALS &amp; PERFORMANCE MEASURES Jordan Kaufman, Kern County Treasurer-Tax Collector

CS 574: Randomized Algorithms Lecture 5. Coupon Collector Problems September 8, 2015 Lecture 5.

On the Biased Partial Word Collector Problem Philippe Duchon and Cyril Nicaud LIGM Universit e

Search Engine Optimization What is Search Engine Optimization Search Engine Optimization is the

Interactive Proofs Lecture 18 AM 1 Interactive Proofs 2 Interactive Proofs IP[k] 2

Whats New in Engine Research Whats New in Engine Research Mark Musculus Engine Combustion

1 Mapping Relational Data Model Patterns To The App Engine Datastore Max Ross November 19,

Google App Engine Guido van Rossum Stanford EE380 Colloquium, Nov 5, 2008 Google App Engine

IGBTs IXYS Corporation WWW.IXYS.COM Table Of Contents I. IXYS IGBT Technologies II. Product

H-3. East-West Minor Collector Harris County 58 Location 59 Request Add Calvary Hill Drive

Solar Controller Training Presentation AAV solar heated 1 hot water SOLAR COLLECTOR 3

High Resolution Model Analyses of Climate Change and Air Quality Donald J. Wuebbles, Zach Zobel,

PINGU, MicroBooNE &amp; LAr1-ND Justin Evans University of Manchester 11th May 2012

an update Introduction Progress on the experiment Schedule DESY PRC Zeuthen Richard

Clean Energy: Thermoelectrics and Photovoltaics Akram Boukai Ph.D. Solar Energy Use

The Noise Collector for sparse recovery in high dimensions Alexei Novikov Department of

SACM Terminology https://sacmwg.github.io/draft-ietf-sacm-terminology/

Scalable performance analysis of large-scale parallel applications Brian Wylie &amp; Markus

past, present, and future Dr Jose Bilbao, Associate Lecturer SPREE Systems &amp; Policy Group

ESTABLISHING GOALS & PERFORMANCE MEASURES Jordan Kaufman, Kern County Treasurer-Tax Collector

PINGU, MicroBooNE & LAr1-ND Justin Evans University of Manchester 11th May 2012

Scalable performance analysis of large-scale parallel applications Brian Wylie & Markus

past, present, and future Dr Jose Bilbao, Associate Lecturer SPREE Systems & Policy Group