High-resolution Measurement of Data Center Microbursts Qiao Zhang - PowerPoint PPT Presentation

High-resolution Measurement of Data Center Microbursts Qiao Zhang (University of Washington) Vincent Liu (University of Pennsylvania) Hongyi Zeng (Facebook) Arvind Krishnamurthy (University of Washington)

Networks are Fast, Measurements are not... Data center networks are getting faster • 100Gbps, ~100 ns to process a packet, 10-100 μ s RTT But measurement frameworks are not keeping up • SNMP counters (e.g. bytes sent or drops) typically collected every couple minutes • Packet sampling (sFlow or iptables) typically at low sampling rate, e.g. 1/30k

Networks are Fast, Measurements are not... Data center networks are getting faster • 100Gbps, ~100 ns to process a packet, 10-100 μ s RTT But measurement frameworks are not keeping up • SNMP counters (e.g. bytes sent or drops) typically collected every couple minutes • Packet sampling (sFlow or iptables) typically at low sampling rate, e.g. 1/30k Too coarse-grained !

The Case for High Resolution • Packet drop correlates poorly with utilization at 4 minute granularity • 4 minute granularity hides short-term tra ffi c spikes • Need high-resolution to reveal finer-grained behaviors

The Case for High Resolution drop rate generally very low • Packet drop correlates poorly with utilization at 4 minute granularity • 4 minute granularity hides short-term tra ffi c spikes • Need high-resolution to reveal finer-grained behaviors

The Case for High Resolution unusual drop rates at both drop rate low and high generally utilization very low • Packet drop correlates poorly with utilization at 4 minute granularity • 4 minute granularity hides short-term tra ffi c spikes • Need high-resolution to reveal finer-grained behaviors

Roadmap Mechanism • It is possible to do high resolution measurements on today's switches Results • Many if not most tra ffi c bursts are very short-lived

High-resolution Counter Collection Framework We designed a high-resolution counter collection framework • Switch CPUs poll ASIC registers with microsecond level latency • Sample fast (~25 μ s) while keeping sampling loss below 1% We focus on three kinds of counters 1. Byte count : cumulative and used to compute utilization 2. Packet size : a histogram of packet sizes 3. Peak bu ff er occupancy : for single port and shared pool

Deployment • One of the largest data centers at Facebook with a 3-tier Clos network • Only collect from ToRs due to deployment constraints • 10Gbps server links and 4x40Gbps ToR uplinks

Workload and Methodology • Mostly single-role racks • Web: handle user request, lookup with cache • Cache: handle k-v lookups, respond to Web servers • Hadoop: handle batched processing • 30 racks in total: 10 racks for each app, over 24 hours • Sample a random 2-minute interval per hour, for 1TB+

Microburst Measurements Microburst: a period of short-term high utilization (e.g. >50%) • How long do they last and how often do they occur? • How much of congestion is caused by microbursts? • Does network behavior di ff er significantly inside a burst? • Are there synchronized behaviors during bursts?

Distribution of Link Utilization 25 μ s

Distribution of Link Utilization a lot of intervals with almost nothing happening 25 μ s

Distribution of Link Utilization a few intervals have ~100% utilization a lot of intervals with almost nothing happening 25 μ s

Distribution of Link Utilization a few intervals have ~100% utilization a lot of intervals with almost nothing insensitive to happening 50% threshold 25 μ s

Bursts are Short • Burst : an unbroken sequence of hot samples (> 50% util) 25 μ s

Bursts are Short • Burst : an unbroken sequence of hot samples (> 50% util) many bursts last at most 25 μ s 25 μ s

Bursts are Short • Burst : an unbroken sequence of hot samples (> 50% util) 90pct at 200 μ s many bursts last at most 25 μ s 25 μ s

Bursts are Short • Burst : an unbroken sequence of hot samples (> 50% util) 90pct at 200 μ s many bursts Almost all congestion last at most 25 μ s is short-lived 25 μ s

Time between Bursts 25 μ s

Time between Bursts For Web/ Hadoop, 50% < 1 RTT 25 μ s

Time between Bursts Even for cache, median is < 10x RTT For Web/ Hadoop, 50% < 1 RTT 25 μ s

Time between Bursts Even for cache, median is < 10x RTT For Web/ Hadoop, 50% < 1 RTT • Some predictability: a burst is likely to be followed by another relatively soon • Potential for re-balance between bursts 25 μ s

Packet Size Distribution Inside Burst Outside Burst 100 μ s

Packet Size Distribution Bigger packets inside bursts for Inside Burst Web/Cache Outside Burst 100 μ s

Packet Size Distribution Burst are correlated with Bigger packets app-level behaviors (e.g. inside bursts for Inside Burst sending bigger responses Web/Cache or scatter-gather/incast) Outside Burst 100 μ s

Directionality of Bursts 300 μ s

Directionality of Bursts More bursts towards servers due to high fan-in 300 μ s

Directionality of Bursts Cache see more bursts on uplinks as responses are typically bigger than requests More bursts towards servers due to high fan-in 300 μ s

Directionality of Bursts Cache see more bursts on uplinks as responses are typically bigger than requests Bursts are correlated More bursts towards with app behaviors servers due to high fan-in 300 μ s

Efficacy of Network Load Balancing • 4 ToR Uplinks: compute mean absolute deviation (MAD) for each polling interval • MAD = mean( |u - u ̅ | / u ̅ ), so MAD=0 means perfect load balancing 40 μ s

Efficacy of Network Load Balancing • 4 ToR Uplinks: compute mean absolute deviation (MAD) for each polling interval • MAD = mean( |u - u ̅ | / u ̅ ), so MAD=0 means perfect load balancing links well balanced at 1s scale 40 μ s

Efficacy of Network Load Balancing • 4 ToR Uplinks: compute mean absolute deviation (MAD) for each polling interval • MAD = mean( |u - u ̅ | / u ̅ ), so MAD=0 means perfect load balancing links are highly unbalanced at 40 μ s scale links well balanced at 1s scale 40 μ s

Efficacy of Network Load Balancing • 4 ToR Uplinks: compute mean absolute deviation (MAD) for each polling interval • MAD = mean( |u - u ̅ | / u ̅ ), so MAD=0 means perfect load balancing links are highly unbalanced at 40 μ s scale Implications for design of links well balanced network, e.g. for low at 1s scale 40 μ s latency and loss

Conclusions • Deployed a microsecond-scale measurement framework in production • Demonstrated it is possible to do high-resolution measurement on today's switches • Microbursts are real, short, correlated, and related to application behaviors • Future work to correlate with end-host measurements to better understand causes for microbursts

High-resolution Measurement of Data Center Microbursts Qiao Zhang - PowerPoint PPT Presentation

High-resolution Measurement of Data Center Microbursts Qiao Zhang (University of Washington) Vincent Liu (University of Pennsylvania) Hongyi Zeng (Facebook) Arvind Krishnamurthy (University of Washington) Networks are Fast, Measurements are

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Patagonia Gold Plc 2009 Patagonia Gold VOTING ORDINARY SPECIAL Resolution 1 Resolution 2

Interactions, and Electron Precipitation (Microbursts) Bruce T. Tsurutani 1 , Gurbax S. Lakhina 2

The Single Resolution Mechanism Elke Knig Chair of the Single Resolution Board FDIC Systemic

CHAPTER 2 MEASUREMENT OF HIGH VOLTAGE AND CURRENTS 2.1 MEASUREMENT OF HIGH DIRECT VOLTAGES

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

New series of high resolution and efficiency VPH gratings MSE High Resolution Survey MSE will

CCP Resolution: proposal for an EU Regulation and FSB Guidance on CCP Resolution 2ND EUROPEAN

Patagonia Gold Plc g 2010 Cap-Oeste updated June 2010 Patagonia Gold AGM VOTING 2010 g

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

Bridging social and physical measurement: measurement is not scale construction; measurement is

5. High Time Resolution Astrophysics (HTRA) PhD Course, University of Padua Page 1 High Energy

Security Council Resolution 1540 Security Council Resolution 1540 Security Council Resolution

How to Make Cross-border Resolution Effective Dr. Axel Kunde, Head of Unit Resolution Planning and

Resolution 830 Public Consultation Process and Revised Draft Resolution ARHA Redevelopment Work

Hyper-Resolution AUTOMATED REASONING Hyper-resolution generalises ``bottom- (electron) up

Getting To Know Your Data Road Map 1. Data Objects and Attribute Types 2. Descriptive Data

Second quarter 2017 earnings call Jeff Woodbury Vice President, Investor Relations and Secretary

Administrators Zach Hawkins July 30 th , 2019 Section 1 (1) It is the goal of the people to

Minnesota State and Minnesota Department of Education PSEO & Concurrent Enrollment COVID 19

Statistics and Probability in Middle and High Schools with Technology Take the survey at

30-MINUTE BALANCING GATE CLOSURE Market Advisory Committee Meeting 1 May 2017 PRESENTED BY MARTIN

Lecture #5: Multiple Linear Regression Data Science 1 CS 109A, STAT 121A, AC 209A, E-109A Pavlos

Brushing Moments in Interactive Visual Analysis Johannes Kehrer 1 , Peter Filzmoser 2 , and Helwig

High-resolution Measurement of Data Center Microbursts Qiao Zhang - PowerPoint PPT Presentation

High-resolution Measurement of Data Center Microbursts Qiao Zhang (University of Washington) Vincent Liu (University of Pennsylvania) Hongyi Zeng (Facebook) Arvind Krishnamurthy (University of Washington) Networks are Fast, Measurements are

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Patagonia Gold Plc 2009 Patagonia Gold VOTING ORDINARY SPECIAL Resolution 1 Resolution 2

Interactions, and Electron Precipitation (Microbursts) Bruce T. Tsurutani 1 , Gurbax S. Lakhina 2

The Single Resolution Mechanism Elke Knig Chair of the Single Resolution Board FDIC Systemic

CHAPTER 2 MEASUREMENT OF HIGH VOLTAGE AND CURRENTS 2.1 MEASUREMENT OF HIGH DIRECT VOLTAGES

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

New series of high resolution and efficiency VPH gratings MSE High Resolution Survey MSE will

CCP Resolution: proposal for an EU Regulation and FSB Guidance on CCP Resolution 2ND EUROPEAN

Patagonia Gold Plc g 2010 Cap-Oeste updated June 2010 Patagonia Gold AGM VOTING 2010 g

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

Bridging social and physical measurement: measurement is not scale construction; measurement is

5. High Time Resolution Astrophysics (HTRA) PhD Course, University of Padua Page 1 High Energy

Security Council Resolution 1540 Security Council Resolution 1540 Security Council Resolution

How to Make Cross-border Resolution Effective Dr. Axel Kunde, Head of Unit Resolution Planning and

Resolution 830 Public Consultation Process and Revised Draft Resolution ARHA Redevelopment Work

Hyper-Resolution AUTOMATED REASONING Hyper-resolution generalises ``bottom- (electron) up

Getting To Know Your Data Road Map 1. Data Objects and Attribute Types 2. Descriptive Data

Second quarter 2017 earnings call Jeff Woodbury Vice President, Investor Relations and Secretary

Administrators Zach Hawkins July 30 th , 2019 Section 1 (1) It is the goal of the people to

Minnesota State and Minnesota Department of Education PSEO &amp; Concurrent Enrollment COVID 19

Statistics and Probability in Middle and High Schools with Technology Take the survey at

30-MINUTE BALANCING GATE CLOSURE Market Advisory Committee Meeting 1 May 2017 PRESENTED BY MARTIN

Lecture #5: Multiple Linear Regression Data Science 1 CS 109A, STAT 121A, AC 209A, E-109A Pavlos

Brushing Moments in Interactive Visual Analysis Johannes Kehrer 1 , Peter Filzmoser 2 , and Helwig

Minnesota State and Minnesota Department of Education PSEO & Concurrent Enrollment COVID 19