SQR In-network packet loss recovery from link failures for - PowerPoint PPT Presentation

SQR In-network packet loss recovery from link failures for high-reliability datacenter networks Ting Qu Raj Joshi Mun Choon Chan 1 ， 2 2 2 2 1 1 Ben Leong Deke Guo Zhong Liu 1 2

Data centers around the world Google’s worldwide DC map Facebook DC interior Global Microsoft Azure DC Footprint Microsoft’s DC in Dublin, Ireland 2

Low latency is a key requirement Web search e-commerce database cache Low latency for short messages Better app performance & user experience 3

Improve Flow Completion Time (FCT) - DCTCP (sigcomm’10) - D 3 (sigcomm’11) - HULL (nsdi’12) - pFabric (sigcomm’13) But very few work specifically - PASE (sigcomm’14) address how link failures impact FCT - TIMELY (sigcomm’15) - FUSO (atc’16) - Homa (sigcomm’18) - HPCC (sigcomm’19) … 4

Link failures are common • Gill et al. [1] reported: • Link failure are common and can cause loss of a large number of small packets. • The 95th percentile value of link failure is 136 times per day during their measurement period. [1] Phillipa Gill, Navendu Jain, and Nachiappan Nagappan. 2011. Understanding network failures in data centers: measurement, analysis, and implications. In Proceedings of SIGCOMM. 5

Link failure management Link failure management Packet loss Link failure Route recovery recovery detection (e.g., F10) Host-based Protection Restoration (e.g., TCP) (e.g., Conga, (e.g. Hula, SPIDER) Sharebackup) Host-based pkt loss recovery can lead to much longer flow completion time (FCT) for short flows 6

Link failure case Link detection time route reconfiguration time + = 760us 30us 730us (F10, NSDI’13) (ShareBackup, sigcomm’18) 7

Long FCT under link failure Host based recovery is a major contributor to the large increase in FCT Tens of ms to 1s Hundreds of µ s 8

Why does host-based recovery increase FCT significantly? • Packet losses in the TCP three-way handshake - Wait at least 1s and retransmit • Packet losses in the middle of a cwnd - Fast retransmission: 1RTT (100s of us) • Packet losses at the tail of a cwnd - Retransmission timeout: several ms SYN 1 1 Can we keep FCT low under link failure 2 ACK1 2 ACK1 3 3 for latency-sensitive flows? ACK2 ACK1 4 SYN ACK1 SYN, ACK 2 3 ACK 1 9

Our solution: SQR Link failure management Packet loss Link failure Route recovery recovery detection (e.g., F10) Host-based Protection Restoration (e.g., TCP) In-network Host-based (e.g., Conga, (e.g., (SQR) (e.g., TCP) Hula, SPIDER) Sharebackup) The network is the “right” place to perform packet loss recovery 10

How does SQR keeps FCT low when there is link failure ？ Objective: • Mask the effect of packet loss from the end-points during link failure detection time and route reconfiguration time (route failure time). Key idea: • Continuously cache recently sent packet in the switch for a duration equal to the route failure time 11

Is it feasible to cache pkts on switch? Buffer size Route failure time 42MB PortLand (65ms) + availability of dataplane Tomahawk 2 +’17 SIGCOMM’09 programming (e.g. P4) 22MB F10 (1ms) Tomahawk +’16 NSDI’13 16MB ShareBackup Trident 2 +’15 (760us) SIGCOMM’18 9MB Tridernt +’10 12

Where and how to cache?  Challenges • In a switch dataplane, the packets can only be stored in the packet buffer within the buffer & queuing engine (BQE). • The default FIFO queues send out packets as fast as possible. • No BQE today readily provides the queuing discipline required to realize packets caching with a fixed time. • BQE does not support custom packet scheduling algorithms. 13

Solution  Keep recent copies of transmitted packets by cloning and then recirculating cloned packets to BQE.  Supported by the Portable Switch Architecture (PSA)  Packets are cached for durations sufficiently long to detect link failure and perform route recovery.  Resend cached packets to new route when it is available. 14

Challenges “Aging” of packets  Load balancing of circulating packets  Handle packet reordering  15

Delay timer Transmit packet if this is the first/original packet BQE Egress pipeline Is delay duration is enough? Caching queue Make a copy ... CurrentEgressTstamp − StartEgressTstamp; . . . Packet is dropped if it has been cached greater than link detection time 17

Dynamic queue selection BQE Egress pipeline Caching queue No Yes LeastUtilization link Port Utilization Port 1 ... down? 1 50 80 80 50 Caching queue 2 100 Port 2 LeastLoadedPort （ backup … … ... path ） 1 ... mirroring Packets from same flow can be cached on different queues 18

Packet order logic PktTag = 5 BQE Egress pipeline Caching queue Pkt tag counter 5 6 ... Caching queue Backup port ... 19

Packet order logic BQE Egress pipeline Same Caching queue NextPktTag 8 9 ... Compare PktTag with NextPktTag Caching queue Backup port ... 20

Packet order logic BQE Egress pipeline larger Caching queue NextPktTag 8 ... Compare PktTag with NextPktTag Caching queue Backup port ... 21

Why it works • No packet loss ✓ Cache a copy of sent packets for a duration at least equal to the route failure time ✓ Pkt is sent to backup port if new route is ready • Packets in order ✓ Recover lost pkts based on pkt tag • Minimize egress processing delays on other flows going through the switch ✓ Select caching queue from multiple ports ✓ Dynamic least loaded port selection • Complements existing methods of link failure detection and route reconfiguration 22

Evaluation • Hardware Testbed - Barefoot Tofino switch - Intel Xeon servers equipped with Intel X710 NICs • Trace - Web search - Data mining • Schemes compared (SQR implemented in P4) - SB’ (simple ShareBackup, 760us route failure time) - SB’ + SQR - LRR (30us route failure time) - LRR + SQR 23

SQR masks link failures from end-point transport 24

SQR achieves low FCT under link failure 2ms 2ms 25

Overhead: Buffer size Steady-state packet buffer consumption with 30us link failure detection time 28

Conclusion • Design SQR an In-Network packet loss recovery method which keeps FCT low for latency-sensitive flows when there is link failure. • Eliminate packet loss during link failures and enables handing-off flows seamlessly to alternative paths. • SQR can be implemented on any programmable ASIC based on Portable Switch Architecture (PSA) 29

Impact of SQR Traffic

Overhead: Egress processing 32

SQR In-network packet loss recovery from link failures for - PowerPoint PPT Presentation

SQR In-network packet loss recovery from link failures for high-reliability datacenter networks Ting Qu Raj Joshi Mun Choon Chan 1 2 2 2 2 1 1 Ben Leong Deke Guo Zhong Liu 1 2 Data centers around the world Googles

A LEADER IN NEXT GENERATION ASIC CHIPS AND MINING SYSTEMS Corporate Presentation CSE: SQR FWB:

sqr

sst t sqr t r

School Quality Reviews Indiana State Board of Education Meeting Robin obin LeC eCla laire

NOW Handout Page 1 1 Vector Programming Model Scalar Registers Vector Registers r15 v15 #

First experiences in hybrid parallel programming in quad-core Cray XT4 architecture Sebastian

Using Posters and Deep Learning to Recommend Anime & Mangas Jill-Jnn Vie, PhD RIKEN Center

How New is the How New is the New Philanthropy? py Dr Beth Breeze U i University of

Question 11 pull Question on Exam 1 a) Give a sensible argument for the answer you believe.

Women in Tech/Drupal Ashish Jain I am Ashish Jain I @ashishjainmr I Drupal Everyday

Using a History Based Approach to Predict Topology Control Information in Mobile Ad Hoc

Combining HOME, Housing Trust Fund, & the Housing Credit January 14, 2020 Welcome &

1 What is F und Ac c ounting? F und a c c o unting is the a c tivity o f a na lyzing , re

Questionnaire on Special Needs Trust Survey Report Faculty of Law, The University of Hong Kong

National Housing Trust Fund Model Allocation Plan June 2016 Ed Gramlich National Low Income

GASB Update Presented by: Debbie Harper Ryan Domino Housekeeping items GASB series We

Financial Risk Assessment Montana Pension Systems Legislative Fiscal Division November 2018

Understanding Salary Projections April 2018 Presenters Anita Collins, Manager,

OVERVIEW OF OTHER FUNDS IN THE STATE BUDGET Presented to SOUTH CAROLINA HOUSE WAYS AND MEANS

Health Enhancement Community Initiative Overview July 14, 2020 Note While this presentation

COUNTY OF SAN DIEGO BUDGET OVERVIEW Ebony Shelton April 3, 2019 Director Financial Policy and

FuZZan: Efficient Sanitizer Metadata Design for Fuzzing Yuseok Jeon 1 , WookHyun Han 2 , Nathan

Run Time Storage Organization ASU Textbook Chapter 7.17.5, and 7.77.9 Tsan-sheng Hsu

The old challenge: How to support users? mirko.rahn@itwm.fraunhofer.de Dagstuhl, November 2017

SQR In-network packet loss recovery from link failures for - PowerPoint PPT Presentation

SQR In-network packet loss recovery from link failures for high-reliability datacenter networks Ting Qu Raj Joshi Mun Choon Chan 1 2 2 2 2 1 1 Ben Leong Deke Guo Zhong Liu 1 2 Data centers around the world Googles

A LEADER IN NEXT GENERATION ASIC CHIPS AND MINING SYSTEMS Corporate Presentation CSE: SQR FWB:

sqr

sst t sqr t r

School Quality Reviews Indiana State Board of Education Meeting Robin obin LeC eCla laire

NOW Handout Page 1 1 Vector Programming Model Scalar Registers Vector Registers r15 v15 #

First experiences in hybrid parallel programming in quad-core Cray XT4 architecture Sebastian

Using Posters and Deep Learning to Recommend Anime &amp; Mangas Jill-Jnn Vie, PhD RIKEN Center

How New is the How New is the New Philanthropy? py Dr Beth Breeze U i University of

Question 11 pull Question on Exam 1 a) Give a sensible argument for the answer you believe.

Women in Tech/Drupal Ashish Jain I am Ashish Jain I @ashishjainmr I Drupal Everyday

Using a History Based Approach to Predict Topology Control Information in Mobile Ad Hoc

Combining HOME, Housing Trust Fund, &amp; the Housing Credit January 14, 2020 Welcome &amp;

1 What is F und Ac c ounting? F und a c c o unting is the a c tivity o f a na lyzing , re

Questionnaire on Special Needs Trust Survey Report Faculty of Law, The University of Hong Kong

National Housing Trust Fund Model Allocation Plan June 2016 Ed Gramlich National Low Income

GASB Update Presented by: Debbie Harper Ryan Domino Housekeeping items GASB series We

Financial Risk Assessment Montana Pension Systems Legislative Fiscal Division November 2018

Understanding Salary Projections April 2018 Presenters Anita Collins, Manager,

OVERVIEW OF OTHER FUNDS IN THE STATE BUDGET Presented to SOUTH CAROLINA HOUSE WAYS AND MEANS

Health Enhancement Community Initiative Overview July 14, 2020 Note While this presentation

COUNTY OF SAN DIEGO BUDGET OVERVIEW Ebony Shelton April 3, 2019 Director Financial Policy and

FuZZan: Efficient Sanitizer Metadata Design for Fuzzing Yuseok Jeon 1 , WookHyun Han 2 , Nathan

Run Time Storage Organization ASU Textbook Chapter 7.17.5, and 7.77.9 Tsan-sheng Hsu

The old challenge: How to support users? mirko.rahn@itwm.fraunhofer.de Dagstuhl, November 2017

Using Posters and Deep Learning to Recommend Anime & Mangas Jill-Jnn Vie, PhD RIKEN Center

Combining HOME, Housing Trust Fund, & the Housing Credit January 14, 2020 Welcome &