High Speed Networks Need Proactive Congestion Control Using - PowerPoint PPT Presentation

High Speed Networks Need Proactive Congestion Control Using Programmable Forwarding Planes! Lavanya Jose , Steve Ibanez, Lisa Yan, Nick McKeown, Sachin Katti Stanford University Mohammad Alizadeh George Varghese MIT Microsoft Research

Outline • At 100G speeds, we’ll need much faster congestion control schemes • Letting networks switches directly compute rates is a fast and scalable scheme • We can realize such a scheme in 100G networks using programmable forwarding planes (stateful data planes)

The Congestion Control Problem Link 1 Link 2 Link 0 Link 3 Link 4 60 G 100 G 30 G 10 G 100 G Flow A Flow C Flow B Flow D

Ask an oracle. Link Capacity Link 0 Link 1 Link 2 Link 3 Link 4 Flow Rate 0 100 Flow A √ √ Flow A 35 1 60 Flow B √ √ Flow B 25 2 30 Flow C √ √ Flow C 5 3 10 Flow D √ √ Flow D 5 4 100 Link 1 Link 0 Link 2 Link 3 Link 4 60 G 30 G 100 G 10 G 100 G Flow A = 35G Flow A Flow C = 5G Flow C Flow B = 25G Flow B Flow D Flow D = 5G

Traditional congestion control • No explicit information about traffic matrix • Measure congestion signals, then react by adjusting rate after measurement delay • Gradual, can’t jump to right rates, know direction • “Reactive Algorithms” Adjust Flow Rate Measure Congestion

50 Transmission Rate (Gbps) 40 30 1 20 10 0 0 10 20 30 40 Link 1 Link 2 Link 0 Link 3 Link 4 60 G 100 G 30 G 10 G 100 G Flow A = 35G Flow C = 5G Flow B = 25G Flow D = 5G

50 Transmission Rate (Gbps) Ideal (dotted) 40 30 1 20 10 0 0 10 20 30 40 Link 1 Link 2 Link 0 Link 3 Link 4 60 G 100 G 30 G 10 G 100 G Flow A = 35G Flow C = 5G Flow B = 25G Flow D = 5G

50 RCP (dashed) Transmission Rate (Gbps) Ideal (dotted) 40 30 1 20 10 0 0 10 20 30 40 Link 1 Link 2 Link 0 Link 3 Link 4 60 G 100 G 30 G 10 G 100 G Flow A = 35G Flow C = 5G Flow B = 25G Flow D = 5G

30 RTTs to 50 RCP (dashed) Transmission Rate (Gbps) Converge Ideal (dotted) 40 30 1 20 10 0 0 10 20 30 40 Link 1 Link 2 Link 0 Link 3 Link 4 60 G 100 G 30 G 10 G 100 G Flow A = 35G Flow C = 5G Flow B = 25G Flow D = 5G

Convergence Times Are Long Reactive schemes are slow for 100G At 100G, a typical flow in a search workload is < 7 RTTs long. Fraction of Total Flows in Bing Workload 14% Small (1-10KB) 30% 1MB / 100 Gb/s = 80 µs Medium (10KB-1MB) 56% Large (1MB-100MB)

T 10 0 0 10 20 30 40 1 3 5 Time (# of RTTs, 1 RTT=24us) Reactive algorithms trade off explicit flow information for long convergence times • Can we use explicit flow information • and get shorter convergence times?

Back to the oracle, how did she use traffic matrix to compute rates? Link 1 Link 0 Link 2 Link 3 Link 4 60 G 30 G 100 G 10 G 100 G Flow A Flow A = 35G Flow C = 5G Flow C Flow B Flow B = 25G Flow D Flow D = 5G

Waterfilling Algorithm Link 0 (0/ 100 G) Link 4 (0/ 100 G) Link 1 (0/ 60 G) Link 2 (0/ 30 G) Link 3 (0/ 10 G) Flow A (0 G) Flow C (0 G) Flow B (0 G) Flow D (0 G)

Waterfilling- 10 G link is fully used Link 0 (5/ 100 G) Link 4 (5/ 100 G) Link 1 (10/ 60 G) Link 2 (10/ 30 G) Link 3 (10/ 10 G) Flow A (5 G) Flow C (5 G) Flow B (5 G) Flow D (5 G)

Fair Share of Bottlenecked Links Link 0 (35/ 100 G) Fair Share: 35 G Link 4 (5/ 100 G) Link 1 (60 G) Fair Share: 25 G Fair Share: 5 G Link 2 (30 G) Link 3 (10 G) Flow A (35 G) Flow C (5 G) Flow B (25 G) Flow D (5 G)

A centralized water-filling scheme may not scale. Can we let the network figure out rates in a distributed fashion?

Fair Share for a Single Link flow demand Capacity at Link 1: 30G A ∞ So Fair Share Rate: 30G/2 = 15G B ∞ 15 G Link 1 30 G Flow A ∞ ∞ Flow B

A second link introduces a dependency flow demand Capacity at Link 1: 30G A ∞ Demand of Flows restricted at other links: 10G Number of unrestricted flows: 1 B 10 G ∞ So Fair Share Rate: 30G-10G/1 = 20G Link 1 Link 2 30 G 10 G Flow A Flow B

Proactive Explicit Rate Control (PERC) Control Packet For Flow B d| ∞ | ∞ f| ? | ? Link 1 Link 2 30 G 10 G Flow A Flow B

Constraints of Programmable Forwarding Planes at 100 Gb/s • Limited compute- action ~ ns, typically primitives like add/ compare etc. • Limited info. that we can modify per packet. • Limited area for state and look-up tables ~ MB, much of which is for L2/L3 Queues Action Macro Action Macro Action Macro Action Macro Fixed Action Fixed Action Fixed Action Match Table Match Table Fixed Action Match Table Match Table IPv6 Table IPv4 Table ACL Table L2 Table Parser 25

PERC in P4  NetFPGA P4 Front end PX Xilinx SDNet Compilation NetFPGA SUME Switch

Division of compute b/n end host & switch flow demand Capacity at Link 1: 30G A ∞ Demand of Flows restricted at other links: 10G Number of unrestricted flows: 1 B 10 G Stamp inputs to fair share calculation So Fair Share Rate: 30G-10G/1 = 20G 30G,10G,1 Link 1 30 G

Interesting Questions • Minimum time for a distributed scheme • Minimum amount of state for provable convergence • How many active flows in a max-min fair network? • Imprecise demands  some reactive component

High Speed Networks Need Proactive Congestion Control Using - PowerPoint PPT Presentation

High Speed Networks Need Proactive Congestion Control Using Programmable Forwarding Planes! Lavanya Jose , Steve Ibanez, Lisa Yan, Nick McKeown, Sachin Katti Stanford University Mohammad Alizadeh George Varghese MIT Microsoft Research

Congestion Control Mark Handley Outline Part 1: Traditional congestion control for bulk

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

Internet congestion control: TCP Internet congestion control: TCP 1988: "Congestion

The Present and Future of Congestion Control Mark Handley Outline Purpose of congestion

What do you mean, Congestion? some history Congestion Collapse

TCP TCP Congestion Control Congestion Control Essential strategy :: The TCP host sends

Congestion Control In The Congestion Control In The Internet Internet JY Le Boudec Fall 2009

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

Optimization-based routing and congestion control Routing, congestion control as optimization

Convex Optimization Congestion Control Laila Daniel and Krishnan Narayanan 11th March 2013

Internet Congestion Control Research Group Mark Handley UCL Congestion Control The Internet

GAIMD Congestion Control Y. Richard Yang and Simon S. Lam, General AIMD Congestion Control

P4air: Increasing Fairness among Competing Congestion Control Algorithms Belma Turkovic and

CS 457 Lecture 22 Congestion Fall 2011 Extended Project 3 Discussion Topics Principles

End-to-End Previously on CS5229 , Not everyone uses TCP Congestion Control TCP Congestion

A PCC-Vivace Kernel Module PRESENTED BY TOMER GILAD Internet Congestion Control Senders

On-line End-to-End Congestion Control Naveen Garg Neal Young IIT Delhi UC Riverside the

Improving the FreeBSD TCP Implementation An update on all things TCP in FreeBSD and how they

Seawall: Performance Isolation for Cloud Datacenter Networks Alan Shieh Cornell University

Congestion Control in Distributed Media Streaming Lin Ma and Wei Tsang Ooi National

CS 557 Congestion and Complexity Observations on the Dynamics of a Congestion Control Algorithm:

A Wavelet-Based Approach to Detect Shared Congestion Min Sik Kim The University of Texas at

Networked Systems: TCP Yu-Ju Huang Dec. 3, 2019 1 Some slides from CS6410 on 2009 and 2013, and

High Speed Networks Need Proactive Congestion Control Using - PowerPoint PPT Presentation

High Speed Networks Need Proactive Congestion Control Using Programmable Forwarding Planes! Lavanya Jose , Steve Ibanez, Lisa Yan, Nick McKeown, Sachin Katti Stanford University Mohammad Alizadeh George Varghese MIT Microsoft Research

Congestion Control Mark Handley Outline Part 1: Traditional congestion control for bulk

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

Internet congestion control: TCP Internet congestion control: TCP 1988: &quot;Congestion

The Present and Future of Congestion Control Mark Handley Outline Purpose of congestion

What do you mean, Congestion? some history Congestion Collapse

TCP TCP Congestion Control Congestion Control Essential strategy :: The TCP host sends

Congestion Control In The Congestion Control In The Internet Internet JY Le Boudec Fall 2009

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

Optimization-based routing and congestion control Routing, congestion control as optimization

Convex Optimization Congestion Control Laila Daniel and Krishnan Narayanan 11th March 2013

Internet Congestion Control Research Group Mark Handley UCL Congestion Control The Internet

GAIMD Congestion Control Y. Richard Yang and Simon S. Lam, General AIMD Congestion Control

P4air: Increasing Fairness among Competing Congestion Control Algorithms Belma Turkovic and

CS 457 Lecture 22 Congestion Fall 2011 Extended Project 3 Discussion Topics Principles

End-to-End Previously on CS5229 , Not everyone uses TCP Congestion Control TCP Congestion

A PCC-Vivace Kernel Module PRESENTED BY TOMER GILAD Internet Congestion Control Senders

On-line End-to-End Congestion Control Naveen Garg Neal Young IIT Delhi UC Riverside the

Improving the FreeBSD TCP Implementation An update on all things TCP in FreeBSD and how they

Seawall: Performance Isolation for Cloud Datacenter Networks Alan Shieh Cornell University

Congestion Control in Distributed Media Streaming Lin Ma and Wei Tsang Ooi National

CS 557 Congestion and Complexity Observations on the Dynamics of a Congestion Control Algorithm:

A Wavelet-Based Approach to Detect Shared Congestion Min Sik Kim The University of Texas at

Networked Systems: TCP Yu-Ju Huang Dec. 3, 2019 1 Some slides from CS6410 on 2009 and 2013, and

Internet congestion control: TCP Internet congestion control: TCP 1988: "Congestion