Improving TCP Congestion Control with Machine Intelligence Yiming - PowerPoint PPT Presentation

Improving TCP Congestion Control with Machine Intelligence Yiming Kong *, Hui Zang † , and Xiaoli Ma* *School of ECE, Georgia Tech, USA † Futurewei Technologies, USA 1

TCP congestion control Sender 1 • A critical problem in TCP/IP networks • End-to-end or with in-net support • Adjust congestion window (cwnd) Sender 2 Network • Packet loss • Round trip time (RTT) • High throughput, low delay Sender K 2

TCP NewReno (2) Congestion avoidance, cwnd +=1/cwnd (1) Slow start Cwnd +=1 (3) Fast recovery (s) BW = 10Mbps, RTT min = 150ms, Single NewReno flow 3

Other TCP congestion control schemes Vegas Cubic Compound [Brakmo et al. 1995] [Ha et al. 2008] [Tan et al. 2006] • Mechanism-driven instead of objective-driven • Pre-defined operations in response to specific feedback signals • Do not learn and adapt from experience 4 *Figures from: Afanasyev et al. 2010. Host-to-Host Congestion Control for TCP. IEEE Commun. Surveys Tuts., Vol. 12, No. 3, 304 – 342.

RemyCC [Winstein et al. 2013] Traffic model Objective Prior assumptions function about network Remy RemyCC • Delay-throughput tradeoff as objective function • Offline training to generate lookup tables • Inflexible for the network & traffic model changes 5

Our contributions • Teach TCP to optimize its cwnd to minimize packet loss events • LP-TCP • Teach TCP to adaptively adjust cwnd according to an objective • RL-TCP • Improved throughput -- up to 29% over NewReno for LP-TCP • Reduced RTT -- up to 7% over NewReno for RL-TCP • Maintaining fairness 6

Loss prediction based TCP (LP-TCP) (during congestion avoidance) • When a new ACK is received, cwnd += 1/cwnd • Before sending a packet Feature vector, etc. • Sensing engine updates the feature vector ACKs Loss Sensing • Loss predictor outputs loss probability p Network predictor engine • If p < threshold, the actuator sends the packet Loss • Otherwise, the packet is not sent, and cwnd -= 1 probability • Set threshold to max The actuator Packets LP-TCP 7

Training the loss predictor • Collect training data through NewReno simulations on NS2 • Record the state right before the packet goes into transmission as a feature vector • If the packet is successfully delivered, this feature vector gets a label of 0 • Otherwise, the label is 1 (for loss) • Stop the collection when we have enough losses in the data • Train a random forest classifier offline Features cwnd, ewma of ACK intervals, ewma of sending intervals, minimum of sending intervals, minimum of ACK intervals, minimum of RTT, time series (TS) of ack intervals, TS of sending intervals, TS of RTT ratios, and etc. • Re-train LP upon network changes 8

Reinforcement learning based TCP (RL-TCP) • Q-TCP [Li et al. 2016] • Based on Q-learning • Designed with mostly a single flow in mind • Sufficient buffering available at the bottleneck 9

Reinforcement learning based TCP (RL-TCP) • Q-TCP [Li et al. 2016] Our RL-TCP • Based on Q-learning • Add variable to state • • Designed with mostly a single flow in mind Tailor action space to under-buffered bottleneck • • Sufficient buffering available at the bottleneck Propose a new temporal credit assignment of reward • Objective of RL-TCP • Learn to adjust cwnd to increase an utility function delay Packet loss rate throughput Bottleneck bandwidth 10

Map to RL • State s n • Reward r n+1 • ewma of the ACK inter-arrival time • ewma of packet inter-sending time • RTT ratio • slow start threshold • current cwnd size where • Action a n • cwnd += a n , where a n = -1, 0, +1, +3 11

Learning the Q-value • Learning the Q-value function: Q(s, a) • Q(s,a): the value of being at a particular state s and performing action a • Updated every RTT, using SARSA r n+1 = f(U n+1 – U n ) • This is the proposed temporal credit assignment of reward • Action selection: ɛ -greedy exploration & exploitation Randomly select an action from the action space, if rand() < ɛ a n+1 = , otherwise 12

Experimental setup in NS2 RTT min = 150 ms Buffer size L Sender 1 Receiver 1 Sender 2 Receiver 2 Router 1 Router 2 ... ... Bottleneck B = 10Mbps Sender K Receiver K • Bandwidth delay product = 150 packets • Throughput (tp ) = (total amount of bytes received)/(sender’s active duration) • Delay (d) = RTT - RTT min 13

Single sender: performance of RL based TCP Table: Performance of RL based TCP. Buffer size L is 50 packets. E(tp) V(tp) E(d) V(d) M e Q-TCP 6.176 0.267 16.26 4.662 1.541 Q-TCP ca 8.72*10 -3 9.597 20.31 3.690 1.960 Q a -TCP 9.658 0.019 14.80 2.818 1.998 Q a -TCP ca 8.10*10 -5 3.24*10 -2 9.857 3.74 2.156 9.30*10 -3 RL-TCP no-ca 9.723 13.87 3.152 2.011 RL-TCP 9.869 7.49*10 -4 3.86 3.24*10 -2 2.154 • Action space: Redesigning action space improves performance • Credit assignment: The proposed credit assignment scheme improves performance 14

Single sender: performance of LP-TCP • Buffer size L = 5 Network ceiling • LP-TCP predicts all packet losses (during congestion avoidance) & keeps the cwnd at the network ceiling 15

Single sender: varying buffer size RL-TCP LP-TCP L = 5 LP-TCP NewReno • L = 50 LP-TCP has the best M e LP-TCP NewReno when L = 5 L = 150 Q-TCP • RL-TCP has the best M e RL-TCP when L = 50, 150. NewReno Q-TCP • Performance of RL-TCP is less sensitive to the varying buffer size Q-TCP 16

Multiple senders • 4 senders, homogeneous, L = 50 • 3 NewReno, 1 LP-TCP or RL-TCP, L = 50 RL-TCP 0.592 LP-TCP 0.562 NewReno 0.545 Q-TCP 0.306 17

Conclusions • Propose two learning-based TCP congestion control schemes for wired networks • LP-TCP works the best with small buffers at the bottlenecks • RL-TCP achieves the best throughput-delay-tradeoff under various network configurations • Future work • Explore policy-based RL-TCP • Improve fairness for learning-based TCP congestion control schemes 18

Thank you! Q & A 19

Improving TCP Congestion Control with Machine Intelligence Yiming - PowerPoint PPT Presentation

Improving TCP Congestion Control with Machine Intelligence Yiming Kong , Hui Zang , and Xiaoli Ma *School of ECE, Georgia Tech, USA Futurewei Technologies, USA 1 TCP congestion control Sender 1 A critical problem in TCP/IP

TCP TCP Congestion Control Congestion Control Essential strategy :: The TCP host sends

Internet congestion control: TCP Internet congestion control: TCP 1988: "Congestion

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

TCP Congestion Avoidance Joshua Gancher November 10, 2016 Joshua Gancher TCP Congestion

TCP Pacing in Data Center Networks Monia Ghobadi, Yashar Ganjali Department of Computer

End-to-End Previously on CS5229 , Not everyone uses TCP Congestion Control TCP Congestion

The Present and Future of Congestion Control Mark Handley Outline Purpose of congestion

Attacks on TCP 1 Outline What is TCP protocol? How the TCP Protocol Works SYN

Congestion Control Mark Handley Outline Part 1: Traditional congestion control for bulk

TCP-CCC: single-path TCP congestion control coupling draft-welzl-tcp-ccc-00 Michael Welzl,

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

What do you mean, Congestion? some history Congestion Collapse

Lecture 8 Congestion Control EECS 122 University of California Berkeley TOC: Congestion

Congestion Control In The Congestion Control In The Internet Internet JY Le Boudec Fall 2009

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

1 ACK clocking ACK clocking ACK clocking spreads out bursts ACK clocking spreads out bursts

E2E Service Verification Architecture An architectural proposal for defining, engineering, and

May 6, 2020 AGENDA ITEM: A-1 TOPIC: Executive Directors Report Recommendation: Receive

First Steps Summer Third Birthdays Office of Special Education First Steps Program March 2018

Marti Motoyama, Brendan Meeder, Kirill Levchenko, Stefan Savage and Geoffrey M. Voelker OSN

Hybrid Control and Communication Karl Henrich Johansson KTH Stockholm, Sweden kallej@s3.kth.se

On the Design of Load Factor based Congestion Control Protocols for Next-Generation Networks

Swarm-based In Incast Congestion Control in in a Datacenter Serving Web Applications Haoyu Wang*

QoS Services with Dynamic Packet State Ion Stoica Carnegie Mellon University (joint work with

Improving TCP Congestion Control with Machine Intelligence Yiming - PowerPoint PPT Presentation

Improving TCP Congestion Control with Machine Intelligence Yiming Kong *, Hui Zang , and Xiaoli Ma* *School of ECE, Georgia Tech, USA Futurewei Technologies, USA 1 TCP congestion control Sender 1 A critical problem in TCP/IP

TCP TCP Congestion Control Congestion Control Essential strategy :: The TCP host sends

Internet congestion control: TCP Internet congestion control: TCP 1988: &quot;Congestion

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

TCP Congestion Avoidance Joshua Gancher November 10, 2016 Joshua Gancher TCP Congestion

TCP Pacing in Data Center Networks Monia Ghobadi, Yashar Ganjali Department of Computer

End-to-End Previously on CS5229 , Not everyone uses TCP Congestion Control TCP Congestion

The Present and Future of Congestion Control Mark Handley Outline Purpose of congestion

Attacks on TCP 1 Outline What is TCP protocol? How the TCP Protocol Works SYN

Congestion Control Mark Handley Outline Part 1: Traditional congestion control for bulk

TCP-CCC: single-path TCP congestion control coupling draft-welzl-tcp-ccc-00 Michael Welzl,

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

What do you mean, Congestion? some history Congestion Collapse

Lecture 8 Congestion Control EECS 122 University of California Berkeley TOC: Congestion

Congestion Control In The Congestion Control In The Internet Internet JY Le Boudec Fall 2009

Congestion Games with affine functions Maria Serna Fall 2016 AGT-MIRI, FIB-UPC Congestion Games

1 ACK clocking ACK clocking ACK clocking spreads out bursts ACK clocking spreads out bursts

E2E Service Verification Architecture An architectural proposal for defining, engineering, and

May 6, 2020 AGENDA ITEM: A-1 TOPIC: Executive Directors Report Recommendation: Receive

First Steps Summer Third Birthdays Office of Special Education First Steps Program March 2018

Marti Motoyama, Brendan Meeder, Kirill Levchenko, Stefan Savage and Geoffrey M. Voelker OSN

Hybrid Control and Communication Karl Henrich Johansson KTH Stockholm, Sweden kallej@s3.kth.se

On the Design of Load Factor based Congestion Control Protocols for Next-Generation Networks

Swarm-based In Incast Congestion Control in in a Datacenter Serving Web Applications Haoyu Wang*

QoS Services with Dynamic Packet State Ion Stoica Carnegie Mellon University (joint work with

Improving TCP Congestion Control with Machine Intelligence Yiming Kong , Hui Zang , and Xiaoli Ma *School of ECE, Georgia Tech, USA Futurewei Technologies, USA 1 TCP congestion control Sender 1 A critical problem in TCP/IP

Internet congestion control: TCP Internet congestion control: TCP 1988: "Congestion