Iroko A Data Center Emulator for Reinforcement Learning Fabian - PowerPoint PPT Presentation

Iroko A Data Center Emulator for Reinforcement Learning Fabian Ruffy, Michael Przystupa, Ivan Beschastnikh University of British Columbia https://github.com/dcgym/iroko

Reinforcement Learning and Networking 2

The Data Center: A perfect use case • DC challenges are optimization problems • Traffic control • Resource management • Routing • Operators have complete control • Automation possible • Lots of data can be collected Cho, Inho, Keon Jang, and Dongsu Han. "Credit-scheduled delay- bounded congestion control for datacenters.“ SIGCOMM 2017 8

Two problems… • Typical reinforcement learning is not viable for data center operators! • Fragile stability • Questionable reproducibility • Unknown generalizability • Prototyping RL is complicated • Cannot interfere with live production traffic • Offline traces are limited in expressivity • Deployment is tedious and slow 9

Our work: A platform for RL in Data Centers • Iroko : open reinforcement learning gym for data center scenarios • Inspired by the Pantheon* for WAN congestion control • Deployable on a local Linux machine • Can scale to topologies with many hosts • Approximates real data center conditions • Allows arbitrary definition of • Reward • State • Actions *Yan, Francis Y., et al. "Pantheon: the training ground for Internet congestion- control research.“ ATC 2018 10

Iroko in one slide 11

Iroko in one slide Topology Fat-Tree Dumbbell Rack 12

Iroko in one slide Traffic Pattern Action Model Topology Fat-Tree Dumbbell Rack 13

Iroko in one slide Data Collectors Traffic Pattern Action Model Topology Fat-Tree Dumbbell Rack 14

Iroko in one slide Reward Model State Model Data Collectors Traffic Pattern Action Model Topology Fat-Tree Dumbbell Rack 15

Iroko in one slide OpenAI Gym Reward Model State Model Data Collectors Traffic Pattern Action Model Topology Fat-Tree Dumbbell Rack 16

Iroko in one slide Policy OpenAI Gym Reward Model State Model Data Collectors Traffic Pattern Action Model Topology Fat-Tree Dumbbell Rack 17

Use Case: Congestion Control • Ideal data center should have: • Low latency, high utilization • No packet loss or queuing delay • Fairness • CC variations draw from the reactive TCP • Queueing latency dominates • Frequent retransmits reduce goodput • Data center performance may be unstable 18

Predicting Networking Traffic Bandwidth Flow Pattern Data Collection 10 Allocation Policy 10 10 10 Switch 10 10 10 19

Predicting Networking Traffic Bandwidth Flow Pattern Data Collection 10 Allocation Policy 3.3 10 10 10 3.4 3.3 Switch 10 10 10 22

Predicting Networking Traffic Bandwidth Flow Pattern Data Collection 10 Allocation Policy 10 3.3 10 Switch 3.3 3.4 10 23

Can we learn to allocate traffic fairly? • Two environments: • env_iroko : centralized rate limiting arbiter • Agent can set the sending rate of hosts • PPO, DDPG, REINFORCE • env_tcp : raw TCP • Contains implementations of TCP algorithms • TCP Cubic, TCP New Vegas, DCTCP • Goal: Avoid congestion 24

Experiment Setup • 50000 timesteps • Linux default UDP as base transport • 5 runs (~7 hours per run) • Bottleneck at central link 25

Results – Dumbbell UDP

Results - Takeaways • Challenging real-time environment • Noisy observation • Exhibits strong credit assignment problem • RL algorithms show expected behavior for our gym • Achieve better performance than TCP New Vegas • More robust algorithms required to learn good policy • DDPG and PPO achieve near optimum • REINFORCE fails to learn good policy 27

Contributions • Data center reinforcement learning is gaining traction …but it is difficult to prototype and evaluate • Iroko is • a platform to experiment with RL for data centers • intended to train on live traffic • early stage work • but experiments are promising • available on Github: https://github.com/dcgym/iroko 28

Iroko A Data Center Emulator for Reinforcement Learning Fabian - PowerPoint PPT Presentation

Iroko A Data Center Emulator for Reinforcement Learning Fabian Ruffy, Michael Przystupa, Ivan Beschastnikh University of British Columbia https://github.com/dcgym/iroko Reinforcement Learning and Networking 2 Reinforcement Learning and

Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control Fabian

Leveraging Multi-path Diversity for Transport Loss Recovery in Data Centers Guo Chen Yuanwei

Networked Systems: TCP Yu-Ju Huang Dec. 3, 2019 1 Some slides from CS6410 on 2009 and 2013, and

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

Pre-Midterm 0. Recall: RTT, bandwidth-delay product, encoding, forward ECC 1. Review all the

Game Theory: Spring 2020 Ulle Endriss (via Zoi Terzopoulou) Institute for Logic, Language and

Optimal Oblivious Path Selection on the Mesh Costas Busch Malik Magdon-Ismail Jing Xi

High-Definition Routing Congestion Prediction for Large-Scale FPGAs Mohamed Baker Alawieh 1 , Wuxi

Congestion Analysis for Global Routing via Integer Programming Hamid Shojaei, Azadeh Davoodi, and

Physical Design Closure Physical Design Closure Olivier Coudert Monterey Design System DAC 2000

Emerging Connected Vehicle based Traffic Signal Control Qi Alfred Chen, Yucheng Yin, Yiheng Feng,

(TXDWLRQ%DVHG&RQJHVWLRQ 2XWOLQH &RQWUROIRU8QLFDVW$SSOLFDWLRQV ,QWUR

PDE Backstepping Control Traffic Congestion Control: of Congested Traffic A PDE Backstepping

Congestion Avoidance in Low-Voltage Networks Using Smart Meters Nicolas Gast (Inria, Grenoble)

Optimization-based routing and congestion control Routing, congestion control as optimization

Markets for Transport Eliminating Congestion through Scheduling, Routing, and Real-time Pricing

Accurate Latency-based Congestion Feedback for Datacenters Changhyun Lee with Chunjong Park,

Networking part 3: the transport layer Juliusz Chroboczek Universit de Paris-Diderot (Paris 7)

04832250 Computer Networks (Honor Track) A Data Communication and Device Networking

#getmoving2020 getmoving2020.org Notice: Verbal Public Comment will be limited to between 90

RippleFPGA: A Routability-Driven Placement for Large-Scale

The Use of Real-Time and Archived Operations Data for Congestion Planning and Incident Management

Transport layer Congestion Control in TCP Global congestion collapse Craig Partridge, Research

Iroko A Data Center Emulator for Reinforcement Learning Fabian - PowerPoint PPT Presentation

Iroko A Data Center Emulator for Reinforcement Learning Fabian Ruffy, Michael Przystupa, Ivan Beschastnikh University of British Columbia https://github.com/dcgym/iroko Reinforcement Learning and Networking 2 Reinforcement Learning and

Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control Fabian

Leveraging Multi-path Diversity for Transport Loss Recovery in Data Centers Guo Chen Yuanwei

Networked Systems: TCP Yu-Ju Huang Dec. 3, 2019 1 Some slides from CS6410 on 2009 and 2013, and

Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion 1

TCP/IP Over Lossy Links - TCP SACK without Congestion Control Organization 1. The History of

Pre-Midterm 0. Recall: RTT, bandwidth-delay product, encoding, forward ECC 1. Review all the

Game Theory: Spring 2020 Ulle Endriss (via Zoi Terzopoulou) Institute for Logic, Language and

Optimal Oblivious Path Selection on the Mesh Costas Busch Malik Magdon-Ismail Jing Xi

High-Definition Routing Congestion Prediction for Large-Scale FPGAs Mohamed Baker Alawieh 1 , Wuxi

Congestion Analysis for Global Routing via Integer Programming Hamid Shojaei, Azadeh Davoodi, and

Physical Design Closure Physical Design Closure Olivier Coudert Monterey Design System DAC 2000

Emerging Connected Vehicle based Traffic Signal Control Qi Alfred Chen, Yucheng Yin, Yiheng Feng,

(TXDWLRQ%DVHG&amp;RQJHVWLRQ 2XWOLQH &amp;RQWUROIRU8QLFDVW$SSOLFDWLRQV ,QWUR

PDE Backstepping Control Traffic Congestion Control: of Congested Traffic A PDE Backstepping

Congestion Avoidance in Low-Voltage Networks Using Smart Meters Nicolas Gast (Inria, Grenoble)

Optimization-based routing and congestion control Routing, congestion control as optimization

Markets for Transport Eliminating Congestion through Scheduling, Routing, and Real-time Pricing

Accurate Latency-based Congestion Feedback for Datacenters Changhyun Lee with Chunjong Park,

Networking part 3: the transport layer Juliusz Chroboczek Universit de Paris-Diderot (Paris 7)

04832250 Computer Networks (Honor Track) A Data Communication and Device Networking

#getmoving2020 getmoving2020.org Notice: Verbal Public Comment will be limited to between 90

RippleFPGA: A Routability-Driven Placement for Large-Scale

The Use of Real-Time and Archived Operations Data for Congestion Planning and Incident Management

Transport layer Congestion Control in TCP Global congestion collapse Craig Partridge, Research

(TXDWLRQ%DVHG&RQJHVWLRQ 2XWOLQH &RQWUROIRU8QLFDVW$SSOLFDWLRQV ,QWUR