Fast, Scalable, and Programmable Packet Scheduler in Hardware - PowerPoint PPT Presentation

Fast, Scalable, and Programmable Packet Scheduler in Hardware Vishal Shrivastav Cornell University

Packet Scheduling 101 express enforce at runtime Packet Scheduler * focus of this work * fairness / rate-limit / To pacing etc.. Wire output interface Scheduling Algorithm Packet Queues specifies when and what order to schedule packets onto the wire

Desirable Properties of a Packet Scheduler Programmability Scalability Express wide-range of Scale to 10s of thousands of flows packet scheduling algorithms [SENIC - NSDI’14] [Carousel - SIGCOMM’17] New transport protocols Time budget e.g., Fastpass, Ethernet TDMA Link for scheduling speed Circuit-Switched network designs decisions Performance e.g., Shoal, RotorNet Transmit packets at precise times e.g., 120 ns for MTU pkt @ 100Gbps e.g., at ns-precision in Shoal Make scheduling decisions within deterministic 10s of nanoseconds

Desirable Properties of a Packet Scheduler Trade-o ff Programmability Scalability Express wide-range of Scale to 10s of thousands of flows packet scheduling algorithms [SENIC - NSID’14] [Carousel - SIGCOMM’17] Trade-o ff ff o - e d a r T New transport protocols Time budget e.g., Fastpass, QJump, Ethernet TDMA Link for scheduling speed Circuit-Switched designs decisions Performance e.g., Shoal, Rotornet Transmit packets at precise times e.g., 120 ns for MTU pkt @ 100Gbps Challenging to achieve all three properties e.g. at ns-precision in Shoal (programmability, scalability, and performance) Make scheduling decisions in deterministic O(1) time, within 10s of nanoseconds simultaneously

State-of-the-art Packet Schedulers Performance Programmability Scalability express wide range of 10s of thousands decisions within deterministic scheduling algorithms of flows 10s of nanoseconds Software via specialization Performance Generality Hardware 1. FIFO 2. PIFO, UPS * (priority queue * has some limitations abstraction)

Can we design a packet scheduler that is simultaneously programmable, scalable, and high performance? We present PIEO (Push-In-Extract-Out) scheduler in hardware Abstraction Programmable more expressive than any state-of-the-art hardware packet scheduler Hardware Design Scalable easily scales to 10s of thousands of flows High Performance makes scheduling decisions in O(1) time [4 clock cycles]

Can we design a packet scheduler that is simultaneously programmable, scalable, and high performance? We present PIEO (Push-In-Extract-Out) scheduler in hardware Abstraction Programmable more expressive than any state-of-the-art hardware packet scheduler Scalable easily scales to 10s of thousands of flows High Performance makes scheduling decisions in O(1) time [4 clock cycles]

PIEO Scheduling Abstraction Scheduling Algorithms what order to schedule when an element becomes amongst eligible elements? eligible for scheduling? encode using a � value encode using a � value rank t eligible whenever the link is idle: among all elements satisfying the eligibility predicate � : t current ≥ t eligible schedule the smallest ranked element PIEO Abstraction — “ schedule the smallest ranked eligible element” strictly more expressive than a priority queue abstraction, e.g., PIFO, UPS

Push-In-Extract-Out Primitive element programmed ordered list based on 10 12 13 16 19 21 22 rank the choice of increasing rank value 16 9 4 13 6 2 15 t eligible scheduling algorithm 18 18 10 12 13 16 19 21 22 enqueue( ) “Push-In” 1 1 16 9 4 13 6 2 15 inserts element at position dictated by its rank value filter : � t current ≥ t eligible � t current = 7 dequeue( ) “Extract-Out” 13 10 12 13 16 19 21 22 4 16 9 4 13 6 2 15 returns “smallest ranked eligible” element 10 12 13 16 19 21 22 19 dequeue( ) 16 9 4 13 6 2 15 6 returns a specific element

Expressiveness of PIEO for each element: ๏ Work conserving calculate start_time and finish_time at time x, all elements s.t. virtual_time(x) >= start_time : • e.g., DRR, WFQ, � WF 2 Q schedule element with smallest finish_time ๏ Non-work conserving programming PIEO • e.g., Token Bucket, RCSP � rank = finish _ time ๏ Hierarchical scheduling � t eligible = start _ time • e.g., HPFQ Predicate for filtering at dequeue at time x: ๏ Asynchronous scheduling � ( virtual _ time ( x ) ≥ t eligible ) • e.g., Starvation avoidance, � D 3 ๏ Priority scheduling APP APP APP • e.g., SJF , SRTF , LSTF , EDF Rate Rate Rate limit limit limit ๏ Complex scheduling policies e.g. • mixture of shaping and ordering Priority — can not express accurately using PIFO

Can we design a packet scheduler that is simultaneously programmable, scalable, and high performance? We present PIEO (Push-In-Extract-Out) scheduler in hardware Programmable more expressive than any state-of-the-art hardware packet scheduler Hardware Design Scalable easily scales to 10s of thousands of flows High Performance makes scheduling decisions in O(1) time [4 clock cycles]

Hardware Design PIEO primitive relies on an ordered list datastructure Performance Scalability PIFO SRAM flip-flops O(N) comparators) (flip-flops & Hardware Resource > > > > > Array or Linked-list in memory O(1) Time Complexity O(1) O(N)

Is it fundamentally necessary to access and compare � elements in parallel to maintain O ( N ) an (exact) ordered list (of size N) in � time? O (1) We present a design that can maintain an (exact) ordered list in � time, but only needs to access O (1) and compare � elements in parallel. O ( N ) Key Insight “All problems in computer science can be solved by another level of indirection” —David Wheeler

Hardware Architecture 2 N 2 N SRAM N flip-flops N Points to sublists N sublist pointers ordered by increasing smallest rank value each sublist ordered by increasing rank within each sublist Q: How to zoom into the correct sublist in O(1) time? Q: How to read/update/write an entire sublist in O(1) time? Q: How to filter + extract-min in O(1) time? Q: What to do when enqueue into a full sublist? Detailed answers in the paper !!! enqueue(f), dequeue( ), dequeue(f) each execute in exactly 4 clock cycles ….. at the cost of 2 X memory overhead

Implementation • Implemented PIEO scheduler on a Stratix V FPGA - 234K logic modules (ALMs) - 6.5MB SRAM - 40Gbps interface bandwidth • ~1300 LOCs in System Verilog

Evaluation total SRAM = 6.5MB total SRAM = 6.5MB >30x 16 bit � and � fields rank t eligible 16 bit � and � fields rank t eligible 4 cycles per primitive op, i.e., 50ns @ 80MHz 4 cycles per primitive op, i.e., 50ns @ 80MHz - pipelining - pipelining - ASIC target, e.g., 4ns @ 1GHz - ASIC target, e.g., 4ns @ 1GHz But not as fast as PIFO — 1 cycle per primitive op But not as fast as PIFO — 1 cycle per primitive op

Beyond Packet Scheduling • PIEO as an O(1)-time generic Priority Queue • PIEO as an Abstract Dictionary Data Type • act as a (key, value) store, indexed by keys • search , insert , delete , and update in O(1) time • e ffi ciently do complex ops like range filtering over keys • ….. while also being reasonably scalable PIEO as a key basic building block in the era of hardware-accelerated computing

Conclusion Programmability PIFO Software (Hardware) PIEO (Hardware) High Scalability Performance FIFO (Hardware)

Conclusion Two Key Contributions: Programmability • A new programmable abstraction and primitive for packet scheduling - more expressive than any state-of-the-art hardware packet scheduler • A fast and scalable hardware design of the scheduler PIFO - makes scheduling decisions in 4 clock cycles Software - easily scales to 10s of thousands of flows (Hardware) PIEO High (Hardware) Scalability Performance FIFO (Hardware)

Thank you! FPGA code for the implementation of PIEO scheduler is available at: https://github.com/vishal1303/PIEO-Scheduler Email: vishal@cs.cornell.edu Webpage: http://www.cs.cornell.edu/~vishal/

Fast, Scalable, and Programmable Packet Scheduler in Hardware - PowerPoint PPT Presentation

Fast, Scalable, and Programmable Packet Scheduler in Hardware Vishal Shrivastav Cornell University Packet Scheduling 101 express enforce at runtime Packet Scheduler * focus of this work * fairness / rate-limit / To pacing etc.. Wire

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

ROMs, PLAs and FPGAs October 5, 2006 Typeset by Foil T EX Why Programmable Logic?

Design and Implemention of a Plugin Scheduler for DIET March 11, 2005 Design and Implemention of

Preempting Scheduler Activations Scheduler activations are completely preemptable Deadlocks

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

PROGRAMMABLE LOGIC CONTROLLER Control Systems Types Programmable Logic Controllers

Field Programmable Gate Arrays by Ketil Red Field Programmable Gate Array Integrated

Lab 1: Packet Sniffing and Wireshark Fengwei Zhang SUSTech CS 315 Computer Security 1 Packet

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

Programmable Switch Hardware ECE/CS598HPN Radhika Mittal Conventional SDN Programmable

Fast and Scalable Relational Division on Fast and Scalable Relational Division on Database

Fast Scalable Parallel Comparison Sort Fast, Scalable Parallel Comparison Sort On Hybrid Multicore

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

GNU Radio Advanced Scheduler Dude: Josh Blum - New scheduler features and stuff GRAS - Project

The Art, Science and Algorithms from each point of an object ! of Photography ! no distortion.

Argument-Level Interactions for Persuasion Comments Evaluation using Co-attention Lu Ji , Zhongyu

Robust Topology Control for Indoor Wireless Sensor Networks

Fundamental Propertjes of the GraphQL Language Olaf Hartjg @olafiartjg Joint work with Jorge

Numerical Error Analysis for Statistical N i l E A l i f St ti ti l www.simtec Software

Development of a Polarized 3 He Ion Source for RHIC/EIC

Honey, I Shrunk the Cube Matteo Golfarelli Stefano Rizzi University of Bologna - Italy Summary

objective caml Daniel Jackson MIT Lab for Computer Science 6898: Advanced Topics in Software