DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi - PowerPoint PPT Presentation

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei Zaharia 1 , Ion Stoica 1 1 UC Berkeley, 2 Intel ISTC/Stony Brook, 3 KTH 1

Increasing Network Complexity • Packet processing becoming evermore sophisticated – Software Defined Networking (SDN) – Middleboxes – Software Routers (e.g. RouteBricks) – Hardware Acceleration (e.g. SSLShader) • Data plane no longer merely forwarding – WAN optimization – Caching – IDS – VPN 2

Motivation • Flows increasingly have heterogeneous resource consumption – Intrusion detection bottlenecking on CPU – Small packets bottleneck memory-bandwidth – Unprocessed large packets bottleneck on link bw Scheduling based on a single resource insufficient 3

Problem How to schedule packets from different flows, when packets consume multiple resources ? 4

Fair Queueing (FQ) • Packet scheduling well studied in FQ – Goal to provide isolation and fairness to flows • Assumes single-resource consumption – Packet size determines link bandwidth usage How to generalize fair queueing to multiple resources? 5

Contribution Allocation Allocation in Space in Time Single-Resource Max-Min Fair Fairness Fairness Queueing Multi-Resource DRF DRFQ Fairness Generalize Virtual Time to Multiple Resources 6

Outline • Analysis of Natural Policies • DRF allocations in Space • DRFQ: DRF allocations in Time • Implementation/Evaluation 7

Desirable Multi-Resource Properties • Share guarantee: – Each flow can get 1/n of at least one resource • Strategy-proofness: – A flow shouldn’t be able to finish faster by increasing the resources required to process it. 8

Violation of Share Guarantee • Example using traditional FQ – Two resources CPU and NIC, used serially – Two flows with profiles < 2 μs,1 μs > and < 1 μs,1 μs > – FQ based on NIC alternates one packet from each flow – CPU bottlenecked due to more aggregate demand 100% Flow 1 33% 33% Flow 2 50% 66% 33% 0% CPU NIC Share Guarantee Violated by Single Resource FQ 9

Violation of Strategy-Proofness • Bottleneck fairness by related work – Determine which resource is bottlenecked – Apply FQ to that resource • Example with Bottleneck Fairness – 2 resources (CPU, NIC), 3 flows <10,1>, <10,14>, <10,14> – CPU bottlenecked and split equally flow 1 100% 100% flow 2 50% 50% flow 3 48% 33% 33% 0% 0% CPU NIC CPU NIC – Flow 1 changes to <10,7>. NIC bottlenecked and split equally Bottleneck Fairness Violates Strategy-Proofness 10

Is strategy-proofness important? • Lack of strategy-proofness encourages wastage – Decreasing goodput of the system • Networking applications especially savvy – Peer-to-peer apps manipulate to get more resources • Trivially guaranteed for single resource fairness – But not for multi-resource fairness 11

Natural Policy • Per-Resource Fairness (PRF) – Have a buffer between each resource – Apply fair queueing to each resource • PRF abandoned in favor of DRFQ – Not strategy-proof – Requires per-resource buffers

Dominant Resource Fairness • DRF originally in the cloud computing context – Satisfies share guarantee – Satisfies strategy-proofness 14

DRF Allocations • Dominant resource of a user is the resource she is allocated most of – Dominant share is the user’s share of her dominant resource • DRF : apply max-min fairness to dominant shares – ” Equalize ” the dominant share of all users Total resources: <16 CPUs, 16 GB mem> User 1 demand: <3 CPU, 1 GB mem> dom res: CPU User 2 demand: <1 CPU, 4 GB mem> dom res: mem 100% User 1 12 GB 3 CPUs User 2 66% 50% 66% 12 CPUs 4 GB 0% 15 CPU mem

Allocations in Space vsTime • DRF provides allocations in space – Given 1000 CPUs and 1 TB mem, how much to allocate to each user • DRFQ provides DRF allocations in time – Multiplex packets to achieve DRF allocations over time 16

Packet Resource Consumption • Link usage of packets trivial in FQ – Packet size divided by throughput of link • Packet processing time a-priori unknown for multi-resources – Depends on the modules that process it • Leverage Start-time Fair Queueing (SFQ) – Schedules based on virtual start time of packets – Start time of packet p independent of resource consumption of packet p 18

Memoryless Requirement • Lesson from Virtual Clock – Pioneered many concepts in FQ – Simulated flows being dedicated a predefined 1/n share • Problem – During light load a flow might get more than 1/n – A flow receiving more than 1/n gets punished later • Requirement: memoryless scheduling – A flow’s share of resources should be independent of its share in the past 19

Dove-tailing Requirement • Packet size should not affect the service received – Flow with 10 1kb packets gets same service as 5 2kb packets • Use flow processing time, not packet processing time – Example: give same service to these flows: Flow 1: p 1 <1,2>, p 2 <2,1>, p 3 <1,2>, p 4 <2,1>, … Flow 2: p 1 <3,3>, p 2 <3,3>, p 3 <3,3>, p 4 <3,3>, … • Requirement: dove-tailing – Packet processing times should be independent of how resource consumption is distributed in a flow 20

Tradeoff • Dovetailing and memoryless property at odds – Dovetailing needs to remember past consumption • DRFQ developed in three steps – Memoryless DRFQ : uses a single virtual time – Dovetailing DRFQ : use virtual time per resource – Δ -Bounded DRFQ : generalizes the former 21

Memoryless DRFQ • Simulate a bit-by-bit DRFQ service – Attach a virtual start and finish time to every packet • Computing virtual finish time 1. finish time = start time + packet-max-processing-time • Computing virtual start time 2. Start time of the first packet in a burst equals the start time of the packet currently serviced (zero if none) 3. For a backlogged flow, the start time of a packet is equal to finish time of previous packet • Service the packet with minimum virtual start time 22

Memoryless DRFQ example • Two flows become backlogged at time 0 – Flow 1 alternates <1,2> and <2,1> packet processing – Flow 2 uses <3,3> packet processing time 1. finish time = start time + packet-max-processing-time 2. start time of first packet in burst equals start time of the packet currently serviced (zero if none) 3. For backlogged flows, start time is finish time of previous packet Flow 1 P5 Flow 1 P3 Flow 1 P4 Flow 1 P2 Flow 1 P1 S: 8 F: 10 S: 4 F: 6 S: 6 F: 8 S: 2 F: 4 S: 0 F: 2 Flow 2 P3 Flow 2 P2 Flow 2 P1 S: 6 F: 9 S: 3 F: 6 S: 0 F: 3 Flow 1 gets worse service than Flow 2 23

Dovetailing DRFQ • Keep track of start and finish time per resource – Dovetail by keeping track of all resource usage – For each packet use the maximum start time 24

Dovetailing DRFQ example • Two flows become backlogged at time 0 – Flow 1 alternates <1,2> and <2,1> packet processing – Flow 2 uses <3,3> per packet Flow 1 Flow 1 Flow 1 Flow 1 Flow 1 S 1 : 6 F 1 : 7 S 1 : 3 F 1 : 4 S 1 : 4 F 1 : 6 S 1 : 1 F 1 : 3 S 1 : 0 F 1 : 1 S 2 : 6 F 2 : 8 S 2 : 3 F 2 : 5 S 2 : 5 F 2 : 6 S 2 : 2 F 2 : 3 S 2 : 0 F 2 : 2 Flow 2 Flow 2 Flow 2 S 1 : 6 F 1 : 9 S 1 : 3 F 1 : 6 S 1 : 0 F 1 : 3 S 2 : 6 F 2 : 9 S 2 : 3 F 2 : 6 S 2 : 0 F 2 : 3 25 Dovetailing ensures both flows get same service

DRFQ algorithm • DRFQ bounds dovetailing to Δ processing time – Dovetail up to Δ processing time units – Memoryless beyond Δ • DRFQ is a generalization – When Δ =0 then DRFQ=memoryless DRFQ – When Δ=∞ then DRFQ=dovetailing DRFQ • Set Δ to a few packets worth of processing 26

Isolation Experiment • DRFQ Implementation in Click – 2 elephants: 40K/sec basic, 40K/sec IPSec – 2 mice: 1/sec basic, 0.5/sec basic Non-backlogged flows isolated from backlogged flows 28

Simulating Bottleneck Fairness • 2 flows and 2 res. <CPU, NIC> – Demands <1,6> and <7,1>  bottleneck unclear • Especially bad for TCP and video/audio traffic 29

Summary • Packet processing becoming evermore sophisticated – Consume multiple resources • Natural policies not suitable – Per-Resource Fairness (PRF) not strategy-proof – Bottleneck Fairness doesn’t provide isolation • Proposed Dominant Resource Fair Queueing (DRFQ) – Generalization of FQ to multiple resources – Generalizes virtual time to multiple resources – Provides tradeoff between memoryless and dovetailing – Provides share-guarantee (isolation) and strategy- proofness 30

Measuring Processing Times • Challenging problem – Real time measurement with CPU counters expensive • We use an offline linear estimation of processing time of each module 33

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi - PowerPoint PPT Presentation

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei Zaharia 1 , Ion Stoica 1 1 UC Berkeley, 2 Intel ISTC/Stony Brook, 3 KTH 1 Increasing Network Complexity Packet processing becoming evermore

Multiple class queueing networks Mean Value Analysis - Open queueing networks - Closed queueing

Core Stateless Fair Queueing Stoica, Shanker and Zhang - SIGCOMM 98 Rigorous fair Queueing

Packet Scheduling: Weighted Fair Queueing (WFQ) ( ) and Virtual Clock (VC) and Virtual Clock

Purpose Simulation is often used in the analysis of queueing models. A simple but typical

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

Queueing Networks service to a collection of customers that represent the users Customers compete

Fair Testing - O Wings We are learning to carry out a fair test. What is a fair test? Fair

Low Complexity Multi-Resource Fair Queueing with Bounded Delay Wei Wang , Ben Liang, Baochun Li

Basic Queueing Theory CS 450 : Operating Systems Michael Saelee <lee@iit.edu> Agenda -

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

CS 557 Fair Queuing Analysis and Simulation of a Fair Queueing Algorithm A. Demers, S. Keshav,

THE COLLEGE FAIR What is a college fair? When should I attend a fair? Why should I go

SC SCIENCE FAIR IENCE FAIR Calallen Independent School District SCI SCIENCE ENCE FAIR FAIR

11 Introduction Introduction M/M/1 Queueing delay (revisited) R=link bandwidth (bps)

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

(Basic Concepts) CSC304 - Nisarg Shah 1 Game Theory How do rational, self-interested agents

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves

PTSD as a Shame Disorder (A Work in Progress) Judith L. Herman, M.D. ISTSS Webinar, October 2014

CS7015 (Deep Learning) : Lecture 6 Eigen Values, Eigen Vectors, Eigen Value Decomposition,

Lecture 5: The animal kingdom of heuristics: Admissible, Consistent, zero, Relaxed,

Chapter 3 State Feedback - Pole Placement Motivation Whereas classical control theory is based

Supply and Demand: Price Supply and Demand: Price and Quantity and Quantity Determination in

Mendelian Genetics Slide 3 / 43 Slide 4 / 43 2 True or False: 3 What is the difference

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi - PowerPoint PPT Presentation

DRFQ : Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3 , Vyas Sekar 2 , Matei Zaharia 1 , Ion Stoica 1 1 UC Berkeley, 2 Intel ISTC/Stony Brook, 3 KTH 1 Increasing Network Complexity Packet processing becoming evermore

Multiple class queueing networks Mean Value Analysis - Open queueing networks - Closed queueing

Core Stateless Fair Queueing Stoica, Shanker and Zhang - SIGCOMM 98 Rigorous fair Queueing

Packet Scheduling: Weighted Fair Queueing (WFQ) ( ) and Virtual Clock (VC) and Virtual Clock

Purpose Simulation is often used in the analysis of queueing models. A simple but typical

Worm Detection ICMP Packet Analysis Ankur Agiwal 1 2 Packet Content Matching Packet

Queueing Networks service to a collection of customers that represent the users Customers compete

Fair Testing - O Wings We are learning to carry out a fair test. What is a fair test? Fair

Low Complexity Multi-Resource Fair Queueing with Bounded Delay Wei Wang , Ben Liang, Baochun Li

Basic Queueing Theory CS 450 : Operating Systems Michael Saelee &lt;lee@iit.edu&gt; Agenda -

Introduction to Packet Tracer What is Packet Tracer? Packet Tracer is a protocol simulator

Chapter 7 Packet-Switching Networks Routing in Packet Networks Shortest Path Routing Chapter 7

CS 557 Fair Queuing Analysis and Simulation of a Fair Queueing Algorithm A. Demers, S. Keshav,

THE COLLEGE FAIR What is a college fair? When should I attend a fair? Why should I go

SC SCIENCE FAIR IENCE FAIR Calallen Independent School District SCI SCIENCE ENCE FAIR FAIR

11 Introduction Introduction M/M/1 Queueing delay (revisited) R=link bandwidth (bps)

Packet Radio Lee Maddox, N4HOK What is Packet Radio? Packet radio is the connection of a computer

(Basic Concepts) CSC304 - Nisarg Shah 1 Game Theory How do rational, self-interested agents

Sampling Effect on Performance Prediction of Configurable Systems : A Case Study Juliana Alves

PTSD as a Shame Disorder (A Work in Progress) Judith L. Herman, M.D. ISTSS Webinar, October 2014

CS7015 (Deep Learning) : Lecture 6 Eigen Values, Eigen Vectors, Eigen Value Decomposition,

Lecture 5: The animal kingdom of heuristics: Admissible, Consistent, zero, Relaxed,

Chapter 3 State Feedback - Pole Placement Motivation Whereas classical control theory is based

Supply and Demand: Price Supply and Demand: Price and Quantity and Quantity Determination in

Mendelian Genetics Slide 3 / 43 Slide 4 / 43 2 True or False: 3 What is the difference

Basic Queueing Theory CS 450 : Operating Systems Michael Saelee <lee@iit.edu> Agenda -