Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta - PowerPoint PPT Presentation

Henge: Intent-Driven   Multi-Tenant Stream Processing Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta Distributed Protocols Research Group Department of Computer Science University of Illinois at Urbana Champaign 1 http://dprg.cs.uiuc.edu/

Henge allows stream processing jobs to satisfy user-specified performance requirements while reducing costs by performing online resource reconfigurations in a multi-tenant environment. 2

A Typical Deployment Job 1 Job 2 Job 3 Job 4 Per-job clusters � overprovisioning 3

A Typical Deployment Low level metrics e.g., queue sizes, CPU load as performance indicators Job 1 Job 2 Job 3 Job 4 4

A Typical Deployment Low level metrics e.g., queue sizes, CPU load as performance indicators Job 1 Job 2 Job 3 Job 4 Manual scaling 4

Intent-Driven Multi-Tenancy 5

Intent-Driven Multi-Tenancy Efficient resource usage across multiple users ➔ Multi-tenancy ... 5

Intent-Driven Multi-Tenancy Efficient resource usage across multiple users ➔ Multi-tenancy Application-aware adaptation to user requirements ➔ Intent-driven Multi-tenancy 6

Intent-Driven Multi-Tenancy CPU Load, Queue Sizes … Efficient resource usage across multiple users ➔ Multi-tenancy Application-aware adaptation to user requirements ➔ Intent-driven Multi-tenancy Job Description Service Level ... Objective (SLO) 1 Finding ride price Latency < 5 s 2 Analyzing Throughput > earnings over 10K/hr. 6 time

Problem How can we achieve user-facing service level objectives for stream processing jobs on multi-tenant clusters? 7

Problem How can we achieve user-facing service level objectives for stream processing jobs on multi-tenant clusters? Latency, Throughput 7

Absolute Throughput SLOs are not Useful Rate (Tuples/s) Day 1 Day 2 Day 1 Day 2 Workload Variability 8

Absolute Throughput SLOs are not Useful Rate (Tuples/s) SLO? Day 1 Day 2 Day 1 Day 2 Workload Variability 8

Absolute Throughput SLOs are not Useful Rate (Tuples/s) SLO? Day 1 Day 2 Day 1 Day 2 Input Workload Variability Output 8

Absolute Throughput SLOs are not Useful Job Operations … Filter … 9

Absolute Throughput SLOs are not Useful Job Operations … Filter … Juice : fraction* of the input data processed by the job per unit time. 9

Jobs benefit even below SLO threshold Job Utility Functions 10

Jobs benefit even below SLO threshold Job Utility Functions Expected Utility Latency SLO Threshold Current Utility Utility function for a single job 10

Jobs benefit even below SLO threshold Job Utility Functions Expected Utility Latency Henge’s goal � Maximize the total utility of the SLO cluster Threshold Current Utility Utility function for a single job 10

Background: Stream Processing Topologies (Jobs) Spou t Splitte r Operators Coun t Logical DAG for a Word Count Job 11

Bolt Sink Spout Spout Bolt Sink Spout Bolt Sink Spout Sink Bolt Star Topology Diamond Topology 12

Background: Stream Processing Jobs [“So”] [“it”] [“So it goes…”] [“So”] [“goes”] Coun … t [“goes”] Spou Coun Splitter t t Coun Spou Splitte t Coun t r t [“it”] Executors (Threads) 13

Background: Stream Processing Jobs [“So”] [“it”] [“So it goes…”] [“So”] [“goes”] Coun … t [“goes”] Spou Coun Splitter t t Coun Spou Splitte t Coun t r t [“it”] Parallelism � 2 Executors (Threads) 13

Background: A Physical Deployment 14

Background: A Physical Deployment Spout Splitter Count Count Workers 14

Henge’s Cluster-Wide State Machine Not Converged Converged Total System Utility < Total Expected Utility 15

Henge’s Cluster-Wide State Machine Reversion or Reconfiguration Reconfiguration Not Converged Converged Reduction Total System Utility < Total Expected Utility 15

Reconfiguration De-congest operator by increasing parallelism level of executors 1) Reconfiguration 2) Reconfiguration Not Converged Converged 16

Reconfiguration De-congest operator by increasing parallelism level of executors 3) Black-list topologies that show less than Δ % improvement 1) Reconfiguration 2) Reconfiguration Not Converged Converged 16

Bottlenecks Spout Splitter Count Count Workers 17

Reconfiguration Bottlenecks Spout Splitter Reconfigs. Splitter Splitter Count Count Workers 18

Bottlenecks Spout Splitter Reconfigs. Splitter Splitter High Count Count Load Workers 19

Bottlenecks SLO-Satisfying Job Spout Splitter Reconfigs. Splitter Splitter High Count Count Load Workers 19

Bottlenecks Reconfigs. High Load Reduction 20

Reduction Bottlenecks Reconfigs. High Load Reduction 20

Reduction Reconfigurations � drop in utility Reduction Not Converged 21

Reduction Reconfigurations � drop in utility If high CPU load on majority of machines, reduce parallelism for operators that a) are in topologies that satisfy their SLO b) are not congested Reduction Not Converged 21

Reversion Reconfigurations � drop in utility and reduction is not possible Revert to a past configuration that provided best utility Reversion Not Converged Converged 22

Evaluation Real-world workloads: Yahoo! Twitter Web log traces Experimental Setup: 10-40 node Emulab cluster 23

Reducing cost and achieving high utilities 24

Reducing cost and achieving high utilities 93.5% utility at 40% resources 24

Reducing cost and achieving high utilities 93.5% utility at 40% resources 100% utility at 60% resources 24

Adapting to a Diurnal Pattern 25

Day 1 Day 2 25

Day 1 Day 2 Max. Utility Reconfigurations Day 1 Day 2 25

Fewer reconfigurations are required once a job has Day 1 Day 2 adjusted to max load Max. Utility Reconfigurations Day 1 Day 2 25

Can Henge do better than manual configuration? 26

Can Henge do better than manual configuration? Henge does better in the 15th to 45th percentile, and is comparable later. 26

Scaling Cluster Size 27

Scaling Cluster Size Limited resources entail more reconfigurations to reach max. utility 27

More Results Henge can: handle dynamic workloads abrupt e.g., spikes & natural fluctuations gradual e.g., diurnal patterns satisfy hybrid SLOs scale with number of jobs & cluster size gracefully handle failures 28

Summary • Henge allows users to specify performance intents for their jobs • Henge’s goal is to maximize cluster-wide utility • The scheduler performs fine-grained reconfigurations to allow stream processing jobs to meet user-specified intents 29

Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta - PowerPoint PPT Presentation

Henge: Intent-Driven Multi-Tenant Stream Processing Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta Distributed Protocols Research Group Department of Computer Science University of Illinois at Urbana Champaign 1

Kaizen: Building a Performant Blockchain System Verified for Consensus and Integrity Faria Kalim

vicente faria taste the moment www.vicentefaria.com VICENTE FARIA VINHOS | OUR STORY PAGE |

Rule ( ) entails fatalism Domingos Faria LanCog | FLUL June 28, 2019 Domingos Faria (LanCog

LIVING TRADITIONS: WOMENS NARRATIVES OF THE COMMONS AS PROCESS. Soma KP and Richa Audichaya, India

Music Portal for Last.fm and Myspace ontology (Sharanya Chinnusamy ) B636 Final Project - What

Malloci Approachable Virtual Reality Content Creation Michael Sharanya Masha Belyi Jennifer

Lecture 1 Getting Started with Python INSTRUCTOR Sharanya Jayaraman PhD Candidate in

Sharing Lessons on Mangrove Restoration- Gujarat, India By C. N. Pandey & Richa Pandey

AGA Academy of Educators Award Presentation Richa Shukla, MD Assistant Professor Baylor College

Number Systems Prof. Indranil Sen Gupta Dept. of Computer Science & Engg. Indian Institute

Search for rare shape-phase transitions in hot rotating heavy nuclei Indranil Mazumdar Tata

Competitive and Fair Medium Access despite Reactive Jamming ICDCS 2011 Andrea Richa (ASU)

Electronic Mail Prof. Indranil Sen Gupta Professor, Dept. of Computer Science & Engineering

Landslide hazard zonation mapping and debris flow modelling in Nainital, Uttarakhand Richa Vaid 1 ,

Programming and Data Structure Indranil Sen Gupta Department of Computer Science & Engg.

Multithreading Indranil Sen Gupta (odd section) and Mainack Mondal (even section) CS39002

Fitness Wearables The Internet of Things (IOT) Employer Wellness Programs During the most

Moral Permissibility of Action Plans Felix Lindner Robert Mattmller Bernhard Nebel June 25,

AI ethics Tae Wan Kim Associate Professor of Business Ethics Tepper School of Business 1.

Unequal challenges ahead Martin Ravallion Georgetown University Unequal challenges Two

A bisection method for generating random utility functions in SMAA Rudolf Vetschera 1 Luis Dias 2

CS490W Federated Text Search Luo Si Department of Computer Science Purdue University Abstract

A new functional analytic approach to robust utility maximization in the dominated case Julio

Differentiability of reflected BSDEs with quadratic growth joint work with S. Ankirchner and P.

Sambuz

Useful Links

Newsletter

Mail Us

Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta - PowerPoint PPT Presentation

Henge: Intent-Driven Multi-Tenant Stream Processing Faria Kalim, Le Xu, Sharanya Bathey, Richa Meherwal, Indranil Gupta Distributed Protocols Research Group Department of Computer Science University of Illinois at Urbana Champaign 1

Kaizen: Building a Performant Blockchain System Verified for Consensus and Integrity Faria Kalim

vicente faria taste the moment www.vicentefaria.com VICENTE FARIA VINHOS | OUR STORY PAGE |

Rule ( ) entails fatalism Domingos Faria LanCog | FLUL June 28, 2019 Domingos Faria (LanCog

LIVING TRADITIONS: WOMENS NARRATIVES OF THE COMMONS AS PROCESS. Soma KP and Richa Audichaya, India

Music Portal for Last.fm and Myspace ontology (Sharanya Chinnusamy ) B636 Final Project - What

Malloci Approachable Virtual Reality Content Creation Michael Sharanya Masha Belyi Jennifer

Lecture 1 Getting Started with Python INSTRUCTOR Sharanya Jayaraman PhD Candidate in

Sharing Lessons on Mangrove Restoration- Gujarat, India By C. N. Pandey &amp; Richa Pandey

AGA Academy of Educators Award Presentation Richa Shukla, MD Assistant Professor Baylor College

Number Systems Prof. Indranil Sen Gupta Dept. of Computer Science &amp; Engg. Indian Institute

Search for rare shape-phase transitions in hot rotating heavy nuclei Indranil Mazumdar Tata

Competitive and Fair Medium Access despite Reactive Jamming ICDCS 2011 Andrea Richa (ASU)

Electronic Mail Prof. Indranil Sen Gupta Professor, Dept. of Computer Science &amp; Engineering

Landslide hazard zonation mapping and debris flow modelling in Nainital, Uttarakhand Richa Vaid 1 ,

Programming and Data Structure Indranil Sen Gupta Department of Computer Science &amp; Engg.

Multithreading Indranil Sen Gupta (odd section) and Mainack Mondal (even section) CS39002

Fitness Wearables The Internet of Things (IOT) Employer Wellness Programs During the most

Moral Permissibility of Action Plans Felix Lindner Robert Mattmller Bernhard Nebel June 25,

AI ethics Tae Wan Kim Associate Professor of Business Ethics Tepper School of Business 1.

Unequal challenges ahead Martin Ravallion Georgetown University Unequal challenges Two

A bisection method for generating random utility functions in SMAA Rudolf Vetschera 1 Luis Dias 2

CS490W Federated Text Search Luo Si Department of Computer Science Purdue University Abstract

A new functional analytic approach to robust utility maximization in the dominated case Julio

Differentiability of reflected BSDEs with quadratic growth joint work with S. Ankirchner and P.

Sambuz

Useful Links

Newsletter

Mail Us

Sharing Lessons on Mangrove Restoration- Gujarat, India By C. N. Pandey & Richa Pandey

Number Systems Prof. Indranil Sen Gupta Dept. of Computer Science & Engg. Indian Institute

Electronic Mail Prof. Indranil Sen Gupta Professor, Dept. of Computer Science & Engineering

Programming and Data Structure Indranil Sen Gupta Department of Computer Science & Engg.