DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram - PowerPoint PPT Presentation

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram Venkataraman, Aurojit Panda, Kay Ousterhout , Michael Armbrust, Ali Ghodsi, Michael Franklin, Benjamin Recht, Ion Stoica

STREAMING WORKLOADS

Streaming Trends: Low latency Results power decisions by machines Credit card fraud Suspicious user logins Slow video load Ask security questions Direct user to new CDN Disable account

Streaming Requirements: High throughput Detect suspicious Dynamically adjust Disable stolen accounts logins application behavior As many as 10s of millions of updates per second Need a distributed system

Distributed Execution Models

Execution models: CONTINUOUS OPERATORS Group by user, run anomaly detection … … …

Execution models: CONTINUOUS OPERATORS Group by user, run anomaly detection … … Low latency output … Mutable local state

Execution models: CONTINUOUS OPERATORS Systems: Group by user, run anomaly detection Naiad … … Google MillWheel Streaming DBs: Low latency output … … Borealis, Flux etc Mutable local … state

Execution models: Micro-batch … … … … … … Group by user, … … run anomaly detection z … … Tasks output state on completion … Output at task granularity

Execution models: Micro-batch Dynamic task … … … … … … Group by user, … … scheduling run anomaly detection Adaptability z … … Straggler mitigation Elasticity Tasks output state Fault tolerance on completion … Google FlumeJava Output at task Microsoft Dryad granularity

Failure recovery

Failure recovery: continuous operators All machines replay from checkpoint ? … Chandy Lamport Checkpointed Async Checkpoint state

Failure recovery: Micro-batch z z z Task boundaries capture task interactions! … Task output is periodically checkpointed z

Failure recovery: Micro-batch z z z Replay tasks from failed machine … Task output is Parallelize periodically replay checkpointed z

Execution models Continuous operators Static scheduling Low latency Inflexible Slow failover Micro-batch Scheduling granularity Processing granularity Higher latency Dynamic scheduling Adaptable Parallel recovery Straggler mitigation

Execution models Continuous operators Static scheduling Low latency Drizzle Dynamic scheduling Low latency (coarse granularity) (fine-grained processing) Micro-batch Higher latency Dynamic scheduling (coarse-grained processing) (coarse granularity)

inside the scheduler … … … … … … … … ? Scheduler (1) (2) … … Decide how to assign Serialize and send tasks to machines tasks data locality fair sharing …

SCHEDULING OVERHEADS Median-task time breakdown 250 200 Compute + Data Transfer ime (ms) Time (ms) 150 Task Fetch 100 50 Scheduler Delay 0 4 8 16 32 64 128 Machines Machines Cluster: 4 core, r3.xlarge machines Workload: Sum of 10k numbers per-core

inside the scheduler … … … … … … … … ? ? Scheduler (1) (2) … … Decide how to assign Serialize and send Reuse scheduling decisions! tasks to machines tasks data locality fair sharing …

DRIZZLE Goal: … … … … … … … … remove frequent scheduler interaction (2) Group schedule micro-batches … (1) Pre-schedule reduce tasks

… … Goal: Remove scheduler involvement for reduce tasks … (1) Pre-schedule reduce tasks

… … ? Goal: Remove scheduler involvement for reduce tasks … (1) Pre-schedule reduce tasks

coordinating shuffles: Existing systems … … Data fetched from remote Metadata machines describes shuffle data location …

coordinating shuffles: Pre-scheduling … … (1) Pre-schedule reducers (2) Mappers get metadata (3) Mappers trigger reducers …

DRIZZLE Goal: … … … … … … … … wait to return to scheduler (2) Group schedule micro-batches … (1) Pre-schedule reduce tasks

Group scheduling Group of 2 Group of 2 Schedule group of micro-batches … … … … … … … … at once Fault tolerance, scheduling at group boundaries …

Micro-benchmark: 2-stages 100 iterations – Breakdown of pre-scheduling, group-scheduling Baseline 300 Only Pre-Scheduling 250 ime / Iter (ms) Time / Iter (ms) Drizzle-10 200 Drizzle-100 150 100 50 0 4 8 16 32 64 128 Machines Machines In the paper: group size auto-tuning

Evaluation 1. Latency? Continuous operators Static scheduling Low latency Drizzle Dynamic scheduling Low latency (coarse granularity) (fine-grained processing) Micro-batch 2. Adaptability? Higher latency Dynamic scheduling (coarse-grained processing) (coarse granularity)

EVALUATION: Latency Yahoo! Streaming Benchmark Input: JSON events of ad-clicks Compute: Number of clicks per campaign Window: Update every 10s Comparing Spark 2.0, Flink 1.1.1, Drizzle 128 Amazon EC2 r3.xlarge instances

Streaming BENCHMARK - performance Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines Event Latency: Difference between window end, processing end 1 0.8 Spark 0.6 Drizzle 0.4 Flink 0.2 0 0 500 1000 1500 2000 2500 3000 Event Latency (ms) Event Latency (ms)

Adaptability: FAULT TOLERANCE Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines Inject machine failure at 240 seconds 2000 20000 Spark Flink Drizzle Spark Flink Drizzle Latency (ms) 15000 1500 10000 1000 5000 500 0 0 190 200 210 220 230 240 250 260 270 280 290 190 200 210 220 230 31

Execution models Continuous operators Static scheduling Low latency Drizzle Dynamic scheduling Low latency Optimization of (coarse-granularity) (fine-grained processing) batches Micro-batch Optimization of Dynamic scheduling Higher latency batches

INTRA-BATCH QUERY optimization Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines Optimize execution of each micro-batch by pushing down aggregation 1 Spark 0.8 0.6 Drizzle 0.4 Flink 0.2 Drizzle-Optimized 0 0 500 1000 1500 2000 2500 3000 Event Latency ( Event Latency (ms ms)

EVALUATION End-to-end Latency Yahoo Streaming Benchmark Fault tolerance Query optimization Throughput Synthetic micro-benchmarks Video Analytics Elasticity Shivaram’s Thesis: Iterative ML Algorithms Group-size tuning

Shivaram is answering questions on sli.do Conclusion Continuous operators Static scheduling Low latency Drizzle Dynamic scheduling Optimization of Low latency (coarse granularity) batches (fine-grained processing) Source code: https://github.com/amplab/drizzle-spark Micro-batch Dynamic scheduling Higher latency Optimization of (coarse granularity) (coarse-grained processing) batches

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram - PowerPoint PPT Presentation

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram Venkataraman, Aurojit Panda, Kay Ousterhout , Michael Armbrust, Ali Ghodsi, Michael Franklin, Benjamin Recht, Ion Stoica STREAMING WORKLOADS Streaming Trends: Low latency Results

Using Drizzle OSCON 2010 Eric Day - http://oddments.org/ Senior Software Engineer @ Rackspace

Stream Processing Marco Serafini COMPSCI 532 Lecture 5 Stream vs. Batch Processing Batch

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

SystemT ap/DTrace with MySQL & Drizzle Padraig O'Sullivan Software Engineer, Akiban T ech.

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Apache Flink Fast and Reliable Large-Scale Data Processing Fabian Hueske @fhueske 1 What is

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Auto-sizing for Stream Processing Applications at LinkedIn Rayman Preet Singh, Bharath

An Introduction To Data Stream Query Processing Neil Conway <nconway@truviso.com> Truviso,

Text Stream Processing Dunja Mladeni Artificial Intelligence Laboratory Marko Grobelnik Jo

Introduction to Data Stream Processing Amir H. Payberah payberah@kth.se 19/09/2019 The Course

Adaptable skin systems Omar Zalloum Architect M.Sc. Architecture and Planning Beyond

Adaptable Space Design LWC-S, Studio 8 September 2014

HATS: Highly Adaptable & Trustworthy Software Using Formal Models Reiner H ahnle

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &

Building Adaptive and Agile Applications Using Intrusion Detection and Response Joseph P. Loyall,

Transfer Adversarial Training: A General Approach to Adapting Deep Classifiers Hong Liu,

Design and Run-Time Quality of Service Management Techniques for Publish/ Subscribe Distributed

Learning to Adapt to Dynamic, Real- World Environments Chelsea Finn UC Berkeley Google Brain

Multi-functional Learning Commons Presented by: Janet Nelson, Demco Session Overview: Changing

Mary Uhl-Bien, Ph.D. BNSF Railway Endowed Professor of Leadership Neeley School of Business at

ThingsJS: Towards a Flexible and Self-Adaptable Middleware for Dynamic and Heterogeneous IoT

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram - PowerPoint PPT Presentation

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram Venkataraman, Aurojit Panda, Kay Ousterhout , Michael Armbrust, Ali Ghodsi, Michael Franklin, Benjamin Recht, Ion Stoica STREAMING WORKLOADS Streaming Trends: Low latency Results

Using Drizzle OSCON 2010 Eric Day - http://oddments.org/ Senior Software Engineer @ Rackspace

Stream Processing Marco Serafini COMPSCI 532 Lecture 5 Stream vs. Batch Processing Batch

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

SystemT ap/DTrace with MySQL &amp; Drizzle Padraig O'Sullivan Software Engineer, Akiban T ech.

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Apache Flink Fast and Reliable Large-Scale Data Processing Fabian Hueske @fhueske 1 What is

Being a METS Startup Fast Failure; Fast Reward November 2016 Fast Failure; Fast Reward

Auto-sizing for Stream Processing Applications at LinkedIn Rayman Preet Singh, Bharath

An Introduction To Data Stream Query Processing Neil Conway &lt;nconway@truviso.com&gt; Truviso,

Text Stream Processing Dunja Mladeni Artificial Intelligence Laboratory Marko Grobelnik Jo

Introduction to Data Stream Processing Amir H. Payberah payberah@kth.se 19/09/2019 The Course

Adaptable skin systems Omar Zalloum Architect M.Sc. Architecture and Planning Beyond

Adaptable Space Design LWC-S, Studio 8 September 2014

HATS: Highly Adaptable &amp; Trustworthy Software Using Formal Models Reiner H ahnle

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &amp;

Building Adaptive and Agile Applications Using Intrusion Detection and Response Joseph P. Loyall,

Transfer Adversarial Training: A General Approach to Adapting Deep Classifiers Hong Liu,

Design and Run-Time Quality of Service Management Techniques for Publish/ Subscribe Distributed

Learning to Adapt to Dynamic, Real- World Environments Chelsea Finn UC Berkeley Google Brain

Multi-functional Learning Commons Presented by: Janet Nelson, Demco Session Overview: Changing

Mary Uhl-Bien, Ph.D. BNSF Railway Endowed Professor of Leadership Neeley School of Business at

ThingsJS: Towards a Flexible and Self-Adaptable Middleware for Dynamic and Heterogeneous IoT

SystemT ap/DTrace with MySQL & Drizzle Padraig O'Sullivan Software Engineer, Akiban T ech.

An Introduction To Data Stream Query Processing Neil Conway <nconway@truviso.com> Truviso,

HATS: Highly Adaptable & Trustworthy Software Using Formal Models Reiner H ahnle

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &