SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda - PowerPoint PPT Presentation

SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda Pathirage (IU), Julian Hyde (Hortonworks), Yi Pan (LinkedIn), Beth Plale (IU) School of Informatics and Computing INDIANA UNIVERSITY

Fast Data Data has to be processed as it arrives, so that we can react immediately to changing conditions. BIG DATA ISN’T JUST BIG; IT’S ALSO FAST. Big data is often data that is generated at incredible speeds, such as click-stream data, financial ticker data, log aggregation, and sensor data. John Hugg, "Fast data: The next step after big data"

Applications � Real-time distributed tracing for website performance and efficiency optimizations � Calculating click-through rates � Data stream enrichment ◦ Count page views by group key where group key is retrieved from a key/value storage ◦ Enriching data streams related to use activities with user’s information such as location and company � At the time of writing LinkedIn uses 90 Kafka clusters deployed across 1500 nodes to process 150TB of input data daily

Lambda Architecture (LA) LA is a technology agnostic data processing architecture that attempts to balance latency, accuracy, throughput and fault-tolerance by providing a unified serving layer on top of batch and stream processing sub-systems. From: https://www.oreilly.com/ideas/questioning-the-lambda-architecture

Kappa Architecture (KA) Simplification of Lambda Architecture is KA that uses append-only immutable log as the canonical data store; batch processing is replaced by stream replay. From: https://www.oreilly.com/ideas/questioning-the-lambda-architecture

M OTIVATION

Programming APIs for LA and KA Summingbird is a well known abstraction for writing LA style applications. KA style applications are mainly written in a stateful stream processing APIs provided by frameworks such as Apache Samza. Limitations � Need to maintain two complex distributed systems � Users need to understand complex programming abstractions � Long turnaround times

Summingbird WORD COUNT def wordCount[ P <: Platform [ P ]] (source : Producer [ P , String ], store : P#Store [ String , Long ]) = source.flatMap { sentence => toWords(sentence).map( _ -> 1L) }.sumByKey(store) More examples at https://github.com/twitter/summingbird

Samza API WINDOW AGGREGATION public class WikipediaStatsStreamTask implements StreamTask, InitableTask, WindowableTask { ... private KeyValueStore<String, Integer> store; public void init(Config config, TaskContext context) { this .store = (KeyValueStore<String, Integer>) context.getStore("wikipedia-stats"); } @Override public void process(IncomingMessageEnvelope envelope, MessageCollector collector, TaskCoordinator coordinator) { Map<String, Object> edit = (Map<String, Object>) envelope.getMessage(); ... } @Override public void window(MessageCollector collector, TaskCoordinator coordinator) { ... collector.send( new OutgoingMessageEnvelope( new SystemStream("kafka", "wikipedia-stats"), counts)); ... }

SQL for Big Data There are several well known SQL-on-Hadoop solutions and most organizations that use Hadoop use one or more SQL-on-Hadoop solutions. � Apache Hive � Presto � Apache Drill � Apache Impala � Apache Kylin � Apache Tajo � Apache Phoenix

Motivating Research Questions � Can the same low barrier and the clear semantics of SQL be extended to queries that execute simultaneously over data streams (in movement) and tables (at rest)? � Can this be done with minimal and well-founded extensions to SQL? � And with minimal overhead over a non-SQL-based LA/KA?

SAMZASQL

Streaming SQL - Data Model � Stream: A stream S is a possibly indefinite partitioned sequence of temporally-defined elements where an element is a tuple belonging to the schema of S. � Partition: A partition is a time-ordered, immutable sequence of elements existing within a single stream. � Relation: Analogous to a relation/table in relational databases, a relation R is a bag of tuples belonging to the schema of R.

Streaming SQL - Continuous Queries SAMZASQL SELECT STREAM rowtime, productId, units FROM Orders WHERE units > 25 CQL SELECT ISTREAM rowtime, productId, units FROM Orders WHERE units > 25;

Streaming SQL - Window Aggregations SAMZASQL SELECT STREAM TUMBLE_END (rowtime, INTERVAL '1' HOUR) AS rowtime, productId, COUNT (*) AS c , SUM (units) AS units FROM Orders GROUP BY TUMBLE (rowtime, INTERVAL '1' HOUR), productId CQL SELECT ISTREAM ... AS rowtime, productId, COUNT (*) AS c , SUM (units) AS units FROM Orders[Range '1' HOUR, Slide '1' HOUR] GROUP BY productId;

Streaming SQL - Sliding Windows SAMZASQL SELECT STREAM rowtime, productId, units, SUM (units) OVER ( ORDER BY rowtime PARTITION BY productId RANGE INTERVAL '1' HOUR PRECEDING) unitsLastHour FROM Orders; CQL SELECT ISTREAM rowtime, productId, units, SUM (units) AS unitsLastHour FROM Orders[Range '1' HOUR] GROUP BY productId;

Streaming SQL - Window Joins SAMZASQL SELECT STREAM GREATEST(PacketsR1.rowtime, PacketsR2.rowtime) AS rowtime, PacketsR1.sourcetime, PacketsR1.packetId, PacketsR2.rowtime - PacketsR1.rowtime AS timeToTravel FROM PacketsR1 JOIN PacketsR2 ON PacketsR1.rowtime BETWEEN PacketsR2.rowtime - INTERVAL '2' SECOND AND PacketsR2.rowtime + INTERVAL '2' SECOND AND PacketsR1.packetId = PacketsR2.packetId

Streaming SQL - Stream-to-Relation Joins SAMZASQL SELECT STREAM * FROM Orders as o JOIN Products as p on o.productId = p.productId

SamzaSQL - Implementation � Uses Apache Calcite query planning framework � Utilizes Calcite’s code generation framework � Generates Samza jobs for streaming SQL queries � Uses Samza’s local storage to implement fault-tolerant window aggregations � Uses Samza’s bootstrap stream feature to cache the relation to perform stream-to-relation join queries � Uses Janino to compile operators generated during stream task initialization

SamzaSQL - Architecture SamzaSQL Shell Samza YARN Client Schema Calcite Model Zookeeper Registry SamzaSQL Job

SamzaSQL - Samza Job Node Manager YARN Resource Samza YARN Client Samza App Master Manager Node Manager Node Manager SamzaContainer SamzaContainer SamzaContainer [ s-p0 ] [ s-p1 ] [ s-p2 ] Kafka Cluster s-p2 s-p1 s-p0 Kafka Broker 2 Kafka Broker 1 Kafka Broker n

SamzaSQL - Kafka

SamzaSQL - Query Planner Apache Calcite Convert to Generic Parser Validator Logical Plan Optimizations SELECT STREAM Samza Job … Configuration* Conver to SamzaSQL SamzaSQL Optimizations Model Execution Plan

EVALUATION

Evaluation - Environment � 100 byte messages (based on previous Kafka benchmarks) � 3 node (EC2 r3.2xlarge) Kafka cluster � 3 node (EC2 r3.2xlarge) YARN cluster � Each r3.2xlarge instance has 8 vCPUs, 61GB of RAM and 160 GB SSD backed storage � Data model ◦ Stream - Orders (rowtime, productId, orderId, units) ◦ Table - Products (productId, name, supplierId)

Evaluation - Results � Per task throughput is around 550MB/m for simple queries (100 byte messages) � Throughput is around 200MB/m when local storage is used (100 byte messages) � 30-40% overhead for simple queries when compared with Samza jobs written in Java � Overheads are mainly due to message format transformations required in streaming SQL runtime � Overheads increase when local storage is used due to message serialization/deserialization

Evaluation - SamzaSQL Message Processing Flow MESSAGE PROCESSING FLOW Decode AvrotoArray Process ArraytoAvro Encode

Evaluation - Filter Throughput · 10 7 Throughput ( msg / m ) 6 4 2 SamzaSQL Native 2 4 8 16 Number of tasks SELECT STREAM * FROM Orders WHERE units > 50

Evaluation - Project Throughput · 10 7 Throughput ( msg / m ) 6 4 2 SamzaSQL Native 2 4 8 16 Number of tasks SELECT STREAM rowtime, productId, units FROM Orders

Evaluation - Stream-to-Relation Join Throughput · 10 7 Throughput ( msg / m ) 4 2 SamzaSQL Native 0 2 4 8 16 Number of tasks SELECT STREAM Orders.rowtime, Orders.orderId, Orders,productId, Orders.units, Products.supplierId FROM Orders JOIN ON Orders.productId = Products.productId

Evaluation - Sliding Window Throughput Number of tasks SamzaSQL Samza 1 0 0 . 2 0 . 4 0 . 6 0 . 8 1 · 10 6 Throughput ( msg / m ) SELECT STREAM rowtime, productId, units, SUM(units) OVER (PARTITION BY productId ORDER BY rowtime RANGE INTERVAL ’5’ MINUTE PRECEDING) unitsLastFiveMinutes FROM Orders Sliding window query throughput was measured in a iMac due to limitations in EC2 IO rates.

RELATED WORK

Related Work � Eerly work on streaming SQL - TelegraphCQ, Tribecca, GSQL � CQL � Streaming SQL for Apache Flink and Apache Storm based on our work in Apache Calcite

F UTURE WORK AND CONCLUSION

Future Work � Code generation to bring SamzaSQL generated physical plans closer to Samza Java API based queries � Local storage related improvements to reduce serialization/deserialization overheads � Streaming query optimizations for fast data management systems � Ordering guarantees in the presence of stream repartitioning � Stream-to-relation queries � Intra-query optimizations � Handling out-of-order arrivals

SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda - PowerPoint PPT Presentation

SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda Pathirage (IU), Julian Hyde (Hortonworks), Yi Pan (LinkedIn), Beth Plale (IU) School of Informatics and Computing INDIANA UNIVERSITY Fast Data Data has to be processed as it

IFRS 17 General Measurement Model 26 th April 2019 2 Disclaimer The views expressed in this

Delivering inclusive capitalism Sharing success with investors, customers and society LEGAL &

Lecture 19 Computing with Graphical Processing Units Announcements Evaluate your TAs 4

Neil Foster, Robert Stehrer, Marcel Timmer, Gaaitzen de Vries WIOD conference, 24-26 april 2012

Internet technology research Internet technology research with International collaboration with

Module 8: Evaluating Immune Correlates of Protection Instructors: Ivan Chan, Peter Gilbert, Paul

Central Representation of Touch John H. Martin, Ph.D. Center for Neurobiology & Behavior

Modeling Adult Visual Function Dr. James A. Bednar jbednar@inf.ed.ac.uk

ASIC accelerators 1 To read more This days papers: Reagan et al, Minerva: Enabling

Discovery of Latent Factors in High-dimensional Data Using Tensor Methods Furong Huang

Theoretical neuroscience: From single neuron to network dynamics Nicolas Brunel Outline

Logic, language and the brain: how autists reason with rules and exceptions Michiel van

Patient and Physician Reported Outcomes Karl Swedberg Senior professor of Medicine University

What Assessment Strategies Should Be Used in Determining Physician Competence? Bill McGaghie

On the Muskat problem Robert Strain (University of Pennsylvania) Collaborators: Peter Constantin

Hybrid Automata Lecturer: Tiziano Villa 1 1 Dipartimento dInformatica Universit` a di Verona

Reducing SSD Read Latency via NAND Flash Program and Erase Suspension Guanying Wu and Xubin He

Three ways to print

From Autotune to READEX Michael Gerndt Technische Universitt Mnchen Project overview

Closure relations for non-uniform suspensions Kengo Ichiki Andrea Prosperetti Department of

+ R N RX R 3 RN + X - time of reaction : 2 3 days solvent: methanol, acetonitrile R

Lesson Plan Template Lesson Segment Focus: Phases of Mitosis_ Lesson _of____

Graph cut, convex relaxation and continuous max-flow problem Egil Bae (UCLA) and Xue-Cheng Tai

Multiphase flows in complex geometries: a UQ perspective Matteo Icardi Stochastic Numerics group

SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda - PowerPoint PPT Presentation

SamzaSQL Scalable Fast Data Management with Streaming SQL Milinda Pathirage (IU), Julian Hyde (Hortonworks), Yi Pan (LinkedIn), Beth Plale (IU) School of Informatics and Computing INDIANA UNIVERSITY Fast Data Data has to be processed as it

IFRS 17 General Measurement Model 26 th April 2019 2 Disclaimer The views expressed in this

Delivering inclusive capitalism Sharing success with investors, customers and society LEGAL &amp;

Lecture 19 Computing with Graphical Processing Units Announcements Evaluate your TAs 4

Neil Foster, Robert Stehrer, Marcel Timmer, Gaaitzen de Vries WIOD conference, 24-26 april 2012

Internet technology research Internet technology research with International collaboration with

Module 8: Evaluating Immune Correlates of Protection Instructors: Ivan Chan, Peter Gilbert, Paul

Central Representation of Touch John H. Martin, Ph.D. Center for Neurobiology &amp; Behavior

Modeling Adult Visual Function Dr. James A. Bednar jbednar@inf.ed.ac.uk

ASIC accelerators 1 To read more This days papers: Reagan et al, Minerva: Enabling

Discovery of Latent Factors in High-dimensional Data Using Tensor Methods Furong Huang

Theoretical neuroscience: From single neuron to network dynamics Nicolas Brunel Outline

Logic, language and the brain: how autists reason with rules and exceptions Michiel van

Patient and Physician Reported Outcomes Karl Swedberg Senior professor of Medicine University

What Assessment Strategies Should Be Used in Determining Physician Competence? Bill McGaghie

On the Muskat problem Robert Strain (University of Pennsylvania) Collaborators: Peter Constantin

Hybrid Automata Lecturer: Tiziano Villa 1 1 Dipartimento dInformatica Universit` a di Verona

Reducing SSD Read Latency via NAND Flash Program and Erase Suspension Guanying Wu and Xubin He

Three ways to print

From Autotune to READEX Michael Gerndt Technische Universitt Mnchen Project overview

Closure relations for non-uniform suspensions Kengo Ichiki Andrea Prosperetti Department of

+ R N RX R 3 RN + X - time of reaction : 2 3 days solvent: methanol, acetonitrile R

Lesson Plan Template Lesson Segment Focus: __Phases of Mitosis___ Lesson _________of____________

Graph cut, convex relaxation and continuous max-flow problem Egil Bae (UCLA) and Xue-Cheng Tai

Multiphase flows in complex geometries: a UQ perspective Matteo Icardi Stochastic Numerics group

Delivering inclusive capitalism Sharing success with investors, customers and society LEGAL &

Central Representation of Touch John H. Martin, Ph.D. Center for Neurobiology & Behavior

Lesson Plan Template Lesson Segment Focus: Phases of Mitosis_ Lesson _of____