From a Calculus to an Execution Environment for Stream Processing
DEBS 2012
1
Robert Soulé Martin Hirzel Buğra Gedik Robert Grimm
Cornell University IBM Research Bilkent University New York University
From a Calculus to an Execution Environment for Stream Processing - - PowerPoint PPT Presentation
From a Calculus to an Execution Environment for Stream Processing Robert Soul Martin Hirzel Bu ra Gedik Robert Grimm Cornell University IBM Research Bilkent University New York University DEBS 2012 1 to an Execution Environment
1
Cornell University IBM Research Bilkent University New York University
2
CQL (StreamSQL) StreamIt (SDF) Sawzall (MapReduce) River (execution environment) System S (platform) Fusion (merge ops) Fission (replicate ops) Placement (assign hosts)
3
4 ¡
5
6
7
8
9
10
11
logs : {origin : string; target : string} stream; hits : {origin : string; count : int} stream = select istream(origin, count(origin)) from logs[range 300] where origin != target Bag.filter (fun x -> #expr) Bag.filter (fun x ->
12
select istream(*) from quotes[now], history where quotes.ask<=history.low and quotes.ticker=history.ticker
Expression analyzer SQL analyzer CQL analyzer Symbol table is-a has-a has-a has-a
13
Source language River (execution environment) System S (platform) Optimizer
14
15
16
now proj ect istre am dup split ran ge join istre am aggre gate join se lect join ran ge parti tion proj ect dis tinct dup- split now proj ect aggre gate pro ject pro ject rstre am
17
4x speedup on 4 machines
2x speedup on 16 machines
18
19
20
Stream processing Execution environment Translators from languages to IL CQL Arasu et al. [VLDB J.’06] SVM Labonte et al. [PACT’04] P-Code Nelson [CC’79] This paper
21