A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of - PowerPoint PPT Presentation

A Worst-Case Op�mal Mul�-Round Algorithm for Parallel Computa�on of Conjunc�ve Queries Bas Ketsman & Dan Suciu 1

Topic of the Talk How to compute mul�-joins (over graphs) ... ( x , y , z ) ← R ( x , y ) , S ( y , z ) , T ( z , x ) ... in a mul�-round shared nothing cluster se�ng ... input output ... with communica�on cost that is worst-case op�mal ? 2

Introduc�on Worst-case op�mality: ▶ Output size: AGM bound [Atserias, Grohe & Marx 08] query output = m ρ ∗ . Lower-bound on worst-case running-�me ▶ Op�mal sequen�al algorithms: (w.r.t running-�me) Leapfrog-trie-join, NPRR, Generic Join 3

Introduc�on (2) Worst-case op�mal communica�on cost: ▶ Load = maximal amount of messages received by any server in any communica�on round ▶ Lowerbound m load ≥ p 1/ ρ ∗ . [Koutris, Beame & Suciu 16] ▶ Op�mal parallel algorithms: (w.r.t communica�on cost) [Koutris, Beame & Suciu 16] Ad-hoc algorithms for chains, stars, simple cycles 4

Main Result A parallel algorithm exists for compu�ng join queries over graphs using only a constant number of rounds and load ≤ ˜ O ( m / p 1/ ρ ∗ ) . Query/schema restric�ons: ▶ Arity at most two ▶ No projec�ons ▶ No self-joins Essen�ally op�mal: ▶ Up to a poly-log factor ▶ Data-complexity 5

Outline The Model Lowerbound and Hypercube ( ρ ∗ and τ ∗ ) Main Result by Example Summary & Future Work 6

Massively Parallel Communica�on Model: [Koutris, Suciu 2011] Input Input Input fragment fragment fragment Synchronized Synchronized Synchronized communica�on communica�on communica�on Local Local Local computa�on computa�on computa�on Output Output Output fragment fragment fragment 7

Lower-Bound [Koutris, Beame, Suciu 2016] Input fragment For a constant-round algorithm to be correct for given query on every instance Synchronized worst-case load is communica�on m ≥ p 1/ ρ ∗ Local computa�on (assuming equi-sized rela�ons) Through AGM bound Output fragment 9

ρ ∗ = Frac�onal Edge Covering Number R 1 ( x , y ) , R 2 ( y , z ) , R 3 ( z , x ) , R 4 ( z , u ) , R 5 ( u , w ) , T 6 ( u , t ) , T 7 ( t , s ) , T 8 ( s , u ) Query Graph 1/2 1/2 0 1 ρ ∗ = 7/2 1/2 0 0 1 1 ▶ Objec�ve func�on: Assign a posi�ve weight to every edge ▶ Constraint: Every vertex incident to sum of weights ≥ 1 ▶ Op�miza�on goal: Minimize total sum of assigned weights 10

Hypercube (= shares algorithm) Input fragment = Single-round hash-join algorithm Introduced by [Afra�, Ullman, 2010] Synchronized If database has no skew , runs with load: communica�on m ≤ p 1/ τ ∗ Local computa�on (w.h.p. and ignoring poly-log factor) Output [Beame, Koutris, Suciu 2013] fragment 11

τ ∗ = Frac�onal Edge Packing Number 1/2 1/2 0 1 τ ∗ = 7/2 1/2 0 0 1 ▶ Objec�ve func�on: Assign a posi�ve weight to every edge ▶ Constraint: Every vertex incident to sum of weights ≤ 1 ▶ Op�miza�on goal: Maximize total sum of assigned weights 12

Rela�on between τ ∗ and ρ ∗ ? Solu�on is �ght if sa�sfies = rather than ≤ or ≥ . For general hypergraphs: No clear rela�on between τ ∗ and ρ ∗ ! For simple graphs: ▶ Op�mal half-integral frac�onal edge packings exist (using only weights 1 , 1/2 and 0 ) ≤ ρ ∗ (assign weights 1/2 to all ver�ces) ▶ τ ∗ ≤ | vars ( Q ) | 2 ▶ τ ∗ + ρ ∗ = | vars ( Q ) | 13

Heavy-Hi�er Configura�ons Example Query: ( x , y , z ) ← R 1 ( x , y ) , R 2 ( y , z ) , R 3 ( z , u ) Heavy-hi�er : value with degree > δ (in some direc�on) Skew : some heavy-hi�er exists 15

Break Skewed Instance in Understandable Pieces Heavy-hi�er configura�on ( δ, H ) : A skew threshold value δ + labeling of query variables with “heavy” (H) or “light” (others). 1. 2. Matching instance I ( δ, H ) = induced subinstance where heavy variables have only the heavy values, light variables only the light values. 16

Break Skewed Instance in Understandable Pieces Evalua�on strategy: Compute Q in parallel over all instances I ( δ, H ) using the same p servers. For Fixed δ : Claim: ∪ H ⊆ vars ( Q ) Q ( I | ( δ, H ) ) = Q ( I ) . As the number of configura�ons depends on Q , maximal load ≤ max H { maximal load to compute Q on I ( δ, H ) } . (ignoring constants) 17

The Algorithm in a Nutshell Preprocessing: ▶ Iden�fy where skew is Heavy-hi�ers and degrees of heavy-hi�ers. Algorithm: 1. Break skewed instance in understandable pieces 2. Divide and Conquer strategy to deal with skew 3. Solve remaining (skew-free) problem with Hypercube 18

The Algorithm by Example Example query Servers 0 1 1/2 1/2 1/2 ▶ τ ∗ = ρ ∗ = | vars ( Q ) | /2 19

The Algorithm by Example m Threshold value: δ = p 1/ | vars ( Q ) | Do computa�on for each heavy-hi�er configura�on in parallel “all light” “all heavy” “hybrid” 20

The Algorithm by Example: “All light” Use the Hypercube algorithm ▶ Due to �ghtness: τ ∗ = ρ ∗ = | vars ( Q ) | /2 m m ▶ non skewed means: degree ≤ δ = p 1/ | vars ( Q ) | = p 1/(2 τ ∗ ) ▶ Hypercube ensures load ≤ m m p 1/ ρ ∗ . p 1/ τ ∗ = 21

The Algorithm by Example: “Hybrid” Step 1: Broadcast heavy rela�on ▶ As before: load ≤ m p 1/ ρ ∗ due to m ≥ p 2 . Refocus: ▶ Solu�on can be easily extended. 23

Step 2: Assign group of servers to every heavy value ▶ Combina�on of outputs = complete output 24

▶ size of group p ′ = p ( | vars ( Q ) |− 1)/ | vars ( Q ) | (because ≤ p 1/ | vars ( Q ) | heavy values) Step 3: Semi-join reduce involved rela�ons ▶ reduc�ons are cheap: 2 rounds and load ≤ m m p ′ ≤ p 1/ ρ ∗ (because we have > 2 light variables) Refocus: ▶ Output for simpler query can be translated to output for original query by simply adding to every tuple the locally known heavy value 25

Main Result Every conjunc�ve query without self-joins, that is full, over rela�ons with ari�es at most two can be computed in 9 rounds m with load ≤ ˜ p 1/ ρ ∗ ) . O ( Essen�aly op�mal ρ ∗ seems the right way to express op�mality for the communica�on cost of distributed query evalua�on algorithms, at least when rela�on ari�es do not exceed two. 28

Future Work Does an algorithm exist with worst-case op�mal load m / p 1/ ρ ∗ for queries over rela�ons with arbitrary-ari�es? ▶ rela�on between edge cover / packing unclear in general ▶ half-integral edge cover/packing does not always exist ▶ queries exist where τ ∗ > ρ ∗ R 1 ( x 1 , y 1 , z 1 ) , R 2 ( x 2 , y 2 , z 2 ) , S 1 ( x 1 , x 2 ) , S 2 ( y 1 , y 2 ) , S 3 ( z 1 , z 2 ) . ⇒ Hypercube cannot be used even when there is no skew Is m / p 1/ ρ ∗ a �ght lowerbound for joins over arbitrary-arity rela�ons? 29

Future Work (2) Are the 9 rounds essen�al? What if queries have existen�al quan�fica�on (projec�ons)? What if the database has dependencies? 30

Thank you! 31

A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of - PowerPoint PPT Presentation

A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of Conjuncve Queries Bas Ketsman & Dan Suciu 1 Topic of the Talk How to compute mul-joins (over graphs) ... ( x , y , z ) R ( x , y ) , S ( y , z ) , T ( z , x

MAL MEIERS Chef, founder of Food for Thought MAL MEIERS Chef, founder of Food For Thought About

Toy Example Toy Example Toy Example Toy Example Toy Example D 1 weak classifiers = vertical or

Mul&lingualism @ ECUAD Debora O & Tara Wren

Mul$-Object Synchroniza$on Mul$-Object Programs What happens

Mul$-Object Synchroniza$on Mul$-Object Programs What happens

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design

Information Geometry in Mathematical Finance: Model Risk, Worst and Almost Worst Scenarios Imre

Worst-case Ethernet Network Latency for Shaped Sources Max Azarov, Standard Microsystems (SMSC)

Comparison of Efficiency Binary Binomial Procedure (worst- (worst- (amortized) case) case)

Typical versus Worst Case Design in Networking Nandita Dukkipati Yashar Ganjali, Rui Zhang-Shen

A RARE PRESENTATION OF UNUSUAL BENIGN MUL A RARE PRESENTATION OF UNUSUAL BENIGN MUL TICYSTIC

COMMUNITY CONTRIBUTION FUND (CCF) Round 9, September 2015 TA11 CCF Summary 2011 -2015 Round

Lattices that Admit Logarithmic Worst-Case to Average-Case Connection Factors Chris Peikert 1

quiz insertion sort: worst-case time complexity? best-case time complexity? in-place?

Mul,-way Joins J(a,b,c) :- R(a,b) S(b,c) T(a,c)

To Reserve or Not to Reserve: Op.mal Online Mul.-Instance

1 A Nachos Filesystem A Nachos FilesystemOn Disk On Disk A Typical Unix File Tree A Typical

Undecidability of propositional separation logic and its neighbours James Brotherston Computer

Schism: Fragmentation-Tolerant Real-Time Garbage Collection Fil Pizlo Luke Ziarek Peta Maj

Decidable fragments of first order logic R. Ramanujam The Institute of Mathematical Sciences,

Remote Procedure Call Client Server R e Blocked q u e s t Outline Protocol Stack

A Semantics-Free Tagalog Fragment Carl Pollard Department of Linguistics Ohio State University

Samsung R&D Institute India - Bangalore 1 Embedded Linux Conference, San Jose, CA, March-2015

Rank Polymorphism Viewed as a Constraint Problem Justin Slepak Panagiotis Manolios Olin

A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of - PowerPoint PPT Presentation

A Worst-Case Opmal Mul-Round Algorithm for Parallel Computaon of Conjuncve Queries Bas Ketsman & Dan Suciu 1 Topic of the Talk How to compute mul-joins (over graphs) ... ( x , y , z ) R ( x , y ) , S ( y , z ) , T ( z , x

MAL MEIERS Chef, founder of Food for Thought MAL MEIERS Chef, founder of Food For Thought About

Toy Example Toy Example Toy Example Toy Example Toy Example D 1 weak classifiers = vertical or

Mul&amp;lingualism @ ECUAD Debora O &amp; Tara Wren

Mul$-Object Synchroniza$on Mul$-Object Programs What happens

Mul$-Object Synchroniza$on Mul$-Object Programs What happens

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design

Information Geometry in Mathematical Finance: Model Risk, Worst and Almost Worst Scenarios Imre

Worst-case Ethernet Network Latency for Shaped Sources Max Azarov, Standard Microsystems (SMSC)

Comparison of Efficiency Binary Binomial Procedure (worst- (worst- (amortized) case) case)

Typical versus Worst Case Design in Networking Nandita Dukkipati Yashar Ganjali, Rui Zhang-Shen

A RARE PRESENTATION OF UNUSUAL BENIGN MUL A RARE PRESENTATION OF UNUSUAL BENIGN MUL TICYSTIC

COMMUNITY CONTRIBUTION FUND (CCF) Round 9, September 2015 TA11 CCF Summary 2011 -2015 Round

Lattices that Admit Logarithmic Worst-Case to Average-Case Connection Factors Chris Peikert 1

quiz insertion sort: worst-case time complexity? best-case time complexity? in-place?

Mul,-way Joins J(a,b,c) :- R(a,b) S(b,c) T(a,c)

To Reserve or Not to Reserve: Op.mal Online Mul.-Instance

1 A Nachos Filesystem A Nachos FilesystemOn Disk On Disk A Typical Unix File Tree A Typical

Undecidability of propositional separation logic and its neighbours James Brotherston Computer

Schism: Fragmentation-Tolerant Real-Time Garbage Collection Fil Pizlo Luke Ziarek Peta Maj

Decidable fragments of first order logic R. Ramanujam The Institute of Mathematical Sciences,

Remote Procedure Call Client Server R e Blocked q u e s t Outline Protocol Stack

A Semantics-Free Tagalog Fragment Carl Pollard Department of Linguistics Ohio State University

Samsung R&amp;D Institute India - Bangalore 1 Embedded Linux Conference, San Jose, CA, March-2015

Rank Polymorphism Viewed as a Constraint Problem Justin Slepak Panagiotis Manolios Olin

Mul&lingualism @ ECUAD Debora O & Tara Wren

Samsung R&D Institute India - Bangalore 1 Embedded Linux Conference, San Jose, CA, March-2015