Counting Triangles under Updates in Worst-Case Optimal Time Ahmet - PowerPoint PPT Presentation

Counting Triangles under Updates in Worst-Case Optimal Time Ahmet Kara, Hung Q. Ngo, Milos Nikolic Dan Olteanu, and Haozhe Zhang fdbresearch.github.io Highlights 2018, Berlin Relational AI

Problem Setting Maintain the triangle count Q under single-tuple updates to R , S , and T ! A R T B C S Q counts the number of tuples in the join of R, S, and T . Q = � a , b , c R ( a , b ) · S ( b , c ) · T ( c , a )

The Maintenance Problem single-tuple single-tuple single-tuple update update update database D 0 D 1 D 2 auxiliary maintain maintain A 0 A 1 A 2 data structure maintain maintain triangle Q ( D 0 ) Q ( D 1 ) Q ( D 2 ) count Given a current database D and a single-tuple update, what are the time and space complexities for maintaining Q ( D )?

Much Ado about Triangles The Triangle Query Served as Milestone in Many Fields Worst-case optimal join algorithms [Algorithmica 1997, SIGMOD R. 2013] Parallel query evaluation [Found. & Trends DB 2018] Randomized approximation in static settings [FOCS 2015] Randomized approximation in data streams [SODA 2002, COCOON 2005, PODS 2006, PODS 2016, Theor. Comput. Sci. 2017] Intensive Investigation of Answering Queries under Updates Theoretical developments [PODS 2017, ICDT 2018] Systems developments [F. & T. DB 2012, VLDB J. 2014, SIGMOD 2017, 2018] Lower bounds [STOC 2015, ICM 2018] So far: No dynamic algorithm maintaining the exact triangle count in worst-case optimal time!

Na¨ ıve Maintenance “ Compute from scratch! ” δ R = { ( a ′ , b ′ ) �→ m } � � � R ( a , b ) + δ R ( a , b ) · S ( b , c ) · T ( c , a ) a , b , c � �� newR = � a , b , c newR ( a , b ) · S ( b , c ) · T ( c , a ) Maintenance Complexity Time: O ( | D | 1 . 5 ) using worst-case optimal join algorithms Space: O ( | D | ) to store input relations

Classical Incremental View Maintenance (IVM) “ Compute the difference! ” δ R = { ( a ′ , b ′ ) �→ m } � � � R ( a , b ) + δ R ( a , b ) · S ( b , c ) · T ( c , a ) a , b , c = � a , b , c R ( a , b ) · S ( b , c ) · T ( c , a ) + δ R ( a ′ , b ′ ) · � c S ( b ′ , c ) · T ( c , a ′ ) Maintenance Complexity Time: O ( | D | ) to intersect C -values from S and T Space: O ( | D | ) to store input relations

Factorized Incremental View Maintenance (F-IVM) “ Compute the difference by using pre-materialized views! ” δ R = { ( a ′ , b ′ ) �→ m } Pre-materialize V ST ( b , a ) = � c S ( b , c ) · T ( c , a )! � � � R ( a , b ) + δ R ( a , b ) · S ( b , c ) · T ( c , a ) a , b , c = � a , b , c R ( a , b ) · S ( b , c ) · T ( c , a ) + δ R ( a ′ , b ′ ) · V ST ( b ′ , a ′ ) Maintenance Complexity Time for updates to R : O (1) to look up in V ST Time for updates to S and T : O ( | D | ) to maintain V ST Space: O ( | D | 2 ) to store input relations and V ST

Closing the Complexity Gap Complexity bounds for the maintenance of the triangle count Known Upper Bound Maintenance Time: O ( | D | ) Space: O ( | D | ) Known Lower Bound Amortized maintenance time: not O ( | D | 0 . 5 − γ ) for any γ > 0 (under reasonable complexity theoretic assumptions)

Closing the Complexity Gap Complexity bounds for the maintenance of the triangle count Known Upper Bound Maintenance Time: O ( | D | ) Space: O ( | D | ) Can the triangle count be maintained in sublinear time? Known Lower Bound Amortized maintenance time: not O ( | D | 0 . 5 − γ ) for any γ > 0 (under reasonable complexity theoretic assumptions)

Closing the Complexity Gap Complexity bounds for the maintenance of the triangle count Known Upper Bound Maintenance Time: O ( | D | ) Space: O ( | D | ) Yes! We propose: IVM ε Can the triangle count Amortized maintenance time: be maintained in O ( | D | 0 . 5 ) sublinear time? This is worst-case optimal! Known Lower Bound Amortized maintenance time: not O ( | D | 0 . 5 − γ ) for any γ > 0 (under reasonable complexity theoretic assumptions)

IVM ε Exhibits a Time-Space Tradeoff Given ε ∈ [0 , 1], IVM ε maintains the triangle count with O ( | D | max { ε, 1 − ε } ) amortized time and O ( | D | 1+min { ε, 1 − ε } ) space. complexity O ( | D | 1 . 5 ) Space Amortized Time O ( | D | ) worst-case optimality O ( | D | 0 . 5 ) ε = 0 . 5 ε 0 0 . 5 1 Known maintenance approaches are recovered by IVM ε .

Main Ideas in IVM ε Compute the difference like in classical IVM! Materialize views like in Factorized IVM! New ingredient: Use adaptive processing based on data skew! = ⇒ Treat heavy values differently from light values!

Quo Vadis IVM ε ? Generalization of IVM ε IVM ε variants obtain sublinear maintenance time for counting versions of Loomis-Whitney, 4-cycle, and 4-path. Ongoing Work Characterization of the class of conjunctive count queries that admit sublinear maintenance time Implementation of IVM ε on top of DBToaster

Quo Vadis IVM ε ? Generalization of IVM ε IVM ε variants obtain sublinear maintenance time for counting versions of Loomis-Whitney, 4-cycle, and 4-path. Ongoing Work Characterization of the class of conjunctive count queries that admit sublinear maintenance time Implementation of IVM ε on top of DBToaster For details, see arxiv.org/abs/1804.02780

Quick Look inside IVM ε Partition R into a light part R L = { t ∈ R | | σ A = t . A | < | D | ε } , a heavy part R H = R \ R L ! R light part A B R L · · A B a b 1 . . . . . . . . n < | D | ε . . . . a b n · · heavy part · · R H a ′ b ′ 1 A B . . . . . . m ≥ | D | ε . . . . . . . . . . . . a ′ b ′ m · ·

Quick Look inside IVM ε Derived Bounds Partition R into for all A -values a : a light part | σ A = a R L | < | D | ε R L = { t ∈ R | | σ A = t . A | < | D | ε } , | π A R H | ≤ | D | 1 − ε a heavy part R H = R \ R L ! R light part A B R L · · A B a b 1 . . . . . . . . n < | D | ε . . . . a b n · · heavy part · · R H a ′ b ′ 1 A B . . . . . . m ≥ | D | ε . . . . . . . . . . . . a ′ b ′ m · ·

Quick Look inside IVM ε Derived Bounds Partition R into for all A -values a : a light part | σ A = a R L | < | D | ε R L = { t ∈ R | | σ A = t . A | < | D | ε } , | π A R H | ≤ | D | 1 − ε a heavy part R H = R \ R L ! R light part A B R L Likewise, partition · · A B S = S L ∪ S H based on B , and a b 1 . . . . . . . . n < | D | ε . . T = T L ∪ T H based on C ! . . a b n · · heavy part · · R H a ′ b ′ 1 A B . . . . . . m ≥ | D | ε . . . . . . . . . . . . a ′ b ′ m · ·

Quick Look inside IVM ε Derived Bounds Partition R into for all A -values a : a light part | σ A = a R L | < | D | ε R L = { t ∈ R | | σ A = t . A | < | D | ε } , | π A R H | ≤ | D | 1 − ε a heavy part R H = R \ R L ! R light part A B R L Likewise, partition · · A B S = S L ∪ S H based on B , and a b 1 . . . . . . . . n < | D | ε . . T = T L ∪ T H based on C ! . . a b n · · heavy part · · R H a ′ b ′ Q is the sum of skew-aware views 1 A B . . . . R U ( a , b ) · S V ( b , c ) · T W ( c , a ) . . m ≥ | D | ε . . . . . . . . . . with U , V , W ∈ { L , H } . . . a ′ b ′ m · ·

Adaptive Maintenance Strategy Given an update δ R ∗ = { ( a ′ , b ′ ) �→ m } , compute the difference for each skew-aware view using different strategies: Skew-aware View Evaluation from left to right Time � δ R ∗ ( a ′ , b ′ ) · � S L ( b ′ , c ) · T L ( c , a ′ ) R ∗ ( a , b ) · S L ( b , c ) · T L ( c , a ) O ( | D | ε ) a , b , c c

Adaptive Maintenance Strategy Given an update δ R ∗ = { ( a ′ , b ′ ) �→ m } , compute the difference for each skew-aware view using different strategies: Skew-aware View Evaluation from left to right Time � δ R ∗ ( a ′ , b ′ ) · � S L ( b ′ , c ) · T L ( c , a ′ ) R ∗ ( a , b ) · S L ( b , c ) · T L ( c , a ) O ( | D | ε ) a , b , c c � δ R ∗ ( a ′ , b ′ ) · � T H ( c , a ′ ) · S H ( b ′ , c ) O ( | D | 1 − ε ) R ∗ ( a , b ) · S H ( b , c ) · T H ( c , a ) a , b , c c

Adaptive Maintenance Strategy Given an update δ R ∗ = { ( a ′ , b ′ ) �→ m } , compute the difference for each skew-aware view using different strategies: Skew-aware View Evaluation from left to right Time � δ R ∗ ( a ′ , b ′ ) · � S L ( b ′ , c ) · T L ( c , a ′ ) R ∗ ( a , b ) · S L ( b , c ) · T L ( c , a ) O ( | D | ε ) a , b , c c � δ R ∗ ( a ′ , b ′ ) · � T H ( c , a ′ ) · S H ( b ′ , c ) O ( | D | 1 − ε ) R ∗ ( a , b ) · S H ( b , c ) · T H ( c , a ) a , b , c c δ R ∗ ( a ′ , b ′ ) · � S L ( b ′ , c ) · T H ( c , a ′ ) O ( | D | ε ) c � R ∗ ( a , b ) · S L ( b , c ) · T H ( c , a ) or a , b , c δ R ∗ ( a ′ , b ′ ) · � T H ( c , a ′ ) · S L ( b ′ , c ) O ( | D | 1 − ε ) c

Counting Triangles under Updates in Worst-Case Optimal Time Ahmet - PowerPoint PPT Presentation

Counting Triangles under Updates in Worst-Case Optimal Time Ahmet Kara, Hung Q. Ngo, Milos Nikolic Dan Olteanu, and Haozhe Zhang fdbresearch.github.io Highlights 2018, Berlin Relational AI Problem Setting Maintain the triangle count Q under

Geometry Triangles Triangles Return to Table of Contents www.njctl.org Slide 5 / 210 Slide 6

Trigonometric functions Step one: similar triangles Two similar triangles have the same set of

Rasterization May 1, 2006 Triangles Only We will discuss the rasterization of triangles

Counting Triangles and Modeling MapReduce Siddharth Suri Yahoo! Research Outline 2 Modeling

Worst-Case Optimal Redistribution of VCG Payments in Multi-Unit Auctions Mingyu Guo Joint work

Information Geometry in Mathematical Finance: Model Risk, Worst and Almost Worst Scenarios Imre

Worst-case Ethernet Network Latency for Shaped Sources Max Azarov, Standard Microsystems (SMSC)

Comparison of Efficiency Binary Binomial Procedure (worst- (worst- (amortized) case) case)

Typical versus Worst Case Design in Networking Nandita Dukkipati Yashar Ganjali, Rui Zhang-Shen

Law of Sines In Section 14-3 you studied techniques for solving right triangles. In this

JUST THE MATHS SLIDES NUMBER 3.4 TRIGONOMETRY 4 (Solution of triangles) by A.J.Hobson

Right Triangle Trigonometry Special Right Triangles Trigonometric Functions Inverse

CS 6958 LECTURE 7 TRIANGLES, BVH January 29, 2014 Triangles 2 Lets try to derive an

Optimal Portfolio Under Worst-Case Scenarios Carole Bernard (UW), Jit Seng Chen (UW) and Steven

Optimal Portfolio Under Worst-Case Scenarios Carole Bernard (UW), Jit Seng Chen (UW) and Steven

Optimal Portfolio Under Worst-Case Scenarios Carole Bernard (UW), Jit Seng Chen (UW) and Steven

DRIVOLUTION: RETHINKING THE DATABASE DRIVER LIFECYCLE Emmanuel Cecchet (UMass Amherst) Joint

Rethinking naturalness Francesco Sannino Can the Higgs be elementary ? Francesco Sannino Plan

The .NET Framework: What is it and How to Use it? Drs.ir. Vadim V. Zaytsev 19 October 2004

Rado Positional Games Christopher Kusch Juanjo Ru e Tibor Szab o Christoph Spiegel The

tI fm 7iowIeorsEiii iiie troiuigerror Tr modees i clossifcotion bing Shopire 1997 A C Sit

Charles Augustin Coulomb (1736-1806) France Coulomb's Law ~1875 Mini Mini- -Ph.D. Ph.D.

SL 2 ( C ) -holonomy invariants of links Calvin McPhail-Snyder June 1, 2020 UC Berkeley

Dynamic Zero Compression for Cache Energy Reduction