Tight Bounds for Single-Pass Streaming Complexity of the Set Cover - PowerPoint PPT Presentation

Tight Bounds for Single-Pass Streaming Complexity of the Set Cover Problem Sepehr Assadi University of Pennsylvania Joint work with Sanjeev Khanna (Penn) and Yang Li (Penn) Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem Input: A collection of m sets S 1 , . . . , S m from a universe [ n ] . Goal: Choose a smallest subset C of the sets from S 1 , . . . , S m such that C covers [ n ] , i.e., � i ∈ C S i = [ n ] . Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem Input: A collection of m sets S 1 , . . . , S m from a universe [ n ] . Goal: Choose a smallest subset C of the sets from S 1 , . . . , S m such that C covers [ n ] , i.e., � i ∈ C S i = [ n ] . The sets maybe weighted in general. We use OPT to denote the optimal solution size/weight. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem Input: A collection of m sets S 1 , . . . , S m from a universe [ n ] . Goal: Choose a smallest subset C of the sets from S 1 , . . . , S m such that C covers [ n ] , i.e., � i ∈ C S i = [ n ] . The sets maybe weighted in general. We use OPT to denote the optimal solution size/weight. Approximation vs Estimation: α -approximation: output a set cover of size at most α · OPT plus a certificate of coverage for each element e ∈ [ n ] . α -estimation: output an estimate for the size of minimum set cover in range [ OPT , α · OPT ] . Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem A classic optimization problem with many applications. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem A classic optimization problem with many applications. A well-understood problem in the classical setting: ◮ Admits a poly-time greedy ln n -approximation algorithm. ◮ No poly-time (1 − ǫ ) · ln n -estimation algorithm unless P = NP. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Set Cover Problem A classic optimization problem with many applications. A well-understood problem in the classical setting: ◮ Admits a poly-time greedy ln n -approximation algorithm. ◮ No poly-time (1 − ǫ ) · ln n -estimation algorithm unless P = NP. This talk: space complexity of approximating the set cover problem in the streaming model. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Streaming Set Cover Problem Model: The input sets S 1 , . . . , S m are presented one by one in a stream. The streaming algorithm has a small space to maintain a summary of the input sets. At the end, the algorithm outputs an exact/approximate set cover using this summary. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Streaming Set Cover Problem Model: The input sets S 1 , . . . , S m are presented one by one in a stream. The streaming algorithm has a small space to maintain a summary of the input sets. At the end, the algorithm outputs an exact/approximate set cover using this summary. Introduced originally by [SG09] and further studied in several recent works [ER14, DIMV14, IMV15, CW16, HPIMV16]. Sepehr Assadi (Penn) Symposium on Theory of Computing

The Streaming Set Cover Problem Model: The input sets S 1 , . . . , S m are presented one by one in a stream. The streaming algorithm has a small space to maintain a summary of the input sets. At the end, the algorithm outputs an exact/approximate set cover using this summary. Introduced originally by [SG09] and further studied in several recent works [ER14, DIMV14, IMV15, CW16, HPIMV16]. Remark. We are not concerned with poly-time computability in this model. Sepehr Assadi (Penn) Symposium on Theory of Computing

State of the Art for Single-Pass Algorithms Result Space Performance Ratio Exact O ( mn ) 1 [IMV15] Ω( mn ) 3 / 2 − ǫ O ( √ n ) [ER14] O ( n ) o ( √ n ) [ER14] Ω( m ) Sepehr Assadi (Penn) Symposium on Theory of Computing

State of the Art for Single-Pass Algorithms Result Space Performance Ratio Exact O ( mn ) 1 [IMV15] Ω( mn ) 3 / 2 − ǫ O ( √ n ) [ER14] O ( n ) o ( √ n ) [ER14] Ω( m ) Many known results for multi-pass algorithms as well: [SG09, IMV15, CW16] . . . Sepehr Assadi (Penn) Symposium on Theory of Computing

State of the Art for Single-Pass Algorithms Result Space Performance Ratio Exact O ( mn ) 1 [IMV15] Ω( mn ) 3 / 2 − ǫ O ( √ n ) [ER14] O ( n ) o ( √ n ) [ER14] Ω( m ) Single-pass Algorithms: o ( m ) space regime is settled by the results of [ER14]. However, sublinear space regime, that is, what can be done in o ( mn ) space is wide open. Sepehr Assadi (Penn) Symposium on Theory of Computing

State of the Art for Single-Pass Algorithms Result Space Performance Ratio Exact O ( mn ) 1 [IMV15] Ω( mn ) 3 / 2 − ǫ O ( √ n ) [ER14] O ( n ) o ( √ n ) [ER14] Ω( m ) Single-pass Algorithms: o ( m ) space regime is settled by the results of [ER14]. However, sublinear space regime, that is, what can be done in o ( mn ) space is wide open. ◮ For example, is O (1) approximation possible in o ( mn ) space? ◮ In general, what is the space-approximation tradeoff in this regime? Sepehr Assadi (Penn) Symposium on Theory of Computing

Our First Result A tight space-approximation tradeoff for single-pass streaming algorithms: Theorem For any α = o ( √ n ) , � Θ( mn/α ) space is both sufficient and necessary for α -approximating the set cover problem. Sepehr Assadi (Penn) Symposium on Theory of Computing

� α -Approximation in O ( mn/α ) space A simple algorithm for (weighted) set cover: Guess OPT and ignore sets with weight > OPT. 1 Sepehr Assadi (Penn) Symposium on Theory of Computing

� α -Approximation in O ( mn/α ) space A simple algorithm for (weighted) set cover: Guess OPT and ignore sets with weight > OPT. 1 Prune: Include a set if it covers more than n/α new elements 2 and remove these elements from the universe. (at most α sets would be included with total weight ≤ α · OPT) Sepehr Assadi (Penn) Symposium on Theory of Computing

� α -Approximation in O ( mn/α ) space A simple algorithm for (weighted) set cover: Guess OPT and ignore sets with weight > OPT. 1 Prune: Include a set if it covers more than n/α new elements 2 and remove these elements from the universe. (at most α sets would be included with total weight ≤ α · OPT) Store all remaining sets over the new universe. 3 (each remaining set contains < n/α elements and hence they can all be stored in O ( mn/α ) space) Sepehr Assadi (Penn) Symposium on Theory of Computing

� α -Approximation in O ( mn/α ) space A simple algorithm for (weighted) set cover: Guess OPT and ignore sets with weight > OPT. 1 Prune: Include a set if it covers more than n/α new elements 2 and remove these elements from the universe. (at most α sets would be included with total weight ≤ α · OPT) Store all remaining sets over the new universe. 3 (each remaining set contains < n/α elements and hence they can all be stored in O ( mn/α ) space) Solve the store set cover instance optimally to cover the elements 4 remained uncovered by the prune step. Sepehr Assadi (Penn) Symposium on Theory of Computing

� α -Approximation in O ( mn/α ) space A simple algorithm for (weighted) set cover: Guess OPT and ignore sets with weight > OPT. 1 Prune: Include a set if it covers more than n/α new elements 2 and remove these elements from the universe. (at most α sets would be included with total weight ≤ α · OPT) Store all remaining sets over the new universe. 3 (each remaining set contains < n/α elements and hence they can all be stored in O ( mn/α ) space) Solve the store set cover instance optimally to cover the elements 4 remained uncovered by the prune step. Our lower bound shows that this simple algorithm is essentially the best possible in terms of space requirement! Sepehr Assadi (Penn) Symposium on Theory of Computing

Approximation vs Estimation Previous upper bounds are for the approximation problem, while lower bounds are for estimation. Sepehr Assadi (Penn) Symposium on Theory of Computing

Approximation vs Estimation Previous upper bounds are for the approximation problem, while lower bounds are for estimation. However, our Ω( mn/α ) lower bound strongly relies on the fact that we are solving the approximation problem and not simply estimating the value of the optimal set cover. Sepehr Assadi (Penn) Symposium on Theory of Computing

Approximation vs Estimation Previous upper bounds are for the approximation problem, while lower bounds are for estimation. However, our Ω( mn/α ) lower bound strongly relies on the fact that we are solving the approximation problem and not simply estimating the value of the optimal set cover. Question: Can it be that estimation is strictly easier than approximation? Sepehr Assadi (Penn) Symposium on Theory of Computing

Our Second Result Estimation is indeed distinctly easier! Theorem For any α = o ( √ n ) , there exists a randomized α -estimation � O ( mn/α 2 ) space algorithm for the streaming set cover problem. Works in general for any covering integer program, and in particular for weighted set-cover or set multi-cover problem. Sepehr Assadi (Penn) Symposium on Theory of Computing

Tight Bounds for Single-Pass Streaming Complexity of the Set Cover - PowerPoint PPT Presentation

Tight Bounds for Single-Pass Streaming Complexity of the Set Cover Problem Sepehr Assadi University of Pennsylvania Joint work with Sanjeev Khanna (Penn) and Yang Li (Penn) Sepehr Assadi (Penn) Symposium on Theory of Computing The Set Cover

Chapter 2 Tight-frames An Introduction 1 Outline 1. Tight-frame 1. Tight-frame 2. Matrix

Amit Chakrabarti Dartmouth College WAPMDS, IIT Kanpur, Dec 2009 Amit Chakrabarti 1 Multi-Pass

50% pass developmental credit course course pass take pass developmental credit credit

Computing Tight Bounds for Insurance Payments with Nonlinear Risk Man Hong WONG 1 Shuzhong ZHANG 2

Tight Bounds for Learning a Mixture of Two Gaussians Moritz Hardt Eric Price Google Research

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

U-Pass Program Executive Management Committee May 17, 2018 1 U-PASS The U-Pass Pilot

Tight Space-Approximation Tradeoff for the Multi-Pass Streaming Set Cover Problem Sepehr Assadi

Submodular Streaming in All Its Glory: Tight Approximation, Minimum Memory and Low Adaptive

Quadratically Tight Relations for Randomized Query Complexity Rahul Jain Hartmut Klauck Srijita

Tight Bounds for Cost-Sharing in Weighted Congestion Games Martin Gairing University of

Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin Ilias Diakonikolas

The Proposed Closure of Rollover Pass Texas General Land Office Jerry Patterson, Land

U-PASS IMPLEMENTATION 2015/2016 Why are we implementing the U-Pass? In 2014/2015, the

What is this talk about? What is this talk about? Deriving tight and safe task execution time

Training Presentation Web Streaming Introduction What is Web Streaming? Who is Streaming?

Exercise Sheet 5: Space Complexity David Carral December 5, 2019 Exercise 1 Let A LBA be the

Homework Homework 6 & 7 Computational Complexity Returned today Homework 8

Computational Social Choice: Spring 2015 Ulle Endriss Institute for Logic, Language and

Games solved: Now and in the future by H. J. van den Herik, J. W. H. M. Uiterwijk, and J. van

Space of Search Strategies CSE 573: Artificial Intelligence Blind Search DFS, BFS, IDS

Symmetry Characterization Theorems for Homogeneous Siegel Domains Takaaki Nomura Kyushu

Mod 3 Unit 7 Lesson 5 Three Dimensional Space Lecture Slides.notebook April 29, 2015 1 Mod 3

Fast Polyhedra Abstract Domain Gagandeep Singh Markus Pschel Martin Vechev Department of

Tight Bounds for Single-Pass Streaming Complexity of the Set Cover - PowerPoint PPT Presentation

Tight Bounds for Single-Pass Streaming Complexity of the Set Cover Problem Sepehr Assadi University of Pennsylvania Joint work with Sanjeev Khanna (Penn) and Yang Li (Penn) Sepehr Assadi (Penn) Symposium on Theory of Computing The Set Cover

Chapter 2 Tight-frames An Introduction 1 Outline 1. Tight-frame 1. Tight-frame 2. Matrix

Amit Chakrabarti Dartmouth College WAPMDS, IIT Kanpur, Dec 2009 Amit Chakrabarti 1 Multi-Pass

50% pass developmental credit course course pass take pass developmental credit credit

Computing Tight Bounds for Insurance Payments with Nonlinear Risk Man Hong WONG 1 Shuzhong ZHANG 2

Tight Bounds for Learning a Mixture of Two Gaussians Moritz Hardt Eric Price Google Research

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

U-Pass Program Executive Management Committee May 17, 2018 1 U-PASS The U-Pass Pilot

Tight Space-Approximation Tradeoff for the Multi-Pass Streaming Set Cover Problem Sepehr Assadi

Submodular Streaming in All Its Glory: Tight Approximation, Minimum Memory and Low Adaptive

Quadratically Tight Relations for Randomized Query Complexity Rahul Jain Hartmut Klauck Srijita

Tight Bounds for Cost-Sharing in Weighted Congestion Games Martin Gairing University of

Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin Ilias Diakonikolas

The Proposed Closure of Rollover Pass Texas General Land Office Jerry Patterson, Land

U-PASS IMPLEMENTATION 2015/2016 Why are we implementing the U-Pass? In 2014/2015, the

What is this talk about? What is this talk about? Deriving tight and safe task execution time

Training Presentation Web Streaming Introduction What is Web Streaming? Who is Streaming?

Exercise Sheet 5: Space Complexity David Carral December 5, 2019 Exercise 1 Let A LBA be the

Homework Homework 6 &amp; 7 Computational Complexity Returned today Homework 8

Computational Social Choice: Spring 2015 Ulle Endriss Institute for Logic, Language and

Games solved: Now and in the future by H. J. van den Herik, J. W. H. M. Uiterwijk, and J. van

Space of Search Strategies CSE 573: Artificial Intelligence Blind Search DFS, BFS, IDS

Symmetry Characterization Theorems for Homogeneous Siegel Domains Takaaki Nomura Kyushu

Mod 3 Unit 7 Lesson 5 Three Dimensional Space Lecture Slides.notebook April 29, 2015 1 Mod 3

Fast Polyhedra Abstract Domain Gagandeep Singh Markus Pschel Martin Vechev Department of

Homework Homework 6 & 7 Computational Complexity Returned today Homework 8