 
              Set Cover in Sub-linear Time Piotr Indyk Sepideh Mahabadi Ronitt Rubinfeld MIT Columbia University MIT/TAU Ali Vakilian Anak Yodpinyanee MIT MIT
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 2 5 1 3
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 Output: a subset π of β± such that: β’ π covers π± 2 β’ | π | is minimized 5 1 3
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 Output: a subset π of β± such that: β’ π covers π± 2 β’ | π | is minimized 5 1 Complexity : β’ NP-hard 3
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 Output: a subset π of β± such that: β’ π covers π± 2 β’ | π | is minimized 5 1 Complexity : β’ NP-hard 3 β’ Greedy (ln π) -approximation algorithm
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 Output: a subset π of β± such that: β’ π covers π± 2 β’ | π | is minimized 5 1 Complexity : β’ NP-hard 3 β’ Greedy (ln π) -approximation algorithm β’ Canβt do better unless P=NP [LY91][RS97][Fei98][AMS06][DS14]
Set Cover Problem Input: Collection β± of sets π 1 , β¦ , π π , each a subset of π± = {1, β¦ , π} 4 Output: a subset π of β± such that: β’ π covers π± 2 β’ | π | is minimized 5 1 Complexity : β’ NP-hard 3 β’ Greedy (ln π) -approximation algorithm β’ Canβt do better unless P=NP [LY91][RS97][Fei98][AMS06][DS14] βIs it possible to solve minimum set cover in sub-linear time ?β
Sub-linear Time Set Cover Data Access Model ?
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» SetOf (π, π) : π th set containing π
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs β’ Sublinear in ππ
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs β’ Sublinear in ππ Prior Results
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs β’ Sublinear in ππ Prior Results ο± [Nguyen, Onak β08][Yoshida, Yamamoto, Itoβ12] ο§ Constant queries, if degree is constant
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs β’ Sublinear in ππ Prior Results ο± [Nguyen, Onak β08][Yoshida, Yamamoto, Itoβ12] ο§ Constant queries, if degree is constant ο± [Koufogiannakis , Youngβ14][ Grigoriadis , Kachiyanβ95]: ο§ Find (1 + π) -approximate fractional solution , then perform randomized rounding to achieve π(log π) -approximation
Sub-linear Time Set Cover Data Access Model [NOβ08,YYIβ12] EltOf (π», π) : π th element in π» β’ No assumption on the order SetOf (π, π) : π th set containing π β’ Incidence list in (sub-linear) algorithms for graphs β’ Sublinear in ππ Prior Results ο± [Nguyen, Onakβ08][Yoshida, Yamamoto, Itoβ12] ο§ Constant queries, if degree is constant ο± [Koufogiannakis , Youngβ14][ Grigoriadis , Kachiyanβ95]: ο§ Find (1 + π) -approximate fractional solution , then perform randomized rounding to achieve π(log π) -approximation ο§ π(ππ 2 + ππ 2 ) (can be improved to π(π + ππ) ) π = number of elements π = number of sets π = size of the optimal solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) Cover Verification : given a set system, verify whether a given sub-collection of sets covers the universe. π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) Cover Verification : given a set system, verify whether a given sub-collection of sets covers the universe. π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) Cover Verification : given a set system, verify whether a given sub-collection of sets covers the universe. π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) Cover Verification : given a set system, verify whether a given sub-collection of sets covers the universe. π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Results Problem Approximation Constraints Query Complexity 1 π π½β1 + ππ π½π + 1 π½ β₯ 2 π π π π ππ π + 1 β π Set Cover 1 1 π 4π½+1 π 2π½ π½ π β€ Ξ© π log π π Ξ© ππ π½ β€ 1.01 π½ π = π(π/ log π) π Cover Verification β π β€ π/2 Ξ©(ππ) Cover Verification : given a set system, verify whether a given sub-collection of sets covers the universe. π = approximation factor for offline Set Cover π = number of elements π = number of sets π = Size of the optimal Solution
Part one: upper bound Theorem: There exists an algorithm that with high probability π(ππ π/π· + ππ) finds an O(ππ½) -approximate cover which uses number of queries.
Part one: upper bound Theorem: There exists an algorithm that with high probability π(ππ π/π· + ππ) finds an O(ππ½) -approximate cover which uses number of queries. 1. Two simple components used for coverage problems in massive data models. β’ Set Sampling β’ Element Sampling 2. The algorithm overview
Component I: set sampling Set Sampling: After picking β sets uniformly at random, all m log π elements with degree at least are covered w.h.p. β β’ We only need to worry about low degree elements.
Recommend
More recommend