Composable Core-sets for Diversity and Coverage Maximization
Piotr Indyk (MIT) Sepideh Mahabadi (MIT) Mohammad Mahdian (Google) Vahab S. Mirrokni (Google)
Composable Core-sets for Diversity and Coverage Maximization Piotr - - PowerPoint PPT Presentation
Composable Core-sets for Diversity and Coverage Maximization Piotr Indyk (MIT) Sepideh Mahabadi (MIT) Mohammad Mahdian (Google) Vahab S. Mirrokni (Google) Core-Set Definition Setup Set of points in -dimensional
Piotr Indyk (MIT) Sepideh Mahabadi (MIT) Mohammad Mahdian (Google) Vahab S. Mirrokni (Google)
β Set of π points πΈ in π-dimensional space β Optimize a function π
β Set of π points πΈ in π-dimensional space β Optimize a function π
which suffices to π-approximate the optimal solution
π
πππ π
π
β€ π
πππ π β€ π πππ(π)
β Set of π points πΈ in π-dimensional space β Optimize a function π
which suffices to π-approximate the optimal solution
π
πππ π
π
β€ π
πππ π β€ π πππ(π)
β Optimization Function: Distance of the two farthest points
β Set of π points πΈ in π-dimensional space β Optimize a function π
which suffices to π-approximate the optimal solution
π
πππ π
π
β€ π
πππ π β€ π πππ(π)
β Optimization Function: Distance of the two farthest points β 1-Core-set: Points on the convex hull.
β πΈπ, πΈπ, β¦ , πΈπ are set of points in π-dimensional space β Optimize a function π over their union πΈ.
β πΈπ, πΈπ, β¦ , πΈπ are set of points in π-dimensional space β Optimize a function π over their union πΈ.
points S1 β π
1, S2 β π2, β¦ , Sm β π π
points such that the solution of the union
1 π π
πππ π 1 βͺ β― βͺ π π β€ π
πππ(π 1 βͺ β― βͺ π π)
β πΈπ, πΈπ, β¦ , πΈπ are set of points in π-dimensional space β Optimize a function π over their union πΈ.
points S1 β π
1, S2 β π2, β¦ , Sm β π π
points such that the solution of the union
1 π π
πππ π 1 βͺ β― βͺ π π β€ π
πππ(π 1 βͺ β― βͺ π π)
β πΈπ, πΈπ, β¦ , πΈπ are set of points in π-dimensional space β Optimize a function π over their union πΈ.
points S1 β π
1, S2 β π2, β¦ , Sm β π π
points such that the solution of the union
1 π π
πππ π 1 βͺ β― βͺ π π β€ π
πππ(π 1 βͺ β― βͺ π π)
β πΈπ, πΈπ, β¦ , πΈπ are set of points in π-dimensional space β Optimize a function π over their union πΈ.
points S1 β π
1, S2 β π2, β¦ , Sm β π π
points such that the solution of the union
1 π π
πππ π 1 βͺ β― βͺ π π β€ π
πππ(π 1 βͺ β― βͺ π π)
β Processing sequence of π data elements βon the flyβ β limited Storage
β Processing sequence of π data elements βon the flyβ β limited Storage
β Chunks of size ππ , thus number of chunks = π/π
β Processing sequence of π data elements βon the flyβ β limited Storage
β Chunks of size ππ , thus number of chunks = π/π β Core-set for each chunk β Total Space: π π/π + ππ = π( ππ) β Approximation Factor: π
β Each machine holds a block of data. β A composable core-set is computed and sent to the server
β Each machine holds a block of data. β A composable core-set is computed and sent to the server
WWWβ13] [Abbar, Amer-Yahia, Indyk, Mahabadi, Varadarajan, SoCGβ13]
WWWβ13] [Abbar, Amer-Yahia, Indyk, Mahabadi, Varadarajan, SoCGβ13]
k=4 n = 6
k=4 n = 6
k=4 n = 6
k=4 n = 6
Diversity function over a set π of π point Description Remote-edge Minimum Pairwise Distance: min
π,πβπ ππππ(π, π)
Remote-clique Sum of Pairwise Distances : β ππππ(π, π)
π,πβπ
Remote-tree Weight of Minimum Spanning Tree (MST) of the set π Remote-cycle Weight of minimum Traveling Salesman Tour (TSP) of the set π Remote-star Weight of minimum star: min
πβπ β
ππππ(π, π)
πβπ
Remote-Pseudoforest Sum of the distance of each point to its nearest neighbor β min
πβπ ππππ(π, π) πβπ
Remote-Matching Weight of minimum perfect Matching of the set π Max-Coverage How well the points cover each coordinate max
πβπ ππ π π=1
Diversity function Offline ApproxFactor Composable Coreset Approx factor [Our Results] Remote-edge Minimum Pairwise Distance π(1) [Tmair 91][White 91] [Ravi et al 94] π·(π) Remote-clique Sum of Pairwise Distances π(1) [Hassin et al 97] π·(π) Remote-tree Weight of MST π(1) [Halldorsson et al 99] π·(π) Remote-cycle Weight of minimum TSP π(1) [Halldorsson et al 99] π·(π) Remote-star Weight of minimum star π(1) [Chandra&Halldorsson 01] π·(π) Remote-Pseudoforest Sum of the distance of each point to its nearest neighbor π(log π) [Chandra&Halldorsson 01] π·(π¦π¦π¦ π) Remote-Matching Weight of minimum perfect Matching π(log π) [Chandra&Halldorsson 01] π·(π¦π¦π¦ π) Max-Coverage How well the points cover each coordinate max
πβπ ππ π π=1
π(1) [Feige 98] No Composable Coreset of Poly size in π with app. factor
π πππ π
the currently chosen points is maximized
the currently chosen points is maximized
π π
π π
π π
π π
π
π2 = π(π
π log π)
Let π
1, β― , π π be the set of points , π = βπ π
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Case 1: one of ππ has diversity as good as the optimum: π β₯ π· πππ€ πππ
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Case 1: one of ππ has diversity as good as the optimum: π β₯ π· πππ€ πππ Case 2: : π β€ π·(πππ€(πππ))
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Case 1: one of ππ has diversity as good as the optimum: π β₯ π· πππ€ πππ Case 2: : π β€ π·(πππ€(πππ))
ππππ ππ , π ππ β€ π·(π )
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Case 1: one of ππ has diversity as good as the optimum: π β₯ π· πππ€ πππ Case 2: : π β€ π·(πππ€(πππ))
ππππ ππ , π ππ β€ π·(π )
π ππ is approximately as good as πππ€ ππ
Let π
1, β― , π π be the set of points , π = βπ π
π1, β― , ππ be their core-sets, S = βππ Goal: πππ€π π β₯ πππ€π(π) / c Let πππ = π1, β― , ππ be the optimal solution Goal: πππ€π π β₯ πππ€(πππ) / c Let π be their maximum diversity , π = max
π πππ€ ππ , Note: divk π β₯ π
Case 1: one of ππ has diversity as good as the optimum: π β₯ π· πππ€ πππ Case 2: : π β€ π·(πππ€(πππ))
ππππ ππ , π ππ β€ π·(π )
π ππ is approximately as good as πππ€ ππ
coverage:
β cov S = β max
π‘βπ ππ π π=1
coverage:
β cov S = β max
π‘βπ ππ π π=1
maximized.
coverage:
β cov S = β max
π‘βπ ππ π π=1
maximized.
π log π and any constant πΎ > 1, there is
no π½-composable core-set of size ππΎ
Build a set of instances π
1, β― , ππ π
let π = 1, β― , π π4
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
π from π
π
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
π from π
π
π π
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
π from π
π
π π
We show there exists π
1, β― , π π π such that
β π
π β π 1 has size π
β π
π β π 1 and π π β π 1 are disjoint for π β π
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
π from π
π
π π
We show there exists π
1, β― , π π π such that
β π
π β π 1 has size π
β π
π β π 1 and π π β π 1 are disjoint for π β π
π can be covered,
that is π(π3/2) elements.
Build a set of instances π
1, β― , ππ π
let π = {1, β― , π π4 }
π be subset of size π of π
π from π
π
π π
We show there exists π
1, β― , π π π such that
β π
π β π 1 has size π
β π
π β π 1 and π π β π 1 are disjoint for π β π
π can be covered,
that is π(π3/2) elements.
1 + π log π = O(k log k )
can be covered
β Are there any other applications of composable core-sets?
β Are there any other applications of composable core-sets? β Is there a general characterization of measures for which composable core-sets exist?
β Are there any other applications of composable core-sets? β Is there a general characterization of measures for which composable core-sets exist? β Better approximation factors?