Approximation Schemes for Euclidean k-Medians and Related Problems - - PowerPoint PPT Presentation

approximation schemes for euclidean k medians and related
SMART_READER_LITE
LIVE PREVIEW

Approximation Schemes for Euclidean k-Medians and Related Problems - - PowerPoint PPT Presentation

Approximation Schemes for Euclidean k-Medians and Related Problems S. Arora, P. Raghavan, S. Rao STOC '98 A Nearly Linear-Time App. Scheme for the Euclidean k-median Problem ESA ' 99 S. Kolliopoulos, S. Rao k-medians Problem Given S={x i


slide-1
SLIDE 1

Approximation Schemes for Euclidean k-Medians and Related Problems

  • S. Arora, P. Raghavan, S. Rao

STOC '98

A Nearly Linear-Time App. Scheme for the Euclidean k-median Problem

  • S. Kolliopoulos, S. Rao

ESA '99

slide-2
SLIDE 2

k-medians Problem

Given

  • S={xi} be n points in metric space Rd
  • Positive integer k

Goal

  • Find M={mi} ∈ Rd which minimizes
slide-3
SLIDE 3

k-medians Problem

Paper 1 Paper 2

slide-4
SLIDE 4

k-medians Problem

Paper 1 Paper 2

slide-5
SLIDE 5

k-medians Problem

Paper 1 Paper 2

slide-6
SLIDE 6

Facility Location Problem

Given

  • S={xi} be n points in metric space Rd
  • Positive cost function c()

Goal

  • Find M={mi} ∈ S which minimizes
slide-7
SLIDE 7

Facility Location Problem

slide-8
SLIDE 8

Facility Location Problem

slide-9
SLIDE 9

Previous Results: Facility Location

O(logn) approx Hochbaum '82 3.16 approx Shmoys et al. '97 2.41 approx Guha and Khuller '98 1.74 approx Chudak '98 but no (1+ε) approx before this paper

slide-10
SLIDE 10

Previous Results: k-medians

(1+ε) approx using (1+1/ε)(1+lnn)k medians Lin and Vitter '92 2(1+ε) approx using (1+1/ε)k medians Lin and Vitter '92

slide-11
SLIDE 11

Results: Paper 1

In 2D, for the k-median problem, given any constant positive ε with probability 1-o(1) achieve solution at most (1+ε)OPT in time O(nO(1/ε)nk logn)

  • Proof of existence
  • Dynamic programming: bound table size
slide-12
SLIDE 12

Results: Paper 1

In 2D, for the facility problem, given any constant positive ε with probability 1-o(1) achieve solution at most (1+ε)OPT in time O(n1+O(1/ε)logn)

slide-13
SLIDE 13

Results: Paper 2

In 2D, for the k-median problem, given any constant positive ε with probability 1-o(1) achieve solution at most (1+ε)OPT in time O(2O(1+log(1/ε)/ε)nklogn)

  • Proof of existence
  • Dynamic programming: bound table size
slide-14
SLIDE 14

Approximation Schemes for Euclidean k-Medians and Related Problems

  • S. Arora, P. Raghavan, S. Rao

STOC '98

slide-15
SLIDE 15

Integer Point Coordinates

  • Given n points, minmax solution D is

2OPT for facility assignment

  • Optimal facility cost in [D/2,Dn]
  • Arora-TSP '98: Mapping each point to jD/n2

increases cost by maximum O(D/n)

  • Length of BBox(S)

– L=O(n4)

(<ε?)

slide-16
SLIDE 16

Terminology

  • Dissection vs quadtree
  • L = O(n4)
  • Number of nodes O(L2)
  • Minimum size of box = 1
  • Depth of tree log(L)
  • Dissection with (a,b) shift
  • Quadtree derived from (a,b)-shifted dissection
slide-17
SLIDE 17

A Result from Arora-TSP-'98

S={li} is a collection of line segments t(S,l) number of lines in S crossing l

slide-18
SLIDE 18

More Notations

  • BBox at level 0

its 4 children at level 1, …

  • Level of an edge

level of the corresponding box

  • Edge is in i level ⇒ in (i+1), (i+2), .., logL
  • maximal level
slide-19
SLIDE 19

Charging Scheme

R-charging For a maximal edge which is crossed g times by S, charge

slide-20
SLIDE 20

Charging Lemma

  • Expected total cost for top i-level edges is
  • Maximal level of grid line l be j

– Length L/2j – Charge t(S,l)/R L/2j – Probability 2j/L

slide-21
SLIDE 21

Charging Lemma

  • i ≤ logL
  • Choose R = logL/ε
  • Cost ≤ ε t(S,l)
  • Total ≤ O(ε cost(S))
slide-22
SLIDE 22

m-portal

  • m-regular portals for a shifted dissection
  • Total number of points ⇒ 4m
slide-23
SLIDE 23

m-portal

  • m-regular portals for a shifted dissection
  • Total number of points ⇒ 4m

3-portal

slide-24
SLIDE 24

m-light Solution

slide-25
SLIDE 25

m-light Solution

slide-26
SLIDE 26

Existence

  • m>1 and (a,b)∈U[0,L]
  • OPT ⇒ deflect to form m-portal

– for l sized square, cost of deflection O(l/m) – m-charging scheme

  • w.p. ½ or in expectation or w.h.p.

m-light solution exists for (a,b) shifted dissection with cost at most

slide-27
SLIDE 27

Existence

Solution is O( (1+logL/m)OPT )

  • L=O(n4)
  • Choose m to get O( (1+ε)OPT )
slide-28
SLIDE 28

Dynamic Programming: Sketch

  • Finds solution

within (1+1/4m) of m-light OPT

O((1+ε/4logn)(1+ε)OPT) = O((1+ε)OPT) Dynamic Programming

  • 1. Nearest facility within (1+1/4m)
  • 2. Nearest facilities similar for neighbors
slide-29
SLIDE 29

Dynamic Programming: Sketch

Given f and sub-boxes (children boxes Si-s) solve for current level

  • Table build for all choices f(≤k) and S
  • Table size O(nc)

⇒ worst case time for algorithm

slide-30
SLIDE 30

Summary

In 2D, for the k-median problem, given any constant positive ε with probability 1-o(1) achieve solution at most (1+ε)OPT in time O(nO(1/ε)nk logn)

  • Proof of existence
  • Dynamic programming: bound table size
slide-31
SLIDE 31

A Nearly Linear-Time App. Scheme for the Euclidean k-median Problem

  • S. Kolliopoulos, S. Rao

ESA '99

slide-32
SLIDE 32

k-medians

– Given

  • S={xi} be n points in metric space Rd
  • Positive integer k

– Goal

  • Find M={mi} ∈ S in metric space Rd which minimizes
slide-33
SLIDE 33

Paper 2

O(nO(1/ε)nk logn) reduced to O(O(1/ε)nk logn)

  • Reduce the number of portals from

O(logn/ε) ⇒ O(1+log(1/ε)/ε)

  • Different construction (no (a,b) shifting)
slide-34
SLIDE 34

Adaptive Dissection

Sub-rectangle

slide-35
SLIDE 35

Adaptive Dissection

Sub-rectangle

slide-36
SLIDE 36

Adaptive Dissection

Sub-rectangle

slide-37
SLIDE 37

Adaptive Dissection

Cut-rectangle (randomization)

slide-38
SLIDE 38

Adaptive Dissection

Cut-rectangle (randomization)

slide-39
SLIDE 39

Adaptive Dissection

Cut-rectangle (randomization)

slide-40
SLIDE 40

Lemma (just one of many)

  • Given two parallel cut-lines (due to cut-

rectangle) are L apart, the line segments has side length less than 3L.

slide-41
SLIDE 41

Structure Theorem

Error due to assignment using m-portal respecting paths is bounded by

Choose

slide-42
SLIDE 42

Extensions

  • d-dimension ⇒ md-1-portal
  • Facility location

– Same structure

  • Capacitated k-median

– Tweak dynamic programming

  • Few medians (small k)

– Guess position of facilities – Number of choices

slide-43
SLIDE 43

k-medians Problem

slide-44
SLIDE 44

k-centers Problem