On the cost of essentially fair clusterings Ioana Bercea, Martin - - PowerPoint PPT Presentation

on the cost of essentially fair clusterings
SMART_READER_LITE
LIVE PREVIEW

On the cost of essentially fair clusterings Ioana Bercea, Martin - - PowerPoint PPT Presentation

On the cost of essentially fair clusterings Ioana Bercea, Martin Gro, Samir Khuller, Aounon Kumar, Clemens Rsner, Daniel Schmidt, Melanie Schmidt Aussois, January 2019 1 Assigning students to schools Example problem Goals Minimize


slide-1
SLIDE 1

On the cost of essentially fair clusterings

Ioana Bercea, Martin Groß, Samir Khuller, Aounon Kumar, Clemens Rösner, Daniel Schmidt, Melanie Schmidt Aussois, January 2019

1

slide-2
SLIDE 2

Assigning students to schools

Example problem Goals

Minimize maximum distance to school Preserve color ratios

2

slide-3
SLIDE 3

Assigning students to schools

Example problem Goals

Minimize maximum distance to school Preserve color ratios

2

slide-4
SLIDE 4

Assigning students to schools

Example problem Goals

Minimize maximum distance to school Preserve color ratios

2

slide-5
SLIDE 5

What is a fair assignment?

Given

P point set colp color of each point p lh, uh (rational) lower/upper bound on ratio of color h c1, . . . , ck ⊆ P k locations (“centers”)

Assignment φ : P → {1, . . . , k} is fair if

lh ≤ |{p ∈ P | colp = h ∧ φ(p) = ℓ}| |{p ∈ P | φ(p) = ℓ}| ≤ uh for all colors h and all centers ℓ. [Chierichetti et al., NIPS 2017]

3

slide-6
SLIDE 6

Fair clustering

Find k centers and a fair assignment φ minimizing

max

p∈P dist(p, φ(p))

(k-center)

  • p∈P

dist(p, φ(p)) (k-median)

  • p∈P

dist2(p, φ(p)) (k-means) Also: Facility location, supplier variants

4

slide-7
SLIDE 7

State-of-the-Art I

Classical (colorblind) clustering

Approximation Hardness k-center 2 [G85,HS85] 2 [HN79] facility location 1.488 [L11] 1.463 [GK99] k-median 2.675+ε [BPRST15] ≈ 1.74 [HN79] k-means 6.357 [ANSW17] 1.0013 [ACKS15,LSW15]

[ACKS15] Awasthi, Charikar, Krishnaswamy, Sinop. SoCG 2015. [ANSW17] Ahmadian, Norouzi-Fard, Svensson, Ward. FOCS 2017. [BPRST15] Behsaz, Friggstad, Salavatipour, Sivakumar. ICALP 2015. [G85]

  • Gonzalez. Theoretical Computer Science 1985.

[GK99] Guha, Khuller. J. Algorithms 1999. [HN79] Hsu, Nemhauser. Discrete Applied Mathematics 1979. [HS85] Hochbaum, Shmoys. Mathematics of Operations Research 1985. [L11]

  • Li. ICALP 2011.

[LSW15] Lee, M. Schmidt, Wright. IPL 2017.

5

slide-8
SLIDE 8

State-of-the-Art II

Fair clustering

Approximation exact ratio fair essentially fair k-center 14 [RS18] 4∗ [CKLV17] facility location k-median Θ(t)∗ [CKLV17] k-means

∗at most two colors

[CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018.

6

slide-9
SLIDE 9

State-of-the-Art II

Fair clustering

Approximation exact ratio fair essentially fair k-center 14 5 4∗ [CKLV17] 3 facility location 3.488 k-median Θ(t)∗ [CKLV17] 4.675 k-means 62.856

∗at most two colors

[CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018.

6

slide-10
SLIDE 10

State-of-the-Art II

Fair clustering

Approximation exact ratio fair essentially fair k-center 14 5 4∗ [CKLV17] 3 ← today facility location 3.488 k-median Θ(t)∗ [CKLV17] 4.675 k-means 62.856

∗at most two colors

[CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018.

6

slide-11
SLIDE 11

Main result

Black-box approach:

Any clustering C A fair LP solution weakly supervised rounding An essentially fair clustering C′ with cost(C′) ∈ O(cost(C) + FAIR-OPT) Turns any colorblind clustering into an essentially fair clustering on the same set of centers with slightly increased cost.

7

slide-12
SLIDE 12

Main result

Black-box approach:

Any clustering C A fair LP solution weakly supervised rounding An essentially fair clustering C′ with cost(C′) ∈ O(cost(C) + FAIR-OPT) Turns any colorblind clustering into an essentially fair clustering on the same set of centers with slightly increased cost. For instance: Use a 2-approximate k-center solution.

7

slide-13
SLIDE 13

Weakly supervised LP-rounding

“Guess” optimum radius τ from O(|P|2) candidates

The classical k-center LP with fairness constraints

  • i∈P

xij = 1 for all j ∈ P xij ≤ yi for all i ∈ P

  • i∈P

yi ≤ k xij = 0 for all i, j with dist(i, j) > τ ℓh

  • j∈P

xij ≤

  • col(pj)=colh

xij ≤ uh

  • j∈P

xij for all i ∈ P and all colors h

8

slide-14
SLIDE 14

Step 1: Rounding y

1 3 1 3 1 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-15
SLIDE 15

Step 1: Rounding y

1 3 1 3 1 3 1 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-16
SLIDE 16

Step 1: Rounding y

1 3 1 3 1 3 1 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-17
SLIDE 17

Step 1: Rounding y

1 3 1 3 1 3 1 3 1 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-18
SLIDE 18

Step 1: Rounding y

1 3 1 3 1 3 1 3 1 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-19
SLIDE 19

Step 1: Rounding y

1 3 1 3 1 3 1 3 1 3 2 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-20
SLIDE 20

Step 1: Rounding y

1 3 1 3 1 3 1 3 2 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-21
SLIDE 21

Step 1: Rounding y

1 3 1 3 1 3

≤ τ

1 3 2 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-22
SLIDE 22

Step 1: Rounding y

1 3 1 3 1 3

≤ τ ≤ cost(C)

1 3 2 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-23
SLIDE 23

Step 1: Rounding y

1 3 1 3 1 3

≤ τ ≤ cost(C)

1 3

≤ FAIR-OPT + cost(C)

2 3

Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k-center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y, potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT+ cost(C)

9

slide-24
SLIDE 24

Step 2: Rounding x

Define: massh(x, C) :=

j∈C,col(j)=h xij

Claim: Can find integral ˆ x with

⌊massh(x, C)⌋ ≤ massh(ˆ x, C) ≤ ⌈massh(x, C)⌉ and ⌊mass(x, C)⌋ ≤ mass(ˆ x, C) ≤ ⌈mass(x, C)⌉ for all colors h and all clusters C without increasing cost.

Consequence: Error of at most one point per cluster.

ratio of color h in cluster C ∧ = massh(ˆ x, C) |C|

10

slide-25
SLIDE 25

The flow network for one color

1 1 . . . 1 all points with color h . . . all open centers “remainder” −⌊massh(x, C)⌋ arc (i, j) if xij > 0 Want: ⌊massh(x, C)⌋ ≤ massh(ˆ x, C) ≤ ⌈massh(x, C)⌉

11