on the cost of essentially fair clusterings
play

On the cost of essentially fair clusterings Ioana Bercea, Martin - PowerPoint PPT Presentation

On the cost of essentially fair clusterings Ioana Bercea, Martin Gro, Samir Khuller, Aounon Kumar, Clemens Rsner, Daniel Schmidt, Melanie Schmidt Aussois, January 2019 1 Assigning students to schools Example problem Goals Minimize


  1. On the cost of essentially fair clusterings Ioana Bercea, Martin Groß, Samir Khuller, Aounon Kumar, Clemens Rösner, Daniel Schmidt, Melanie Schmidt Aussois, January 2019 1

  2. Assigning students to schools Example problem Goals Minimize maximum distance to school Preserve color ratios 2

  3. Assigning students to schools Example problem Goals Minimize maximum distance to school Preserve color ratios 2

  4. Assigning students to schools Example problem Goals Minimize maximum distance to school Preserve color ratios 2

  5. What is a fair assignment? Given point set P col p color of each point p l h , u h (rational) lower/upper bound on ratio of color h c 1 , . . . , c k ⊆ P k locations (“centers”) Assignment φ : P → { 1 , . . . , k } is fair if l h ≤ |{ p ∈ P | col p = h ∧ φ ( p ) = ℓ }| ≤ u h |{ p ∈ P | φ ( p ) = ℓ }| for all colors h and all centers ℓ . [Chierichetti et al., NIPS 2017] 3

  6. Fair clustering Find k centers and a fair assignment φ minimizing max p ∈ P dist( p , φ ( p )) ( k -center) � dist( p , φ ( p )) ( k -median) p ∈ P � dist 2 ( p , φ ( p )) ( k -means) p ∈ P Also: Facility location, supplier variants 4

  7. State-of-the-Art I Classical (colorblind) clustering Approximation Hardness k -center 2 [G85,HS85] 2 [HN79] facility location 1.488 [L11] 1.463 [GK99] k -median 2.675+ ε [BPRST15] ≈ 1.74 [HN79] k -means 6.357 [ANSW17] 1.0013 [ACKS15,LSW15] [ACKS15] Awasthi, Charikar, Krishnaswamy, Sinop. SoCG 2015. [ANSW17] Ahmadian, Norouzi-Fard, Svensson, Ward. FOCS 2017. [BPRST15] Behsaz, Friggstad, Salavatipour, Sivakumar. ICALP 2015. [G85] Gonzalez. Theoretical Computer Science 1985. [GK99] Guha, Khuller. J. Algorithms 1999. [HN79] Hsu, Nemhauser. Discrete Applied Mathematics 1979. [HS85] Hochbaum, Shmoys. Mathematics of Operations Research 1985. [L11] Li. ICALP 2011. [LSW15] Lee, M. Schmidt, Wright. IPL 2017. 5

  8. State-of-the-Art II Fair clustering Approximation exact ratio fair essentially fair 4 ∗ [CKLV17] k -center 14 [RS18] facility location Θ( t ) ∗ [CKLV17] k -median k -means ∗ at most two colors [CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018. 6

  9. State-of-the-Art II Fair clustering Approximation exact ratio fair essentially fair 4 ∗ [CKLV17] k -center 14 5 3 facility location 3.488 Θ( t ) ∗ [CKLV17] k -median 4.675 k -means 62.856 ∗ at most two colors [CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018. 6

  10. State-of-the-Art II Fair clustering Approximation exact ratio fair essentially fair 4 ∗ [CKLV17] k -center 14 5 3 ← today facility location 3.488 Θ( t ) ∗ [CKLV17] k -median 4.675 k -means 62.856 ∗ at most two colors [CKLV17] Chierichetti, Kumar, Lattanzi, Vassilvitskii. NIPS 2017. [RS18] Rösner, M. Schmidt. ICALP 2018. 6

  11. Main result Black-box approach: Any clustering C An essentially fair weakly supervised rounding clustering C ′ with cost( C ′ ) ∈ O (cost( C ) + FAIR-OPT ) A fair LP solution Turns any colorblind clustering into an essentially fair clustering on the same set of centers with slightly increased cost. 7

  12. Main result Black-box approach: Any clustering C An essentially fair weakly supervised rounding clustering C ′ with cost( C ′ ) ∈ O (cost( C ) + FAIR-OPT ) A fair LP solution Turns any colorblind clustering into an essentially fair clustering on the same set of centers with slightly increased cost. For instance: Use a 2-approximate k -center solution. 7

  13. Weakly supervised LP-rounding “Guess” optimum radius τ from O ( | P | 2 ) candidates The classical k -center LP with fairness constraints � x ij = 1 for all j ∈ P i ∈ P x ij ≤ y i for all i ∈ P � y i ≤ k i ∈ P x ij = 0 for all i , j with dist( i , j ) > τ � � � x ij ≤ x ij ≤ u h for all i ∈ P and all colors h ℓ h x ij j ∈ P col( p j )=col h j ∈ P 8

  14. Step 1: Rounding y 1 3 1 3 1 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  15. Step 1: Rounding y 1 3 1 3 1 3 1 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  16. Step 1: Rounding y 1 3 1 3 1 3 1 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  17. Step 1: Rounding y 1 1 3 3 1 3 1 3 1 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  18. Step 1: Rounding y 1 1 3 3 1 3 1 3 1 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  19. Step 1: Rounding y 1 1 3 3 1 3 1 3 2 1 3 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  20. Step 1: Rounding y 1 1 3 3 1 3 1 3 2 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  21. Step 1: Rounding y 1 1 3 3 1 3 ≤ τ 1 3 2 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  22. Step 1: Rounding y 1 1 3 3 1 3 ≤ τ ≤ cost( C ) 1 3 2 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  23. Step 1: Rounding y 1 1 3 3 ≤ FAIR-OPT + cost( C ) 1 3 ≤ τ ≤ cost( C ) 1 3 2 3 Use centers ( ) from given C as integer ˆ y for instance: Use 2-approximation for classical k -center (Fractionally) assign each point ( ) to its center’s ( ) center Obtain integer ˆ y , potentially fractional fair ˆ x Maximum radius bounded by FAIR-OPT + cost( C ) 9

  24. Step 2: Rounding x Define: mass h ( x , C ) := � j ∈ C , col( j )= h x ij Claim: Can find integral ˆ x with ⌊ mass h ( x , C ) ⌋ ≤ mass h (ˆ x , C ) ≤ ⌈ mass h ( x , C ) ⌉ and ⌊ mass( x , C ) ⌋ ≤ mass(ˆ x , C ) ≤ ⌈ mass( x , C ) ⌉ for all colors h and all clusters C without increasing cost. Consequence: Error of at most one point per cluster. = mass h (ˆ x , C ) ratio of color h in cluster C ∧ | C | 10

  25. The flow network for one color 1 −⌊ mass h ( x , C ) ⌋ 1 “remainder” . . . . . . 1 all open centers all points with color h arc ( i , j ) if x ij > 0 Want: ⌊ mass h ( x , C ) ⌋ ≤ mass h (ˆ x , C ) ≤ ⌈ mass h ( x , C ) ⌉ 11

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend