Al Algo gori rithms thms fo for r in inst stan ance ce-st - - PowerPoint PPT Presentation

β–Ά
al algo gori rithms thms fo for r in inst stan ance ce st
SMART_READER_LITE
LIVE PREVIEW

Al Algo gori rithms thms fo for r in inst stan ance ce-st - - PowerPoint PPT Presentation

Al Algo gori rithms thms fo for r in inst stan ance ce-st stable able an and pe d pert rtur urbat bation ion-res resili ilient ent pr prob oblems ems Haris Angelidakis, TTIC Konstantin Makarychev, Northwestern Yury


slide-1
SLIDE 1

Al Algo gori rithms thms fo for r in inst stan ance ce-st stable able an and pe d pert rtur urbat bation ion-res resili ilient ent pr prob

  • blems

ems

Haris Angelidakis, TTIC Konstantin Makarychev, Northwestern Yury Makarychev, TTIC Aravindan Vijayaraghavan, Northwestern

QTW on Beyond Worst-Case Analysis Northwestern, May 24, 2017

slide-2
SLIDE 2

Motivation

  • Practice: Need to solve clustering and combinatorial
  • ptimization problems.
  • Theory:
  • Many problems are NP-hard. Cannot solve them

exactly.

  • Design approximation algorithms for worst case.

Can we get better algorithms for real-world instances than for worst-case instances?

slide-3
SLIDE 3

Motivation

  • Answer: Yes!

When we solve problems that arise in practice, we often get much better approximation than it is theoretically possible for worst case instances.

  • Want to design algorithms with provable

performance guarantees for solving real-world instances.

slide-4
SLIDE 4

Motivation

  • Need a model for real-world instances.
  • Many different models have been proposed.
  • It’s unrealistic that one model will capture all

instances that arise in different applications.

slide-5
SLIDE 5

This work

  • Assumption: instances are stable/perturbation-

resilient

  • Consider several problems:
  • 𝑙-means
  • 𝑙-median
  • Max Cut
  • Multiway Cut
  • Get exact polynomial-time algorithms
slide-6
SLIDE 6

𝑙-means and 𝑙-median

Given a set of points π‘Œ, distance 𝑒(β‹…,β‹…) on π‘Œ, and 𝑙 Partition π‘Œ into 𝑙 clusters 𝐷1, … , 𝐷𝑙 and find a β€œcenter” 𝑑𝑗 in each 𝐷𝑗 so as to minimize ෍

𝑗=1 𝑙

෍

π‘£βˆˆπ·π‘—

𝑒(𝑣, 𝑑𝑗) ෍

𝑗=1 𝑙

෍

π‘£βˆˆπ·π‘—

𝑒 𝑣, 𝑑𝑗 2 (𝑙-means) (𝑙-median)

slide-7
SLIDE 7

Multiway Cut

Given

  • a graph 𝐻 = (π‘Š, 𝐹, π‘₯)
  • a set of terminals 𝑒1, … , 𝑒𝑙

Find a partition of π‘Š into sets 𝑇1, … , 𝑇𝑙 that minimizes the weight of cut edges s.t. 𝑒𝑗 ∈ 𝑇𝑗.

𝑒1 𝑒2 𝑒3 𝑒4

slide-8
SLIDE 8

Instance-stability & perturbation- resilience

➒ Consider an instance ℐ of an optimization or clustering problem. ➒ ℐ′ is a 𝛿-perturbation of ℐ if it can be obtained from ℐ by β€œperturbing the parameters” β€” multiplying each parameter by a number from 1 to 𝛿.

  • π‘₯ 𝑓 ≀ π‘₯β€² 𝑓 ≀ 𝛿 β‹… π‘₯ 𝑓
  • 𝑒(𝑣, 𝑀) ≀ 𝑒′ 𝑣, 𝑀 ≀ 𝛿 β‹… 𝑒(𝑣, 𝑀)
slide-9
SLIDE 9

Instance-stability & perturbation- resilience

An instance ℐ of an optimization or clustering problem is perturbation-resilient/instance-stable if the optimal solution remains the same when we perturb the instance: every Ξ³-perturbation ℐ′ has the same optimal solution as ℐ

slide-10
SLIDE 10

Instance-stability & perturbation- resilience

Every Ξ³-perturbation ℐ′ has the same optimal solution as ℐ

  • In practice, we are interested in solving instances

where the optimal solution β€œstands out” among all solutions [Bilu, Linial]

  • Objective function is an approximation to the β€œtrue”
  • bjective function.
  • β€œPractically interesting instance” β‡’ it is stable
slide-11
SLIDE 11

Results

slide-12
SLIDE 12

History

Instance-stability & perturbation-resilience was introduced

in discrete optimization: by Bilu and Linial `10 in clustering: by Awasthi, Blum, and Sheffet `12

slide-13
SLIDE 13

Results (clustering)

𝛿 β‰₯ 3

𝒍-center, 𝒍-means, 𝒍-median

[Awasthi, Blum, Sheffet `12]

𝛿 β‰₯ 1 + 2

𝒍-center, 𝒍-means, 𝒍-median

[Balcan, Liang `13]

𝛿 β‰₯ 2

  • sym. /asym.

𝒍-center

[Balcan, Haghtalab, White `16]

𝛿 β‰₯ 2

𝒍-means, 𝒍-median

[AMM `17]

slide-14
SLIDE 14

Results (optimization)

𝛿 β‰₯ π‘‘π‘œ

Max Cut

[Bilu, Linial `10]

𝛿 β‰₯ 𝑑 π‘œ

Max Cut

[Bilu, Daniely, Linial, Saks `13]

𝛿 β‰₯ 𝑑 log π‘œ log log π‘œ

Max Cut

[MMV `13]

𝛿 β‰₯ 4

Multiway

[MMV `13]

𝛿 β‰₯ 2 βˆ’ 2/𝑙

Multiway

[AMM `17]

slide-15
SLIDE 15

Results (optimization)

Our algorithms are robust.

  • Find the optimal solution, if the instance is stable.
  • Find an optimal solution or detects that the instance is

not stable, otherwise.

  • Never output an incorrect answer.

Solve weakly stable instances.

Assume that when we perturb the instance

  • the optimal solution changes only slightly, or
  • there is a core that changes only slightly.
slide-16
SLIDE 16

[Balcan, Haghtalab, White `16] No polynomial-time algorithm for (2 βˆ’ 𝜁)-perturbation-resilient instances

  • f 𝑙-center (𝑂𝑄 β‰  𝑆𝑄).

[Ben-David, Reyzin `14] No polynomial-time algorithm for instances of 𝑙-means, 𝑙-median, 𝑙-center satisfying (2 βˆ’ 𝜁)-center proximity property (𝑄 β‰  𝑂𝑄).

Hardness results for center-based

  • bejctives
slide-17
SLIDE 17

Hardness results for optimization problems

Set Cover, Vertex Cover, Min 2-Horn Deletion

There is no robust algorithm for 𝑃(π‘œ1βˆ’πœ)-stable instances unless P = NP [AMM `17].

Provide evidence that [MMV `13, AMM `17]

  • No robust algorithm for Max Cut when

𝛿 < 𝑃 log π‘œ log log π‘œ

  • Multiway cut is hard when 𝛿 < 4

3 βˆ’ 𝑃 1 𝑙 .

slide-18
SLIDE 18

Algorithm for Clustering Problems

slide-19
SLIDE 19

Center proximity property

[Awasthi, Blum, Sheffet `12] A clustering 𝐷1, …, 𝐷𝑙 with centers 𝑑1, …, 𝑑𝑙 satisfies the center proximity property if for every π‘ž ∈ 𝐷𝑗: 𝑒 π‘ž, 𝑑

π‘˜

> 𝛿 𝑒 π‘ž, 𝑑𝑗 𝑑𝑗 𝐷

π‘˜

𝐷𝑗 𝑑

π‘˜

π‘ž

slide-20
SLIDE 20

i. 𝛿-perturbation resilience β‡’ 𝛿-center proximity ii. 2-center proximity β‡’ each cluster is a subtree of the MST

  • iii. use single-linkage + DP to find 𝐷1, … , 𝐷𝑙

Plan [AMM `17]

slide-21
SLIDE 21

Perturbation resilience: the optimal clustering doesn’t change when we perturb the distances. 𝑒 𝑣, 𝑀 /𝛿 ≀ 𝑒′ 𝑣, 𝑀 ≀ 𝑒(𝑣, 𝑀) [ABS `12] 𝑒′(β‹…,β‹…) doesn’t have to be a metric [AMM `17] 𝑒′(β‹…,β‹…) is a metric

Metric perturbation resilience is a more natural notion.

Perturbation resilience β‡’ center proximity

slide-22
SLIDE 22

Assume center proximity doesn’t hold. Then 𝑒 π‘ž, 𝑑

π‘˜ ≀ 𝛿 𝑒 π‘ž, 𝑑𝑗 .

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷

π‘˜

𝐷𝑗 𝑑

π‘˜

π‘ž

slide-23
SLIDE 23

Assume center proximity doesn’t hold.

  • Let 𝑒′ π‘ž, 𝑑

π‘˜ = 𝑒 π‘ž, 𝑑𝑗 β‰₯ π›Ώβˆ’1𝑒(π‘ž, 𝑑 π‘˜).

  • Don’t change other distances.
  • Consider the shortest-path closure.

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

slide-24
SLIDE 24

Assume center proximity doesn’t hold.

  • Let 𝑒′ π‘ž, 𝑑

π‘˜ = 𝑒 π‘ž, 𝑑𝑗 β‰₯ π›Ώβˆ’1𝑒(π‘ž, 𝑑 π‘˜).

  • Don’t change other distances.
  • Consider the shortest-path closure.

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

slide-25
SLIDE 25

Assume center proximity doesn’t hold.

  • Let 𝑒′ π‘ž, 𝑑

π‘˜ = 𝑒 π‘ž, 𝑑𝑗 β‰₯ π›Ώβˆ’1𝑒(π‘ž, 𝑑 π‘˜).

  • Don’t change other distances.
  • Consider the shortest-path closure.

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

This is a 𝛿-perturbation.

slide-26
SLIDE 26

Distances inside clusters 𝑫𝒋 and π‘«π’Œ don’t change.

Consider 𝑣, 𝑀 ∈ 𝐷𝑗. 𝑒′ 𝑣, 𝑀 = min 𝑒 𝑣, 𝑀 , 𝑒 𝑣, π‘ž + 𝑒′ π‘ž, 𝑑

π‘˜ + 𝑒 𝑑 π‘˜, 𝑀

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

𝑣 𝑀

slide-27
SLIDE 27

Distances inside clusters 𝑫𝒋 and π‘«π’Œ don’t change.

Consider 𝑣, 𝑀 ∈ 𝐷𝑗. 𝑒′ 𝑣, 𝑀 = min 𝑒 𝑣, 𝑀 , 𝑒 𝑣, π‘ž + 𝑒′ π‘ž, 𝑑

π‘˜ + 𝑒 𝑑 π‘˜, 𝑀

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

𝑣 𝑀

slide-28
SLIDE 28

Since the instance is 𝛿-stable, 𝐷1, … , 𝐷𝑙 must be the unique

  • ptimal solution for distance 𝑒′.

Still, 𝑑𝑗 and 𝑑

π‘˜ are optimal centers for 𝐷𝑗 and 𝐷 π‘˜.

𝑒′ π‘ž, 𝑑𝑗 = 𝑒′ π‘ž, 𝑑

π‘˜ β‡’ can move π‘ž from 𝐷𝑗 to 𝐷 π‘˜

Perturbation resilience β‡’ center proximity [ABS `12, AMM `17]

𝑑𝑗 𝐷𝑗 π‘ž 𝐷

π‘˜

𝑑

π‘˜

slide-29
SLIDE 29

[ABS `12] 2-center proximity β‡’ every 𝑣 ∈ 𝐷𝑗 is closer to 𝑑𝑗 than to any 𝑀 βˆ‰ 𝐷𝑗 Assume the path from 𝑣 ∈ 𝐷𝑗 to 𝑑𝑗 in MST, leaves 𝐷𝑗.

Each cluster is a subtree of MST

𝑣 𝑑𝑗 𝑀

slide-30
SLIDE 30

[ABS `12] 2-center proximity β‡’ every 𝑣 ∈ 𝐷𝑗 is closer to 𝑑𝑗 than to any 𝑀 βˆ‰ 𝐷𝑗 Assume the path from 𝑣 ∈ 𝐷𝑗 to 𝑑𝑗 in MST, leaves 𝐷𝑗.

Each cluster is a subtree of MST

𝑣 𝑑𝑗 𝑀

slide-31
SLIDE 31

Root MST at some 𝑠. π‘ˆ 𝑣 is the subtree rooted at 𝑣. cost𝑣(π‘˜, 𝑑): the cost of the partitioning of π‘ˆ 𝑣

  • into π‘˜ clusters (subtrees)
  • so that 𝑑 is the center of the cluster containing 𝑣.

Dynamic programming algorithm

𝑣 𝑠 π‘ˆ(𝑣) 𝑑

slide-32
SLIDE 32

Fill out the DP table bottom-up. Example: 𝑙-median, 𝑣 has 2 children 𝑣1 and 𝑣2.

Dynamic programming algorithm

𝑣 π‘ˆ(𝑣) 𝑣2 𝑣1

slide-33
SLIDE 33

Fill out the DP table bottom-up. Example: 𝑙-median, 𝑣 has 2 children 𝑣1 and 𝑣2.

Dynamic programming algorithm

𝑣 π‘ˆ(𝑣)

slide-34
SLIDE 34

Fill out the DP table bottom-up. Example: 𝑙-median, 𝑣 has 2 children 𝑣1 and 𝑣2.

Dynamic programming algorithm

𝑣 π‘ˆ(𝑣)

slide-35
SLIDE 35

Fill out the DP table bottom-up. Example: 𝑙-median, 𝑣 has 2 children 𝑣1 and 𝑣2.

Dynamic programming algorithm

𝑣 π‘ˆ(𝑣)

slide-36
SLIDE 36

𝑣, 𝑣1, 𝑣2 lie in the same cluster cost𝑣 π‘˜, 𝑑 = 𝑒 𝑣, 𝑑 + cost𝑣1 π‘˜1, 𝑑 + cost𝑣2 π‘˜2, 𝑑 where π‘˜1 + π‘˜2 = π‘˜ + 1 𝑣, 𝑣1, 𝑣2 lie in different clusters cost𝑣 π‘˜, 𝑑 = 𝑒 𝑣, 𝑑 + cost𝑣1 π‘˜1, 𝑑1 + cost𝑣2 π‘˜2, 𝑑2 where π‘˜1 + π‘˜2 = π‘˜ βˆ’ 1, 𝑑1 ∈ π‘ˆ 𝑣1 , 𝑑2 ∈ π‘ˆ 𝑣2 𝑣, 𝑣1 lie in the same clusters, 𝑣2 in a different

cost𝑣 π‘˜, 𝑑 = 𝑒 𝑣, 𝑑 + cost𝑣1 π‘˜1, 𝑑 + cost𝑣2 π‘˜1, 𝑑2

where π‘˜1 + π‘˜2 = π‘˜, 𝑑2 ∈ π‘ˆ 𝑣2

Dynamic programming algorithm

slide-37
SLIDE 37

Algorithms for Max Cut and Multiway Cut

slide-38
SLIDE 38

Algorithms for Max Cut and Multiway Cut [MMV `13]

Write an SDP or LP relaxation for the problem. Show that it is integral if the instance is 𝛿-stable.

solve the relaxation if the SDP/LP solution is integral return the solution else return that the instance is not 𝛿-stable

The algorithm is robust: it never returns an incorrect answer.

slide-39
SLIDE 39

Multiway Cut

Write the relaxation for Multiway Cut by

Călinescu, Karloff, and Rabani [CKR `98]

To get an 𝛽-approximation, we would design a rounding scheme with

Pr 𝑣, 𝑀 is cut ≀ 𝛽 𝑒 𝑣, 𝑀

Then

𝔽 weight of cut edges ≀ 𝛽 ෍

𝑣,𝑀 ∈𝐹

π‘₯𝑣𝑀𝑒(𝑣, 𝑀)

slide-40
SLIDE 40

Multiway Cut: complementary objective

If we want to maximize the weight of uncut edges, we would we would design a rounding scheme with

Pr 𝑣, 𝑀 is not cut β‰₯ 𝛾 (1 βˆ’ 𝑒 𝑣, 𝑀 )

Then

𝔽 wt. of uncut edges β‰₯ 𝛾 ෍

𝑣,𝑀 ∈𝐹

π‘₯𝑣𝑀(1 βˆ’ 𝑒 𝑣, 𝑀 )

slide-41
SLIDE 41

Write an LP or SDP relaxation for the problem. Design a rounding procedure s.t.

Pr 𝑣, 𝑀 is cut ≀ 𝛽 𝑒 𝑣, 𝑀 Pr 𝑣, 𝑀 is not cut β‰₯ 𝛾 1 βˆ’ 𝑒 𝑣, 𝑀

  • r

Pr 𝑣, 𝑀 is cut β‰₯ 𝛾 𝑒 𝑣, 𝑀 Pr 𝑣, 𝑀 is not cut ≀ 𝛽 1 βˆ’ 𝑒 𝑣, 𝑀

Then the relaxation for 𝛿-stable instances is integral, when 𝛿 β‰₯ 𝛽/𝛾

General approach to solving stable instances of graph partitioning

minimization maximization

!

slide-42
SLIDE 42

Solving Max Cut [MMV `13]

Use the Goemans–Williamson SDP relaxation with β„“2

2-triangle inequalities.

Design a rounding procedure with 𝛽 𝛾 = 𝑃 log π‘œ log log π‘œ , which is a combination of two algorithms:

  • the algorithm for Sparsest Cut with Nonuniform Demands

by Arora, Lee, and Naor `08,

  • the algorithm for Min Uncut by Agarwal, Charikar,

Makarychev, M `05

slide-43
SLIDE 43

Solving Multiway Cut [AMM `17]

Rounding procedures for Multiway Cut by

  • Sharma and VondrΓ‘k `14
  • Buchbinder, Schwartz, and Weizman `17

are highly non-trivial. Show: need a rounding procedure only for LP solutions that are almost integral. Design a simple rounding procedure with

𝛽 𝛾 = 2 βˆ’ 2 𝑙 .

slide-44
SLIDE 44

Summary

  • Algorithms for 2-perturbation-resilient instances of

problems with a natural center based objective: 𝑙-means, 𝑙-median, facility location

  • Robust algorithms for 𝑃

log π‘œ log log π‘œ -stable instance of Max Cut and 2 βˆ’

2 𝑙 -stable instances of

Multiway Cut.

  • Negative results for stable instances of Max Cut,

Multiway Cut, Max 𝑙-Cut, Multi Cut, Set Cover, Vertex Cover, Min 2-Horn Deletion.