Approximate Constraint Satisfaction requires Large LP Relaxations - - PowerPoint PPT Presentation
Approximate Constraint Satisfaction requires Large LP Relaxations - - PowerPoint PPT Presentation
Approximate Constraint Satisfaction requires Large LP Relaxations David Steurer Cornell Siu On Chan James R. Lee Prasad Raghavendra MSR Washington Berkeley TCS+ Seminar, December 2013 Max Cut, Traveling Salesman, best-known
best-known (approximation) algorithms for many combinatorial optimization problems:
Max Cut, Traveling Salesman, Sparsest Cut, Steiner Tree, โฆ
common core = linear / semidefinite programming (LP/SDP) LP / SDP relaxations
particular kind of reduction from hard problem to LP/SDP running time: polynomial in size of relaxation
what guarantees are possible for approximation and running time?
example: basic LP relaxation for Max Cut
maximize 1 ๐น ๐๐๐
๐๐โ๐น
subject to ๐๐๐ โ 0,1 ๐๐๐ โ ๐๐๐ โ ๐๐๐ โค 0 ๐๐๐ + ๐๐๐ + ๐๐๐ โค 2 intended solution ๐๐ฆ ๐๐ = 1, if ๐ฆ๐ โ ๐ฆ๐, 0,
- therwise.
๐ ๐3 inequalities depend only on instances size (but not instance itself) approximation guarantee
- ptimal value of instance
vs.
- ptimal value of LP relaxation
๐๐๐ โ 0,1
integer linear program
Max Cut: Given a graph, find bipartition ๐ฆ โ ยฑ1 ๐ that cuts as many edges as possible
(relax integrality constraint) ๐ฆ๐ = 1 ๐ฆ๐ = โ1
challenges
many possible relaxations for same problem small difference syntactically ๏ big difference for guarantees goal: identify โrightโ polynomial-size relaxation hierarchies = systematic ways to generate relaxations best-known: Sherali-Adams (LP), sum-of-squares/Lasserre (SDP); best possible? goal: compare hierarchies and general LP relaxations
- ften: more complicated/larger relaxations ๏ better approximation
P โ NP predicts limits of this approach; can we confirm them? goal: understand computational power of relaxations Rule out that poly-size LP relaxations show ๐ = ๐๐?
hierarchies
great variety (sometimes different ways to apply same hierarchy) current champions: SheraliโAdams (LP) & sum-of-squares / Lasserre (SDP)
[LovรกszโSchrijver, SheraliโAdams, Parrilo / Lasserre]
connections to proof complexity (Nullstellensatz and Positivstellensatz refutations) lower bounds Sherali-Adams requires size 2๐ฮฉ 1 to beat ratio ยฝ for Max Cut sum-of-squares requires size 2ฮฉ ๐ to beat ratio 7 8 for Max 3-Sat
[MathieuโFernandez de la Vega CharikarโMakarychevโMakarychev] [Grigoriev, Schoenebeck]
upper bounds implicit: many algorithms (e.g., Max Cut and Sparsest Cut) explicit: Coloring, Unique Games, Max Bisection
[Chlamtac, Arora-Barak-S., Barak-Raghavendra-S., Raghavendra-Tan] [Goemans-Williamson, Arora-Rao-Vazirani]
lower bounds for general LP formulations (extended formulations)
characterization; symmetric formulations for TSP & matching [Yannakakisโ88]
general, exact formulations for TSP & Clique
[FioriniโMassarโPokutta โTiwaryโde Wolfโ12]
approximate formulations for Clique
[BraunโFioriniโPokuttaโS.โ12 BravermanโMoitraโ13]
general, exact formulation for maximum matching [Rothvoรโ13]
geometric idea: complicated polytopes can be
projections of simple polytopes
universality result for LP relaxations of Max CSPs general polynomial-size LP relaxations are no more powerful than polynomial-size Sherali-Adams relaxations
also holds for almost quasi-polynomial size
confirm non-trivial prediction of Pโ NP: poly-size LP relaxations cannot achieve 0.99 approximation for Max Cut, Max 3-Sat, or Max 2-Sat (NP-hard approximations) approximability and UGC: poly-size LP relaxation cannot refute Unique Games Conjecture
(cannot improve current Max CSP approximations)
concrete consequences
[this talk]
separation of LP relaxation and SDP relaxation: poly-size LP relaxations are strictly weaker than SDP relaxations for Max Cut and Max 2Sat
unconditional lower bound in powerful computational model
universality result for LP relaxations of Max CSPs general polynomial-size LP relaxations are no more powerful than polynomial-size Sherali-Adams relaxations
also holds for almost quasi-polynomial size [this talk]
for concreteness: focus on Max Cut compare: general ๐ 1โ๐ ๐-size LP relaxation for Max Cut๐
- vs. ๐๐-size Sherali-Adams relaxations for Max Cut๐
notation: cut๐ป ๐ฆ = fraction of edges that bipartition ๐ฆ cuts in ๐ป Max Cut๐ = Max Cut instances / graphs on ๐ vertices
general LP relaxation for ๐๐๐ฒ ๐๐ฏ๐ฎ๐จ
linearization ๐ป โฆ ๐๐ป: โ๐ โ โ linear such that ๐๐ป ๐x = cut๐ป ๐ฆ ๐ฆ โฆ ๐๐ฆ โ โ๐ polytope of size R ๐
๐ โ โ๐, at most ๐ facets,
๐๐ฆ ๐ฆโ ยฑ1 ๐ โ ๐
๐
example linearization
๐๐ป ๐ =
1 ๐น
๐๐๐
๐๐โ๐น
๐๐ฆ ๐๐ = 1, if ๐ฆ๐ โ ๐ฆ๐, 0,
- therwise.
๐
๐
โ๐
same polytope for all instances of size ๐ makes sense because solution space for Max Cut depends only on ๐
๐๐ฆ .
computing with size-๐บ LP relaxation ๐ input
graph G
- n n vertices
computation
maximize ๐๐ป ๐ subject to ๐ โ ๐
๐
- utput
value โ ๐ป = max
๐โ๐ ๐๐ป ๐
approximation ratio ๐ฝ ๐, ๐ก -approximation Opt ๐ป โค ๐ก โ โ ๐ป โค ๐ for all ๐ป โ Max Cut๐ โ ๐ป โค ๐ฝ โ Opt ๐ป for all ๐ป โ Max Cut๐
always upper-bounds Opt G how far in the worst-case?
general computational modelโhow to prove lower bounds?
poly(๐)-time computation
geometric characterization (ร la Yannakakisโ88)
every size-R LP relaxation ๐ for Max Cu๐ฎ๐ corresponds to nonnegative functions ๐๐, โฆ , ๐๐บ: ยฑ1 ๐ โ โโฅ0 such that โ ๐ป โค ๐ iff ๐ โ cut๐ป = ๐๐ ๐๐
๐
and ๐1, โฆ , ๐๐ โฅ 0 for all ๐ป โ Max Cut๐ certifies cut๐ป โค ๐ over ยฑ1 ๐ canonical linear program
- f size ๐
example 2๐ standard basis functions correspond to exact 2๐-size LP relaxation for Max Cut๐
geometric characterization (ร la Yannakakisโ88)
every size-R LP relaxation ๐ for Max Cu๐ฎ๐ corresponds to nonnegative functions ๐๐, โฆ , ๐๐บ: ยฑ1 ๐ โ โโฅ0 such that โ ๐ป โค ๐ iff ๐ โ cut๐ป = ๐๐ ๐๐
๐
and ๐1, โฆ , ๐๐ โฅ 0 for all ๐ป โ Max Cut๐ connection to Sherali-Adams hierarchy ๐๐-size Sherali-Adams relaxation for Max Cut๐ exactly corresponds to nonnegative combinations of nonnegative ๐-juntas on ยฑ1 ๐)
๐-junta = function on ยฑ1 ๐ depends on โค d coordinates intuition: all inequalities for functions on ยฑ1 n with local proofs
geometric characterization (ร la Yannakakisโ88)
every size-R LP relaxation ๐ for Max Cu๐ฎ๐ corresponds to nonnegative functions ๐๐, โฆ , ๐๐บ: ยฑ1 ๐ โ โโฅ0 such that โ ๐ป โค ๐ iff ๐ โ cut๐ป = ๐๐ ๐๐
๐
and ๐1, โฆ , ๐๐ โฅ 0 for all ๐ป โ Max Cut๐
to rule out (c,s)-approx. by size-R LP relaxation, show:
for every size-๐ nonnegative cone, exists ๐ป โ Max Cut๐ with Opt ๐ป โค ๐ก but ๐ โ cut๐ป outside of cone
๐ โ cut๐ป
cone ๐1, โฆ , ๐๐ = ๐๐ ๐๐
๐
๐๐ โฅ 0
lower-bound for SheraliโAdams relaxations of size ๐๐ lower-bound for general LP relaxations of size ๐ 1โ๐ ๐ ๐-juntas ๐๐-juntas non-spiky general lower-bounds for size-๐๐ nonneg. cones with restricted functions
from ๐-juntas to ๐๐ป-juntas
let ๐1, โฆ , ๐๐ be nonneg. ๐๐-juntas on ยฑ1 ๐ for ๐ = ๐ 1โ10๐ ๐ want: subset ๐ โ ๐ of size ๐ โ ๐๐ where functions behave like ๐-juntas claim: there exists subset ๐ โ [๐] of size ๐ = ๐๐ such that ๐พ๐ โฉ ๐ โค ๐ for all ๐ โ ๐ proof: choose ๐ at random โ ๐ โฉ ๐พ๐ > ๐ โค
๐ ๐ โ ๐พ๐ ๐
= ๐โ 1โ2๐ ๐
[n]
๐พ1 ๐พ2 ๐พ3 ๐พ4 ๐พ๐๐/2
S
let ๐พ1, โฆ , ๐พ๐ be junta-coordinates of ๐1, โฆ , ๐๐ ๏ can afford union bound over ๐ junta sets ๐พ1, โฆ , ๐พ๐
lower-bound for SheraliโAdams relaxations of size ๐๐ lower-bound for general LP relaxations of size ๐ 1โ๐ ๐ ๐-juntas ๐๐-juntas non-spiky general lower-bounds for size-๐๐ nonneg. cones with restricted functions
from ๐๐ป-juntas to non-spiky functions
let ๐ be a nonnegative function on ยฑ1 ๐with ๐ฝ๐ = 1 non-spiky: max ๐ โค 2๐ข junta structure lemma: can approximate ๐ by nonnegative ๐๐-junta ๐โฒ, error ๐ = ๐ โ ๐โฒ satisfies ๐ ๐ 2 โค ๐ข๐/๐๐ for ๐ < ๐ proof: nonnegative function ๐ ๏ probability distribution over ยฑ1 ๐, +1/-1 rand. variables ๐1, โฆ , ๐๐ (dependent)
small low-degree Fourier coefficients
non-spiky ๏ entropy ๐ผ ๐1, โฆ , ๐๐ โฅ ๐ โ ๐ข want: ๐พ โ [๐] of size ๐๐ such that โ๐ โ ๐พ. ๐๐ โฃ ๐๐พ โ uniform, that is, S โ ๐ผ ๐๐ ๐๐พ โค ๐พ for ๐พ =
๐ข๐ ๐๐
construction: start with ๐พ = โ ; as long as bad ๐ exists, update ๐พ โ ๐พ โช ๐ analysis: total entropy defect โค ๐ข ๏ stop after
๐ข ๐พ iterations ๏ ๐พ โค ๐๐ข ๐พ = ๐๐
( ๐ < ๐)
lower-bound for SheraliโAdams relaxations of size ๐๐ lower-bound for general LP relaxations of size ๐ 1โ๐ ๐ ๐-juntas ๐๐-juntas non-spiky general lower-bounds for size-๐๐ nonneg. cones with restricted functions
from non-spiky functions to general functions
let ๐1, โฆ , ๐๐ be general nonneg. functions on ยฑ1 ๐ for ๐ = ๐๐ claim: exists nonneg. ๐1
โฒ, โฆ , ๐๐ โฒ such that ๐๐ โฒ โค ๐2๐, ๐ฝ๐๐ โฒ = 1 and
cone ๐1, โฆ , ๐๐ โ cone(๐1
โฒ, โฆ , ๐๐ โฒ )
proof: truncate functions carefully intuition: ๐ โ cut๐ป is non-spiky. Thus, spiky ๐๐ donโt help!
non-spiky
lower-bound for general LP relaxations of size ๐ 1โ๐ ๐ lower-bound for SheraliโAdams relaxations of size ๐๐ ๐-juntas ๐๐-juntas non-spiky general lower-bounds for nonneg. cones of size ๐๐ with restricted functions
- pen problems
- 1. LP size ๐๐๐ป
- 2. beyond CSPs (e.g., TSP)
- 3. SDPs
Lower-bound for general LP relaxations of size ๐ 1โ๐ ๐ ๐-juntas ๐๐-juntas non-spiky general
- pen problems
- 1. LP size ๐๐๐ป
- 2. beyond CSPs (e.g., TSP)
- 3. SDPs