Constructive Discrepancy Minimization for Convex Sets Thomas - - PowerPoint PPT Presentation

constructive discrepancy minimization for convex sets
SMART_READER_LITE
LIVE PREVIEW

Constructive Discrepancy Minimization for Convex Sets Thomas - - PowerPoint PPT Presentation

Constructive Discrepancy Minimization for Convex Sets Thomas Rothvoss UW Seattle Discrepancy theory Set system S = { S 1 , . . . , S m } , S i [ n ] i S b b b Discrepancy theory 1 Set system S = { S 1 , . . . , S m } , S i


slide-1
SLIDE 1

Constructive Discrepancy Minimization for Convex Sets

Thomas Rothvoss

UW Seattle

slide-2
SLIDE 2

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n]

i S

slide-3
SLIDE 3

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n] ◮ Coloring χ : [n] → {−1, +1}

i S

b b

−1

b

+1 −1

slide-4
SLIDE 4

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n] ◮ Coloring χ : [n] → {−1, +1} ◮ Discrepancy

disc(S) = min

χ:[n]→{±1} max S∈S

  • i∈S

χ(i)

  • .

i S

b b

−1

b

+1 −1

slide-5
SLIDE 5

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n] ◮ Coloring χ : [n] → {−1, +1} ◮ Discrepancy

disc(S) = min

χ:[n]→{±1} max S∈S

  • i∈S

χ(i)

  • .

i S

b b

−1

b

+1 −1 Known results:

◮ n sets, n elements: disc(S) = O(√n) [Spencer ’85]

slide-6
SLIDE 6

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n] ◮ Coloring χ : [n] → {−1, +1} ◮ Discrepancy

disc(S) = min

χ:[n]→{±1} max S∈S

  • i∈S

χ(i)

  • .

i S

b b

−1

b

+1 −1 Known results:

◮ n sets, n elements: disc(S) = O(√n) [Spencer ’85] ◮ Every element in ≤ t sets: disc(S) < 2t [Beck & Fiala ’81]

slide-7
SLIDE 7

Discrepancy theory

◮ Set system S = {S1, . . . , Sm}, Si ⊆ [n] ◮ Coloring χ : [n] → {−1, +1} ◮ Discrepancy

disc(S) = min

χ:[n]→{±1} max S∈S

  • i∈S

χ(i)

  • .

i S

b b

−1

b

+1 −1 Known results:

◮ n sets, n elements: disc(S) = O(√n) [Spencer ’85] ◮ Every element in ≤ t sets: disc(S) < 2t [Beck & Fiala ’81]

Main method: Find a partial coloring χ : [n] → {0, ±1}

◮ low discrepancy maxS∈S |χ(S)| ◮ |supp(χ)| ≥ Ω(n)

slide-8
SLIDE 8

Discrepancy theory (2)

Lemma (Spencer)

For m set on n ≤ m elements there is a partial coloring of discrepancy O(

  • n log 2m

n ). ◮ Run argument log n times ◮ Total discrepancy is

√n +

  • n/2 +
  • n/22 + . . . + 1 = O(√n)
slide-9
SLIDE 9

Thm of Spencer-Gluskin-Giannopolous

(−1, −1) (1, −1) (−1, 1) (1, 1)

slide-10
SLIDE 10

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • K
slide-11
SLIDE 11

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • find a point x ∈ {−1, 1}n ∩ K

K x

slide-12
SLIDE 12

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • find a point x ∈ {−1, 0, 1}n ∩ K with |supp(x)| ≥ Ω(n).

K x

slide-13
SLIDE 13

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • find a point x ∈ {−1, 0, 1}n ∩ K with |supp(x)| ≥ Ω(n).

K

≥ 100 ≥ 100

◮ K is intersection of n strips of width 100

slide-14
SLIDE 14

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • find a point x ∈ {−1, 0, 1}n ∩ K with |supp(x)| ≥ Ω(n).

K

≥ 100 ≥ 100

◮ K is intersection of n strips of width 100 ◮ Gaussian measure

γn(K) ≥ (γn(width 100 strip))n ≥ e−n/100

slide-15
SLIDE 15

Thm of Spencer-Gluskin-Giannopolous

Goal: For K :=

  • x ∈ Rn : |

j∈Si xj| ≤ 100√n ∀i ∈ [n]

  • find a point x ∈ {−1, 0, 1}n ∩ K with |supp(x)| ≥ Ω(n).

K x

◮ K is intersection of n strips of width 100 ◮ Gaussian measure

γn(K) ≥ (γn(width 100 strip))n ≥ e−n/100

◮ Counting argument: any such K admits partial coloring

slide-16
SLIDE 16

Gaussian measure

Lemma (Sidak-Kathri ’67)

For K convex and symmetric and strip S, γn(K ∩ S) ≥ γn(K) · γn(S) K S

slide-17
SLIDE 17

Gaussian measure

Lemma (Sidak-Kathri ’67)

For K convex and symmetric and strip S, γn(K ∩ S) ≥ γn(K) · γn(S) S K

slide-18
SLIDE 18

Partial coloring approaches

◮ Spencer ’85, Gluskin ’87, Giannopolous ’97:

◮ (+) very general ◮ (−) non-constructive

slide-19
SLIDE 19

Partial coloring approaches

◮ Spencer ’85, Gluskin ’87, Giannopolous ’97:

◮ (+) very general ◮ (−) non-constructive

◮ Bansal ’10: SDP-based random walk for Spencer’s Thm

◮ (+) poly-time ◮ (−) custom-tailored to Spencers setting

slide-20
SLIDE 20

Partial coloring approaches

◮ Spencer ’85, Gluskin ’87, Giannopolous ’97:

◮ (+) very general ◮ (−) non-constructive

◮ Bansal ’10: SDP-based random walk for Spencer’s Thm

◮ (+) poly-time ◮ (−) custom-tailored to Spencers setting

◮ Lovett-Meka ’12:

◮ (+) poly-time ◮ (+) simple and elegant ◮ (+/−) Works for any K = {x : |x, vi| ≤ λi∀i ∈ [m]}

with m

i=1 e−λ2

i /16 ≤ n

16

slide-21
SLIDE 21

Our main result

Theorem (R. 2014)

For a convex symmetric set K ⊆ Rn with γn(K) ≥ e−δn,

  • ne can find a y ∈ K ∩ [−1, 1]n with |{i : yi = ±1}| ≥ εn in

poly-time. K y∗ [−1, 1]n

slide-22
SLIDE 22

Our main result

Theorem (R. 2014)

For a convex symmetric set K ⊆ Rn with γn(K) ≥ e−δn,

  • ne can find a y ∈ K ∩ [−1, 1]n with |{i : yi = ±1}| ≥ εn in

poly-time. Algorithm: (1) take a random x∗ ∼ γn K x∗ [−1, 1]n

slide-23
SLIDE 23

Our main result

Theorem (R. 2014)

For a convex symmetric set K ⊆ Rn with γn(K) ≥ e−δn,

  • ne can find a y ∈ K ∩ [−1, 1]n with |{i : yi = ±1}| ≥ εn in

poly-time. Algorithm: (1) take a random x∗ ∼ γn (2) compute y∗ = argmin{x∗ − y2 | y ∈ K ∩ [−1, 1]n} K x∗ y∗ [−1, 1]n

slide-24
SLIDE 24

Isoperimetric inequality

◮ For set K

K

slide-25
SLIDE 25

Isoperimetric inequality

◮ For set K, let K∆ := {x : d(x, K) ≤ ∆}

∆ K∆ K

slide-26
SLIDE 26

Isoperimetric inequality

◮ For set K, let K∆ := {x : d(x, K) ≤ ∆}

∆ K∆ K

Lemma

γn(K) ≥ e−δn = ⇒ γn(K3

√ δn) ≥ 1 − e−δn

slide-27
SLIDE 27

Isoperimetric inequality

◮ For set K, let K∆ := {x : d(x, K) ≤ ∆}

∆ K∆ K

Lemma

γn(K) ≥ e−δn = ⇒ γn(K3

√ δn) ≥ 1 − e−δn ◮ Isoperimetric inequality: worst case are half planes!

z

1 √ 2πe−z2/2

γn = e−δn γn = e−δn

slide-28
SLIDE 28

Isoperimetric inequality

◮ For set K, let K∆ := {x : d(x, K) ≤ ∆}

∆ K∆ K

Lemma

γn(K) ≥ e−δn = ⇒ γn(K3

√ δn) ≥ 1 − e−δn ◮ Isoperimetric inequality: worst case are half planes!

z

1 √ 2πe−z2/2

γn = e−δn γn = e−δn ≤ 3 √ δn

slide-29
SLIDE 29

Analysis

K [−1, 1]n

slide-30
SLIDE 30

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n K x∗ y∗ [−1, 1]n

slide-31
SLIDE 31

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n)

K x∗ y∗ [−1, 1]n

slide-32
SLIDE 32

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn K x∗ y∗ [−1, 1]n

slide-33
SLIDE 33

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn

◮ K(I∗) := K ∩ {|xi| ≤ 1 : i ∈ I∗}

K(I∗) K x∗ y∗ [−1, 1]n

slide-34
SLIDE 34

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn

◮ K(I∗) := K ∩ {|xi| ≤ 1 : i ∈ I∗} ◮ x∗ − y∗2 = d(y∗, K(I∗))

K(I∗) K x∗ y∗ [−1, 1]n

slide-35
SLIDE 35

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn

◮ K(I∗) := K ∩ {|xi| ≤ 1 : i ∈ I∗} ◮ x∗ − y∗2 = d(y∗, K(I∗))

K(I∗) K x∗ y∗ [−1, 1]n

◮ K(I∗) still large:

γn(K(I∗)) ≥ γn(K) · (γn(strip of width 1))εn ≥ e−2δn

slide-36
SLIDE 36

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn

◮ K(I∗) := K ∩ {|xi| ≤ 1 : i ∈ I∗} ◮ x∗ − y∗2 = d(y∗, K(I∗))

K(I∗) K x∗ y∗ [−1, 1]n

◮ K(I∗) still large:

γn(K(I∗)) ≥ γn(K) · (γn(strip of width 1))εn ≥ e−2δn

◮ W.h.p. d(x∗, K(I∗)) ≤

√ 3δ · √n ≪ 1

5

√n

slide-37
SLIDE 37

Analysis

◮ W.h.p. x∗ − y∗2 ≥ 1 5

√n

◮ For any Q with γn(Q) ≥ e−o(n):

Pr[d(x∗, Q) ≥ 1

5 · √n] ≤ e−Ω(n) ◮ Def. I∗ := {i : |y∗ i | = 1}

Suppose |I∗| ≤ εn

◮ K(I∗) := K ∩ {|xi| ≤ 1 : i ∈ I∗} ◮ x∗ − y∗2 = d(y∗, K(I∗))

K(I∗) K x∗ y∗ [−1, 1]n

◮ K(I∗) still large:

γn(K(I∗)) ≥ γn(K) · (γn(strip of width 1))εn ≥ e−2δn

◮ W.h.p. d(x∗, K(I∗)) ≤

√ 3δ · √n ≪ 1

5

√n

◮ Union bound over all |I| ≤ εn:

Pr

|I|≤εn

d(x∗, K(I)) < 1 5 √n

  • ≤ e−Ω(n)
slide-38
SLIDE 38

Application to approximation algorithms

◮ Classical technique: For

K = {x ∈ Rn | Ax = b} with n

2 constraints

→ basic solution x′ has ≤ n

2 fractional

variables.

b

x

slide-39
SLIDE 39

Application to approximation algorithms

◮ Classical technique: For

K = {x ∈ Rn | Ax = b} with n

2 constraints

→ basic solution x′ has ≤ n

2 fractional

variables. Ax = b

b

x

slide-40
SLIDE 40

Application to approximation algorithms

◮ Classical technique: For

K = {x ∈ Rn | Ax = b} with n

2 constraints

→ basic solution x′ has ≤ n

2 fractional

variables. Ax = b

b

x

b x′

slide-41
SLIDE 41

Application to approximation algorithms

◮ Classical technique: For

K = {x ∈ Rn | Ax = b} with n

2 constraints

→ basic solution x′ has ≤ n

2 fractional

variables. Ax = b

b

x

b x′

◮ Lovett-Meka rounding:

◮ x′ has n( 1

2 + ε) fractional variables

◮ Additional: For |vi, x − x′| ≤ λi for unit vectors

satisfying m

i=1 e−λ2

i /16 ≤ c(ε) · n

slide-42
SLIDE 42

Application to approximation algorithms

◮ Classical technique: For

K = {x ∈ Rn | Ax = b} with n

2 constraints

→ basic solution x′ has ≤ n

2 fractional

variables. Ax = b

b

x

b x′

◮ Lovett-Meka rounding:

◮ x′ has n( 1

2 + ε) fractional variables

◮ Additional: For |vi, x − x′| ≤ λi for unit vectors

satisfying m

i=1 e−λ2

i /16 ≤ c(ε) · n

Better approximation algorithms for

◮ Bin Packing [R. ’13] ◮ Broadcast scheduling

[Bansal, Charikar, Krishnaswamy, Li ’14]

slide-43
SLIDE 43

Open problems

Setting: Given a set system S1, . . . , Sm ⊆ [n] where each element is in at most t sets. Beck-Fiala Conjecture: disc ≤ O( √ t)

slide-44
SLIDE 44

Open problems

Setting: Given a set system S1, . . . , Sm ⊆ [n] where each element is in at most t sets. Beck-Fiala Conjecture: disc ≤ O( √ t) Known bounds:

◮ 2t (constructive [Beck-Fiala ’81]) ◮ O(

√ t log n) (constructive)

◮ O(√t · log n) (non-constructive [Banaszczyk ’97])

slide-45
SLIDE 45

Open problems

Setting: Given a set system S1, . . . , Sm ⊆ [n] where each element is in at most t sets. Beck-Fiala Conjecture: disc ≤ O( √ t) Known bounds:

◮ 2t (constructive [Beck-Fiala ’81]) ◮ O(

√ t log n) (constructive)

◮ O(√t · log n) (non-constructive [Banaszczyk ’97])

Open problem:

◮ Show that disc ≤ o(t)

slide-46
SLIDE 46

Open problems

Setting: Given a set system S1, . . . , Sm ⊆ [n] where each element is in at most t sets. Beck-Fiala Conjecture: disc ≤ O( √ t) Known bounds:

◮ 2t (constructive [Beck-Fiala ’81]) ◮ O(

√ t log n) (constructive)

◮ O(√t · log n) (non-constructive [Banaszczyk ’97])

Open problem:

◮ Show that disc ≤ o(t)

Thanks for your attention