Learning Algorithms from Natural Lower Bounds CCC 2016 Marco - - PowerPoint PPT Presentation

learning algorithms from natural lower bounds
SMART_READER_LITE
LIVE PREVIEW

Learning Algorithms from Natural Lower Bounds CCC 2016 Marco - - PowerPoint PPT Presentation

Learning Algorithms from Natural Lower Bounds CCC 2016 Marco Carmosino (UCSD) Russell Impagliazzo (UCSD) Valentine Kabanets (SFU) Antonina Kolokolova (MUN) June 11, 2016 1 / 75 Natural Proof of Circuit Lower Bounds for C Learning


slide-1
SLIDE 1

Learning Algorithms from Natural Lower Bounds

CCC 2016 Marco Carmosino (UCSD) Russell Impagliazzo (UCSD) Valentine Kabanets (SFU) Antonina Kolokolova (MUN) June 11, 2016

1 / 75
slide-2
SLIDE 2

Natural Proof of Circuit Lower Bounds for C

Learning Algorithm for C

2 / 75
slide-3
SLIDE 3

Natural Proof

?

= Barrier However: Natural Proof = Algorithm

3 / 75
slide-4
SLIDE 4

Not A Surprise As long as we use natural proofs we have to cope with a duality: any lower bound proof must implicitly argue a proportionately strong upper bound. – Razborov, Rudich 1997 With this duality in mind, it is no coincidence that the technical lemmas of [H˚ as87, Smo87, Raz87] yield much of the machinery for the learning result of [LMN93]. – Razborov, Rudich 1997

4 / 75
slide-5
SLIDE 5

Theorem (Main Application of this work)

There is a quasi-polynomial time membership-query learning algorithm for AC0[p] Open Since:

Theorem ([LMN93])

There is a quasi-polynomial time uniform-sample learning algorithm for AC0

5 / 75
slide-6
SLIDE 6

Algorithms Lower Bounds

6 / 75
slide-7
SLIDE 7

Reminder: Circuits

7 / 75
slide-8
SLIDE 8

Complexity Petting Zoo

An AC0 Circuit

8 / 75
slide-9
SLIDE 9

Circuit Zoo P/poly Unrestricted poly-size circuits ⊆ TC0 Add MAJORITY gates ⊆ ACC0 Add counting modulo any m ⊆ AC0[p] Add counting modulo prime p ⊆ AC0 AND, OR, NOT, constant-depth, poly-size

9 / 75
slide-10
SLIDE 10

Circuit Lower Bounds

The Final Frontier

P/poly ??!?!?!?!?!!?? ⊆ TC0 ???? ⊆ ACC0 ⊃ NEXP [Wil11] ⊆ AC0[p] ACC0 [Raz87, Smo87] ⊆ AC0 AC0[2] [FSS81]

10 / 75
slide-11
SLIDE 11

Reminder: Pseudorandom Generators

11 / 75
slide-12
SLIDE 12 12 / 75
slide-13
SLIDE 13 13 / 75
slide-14
SLIDE 14 14 / 75
slide-15
SLIDE 15 15 / 75
slide-16
SLIDE 16

Reminder: Natural Proofs

16 / 75
slide-17
SLIDE 17

Natural Lower Bounds Against C

Razborov Rudich 97

Natural Proofs embed an efficient algorithm for some problem R: input: (truth table of) f : {0, 1}n → {0, 1} task: Distinguish: ∀f of C-circuit complexity ≤ u(n), f ∈ R versus Pr[random f ∈ R] > 1/5

17 / 75
slide-18
SLIDE 18

= Function on n bits

18 / 75
slide-19
SLIDE 19

Prior Work

19 / 75
slide-20
SLIDE 20

Prior Work

C-Lower Bounds to Algorithms

C-Meta-Algorithms

(SAT, Learning, Compression)

◮ Formula-SAT [San10] ◮ AC0-SAT [IMP12] ◮ AC0-Learning [LMN93] ◮ AC0-Compression [CKK+15] ◮ AC0[p]-Compression [Sri15]

20 / 75
slide-21
SLIDE 21

Prior Work

C-Algorithms to Lower Bounds

◮ fast C-LEARN =

⇒ EXP C [FK09, HH11, KKO13]

◮ fast C-SAT =

⇒ NEXP C [Wil10]

◮ NEXP ACC

[Wil11]

◮ C-Compression =

⇒ NEXP C [CKK+15]

(SAT, Learning, Compression)

C-Lower Bounds

21 / 75
slide-22
SLIDE 22

Prior Work: The Pattern C-Meta-Algorithms

(SAT, Learning, Compression)

(SAT, Learning, Compression)

C-Lower Bounds

22 / 75
slide-23
SLIDE 23

C-Meta-Algorithms

(SAT? Learning? Compression?)

?⇑?

???

23 / 75
slide-24
SLIDE 24

Randomized C-Learning Algorithm

(for “powerful enough” circuit classes C)

(C closed under polysize AC0[p]-reductions)

24 / 75
slide-25
SLIDE 25

C-Learning f

Learning Model: Membership Queries: may query f (x) for any x Uniform PAC: find circuit H s.t. Prx[H(x) = f (x)] ≈ 1

25 / 75
slide-26
SLIDE 26

Razborov Rudich 97

Natural Lower Bound Against C

Inverting Algorithm for every g ∈ C

MESSAGE: No Natural Lower Bounds for C ⊇ TC0

26 / 75
slide-27
SLIDE 27

This Work

Natural Lower Bound Against C

Learning Algorithm for every g ∈ C

MESSAGE: Hmmm.

27 / 75
slide-28
SLIDE 28

28 / 75
slide-29
SLIDE 29

Natural Lower Bound for AC0[p] [RS]

Quasi-Polytime Learning Algorithm for AC0[p]

29 / 75
slide-30
SLIDE 30

Learning Algorithms

Compare & Contrast

[LMN] Algorithm for AC0

◮ Uniform PAC Model ◮ npoly(log n) runtime ◮ Switching Lemma

(an exp(n1/Ω(d)) lower bound for depth d)

Our Algorithm for AC0[p]

◮ Membership queries ◮ npoly(log n) runtime ◮ RS Lower Bound

(an exp(n1/Ω(d)) lower bound for depth d)

30 / 75
slide-31
SLIDE 31

The Proof

31 / 75
slide-32
SLIDE 32

Tools

◮ NW Generator ◮ XOR Lemma ◮ Natural Properties

32 / 75
slide-33
SLIDE 33

Notation

33 / 75
slide-34
SLIDE 34

Agree-o-Meter

Let f : {0, 1}n → {0, 1}. Define:

34 / 75
slide-35
SLIDE 35

Hardness

35 / 75
slide-36
SLIDE 36

Hardness

Very Hard Somewhat Hard

(not to scale)

36 / 75
slide-37
SLIDE 37

NW Generator

Definition

37 / 75
slide-38
SLIDE 38

NW Generator

Definition

(stretch)

(seed)

38 / 75
slide-39
SLIDE 39

NW Generator

Definition

39 / 75
slide-40
SLIDE 40

NW Generator

Definition

Almost Disjoint Subset Multiplexer

40 / 75
slide-41
SLIDE 41

NW Generator

Definition

Almost Disjoint Subset Multiplexer

41 / 75
slide-42
SLIDE 42

NW Generator

THEOREM: If h is a very hard function, then NWh(s) is a PRG. PROOF: Assume NWh(s) is NOT a PRG. Then ∃D:

42 / 75
slide-43
SLIDE 43

NW Generator

Using D:

43 / 75
slide-44
SLIDE 44

Yao’s XOR Lemma

h⊕k( x1, . . . , xk) = h( x1) ⊕ · · · ⊕ h( xk) THEOREM: If h is a somewhat hard function, then h⊕k is a very hard function. PROOF: Assume h⊕k is NOT very hard. Then ∃C:

44 / 75
slide-45
SLIDE 45

Yao’s XOR Lemma

Proof

Using C:

45 / 75
slide-46
SLIDE 46

Hardness to Randomness

Theorem: if h is a somewhat hard function, then NWh⊕k(s) is a PRG. Proof: Assume NWh⊕k(s) is NOT a PRG. Then compose the previous two proofs!

46 / 75
slide-47
SLIDE 47

Main Proof Idea: Play to Lose PRG: ⊥ = victory Learning: circuit = victory

47 / 75
slide-48
SLIDE 48

Learning Algorithm

Idea: if f ∈ C...

Figure 1: Composed arguments for easy f

48 / 75
slide-49
SLIDE 49

What’s Missing?

CONDITION: ∀f ∈ C ∃D a UNIFORM circuit such that...

Figure 2: The circuit we need to learn f

49 / 75
slide-50
SLIDE 50

NATURAL PROPERTY R vs C: ∀f ′ ∈ C[u]

50 / 75
slide-51
SLIDE 51

COMPARE

Figure 3: HAVE vs WANT

51 / 75
slide-52
SLIDE 52

. . . . . . . . . . . . . . . . . .

52 / 75
slide-53
SLIDE 53

Key Lemma f ∈ C[poly] = ⇒ ∀s gs ∈ C[exp(ℓ−1)] Requirement: gs ∈ R size(gs) ≤ usefulness(log(ℓ))

53 / 75
slide-54
SLIDE 54

Proof of Key Lemma:

Almost Disjoint Subset Multiplexer

54 / 75
slide-55
SLIDE 55

Runtime Analysis

55 / 75
slide-56
SLIDE 56 56 / 75
slide-57
SLIDE 57 57 / 75
slide-58
SLIDE 58 58 / 75
slide-59
SLIDE 59 59 / 75
slide-60
SLIDE 60 60 / 75
slide-61
SLIDE 61

Learning Runtime = poly(ℓ)

(Recall, ℓ is the stretch of our PRG)

61 / 75
slide-62
SLIDE 62

= Function on n bits

62 / 75
slide-63
SLIDE 63

???

63 / 75
slide-64
SLIDE 64

???

64 / 75
slide-65
SLIDE 65

???

65 / 75
slide-66
SLIDE 66

???

66 / 75
slide-67
SLIDE 67

???

67 / 75
slide-68
SLIDE 68 68 / 75
slide-69
SLIDE 69

Constraints

  • 1. Natural Property:

C-size(gs) ≤ u(log(ℓ))

recall: f ∈ C[poly] = ⇒ ∀s gs ∈ C[exp(ℓ−1)]

  • 2. Runtime (want to minimize):

poly(ℓ)

69 / 75
slide-70
SLIDE 70

Learning Algorithm Runtime

Usefulness vs. Size

Usefulness

◮ Exponential 2Ω(n) ◮ Subexponential 2nǫ ◮ Superpolynomial nω(1)

Learning Runtime

◮ Polynomial ◮ Quasipolynomial ◮ Subexponential

70 / 75
slide-71
SLIDE 71

Poly(n) lb

2𝑜 lb

Exp runtime

Poly runtime

71 / 75
slide-72
SLIDE 72

Randomized C-Learning Algorithm

(for “powerful enough” circuit classes C)

(C closed under polysize AC0[p]-reductions)

72 / 75
slide-73
SLIDE 73

Main Application

Learning AC0[p] In quasipolynomial time quasi-poly

73 / 75
slide-74
SLIDE 74

Summary

◮ Natural Proofs =

⇒ Learning (and Compression) Algorithms

◮ BPP-Natural Proofs for C ⇐

⇒ Randomized Compression for C

(answers open question of [CKKSZ15]

◮ Randomized quasi-polytime learning algorithm

for AC0[p] for any prime p

74 / 75
slide-75
SLIDE 75

Future Work

◮ Natural Proof of Williams’ ACC0 lower bound? ◮ Natural Proofs for C

?

= ⇒ C-SAT Algorithms

◮ Other learning models? ◮ Banish serendipity? ◮ Derandomization? ◮ Limits to “grey-box” algorithm design?

75 / 75
slide-76
SLIDE 76

Ruiwen Chen, Valentine Kabanets, Antonina Kolokolova, Ronen Shaltiel, and David Zuckerman. Mining circuit lower bound proofs for meta-algorithms. Computational Complexity, 24(2):333–392, 2015. Lance Fortnow and Adam R. Klivans. Efficient learning algorithms yield circuit lower bounds.

  • J. Comput. Syst. Sci., 75(1):27–36, 2009.

Merrick L. Furst, James B. Saxe, and Michael Sipser. Parity, circuits, and the polynomial-time hierarchy. In 22nd Annual Symposium on Foundations of Computer Science, Nashville, Tennessee, USA, 28-30 October 1981, pages 260–270. IEEE Computer Society, 1981. Johan H˚ astad. Computational Limitations of Small-depth Circuits. MIT Press, Cambridge, MA, USA, 1987. Ryan C. Harkins and John M. Hitchcock.

75 / 75
slide-77
SLIDE 77

Exact learning algorithms, betting games, and circuit lower bounds. In Luca Aceto, Monika Henzinger, and Jir´ ı Sgall, editors, Automata, Languages and Programming - 38th International Colloquium, ICALP 2011, Zurich, Switzerland, July 4-8, 2011, Proceedings, Part I, volume 6755 of Lecture Notes in Computer Science, pages 416–423. Springer, 2011. Russell Impagliazzo, William Matthews, and Ramamohan Paturi. A satisfiability algorithm for ac0. In Yuval Rabani, editor, Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2012, Kyoto, Japan, January 17-19, 2012, pages 961–972. SIAM, 2012. Adam Klivans, Pravesh Kothari, and Igor Carboni Oliveira. Constructing hard functions using learning algorithms.

75 / 75
slide-78
SLIDE 78

In Proceedings of the 28th Conference on Computational Complexity, CCC 2013, K.lo Alto, California, USA, 5-7 June, 2013, pages 86–97. IEEE, 2013. Nathan Linial, Yishay Mansour, and Noam Nisan. Constant depth circuits, fourier transform, and learnability.

  • J. ACM, 40(3):607–620, 1993.
  • A. A. Razborov.

Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical notes of the Academy of Sciences of the USSR, 41(4):333–338, 1987. Rahul Santhanam. Fighting perebor: New and improved algorithms for formula and QBF satisfiability. In 51th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2010, October 23-26, 2010, Las Vegas, Nevada, USA, pages 183–192. IEEE Computer Society, 2010.

75 / 75
slide-79
SLIDE 79

Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. In Proceedings of the 19th Annual ACM Symposium on Theory of Computing, 1987, New York, New York, USA, pages 77–82, 1987. Srikanth Srinivasan. A compression algorithm for $acˆ0[\oplus]$ circuits using certifying polynomials. Electronic Colloquium on Computational Complexity (ECCC), 22:142, 2015. Ryan Williams. Improving exhaustive search implies superpolynomial lower bounds. In Leonard J. Schulman, editor, Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC 2010, Cambridge, Massachusetts, USA, 5-8 June 2010, pages 231–240. ACM, 2010.

75 / 75
slide-80
SLIDE 80

Ryan Williams. Non-uniform ACC circuit lower bounds. In Proceedings of the 26th Annual IEEE Conference on Computational Complexity, CCC 2011, San Jose, California, June 8-10, 2011, pages 115–125. IEEE Computer Society, 2011.

75 / 75