learning algorithms from natural lower bounds
play

Learning Algorithms from Natural Lower Bounds CCC 2016 Marco - PowerPoint PPT Presentation

Learning Algorithms from Natural Lower Bounds CCC 2016 Marco Carmosino (UCSD) Russell Impagliazzo (UCSD) Valentine Kabanets (SFU) Antonina Kolokolova (MUN) June 11, 2016 1 / 75 Natural Proof of Circuit Lower Bounds for C Learning


  1. Learning Algorithms from Natural Lower Bounds CCC 2016 Marco Carmosino (UCSD) Russell Impagliazzo (UCSD) Valentine Kabanets (SFU) Antonina Kolokolova (MUN) June 11, 2016 1 / 75

  2. Natural Proof of Circuit Lower Bounds for C ⇓ Learning Algorithm for C 2 / 75

  3. ? Natural Proof = Barrier However: Natural Proof = Algorithm 3 / 75

  4. Not A Surprise As long as we use natural proofs we have to cope with a duality: any lower bound proof must implicitly argue a proportionately strong upper bound. – Razborov, Rudich 1997 With this duality in mind, it is no coincidence that the technical lemmas of [H˚ as87, Smo87, Raz87] yield much of the machinery for the learning result of [LMN93]. – Razborov, Rudich 1997 4 / 75

  5. Theorem (Main Application of this work) There is a quasi-polynomial time membership-query learning algorithm for AC 0 [ p ] Open Since: Theorem ([LMN93]) There is a quasi-polynomial time uniform-sample learning algorithm for AC 0 5 / 75

  6. Algorithms Lower Bounds 6 / 75

  7. Reminder: Circuits 7 / 75

  8. Complexity Petting Zoo An AC 0 Circuit 8 / 75

  9. Circuit Zoo P / poly Unrestricted poly-size circuits ⊆ TC 0 Add MAJORITY gates ⊆ ACC 0 Add counting modulo any m ⊆ AC 0 [ p ] Add counting modulo prime p ⊆ AC 0 AND, OR, NOT, constant-depth, poly-size 9 / 75

  10. Circuit Lower Bounds The Final Frontier P / poly ??!?!?!?!?!!?? ⊆ TC 0 ???? ⊆ ACC 0 �⊃ NEXP [Wil11] ⊆ AC 0 [ p ] � ACC 0 [Raz87 , Smo87] ⊆ AC 0 � AC 0 [2] [FSS81] 10 / 75

  11. Reminder: Pseudorandom Generators 11 / 75

  12. 12 / 75

  13. 13 / 75

  14. 14 / 75

  15. 15 / 75

  16. Reminder: Natural Proofs 16 / 75

  17. Natural Lower Bounds Against C Razborov Rudich 97 Natural Proofs embed an efficient algorithm for some problem R : input: (truth table of) f : { 0 , 1 } n → { 0 , 1 } task: Distinguish: ∀ f of C -circuit complexity ≤ u ( n ) , f �∈ R versus Pr[random f ∈ R ] > 1 / 5 17 / 75

  18. = Function on n bits 18 / 75

  19. Prior Work 19 / 75

  20. Prior Work C -Lower Bounds to Algorithms ◮ Formula-SAT [San10] C -Meta-Algorithms ◮ AC 0 -SAT [IMP12] (SAT, Learning, Compression) ◮ AC 0 -Learning [LMN93] ⇑ ◮ AC 0 -Compression [CKK + 15] ◮ AC 0 [ p ]-Compression [Sri15] 20 / 75

  21. Prior Work C -Algorithms to Lower Bounds ◮ fast C -LEARN = ⇒ EXP � C [FK09, HH11, KKO13] ◮ fast C -SAT = ⇒ NEXP � C (SAT, Learning, Compression) [Wil10] ⇓ ◮ NEXP � ACC [Wil11] ◮ C -Compression = ⇒ NEXP � C [CKK + 15] C -Lower Bounds 21 / 75

  22. Prior Work: The Pattern C -Meta-Algorithms (SAT, Learning, Compression) (SAT, Learning, Compression) ⇑ ⇓ C -Lower Bounds 22 / 75

  23. C -Meta-Algorithms (SAT? Learning? Compression?) ? ⇑ ? ??? 23 / 75

  24. Randomized C -Learning Algorithm ⇑ (for “powerful enough” circuit classes C ) ( C closed under polysize AC 0 [ p ]-reductions) 24 / 75

  25. C -Learning f Learning Model: Membership Queries: may query f ( x ) for any x Uniform PAC: find circuit H s.t. Pr x [ H ( x ) = f ( x )] ≈ 1 25 / 75

  26. Razborov Rudich 97 Natural Lower Bound Against C ⇓ Inverting Algorithm for every g ∈ C MESSAGE: No Natural Lower Bounds for C ⊇ TC 0 26 / 75

  27. This Work Natural Lower Bound Against C ⇓ Learning Algorithm for every g ∈ C MESSAGE: Hmmm. 27 / 75

  28. ⇓ 28 / 75

  29. Natural Lower Bound for AC 0 [ p ] [RS] ⇓ Quasi-Polytime Learning Algorithm for AC 0 [ p ] 29 / 75

  30. Learning Algorithms Compare & Contrast [LMN] Algorithm for AC 0 Our Algorithm for AC 0 [ p ] ◮ Uniform PAC Model ◮ Membership queries ◮ n poly (log n ) runtime ◮ n poly (log n ) runtime ◮ Switching Lemma ◮ RS Lower Bound (an exp( n 1 / Ω( d ) ) lower (an exp( n 1 / Ω( d ) ) lower bound for depth d ) bound for depth d ) 30 / 75

  31. The Proof 31 / 75

  32. Tools ◮ NW Generator ◮ XOR Lemma ◮ Natural Properties 32 / 75

  33. Notation 33 / 75

  34. Agree-o-Meter Let f : { 0 , 1 } n → { 0 , 1 } . Define: 34 / 75

  35. Hardness 35 / 75

  36. Hardness Somewhat Very Hard Hard (not to scale) 36 / 75

  37. NW Generator Definition 37 / 75

  38. NW Generator Definition (stretch) (seed) 38 / 75

  39. NW Generator Definition 39 / 75

  40. NW Generator Definition Almost Disjoint Subset Multiplexer 40 / 75

  41. NW Generator Definition Almost Disjoint Subset Multiplexer 41 / 75

  42. NW Generator THEOREM: If h is a very hard function, then NW h ( s ) is a PRG. PROOF: Assume NW h ( s ) is NOT a PRG. Then ∃D : 42 / 75

  43. NW Generator Using D : 43 / 75

  44. Yao’s XOR Lemma h ⊕ k ( � x 1 , . . . , � x k ) = h ( � x 1 ) ⊕ · · · ⊕ h ( � x k ) THEOREM: If h is a somewhat hard function, then h ⊕ k is a very hard function . PROOF: Assume h ⊕ k is NOT very hard. Then ∃ C : 44 / 75

  45. Yao’s XOR Lemma Proof Using C : 45 / 75

  46. Hardness to Randomness Theorem: if h is a somewhat hard function, then NW h ⊕ k ( s ) is a PRG. Proof: Assume NW h ⊕ k ( s ) is NOT a PRG. Then compose the previous two proofs! 46 / 75

  47. Main Proof Idea: Play to Lose PRG: ⊥ = victory Learning: circuit = victory 47 / 75

  48. Learning Algorithm Idea: if f ∈ C ... Figure 1: Composed arguments for easy f 48 / 75

  49. What’s Missing? CONDITION: ∀ f ∈ C ∃D a UNIFORM circuit such that... Figure 2: The circuit we need to learn f 49 / 75

  50. NATURAL PROPERTY R vs C : ∀ f ′ ∈ C [ u ] 50 / 75

  51. COMPARE Figure 3: HAVE vs WANT 51 / 75

  52. . . . . . . . . . . . . . . . . . . 52 / 75

  53. Key Lemma f ∈ C [ poly ] = ⇒ ∀ s g s ∈ C [exp( ℓ − 1 )] Requirement: g s �∈ R size( g s ) ≤ usefulness(log( ℓ )) 53 / 75

  54. Proof of Key Lemma: Almost Disjoint Subset Multiplexer 54 / 75

  55. Runtime Analysis 55 / 75

  56. 56 / 75

  57. 57 / 75

  58. 58 / 75

  59. 59 / 75

  60. 60 / 75

  61. Learning Runtime = poly( ℓ ) (Recall, ℓ is the stretch of our PRG) 61 / 75

  62. = Function on n bits 62 / 75

  63. ??? 63 / 75

  64. ??? 64 / 75

  65. ??? 65 / 75

  66. ??? 66 / 75

  67. ??? 67 / 75

  68. 68 / 75

  69. Constraints 1. Natural Property: C -size( g s ) ≤ u (log( ℓ )) ⇒ ∀ s g s ∈ C [exp( ℓ − 1 )] recall: f ∈ C [ poly ] = 2. Runtime (want to minimize ): poly( ℓ ) 69 / 75

  70. Learning Algorithm Runtime Usefulness vs. Size Usefulness Learning Runtime ◮ Exponential 2 Ω( n ) ◮ Polynomial ◮ Subexponential 2 n ǫ ◮ Quasipolynomial ◮ Superpolynomial n ω (1) ◮ Subexponential 70 / 75

  71. Exp runtime Poly(n) lb Poly runtime 2 𝑜 lb 71 / 75

  72. Randomized C -Learning Algorithm ⇑ (for “powerful enough” circuit classes C ) ( C closed under polysize AC 0 [ p ]-reductions) 72 / 75

  73. Main Application Learning AC 0 [ p ] In quasipolynomial time quasi-poly 73 / 75

  74. Summary ◮ Natural Proofs = ⇒ Learning (and Compression) Algorithms ◮ BPP -Natural Proofs for C ⇐ ⇒ Randomized Compression for C (answers open question of [CKKSZ15] ◮ Randomized quasi-polytime learning algorithm for AC 0 [ p ] for any prime p 74 / 75

  75. Future Work ◮ Natural Proof of Williams’ ACC 0 lower bound? ? ◮ Natural Proofs for C = ⇒ C -SAT Algorithms ◮ Other learning models? ◮ Banish serendipity? ◮ Derandomization? ◮ Limits to “grey-box” algorithm design? 75 / 75

  76. Ruiwen Chen, Valentine Kabanets, Antonina Kolokolova, Ronen Shaltiel, and David Zuckerman. Mining circuit lower bound proofs for meta-algorithms. Computational Complexity , 24(2):333–392, 2015. Lance Fortnow and Adam R. Klivans. Efficient learning algorithms yield circuit lower bounds. J. Comput. Syst. Sci. , 75(1):27–36, 2009. Merrick L. Furst, James B. Saxe, and Michael Sipser. Parity, circuits, and the polynomial-time hierarchy. In 22nd Annual Symposium on Foundations of Computer Science, Nashville, Tennessee, USA, 28-30 October 1981 , pages 260–270. IEEE Computer Society, 1981. Johan H˚ astad. Computational Limitations of Small-depth Circuits . MIT Press, Cambridge, MA, USA, 1987. Ryan C. Harkins and John M. Hitchcock. 75 / 75

  77. Exact learning algorithms, betting games, and circuit lower bounds. In Luca Aceto, Monika Henzinger, and Jir´ ı Sgall, editors, Automata, Languages and Programming - 38th International Colloquium, ICALP 2011, Zurich, Switzerland, July 4-8, 2011, Proceedings, Part I , volume 6755 of Lecture Notes in Computer Science , pages 416–423. Springer, 2011. Russell Impagliazzo, William Matthews, and Ramamohan Paturi. A satisfiability algorithm for ac0. In Yuval Rabani, editor, Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2012, Kyoto, Japan, January 17-19, 2012 , pages 961–972. SIAM, 2012. Adam Klivans, Pravesh Kothari, and Igor Carboni Oliveira. Constructing hard functions using learning algorithms. 75 / 75

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend