How to not prove A Brief Introduction to Natural Proofs & - PowerPoint PPT Presentation

How to not prove 𝑄 ≠ 𝑂𝑄 A Brief Introduction to Natural Proofs & Data Complexity Shubhang Kulkarni and Ryan Davis

Part I : Introduction to Natural Proofs Shubhang Kulkarni “The obstacle is, roughly, that a large class of approaches to circuit lower bounds must prove more” - R. Lipton

Why This Talk is Important Lowerbounds ≡ Computational Intractability of a problem Modern day e-commerce heavily relies on certain lower bounds being true. Its natural to ask what makes lower bound questions so difficult • 𝑄 ≠ 𝑂𝑄 ? • 𝑄 ≠ 𝑄𝑇𝑄𝐵𝐷𝐹 ? • 𝑄 ≠ 𝑂𝐷 ?

Algorithms vs Complexity Theory Algorithms Theory ∃ • When can we solve problems quickly? • What’s an efficient way to solve the problem? Complexity Theory • When can problems not be solved efficiently? ∀ • How can we prove that a problem is not easy? The algorithm designers and the complexity theorists have opposing goals.

Complexity Barriers It turns out we have some formal understanding of why lower bounds are so tough to prove (at the moment) Any lower bound proof must overcome complexity barriers These barriers are “meta-theorems” about proofs • Relativization • Natural Proofs Baker, Gill, Solovay • Algebrization ∃ Oracles A, B such that 𝑄 - = 𝑂𝑄 - but 𝑄 / ≠ 𝑂𝑄 /

Simple Notation • 0,1 3 ∶ n-bit binary strings (input x) Also denoted by 𝐶 3 3 : function 𝑔: 0,1 3 → {0,1} • 𝑔 • 𝐺 3 ∶ set of all functions 𝑔 3 • 𝐷 3 : combinatorial property of 𝑔 3

Combinatorial Properties I 𝐷 3 can be thought of as a subset of 𝐺 3 of the functions possessing the property • 𝑔 3 “satisfies” 𝐷 3 ↔ 𝑔 3 ∈ 𝐷 3 • Also denoted as: 𝐷 3 𝑔 3 = 1 if 𝑔 3 “satisfies” 𝐷 3 , = 0 otherwise

Combinatorial Properties II Actually for any subset of 𝐷 3 𝐷 3 is Natural if it satisfies (1) Constructivity There is a polynomial algorithm to determine whether 𝑔 3 ∈ 𝐷 3 (2) Largeness A random 𝑕 3 has a “non-negligible” chance of satisfying 𝐷 3 Formally, |𝐷 3 | ≥ 2 BC 3 ⋅ |𝐺 3 | Terminology A combinatorial property is useful against 𝑄 /FGHI if the circuit sizes of all functions satisfying 𝐷 3 are super-polynomial.

Natural Proof “A proof that some function does not have a polynomial size circuit is natural against 𝑄 /FGHI if the proof contains the definition of natural combinatorial properties useful against 𝑄 /FGHI .” Want to prove { 𝑕 3 } has no polynomial circuit • Identify property 𝐷 3 such that • The proof shows that ∀𝑔 3 ∈ 𝐷 3 , 𝑔 3 is “hard” for circuits to compute • 𝑕 3 ∈ 𝐷 3

The Naïve Approach to P ≠ NP • Define mathematically the notion of “discrepancy” or “scatter” of boolean function values i.e. Define a 𝐷 3 s.t. 𝐷 3 is true for functions of “high discrepancy” • Show (inductively) that poly circuits can only compute low discrepancy functions i.e. 𝐷 3 is useful as 𝑔 3 ∈ 𝐷 3 cannot be computed by poly circuit • Show that SAT has high discrepancy i.e. SAT ∈ 𝐷 3 • 𝑄 ⊂ 𝑄 /FGHI implies 𝑄 ≠ 𝑂𝑄

The Breakthrough Results • “Our main theorem … … gives evidence that no proof strategy along these lines can ever succeed” – Razborov, Rudich ’ 96 • “Any Large and Constructive 𝐷 3 useful against 𝑄 /FGHI provides a statistical test that can be used to break any polytime psuedo-random generator.” • Violates a widely held belief that psuedo-random generators of hardness 2 K (𝜗 > 0) exist.

Psuedo-Random Generators f(𝑦 3 ) 𝚾(𝒀 𝒍 ) Φ(𝑦 R ) 𝟏, 𝟐 𝒐 {0,1} 𝟏, 𝟐 R 𝑌 R 𝑌 3 • 𝑄 𝑔 = 1 : Probability that 𝑔(x \ ) = 1 • 𝑄 𝑔 ] = 1 : Probability that 𝑔 Φ x ^ = 1 𝑄 𝑔 = 1 | 𝑦 3 ∈ Φ(𝑌 R ) Φ is a M-hard psuedo-random generator if 𝑄 𝑔 ≈ 𝑄(𝑔 ] ) ≤ 𝑁 Bc i.e. 𝑄 𝑔 − 𝑄 𝑔 ]

The Entire Picture (so far…) Integer One Way Discrete Log Empirical Evidence Factorization Functions Exist Hardness Hardness Psuedo-Random No Natural Proofs Generators Exist 𝑸 ≠ 𝑶𝑸

Implications of no Natural Proofs Recall the definition of Natural Proofs Any property used by a non-natural lower bound proof must fall into one of: • Violates Largeness • Probability that a random function has the property is small • Property is shared by very few • Violates Constructability • Very complicated property

Takeaways Decoding the literature: Notation Description Size of Distribution Type 0,1 3 2 3 Input Set Set 2 f g Functions on 0,1 3 Set of Sets 𝐺 3 2 f hg 𝐷 3 Properties of 𝐺 Set of Sets of Sets 3 A random efficiently computable function is very hard to distinguish from a random function • Main Proof Idea [RR97] • The rubik's cube • 3 bit scramblers’ composition

End of Part I Shubhang Kulkarni “The general problem of mathematically proving computational lower bounds is a mystery” - Ryan Williams, Thinking Algorithmically About Impossibility

Part II : Introduction to Data Complexity Ryan Davis “[We] will have to develop new methods to make a serious dent in major lower bound problems.” - Ryan Williams, Thinking Algorithmically About Impossibility

Why use Data Complexity? Closely related to program verification (testing) Carefully chosen input/output pairs to determine correctness When does it suffice to use a small number of test cases? What if we know something about the program, such as its size?

Defining Data Complexity Decision Problem Potential Solution Assume a known function 𝑔: {0,1} ∗ → {0,1} Given a circuit 𝐷 of size 𝑡 , we wish to determine if 𝐷 computes 𝑔 Data complexity (w.r.t. 𝑡 ) – minimum number of input/output examples to determine if 𝐷 computes 𝑔 “Gray-box” testing where 𝑡 is side information

Overarching Question The data complexity for size 𝑡 circuits is trivially 2 C(k) (Include all input/output examples up to length 𝑡 ) We are interested to know: For what functions 𝑔 can the data complexity be much smaller?

Data Complexity and Circuit Complexity The data complexity of testing 𝑔 is “small” If and only if The circuit complexity of 𝑔 is “large” “The theory of circuits becomes interesting when we restrict the complexities of the circuits; The theory of test suites becomes similarly interesting when restricting the amount of necessary data.” - Chapman, B., Williams, R.

The Circuit-Input Game • Circuit player has a set 𝐷 of all circuits of size 𝑡 • Size of 𝐷 is 𝐷 = 2 C(k lmn k) 2 C(k lmn k) • Input player has a set 𝐽 of all inputs of length 𝑜 • Size of 𝐽 is |𝐽| = 2 3 • Payoff matrix 𝑁 with 2 3 rows and 2 C(k lmn k) columns 2 3 𝑁 • 𝑁 𝑑, 𝑦 = 0 if 𝑑 𝑦 = 𝑔(𝑦) • 𝑁 𝑑, 𝑦 = 1 if 𝑑 𝑦 ≠ 𝑔(𝑦) Payoff goes to the input player

Approximate Optimal Strategies Theorem 2.1 (roughly) Circuit player has a good strategy! 𝑙 ≥ 𝑑 u 𝑜 1. There exists a 𝑙 -size distribution 𝑞 (strategy) on 𝐷 such that for all 𝑦 ∈ 𝐽 , the circuit player has a good chance 𝑑 ∈ 𝑞 will satisfy 𝑑 𝑦 = 𝑔 𝑦 ℓ ≥ 𝑑 u 𝑡 log 𝑡 2. There exists a ℓ -size distribution 𝑞 (strategy) on 𝐽 such that for all 𝑑 ∈ 𝐷 , the input player has a good chance 𝑦 ∈ 𝑞 will satisfy 𝑑 𝑦 ≠ 𝑔 𝑦 Input player has a good strategy!

Data Complexity Consequence Theorem 2.2 (roughly) Let 𝑞 + 𝑟 ≤ 1 − 𝜗 𝑙 ≥ 𝑑 u 𝑜 1. There exists a 𝑙 -size set of circuits 𝑍 ⊆ 𝐷 such that for all 𝑦 ∈ 𝐽 , 𝑑 𝑦 = 𝑔 𝑦 for more than a 𝑞 -fraction of circuits 𝑑 ∈ 𝑍 . ℓ ≥ 𝑑 u 𝑡 log 𝑡 2. There exists a ℓ -size set of inputs 𝑌 ⊆ 𝐽 such that for all 𝑑 ∈ 𝐷 , 𝑑 𝑦 ≠ 𝑔 𝑦 for more than a 𝑟 -fraction of inputs 𝑦 ∈ 𝑌 .

Data Complexity and Circuit Complexity The data complexity of testing 𝑔 is “small” If and only if The circuit complexity of 𝑔 is “large”

Data Complexity and Circuit Complexity Theorem 1.2: Let function 𝑔: {0,1} ∗ → {0,1} and 𝑇(𝑜) ≥ 2𝑜 for all 𝑜 Hard to test! 1. If 𝑔 is in SIZE ( 𝑇(𝑜) ), the data complexity of testing size- 𝑡 circuits for 𝑔 is at least 2 }(~ •€ k ) Easy to test! 2. If 𝑔 is not in SIZE ( 𝑜 u 𝑇(𝑜) ), the data complexity of testing size- 𝑡 circuits for 𝑔 is at most 𝑃(2 ~ •€ k + 𝑇 Bc 𝑡 u 𝑡 f log 𝑡)

Proof Intuition Replace data complexity with time complexity When 𝑔 has large circuit complexity, we can quickly test circuits for 𝑔 Suppose 𝑇(𝑜) is a lower bound on circuit complexity of 𝑔 3 Given a size- 𝑡 circuit, we may have to try all 2 3 < 2 ~ •€ (k) inputs As circuit complexity 𝑇(𝑜) increases, time complexity 2 ~ •€ (k) decreases Time complexity with respect to circuit size 𝑡

Proof Idea Pt. 1 If f is in SIZE ( 𝑇(𝑜) ), the data complexity of testing size- 𝑡 circuits for 𝑔 is at least 2 }(~ •€ k ) 1. Suppose a circuit 𝑑 of size 𝑇(𝑜) that computes 𝑔 2. Construct a circuit 𝑑 ƒ of size 𝑇’(𝑜) = 𝑇(𝑜) + 𝑜 that agrees with 𝑑 on all inputs except 𝑦 3. Thus, for any 𝑜 -input circuit of size 𝑇’ 𝑜 we must include all inputs of length 𝑜 ≥ 𝑇 Bc 𝑡 in a test set for 𝑔

How to not prove A Brief Introduction to Natural Proofs & - PowerPoint PPT Presentation

How to not prove A Brief Introduction to Natural Proofs & Data Complexity Shubhang Kulkarni and Ryan Davis Part I : Introduction to Natural Proofs Shubhang Kulkarni The obstacle is, roughly, that a large class of

Vanilla Meta-interpreter prove ( G ) is true when base-level body G is a logical consequence of

CS70: Lecture 2. Outline. Today: Proofs!!! 1. By Example (or Counterexample). 2. Direct. (Prove P

Proofs about functions Function consuming A is related to proof about A Q: How to prove two

A very elementary introduction to proofs Part 2 Example: Prove a function is not 1:1 By Dr.

NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR

Why Im NOT Why Im NOT Why Im NOT Why Im NOT a Hindu Why Im NOT a Hindu

JOI NT USE STRATEGY 2013 San Diego Unified School District 1 . Prop Z Opportunity I m prove

Small Dealer Symposium Strategic Transactions Mario Frankovich, CEO Burgeonvest Bick Securities

Foundation of proofs Jim Hefferon http://joshua.smcvt.edu/proofs The need to prove In

CS 497 Program Analysis Ond rej Lhot ak November 21 and 26, 2007 Program Analysis Prove

Mathematical Induction Examples Strong Induction Induction hypothesis: n k P(n) To prove

Fast Fourier Transform 2 Announcements HW 3 posted tonight (after this) 3 Fast Fourier

Structural recursively defined set R, prove P(b) for each base case b R Induction

What Works? (What Does Not)? in the control of violent crime and how do you prove it? A

How (Not) to Prove Theorems About Algorithms (or; fun with inductive types! ) Jack Crawford

Example 1.73 I Lets apply pumping lemma to prove that B = { 0 n 1 n | n 0 } is not regular

Formal Methods for Knowledge Management in Science Theorema Group RISC, Johannes Kepler

Proving discrimination: The shift of the burden of proof and access to evidence Rachel Crasnow

Enterprise Data Management (EDM) and Enterprise Product Generation (EPG) Proving Ground in the

Debugging of Model Transformations and Contracts in SyVOLT Bentley James Oakes , Clark Verbrugge,

Whitneys First Embedding Theorem Brahim Abdenbi abrahim.montreal@gmail.com Topology of

Financial Results FY 2017 Presentation 3 May 2018 Disclaimer This presentation (

A New Deployment of AGA 6 AGA 6 Refresher In Situ Proving of Gas Meters Challenges Of AGA 6

EU Settlement scheme Student Immigration Team Examination Schools 3 March 2020 Jo Aldhouse and