Algrabraic Complexity Theory Manindra Agrawal IIT Kanpur Symposium - - PowerPoint PPT Presentation

algrabraic complexity theory
SMART_READER_LITE
LIVE PREVIEW

Algrabraic Complexity Theory Manindra Agrawal IIT Kanpur Symposium - - PowerPoint PPT Presentation

Algrabraic Complexity Theory Manindra Agrawal IIT Kanpur Symposium on Learning, Algorithms and Complexity, IISc Bangalore 2015 ../IITK-Logo.jpg Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 1 / 73 Overview 1 Computation Over


slide-1
SLIDE 1

../IITK-Logo.jpg

Algrabraic Complexity Theory

Manindra Agrawal

IIT Kanpur

Symposium on Learning, Algorithms and Complexity, IISc Bangalore 2015

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 1 / 73

slide-2
SLIDE 2

../IITK-Logo.jpg

Overview

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 2 / 73

slide-3
SLIDE 3

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 3 / 73

slide-4
SLIDE 4

../IITK-Logo.jpg

Computation without Bits

An algorithm, in general, can use individual bits of the input in very complex ways. In particular, making execution decisions based on the values of a bit. Certain algorithms, however, use the individual bits in a much simpler way. Example: matrix multiplication. For [cij] = [aij] · [bij], we have: cij =

n−1

  • k=0

aikbkj. If we assume operations + and ∗ as primitives, and the input being a sequence of numbers denoting entries of matrices, then the algorithm does not need to access bit values.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 4 / 73

slide-5
SLIDE 5

../IITK-Logo.jpg

Computation without Bits

An algorithm, in general, can use individual bits of the input in very complex ways. In particular, making execution decisions based on the values of a bit. Certain algorithms, however, use the individual bits in a much simpler way. Example: matrix multiplication. For [cij] = [aij] · [bij], we have: cij =

n−1

  • k=0

aikbkj. If we assume operations + and ∗ as primitives, and the input being a sequence of numbers denoting entries of matrices, then the algorithm does not need to access bit values.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 4 / 73

slide-6
SLIDE 6

../IITK-Logo.jpg

Computation without Bits

An algorithm, in general, can use individual bits of the input in very complex ways. In particular, making execution decisions based on the values of a bit. Certain algorithms, however, use the individual bits in a much simpler way. Example: matrix multiplication. For [cij] = [aij] · [bij], we have: cij =

n−1

  • k=0

aikbkj. If we assume operations + and ∗ as primitives, and the input being a sequence of numbers denoting entries of matrices, then the algorithm does not need to access bit values.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 4 / 73

slide-7
SLIDE 7

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 5 / 73

slide-8
SLIDE 8

../IITK-Logo.jpg

Computation without Bits

We can formalize such computations as follows:

◮ Let R be a ring with operations + and ∗. ◮ Let the input be variables x1, x2, . . ., xn. ◮ An algorithm applies a sequence of ring operations on the input

variables and constants from R.

◮ The output is a polynomial in R[x1, x2, . . . , xn].

This is called arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 6 / 73

slide-9
SLIDE 9

../IITK-Logo.jpg

Computation without Bits

We can formalize such computations as follows:

◮ Let R be a ring with operations + and ∗. ◮ Let the input be variables x1, x2, . . ., xn. ◮ An algorithm applies a sequence of ring operations on the input

variables and constants from R.

◮ The output is a polynomial in R[x1, x2, . . . , xn].

This is called arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 6 / 73

slide-10
SLIDE 10

../IITK-Logo.jpg

Computation without Bits

We can formalize such computations as follows:

◮ Let R be a ring with operations + and ∗. ◮ Let the input be variables x1, x2, . . ., xn. ◮ An algorithm applies a sequence of ring operations on the input

variables and constants from R.

◮ The output is a polynomial in R[x1, x2, . . . , xn].

This is called arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 6 / 73

slide-11
SLIDE 11

../IITK-Logo.jpg

Computation without Bits

We can formalize such computations as follows:

◮ Let R be a ring with operations + and ∗. ◮ Let the input be variables x1, x2, . . ., xn. ◮ An algorithm applies a sequence of ring operations on the input

variables and constants from R.

◮ The output is a polynomial in R[x1, x2, . . . , xn].

This is called arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 6 / 73

slide-12
SLIDE 12

../IITK-Logo.jpg

An Example

+ ∗ + ∗ ∗ ∗ + ∗ u ∗ v + ∗ x ∗ y ∗ + ∗ ∗ −1 2 2 2 2 2 2 −1 inputs

  • utput = (ux + vy)2 + (vx − uy)2 − (u2 + v2) · (x2 + y2)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 7 / 73

slide-13
SLIDE 13

../IITK-Logo.jpg

Arithmetic Circuit Families

As in the boolean settings, arithmetic circuit model is a non-uniform model of computation. For each problem, one has, therefore, an infinite family of circuits computing its solution.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 8 / 73

slide-14
SLIDE 14

../IITK-Logo.jpg

Arithmetic Circuit Families

As in the boolean settings, arithmetic circuit model is a non-uniform model of computation. For each problem, one has, therefore, an infinite family of circuits computing its solution.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 8 / 73

slide-15
SLIDE 15

../IITK-Logo.jpg

Power of the Model

The model proposed by [Valiant 1979]. It can compute all of the following operations:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial, permanent

◮ Polynomial operations: addition, multiplication ◮ Multivariate polynomial factorization when the polynomial is fixed Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 9 / 73

slide-16
SLIDE 16

../IITK-Logo.jpg

Power of the Model

The model proposed by [Valiant 1979]. It can compute all of the following operations:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial, permanent

◮ Polynomial operations: addition, multiplication ◮ Multivariate polynomial factorization when the polynomial is fixed Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 9 / 73

slide-17
SLIDE 17

../IITK-Logo.jpg

Power of the Model

The model proposed by [Valiant 1979]. It can compute all of the following operations:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial, permanent

◮ Polynomial operations: addition, multiplication ◮ Multivariate polynomial factorization when the polynomial is fixed Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 9 / 73

slide-18
SLIDE 18

../IITK-Logo.jpg

Power of the Model

The model proposed by [Valiant 1979]. It can compute all of the following operations:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial, permanent

◮ Polynomial operations: addition, multiplication ◮ Multivariate polynomial factorization when the polynomial is fixed Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 9 / 73

slide-19
SLIDE 19

../IITK-Logo.jpg

Power of the Model

The model proposed by [Valiant 1979]. It can compute all of the following operations:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial, permanent

◮ Polynomial operations: addition, multiplication ◮ Multivariate polynomial factorization when the polynomial is fixed Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 9 / 73

slide-20
SLIDE 20

../IITK-Logo.jpg

Arithmetic Complexity

Crucial parameters associated with an arithmetic circuit are: Input length: number of input variables. Notice that the size of individual variables is not counted! Size: equals the number of operations in the circuit (measured as a function of input length). Depth: equals the length of the longest path from a variable to

  • utput of the circuit.

Degree: equals the formal degree of circuit defined inductively as: 1 for input variables, max for addition gates, and sum for multiplication gates. Fanin: equals the largest number of inputs to a gate in the circuit. We allow arbitrary fanin.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 10 / 73

slide-21
SLIDE 21

../IITK-Logo.jpg

Arithmetic Complexity

Crucial parameters associated with an arithmetic circuit are: Input length: number of input variables. Notice that the size of individual variables is not counted! Size: equals the number of operations in the circuit (measured as a function of input length). Depth: equals the length of the longest path from a variable to

  • utput of the circuit.

Degree: equals the formal degree of circuit defined inductively as: 1 for input variables, max for addition gates, and sum for multiplication gates. Fanin: equals the largest number of inputs to a gate in the circuit. We allow arbitrary fanin.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 10 / 73

slide-22
SLIDE 22

../IITK-Logo.jpg

Arithmetic Complexity

Crucial parameters associated with an arithmetic circuit are: Input length: number of input variables. Notice that the size of individual variables is not counted! Size: equals the number of operations in the circuit (measured as a function of input length). Depth: equals the length of the longest path from a variable to

  • utput of the circuit.

Degree: equals the formal degree of circuit defined inductively as: 1 for input variables, max for addition gates, and sum for multiplication gates. Fanin: equals the largest number of inputs to a gate in the circuit. We allow arbitrary fanin.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 10 / 73

slide-23
SLIDE 23

../IITK-Logo.jpg

Arithmetic Complexity

Crucial parameters associated with an arithmetic circuit are: Input length: number of input variables. Notice that the size of individual variables is not counted! Size: equals the number of operations in the circuit (measured as a function of input length). Depth: equals the length of the longest path from a variable to

  • utput of the circuit.

Degree: equals the formal degree of circuit defined inductively as: 1 for input variables, max for addition gates, and sum for multiplication gates. Fanin: equals the largest number of inputs to a gate in the circuit. We allow arbitrary fanin.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 10 / 73

slide-24
SLIDE 24

../IITK-Logo.jpg

Arithmetic Complexity

Crucial parameters associated with an arithmetic circuit are: Input length: number of input variables. Notice that the size of individual variables is not counted! Size: equals the number of operations in the circuit (measured as a function of input length). Depth: equals the length of the longest path from a variable to

  • utput of the circuit.

Degree: equals the formal degree of circuit defined inductively as: 1 for input variables, max for addition gates, and sum for multiplication gates. Fanin: equals the largest number of inputs to a gate in the circuit. We allow arbitrary fanin.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 10 / 73

slide-25
SLIDE 25

../IITK-Logo.jpg

Circuit Parameters

+ ∗ + ∗ ∗ ∗ + ∗ u ∗ v + ∗ x ∗ y ∗ + ∗ ∗ −1 2 2 2 2 2 2 −1 SIZE = 16 DEPTH = 4 DEGREE = 4 FANIN = 3

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 11 / 73

slide-26
SLIDE 26

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 12 / 73

slide-27
SLIDE 27

../IITK-Logo.jpg

Extension with Zero-test

Many other algebraic operations cannot be computed in arithmetic circuit model: solving system of linear equations, rank of a matrix, gcd of polynomials, primality testing . . . Generalize the model by including another operation: zero-test.

◮ This is a branching operation: check if the input is zero; if yes do A

else do B.

All the above operations can be computed in the new model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 13 / 73

slide-28
SLIDE 28

../IITK-Logo.jpg

Extension with Zero-test

Many other algebraic operations cannot be computed in arithmetic circuit model: solving system of linear equations, rank of a matrix, gcd of polynomials, primality testing . . . Generalize the model by including another operation: zero-test.

◮ This is a branching operation: check if the input is zero; if yes do A

else do B.

All the above operations can be computed in the new model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 13 / 73

slide-29
SLIDE 29

../IITK-Logo.jpg

Extension with Zero-test

Many other algebraic operations cannot be computed in arithmetic circuit model: solving system of linear equations, rank of a matrix, gcd of polynomials, primality testing . . . Generalize the model by including another operation: zero-test.

◮ This is a branching operation: check if the input is zero; if yes do A

else do B.

All the above operations can be computed in the new model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 13 / 73

slide-30
SLIDE 30

../IITK-Logo.jpg

BSS Model

The generalized model can still not compute simple functions, e.g., ”Is x < y?” [Blum-Shub-Smale 1989] replaced zero-test with ≤ operator.

◮ The operator makes sense only in rings with a total ordering, e.g., Z,

Q, R.

They showed that the model, for R = Z or Q restores access to bits, and is therefore equivalent to the standard boolean model. For R = R, they developed a new theory of complexity. We will not consider this model any further.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 14 / 73

slide-31
SLIDE 31

../IITK-Logo.jpg

BSS Model

The generalized model can still not compute simple functions, e.g., ”Is x < y?” [Blum-Shub-Smale 1989] replaced zero-test with ≤ operator.

◮ The operator makes sense only in rings with a total ordering, e.g., Z,

Q, R.

They showed that the model, for R = Z or Q restores access to bits, and is therefore equivalent to the standard boolean model. For R = R, they developed a new theory of complexity. We will not consider this model any further.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 14 / 73

slide-32
SLIDE 32

../IITK-Logo.jpg

BSS Model

The generalized model can still not compute simple functions, e.g., ”Is x < y?” [Blum-Shub-Smale 1989] replaced zero-test with ≤ operator.

◮ The operator makes sense only in rings with a total ordering, e.g., Z,

Q, R.

They showed that the model, for R = Z or Q restores access to bits, and is therefore equivalent to the standard boolean model. For R = R, they developed a new theory of complexity. We will not consider this model any further.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 14 / 73

slide-33
SLIDE 33

../IITK-Logo.jpg

BSS Model

The generalized model can still not compute simple functions, e.g., ”Is x < y?” [Blum-Shub-Smale 1989] replaced zero-test with ≤ operator.

◮ The operator makes sense only in rings with a total ordering, e.g., Z,

Q, R.

They showed that the model, for R = Z or Q restores access to bits, and is therefore equivalent to the standard boolean model. For R = R, they developed a new theory of complexity. We will not consider this model any further.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 14 / 73

slide-34
SLIDE 34

../IITK-Logo.jpg

BSS Model

The generalized model can still not compute simple functions, e.g., ”Is x < y?” [Blum-Shub-Smale 1989] replaced zero-test with ≤ operator.

◮ The operator makes sense only in rings with a total ordering, e.g., Z,

Q, R.

They showed that the model, for R = Z or Q restores access to bits, and is therefore equivalent to the standard boolean model. For R = R, they developed a new theory of complexity. We will not consider this model any further.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 14 / 73

slide-35
SLIDE 35

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 15 / 73

slide-36
SLIDE 36

../IITK-Logo.jpg

The Class P

For both the models, the class P can be defined in an analogous way to boolean settings: all problems that can be solved by a circuit family of polynomial size. In the arithmetic circuit model, a problem is simply a family of polynomials, typically parameterized by the number of variables, or degree, or both:

◮ Chebyshev polynomials

Td(x) =

⌊d/2⌋

  • k=0

d 2k

  • (x2 − 1)kxd−2k

by degree,

◮ Determinant polynomial by number of variables, and ◮ Elementary symmetric polynomials

Sd(x1, x2, . . . , xn) =

  • I⊆[1,n],|I|=d
  • j∈I

xj, by both degree and number of variables.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 16 / 73

slide-37
SLIDE 37

../IITK-Logo.jpg

The Class P

For both the models, the class P can be defined in an analogous way to boolean settings: all problems that can be solved by a circuit family of polynomial size. In the arithmetic circuit model, a problem is simply a family of polynomials, typically parameterized by the number of variables, or degree, or both:

◮ Chebyshev polynomials

Td(x) =

⌊d/2⌋

  • k=0

d 2k

  • (x2 − 1)kxd−2k

by degree,

◮ Determinant polynomial by number of variables, and ◮ Elementary symmetric polynomials

Sd(x1, x2, . . . , xn) =

  • I⊆[1,n],|I|=d
  • j∈I

xj, by both degree and number of variables.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 16 / 73

slide-38
SLIDE 38

../IITK-Logo.jpg

The Class P

For both the models, the class P can be defined in an analogous way to boolean settings: all problems that can be solved by a circuit family of polynomial size. In the arithmetic circuit model, a problem is simply a family of polynomials, typically parameterized by the number of variables, or degree, or both:

◮ Chebyshev polynomials

Td(x) =

⌊d/2⌋

  • k=0

d 2k

  • (x2 − 1)kxd−2k

by degree,

◮ Determinant polynomial by number of variables, and ◮ Elementary symmetric polynomials

Sd(x1, x2, . . . , xn) =

  • I⊆[1,n],|I|=d
  • j∈I

xj, by both degree and number of variables.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 16 / 73

slide-39
SLIDE 39

../IITK-Logo.jpg

The Class P

For both the models, the class P can be defined in an analogous way to boolean settings: all problems that can be solved by a circuit family of polynomial size. In the arithmetic circuit model, a problem is simply a family of polynomials, typically parameterized by the number of variables, or degree, or both:

◮ Chebyshev polynomials

Td(x) =

⌊d/2⌋

  • k=0

d 2k

  • (x2 − 1)kxd−2k

by degree,

◮ Determinant polynomial by number of variables, and ◮ Elementary symmetric polynomials

Sd(x1, x2, . . . , xn) =

  • I⊆[1,n],|I|=d
  • j∈I

xj, by both degree and number of variables.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 16 / 73

slide-40
SLIDE 40

../IITK-Logo.jpg

The Class P

For both the models, the class P can be defined in an analogous way to boolean settings: all problems that can be solved by a circuit family of polynomial size. In the arithmetic circuit model, a problem is simply a family of polynomials, typically parameterized by the number of variables, or degree, or both:

◮ Chebyshev polynomials

Td(x) =

⌊d/2⌋

  • k=0

d 2k

  • (x2 − 1)kxd−2k

by degree,

◮ Determinant polynomial by number of variables, and ◮ Elementary symmetric polynomials

Sd(x1, x2, . . . , xn) =

  • I⊆[1,n],|I|=d
  • j∈I

xj, by both degree and number of variables.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 16 / 73

slide-41
SLIDE 41

../IITK-Logo.jpg

Examples

In the arithmetic circuit model, the following problems are in P:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial

◮ Polynomial operations: addition, multiplication, elementary symmetric

polynomials

◮ Multivariate polynomial factorization when the polynomial is fixed ◮ In the arithmetic circuits with zero-test model, the following problems

are also in P: solving a system of linear equations, rank of a matrix, gcd of polynomials, primality testing.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 17 / 73

slide-42
SLIDE 42

../IITK-Logo.jpg

Examples

In the arithmetic circuit model, the following problems are in P:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial

◮ Polynomial operations: addition, multiplication, elementary symmetric

polynomials

◮ Multivariate polynomial factorization when the polynomial is fixed ◮ In the arithmetic circuits with zero-test model, the following problems

are also in P: solving a system of linear equations, rank of a matrix, gcd of polynomials, primality testing.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 17 / 73

slide-43
SLIDE 43

../IITK-Logo.jpg

Examples

In the arithmetic circuit model, the following problems are in P:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial

◮ Polynomial operations: addition, multiplication, elementary symmetric

polynomials

◮ Multivariate polynomial factorization when the polynomial is fixed ◮ In the arithmetic circuits with zero-test model, the following problems

are also in P: solving a system of linear equations, rank of a matrix, gcd of polynomials, primality testing.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 17 / 73

slide-44
SLIDE 44

../IITK-Logo.jpg

Examples

In the arithmetic circuit model, the following problems are in P:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial

◮ Polynomial operations: addition, multiplication, elementary symmetric

polynomials

◮ Multivariate polynomial factorization when the polynomial is fixed ◮ In the arithmetic circuits with zero-test model, the following problems

are also in P: solving a system of linear equations, rank of a matrix, gcd of polynomials, primality testing.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 17 / 73

slide-45
SLIDE 45

../IITK-Logo.jpg

Examples

In the arithmetic circuit model, the following problems are in P:

◮ Matrix operations: addition, multiplication, determinant, inverse,

characteristic polynomial

◮ Polynomial operations: addition, multiplication, elementary symmetric

polynomials

◮ Multivariate polynomial factorization when the polynomial is fixed ◮ In the arithmetic circuits with zero-test model, the following problems

are also in P: solving a system of linear equations, rank of a matrix, gcd of polynomials, primality testing.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 17 / 73

slide-46
SLIDE 46

../IITK-Logo.jpg

A Poor Definition of NP

Analogous definition of NP to the boolean settings fails. Consider arithmetic circuit model, where each computation results in a polynomial, over R = C. Say polynomial family Pn(x1, . . . , xn) is in NP if there exists another polynomial family Qn+m+1(x1, . . . , xn, y1, . . . , ym, z) in P such that:

◮ m = nO(1), and ◮ Pn(α1, . . . , αn) = γ iff there exists β1, . . ., βm with

Qn+m+1(α1, . . . , αn, β1, . . . , βm, γ) = 0.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 18 / 73

slide-47
SLIDE 47

../IITK-Logo.jpg

A Poor Definition of NP

Analogous definition of NP to the boolean settings fails. Consider arithmetic circuit model, where each computation results in a polynomial, over R = C. Say polynomial family Pn(x1, . . . , xn) is in NP if there exists another polynomial family Qn+m+1(x1, . . . , xn, y1, . . . , ym, z) in P such that:

◮ m = nO(1), and ◮ Pn(α1, . . . , αn) = γ iff there exists β1, . . ., βm with

Qn+m+1(α1, . . . , αn, β1, . . . , βm, γ) = 0.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 18 / 73

slide-48
SLIDE 48

../IITK-Logo.jpg

A Poor Definition of NP

Analogous definition of NP to the boolean settings fails. Consider arithmetic circuit model, where each computation results in a polynomial, over R = C. Say polynomial family Pn(x1, . . . , xn) is in NP if there exists another polynomial family Qn+m+1(x1, . . . , xn, y1, . . . , ym, z) in P such that:

◮ m = nO(1), and ◮ Pn(α1, . . . , αn) = γ iff there exists β1, . . ., βm with

Qn+m+1(α1, . . . , αn, β1, . . . , βm, γ) = 0.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 18 / 73

slide-49
SLIDE 49

../IITK-Logo.jpg

A Poor Definition of NP

By definition, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = 0 iff z = γ. Therefore, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = δ · (z − γ)t, t > 0. Since this is true for all α1, . . ., αn, we can reset Qn+m+1 to Qn+m+1(α1, . . . , αn, 0, . . . , 0, z).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 19 / 73

slide-50
SLIDE 50

../IITK-Logo.jpg

A Poor Definition of NP

By definition, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = 0 iff z = γ. Therefore, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = δ · (z − γ)t, t > 0. Since this is true for all α1, . . ., αn, we can reset Qn+m+1 to Qn+m+1(α1, . . . , αn, 0, . . . , 0, z).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 19 / 73

slide-51
SLIDE 51

../IITK-Logo.jpg

A Poor Definition of NP

By definition, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = 0 iff z = γ. Therefore, Qn+m+1(α1, . . . , αn, y1, . . . , ym, z) = δ · (z − γ)t, t > 0. Since this is true for all α1, . . ., αn, we can reset Qn+m+1 to Qn+m+1(α1, . . . , αn, 0, . . . , 0, z).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 19 / 73

slide-52
SLIDE 52

../IITK-Logo.jpg

A Better Definition of NP

The Class NP [Valiant 1979]

Polynomial family {Pn} is in NP if there exists a family {Pn+m} ∈ P such that m = nO(1), and for every n: Pn(x1, . . . , xn) =

  • y1∈{0,1}

· · ·

  • ym∈{0,1}

Qn+m(x1, . . . , xn, y1, . . . , ym).

1 Here 0 and 1 are identities of R. 2 The definition can be easily generalized to arithmetic circuit with

zero-test model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 20 / 73

slide-53
SLIDE 53

../IITK-Logo.jpg

Examples

All problems in P, Permanent family, Jones polynomials: representing invariants of knots, Tutte polynomials: TG(x, y) =

  • A⊆E

(x − 1)k(A)−k(E)(y − 1)k(A)+|A|−|V | where G = (V , E) is an undirected graph and k(A) is the number of connected components in the subgraph (V , A).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 21 / 73

slide-54
SLIDE 54

../IITK-Logo.jpg

Examples

All problems in P, Permanent family, Jones polynomials: representing invariants of knots, Tutte polynomials: TG(x, y) =

  • A⊆E

(x − 1)k(A)−k(E)(y − 1)k(A)+|A|−|V | where G = (V , E) is an undirected graph and k(A) is the number of connected components in the subgraph (V , A).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 21 / 73

slide-55
SLIDE 55

../IITK-Logo.jpg

Examples

All problems in P, Permanent family, Jones polynomials: representing invariants of knots, Tutte polynomials: TG(x, y) =

  • A⊆E

(x − 1)k(A)−k(E)(y − 1)k(A)+|A|−|V | where G = (V , E) is an undirected graph and k(A) is the number of connected components in the subgraph (V , A).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 21 / 73

slide-56
SLIDE 56

../IITK-Logo.jpg

Examples

All problems in P, Permanent family, Jones polynomials: representing invariants of knots, Tutte polynomials: TG(x, y) =

  • A⊆E

(x − 1)k(A)−k(E)(y − 1)k(A)+|A|−|V | where G = (V , E) is an undirected graph and k(A) is the number of connected components in the subgraph (V , A).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 21 / 73

slide-57
SLIDE 57

../IITK-Logo.jpg

NP-complete Problems

Theorem [Valient 1979]

Computing permanent family is complete for NP in arithmetic circuit model: for every polynomial family {Qn} in NP, for every n, Qn can be expressed as permanent of a nO(1)-size matrix with variable and constant entries. Several other polynomial families are also NP-complete: Jones polynomials, Tutte polynomials, matching polynomial etc.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 22 / 73

slide-58
SLIDE 58

../IITK-Logo.jpg

NP-complete Problems

Theorem [Valient 1979]

Computing permanent family is complete for NP in arithmetic circuit model: for every polynomial family {Qn} in NP, for every n, Qn can be expressed as permanent of a nO(1)-size matrix with variable and constant entries. Several other polynomial families are also NP-complete: Jones polynomials, Tutte polynomials, matching polynomial etc.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 22 / 73

slide-59
SLIDE 59

../IITK-Logo.jpg

Is P = NP?

The classes P and NP of arithmetic circuit model roughly correspond to computing the boolean classes #L and #P respectively:

◮ Permanent is complete for #P in boolean model and for NP in

arithmetic circuit model.

◮ Determinant is complete for #L in boolean model and for P under

quasi-polynomial size reductions in arithmetic circuit model.

Therefore, it is a weaker question that P = NP in boolean model: If P = NP in boolean model then P = NP in arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 23 / 73

slide-60
SLIDE 60

../IITK-Logo.jpg

Is P = NP?

The classes P and NP of arithmetic circuit model roughly correspond to computing the boolean classes #L and #P respectively:

◮ Permanent is complete for #P in boolean model and for NP in

arithmetic circuit model.

◮ Determinant is complete for #L in boolean model and for P under

quasi-polynomial size reductions in arithmetic circuit model.

Therefore, it is a weaker question that P = NP in boolean model: If P = NP in boolean model then P = NP in arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 23 / 73

slide-61
SLIDE 61

../IITK-Logo.jpg

Is P = NP?

The classes P and NP of arithmetic circuit model roughly correspond to computing the boolean classes #L and #P respectively:

◮ Permanent is complete for #P in boolean model and for NP in

arithmetic circuit model.

◮ Determinant is complete for #L in boolean model and for P under

quasi-polynomial size reductions in arithmetic circuit model.

Therefore, it is a weaker question that P = NP in boolean model: If P = NP in boolean model then P = NP in arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 23 / 73

slide-62
SLIDE 62

../IITK-Logo.jpg

Is P = NP?

The classes P and NP of arithmetic circuit model roughly correspond to computing the boolean classes #L and #P respectively:

◮ Permanent is complete for #P in boolean model and for NP in

arithmetic circuit model.

◮ Determinant is complete for #L in boolean model and for P under

quasi-polynomial size reductions in arithmetic circuit model.

Therefore, it is a weaker question that P = NP in boolean model: If P = NP in boolean model then P = NP in arithmetic circuit model.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 23 / 73

slide-63
SLIDE 63

../IITK-Logo.jpg

Is P = NP?

Even for arithmetic circuit model, proving P = NP has been very challenging, and has remained a hypothesis. Henceforth, we restrict ourselves to the arithmetic model of computation. For arithmetic circuit model, the classes P and NP are called VP and VNP: named after Valiant. Over the years, this problem has become one of the most active areas

  • f research in complexity theory.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 24 / 73

slide-64
SLIDE 64

../IITK-Logo.jpg

Is P = NP?

Even for arithmetic circuit model, proving P = NP has been very challenging, and has remained a hypothesis. Henceforth, we restrict ourselves to the arithmetic model of computation. For arithmetic circuit model, the classes P and NP are called VP and VNP: named after Valiant. Over the years, this problem has become one of the most active areas

  • f research in complexity theory.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 24 / 73

slide-65
SLIDE 65

../IITK-Logo.jpg

Is P = NP?

Even for arithmetic circuit model, proving P = NP has been very challenging, and has remained a hypothesis. Henceforth, we restrict ourselves to the arithmetic model of computation. For arithmetic circuit model, the classes P and NP are called VP and VNP: named after Valiant. Over the years, this problem has become one of the most active areas

  • f research in complexity theory.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 24 / 73

slide-66
SLIDE 66

../IITK-Logo.jpg

Is P = NP?

Even for arithmetic circuit model, proving P = NP has been very challenging, and has remained a hypothesis. Henceforth, we restrict ourselves to the arithmetic model of computation. For arithmetic circuit model, the classes P and NP are called VP and VNP: named after Valiant. Over the years, this problem has become one of the most active areas

  • f research in complexity theory.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 24 / 73

slide-67
SLIDE 67

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 25 / 73

slide-68
SLIDE 68

../IITK-Logo.jpg

Reducing Depth to O(log d)

Theorem (Valiant-Skyum-Berkowitz-Rackoff, 1983)

If polynomial P(x1, . . . , xn) of degree d is computable by an arithmetic circuit of size s ≥ n, then it can also be computed by an arithmetic circuit

  • f size sO(1) whose depth is O(log d) and fanin of multiplication gates is

two. Another construction was given by [Allender-Jiao-Mahajan-Vinay 1994].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 26 / 73

slide-69
SLIDE 69

../IITK-Logo.jpg

Reducing Depth to 4

Theorem (A-Vinay 2008)

If polynomial P(x1, . . . , xn) of degree d is computable by an arithmetic circuit of size s = 2o(d+d log n

d ), then it can also be computed by an

arithmetic circuit of size sO(1) of depth 4. Extended by [Koiran 2012, Tavenas 2013].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 27 / 73

slide-70
SLIDE 70

../IITK-Logo.jpg

Proof

Let the polynomial P(x1, . . . , xn) be computed by an arithmetic circuit C

  • f size t = 2o(d+d log n

d ).

[Allender-Jiao-Mahajan-Vinay 1994] shows that C can be transformed to a circuit D of degree d, size tO(1) and depth O(log d) with multiplication gates of fanin two. We modify this transformation slightly to obtain a circuit D of degree d, size tO(1) and depth ≤ 2 log d with multiplication gates of fanin ≤ 6. Further, the circuit D consists of alternating layers of addition and multiplication gates. We now describe the construction of the circuit D.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 28 / 73

slide-71
SLIDE 71

../IITK-Logo.jpg

Proof

Let the polynomial P(x1, . . . , xn) be computed by an arithmetic circuit C

  • f size t = 2o(d+d log n

d ).

[Allender-Jiao-Mahajan-Vinay 1994] shows that C can be transformed to a circuit D of degree d, size tO(1) and depth O(log d) with multiplication gates of fanin two. We modify this transformation slightly to obtain a circuit D of degree d, size tO(1) and depth ≤ 2 log d with multiplication gates of fanin ≤ 6. Further, the circuit D consists of alternating layers of addition and multiplication gates. We now describe the construction of the circuit D.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 28 / 73

slide-72
SLIDE 72

../IITK-Logo.jpg

Proof

Let the polynomial P(x1, . . . , xn) be computed by an arithmetic circuit C

  • f size t = 2o(d+d log n

d ).

[Allender-Jiao-Mahajan-Vinay 1994] shows that C can be transformed to a circuit D of degree d, size tO(1) and depth O(log d) with multiplication gates of fanin two. We modify this transformation slightly to obtain a circuit D of degree d, size tO(1) and depth ≤ 2 log d with multiplication gates of fanin ≤ 6. Further, the circuit D consists of alternating layers of addition and multiplication gates. We now describe the construction of the circuit D.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 28 / 73

slide-73
SLIDE 73

../IITK-Logo.jpg

Proof

Let the polynomial P(x1, . . . , xn) be computed by an arithmetic circuit C

  • f size t = 2o(d+d log n

d ).

[Allender-Jiao-Mahajan-Vinay 1994] shows that C can be transformed to a circuit D of degree d, size tO(1) and depth O(log d) with multiplication gates of fanin two. We modify this transformation slightly to obtain a circuit D of degree d, size tO(1) and depth ≤ 2 log d with multiplication gates of fanin ≤ 6. Further, the circuit D consists of alternating layers of addition and multiplication gates. We now describe the construction of the circuit D.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 28 / 73

slide-74
SLIDE 74

../IITK-Logo.jpg

Construction of D: Setup

Make the circuit C layered with alternating layers of addition and multiplication gates. Make fanin of every multiplication gate two. Rearrange children of multiplication gates so that degree of the right child is greater than or equal to the degree of the left child.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 29 / 73

slide-75
SLIDE 75

../IITK-Logo.jpg

Construction of D: Setup

Make the circuit C layered with alternating layers of addition and multiplication gates. Make fanin of every multiplication gate two. Rearrange children of multiplication gates so that degree of the right child is greater than or equal to the degree of the left child.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 29 / 73

slide-76
SLIDE 76

../IITK-Logo.jpg

Construction of D: Setup

Make the circuit C layered with alternating layers of addition and multiplication gates. Make fanin of every multiplication gate two. Rearrange children of multiplication gates so that degree of the right child is greater than or equal to the degree of the left child.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 29 / 73

slide-77
SLIDE 77

../IITK-Logo.jpg

Construction of D: Proof Trees

A proof tree rooted at gate g of circuit C is a subcircuit of C obtained as follows: Start with the subcircuit of C that has gate g at the top and computes the polynomial at gate g. For every +-gate in the subcircuit, retain only one input to the gate deleting the remaining input lines. For every ∗-gate in the subcircuit, retain both the inputs to the gate. A proof tree rooted at gate g computes a monomial and the polynomial at g is the sum over monomials computed by all proof tress rooted at g.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 30 / 73

slide-78
SLIDE 78

../IITK-Logo.jpg

Construction of D: Proof Trees

A proof tree rooted at gate g of circuit C is a subcircuit of C obtained as follows: Start with the subcircuit of C that has gate g at the top and computes the polynomial at gate g. For every +-gate in the subcircuit, retain only one input to the gate deleting the remaining input lines. For every ∗-gate in the subcircuit, retain both the inputs to the gate. A proof tree rooted at gate g computes a monomial and the polynomial at g is the sum over monomials computed by all proof tress rooted at g.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 30 / 73

slide-79
SLIDE 79

../IITK-Logo.jpg

Construction of D: Proof Trees

A proof tree rooted at gate g of circuit C is a subcircuit of C obtained as follows: Start with the subcircuit of C that has gate g at the top and computes the polynomial at gate g. For every +-gate in the subcircuit, retain only one input to the gate deleting the remaining input lines. For every ∗-gate in the subcircuit, retain both the inputs to the gate. A proof tree rooted at gate g computes a monomial and the polynomial at g is the sum over monomials computed by all proof tress rooted at g.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 30 / 73

slide-80
SLIDE 80

../IITK-Logo.jpg

Construction of D: Defining Intermediate Polynomials

For every input variable xi, let [xi] stand for the polynomial xi. For every gate g of C, let [g] stand for polynomial computed at gate g. For every pair of gates g and h of C, let [g, h] be the polynomial: [g, h] =

  • T

m(T, h) where T runs over all proof trees rooted at g and m(T, h) is the monomial computed by proof tree T when gate h is replaced by 1 if gate h occurs in the rightmopst path of T, m(T, h) is 0 otherwise. It follows that [g] =

n

  • i=1

[g, xi][xi].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 31 / 73

slide-81
SLIDE 81

../IITK-Logo.jpg

Construction of D: Defining Intermediate Polynomials

For every input variable xi, let [xi] stand for the polynomial xi. For every gate g of C, let [g] stand for polynomial computed at gate g. For every pair of gates g and h of C, let [g, h] be the polynomial: [g, h] =

  • T

m(T, h) where T runs over all proof trees rooted at g and m(T, h) is the monomial computed by proof tree T when gate h is replaced by 1 if gate h occurs in the rightmopst path of T, m(T, h) is 0 otherwise. It follows that [g] =

n

  • i=1

[g, xi][xi].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 31 / 73

slide-82
SLIDE 82

../IITK-Logo.jpg

Construction of D: Defining Intermediate Polynomials

For every input variable xi, let [xi] stand for the polynomial xi. For every gate g of C, let [g] stand for polynomial computed at gate g. For every pair of gates g and h of C, let [g, h] be the polynomial: [g, h] =

  • T

m(T, h) where T runs over all proof trees rooted at g and m(T, h) is the monomial computed by proof tree T when gate h is replaced by 1 if gate h occurs in the rightmopst path of T, m(T, h) is 0 otherwise. It follows that [g] =

n

  • i=1

[g, xi][xi].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 31 / 73

slide-83
SLIDE 83

../IITK-Logo.jpg

Construction of D: Defining Intermediate Polynomials

For every input variable xi, let [xi] stand for the polynomial xi. For every gate g of C, let [g] stand for polynomial computed at gate g. For every pair of gates g and h of C, let [g, h] be the polynomial: [g, h] =

  • T

m(T, h) where T runs over all proof trees rooted at g and m(T, h) is the monomial computed by proof tree T when gate h is replaced by 1 if gate h occurs in the rightmopst path of T, m(T, h) is 0 otherwise. It follows that [g] =

n

  • i=1

[g, xi][xi].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 31 / 73

slide-84
SLIDE 84

../IITK-Logo.jpg

Construction of D: Defining gates [g, h]

If g is a +-gate with children g1, . . ., gt, then [g, h] =

t

  • i=1

[gi, h]. Let g be a ∗-gate with children gL (left child) and gR (right child). A rightmost path from g to h is a path from g to h in the circuit

  • btained from C by deleting input line from left child of every ∗-gate.

If there are only +-gates on every rightmost path from g to h then [g, h] = [gL].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 32 / 73

slide-85
SLIDE 85

../IITK-Logo.jpg

Construction of D: Defining gates [g, h]

If g is a +-gate with children g1, . . ., gt, then [g, h] =

t

  • i=1

[gi, h]. Let g be a ∗-gate with children gL (left child) and gR (right child). A rightmost path from g to h is a path from g to h in the circuit

  • btained from C by deleting input line from left child of every ∗-gate.

If there are only +-gates on every rightmost path from g to h then [g, h] = [gL].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 32 / 73

slide-86
SLIDE 86

../IITK-Logo.jpg

Construction of D: Defining gates [g, h]

If g is a +-gate with children g1, . . ., gt, then [g, h] =

t

  • i=1

[gi, h]. Let g be a ∗-gate with children gL (left child) and gR (right child). A rightmost path from g to h is a path from g to h in the circuit

  • btained from C by deleting input line from left child of every ∗-gate.

If there are only +-gates on every rightmost path from g to h then [g, h] = [gL].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 32 / 73

slide-87
SLIDE 87

../IITK-Logo.jpg

Construction of D: Defining [g, h]

Otherwise, there exists a ∗-gate p with children pL and pR in a rightmost path from g to h such that deg(p) ≥ 1

2(deg(g) + deg(h)) > deg(pR).

Then, we have: [g, h] =

  • p

[g, p] · [pL] · [pR, h] where the sum ranges over all gates p satisfying the above condition. deg(g) stands for degree of gate g

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 33 / 73

slide-88
SLIDE 88

../IITK-Logo.jpg

Construction of D: Defining [g, h]

Otherwise, there exists a ∗-gate p with children pL and pR in a rightmost path from g to h such that deg(p) ≥ 1

2(deg(g) + deg(h)) > deg(pR).

Then, we have: [g, h] =

  • p

[g, p] · [pL] · [pR, h] where the sum ranges over all gates p satisfying the above condition. deg(g) stands for degree of gate g

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 33 / 73

slide-89
SLIDE 89

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-90
SLIDE 90

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-91
SLIDE 91

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([g, p]) ≤

1 2(deg(g) − deg(h))

deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-92
SLIDE 92

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pR, h] ≤

1 2(deg(g) − deg(h))

deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-93
SLIDE 93

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-94
SLIDE 94

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-95
SLIDE 95

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-96
SLIDE 96

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-97
SLIDE 97

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-98
SLIDE 98

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-99
SLIDE 99

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s

+ [g, p] + [pL] + [pR, h] ∗ ∗ + [pL, xi]

xi xi’s

+ + [pj

L, xi]

j’s

∗ ∗

q’s

+ [pj

L, q]

+ [qL] + [qR, xi] [g, h] =

p[g, p][pL][pR, h].

deg([g, h]) = deg(g) − deg(h) deg([pL]) ≤ deg(g) − deg(h) [pL] =

i[pL, xi][xi],

pL =

j pj L.

[pj

L, xi] = q[pj L, q][qL][qR, xi].

deg([pj

L, q] ≤ 1 2 deg(pL)

deg([qL] ≤ 1

2 deg(pL)

deg([qR, xi] ≤ 1

2 deg(pL)

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 34 / 73

slide-100
SLIDE 100

../IITK-Logo.jpg

Construction of D: Defining [g, h]

+ [g, h] ∗ ∗

p’s, xi’s, j’s, q’s

+

[g, p]

+

[pj

L, q]

+

[qL]

+

[qR, xi]

+

[xi]

+

[pR, h]

Flatten the subcircuit to write [g, h] as: [g, h] =

  • p
  • i
  • j
  • q

[g, p][pL,j, q][qL][qR, xi][xi][pR, h] with degree of each of the six polynomials in the product bounded by 1

2 deg([g, h]).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 35 / 73

slide-101
SLIDE 101

../IITK-Logo.jpg

Construction of D

By adding dummy +-gates and merging adjacent +-gates, it can be ensured that the circuit has alternating layers of +- and ∗-gates. The size of resulting circuit is tO(1). Since the degree of children of a ∗-gate is at most half of the degree

  • f the gate, the depth of the circuit D is ≤ 2 log d.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 36 / 73

slide-102
SLIDE 102

../IITK-Logo.jpg

Construction of D

By adding dummy +-gates and merging adjacent +-gates, it can be ensured that the circuit has alternating layers of +- and ∗-gates. The size of resulting circuit is tO(1). Since the degree of children of a ∗-gate is at most half of the degree

  • f the gate, the depth of the circuit D is ≤ 2 log d.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 36 / 73

slide-103
SLIDE 103

../IITK-Logo.jpg

Replacing D

We now replace D by a depth four circuit. The circuit is defined by cutting D in two halves and replacing each half by a depth two circuit.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 37 / 73

slide-104
SLIDE 104

../IITK-Logo.jpg

Replacing D

We now replace D by a depth four circuit. The circuit is defined by cutting D in two halves and replacing each half by a depth two circuit.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 37 / 73

slide-105
SLIDE 105

../IITK-Logo.jpg

Cutting D

Let ℓ be any function such that ℓ ≤

d+d log d

n

log t

and ℓ = ω(1). Let u = 1

2 log6 ℓ.

Cut D into two halves with top half consisting of u layers of ∗-gates with the bottom layer being of ∗-gates. Let g1, g2, . . ., gk be the output gates of the bottom layer. Let the polynomial computed by gate gi be Pi(x1, x2, . . . , xn). The top layer can be viewed as computing a polynomial in k new variables; let this be P0(y1, y2, . . . , yk). Then: P(x1, . . . , xn) = P0(P1(x1, . . . , xn), P2(x1, . . . , xn), . . . , Pk(x1, . . . , xn)).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 38 / 73

slide-106
SLIDE 106

../IITK-Logo.jpg

Cutting D

Let ℓ be any function such that ℓ ≤

d+d log d

n

log t

and ℓ = ω(1). Let u = 1

2 log6 ℓ.

Cut D into two halves with top half consisting of u layers of ∗-gates with the bottom layer being of ∗-gates. Let g1, g2, . . ., gk be the output gates of the bottom layer. Let the polynomial computed by gate gi be Pi(x1, x2, . . . , xn). The top layer can be viewed as computing a polynomial in k new variables; let this be P0(y1, y2, . . . , yk). Then: P(x1, . . . , xn) = P0(P1(x1, . . . , xn), P2(x1, . . . , xn), . . . , Pk(x1, . . . , xn)).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 38 / 73

slide-107
SLIDE 107

../IITK-Logo.jpg

Cutting D

Let ℓ be any function such that ℓ ≤

d+d log d

n

log t

and ℓ = ω(1). Let u = 1

2 log6 ℓ.

Cut D into two halves with top half consisting of u layers of ∗-gates with the bottom layer being of ∗-gates. Let g1, g2, . . ., gk be the output gates of the bottom layer. Let the polynomial computed by gate gi be Pi(x1, x2, . . . , xn). The top layer can be viewed as computing a polynomial in k new variables; let this be P0(y1, y2, . . . , yk). Then: P(x1, . . . , xn) = P0(P1(x1, . . . , xn), P2(x1, . . . , xn), . . . , Pk(x1, . . . , xn)).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 38 / 73

slide-108
SLIDE 108

../IITK-Logo.jpg

Cutting D

Let ℓ be any function such that ℓ ≤

d+d log d

n

log t

and ℓ = ω(1). Let u = 1

2 log6 ℓ.

Cut D into two halves with top half consisting of u layers of ∗-gates with the bottom layer being of ∗-gates. Let g1, g2, . . ., gk be the output gates of the bottom layer. Let the polynomial computed by gate gi be Pi(x1, x2, . . . , xn). The top layer can be viewed as computing a polynomial in k new variables; let this be P0(y1, y2, . . . , yk). Then: P(x1, . . . , xn) = P0(P1(x1, . . . , xn), P2(x1, . . . , xn), . . . , Pk(x1, . . . , xn)).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 38 / 73

slide-109
SLIDE 109

../IITK-Logo.jpg

The Circuit E

A direct counting shows that each Pj, 0 ≤ j ≤ k, can be replaced by a depth two circuit of size 2o(d+d log n

d ).

Since k = 2o(d+d log n

d ), the resulting depth four circuit, E, is of size

2o(d+d log n

d ).

The fanin of second layer of ∗-gates in E is at most 6u = √ ℓ which is any small function in ω(1). The fanin of bottom layer of ∗-gates in E is at most

d 2u = o(d).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 39 / 73

slide-110
SLIDE 110

../IITK-Logo.jpg

The Circuit E

A direct counting shows that each Pj, 0 ≤ j ≤ k, can be replaced by a depth two circuit of size 2o(d+d log n

d ).

Since k = 2o(d+d log n

d ), the resulting depth four circuit, E, is of size

2o(d+d log n

d ).

The fanin of second layer of ∗-gates in E is at most 6u = √ ℓ which is any small function in ω(1). The fanin of bottom layer of ∗-gates in E is at most

d 2u = o(d).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 39 / 73

slide-111
SLIDE 111

../IITK-Logo.jpg

The Circuit E

A direct counting shows that each Pj, 0 ≤ j ≤ k, can be replaced by a depth two circuit of size 2o(d+d log n

d ).

Since k = 2o(d+d log n

d ), the resulting depth four circuit, E, is of size

2o(d+d log n

d ).

The fanin of second layer of ∗-gates in E is at most 6u = √ ℓ which is any small function in ω(1). The fanin of bottom layer of ∗-gates in E is at most

d 2u = o(d).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 39 / 73

slide-112
SLIDE 112

../IITK-Logo.jpg

Reducing Depth to 3

Theorem (Gupta-Kamath-Kayal-Saptharishi 2013)

If polynomial P(x1, . . . , xn) of degree d is computable by an arithmetic circuit of size s = 2o(d+d log n

d ), then it can also be computed by an

arithmetic circuit of size sO(1) of depth 3 if the underlying field has characteristic zero or large (= Ω(log s)).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 40 / 73

slide-113
SLIDE 113

../IITK-Logo.jpg

Proof Outline

Replace each layer of a depth four circuit by ∧ layers resulting in a ∧ ∧ circuit using [Fischer 1994]:

n

  • j=1

xj = 1 2n−1n!

  • r2,...,rn∈{−1,1}

(−1)wt(r)(x1 +

n

  • j=2

rjxj)n, where wt(r) = |{j | rj = −1}|. This works for char = 0 or > n. Replace ∧ ∧ by resulting in circuit using [Saxena 2008]: (α1xβ1

1 +α2xβ2 2 +· · ·+αnxβn n )d = degree d coefficient of d!· n

  • j=1

eαjx

βj j

z.

This works for char = 0 or > d.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 41 / 73

slide-114
SLIDE 114

../IITK-Logo.jpg

Proof Outline

Replace each layer of a depth four circuit by ∧ layers resulting in a ∧ ∧ circuit using [Fischer 1994]:

n

  • j=1

xj = 1 2n−1n!

  • r2,...,rn∈{−1,1}

(−1)wt(r)(x1 +

n

  • j=2

rjxj)n, where wt(r) = |{j | rj = −1}|. This works for char = 0 or > n. Replace ∧ ∧ by resulting in circuit using [Saxena 2008]: (α1xβ1

1 +α2xβ2 2 +· · ·+αnxβn n )d = degree d coefficient of d!· n

  • j=1

eαjx

βj j

z.

This works for char = 0 or > d.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 41 / 73

slide-115
SLIDE 115

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 42 / 73

slide-116
SLIDE 116

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Jerrum-Snir 1982] Any monotone circuit family computing permanent is of exponential size. Monotone circuits are circuits with no negative constant. [Shpilka-Wigderson 1999] Any depth three circuit family computing permanent (or even determinant) over Q is of size Ω(n2). [Grigoriev-Razborov 2000] Any depth three circuit family computing permanent or determinant over a finite field is of exponential size.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 43 / 73

slide-117
SLIDE 117

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Jerrum-Snir 1982] Any monotone circuit family computing permanent is of exponential size. Monotone circuits are circuits with no negative constant. [Shpilka-Wigderson 1999] Any depth three circuit family computing permanent (or even determinant) over Q is of size Ω(n2). [Grigoriev-Razborov 2000] Any depth three circuit family computing permanent or determinant over a finite field is of exponential size.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 43 / 73

slide-118
SLIDE 118

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Jerrum-Snir 1982] Any monotone circuit family computing permanent is of exponential size. Monotone circuits are circuits with no negative constant. [Shpilka-Wigderson 1999] Any depth three circuit family computing permanent (or even determinant) over Q is of size Ω(n2). [Grigoriev-Razborov 2000] Any depth three circuit family computing permanent or determinant over a finite field is of exponential size.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 43 / 73

slide-119
SLIDE 119

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Raz 2004] Any multilinear formula family computing permanent or determinant is of size nΩ(log n). Formulas are circuits with outdegree one. Multilinear formulas are formulas in which every gate computes a multilinear polynomial. [Kayal-Saha 2014] Any depth three circuit family of bottom fanin ≤ r computing a polynomial family in VP of degree d in n variables over fields of charecteristic zero, is of size nΩ( d

r ).

[Kayal-Limaye-Saha-Srinivasan 2014] 2Ω(√n log n) lower bound on homogeneous depth four circuits computing permanent over characteristic zero. A circuit is homogeneous if every internediate polynomial is homogeneous.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 44 / 73

slide-120
SLIDE 120

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Raz 2004] Any multilinear formula family computing permanent or determinant is of size nΩ(log n). Formulas are circuits with outdegree one. Multilinear formulas are formulas in which every gate computes a multilinear polynomial. [Kayal-Saha 2014] Any depth three circuit family of bottom fanin ≤ r computing a polynomial family in VP of degree d in n variables over fields of charecteristic zero, is of size nΩ( d

r ).

[Kayal-Limaye-Saha-Srinivasan 2014] 2Ω(√n log n) lower bound on homogeneous depth four circuits computing permanent over characteristic zero. A circuit is homogeneous if every internediate polynomial is homogeneous.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 44 / 73

slide-121
SLIDE 121

../IITK-Logo.jpg

Lower Bounds on Permanent and Determinant

[Raz 2004] Any multilinear formula family computing permanent or determinant is of size nΩ(log n). Formulas are circuits with outdegree one. Multilinear formulas are formulas in which every gate computes a multilinear polynomial. [Kayal-Saha 2014] Any depth three circuit family of bottom fanin ≤ r computing a polynomial family in VP of degree d in n variables over fields of charecteristic zero, is of size nΩ( d

r ).

[Kayal-Limaye-Saha-Srinivasan 2014] 2Ω(√n log n) lower bound on homogeneous depth four circuits computing permanent over characteristic zero. A circuit is homogeneous if every internediate polynomial is homogeneous.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 44 / 73

slide-122
SLIDE 122

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 45 / 73

slide-123
SLIDE 123

../IITK-Logo.jpg

Definitions

PIT

Given an arithmetic circuit of size s over ring R, test if the polynomial computed by the circuit is non-zero.

Low Degree PIT (LPIT)

Given an arithmetic circuit of size s over ring R computing a polynomial of degree ≤ s, test if the polynomial computed by the circuit is non-zero.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 46 / 73

slide-124
SLIDE 124

../IITK-Logo.jpg

Definitions

PIT

Given an arithmetic circuit of size s over ring R, test if the polynomial computed by the circuit is non-zero.

Low Degree PIT (LPIT)

Given an arithmetic circuit of size s over ring R computing a polynomial of degree ≤ s, test if the polynomial computed by the circuit is non-zero.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 46 / 73

slide-125
SLIDE 125

../IITK-Logo.jpg

An Example

+ ∗ + ∗ ∗ ∗ + ∗ u ∗ v + ∗ x ∗ y ∗ + ∗ ∗ −1 2 2 2 2 2 2 −1 Is (ux + vy)2 + (vx − uy)2 − (u2 + v2) · (x2 + y2) = 0? [NO!]

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 47 / 73

slide-126
SLIDE 126

../IITK-Logo.jpg

Applications

Bipartite Matching : for graph G = (U, V , E), check if det    e1,1x1,1 · · · e1,nx1,n . . . ... . . . en,1xn,1 · · · en,nxn,n    = 0

  • ver any field, where E = [ei,j]. An example of LPIT.

Primality Testing : for number n, check if (x + y)n = xn + yn

  • ver ring Zn[x, y]/(xr − 1, ys − 1) for suitable r and s, both

logO(1) n.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 48 / 73

slide-127
SLIDE 127

../IITK-Logo.jpg

Applications

Bipartite Matching : for graph G = (U, V , E), check if det    e1,1x1,1 · · · e1,nx1,n . . . ... . . . en,1xn,1 · · · en,nxn,n    = 0

  • ver any field, where E = [ei,j]. An example of LPIT.

Primality Testing : for number n, check if (x + y)n = xn + yn

  • ver ring Zn[x, y]/(xr − 1, ys − 1) for suitable r and s, both

logO(1) n.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 48 / 73

slide-128
SLIDE 128

../IITK-Logo.jpg

Complexity of PIT

A number of randomized polynomial time algorithms are known for the problem. The simplest one is by [Schwartz, Zippel 1979]: Substitute random values from a small subset of R (using a small extension of R if required) for each variable, evaluate the circuit, and output NON-ZERO iff the result is a non-zero number. Others are [Chen-Kao 1997], [Lewis-Vadhan 1998], [A-Biswas 1999], . . ..

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 49 / 73

slide-129
SLIDE 129

../IITK-Logo.jpg

Complexity of PIT

A number of randomized polynomial time algorithms are known for the problem. The simplest one is by [Schwartz, Zippel 1979]: Substitute random values from a small subset of R (using a small extension of R if required) for each variable, evaluate the circuit, and output NON-ZERO iff the result is a non-zero number. Others are [Chen-Kao 1997], [Lewis-Vadhan 1998], [A-Biswas 1999], . . ..

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 49 / 73

slide-130
SLIDE 130

../IITK-Logo.jpg

Complexity of PIT

A number of randomized polynomial time algorithms are known for the problem. The simplest one is by [Schwartz, Zippel 1979]: Substitute random values from a small subset of R (using a small extension of R if required) for each variable, evaluate the circuit, and output NON-ZERO iff the result is a non-zero number. Others are [Chen-Kao 1997], [Lewis-Vadhan 1998], [A-Biswas 1999], . . ..

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 49 / 73

slide-131
SLIDE 131

../IITK-Logo.jpg

Deterministic Algorithm for PIT

Open Question

Is there a deterministic polynomial time algorithm for PIT? Long-standing open problem. A positive answer also yields a lower bound.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 50 / 73

slide-132
SLIDE 132

../IITK-Logo.jpg

Deterministic Algorithm for PIT

Open Question

Is there a deterministic polynomial time algorithm for PIT? Long-standing open problem. A positive answer also yields a lower bound.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 50 / 73

slide-133
SLIDE 133

../IITK-Logo.jpg

Deterministic Algorithm for PIT

Open Question

Is there a deterministic polynomial time algorithm for PIT? Long-standing open problem. A positive answer also yields a lower bound.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 50 / 73

slide-134
SLIDE 134

../IITK-Logo.jpg

Two Types of Deterministic Algorithms for PIT

White Box

A white-box time t(n) algorithm for PIT is a deterministic algorithm solving the problem in time at most t(n).

Black Box

A black-box time t(n) algorithm for PIT is a deterministic algorithm running in time t(n) that, given an arithmetic circuit, determines if it computes non-zero polynomial with access only to input-output lines and size of the circuit.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 51 / 73

slide-135
SLIDE 135

../IITK-Logo.jpg

Two Types of Deterministic Algorithms for PIT

White Box

A white-box time t(n) algorithm for PIT is a deterministic algorithm solving the problem in time at most t(n).

Black Box

A black-box time t(n) algorithm for PIT is a deterministic algorithm running in time t(n) that, given an arithmetic circuit, determines if it computes non-zero polynomial with access only to input-output lines and size of the circuit.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 51 / 73

slide-136
SLIDE 136

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 52 / 73

slide-137
SLIDE 137

../IITK-Logo.jpg

LPIT and Lower Bounds I

Theorem (Kabanets-Impagliazzo 2003)

If there exists a white-box polynomial-time algorithm for LPIT then NEXP requires superpolynomial size arithmetic circuits.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 53 / 73

slide-138
SLIDE 138

../IITK-Logo.jpg

LPIT and Lower Bounds I

Proof.

Assume NEXP has polynomial-size arithmetic circuits and PIT has a polynomial-time algorithm. Construct an NP machine to compute permanent that guesses the circuit for the permanent and verifies it recursively using PIT:

◮ If C(x1,1, . . . , x1,n, . . . , xn,1, . . . , xn,n) is circuit for permanent of n × n

matrices, then we can extract from it circuit Cj for permanent of j × j matrices for j < n.

◮ Using LPIT, verify the correctness of C:

Cj(¯ x) = x1,1Cj−1(¯ x1) + · · · + x1,jCj−1(¯ xj) where ¯ xi drops first row and ith column.

This implies #P is in NP. Since NEXP = #P by assumption, we get NEXP = NP contradicting time hierarchy theorem.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 54 / 73

slide-139
SLIDE 139

../IITK-Logo.jpg

LPIT and Lower Bounds I

Proof.

Assume NEXP has polynomial-size arithmetic circuits and PIT has a polynomial-time algorithm. Construct an NP machine to compute permanent that guesses the circuit for the permanent and verifies it recursively using PIT:

◮ If C(x1,1, . . . , x1,n, . . . , xn,1, . . . , xn,n) is circuit for permanent of n × n

matrices, then we can extract from it circuit Cj for permanent of j × j matrices for j < n.

◮ Using LPIT, verify the correctness of C:

Cj(¯ x) = x1,1Cj−1(¯ x1) + · · · + x1,jCj−1(¯ xj) where ¯ xi drops first row and ith column.

This implies #P is in NP. Since NEXP = #P by assumption, we get NEXP = NP contradicting time hierarchy theorem.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 54 / 73

slide-140
SLIDE 140

../IITK-Logo.jpg

LPIT and Lower Bounds I

Proof.

Assume NEXP has polynomial-size arithmetic circuits and PIT has a polynomial-time algorithm. Construct an NP machine to compute permanent that guesses the circuit for the permanent and verifies it recursively using PIT:

◮ If C(x1,1, . . . , x1,n, . . . , xn,1, . . . , xn,n) is circuit for permanent of n × n

matrices, then we can extract from it circuit Cj for permanent of j × j matrices for j < n.

◮ Using LPIT, verify the correctness of C:

Cj(¯ x) = x1,1Cj−1(¯ x1) + · · · + x1,jCj−1(¯ xj) where ¯ xi drops first row and ith column.

This implies #P is in NP. Since NEXP = #P by assumption, we get NEXP = NP contradicting time hierarchy theorem.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 54 / 73

slide-141
SLIDE 141

../IITK-Logo.jpg

LPIT and Lower Bounds I

Proof.

Assume NEXP has polynomial-size arithmetic circuits and PIT has a polynomial-time algorithm. Construct an NP machine to compute permanent that guesses the circuit for the permanent and verifies it recursively using PIT:

◮ If C(x1,1, . . . , x1,n, . . . , xn,1, . . . , xn,n) is circuit for permanent of n × n

matrices, then we can extract from it circuit Cj for permanent of j × j matrices for j < n.

◮ Using LPIT, verify the correctness of C:

Cj(¯ x) = x1,1Cj−1(¯ x1) + · · · + x1,jCj−1(¯ xj) where ¯ xi drops first row and ith column.

This implies #P is in NP. Since NEXP = #P by assumption, we get NEXP = NP contradicting time hierarchy theorem.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 54 / 73

slide-142
SLIDE 142

../IITK-Logo.jpg

LPIT and Lower Bounds I

Proof.

Assume NEXP has polynomial-size arithmetic circuits and PIT has a polynomial-time algorithm. Construct an NP machine to compute permanent that guesses the circuit for the permanent and verifies it recursively using PIT:

◮ If C(x1,1, . . . , x1,n, . . . , xn,1, . . . , xn,n) is circuit for permanent of n × n

matrices, then we can extract from it circuit Cj for permanent of j × j matrices for j < n.

◮ Using LPIT, verify the correctness of C:

Cj(¯ x) = x1,1Cj−1(¯ x1) + · · · + x1,jCj−1(¯ xj) where ¯ xi drops first row and ith column.

This implies #P is in NP. Since NEXP = #P by assumption, we get NEXP = NP contradicting time hierarchy theorem.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 54 / 73

slide-143
SLIDE 143

../IITK-Logo.jpg

LPIT and Lower Bounds II

Theorem (Heintz-Schnorr 1980, A 2005)

If there exist a black-box polynomial-time algorithm for LPIT then E requires exponential size arithmetic circuits.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 55 / 73

slide-144
SLIDE 144

../IITK-Logo.jpg

LPIT and Lower Bounds II

Proof.

Let A be a black-box polynomial-time algorithm for LPIT. For a circuit of size s on n variables, A will evaluate it on a sequence

  • f inputs and accept iff any of the outputs in non-zero.

Let these inputs be (α1,1, . . . , α1,n), . . . , (αt,1, . . . , αt,n) with t = sO(1). Let m = ⌈log(t + 1)⌉ = O(log s). Define polynomial rm as: rm(x1, x2, . . . , xm) =

  • S⊆[1,m]

cS

  • i∈S

xi. Coefficients cS ∈ F satisfy:

  • S⊆[1,m]

cS

  • i∈S

αj,i = 0 for every 1 ≤ j ≤ t.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 56 / 73

slide-145
SLIDE 145

../IITK-Logo.jpg

LPIT and Lower Bounds II

Proof.

Let A be a black-box polynomial-time algorithm for LPIT. For a circuit of size s on n variables, A will evaluate it on a sequence

  • f inputs and accept iff any of the outputs in non-zero.

Let these inputs be (α1,1, . . . , α1,n), . . . , (αt,1, . . . , αt,n) with t = sO(1). Let m = ⌈log(t + 1)⌉ = O(log s). Define polynomial rm as: rm(x1, x2, . . . , xm) =

  • S⊆[1,m]

cS

  • i∈S

xi. Coefficients cS ∈ F satisfy:

  • S⊆[1,m]

cS

  • i∈S

αj,i = 0 for every 1 ≤ j ≤ t.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 56 / 73

slide-146
SLIDE 146

../IITK-Logo.jpg

LPIT and Lower Bounds II

Proof.

Let A be a black-box polynomial-time algorithm for LPIT. For a circuit of size s on n variables, A will evaluate it on a sequence

  • f inputs and accept iff any of the outputs in non-zero.

Let these inputs be (α1,1, . . . , α1,n), . . . , (αt,1, . . . , αt,n) with t = sO(1). Let m = ⌈log(t + 1)⌉ = O(log s). Define polynomial rm as: rm(x1, x2, . . . , xm) =

  • S⊆[1,m]

cS

  • i∈S

xi. Coefficients cS ∈ F satisfy:

  • S⊆[1,m]

cS

  • i∈S

αj,i = 0 for every 1 ≤ j ≤ t.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 56 / 73

slide-147
SLIDE 147

../IITK-Logo.jpg

LPIT and Lower Bounds II

A non-zero rm always exists since it has ≥ t + 1 coefficients that satisfy t homogeneous linear equations. Polynomial rm can be computed by exponential size arithmetic circuits. Circuit complexity of rm is more than s = 2Ω(m).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 57 / 73

slide-148
SLIDE 148

../IITK-Logo.jpg

LPIT and Lower Bounds II

A non-zero rm always exists since it has ≥ t + 1 coefficients that satisfy t homogeneous linear equations. Polynomial rm can be computed by exponential size arithmetic circuits. Circuit complexity of rm is more than s = 2Ω(m).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 57 / 73

slide-149
SLIDE 149

../IITK-Logo.jpg

LPIT and Lower Bounds II

A non-zero rm always exists since it has ≥ t + 1 coefficients that satisfy t homogeneous linear equations. Polynomial rm can be computed by exponential size arithmetic circuits. Circuit complexity of rm is more than s = 2Ω(m).

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 57 / 73

slide-150
SLIDE 150

../IITK-Logo.jpg

Fixed Depth PIT

Depth d PIT

d-PIT is the problem to decide if a given arithmetic circuit of depth d (alternating sums and products with top gate being sum) computes a non-zero polynomial. d-PIT is a restriction of LPIT.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 58 / 73

slide-151
SLIDE 151

../IITK-Logo.jpg

Fixed Depth PIT

Depth d PIT

d-PIT is the problem to decide if a given arithmetic circuit of depth d (alternating sums and products with top gate being sum) computes a non-zero polynomial. d-PIT is a restriction of LPIT.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 58 / 73

slide-152
SLIDE 152

../IITK-Logo.jpg

3-PIT and Lower Bounds

Theorem (Gupta-Kamath-Kayal-Saptharishi 2013)

If there exist a polynomial-time black-box algorithm for 3-PIT then E requires exponential size arithmetic circuits if the underlying field has characteristic zero or large (= Ω(log s)).

Theorem

If there exists a white-box polynomial-time algorithm for 3-PIT then NEXP requires superpolynomial size arithmetic circuits.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 59 / 73

slide-153
SLIDE 153

../IITK-Logo.jpg

3-PIT and Lower Bounds

Theorem (Gupta-Kamath-Kayal-Saptharishi 2013)

If there exist a polynomial-time black-box algorithm for 3-PIT then E requires exponential size arithmetic circuits if the underlying field has characteristic zero or large (= Ω(log s)).

Theorem

If there exists a white-box polynomial-time algorithm for 3-PIT then NEXP requires superpolynomial size arithmetic circuits.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 59 / 73

slide-154
SLIDE 154

../IITK-Logo.jpg

Outline

1 Computation Over Rings

Arithmetic Circuit Model Generalizing Arithmetic Circuits

2 Classes P and NP 3 Depth Reduction 4 Status of Lower Bounds 5 Polynomial Identity Testing 6 LPIT and Lower Bounds 7 Algorithms for 2-PIT and 3-PIT

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 60 / 73

slide-155
SLIDE 155

../IITK-Logo.jpg

2-PIT

Theorem (Folklore)

There exists a polynomial-time black-box algorithm for 2-PIT.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 61 / 73

slide-156
SLIDE 156

../IITK-Logo.jpg

2-PIT

Proof.

A circuit computes a sparse polynomial. Let C be the given circuit of size s computing a polynomial of degree ≤ d. One of the substitutions (x1, . . . , xi, . . . , xn) = (y, . . . , y(d+1)i−1 (mod r), . . . , y(d+1)n−1 (mod r)), 1 < r < s2, will ensure that all terms of the polynomial remain distinct.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 62 / 73

slide-157
SLIDE 157

../IITK-Logo.jpg

2-PIT

Proof.

A circuit computes a sparse polynomial. Let C be the given circuit of size s computing a polynomial of degree ≤ d. One of the substitutions (x1, . . . , xi, . . . , xn) = (y, . . . , y(d+1)i−1 (mod r), . . . , y(d+1)n−1 (mod r)), 1 < r < s2, will ensure that all terms of the polynomial remain distinct.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 62 / 73

slide-158
SLIDE 158

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-159
SLIDE 159

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-160
SLIDE 160

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-161
SLIDE 161

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-162
SLIDE 162

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-163
SLIDE 163

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-164
SLIDE 164

../IITK-Logo.jpg

3-PIT with Bounded Top Fanin

Sequence of solutions for 3-PIT with top sum gate of fanin k: [Dvir-Shpilka 2005] White-box 2(log s)k2 time algorithm. [Kayal-Saxena 2006] White-box sO(k) time algorithm. [Karnin-Shpilka 2008] Black-box sO(logk s) time algorithm. [Saxena-Seshadri 2009] Black-box sk3 log s time algorithm. [Kayal-Saraf 2009] Black-box skk time algorithm over characteristic zero fields. [Saxena-Seshadri 2011] Black-box sO(k) time algorithm. [A-Saha-Saptharishi-Saxena 2012] Black-box sO(k) time algorithm for zero or large characteristic fields.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 63 / 73

slide-165
SLIDE 165

../IITK-Logo.jpg

Jacobian Based Algorithm

Let P = k

i=1 Ti, Ti = s j=1 Li,j be the given circuit with

Li,j = αi,j,0 + n

ℓ=1 αi,j,ℓxℓ.

Assume that P = 0 and Ti’s are algebraically independent:

◮ There is no polynomial Q(y1, y2, . . . , yk) such that

Q(T1, T2, . . . , Tk) = 0.

For characteristic zero or > sk: T1, . . ., Tk are algebraically independent iff J(T1, T2, . . . , Tk) has full rank, where J(y1, y2, . . . , yk) =   

∂y1 ∂x1 ∂y1 ∂x2

· · ·

∂y1 ∂xn

. . . . . . ... . . .

∂yk ∂x1 ∂yk ∂x1

· · ·

∂yk ∂xn

   .

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 64 / 73

slide-166
SLIDE 166

../IITK-Logo.jpg

Jacobian Based Algorithm

Let P = k

i=1 Ti, Ti = s j=1 Li,j be the given circuit with

Li,j = αi,j,0 + n

ℓ=1 αi,j,ℓxℓ.

Assume that P = 0 and Ti’s are algebraically independent:

◮ There is no polynomial Q(y1, y2, . . . , yk) such that

Q(T1, T2, . . . , Tk) = 0.

For characteristic zero or > sk: T1, . . ., Tk are algebraically independent iff J(T1, T2, . . . , Tk) has full rank, where J(y1, y2, . . . , yk) =   

∂y1 ∂x1 ∂y1 ∂x2

· · ·

∂y1 ∂xn

. . . . . . ... . . .

∂yk ∂x1 ∂yk ∂x1

· · ·

∂yk ∂xn

   .

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 64 / 73

slide-167
SLIDE 167

../IITK-Logo.jpg

Jacobian Based Algorithm

Let P = k

i=1 Ti, Ti = s j=1 Li,j be the given circuit with

Li,j = αi,j,0 + n

ℓ=1 αi,j,ℓxℓ.

Assume that P = 0 and Ti’s are algebraically independent:

◮ There is no polynomial Q(y1, y2, . . . , yk) such that

Q(T1, T2, . . . , Tk) = 0.

For characteristic zero or > sk: T1, . . ., Tk are algebraically independent iff J(T1, T2, . . . , Tk) has full rank, where J(y1, y2, . . . , yk) =   

∂y1 ∂x1 ∂y1 ∂x2

· · ·

∂y1 ∂xn

. . . . . . ... . . .

∂yk ∂x1 ∂yk ∂x1

· · ·

∂yk ∂xn

   .

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 64 / 73

slide-168
SLIDE 168

../IITK-Logo.jpg

Jacobian Based Algorithm

Therefore, J(T1, . . . , Tk) has rank k. We have: J(T1, T2, . . . , Tk) =   

∂T1 ∂x1 ∂T1 ∂x2

· · ·

∂T1 ∂xn

. . . . . . ... . . .

∂Tk ∂x1 ∂Tk ∂x1

· · ·

∂Tk ∂xn

   =     T1 d

j=1 α1,j,1 L1,j

· · · T1 d

j=1 α1,j,n L1,j

. . . ... . . . Tk d

j=1 αk,j,1 Lk,j

· · · Tk d

j=1 αk,j,n Lk,j

   

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 65 / 73

slide-169
SLIDE 169

../IITK-Logo.jpg

Jacobian Based Algorithm

Therefore, J(T1, . . . , Tk) has rank k. We have: J(T1, T2, . . . , Tk) =   

∂T1 ∂x1 ∂T1 ∂x2

· · ·

∂T1 ∂xn

. . . . . . ... . . .

∂Tk ∂x1 ∂Tk ∂x1

· · ·

∂Tk ∂xn

   =     T1 d

j=1 α1,j,1 L1,j

· · · T1 d

j=1 α1,j,n L1,j

. . . ... . . . Tk d

j=1 αk,j,1 Lk,j

· · · Tk d

j=1 αk,j,n Lk,j

   

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 65 / 73

slide-170
SLIDE 170

../IITK-Logo.jpg

Jacobian Based Algorithm

Assume, wlog, that columns corresponding to variables x1, x2, . . ., xk have rank k. Let ˆ P =

  • T1

d

j=1 α1,j,1 L1,j

· · · T1 d

j=1 α1,j,k L1,j

. . . ... . . . Tk d

j=1 αk,j,1 Lk,j

· · · Tk d

j=1 αk,j,k Lk,j

  • =

k

  • i=1

Ti ·

  • d

j=1 α1,j,1 L1,j

· · · d

j=1 α1,j,k L1,j

. . . ... . . . d

j=1 αk,j,1 Lk,j

· · · d

j=1 αk,j,k Lk,j

  • =

k

  • i=1

Ti · R, where R is a sparse rational function.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 66 / 73

slide-171
SLIDE 171

../IITK-Logo.jpg

Jacobian Based Algorithm

Assume, wlog, that columns corresponding to variables x1, x2, . . ., xk have rank k. Let ˆ P =

  • T1

d

j=1 α1,j,1 L1,j

· · · T1 d

j=1 α1,j,k L1,j

. . . ... . . . Tk d

j=1 αk,j,1 Lk,j

· · · Tk d

j=1 αk,j,k Lk,j

  • =

k

  • i=1

Ti ·

  • d

j=1 α1,j,1 L1,j

· · · d

j=1 α1,j,k L1,j

. . . ... . . . d

j=1 αk,j,1 Lk,j

· · · d

j=1 αk,j,k Lk,j

  • =

k

  • i=1

Ti · R, where R is a sparse rational function.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 66 / 73

slide-172
SLIDE 172

../IITK-Logo.jpg

Jacobian Based Algorithm

Since ˆ P is a product of sparse polynomials and rational functions, the set of substitutions as used for 2-PIT will ensure that ˆ P remains non-zero under one of them. For this substitution, the Jacobian has full rank and therefore the circuit output remains non-zero.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 67 / 73

slide-173
SLIDE 173

../IITK-Logo.jpg

Jacobian Based Algorithm

Since ˆ P is a product of sparse polynomials and rational functions, the set of substitutions as used for 2-PIT will ensure that ˆ P remains non-zero under one of them. For this substitution, the Jacobian has full rank and therefore the circuit output remains non-zero.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 67 / 73

slide-174
SLIDE 174

../IITK-Logo.jpg

3-PIT for Diagonal Circuits

Diagonal Circuits

Circuits where each multiplication gate is a powering gate.

Theorem (Forbes-Shpilka 2012, A-Saha-Saxena 2013)

There exists a sO(log s)-time black-box algorithm for diagonal 3-PIT.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 68 / 73

slide-175
SLIDE 175

../IITK-Logo.jpg

3-PIT for Diagonal Circuits

Diagonal Circuits

Circuits where each multiplication gate is a powering gate.

Theorem (Forbes-Shpilka 2012, A-Saha-Saxena 2013)

There exists a sO(log s)-time black-box algorithm for diagonal 3-PIT.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 68 / 73

slide-176
SLIDE 176

../IITK-Logo.jpg

Rank Concentration Based Algorithm

Let P = k

i=1 Ti, Ti = Ld i be the given circuit with

Li = αi,0 + n

ℓ=1 αi,ℓxℓ.

The polynomial can be rewritten as: P = ¯ 1 · (¯ u0 + ¯ u1x1 + · · · + ¯ unxn)d, where ¯ uℓ = [α1,ℓ · · · αk,ℓ].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 69 / 73

slide-177
SLIDE 177

../IITK-Logo.jpg

Rank Concentration Based Algorithm

Let P = k

i=1 Ti, Ti = Ld i be the given circuit with

Li = αi,0 + n

ℓ=1 αi,ℓxℓ.

The polynomial can be rewritten as: P = ¯ 1 · (¯ u0 + ¯ u1x1 + · · · + ¯ unxn)d, where ¯ uℓ = [α1,ℓ · · · αk,ℓ].

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 69 / 73

slide-178
SLIDE 178

../IITK-Logo.jpg

Rank Concentration Based Algorithm

Now consider the following polynomial with vectors over F k as coefficients: Q = (¯ u0 + ¯ u1x1 + · · · + ¯ unxn)d =

  • S∈[0,d]n

¯ vS¯ xS where S = (d1, d2, . . . , dn), ¯ vS is Hadamard product of d ¯ u’s, and ¯ xS = n

i=1 xdi i .

Consider the vectors ¯ vS ∈ F k. Let the dimension of the space spanned by these vectors be m ≤ k.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 70 / 73

slide-179
SLIDE 179

../IITK-Logo.jpg

Rank Concentration Based Algorithm

Now consider the following polynomial with vectors over F k as coefficients: Q = (¯ u0 + ¯ u1x1 + · · · + ¯ unxn)d =

  • S∈[0,d]n

¯ vS¯ xS where S = (d1, d2, . . . , dn), ¯ vS is Hadamard product of d ¯ u’s, and ¯ xS = n

i=1 xdi i .

Consider the vectors ¯ vS ∈ F k. Let the dimension of the space spanned by these vectors be m ≤ k.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 70 / 73

slide-180
SLIDE 180

../IITK-Logo.jpg

Rank Concentration Based Algorithm

ℓ-rank concentration is the property that ¯ vS of support ℓ (i.e., S with

  • nly ℓ non-zero di’s) span this space.

If there is ℓ-rank concentration, the PIT can be solved by setting all but ℓ x’s to zero and evaluating the resulting polynomial.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 71 / 73

slide-181
SLIDE 181

../IITK-Logo.jpg

Rank Concentration Based Algorithm

ℓ-rank concentration is the property that ¯ vS of support ℓ (i.e., S with

  • nly ℓ non-zero di’s) span this space.

If there is ℓ-rank concentration, the PIT can be solved by setting all but ℓ x’s to zero and evaluating the resulting polynomial.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 71 / 73

slide-182
SLIDE 182

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The space spanned by ¯ vS has log m-rank concentration:

◮ Consider a monomial ¯

xS with support > log m. It has > m monomials strictly below it in lex-ordering.

◮ There must be linear dependence between coefficients associated with

these lower monomials.

◮ Define a total ordering on monomials by fixing an arbitrary order

between variables.

◮ Take a linear dependence equation for lower monomial coefficients,

identify the largest monomial in total order, and multiply the equation with coefficient of a monomial such that the largest monomial becomes ¯ xS.

◮ This makes coefficient of ¯

xS linearly dependent on smaller monomial coefficients in total order.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 72 / 73

slide-183
SLIDE 183

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The space spanned by ¯ vS has log m-rank concentration:

◮ Consider a monomial ¯

xS with support > log m. It has > m monomials strictly below it in lex-ordering.

◮ There must be linear dependence between coefficients associated with

these lower monomials.

◮ Define a total ordering on monomials by fixing an arbitrary order

between variables.

◮ Take a linear dependence equation for lower monomial coefficients,

identify the largest monomial in total order, and multiply the equation with coefficient of a monomial such that the largest monomial becomes ¯ xS.

◮ This makes coefficient of ¯

xS linearly dependent on smaller monomial coefficients in total order.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 72 / 73

slide-184
SLIDE 184

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The space spanned by ¯ vS has log m-rank concentration:

◮ Consider a monomial ¯

xS with support > log m. It has > m monomials strictly below it in lex-ordering.

◮ There must be linear dependence between coefficients associated with

these lower monomials.

◮ Define a total ordering on monomials by fixing an arbitrary order

between variables.

◮ Take a linear dependence equation for lower monomial coefficients,

identify the largest monomial in total order, and multiply the equation with coefficient of a monomial such that the largest monomial becomes ¯ xS.

◮ This makes coefficient of ¯

xS linearly dependent on smaller monomial coefficients in total order.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 72 / 73

slide-185
SLIDE 185

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The space spanned by ¯ vS has log m-rank concentration:

◮ Consider a monomial ¯

xS with support > log m. It has > m monomials strictly below it in lex-ordering.

◮ There must be linear dependence between coefficients associated with

these lower monomials.

◮ Define a total ordering on monomials by fixing an arbitrary order

between variables.

◮ Take a linear dependence equation for lower monomial coefficients,

identify the largest monomial in total order, and multiply the equation with coefficient of a monomial such that the largest monomial becomes ¯ xS.

◮ This makes coefficient of ¯

xS linearly dependent on smaller monomial coefficients in total order.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 72 / 73

slide-186
SLIDE 186

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The space spanned by ¯ vS has log m-rank concentration:

◮ Consider a monomial ¯

xS with support > log m. It has > m monomials strictly below it in lex-ordering.

◮ There must be linear dependence between coefficients associated with

these lower monomials.

◮ Define a total ordering on monomials by fixing an arbitrary order

between variables.

◮ Take a linear dependence equation for lower monomial coefficients,

identify the largest monomial in total order, and multiply the equation with coefficient of a monomial such that the largest monomial becomes ¯ xS.

◮ This makes coefficient of ¯

xS linearly dependent on smaller monomial coefficients in total order.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 72 / 73

slide-187
SLIDE 187

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The algorithm is now simple: for all subsets of log m variables, set the remaining variables to zero, and test if the resulting polynomial is zero on dlog m distinct values. This gives a dO(log d)-time black-box algorithm. In certain situations, there may not be rank concentration to begin with. So first apply a transformation on variables that yields rank concentration. For certain other restrictions of 3-PIT, the following transformation works: xi → xi + tdi for small di’s.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 73 / 73

slide-188
SLIDE 188

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The algorithm is now simple: for all subsets of log m variables, set the remaining variables to zero, and test if the resulting polynomial is zero on dlog m distinct values. This gives a dO(log d)-time black-box algorithm. In certain situations, there may not be rank concentration to begin with. So first apply a transformation on variables that yields rank concentration. For certain other restrictions of 3-PIT, the following transformation works: xi → xi + tdi for small di’s.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 73 / 73

slide-189
SLIDE 189

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The algorithm is now simple: for all subsets of log m variables, set the remaining variables to zero, and test if the resulting polynomial is zero on dlog m distinct values. This gives a dO(log d)-time black-box algorithm. In certain situations, there may not be rank concentration to begin with. So first apply a transformation on variables that yields rank concentration. For certain other restrictions of 3-PIT, the following transformation works: xi → xi + tdi for small di’s.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 73 / 73

slide-190
SLIDE 190

../IITK-Logo.jpg

Rank Concentration Based Algorithm

The algorithm is now simple: for all subsets of log m variables, set the remaining variables to zero, and test if the resulting polynomial is zero on dlog m distinct values. This gives a dO(log d)-time black-box algorithm. In certain situations, there may not be rank concentration to begin with. So first apply a transformation on variables that yields rank concentration. For certain other restrictions of 3-PIT, the following transformation works: xi → xi + tdi for small di’s.

Manindra Agrawal (IIT Kanpur) Algebraic Complexity SLAC 2015 73 / 73