SLIDE 1
Equivalence test for the trace iterated matrix multiplication polynomial
Janaky Murthy M.Tech Research Advisor: Prof. Chandan Saha
Department of Computer Science and Automation IISc Bangalore 1
SLIDE 2 Overview
- Introduction and Motivation
- Problem Statement
- Our Results
- Approach
2
SLIDE 3
What are Equivalent polynomials?
Definition (Equivalent polynomials)
g(x1, x2) = x1 + x2
2
f (x1, x2) = x1 + x2 + x2
2 .
If we replace the variables of g as follows, we obtain f . x1 → x1 + x2 x2 → x2 .
3
SLIDE 4 What are Equivalent polynomials?
Definition (Equivalent polynomials)
g(x1, x2) = x1 + x2
2
f (x1, x2) = x1 + x2 + x2
2 .
If we replace the variables of g as follows, we obtain f . 1 1 1
· x1 x2
x
= x1 + x2 x2 ; f (x) = g(Ax)
4
SLIDE 5
What are Equivalent polynomials?
Definition (Equivalent polynomials)
Two n-variate, degree d polynomials f and g (over a field F ) are said to be equivalent if there exists an invertible matrix A ∈ Fn×n such that f (x) = g(Ax). The Equivalence Testing Problem: Can we efficiently check if two polynomials f and g are equivalent?
5
SLIDE 6 Complexity of Equivalence Testing
Depends on the underlying field.
- over finite fields: NP ∩ co-AM
[Thierauf(1998), Saxena(2006)]
- over Q: not even known if it is decidable or not!
- over other fields: reduces to solving system of polynomial
equations (which could possibly be a harder problem).
6
SLIDE 7
Relation to other Isomorphism problems
Isomorphism problem: Check if there is a bijection between two structures that preserves some relation on the structure. Examples: Graph Isomorphism, Algebra Isomorphism, Tensor Iso- morphism. Graph Isomorphism: Two graphs are isomorphic if there is a bijec- tion between the vertex sets which preserves the edge relation. Given two graphs, check if they are isomorphic.
7
SLIDE 8 Algebra Isomorphism
(A, +, ∗) is a F-Algebra if:
- (A, +) is a F-vector space.
- (A, +, ∗) is a ring.
- the ring multiplication is compatible with the scalar
multiplication of the field, i.e k(B ∗ C) = (kB) ∗ C = B ∗ (kC) for all B, C ∈ A and k ∈ F. Example The set of all m × m matrices (Mm, +, ∗). Algebra Isomorphism: Given bases of two algebras (as structure table), check if there is a bijection that preserves the + and ∗
8
SLIDE 9
d-tensor Isomorphism
Consider a partition of n variables into d sets. A d-tensor is a de- gree d homogeneous polynomial such that each monomial contains exactly one variable from each of the d variable sets. Example: f = x1x4 + x2x4 + x3x6 is a 2-tensor. d-tensor Isomorphism: Given two d-tensors f and g, check if there exists invertible matrices B1, . . . , Bd such that f (x1, . . . , xd) = g(B1x1, . . . , Bdxd).
9
SLIDE 10
Connections between the isomorphism problems
10
SLIDE 11
A natural variant of Equivalence Testing
Equivalence test for special polynomial families: Check if a polynomial f is equivalent to some g ∈ G where G = {g1, g2, . . .} is a polynomial family. Some popular polynomial families: Permanent, Determinant, Power Symmetric polynomial, Sum of Products polynomial, Elemen- tary Symmetric polynomial, Iterated Matrix Multiplication (IMM) polynomial, Trace Iterated Matrix Multiplication (Tr-IMM) polyno- mial, Design polynomials etc...
11
SLIDE 12 Motivation from Geometric Complexity Theory
An n-variate, degree d homogeneous polynomial f : A point in the vector space CN (where N = n+d
d
Orbit of f : O(f ) = {g : g(x) = f (Ax), A is invertible}. Orbit Closure of f : O(f ) - The Zariski closure of O.
12
SLIDE 13 Motivation from Geometric Complexity Theory
Perm vs Det problem: Show that padded permanent is not in the
- rbit closure of (poly-sized) determinant polynomial.
This question also makes sense for permanent vs any other polyno- mial family G where G is a complete for some low complexity circuit class C.
13
SLIDE 14
Equivalence test for some well known polynomial families
[Kayal(2012)] gave efficient randomized algorithms for equivalence testing of the Permanent polynomial family , Power Symmetric polynomial family, Sum of Product polynomial family, Elemen- tary Symmetric polynomial family over any field. From now on we assume a stronger search version of the equiva- lence testing problem.
14
SLIDE 15 Determinant Equivalence Testing
The Determinant polynomial family: {Det(Xn)}n≥1 , where Det(Xn) denotes the determinant of n × n symbolic matrix Xn. Determinant Equivalence Testing (DET)
- An efficient randomized algorithm is known over :
◮ C [Kayal(2012)] ◮ finite fields of sufficiently large characteristic - Garg,Gupta,Kayal,Saha [GGKS19]. ◮ For fixed n, DET can be efficiently done given oracle access to INTFACT [GGKS19].
- But it is as hard as Integer Factoring (INTFACT) over
Q [GGKS19].
15
SLIDE 16
IMM Equivalence Testing
The Iterated Matrix Multiplication Polynomial Family IMMw,d := (1, 1)-th entry of (X1 · X2 . . . Xd) where each Xi is a w × w symbolic matrix. An efficient randomized equivalence test for the Iterated Matrix Multiplication polynomial (IMM) over Q, C and finite fields is known from Kayal,Nair,Saha,Tavenas [KNST17].
16
SLIDE 17
IMM vs Determinant Equivalence testing
Both IMM and Determinant polynomial families are complete for the circuit class VBP, yet they can not have similar algo- rithmic complexity for the equivalence testing problem (over Q) unless INTFACT is easy.
17
SLIDE 18
The Trace Iterated Matrix Multiplication Polynomial
Definition (The Trace Iterated Matrix Multiplication Polynomial)
Q1 = x(1)
11
x(1)
12
x(1)
21
x(1)
22
; Q2 = x(2)
11
x(2)
12
x(2)
21
x(2)
22
; Q3 = x(3)
11
x(3)
12
x(3)
21
x(3)
22
. w = 2, d = 3. Tr-IMM2,3 = tr(Q1 · Q2 · Q3) .
18
SLIDE 19
The Trace Iterated Matrix Multiplication Polynomial
Definition (The Trace Iterated Matrix Multiplication Polynomial)
Let Q1, . . . , Qd be w × w symbolic matrices whose entries are distinct (formal) variables. Then the Trace Iterated Matrix Multiplication Polynomial denoted as Tr-IMMw,d is defined as the trace of the product of these matrices. Tr-IMMw,d = tr(Q1 · Q2 . . . Qd) .
19
SLIDE 20
Equivalence test for Tr-IMM (TRACE)
It is syntatically close to the IMM polynomial, which is the (1, 1)-th entry of the matrix product. Is the complexity of TRACE similar to the equivalence test for IMM polynomial?
20
SLIDE 21
Equivalence test for Tr-IMM (TRACE)
It is syntatically close to the IMM polynomial, which is the (1, 1)-th entry of the matrix product. Is the complexity of TRACE similar to the equivalence test for IMM polynomial? Or does it resemble that of DET?
20
SLIDE 22
Equivalence test for Tr-IMM (TRACE)
Problem Statement (Equivalence test for Tr-IMMw,d polynomial (TRACE))
Given blackbox access to an n-variate degree d polynomial f , check efficiently if f is equivalent to Tr-IMMw,d. If yes, then compute an invertible matrix A ∈ Fn×n such that f (x) = Tr-IMMw,d(Ax)
21
SLIDE 23
Could there be some relation between special cases of the iso- morphism problem and the special cases of equivalence test- ing?
22
SLIDE 24
Some special cases of the Isomorphism Problems
Full Matrix Algebra Isomorphism (FMAI) Given a basis of an algebra A ⊆ Mm, determine if A is isomorphic to Mw where w2 = dim(A). If yes, compute an isomorphism from A → Mw.
23
SLIDE 25
Some special cases of the Isomorphism Problems
Matrix Multiplication Tensor Isomorphism (MMTI) Given a 3- tensor f , check if it is isomorphic to any tensor in the Tr-IMMw,3 family, i.e check if f (x) = Tr-IMMw,3(B1x1, B2x2, B3x3) = Tr-IMMw,3(Bx) and if yes, output B1, B2, B3.
24
SLIDE 26
Some special cases of the Isomorphism Problems
Tensor Isomorphism for Tr-IMM (TRACE-TI) Given a d-tensor f , check if it is isomorphic to any tensor in the Tr-IMMw,d family, i.e check if f (x) = Tr-IMMw,d(B1x1, . . . , Bdxd) = Tr-IMMw,d(Bx). and if yes, output B1, . . . , Bd.
25
SLIDE 27
Results
26
SLIDE 28
Results
Theorem 1 (TRACE is randomized polynomial time Turing reducible to DET)
Given oracle access to DET over F, TRACE can be solved in randomized, polynomial time polynomial time: poly(n, β) running time randomized: 1 − o(1) success probability.
27
SLIDE 29
Approach
Input: Blackbox access to f Reduce to TRACE-TI Reduce TRACE-TI to DET and compute A Check if f (x) = Tr-IMM(Ax) using Schwartz-Zippel lemma
Figure: High level view of the Algorithm
28
SLIDE 30
Part-I: Reduction to TRACE-TI
TRACE: Is f (x) = Tr-IMMw,d(Ax) for some invertible matrix A? TRACE-TI: Is f (x) = Tr-IMMw,d(Bx) for some invertible, block- diagonal matrix B? Remark: An efficient randomized algorithm for TRACE-TI over C was given in [Grochow(2012)] which does not involve reduction to DET.
29
SLIDE 31
Part-I: Reduction to TRACE-TI
Tr-IMM(x) = tr(Q1 · Q2 . . . Qd) f = Tr-IMM(Ax) = tr(X1 · X2 . . . Xd) For example, Qi = x1 x2 x3 x4 , Xi = x1 + x6 2x1 x1 + 2x4 x4 − x9 Xi - space spanned by the linear forms in Xi. The Layer Spaces of f are X1, . . . , Xd.
30
SLIDE 32
Part-I: Reduction to TRACE-TI
1.Compute a bases for the layer spaces X1, . . . , Xd of f .
31
SLIDE 33 Part-I: Reduction to TRACE-TI
1.Compute a bases for the layer spaces X1, . . . , Xd of f .
- 2. Compute a linear map ˆ
A which maps each basis vector to a distinct variable.
31
SLIDE 34 Part-I: Reduction to TRACE-TI
1.Compute a bases for the layer spaces X1, . . . , Xd of f .
- 2. Compute a linear map ˆ
A which maps each basis vector to a distinct variable.
- 3. Define a new polynomial h(x) = f ( ˆ
Ax). Since we mapped each basis vector to a distinct variable, h is a d-tensor. h(x) = f ( ˆ Ax) = Tr-IMM(A ˆ Ax) We compute ˆ A such that A A is block-diagonal. This is the TRACE- TI problem!
31
SLIDE 35 Computing a basis for the layer spaces
Associated with any n-variate polynomial f , there is a vector space called the Lie Algebra gf (of the group of symmetries) of f which consists of n × n matrices E = (eij)n×n satisfying
eijxj ∂f ∂xi = 0 .
32
SLIDE 36
Computing a basis for the layer spaces
The basis elements of the Lie Algebra of Tr-IMM are block-diagonal matrices [Gesmundo(2016)].
33
SLIDE 37
Computing a basis for the layer spaces
The corresponding basis elements of the Lie Algebra of f ∼ Tr-IMM looks like:
34
SLIDE 38
Computing a basis for the layer spaces
We compute a bases of the Lie Algebra of f . We exploit this relationship to compute a bases for the irreducible invariant subspaces V1, . . . , Vd of gf . Given a bases of these irreducible invariant subspaces, we then com- pute a bases of the layer spaces of f and then reorder them ap- propriately.
35
SLIDE 39
Part-II: Reduction from TRACE-TI to DET
f = tr(X1 · X2 . . . Xd−1 · Xd)
36
SLIDE 40
Part-II: Reduction from TRACE-TI to DET
f = tr(X1 · X2 . . . Xd−1 · Xd) = Y1 · Y2 . . . Yd−1 · Yd where, Y1 = [X1(1, ∗), . . . , X1(w, ∗)]1×w2 Yd = [Xd(∗, 1)T, . . . , Xd(∗, w)T]w2×1 Yi = Xi ... Xi ... Xi
w2×w2
for i ∈ [2, d].
36
SLIDE 41 Part-II: Reduction from TRACE-TI to DET
- 1. Using set-multilinear ABP reconstruction [Klivans,Shpilka(2003)],
we compute Y ′
1, . . . , Y ′ d such that:
f = Y ′
1 · Y ′ 2 . . . Y ′ d−1 · Y ′ d
Y ′
i = T −1 i−1
Xi ... Xi ... Xi Ti for i ∈ [2, d − 1] Idea: Block-diagonalize the matrices Y ′
2, . . . , Y ′ d−1. 37
SLIDE 42 Part-II: Reduction from TRACE-TI to DET
- 2. For each intermediate matrix, compute blackbox access to circuit
computing ci · det(Xi) from Y ′
i .
Xi that satisfies exactly one of the follow- ing: Xi = Ai · Xi · Bi Xi = Ai · Xi
T · Bi 38
SLIDE 43
Part-II: Reduction from TRACE-TI to DET
Y ′
i = T −1 i−1
Xi ... Xi ... Xi Ti
39
SLIDE 44
Part-II: Reduction from TRACE-TI to DET
Y ′
i = T −1 i−1
Ai XiBi ... Ai XiBi ... Ai XiBi Ti
40
SLIDE 45 Part-II: Reduction from TRACE-TI to DET
Y ′
i = Pi
...
...
Qi
Pi, Qi for all i ∈ [2, d − 1]. (Ideally, we would want Pi
−1Y ′ i
Qi
−1 to be block-diagonal). 41
SLIDE 46 Part-II: Reduction from TRACE-TI to DET
Pi, Qi, Y ′
i ’s , we compute X ′ 2, . . . X ′ d−1 such that:
X ′
2 · X ′ 3 . . . X ′ d−1 = α · A · X2 · X3 . . . Xd−1 · B
1, X ′ d (using ABP reconstruction techniques):
X ′
1 = α−1 · X1 · A−1 and X ′ d = B · Xd
So, X1 · X2 . . . Xd−1 · Xd = X ′
1 · X ′ 2 . . . X ′ d−1 · X ′ d 42
SLIDE 47
Conclusion
43
SLIDE 48
Acknowledgements
I thank my advisor Prof. Chandan Saha for his guidance and support during the course of my research program. I want to thank Vineet Nair for mentoring me. I also want to thank him for his contributions to the thesis. Thanks to my labmates Vineet, Nikhil and Bhargav for their won- derful company.
44
SLIDE 49
Acknowledgements
The question regarding whether the equivalence test for IMM can be extended to Tr-IMM or not was asked by Avi Wigderson to Vi- neet Nair at CCC’17 after the presentation of their work on IMM equivalence test. Christian Ikenymeyer also pointed out that the Tr-IMM polynomial is more interesting to mathematicians compared to the IMM poly- nomial and encouraged to look at this problem.
45
SLIDE 50
Acknowledgements
I want to thank all my friends here at IISc and elsewhere for the wonderful memories I have had with them. Last but not the least, I want to thank amma, appa, Sankar and Meenu for their love and encouragement :)
46
SLIDE 51
Thomas Thierauf. The isomorphism problem for read-once branching programs and arithmetic circuits. In Chicago Journal of Theoretical Computer Science. Citeseer, 1998. Nitin Saxena. Morphisms of rings and applications to complexity. Indian Institute of Technology Kanpur, 2006. Neeraj Kayal. Affine projections of polynomials. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pages 643–662. ACM, 2012. Joshua Abraham Grochow. Symmetry and equivalence relations in classical and geometric complexity theory. The University of Chicago, 2012. Fulvio Gesmundo.
46
SLIDE 52
Geometric aspects of iterated matrix multiplication. Journal of Algebra, 461:42–64, 2016.
46