Computing the Nearest Singular Matrix Polynomial Joseph Haraldson - PowerPoint PPT Presentation

Introduction Theory Algorithm and Implementation Future Work Computing the Nearest Singular Matrix Polynomial Joseph Haraldson with Mark Giesbrecht and George Labahn David R. Cheriton School of Computer Science University of Waterloo January 24, 2018 1/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Our Problem: Find the Nearest Singular Matrix Polynomial A ∈ R [ t ] n × n such that � Given a matrix A ∈ R [ t ] n × n , find � A is singular and � A − � A � is minimized. Equivalently, for some “reasonable” norm � · � , solve the optimization problem   �   Ab = 0 � A − � min A � subject to    � b � = 1 , � A , b �� 1 ≤ k deg A ij A 2 1 ≤ i , j ≤ n � A ij � 2 where � A ij � 2 = ijk and � A � = � A � F = 2 . • A matrix polynomial A ∈ R [ t ] n × n is singular if det( A ) ≡ 0 • Sometimes we say that A is irregular • Singular = ⇒ ∃ b ∈ R [ t ] n × 1 \{ 0 } such that Ab ≡ 0 2/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Previous Algorithms We would like an algorithm that is ... • Numerically robust with rapid local convergence w.r.t. � · � Existing Algorithms: • Restricted SVD [De Moor ’93, ’94] • At best linear convergence • Structured Total Least Norm (STLN) [Rosen, Park & Glick ’96] • Super linear convergence (not quadratic) [Lemmerling ’99] • Variable Projection [Golub & Pereyra ’73, ’03] • Use Gauss-Newton (Pure Newton) • Converges super-linearly (at least quadratic) if normalized • Problem size is much larger • Lift and Project methods [Spaenlehauer & Schost ’16] 3/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Our Contributions Recall we want to solve the optimization problem     � b � F = 1 min ∆ A , b � ∆ A � F subject to    ( A + ∆ A ) b = 0 . 1. Prove that minimal solutions exist 2. Prove that minimal solutions are isolated • inf � ∆ A opt − ∆ A ⋆ opt � F > 0 when rank A = n for two distinct minimal perturbations A opt and A ⋆ opt 3. Show that the problem is (locally) well-posed • Optimal value is isolated around minimizers • Algorithm is provably locally stable 4. Derive and implement a quadratically convergent algorithm 4/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Applications • Stability of solutions to linear time invariant systems • [Byers & Nicols ’93] and [ Byers, He & Mehrmann ’98] • Stability of polynomial eigenvalue problems • Approximate GC(R)D of Ore polynomials • [Giesbrecht, H & Kaltofen ’16] 5/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Existence of Solutions Theorem Let ∆ A ∈ R [ t ] n × n have the same support as A , that is deg ∆ A ij ≤ deg A ij . The optimization problem    � b � = 1  ∆ A , b � ∆ A � subject to min    ( A + ∆ A ) b = 0 has an attainable global minimum ( ∆ A ⋆ , b ⋆ ) . • Nearest singular matrix polynomial always exists • i.e. ∃ ( ∆ A ⋆ , b ⋆ ) s.t. ( A + ∆ A ⋆ ) b ⋆ = 0 and � ∆ A ⋆ � is minimal • Minimization of � ∆ A � under other structures has a solution if there is a finite solution to ( A + ∆ A ) b = 0 • In general bi-linear feasibility over Q is NP-Hard 6/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Embedding R [ t ] n × n in R n ( µ + d ) × n µ Embed multiplication as a linear algebra problem over R • Recall that A is n × n with entries at most degree d • Define µ = nd + 1 Lemma If A ∈ R [ t ] n × n is singular then there exists b ∈ ker A such that deg b ≤ nd = µ − 1 . • Embed A as A ∈ R n ( µ + d ) × n µ and b as B ∈ R n µ × 1 • Ab = 0 ⇐⇒ AB = 0 • SVD provides a cheap lower bound on the distance to a singular matrix, hence a singular matrix polynomial 7/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Embedding Example The embedding preserves kernel vectors... � � � − 1 � t 2 − 1 t + 1 and A = ker A = t 2 − 2 t + 1 t − 1 t − 1    − 1 0 0 0 0 1 0 0 0 0             0 − 1 0 0 0 1 1 0 0 0             1 0 − 1 0 0 0 1 1 0 0   − 1 0 0 0                     0 1 0 − 1 0 0 0 1 1 0    0 − 1 0 0                  0 0 1 0 − 1 0 0 0 1 1    0 0 − 1 0                     0 0 0 1 0 0 0 0 0 1     0 0 0 − 1                  0 0 0 0 1 0 0 0 0 0    0 0 0 0         A =     ker A =         1 0 0 0 0 − 1 0 0 0 0    − 1 0 0 0                  − 2 1 0 0 0 1 − 1 0 0 0        1 − 1 0 0                  1 − 2 1 0 0 0 1 − 1 0 0    0 1 − 1 0                  0 1 − 2 1 0 0 0 1 − 1 0      0 0 1 − 1               0 0 1 − 2 1 0 0 0 1 − 1    0 0 0 1         0 0 0 1 − 2 0 0 0 0 1       0 0 0 0 1 0 0 0 0 0 • The embedding does not preserve rank information 8/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Separation Bounds Theorem Suppose ∆ A and ∆ A ⋆ are distinct (local) minimal solutions then � ∆ A − ∆ A ⋆ � F ≥ � ∆ A − ∆ A ⋆ � 2 ≥ σ min ( A ) nd + 1 . nd + 1 Corollary Minimal solutions ( ∆ A ⋆ , b ⋆ ) are isolated modulo equivalence classes of b ⋆ . • Normalize b over R [ t ] = ⇒ locally unique solutions • Proof uses first-order information • Similar separation holds under � · � 1 and other norms 9/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Lagrange Multipliers Definition The Lagrangian is defined as � � vec (( A + ∆ A ) B ) L = � ∆ A � 2 F + �B� 2 2 − 1 + λ T . B T B − 1 • ∇ is the gradient operator • Let x = ( vec ( ∆ A ) T , vec ( b ) T ) T • ∇ 2 ( ∇ 2 xx ) is the Hessian operator (w.r.t. to x ) • First order necessary (Karush-Kuhn-Tucker (KKT)) conditions ∇ L ( ∆ A ⋆ , B ⋆ , λ ⋆ ) = 0 • Idea is to solve ∇ L = 0 by Newton’s method, i.e. compute � x k + 1 � � x k + ∆ x k � � ∆ x � such that ∇ 2 L = = −∇ L λ k + ∆ λ k λ k + 1 ∆ λ 10/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Lagrange Multipliers Definition The Jacobian of the constraints is � � T � ψ ( B ) � vec (( A + ∆ A ) B ) A + ∆ A J = ∇ = . B T B − 1 2 B T 0 • ψ ( B ) vec ( A + ∆ A ) = 0 ⇐⇒ ( A + ∆ A ) B = 0 • J has full rank = ⇒ minimal solutions ( ∆ A ⋆ , b ⋆ ) are isolated • Multiple kernel vectors = ⇒ J is rank deficient Definition � ∇ 2 � xx L J The Hessian matrix of L is ∇ 2 L = . J T 0 ∇ 2 L has full rank ⇐⇒ J has full rank 11/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Minimal Kernel Embedding Definition A kernel vector B corresponding to b ∈ ker( A + ∆ A ) is minimally degree embedded in A + ∆ A if 1. ker( A + ∆ A ) = span ( B ) and b is primitive 2. b comes from a column echelon reduced basis. Theorem J has full rank at ( ∆ A ⋆ , b ⋆ ) if b ⋆ is minimally degree embedded. • Minimally embed by deleting rows/columns of A / B • Compute via orthogonal eliminations Corollary Newton’s method converges quadratically to ( ∆ A ⋆ , b ⋆ ) with a suitable initial guess. 12/20 Giesbrecht, Haraldson & Labahn

Introduction Theory Algorithm and Implementation Future Work Minimal Kernel Example � � � − 1 � t 2 − 1 t + 1 A = and ker A = t 2 − 2 t + 1 t − 1 t − 1 The minimal embedding gives us   − 1 1 0               0 1 1        − 1           1 0 1            and A min = B min = − 1             1 − 1 0       1     − 2 1 − 1       1 0 1 13/20 Giesbrecht, Haraldson & Labahn

Computing the Nearest Singular Matrix Polynomial Joseph Haraldson - PowerPoint PPT Presentation

Introduction Theory Algorithm and Implementation Future Work Computing the Nearest Singular Matrix Polynomial Joseph Haraldson with Mark Giesbrecht and George Labahn David R. Cheriton School of Computer Science University of Waterloo

Singular Value Decomposition (matrix factorization) Singular Value Decomposition The SVD is a

Nearest Neighbor and Locality-Sensitive Hashing Nearest Neighbor Set Similarity

Graphs with singular adjacency matrix School of Mathematical Sciences Jiaotong University

Singular Value Decomposition Presented by Matthew Motoki 1 What is a singular value

Introduction Warping polynomial Span of warping polynomial Span and dealternating number Ayaka

On the product of a singular Wishart matrix and a singular Gaussian vector in high dimension.

1 Singular Value Decomposition The singular vector decomposition allows us to write any matrix A

SYMBOLIC LOGIC UNIT 10: SINGULAR SENTENCES Singular Sentences (monadic) Paris is beautiful

[11] The Singular Value Decomposition The Singular Value Decomposition Gene Golubs license

/ Link Invariants from Braided Monoidal On the PROB of Singular Braids Categories Singular

Descriptive and combinatorial set theory Introduction Singular cardinals, at singular cardinals

Using Geometric Singular Perturbation Theory to Understand Singular Shocks Barbara Lee Keyfitz

Property of the interior polynomial from the HOMFLY polynomial

[3] The Matrix What is a matrix? Traditional answer Neo: What is the Matrix? Trinity: The answer

Matrix Multiplication Matrix Multiplication via Matrix-Vector Mult Defn. If matrix A is m n

Approximate Nearest Neighbors Search Approximate Nearest Neighbors Search in High Dimensions in

Non-commutative rank of linear matrices, related structures and applications G abor Ivanyos

PWA with full rank density matrix of the + and 0 0 systems at VES

Lower Bounds on Matrix Rigidity via a Quantum Argument Ronald de Wolf CWI Amsterdam Lower

trst P r tt r

Variants of zero forcing and their applications to the minimum rank problem Jephian C.-H. Lin

The Erd os-Ko-Rado Theorem for Permutations Karen Meagher (joint work with Bahman Ahmadi,

Efficient cyclic reduction for QBDs with rank structured blocks: algorithm and analysis

Womens Health: Collaborators Eleanor Schwartz, MD,MS, UC Davis Year in Review Kay