computing the matrix square root in a matrix group
play

Computing the Matrix Square Root in a Matrix Group Nick Higham - PowerPoint PPT Presentation

Computing the Matrix Square Root in a Matrix Group Nick Higham Department of Mathematics The Victoria University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Joint work with Niloufer Mackey, D. Steven Mackey, and


  1. Computing the Matrix Square Root in a Matrix Group Nick Higham Department of Mathematics The Victoria University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Joint work with Niloufer Mackey, D. Steven Mackey, and Françoise Tisseur. Matrix Group Iterations – p. 1/20

  2. Matrix Square Root X is a square root of A ∈ C n × n ⇐ ⇒ X 2 = A . Number of square roots may be zero, finite or infinite. Principal Square Root For A with no eigenvalues on R − = { x ∈ R : x ≤ 0 } is the unique square root with spectrum in the open right half-plane. Denoted by A 1 / 2 . Matrix Group Iterations – p. 2/20

  3. Newton’s Method A = ( X + E ) 2 : drop second order term and solve for E . X 0 given, Solve X k E k + E k X k = A − X 2 � k k = 0 , 1 , 2 , . . . X k +1 = X k + E k Prohibitively expensive. Matrix Group Iterations – p. 3/20

  4. Newton’s Method A = ( X + E ) 2 : drop second order term and solve for E . X 0 given, Solve X k E k + E k X k = A − X 2 � k k = 0 , 1 , 2 , . . . X k +1 = X k + E k Prohibitively expensive. But observe that X 0 A = AX 0 ⇒ X k A = AX k for all k . Matrix Group Iterations – p. 3/20

  5. Newton Iteration 2 ( X k + X − 1 X k +1 = 1 k A ) , X 0 = A . If A ∈ C n × n has no eigenvalues on R − then Iterates X k nonsingular, converge quadratically to A 1 / 2 . Related to Newton sign function iterates S k +1 = 1 S 0 = A 1 / 2 2( S k + S − 1 k ) , by X k ≡ A 1 / 2 S k . Matrix Group Iterations – p. 4/20

  6. Newton Iteration 2 ( X k + X − 1 X k +1 = 1 k A ) , X 0 = A . If A ∈ C n × n has no eigenvalues on R − then Iterates X k nonsingular, converge quadratically to A 1 / 2 . Related to Newton sign function iterates S k +1 = 1 S 0 = A 1 / 2 2( S k + S − 1 k ) , by X k ≡ A 1 / 2 S k . Problem : numerical instability (Laasonen, 1958; H, 1996). Matrix Group Iterations – p. 4/20

  7. Newton Variants X k +1 = 1 X k + Y − 1 � � , X 0 = A, DB iteration : k 2 Y k +1 = 1 [Denman–Beavers, 1976] Y k + X − 1 � � , Y 0 = I. k 2 I + M k + M − 1 M k +1 = 1 � � k , M 0 = A, 2 2 Product form DB : X k +1 = 1 2 X k ( I + M − 1 [Cheng-Higham- k ) , X 0 = A, Kenney-Laub, 2001] Y k +1 = 1 2 Y k ( I + M − 1 k ) , Y 0 = I. Y k +1 = − Y k Z − 1 k Y k , Y 0 = I − A, CR iteration : [Meini, 2004] Z k +1 = Z k + 2 Y k +1 , Z 0 = 2( I + A ) . Matrix Group Iterations – p. 5/20

  8. Content ⋆ Characterizations of functions f that preserve automorphism group structure. ⋆ New, numerically stable square root iterations. ⋆ Unified stability analysis of square root iterations based on Fréchet derivatives. Matrix Group Iterations – p. 6/20

  9. Group Background Given nonsingular M and K = R or C , � x T My, real or complex bilinear forms, � x, y � M = sesquilinear forms. x ∗ My, Define automorphism group G = { A ∈ K n × n : � Ax, Ay � M = � x, y � M , ∀ x, y ∈ K n } . Adjoint A⋆ of A ∈ K n × n wrt �· , ·� M defined by � Ax, y � M = � x, A⋆y � M ∀ x, y ∈ K n . � M − 1 A T M, for bilinear forms, A⋆ = Can show: M − 1 A ∗ M, for sesquilinear forms. G = { A ∈ K n × n : A⋆ = A − 1 } . Matrix Group Iterations – p. 7/20

  10. Some Automorphism Groups A⋆ Space M Automorphism group , G Groups corresponding to a bilinear form R n A T I Real orthogonals C n A T I Complex orthogonals Σ p,q A T Σ p,q R n Σ p,q Pseudo-orthogonals RA T R R n Real perplectics R − JA T J R 2 n Real symplectics J − JA T J C 2 n Complex symplectics J Groups corresponding to a sesquilinear form C n Unitaries I A ∗ C n Pseudo-unitaries Σ p,q Σ p,q A ∗ Σ p,q C 2 n − JA ∗ J Conjugate symplectics J � � ...     1 0 In Ip 0 R = , J =  , Σp,q =     − In    0 0 − Iq 1 Matrix Group Iterations – p. 8/20

  11. Bilinear Forms Theorem 1 (a) For any f and A ∈ K n × n , f ( A⋆ ) = f ( A ) ⋆ . (b) For A ∈ G , f ( A ) ∈ G iff f ( A − 1 ) = f ( A ) − 1 . Proof . (a) We have f ( A⋆ ) = f ( M − 1 A T M ) = M − 1 f ( A T ) M = M − 1 f ( A ) T M = f ( A ) ⋆. (b) For A ∈ G , consider f ( A ) ⋆ f ( A⋆ ) = � f ( A − 1 ) Matrix Group Iterations – p. 9/20

  12. Bilinear Forms Theorem 1 (a) For any f and A ∈ K n × n , f ( A⋆ ) = f ( A ) ⋆ . (b) For A ∈ G , f ( A ) ∈ G iff f ( A − 1 ) = f ( A ) − 1 . Proof . (a) We have f ( A⋆ ) = f ( M − 1 A T M ) = M − 1 f ( A T ) M = M − 1 f ( A ) T M = f ( A ) ⋆. (b) For A ∈ G , consider f ( A ) ⋆ f ( A⋆ ) = � � f ( A ) − 1 f ( A − 1 ) = Matrix Group Iterations – p. 9/20

  13. Implications For bilinear forms, f preserves group structure of A when f ( A − 1 ) = f ( A ) − 1 . This condition holds for all A for Matrix sign function , sign( A ) = A ( A 2 ) − 1 / 2 . Any matrix power A α , subject to suitable choice of branches. In particular, the principal matrix p th root A 1 /p ( p ∈ Z + , Λ ( A ) ∩ R − = ∅ ): unique X such that 1. X p = A . 2. − π/p < arg( λ ( X )) < π/p . Matrix Group Iterations – p. 10/20

  14. Group Newton Iteration Theorem 2 Let A ∈ G ( any group ) , Λ ( A ) ∩ R − = ∅ , and Y k +1 = 1 2( Y k + Y − ⋆ ) k = 1 Y 1 = 1 2( Y k + M − 1 Y − T M ) , 2( I + A ) . k Then Y k → A 1 / 2 quadratically and Y k ≡ X k ( k ≥ 1 ) , where X k are the Newton iterates: X k +1 = 1 2 ( X k + X − 1 k A ) , X 0 = A . Matrix Group Iterations – p. 11/20

  15. Group Newton Iteration Theorem 2 Let A ∈ G ( any group ) , Λ ( A ) ∩ R − = ∅ , and Y k +1 = 1 2( Y k + Y − ⋆ ) k = 1 Y 1 = 1 2( Y k + M − 1 Y − T M ) , 2( I + A ) . k Then Y k → A 1 / 2 quadratically and Y k ≡ X k ( k ≥ 1 ) , where X k are the Newton iterates: X k +1 = 1 2 ( X k + X − 1 k A ) , X 0 = A . Cf. Cardoso, Kenney & Silva Leite (2003, App. Num. Math.): bilinear forms with M T = ± M , M T M = I . H (2003, SIREV): M = Σ p,q . Matrix Group Iterations – p. 11/20

  16. Generalized Polar Decomposition Theorem 2 For any group G , A ∈ K n × n has a unique generalized polar decomposition A = WS where S⋆ = S, ( i.e., W⋆ = W − 1 ) , W ∈ G and Λ ( S ) ∈ open right half-plane ( i.e., sign( S ) = I ) iff ( A⋆ ) ⋆ = A and Λ ( A⋆A ) ∩ R − = ∅ . Note ( A⋆ ) ⋆ = A holds for all G in the earlier table. Other gpd’s exist with different conditions on Λ ( S ) (Rodman & co-authors). Matrix Group Iterations – p. 12/20

  17. GPD Iteration & Square Root Theorem 3 Suppose the iteration X k +1 = X k h ( X 2 k ) , X 0 = A converges to sign( A ) with order m . If A has the generalized polar decomposition A = WS w.r.t. a scalar product then Y k +1 = Y k h ( Y ⋆ k Y k ) , Y 0 = A converges to W with order of convergence m . Matrix Group Iterations – p. 13/20

  18. GPD Iteration & Square Root Theorem 3 Suppose the iteration X k +1 = X k h ( X 2 k ) , X 0 = A converges to sign( A ) with order m . If A has the generalized polar decomposition A = WS w.r.t. a scalar product then Y k +1 = Y k h ( Y ⋆ k Y k ) , Y 0 = A converges to W with order of convergence m . Theorem 4 Let G be any automorphism group and A ∈ G . If Λ ( A ) ∩ R − = ∅ then I + A = WS is a generalized polar decomposition with W = A 1 / 2 and S = A − 1 / 2 + A 1 / 2 . Matrix Group Iterations – p. 13/20

  19. Application Newton Sign : X k +1 = X k · 1 2 ( I + ( X 2 k ) − 1 ) ≡ X k h ( X 2 k ) , X 0 = A . Group sqrt : Y k +1 = 1 k Y k ) − 1 ) = 1 2 Y k ( I + ( Y ⋆ 2( Y k + Y − ⋆ ) , Y 0 = I + A. k Schulz Sign : X k +1 = X k · 1 2 (3 I − X 2 k ) ≡ X k h ( X 2 k ) , X 0 = A . Group Schulz : Y k +1 = 1 2 Y k (3 I − Y ⋆ k Y k ) , Y 0 = I + A. Matrix Group Iterations – p. 14/20

  20. Class of Square Root Iterations Theorem 5 Suppose the iteration X k +1 = X k h ( X 2 k ) , X 0 = A converges to sign( A ) with order m . If Λ ( A ) ∩ R − = ∅ and Y k +1 = Y k h ( Z k Y k ) , Y 0 = A, Z k +1 = h ( Z k Y k ) Z k , Z 0 = I, then Y k → A 1 / 2 and Z k → A − 1 / 2 as k → ∞ , both with order m , and Y k = AZ k for all k . Moreover, if X ∈ G implies Xh ( X 2 ) ∈ G then A ∈ G implies Y k ∈ G and Z k ∈ G for all k . �� �� � A 1 / 2 � 0 A 0 Proof makes use of sign = . A − 1 / 2 I 0 0 Newton sign leads to DB iteration. Matrix Group Iterations – p. 15/20

  21. Padé Square Root Iterations Example: structure-preserving cubic: Y k +1 = Y k (3 I + Z k Y k )( I + 3 Z k Y k ) − 1 , Y 0 = A, Z k +1 = (3 I + Z k Y k )( I + 3 Z k Y k ) − 1 Z k , Z 0 = I. If Λ ( A ) ∩ R − = ∅ then • Y k → A 1 / 2 and Z k → A − 1 / 2 cubically, • A ∈ G ⇒ X k ∈ G , Y k ∈ G for all k . Matrix Group Iterations – p. 16/20

  22. Stability Define X k +1 = g ( X k ) to be stable in nbhd of fixed point X = g ( X ) if for X 0 := X + H 0 , with arbitrary error H 0 , the H k := X k − X satisfy H k +1 = L X ( H k ) + O ( � H k � 2 ) , where L X is a linear operator with bounded powers . Theorem 6 Consider the mapping � Y h ( ZY ) � G ( Y, Z ) = , h ( ZY ) Z where X k +1 = X k h ( X 2 k ) is any superlinear matrix sign iteration. Any P = ( B, B − 1 ) is a fixed point for G , and E − BFB dG P ( E, F ) = 1 � � . F − B − 1 EB − 1 2 dG P is idempotent , that is, dG P ◦ dG P = dG P . Matrix Group Iterations – p. 17/20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend