Consistency and efficient solution of the Sylvester equation for - - PowerPoint PPT Presentation

consistency and efficient solution of the sylvester
SMART_READER_LITE
LIVE PREVIEW

Consistency and efficient solution of the Sylvester equation for - - PowerPoint PPT Presentation

Consistency and efficient solution of the Sylvester equation for -congruence: AX + X B = C Fernando De Tern and Froiln M. Dopico ICMAT and Departamento de Matemticas, Universidad Carlos III de Madrid, Spain Special thanks to Daniel


slide-1
SLIDE 1

Consistency and efficient solution of the Sylvester equation for ⋆-congruence: AX + X⋆B = C

Fernando De Terán and Froilán M. Dopico

ICMAT and Departamento de Matemáticas, Universidad Carlos III de Madrid, Spain Special thanks to Daniel Kressner

CEDYA 2011. Palma de Mallorca, Spain, September 5-9, 2011

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 1 / 24

slide-2
SLIDE 2

Abstract Given A ∈ Cm×n, B ∈ Cn×m, and C ∈ Cm×m, we study the equations A X + X⋆ B = C , (X⋆ = XT or X∗), where X ∈ Cn×m is the unknown to be determined. More precisely:

1

Necessary and sufficient conditions for consistency (Wimmer 1994, De Terán and D. 2011).

2

Necessary and sufficient conditions for uniqueness of solutions (Byers, Kressner, Schröder, Watkins, 2006, 2009).

3

Efficient and stable numerical algorithm for computing the unique solution (De Terán and D. 2011).

4

Very briefly, general solution and dimension of solution space of A X + X⋆ B = 0 (De Terán, D., Guillery, Montealegre, Reyes, 2011) We establish parallelism/differences with well-known Sylvester equation A X − X B = C , A ∈ Cm×m, B ∈ Cn×n, C ∈ Cm×n .

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 2 / 24

slide-3
SLIDE 3

Abstract Given A ∈ Cm×n, B ∈ Cn×m, and C ∈ Cm×m, we study the equations A X + X⋆ B = C , (X⋆ = XT or X∗), where X ∈ Cn×m is the unknown to be determined. More precisely:

1

Necessary and sufficient conditions for consistency (Wimmer 1994, De Terán and D. 2011).

2

Necessary and sufficient conditions for uniqueness of solutions (Byers, Kressner, Schröder, Watkins, 2006, 2009).

3

Efficient and stable numerical algorithm for computing the unique solution (De Terán and D. 2011).

4

Very briefly, general solution and dimension of solution space of A X + X⋆ B = 0 (De Terán, D., Guillery, Montealegre, Reyes, 2011) We establish parallelism/differences with well-known Sylvester equation A X − X B = C , A ∈ Cm×m, B ∈ Cn×n, C ∈ Cm×n .

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 2 / 24

slide-4
SLIDE 4

Abstract Given A ∈ Cm×n, B ∈ Cn×m, and C ∈ Cm×m, we study the equations A X + X⋆ B = C , (X⋆ = XT or X∗), where X ∈ Cn×m is the unknown to be determined. More precisely:

1

Necessary and sufficient conditions for consistency (Wimmer 1994, De Terán and D. 2011).

2

Necessary and sufficient conditions for uniqueness of solutions (Byers, Kressner, Schröder, Watkins, 2006, 2009).

3

Efficient and stable numerical algorithm for computing the unique solution (De Terán and D. 2011).

4

Very briefly, general solution and dimension of solution space of A X + X⋆ B = 0 (De Terán, D., Guillery, Montealegre, Reyes, 2011) We establish parallelism/differences with well-known Sylvester equation A X − X B = C , A ∈ Cm×m, B ∈ Cn×n, C ∈ Cm×n .

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 2 / 24

slide-5
SLIDE 5

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 3 / 24

slide-6
SLIDE 6

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 4 / 24

slide-7
SLIDE 7

Motivation for studying A X + X⋆ B = C (I) It is well known that given a block upper triangular matrix (computed by the QR-algorithm for eigenvalues, when the matrix is real or several eigenvalues form a cluster), then I X I A C B I X I −1 =

  • A

C − (AX − XB) B

  • .

Therefore, to find a solution of the Sylvester equation AX − XB = C allows us to block-diagonalize block-triangular matrices via similarity

  • I

X I A C B I −X I

  • =
  • A

B

  • .

This is indeed done in practice in numerical algorithms (LAPACK, MATLAB) to compute bases of invariant subspaces (eigenvectors) of matrices, via the classical Bartels-Stewart algorithm (Comm ACM, 1972) or level-3 BLAS variants of it Jonsson-Kågström (ACM TMS, 2002).

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 5 / 24

slide-8
SLIDE 8

Motivation for studying A X + X⋆ B = C (I) It is well known that given a block upper triangular matrix (computed by the QR-algorithm for eigenvalues, when the matrix is real or several eigenvalues form a cluster), then I X I A C B I X I −1 =

  • A

C − (AX − XB) B

  • .

Therefore, to find a solution of the Sylvester equation AX − XB = C allows us to block-diagonalize block-triangular matrices via similarity I X I A C B I −X I

  • =

A B

  • .

This is indeed done in practice in numerical algorithms (LAPACK, MATLAB) to compute bases of invariant subspaces (eigenvectors) of matrices, via the classical Bartels-Stewart algorithm (Comm ACM, 1972) or level-3 BLAS variants of it Jonsson-Kågström (ACM TMS, 2002).

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 5 / 24

slide-9
SLIDE 9

Motivation for studying A X + X⋆ B = C (I) It is well known that given a block upper triangular matrix (computed by the QR-algorithm for eigenvalues, when the matrix is real or several eigenvalues form a cluster), then I X I A C B I X I −1 =

  • A

C − (AX − XB) B

  • .

Therefore, to find a solution of the Sylvester equation AX − XB = C allows us to block-diagonalize block-triangular matrices via similarity I X I A C B I −X I

  • =

A B

  • .

This is indeed done in practice in numerical algorithms (LAPACK, MATLAB) to compute bases of invariant subspaces (eigenvectors) of matrices, via the classical Bartels-Stewart algorithm (Comm ACM, 1972) or level-3 BLAS variants of it Jonsson-Kågström (ACM TMS, 2002).

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 5 / 24

slide-10
SLIDE 10

Motivation for studying A X + X⋆ B = C (II) Structured numerical algorithms for linear palindromic eigenproblems (Z + λZ⋆) compute an anti-triangular Schur form via unitary ⋆-congruence: Theorem (Kressner, Schröder, Watkins (Numer. Alg., 2009) and Mackey2, Mehl, Mehrmann (NLAA, 2009)) Let Z ∈ Cn×n. Then there exists a unitary matrix U ∈ Cn×n such that M = U ⋆ Z U =       ∗ · · · · · · ∗ . . . ... . . . ... ... . . . ∗ · · ·       The matrix M can be computed, for instance, through structure-preserving QR-type methods for matrices in anti-Hessenberg form (Kressner, Schröder, Watkins (Numer. Alg., 2009)), Jacobi-type methods (Mackey2, Mehl, Mehrmann (NLAA, 2009)), and compute eigenvalues of Z + λZ⋆ with exact pairing λ, 1/λ⋆.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 6 / 24

slide-11
SLIDE 11

Motivation for studying A X + X⋆ B = C (II) Structured numerical algorithms for linear palindromic eigenproblems (Z + λZ⋆) compute an anti-triangular Schur form via unitary ⋆-congruence: Theorem (Kressner, Schröder, Watkins (Numer. Alg., 2009) and Mackey2, Mehl, Mehrmann (NLAA, 2009)) Let Z ∈ Cn×n. Then there exists a unitary matrix U ∈ Cn×n such that M = U ⋆ Z U =       ∗ · · · · · · ∗ . . . ... . . . ... ... . . . ∗ · · ·       The matrix M can be computed, for instance, through structure-preserving QR-type methods for matrices in anti-Hessenberg form (Kressner, Schröder, Watkins (Numer. Alg., 2009)), Jacobi-type methods (Mackey2, Mehl, Mehrmann (NLAA, 2009)), and compute eigenvalues of Z + λZ⋆ with exact pairing λ, 1/λ⋆.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 6 / 24

slide-12
SLIDE 12

Motivation for studying A X + X⋆ B = C (II) Structured numerical algorithms for linear palindromic eigenproblems (Z + λZ⋆) compute an anti-triangular Schur form via unitary ⋆-congruence: Theorem (Kressner, Schröder, Watkins (Numer. Alg., 2009) and Mackey2, Mehl, Mehrmann (NLAA, 2009)) Let Z ∈ Cn×n. Then there exists a unitary matrix U ∈ Cn×n such that M = U ⋆ Z U =       ∗ · · · · · · ∗ . . . ... . . . ... ... . . . ∗ · · ·       The matrix M can be computed, for instance, through structure-preserving QR-type methods for matrices in anti-Hessenberg form (Kressner, Schröder, Watkins (Numer. Alg., 2009)), Jacobi-type methods (Mackey2, Mehl, Mehrmann (NLAA, 2009)), and compute eigenvalues of Z + λZ⋆ with exact pairing λ, 1/λ⋆.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 6 / 24

slide-13
SLIDE 13

Motivation for studying A X + X⋆ B = C (III) Given a block upper ANTI-triangular matrix (computed via structured algorithms for linear palindromic eigenproblems, when the matrix is real or several eigenvalues form a cluster), then

  • I

−X I ⋆ C A B I −X I

  • =
  • C − (A X + X⋆ B)

A B

  • .

Therefore, to find a solution of the Sylvester equation for ⋆-congruence allows us to block-ANTI-diagonalize block-ANTI-triangular matrices via ⋆-congruence I −X⋆ I C A B I −X I

  • =

A B

  • ,

and to compute deflating subspaces of palindromic pencils. GOAL: To understand Sylvester equations for ⋆-congruence and develop efficient and stable numerical algorithms for its solution in order to completely solve the linear palindromic eigenproblem numerically and to determine the conditioning of its deflating subspaces under structured perturbations.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 7 / 24

slide-14
SLIDE 14

Motivation for studying A X + X⋆ B = C (III) Given a block upper ANTI-triangular matrix (computed via structured algorithms for linear palindromic eigenproblems, when the matrix is real or several eigenvalues form a cluster), then

  • I

−X I ⋆ C A B I −X I

  • =
  • C − (A X + X⋆ B)

A B

  • .

Therefore, to find a solution of the Sylvester equation for ⋆-congruence allows us to block-ANTI-diagonalize block-ANTI-triangular matrices via ⋆-congruence I −X⋆ I C A B I −X I

  • =

A B

  • ,

and to compute deflating subspaces of palindromic pencils. GOAL: To understand Sylvester equations for ⋆-congruence and develop efficient and stable numerical algorithms for its solution in order to completely solve the linear palindromic eigenproblem numerically and to determine the conditioning of its deflating subspaces under structured perturbations.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 7 / 24

slide-15
SLIDE 15

Motivation for studying A X + X⋆ B = C (III) Given a block upper ANTI-triangular matrix (computed via structured algorithms for linear palindromic eigenproblems, when the matrix is real or several eigenvalues form a cluster), then

  • I

−X I ⋆ C A B I −X I

  • =
  • C − (A X + X⋆ B)

A B

  • .

Therefore, to find a solution of the Sylvester equation for ⋆-congruence allows us to block-ANTI-diagonalize block-ANTI-triangular matrices via ⋆-congruence I −X⋆ I C A B I −X I

  • =

A B

  • ,

and to compute deflating subspaces of palindromic pencils. GOAL: To understand Sylvester equations for ⋆-congruence and develop efficient and stable numerical algorithms for its solution in order to completely solve the linear palindromic eigenproblem numerically and to determine the conditioning of its deflating subspaces under structured perturbations.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 7 / 24

slide-16
SLIDE 16

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 8 / 24

slide-17
SLIDE 17

Consistency of A X + X⋆ B = C Theorem (Wimmer (LAA, 1994), De Terán and D. (ELA, 2011)) Let F be a field of characteristic different from two and let A ∈ Fm×n, B ∈ Fn×m, C ∈ Fm×m be given. There is some X ∈ Fn×m such that AX + X⋆B = C if and only if C A B

  • and

A B

  • are ⋆-congruent.

Remarks: The implication = ⇒ very easy: done in previous slide. The implication ⇐ = more challenging. Wimmer proved in 1994 the result, for F = C and ⋆ = ∗, without any reference to palindromic eigenproblems. His motivation was the study of standard Sylvester equations with Hermitian solutions.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 9 / 24

slide-18
SLIDE 18

Consistency of A X + X⋆ B = C Theorem (Wimmer (LAA, 1994), De Terán and D. (ELA, 2011)) Let F be a field of characteristic different from two and let A ∈ Fm×n, B ∈ Fn×m, C ∈ Fm×m be given. There is some X ∈ Fn×m such that AX + X⋆B = C if and only if C A B

  • and

A B

  • are ⋆-congruent.

Remarks: The implication = ⇒ very easy: done in previous slide. The implication ⇐ = more challenging. Wimmer proved in 1994 the result, for F = C and ⋆ = ∗, without any reference to palindromic eigenproblems. His motivation was the study of standard Sylvester equations with Hermitian solutions.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 9 / 24

slide-19
SLIDE 19

...to be compared with Roth’s criterion for standard Sylvester equation Theorem (Roth (Proc. AMS, 1952)) Let F be any field and let A ∈ Fm×m, B ∈ Fn×n, C ∈ Fm×n be given. There is some X ∈ Fm×n such that A X − X B = C if and only if A C B

  • and

A B

  • are similar.
  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 10 / 24

slide-20
SLIDE 20

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 11 / 24

slide-21
SLIDE 21

Uniqueness of solutions of A X + X⋆ B = C (I) Remarks: If the matrices A ∈ Fm×n and B ∈ Fn×m are rectangular (m = n), then the equation does not have a unique solution for every right-hand side C, that is, the operator Fn×m − → Fm×m X − → A X + X⋆ B is never invertible. It is of course possible that m > n and that for particular A, B and C, a solution exists and is unique, but we restrict ourselves here to the square case m = n.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 12 / 24

slide-22
SLIDE 22

Uniqueness of solutions of A X + X⋆ B = C (I) Remarks: If the matrices A ∈ Fm×n and B ∈ Fn×m are rectangular (m = n), then the equation does not have a unique solution for every right-hand side C, that is, the operator Fn×m − → Fm×m X − → A X + X⋆ B is never invertible. It is of course possible that m > n and that for particular A, B and C, a solution exists and is unique, but we restrict ourselves here to the square case m = n.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 12 / 24

slide-23
SLIDE 23

Uniqueness of solutions of A X + X⋆ B = C (I) Remarks: If the matrices A ∈ Fm×n and B ∈ Fn×m are rectangular (m = n), then the equation does not have a unique solution for every right-hand side C, that is, the operator Fn×m − → Fm×m X − → A X + X⋆ B is never invertible. It is of course possible that m > n and that for particular A, B and C, a solution exists and is unique, but we restrict ourselves here to the square case m = n.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 12 / 24

slide-24
SLIDE 24

Uniqueness of solutions of A X + X⋆ B = C (I) Remarks: If the matrices A ∈ Fm×n and B ∈ Fn×m are rectangular (m = n), then the equation does not have a unique solution for every right-hand side C, that is, the operator Fn×m − → Fm×m X − → A X + X⋆ B is never invertible. It is of course possible that m > n and that for particular A, B and C, a solution exists and is unique, but we restrict ourselves here to the square case m = n.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 12 / 24

slide-25
SLIDE 25

Uniqueness of solutions of A X + X⋆ B = C (II) Definition: a set {λ1, . . . , λn} ⊂ C is ⋆-reciprocal free if λi = 1/λ⋆

j for any

1 ≤ i, j ≤ n. We admit 0 and/or ∞ as elements of {λ1, . . . , λn}. Theorem (Byers, Kressner (SIMAX, 2006), Kressner, Schröder, Watkins, (Num. Alg., 2009)) Let A, B ∈ Cn×n be given. Then: A X + XT B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λBT is regular, and ✷✮ the set of eigenvalues of A − λBT \{1} is T-reciprocal free and if 1 is an eigenvalue of A − λBT , then it has algebraic multiplicity 1. A X + X∗ B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λB∗ is regular, and ✷✮ the set of eigenvalues of A − λB∗ is ∗-reciprocal free.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 13 / 24

slide-26
SLIDE 26

Uniqueness of solutions of A X + X⋆ B = C (II) Definition: a set {λ1, . . . , λn} ⊂ C is ⋆-reciprocal free if λi = 1/λ⋆

j for any

1 ≤ i, j ≤ n. We admit 0 and/or ∞ as elements of {λ1, . . . , λn}. Theorem (Byers, Kressner (SIMAX, 2006), Kressner, Schröder, Watkins, (Num. Alg., 2009)) Let A, B ∈ Cn×n be given. Then: A X + XT B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λBT is regular, and ✷✮ the set of eigenvalues of A − λBT \{1} is T-reciprocal free and if 1 is an eigenvalue of A − λBT , then it has algebraic multiplicity 1. A X + X∗ B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λB∗ is regular, and ✷✮ the set of eigenvalues of A − λB∗ is ∗-reciprocal free.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 13 / 24

slide-27
SLIDE 27

Uniqueness of solutions of A X + X⋆ B = C (II) Definition: a set {λ1, . . . , λn} ⊂ C is ⋆-reciprocal free if λi = 1/λ⋆

j for any

1 ≤ i, j ≤ n. We admit 0 and/or ∞ as elements of {λ1, . . . , λn}. Theorem (Byers, Kressner (SIMAX, 2006), Kressner, Schröder, Watkins, (Num. Alg., 2009)) Let A, B ∈ Cn×n be given. Then: A X + XT B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λBT is regular, and ✷✮ the set of eigenvalues of A − λBT \{1} is T-reciprocal free and if 1 is an eigenvalue of A − λBT , then it has algebraic multiplicity 1. A X + X∗ B = C has a unique solution X for every right-hand side C ∈ Cn×n if and only if the following conditions hold: ✶✮ The pencil A − λB∗ is regular, and ✷✮ the set of eigenvalues of A − λB∗ is ∗-reciprocal free.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 13 / 24

slide-28
SLIDE 28

...to be compared with uniqueness conditions for standard Sylvester eq Theorem Let A ∈ Cm×m and B ∈ Cn×n be given. Then: A X − X B = C has a unique solution X for every right-hand side C ∈ Cm×n if and only if A and B have no eigenvalues in common. Remark: Comparison of both results brings to our attention a key difference that appears always between solution methods for AX + X⋆B = C and AX − XB = C: In AX + X⋆B = C, one starts dealing with the eigenproblem of A − λB⋆, that is, one deals from the very beginning simultaneously with A and B. By contrast in AX − XB = C, one starts dealing independently with the eigenproblems of A and B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 14 / 24

slide-29
SLIDE 29

...to be compared with uniqueness conditions for standard Sylvester eq Theorem Let A ∈ Cm×m and B ∈ Cn×n be given. Then: A X − X B = C has a unique solution X for every right-hand side C ∈ Cm×n if and only if A and B have no eigenvalues in common. Remark: Comparison of both results brings to our attention a key difference that appears always between solution methods for AX + X⋆B = C and AX − XB = C: In AX + X⋆B = C, one starts dealing with the eigenproblem of A − λB⋆, that is, one deals from the very beginning simultaneously with A and B. By contrast in AX − XB = C, one starts dealing independently with the eigenproblems of A and B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 14 / 24

slide-30
SLIDE 30

...to be compared with uniqueness conditions for standard Sylvester eq Theorem Let A ∈ Cm×m and B ∈ Cn×n be given. Then: A X − X B = C has a unique solution X for every right-hand side C ∈ Cm×n if and only if A and B have no eigenvalues in common. Remark: Comparison of both results brings to our attention a key difference that appears always between solution methods for AX + X⋆B = C and AX − XB = C: In AX + X⋆B = C, one starts dealing with the eigenproblem of A − λB⋆, that is, one deals from the very beginning simultaneously with A and B. By contrast in AX − XB = C, one starts dealing independently with the eigenproblems of A and B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 14 / 24

slide-31
SLIDE 31

...to be compared with uniqueness conditions for standard Sylvester eq Theorem Let A ∈ Cm×m and B ∈ Cn×n be given. Then: A X − X B = C has a unique solution X for every right-hand side C ∈ Cm×n if and only if A and B have no eigenvalues in common. Remark: Comparison of both results brings to our attention a key difference that appears always between solution methods for AX + X⋆B = C and AX − XB = C: In AX + X⋆B = C, one starts dealing with the eigenproblem of A − λB⋆, that is, one deals from the very beginning simultaneously with A and B. By contrast in AX − XB = C, one starts dealing independently with the eigenproblems of A and B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 14 / 24

slide-32
SLIDE 32

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 15 / 24

slide-33
SLIDE 33

The fundamental transformation In this section in A X + X⋆ B = C all matrices are in Cn×n and the solution is unique for every C. A X + X⋆ B = C is equivalent to a linear system for the n2 entries of X if ⋆ = T and to a linear system for the 2 n2 entries of (Re X , Im X) if ⋆ = ∗. From now on, we say simply “linear system” for X. Then, it is possible to use Gaussian elimination on the equivalent system (constructed via vec(X), vec(C), ⊗), but it costs O(n6) flops, which is prohibitive except for small n. IDEA: transform A X + X⋆ B = C into an equation of the same type but with much simpler coefficients instead of A and B and that can be easily solved to get a total cost of O(n3) flops. To this purpose, use QZ algorithm to compute in O(n3) flops the generalized Schur decomposition of A − λB⋆ = U(R − λS)V , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices If A, B real matrices: use real arithmetic to get quasi-triangular R. We do not consider this for brevity.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 16 / 24

slide-34
SLIDE 34

The fundamental transformation In this section in A X + X⋆ B = C all matrices are in Cn×n and the solution is unique for every C. A X + X⋆ B = C is equivalent to a linear system for the n2 entries of X if ⋆ = T and to a linear system for the 2 n2 entries of (Re X , Im X) if ⋆ = ∗. From now on, we say simply “linear system” for X. Then, it is possible to use Gaussian elimination on the equivalent system (constructed via vec(X), vec(C), ⊗), but it costs O(n6) flops, which is prohibitive except for small n. IDEA: transform A X + X⋆ B = C into an equation of the same type but with much simpler coefficients instead of A and B and that can be easily solved to get a total cost of O(n3) flops. To this purpose, use QZ algorithm to compute in O(n3) flops the generalized Schur decomposition of A − λB⋆ = U(R − λS)V , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices If A, B real matrices: use real arithmetic to get quasi-triangular R. We do not consider this for brevity.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 16 / 24

slide-35
SLIDE 35

The fundamental transformation In this section in A X + X⋆ B = C all matrices are in Cn×n and the solution is unique for every C. A X + X⋆ B = C is equivalent to a linear system for the n2 entries of X if ⋆ = T and to a linear system for the 2 n2 entries of (Re X , Im X) if ⋆ = ∗. From now on, we say simply “linear system” for X. Then, it is possible to use Gaussian elimination on the equivalent system (constructed via vec(X), vec(C), ⊗), but it costs O(n6) flops, which is prohibitive except for small n. IDEA: transform A X + X⋆ B = C into an equation of the same type but with much simpler coefficients instead of A and B and that can be easily solved to get a total cost of O(n3) flops. To this purpose, use QZ algorithm to compute in O(n3) flops the generalized Schur decomposition of A − λB⋆ = U(R − λS)V , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices If A, B real matrices: use real arithmetic to get quasi-triangular R. We do not consider this for brevity.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 16 / 24

slide-36
SLIDE 36

The fundamental transformation In this section in A X + X⋆ B = C all matrices are in Cn×n and the solution is unique for every C. A X + X⋆ B = C is equivalent to a linear system for the n2 entries of X if ⋆ = T and to a linear system for the 2 n2 entries of (Re X , Im X) if ⋆ = ∗. From now on, we say simply “linear system” for X. Then, it is possible to use Gaussian elimination on the equivalent system (constructed via vec(X), vec(C), ⊗), but it costs O(n6) flops, which is prohibitive except for small n. IDEA: transform A X + X⋆ B = C into an equation of the same type but with much simpler coefficients instead of A and B and that can be easily solved to get a total cost of O(n3) flops. To this purpose, use QZ algorithm to compute in O(n3) flops the generalized Schur decomposition of A − λB⋆ = U(R − λS)V , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices If A, B real matrices: use real arithmetic to get quasi-triangular R. We do not consider this for brevity.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 16 / 24

slide-37
SLIDE 37

The fundamental transformation In this section in A X + X⋆ B = C all matrices are in Cn×n and the solution is unique for every C. A X + X⋆ B = C is equivalent to a linear system for the n2 entries of X if ⋆ = T and to a linear system for the 2 n2 entries of (Re X , Im X) if ⋆ = ∗. From now on, we say simply “linear system” for X. Then, it is possible to use Gaussian elimination on the equivalent system (constructed via vec(X), vec(C), ⊗), but it costs O(n6) flops, which is prohibitive except for small n. IDEA: transform A X + X⋆ B = C into an equation of the same type but with much simpler coefficients instead of A and B and that can be easily solved to get a total cost of O(n3) flops. To this purpose, use QZ algorithm to compute in O(n3) flops the generalized Schur decomposition of A − λB⋆ = U(R − λS)V , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices If A, B real matrices: use real arithmetic to get quasi-triangular R. We do not consider this for brevity.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 16 / 24

slide-38
SLIDE 38

Algorithm to solve A X + X⋆ B = C in O(n3) flops INPUT: A, B, C ∈ Cn×n OUTPUT: X ∈ Cn×n Step 1. Compute via QZ algorithm R, S, U and V such that A = URV , B⋆ = USV , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices Step 2. Compute E = U ∗ C (U ⋆)∗ Step 3. Solve for W ∈ Cn×n the transformed equation R W + W ⋆ S⋆ = E Step 4. Compute X = V ∗ W U ⋆

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 17 / 24

slide-39
SLIDE 39

Algorithm to solve A X + X⋆ B = C in O(n3) flops INPUT: A, B, C ∈ Cn×n OUTPUT: X ∈ Cn×n Step 1. Compute via QZ algorithm R, S, U and V such that A = URV , B⋆ = USV , ✇❤❡r❡ R, S are upper triangular U, V are unitary matrices Step 2. Compute E = U ∗ C (U ⋆)∗ Step 3. How to solve for W ∈ Cn×n the transformed equation R W + W ⋆ S⋆ = E ? Step 4. Compute X = V ∗ W U ⋆

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 17 / 24

slide-40
SLIDE 40

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (4,4)-entry, then we get r44 w44 + w⋆

44

s⋆

44

= e44 , a scalar equation that allows us to determine w44 .

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-41
SLIDE 41

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (4,4)-entry, then we get r44 w44 + w⋆

44

s⋆

44

= e44 , a scalar equation that allows us to determine w44 .

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-42
SLIDE 42

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (3,4) and (4,3) entries, then we get s33 w34 + w⋆

43

r⋆

44

= e⋆

43 − s34 w44

r33 w34 + w⋆

43

s⋆

44

= e34 − r34 w44 , a 2 × 2 system of scalar equations that allows us to determine w34 and w43 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-43
SLIDE 43

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (3,4) and (4,3) entries, then we get s33 w34 + w⋆

43

r⋆

44

= e⋆

43 − s34 w44

r33 w34 + w⋆

43

s⋆

44

= e34 − r34 w44 , a 2 × 2 system of scalar equations that allows us to determine w34 and w43 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-44
SLIDE 44

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (2,4) and (4,2) entries, then we get s22 w24 + w⋆

42

r⋆

44

= e⋆

42 − s23 w34 − s24 w44

r22 w24 + w⋆

42

s⋆

44

= e24 − r23 w34 − r24 w44 , a 2 × 2 system of scalar equations that allows us to determine w24 and w42 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-45
SLIDE 45

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (2,4) and (4,2) entries, then we get s22 w24 + w⋆

42

r⋆

44

= e⋆

42 − s23 w34 − s24 w44

r22 w24 + w⋆

42

s⋆

44

= e24 − r23 w34 − r24 w44 , a 2 × 2 system of scalar equations that allows us to determine w24 and w42 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-46
SLIDE 46

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (1,4) and (4,1) entries, then we get s11 w14 + w⋆

41

r⋆

44

= e⋆

41 − s12 w24 − s13 w34 − s14 w44

r11 w14 + w⋆

41

s⋆

44

= e14 − r12 w24 − r13 w34 − r14 w44 , a 2 × 2 system of scalar equations that allows us to determine w14 and w41 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-47
SLIDE 47

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (1,4) and (4,1) entries, then we get s11 w14 + w⋆

41

r⋆

44

= e⋆

41 − s12 w24 − s13 w34 − s14 w44

r11 w14 + w⋆

41

s⋆

44

= e14 − r12 w24 − r13 w34 − r14 w44 , a 2 × 2 system of scalar equations that allows us to determine w14 and w41 simultaneously.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-48
SLIDE 48

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (1:3,1:3) submatrices , then we get

  r11 r12 r13 r22 r23 r33     w11 w12 w13 w21 w22 w23 w31 w32 w33   +   w⋆

11

w⋆

21

w⋆

31

w⋆

12

w⋆

22

w⋆

32

w⋆

13

w⋆

23

w⋆

33

    s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

  =   e11 e12 e13 e21 e22 e23 e31 e32 e33   −   r14 r24 r34   w41 w42 w43

  w⋆

41

w⋆

42

w⋆

43

  s⋆

14

s⋆

24

s⋆

34

  • which is a 3 × 3 matrix equation of the same type as the original one.
  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-49
SLIDE 49

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (I) We illustrate with 4 × 4 example for simplicity:     r11 r12 r13 r14 r22 r23 r24 r33 r34 r44         w11 w12 w13 w14 w21 w22 w23 w24 w31 w32 w33 w34 w41 w42 w43 w44     +     w⋆

11

w⋆

21

w⋆

31

w⋆

41

w⋆

12

w⋆

22

w⋆

32

w⋆

42

w⋆

13

w⋆

23

w⋆

33

w⋆

43

w⋆

14

w⋆

24

w⋆

34

w⋆

44

        s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

s⋆

14

s⋆

24

s⋆

34

s⋆

44

    =     e11 e12 e13 e14 e21 e22 e23 e24 e31 e32 e33 e34 e41 e42 e43 e44     If we equate the (1:3,1:3) submatrices , then we get

  r11 r12 r13 r22 r23 r33     w11 w12 w13 w21 w22 w23 w31 w32 w33   +   w⋆

11

w⋆

21

w⋆

31

w⋆

12

w⋆

22

w⋆

32

w⋆

13

w⋆

23

w⋆

33

    s⋆

11

s⋆

12

s⋆

22

s⋆

13

s⋆

23

s⋆

33

  =   e11 e12 e13 e21 e22 e23 e31 e32 e33   −   r14 r24 r34   w41 w42 w43

  w⋆

41

w⋆

42

w⋆

43

  s⋆

14

s⋆

24

s⋆

34

  • which is a 3 × 3 matrix equation of the same type as the original one.
  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 18 / 24

slide-50
SLIDE 50

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (II) INPUT: R, S, E ∈ Cn×n, with R and S upper triangular OUTPUT: W ∈ Cn×n for j = p : −1 : 1 solve rjjwjj + w⋆

jjs⋆ jj = ejj

to get wjj for i = j − 1 : −1 : 1 solve

  • siiwij + w⋆

jir⋆ jj = e⋆ ji − j k=i+1 sikwkj

riiwij + w⋆

jis⋆ jj = eij − j k=i+1 rikwkj

  • to get wij, wji

end E(1 : j − 1, 1 : j − 1) = E(1 : j − 1, 1 : j − 1) − R(1 : j − 1, j)W(j, 1 : j − 1) −(S(1 : j − 1, j)W(j, 1 : j − 1))⋆ end Cost 2n3 + O(n2) flops for real input matrices and a total cost 76n3 + O(n2) flops for the whole algorithm for AX + X⋆B = C.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 19 / 24

slide-51
SLIDE 51

Algorithm to solve the transformed equation R W + W ⋆ S⋆ = E (II) INPUT: R, S, E ∈ Cn×n, with R and S upper triangular OUTPUT: W ∈ Cn×n for j = p : −1 : 1 solve rjjwjj + w⋆

jjs⋆ jj = ejj

to get wjj for i = j − 1 : −1 : 1 solve

  • siiwij + w⋆

jir⋆ jj = e⋆ ji − j k=i+1 sikwkj

riiwij + w⋆

jis⋆ jj = eij − j k=i+1 rikwkj

  • to get wij, wji

end E(1 : j − 1, 1 : j − 1) = E(1 : j − 1, 1 : j − 1) − R(1 : j − 1, j)W(j, 1 : j − 1) −(S(1 : j − 1, j)W(j, 1 : j − 1))⋆ end Cost 2n3 + O(n2) flops for real input matrices and a total cost 76n3 + O(n2) flops for the whole algorithm for AX + X⋆B = C.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 19 / 24

slide-52
SLIDE 52

Remarks on algorithm to solve A X + X⋆ B = C Roundoff errors: X, computed solution of AX + X⋆B = C, satisfies A X + X⋆ B − CF ≤ α u n5/2 (AF + BF ) XF , with u unit roundoff and α small integer constant. The algorithm for solving RW + W ⋆ S⋆ = E is dominated by level-2 BLAS operations. In modern computers, a blocked-version dominated by level-3 BLAS operations would be more efficient (future work), but... for AX + X⋆B = C, cost is dominated by the QZ-alg on A − λB⋆. The algorithm should be compared with Bartels-Stewart algorithm for Sylvester equation AX − XB = C:

1

Compute independently triang. Schur forms TA and TB of A and B.

2

Solve TA Y − Y TB = D for Y .

3

Recover X from Y . Same flavor, but also differences: A − λB⋆ and TA Y − Y TB = D allows us to compute each entry of Y in terms on previous ones (from left and from bottom) without using 2 × 2 linear-systems.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 20 / 24

slide-53
SLIDE 53

Remarks on algorithm to solve A X + X⋆ B = C Roundoff errors: X, computed solution of AX + X⋆B = C, satisfies A X + X⋆ B − CF ≤ α u n5/2 (AF + BF ) XF , with u unit roundoff and α small integer constant. The algorithm for solving RW + W ⋆ S⋆ = E is dominated by level-2 BLAS operations. In modern computers, a blocked-version dominated by level-3 BLAS operations would be more efficient (future work), but... for AX + X⋆B = C, cost is dominated by the QZ-alg on A − λB⋆. The algorithm should be compared with Bartels-Stewart algorithm for Sylvester equation AX − XB = C:

1

Compute independently triang. Schur forms TA and TB of A and B.

2

Solve TA Y − Y TB = D for Y .

3

Recover X from Y . Same flavor, but also differences: A − λB⋆ and TA Y − Y TB = D allows us to compute each entry of Y in terms on previous ones (from left and from bottom) without using 2 × 2 linear-systems.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 20 / 24

slide-54
SLIDE 54

Remarks on algorithm to solve A X + X⋆ B = C Roundoff errors: X, computed solution of AX + X⋆B = C, satisfies A X + X⋆ B − CF ≤ α u n5/2 (AF + BF ) XF , with u unit roundoff and α small integer constant. The algorithm for solving RW + W ⋆ S⋆ = E is dominated by level-2 BLAS operations. In modern computers, a blocked-version dominated by level-3 BLAS operations would be more efficient (future work), but... for AX + X⋆B = C, cost is dominated by the QZ-alg on A − λB⋆. The algorithm should be compared with Bartels-Stewart algorithm for Sylvester equation AX − XB = C:

1

Compute independently triang. Schur forms TA and TB of A and B.

2

Solve TA Y − Y TB = D for Y .

3

Recover X from Y . Same flavor, but also differences: A − λB⋆ and TA Y − Y TB = D allows us to compute each entry of Y in terms on previous ones (from left and from bottom) without using 2 × 2 linear-systems.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 20 / 24

slide-55
SLIDE 55

Remarks on algorithm to solve A X + X⋆ B = C Roundoff errors: X, computed solution of AX + X⋆B = C, satisfies A X + X⋆ B − CF ≤ α u n5/2 (AF + BF ) XF , with u unit roundoff and α small integer constant. The algorithm for solving RW + W ⋆ S⋆ = E is dominated by level-2 BLAS operations. In modern computers, a blocked-version dominated by level-3 BLAS operations would be more efficient (future work), but... for AX + X⋆B = C, cost is dominated by the QZ-alg on A − λB⋆. The algorithm should be compared with Bartels-Stewart algorithm for Sylvester equation AX − XB = C:

1

Compute independently triang. Schur forms TA and TB of A and B.

2

Solve TA Y − Y TB = D for Y .

3

Recover X from Y . Same flavor, but also differences: A − λB⋆ and TA Y − Y TB = D allows us to compute each entry of Y in terms on previous ones (from left and from bottom) without using 2 × 2 linear-systems.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 20 / 24

slide-56
SLIDE 56

Remarks on algorithm to solve A X + X⋆ B = C Roundoff errors: X, computed solution of AX + X⋆B = C, satisfies A X + X⋆ B − CF ≤ α u n5/2 (AF + BF ) XF , with u unit roundoff and α small integer constant. The algorithm for solving RW + W ⋆ S⋆ = E is dominated by level-2 BLAS operations. In modern computers, a blocked-version dominated by level-3 BLAS operations would be more efficient (future work), but... for AX + X⋆B = C, cost is dominated by the QZ-alg on A − λB⋆. The algorithm should be compared with Bartels-Stewart algorithm for Sylvester equation AX − XB = C:

1

Compute independently triang. Schur forms TA and TB of A and B.

2

Solve TA Y − Y TB = D for Y .

3

Recover X from Y . Same flavor, but also differences: A − λB⋆ and TA Y − Y TB = D allows us to compute each entry of Y in terms on previous ones (from left and from bottom) without using 2 × 2 linear-systems.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 20 / 24

slide-57
SLIDE 57

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 21 / 24

slide-58
SLIDE 58

Theoretical method to solve A X + X⋆ B = 0 In case of consistency, but “nonuniqueness”, general solution of AX + X⋆B = C is X = Xp + Xh, where

1

Xp is a particular solution and

2

Xh is the general solution of AX + X⋆B = 0. The latter found a few weeks ago by De Terán, D., Guillery, Montealegre, Reyes (REU program, U. of California at S. Barbara, M.I. Bueno). KEY IDEA: If E − λF ⋆ is the Kronecker Canonical form (KCF) of A − λB⋆, then AX + X⋆B = 0 can be transformed into EY + Y ⋆F = 0. If E = E1 ⊕ · · · ⊕ Ed, F ⋆ = F ⋆

1 ⊕ · · · ⊕ F ⋆ d , and Y = [Yij] is partitioned

into blocks accordingly, then this equation decouples in EiYii + Y ⋆

iiFi = 0

and EiYij + Y ⋆

jiFj = 0

EjYji + Y ⋆

ijFi = 0

, (1 ≤ i < j ≤ d), which produce 14 different types of matrix (systems) equations, whose explicit solutions have been found. Much more complicated general solution than standard Sylvester eq: AX − XB = 0.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 22 / 24

slide-59
SLIDE 59

Theoretical method to solve A X + X⋆ B = 0 In case of consistency, but “nonuniqueness”, general solution of AX + X⋆B = C is X = Xp + Xh, where

1

Xp is a particular solution and

2

Xh is the general solution of AX + X⋆B = 0. The latter found a few weeks ago by De Terán, D., Guillery, Montealegre, Reyes (REU program, U. of California at S. Barbara, M.I. Bueno). KEY IDEA: If E − λF ⋆ is the Kronecker Canonical form (KCF) of A − λB⋆, then AX + X⋆B = 0 can be transformed into EY + Y ⋆F = 0. If E = E1 ⊕ · · · ⊕ Ed, F ⋆ = F ⋆

1 ⊕ · · · ⊕ F ⋆ d , and Y = [Yij] is partitioned

into blocks accordingly, then this equation decouples in EiYii + Y ⋆

iiFi = 0

and EiYij + Y ⋆

jiFj = 0

EjYji + Y ⋆

ijFi = 0

, (1 ≤ i < j ≤ d), which produce 14 different types of matrix (systems) equations, whose explicit solutions have been found. Much more complicated general solution than standard Sylvester eq: AX − XB = 0.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 22 / 24

slide-60
SLIDE 60

Theoretical method to solve A X + X⋆ B = 0 In case of consistency, but “nonuniqueness”, general solution of AX + X⋆B = C is X = Xp + Xh, where

1

Xp is a particular solution and

2

Xh is the general solution of AX + X⋆B = 0. The latter found a few weeks ago by De Terán, D., Guillery, Montealegre, Reyes (REU program, U. of California at S. Barbara, M.I. Bueno). KEY IDEA: If E − λF ⋆ is the Kronecker Canonical form (KCF) of A − λB⋆, then AX + X⋆B = 0 can be transformed into EY + Y ⋆F = 0. If E = E1 ⊕ · · · ⊕ Ed, F ⋆ = F ⋆

1 ⊕ · · · ⊕ F ⋆ d , and Y = [Yij] is partitioned

into blocks accordingly, then this equation decouples in EiYii + Y ⋆

iiFi = 0

and EiYij + Y ⋆

jiFj = 0

EjYji + Y ⋆

ijFi = 0

, (1 ≤ i < j ≤ d), which produce 14 different types of matrix (systems) equations, whose explicit solutions have been found. Much more complicated general solution than standard Sylvester eq: AX − XB = 0.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 22 / 24

slide-61
SLIDE 61

Theoretical method to solve A X + X⋆ B = 0 In case of consistency, but “nonuniqueness”, general solution of AX + X⋆B = C is X = Xp + Xh, where

1

Xp is a particular solution and

2

Xh is the general solution of AX + X⋆B = 0. The latter found a few weeks ago by De Terán, D., Guillery, Montealegre, Reyes (REU program, U. of California at S. Barbara, M.I. Bueno). KEY IDEA: If E − λF ⋆ is the Kronecker Canonical form (KCF) of A − λB⋆, then AX + X⋆B = 0 can be transformed into EY + Y ⋆F = 0. If E = E1 ⊕ · · · ⊕ Ed, F ⋆ = F ⋆

1 ⊕ · · · ⊕ F ⋆ d , and Y = [Yij] is partitioned

into blocks accordingly, then this equation decouples in EiYii + Y ⋆

iiFi = 0

and EiYij + Y ⋆

jiFj = 0

EjYji + Y ⋆

ijFi = 0

, (1 ≤ i < j ≤ d), which produce 14 different types of matrix (systems) equations, whose explicit solutions have been found. Much more complicated general solution than standard Sylvester eq: AX − XB = 0.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 22 / 24

slide-62
SLIDE 62

Outline

1

Motivation

2

Consistency of the Sylvester equation for ⋆-congruence

3

Uniqueness of solutions

4

Efficient and stable algorithm to compute unique solutions

5

General “nonunique” solution of AX + X⋆B = C

6

Conclusions

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 23 / 24

slide-63
SLIDE 63

Conclusions Several questions related to the Sylvester equation for ⋆-congruence A X + X⋆ B = C are well-understood:

1

Necessary and sufficient conds for consistency/uniqueness of sols.

2

Efficient and stable nume. algorithm for computing unique solution.

3

General solution of A X + X⋆ B = 0. Connections with strd. Syl. eq AX − XB = C but also relevant diffs:

1

Use of QZ-algor for pencil A−λB⋆ instead of QR-algor for matrices.

2

Use of KCF for pencil A − λB⋆ instead of JCF for matrices in general homogeneous solution: much more complicated solution.

3

The Canonical Form for Congruence only useful in the particular case AX + X⋆A = 0. Several problems still remain open. Among them:

1

Combine the algor for A X + X⋆ B = C with algors for computing the anti-triangular Schur form for completely solving the linear palindromic eigenproblem via congruence.

2

Numerical method for computing basis of the solution space of A X + X⋆ B = 0 via staircase-form of A − λB⋆.

3

Eigenvalues of the operator X − → AX + X⋆B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 24 / 24

slide-64
SLIDE 64

Conclusions Several questions related to the Sylvester equation for ⋆-congruence A X + X⋆ B = C are well-understood:

1

Necessary and sufficient conds for consistency/uniqueness of sols.

2

Efficient and stable nume. algorithm for computing unique solution.

3

General solution of A X + X⋆ B = 0. Connections with strd. Syl. eq AX − XB = C but also relevant diffs:

1

Use of QZ-algor for pencil A−λB⋆ instead of QR-algor for matrices.

2

Use of KCF for pencil A − λB⋆ instead of JCF for matrices in general homogeneous solution: much more complicated solution.

3

The Canonical Form for Congruence only useful in the particular case AX + X⋆A = 0. Several problems still remain open. Among them:

1

Combine the algor for A X + X⋆ B = C with algors for computing the anti-triangular Schur form for completely solving the linear palindromic eigenproblem via congruence.

2

Numerical method for computing basis of the solution space of A X + X⋆ B = 0 via staircase-form of A − λB⋆.

3

Eigenvalues of the operator X − → AX + X⋆B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 24 / 24

slide-65
SLIDE 65

Conclusions Several questions related to the Sylvester equation for ⋆-congruence A X + X⋆ B = C are well-understood:

1

Necessary and sufficient conds for consistency/uniqueness of sols.

2

Efficient and stable nume. algorithm for computing unique solution.

3

General solution of A X + X⋆ B = 0. Connections with strd. Syl. eq AX − XB = C but also relevant diffs:

1

Use of QZ-algor for pencil A−λB⋆ instead of QR-algor for matrices.

2

Use of KCF for pencil A − λB⋆ instead of JCF for matrices in general homogeneous solution: much more complicated solution.

3

The Canonical Form for Congruence only useful in the particular case AX + X⋆A = 0. Several problems still remain open. Among them:

1

Combine the algor for A X + X⋆ B = C with algors for computing the anti-triangular Schur form for completely solving the linear palindromic eigenproblem via congruence.

2

Numerical method for computing basis of the solution space of A X + X⋆ B = 0 via staircase-form of A − λB⋆.

3

Eigenvalues of the operator X − → AX + X⋆B.

  • F. M. Dopico (U. Carlos III, Madrid)

Sylvester equation for congruence CEDYA 2011 24 / 24