Understanding the QR algorithm, Part X David S. Watkins - - PowerPoint PPT Presentation

understanding the qr algorithm part x
SMART_READER_LITE
LIVE PREVIEW

Understanding the QR algorithm, Part X David S. Watkins - - PowerPoint PPT Presentation

Understanding the QR algorithm, Part X David S. Watkins watkins@math.wsu.edu Department of Mathematics Washington State University Glasgow 2009 p. 1 1. Understanding the QR algorithm, SIAM Rev., 1982 Glasgow 2009 p. 2 1.


slide-1
SLIDE 1

Understanding the QR algorithm, Part X

David S. Watkins

watkins@math.wsu.edu

Department of Mathematics Washington State University

Glasgow 2009 – p. 1

slide-2
SLIDE 2
  • 1. Understanding the QR algorithm, SIAM Rev., 1982

Glasgow 2009 – p. 2

slide-3
SLIDE 3
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991

Glasgow 2009 – p. 2

slide-4
SLIDE 4
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993

Glasgow 2009 – p. 2

slide-5
SLIDE 5
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993
  • 4. QR-like algorithms—an overview of convergence

theory and practice, AMS proceedings, 1996

Glasgow 2009 – p. 2

slide-6
SLIDE 6
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993
  • 4. QR-like algorithms—an overview of convergence

theory and practice, AMS proceedings, 1996

  • 5. QR-like algorithms for eigenvalue problems, JCAM,

2000

Glasgow 2009 – p. 2

slide-7
SLIDE 7
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993
  • 4. QR-like algorithms—an overview of convergence

theory and practice, AMS proceedings, 1996

  • 5. QR-like algorithms for eigenvalue problems, JCAM,

2000

  • 6. Fundamentals of Matrix Computations, Wiley, 2002

Glasgow 2009 – p. 2

slide-8
SLIDE 8
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993
  • 4. QR-like algorithms—an overview of convergence

theory and practice, AMS proceedings, 1996

  • 5. QR-like algorithms for eigenvalue problems, JCAM,

2000

  • 6. Fundamentals of Matrix Computations, Wiley, 2002
  • 7. The Matrix Eigenvalue Problem: GR and Krylov

Subspace Methods, SIAM, 2007.

Glasgow 2009 – p. 2

slide-9
SLIDE 9
  • 1. Understanding the QR algorithm, SIAM Rev., 1982
  • 2. Fundamentals of Matrix Computations, Wiley, 1991
  • 3. Some perspectives on the eigenvalue problem, 1993
  • 4. QR-like algorithms—an overview of convergence

theory and practice, AMS proceedings, 1996

  • 5. QR-like algorithms for eigenvalue problems, JCAM,

2000

  • 6. Fundamentals of Matrix Computations, Wiley, 2002
  • 7. The Matrix Eigenvalue Problem: GR and Krylov

Subspace Methods, SIAM, 2007.

  • 8. The QR algorithm revisited, SIAM Rev., 2008.

Glasgow 2009 – p. 2

slide-10
SLIDE 10

Some names associated with the QR algorithm

Glasgow 2009 – p. 3

slide-11
SLIDE 11

Some names associated with the QR algorithm (short list)

Glasgow 2009 – p. 3

slide-12
SLIDE 12

Some names associated with the QR algorithm (short list)

Rutishauser

Glasgow 2009 – p. 3

slide-13
SLIDE 13

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya

Glasgow 2009 – p. 3

slide-14
SLIDE 14

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya Francis

Glasgow 2009 – p. 3

slide-15
SLIDE 15

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya Francis

Implicitly Shifted QR algorithm

Glasgow 2009 – p. 3

slide-16
SLIDE 16

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya Francis

Implicitly Shifted QR algorithm

How should we understand it?

Glasgow 2009 – p. 3

slide-17
SLIDE 17

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya Francis

Implicitly Shifted QR algorithm

How should we understand it? ...view it?

Glasgow 2009 – p. 3

slide-18
SLIDE 18

Some names associated with the QR algorithm (short list)

Rutishauser Kublanovskaya Francis

Implicitly Shifted QR algorithm

How should we understand it? ...view it? ...teach it to our students?

Glasgow 2009 – p. 3

slide-19
SLIDE 19

The Standard Approach . . .

Glasgow 2009 – p. 4

slide-20
SLIDE 20

The Standard Approach . . . . . . dating from the work of Francis

Glasgow 2009 – p. 4

slide-21
SLIDE 21

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ...

Glasgow 2009 – p. 4

slide-22
SLIDE 22

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR

Glasgow 2009 – p. 4

slide-23
SLIDE 23

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A

Glasgow 2009 – p. 4

slide-24
SLIDE 24

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat!

Glasgow 2009 – p. 4

slide-25
SLIDE 25

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat! This is simple,

Glasgow 2009 – p. 4

slide-26
SLIDE 26

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat! This is simple, appealing,

Glasgow 2009 – p. 4

slide-27
SLIDE 27

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat! This is simple, appealing, does not require much preparation,

Glasgow 2009 – p. 4

slide-28
SLIDE 28

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat! This is simple, appealing, does not require much preparation, but ...

Glasgow 2009 – p. 4

slide-29
SLIDE 29

The Standard Approach . . . . . . dating from the work of Francis

Start with the basic algorithm ... A = QR RQ = ˆ A repeat! This is simple, appealing, does not require much preparation, but ... ...it is far removed from versions of the QR algorithm that are actually used.

Glasgow 2009 – p. 4

slide-30
SLIDE 30

Refinements

Glasgow 2009 – p. 5

slide-31
SLIDE 31

Refinements

shifts of origin

Glasgow 2009 – p. 5

slide-32
SLIDE 32

Refinements

shifts of origin reduction to Hessenberg form

Glasgow 2009 – p. 5

slide-33
SLIDE 33

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis)

Glasgow 2009 – p. 5

slide-34
SLIDE 34

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR

Glasgow 2009 – p. 5

slide-35
SLIDE 35

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR

Glasgow 2009 – p. 5

slide-36
SLIDE 36

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR implicit-Q theorem

Glasgow 2009 – p. 5

slide-37
SLIDE 37

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR implicit-Q theorem vs. Krylov subspaces

Glasgow 2009 – p. 5

slide-38
SLIDE 38

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR implicit-Q theorem vs. Krylov subspaces Introducing Krylov subspaces improves understanding,

Glasgow 2009 – p. 5

slide-39
SLIDE 39

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR implicit-Q theorem vs. Krylov subspaces Introducing Krylov subspaces improves understanding, allows more general results,

Glasgow 2009 – p. 5

slide-40
SLIDE 40

Refinements

shifts of origin reduction to Hessenberg form implicit shift technique (Francis) double shift QR multiple shift QR implicit-Q theorem vs. Krylov subspaces Introducing Krylov subspaces improves understanding, allows more general results, and prepares students for Krylov subspace methods.

Glasgow 2009 – p. 5

slide-41
SLIDE 41

The Implicitly Shifted QR Iteration

Glasgow 2009 – p. 6

slide-42
SLIDE 42

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form

Glasgow 2009 – p. 6

slide-43
SLIDE 43

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm

Glasgow 2009 – p. 6

slide-44
SLIDE 44

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6)

Glasgow 2009 – p. 6

slide-45
SLIDE 45

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI)

Glasgow 2009 – p. 6

slide-46
SLIDE 46

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive!

Glasgow 2009 – p. 6

slide-47
SLIDE 47

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive! compute p(A)e1

Glasgow 2009 – p. 6

slide-48
SLIDE 48

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive! compute p(A)e1 cheap!

Glasgow 2009 – p. 6

slide-49
SLIDE 49

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive! compute p(A)e1 cheap! Build unitary Q0 with q1 = αp(A)e1.

Glasgow 2009 – p. 6

slide-50
SLIDE 50

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive! compute p(A)e1 cheap! Build unitary Q0 with q1 = αp(A)e1. Perform similarity transform A → Q∗

0AQ0.

Glasgow 2009 – p. 6

slide-51
SLIDE 51

The Implicitly Shifted QR Iteration

matrix is in upper Hessenberg form pick some shifts ρ1, ..., ρm (m = 1, 2, 4, 6) p(A) = (A − ρ1I) · · · (A − ρmI) expensive! compute p(A)e1 cheap! Build unitary Q0 with q1 = αp(A)e1. Perform similarity transform A → Q∗

0AQ0.

Hessenberg form is disturbed.

Glasgow 2009 – p. 6

slide-52
SLIDE 52

An Upper Hessenberg Matrix

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Glasgow 2009 – p. 7

slide-53
SLIDE 53

After the Transformation (Q∗

0AQ0)

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Glasgow 2009 – p. 8

slide-54
SLIDE 54

After the Transformation (Q∗

0AQ0)

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Now return the matrix to Hessenberg form.

Glasgow 2009 – p. 8

slide-55
SLIDE 55

Chasing the Bulge

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Glasgow 2009 – p. 9

slide-56
SLIDE 56

Chasing the Bulge

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Glasgow 2009 – p. 10

slide-57
SLIDE 57

Done

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

Glasgow 2009 – p. 11

slide-58
SLIDE 58

Done

❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅ ❅

The implicit QR step is complete!

Glasgow 2009 – p. 11

slide-59
SLIDE 59

Summary of Implicit QR Iteration

Glasgow 2009 – p. 12

slide-60
SLIDE 60

Summary of Implicit QR Iteration

Pick some shifts.

Glasgow 2009 – p. 12

slide-61
SLIDE 61

Summary of Implicit QR Iteration

Pick some shifts. Compute p(A)e1. (p determined by shifts)

Glasgow 2009 – p. 12

slide-62
SLIDE 62

Summary of Implicit QR Iteration

Pick some shifts. Compute p(A)e1. (p determined by shifts) Build Q0 with first column q1 = αp(A)e1.

Glasgow 2009 – p. 12

slide-63
SLIDE 63

Summary of Implicit QR Iteration

Pick some shifts. Compute p(A)e1. (p determined by shifts) Build Q0 with first column q1 = αp(A)e1. Make a bulge. (A → Q∗

0AQ0)

Glasgow 2009 – p. 12

slide-64
SLIDE 64

Summary of Implicit QR Iteration

Pick some shifts. Compute p(A)e1. (p determined by shifts) Build Q0 with first column q1 = αp(A)e1. Make a bulge. (A → Q∗

0AQ0)

Chase the bulge. (return to Hessenberg form)

Glasgow 2009 – p. 12

slide-65
SLIDE 65

Summary of Implicit QR Iteration

Pick some shifts. Compute p(A)e1. (p determined by shifts) Build Q0 with first column q1 = αp(A)e1. Make a bulge. (A → Q∗

0AQ0)

Chase the bulge. (return to Hessenberg form) ˆ A = Q∗AQ

Glasgow 2009 – p. 12

slide-66
SLIDE 66

Question

Glasgow 2009 – p. 13

slide-67
SLIDE 67

Question

This differs a lot from the basic QR step.

Glasgow 2009 – p. 13

slide-68
SLIDE 68

Question

This differs a lot from the basic QR step. A = QR RQ = ˆ A

Glasgow 2009 – p. 13

slide-69
SLIDE 69

Question

This differs a lot from the basic QR step. A = QR RQ = ˆ A Can we carve a reasonable pedagogical path that leads directly to the implicitly-shifted QR algorithm,

Glasgow 2009 – p. 13

slide-70
SLIDE 70

Question

This differs a lot from the basic QR step. A = QR RQ = ˆ A Can we carve a reasonable pedagogical path that leads directly to the implicitly-shifted QR algorithm, bypassing the basic QR algorithm entirely?

Glasgow 2009 – p. 13

slide-71
SLIDE 71

Question

This differs a lot from the basic QR step. A = QR RQ = ˆ A Can we carve a reasonable pedagogical path that leads directly to the implicitly-shifted QR algorithm, bypassing the basic QR algorithm entirely? That’s what we are going to do today.

Glasgow 2009 – p. 13

slide-72
SLIDE 72

Ingredients

Glasgow 2009 – p. 14

slide-73
SLIDE 73

Ingredients

subspace iteration (power method)

Glasgow 2009 – p. 14

slide-74
SLIDE 74

Ingredients

subspace iteration (power method) Krylov subspaces

Glasgow 2009 – p. 14

slide-75
SLIDE 75

Ingredients

subspace iteration (power method) Krylov subspaces and subspace iteration

Glasgow 2009 – p. 14

slide-76
SLIDE 76

Ingredients

subspace iteration (power method) Krylov subspaces and subspace iteration (unitary) similarity transformation (change of coordinate system)

Glasgow 2009 – p. 14

slide-77
SLIDE 77

Ingredients

subspace iteration (power method) Krylov subspaces and subspace iteration (unitary) similarity transformation (change of coordinate system) reduction to Hessenberg form

Glasgow 2009 – p. 14

slide-78
SLIDE 78

Ingredients

subspace iteration (power method) Krylov subspaces and subspace iteration (unitary) similarity transformation (change of coordinate system) reduction to Hessenberg form Hessenberg form and Krylov subspaces (instead of implicit-Q theorem)

Glasgow 2009 – p. 14

slide-79
SLIDE 79

Ingredients

subspace iteration (power method) Krylov subspaces and subspace iteration (unitary) similarity transformation (change of coordinate system) reduction to Hessenberg form Hessenberg form and Krylov subspaces (instead of implicit-Q theorem)

No Magic Shortcut!

Glasgow 2009 – p. 14

slide-80
SLIDE 80

Power Method, Subspace Iteration

Glasgow 2009 – p. 15

slide-81
SLIDE 81

Power Method, Subspace Iteration

v, Av, A2v, A3v, ...

Glasgow 2009 – p. 15

slide-82
SLIDE 82

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1|

Glasgow 2009 – p. 15

slide-83
SLIDE 83

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ...

Glasgow 2009 – p. 15

slide-84
SLIDE 84

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j

Glasgow 2009 – p. 15

slide-85
SLIDE 85

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j (|λj+1/λj |)

Glasgow 2009 – p. 15

slide-86
SLIDE 86

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j (|λj+1/λj |) Substitute p(A) for A

Glasgow 2009 – p. 15

slide-87
SLIDE 87

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j (|λj+1/λj |) Substitute p(A) for A (shifts, multiple steps)

Glasgow 2009 – p. 15

slide-88
SLIDE 88

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j (|λj+1/λj |) Substitute p(A) for A (shifts, multiple steps) S, p(A)S, p(A)2S, p(A)3S, ...

Glasgow 2009 – p. 15

slide-89
SLIDE 89

Power Method, Subspace Iteration

v, Av, A2v, A3v, ... convergence rate |λ2/λ1| S, AS, A2S, A3S, ... subspaces of dimension j (|λj+1/λj |) Substitute p(A) for A (shifts, multiple steps) S, p(A)S, p(A)2S, p(A)3S, ... convergence rate |p(λj+1)/p(λj)|

Glasgow 2009 – p. 15

slide-90
SLIDE 90

Krylov Subspaces . . .

Glasgow 2009 – p. 16

slide-91
SLIDE 91

Krylov Subspaces . . . . . . and Subspace Iteration

Glasgow 2009 – p. 16

slide-92
SLIDE 92

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • Glasgow 2009 – p. 16
slide-93
SLIDE 93

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • j = 1, 2, 3, ...

(nested subspaces)

Glasgow 2009 – p. 16

slide-94
SLIDE 94

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • j = 1, 2, 3, ...

(nested subspaces) Kj(A, q) are “determined by q”.

Glasgow 2009 – p. 16

slide-95
SLIDE 95

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • j = 1, 2, 3, ...

(nested subspaces) Kj(A, q) are “determined by q”. p(A)Kj(A, q) = Kj(A, p(A)q)

Glasgow 2009 – p. 16

slide-96
SLIDE 96

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • j = 1, 2, 3, ...

(nested subspaces) Kj(A, q) are “determined by q”. p(A)Kj(A, q) = Kj(A, p(A)q) ...because p(A)A = Ap(A)

Glasgow 2009 – p. 16

slide-97
SLIDE 97

Krylov Subspaces . . . . . . and Subspace Iteration

Def: Kj(A, q) = span

  • q, Aq, A2q, . . . , Aj−1q
  • j = 1, 2, 3, ...

(nested subspaces) Kj(A, q) are “determined by q”. p(A)Kj(A, q) = Kj(A, p(A)q) ...because p(A)A = Ap(A) Conclusion: Power method induces nested subspace iterations on Krylov subspaces.

Glasgow 2009 – p. 16

slide-98
SLIDE 98

power method: p(A)kq

Glasgow 2009 – p. 17

slide-99
SLIDE 99

power method: p(A)kq nested subspace iterations: p(A)kKj(A, q) = Kj(A, p(A)kq) j = 1, 2, 3, . . .

Glasgow 2009 – p. 17

slide-100
SLIDE 100

power method: p(A)kq nested subspace iterations: p(A)kKj(A, q) = Kj(A, p(A)kq) j = 1, 2, 3, . . . convergence rates: |p(λj+1)/p(λj)|, j = 1, 2, 3, . . .

Glasgow 2009 – p. 17

slide-101
SLIDE 101

(Unitary) Similarity Transforms

Glasgow 2009 – p. 18

slide-102
SLIDE 102

(Unitary) Similarity Transforms

A → Q∗AQ preserves eigenvalues

Glasgow 2009 – p. 18

slide-103
SLIDE 103

(Unitary) Similarity Transforms

A → Q∗AQ preserves eigenvalues transforms eigenvectors in a simple way (w → Q∗w)

Glasgow 2009 – p. 18

slide-104
SLIDE 104

(Unitary) Similarity Transforms

A → Q∗AQ preserves eigenvalues transforms eigenvectors in a simple way (w → Q∗w) is a change of coordinate system (v → Q∗v)

Glasgow 2009 – p. 18

slide-105
SLIDE 105

(Unitary) Similarity Transforms

A → Q∗AQ preserves eigenvalues transforms eigenvectors in a simple way (w → Q∗w) is a change of coordinate system (v → Q∗v) triangular form (eigenvalues)

Glasgow 2009 – p. 18

slide-106
SLIDE 106

(Unitary) Similarity Transforms

A → Q∗AQ preserves eigenvalues transforms eigenvectors in a simple way (w → Q∗w) is a change of coordinate system (v → Q∗v) triangular form (eigenvalues) relationship of invariant subspaces to triangular form

Glasgow 2009 – p. 18

slide-107
SLIDE 107

Subspace Iteration with change of coordinate system

Glasgow 2009 – p. 19

slide-108
SLIDE 108

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej}

Glasgow 2009 – p. 19

slide-109
SLIDE 109

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal)

Glasgow 2009 – p. 19

slide-110
SLIDE 110

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal) build unitary Q = [q1 · · · qj · · ·]

Glasgow 2009 – p. 19

slide-111
SLIDE 111

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal) build unitary Q = [q1 · · · qj · · ·] change coordinate system: ˆ A = Q∗AQ

Glasgow 2009 – p. 19

slide-112
SLIDE 112

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal) build unitary Q = [q1 · · · qj · · ·] change coordinate system: ˆ A = Q∗AQ qk → Q∗qk = ek

Glasgow 2009 – p. 19

slide-113
SLIDE 113

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal) build unitary Q = [q1 · · · qj · · ·] change coordinate system: ˆ A = Q∗AQ qk → Q∗qk = ek span{q1, . . . , qj} → span{e1, . . . , ej}

Glasgow 2009 – p. 19

slide-114
SLIDE 114

Subspace Iteration with change of coordinate system

take S = span{e1, . . . , ej} p(A)S = span{p(A)e1, . . . , p(A)ej} = span{q1, . . . , qj} (orthonormal) build unitary Q = [q1 · · · qj · · ·] change coordinate system: ˆ A = Q∗AQ qk → Q∗qk = ek span{q1, . . . , qj} → span{e1, . . . , ej} ready for next iteration

Glasgow 2009 – p. 19

slide-115
SLIDE 115

This version of subspace iteration ...

Glasgow 2009 – p. 20

slide-116
SLIDE 116

This version of subspace iteration ... ...holds the subspace fixed

Glasgow 2009 – p. 20

slide-117
SLIDE 117

This version of subspace iteration ... ...holds the subspace fixed while the matrix changes.

Glasgow 2009 – p. 20

slide-118
SLIDE 118

This version of subspace iteration ... ...holds the subspace fixed while the matrix changes. ...moving toward a matrix under which span{e1, . . . , ej} is invariant.

Glasgow 2009 – p. 20

slide-119
SLIDE 119

This version of subspace iteration ... ...holds the subspace fixed while the matrix changes. ...moving toward a matrix under which span{e1, . . . , ej} is invariant. A → A11 A12 A22

  • (A11 is j × j.)

Glasgow 2009 – p. 20

slide-120
SLIDE 120

Reduction to Hessenberg form

Glasgow 2009 – p. 21

slide-121
SLIDE 121

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation)

Glasgow 2009 – p. 21

slide-122
SLIDE 122

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation) can always be done (direct method, O(n3) flops)

Glasgow 2009 – p. 21

slide-123
SLIDE 123

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation) can always be done (direct method, O(n3) flops) brings us closer to triangular form

Glasgow 2009 – p. 21

slide-124
SLIDE 124

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation) can always be done (direct method, O(n3) flops) brings us closer to triangular form makes computations cheaper

Glasgow 2009 – p. 21

slide-125
SLIDE 125

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation) can always be done (direct method, O(n3) flops) brings us closer to triangular form makes computations cheaper First column q1 can be chosen “arbitrarily”.

Glasgow 2009 – p. 21

slide-126
SLIDE 126

Reduction to Hessenberg form

Q → Q∗AQ = H (a similarity transformation) can always be done (direct method, O(n3) flops) brings us closer to triangular form makes computations cheaper First column q1 can be chosen “arbitrarily”. Example: q1 = αp(A)e1

Glasgow 2009 – p. 21

slide-127
SLIDE 127

Krylov Subspaces . . .

Glasgow 2009 – p. 22

slide-128
SLIDE 128

Krylov Subspaces . . . . . . and Hessenberg matrices . . .

Glasgow 2009 – p. 22

slide-129
SLIDE 129

Krylov Subspaces . . . . . . and Hessenberg matrices . . .

...go hand in hand.

Glasgow 2009 – p. 22

slide-130
SLIDE 130

Krylov Subspaces . . . . . . and Hessenberg matrices . . .

...go hand in hand. A properly upper Hessenberg = ⇒ Kj(A, e1) = span{e1, . . . , ej}.

Glasgow 2009 – p. 22

slide-131
SLIDE 131

Krylov Subspaces . . . . . . and Hessenberg matrices . . .

...go hand in hand. A properly upper Hessenberg = ⇒ Kj(A, e1) = span{e1, . . . , ej}. More generally ...

Glasgow 2009 – p. 22

slide-132
SLIDE 132

Krylov-Hessenberg Relationship

Glasgow 2009 – p. 23

slide-133
SLIDE 133

Krylov-Hessenberg Relationship

If H = Q∗AQ, and H is properly upper Hessenberg, then for j = 1, 2, 3, ..., span{q1, . . . , qj} = Kj(A, q1).

Glasgow 2009 – p. 23

slide-134
SLIDE 134

Krylov-Hessenberg Relationship

If H = Q∗AQ, and H is properly upper Hessenberg, then for j = 1, 2, 3, ..., span{q1, . . . , qj} = Kj(A, q1). Proof (sketch):

Glasgow 2009 – p. 23

slide-135
SLIDE 135

Krylov-Hessenberg Relationship

If H = Q∗AQ, and H is properly upper Hessenberg, then for j = 1, 2, 3, ..., span{q1, . . . , qj} = Kj(A, q1). Proof (sketch): Induction on j.

Glasgow 2009 – p. 23

slide-136
SLIDE 136

Krylov-Hessenberg Relationship

If H = Q∗AQ, and H is properly upper Hessenberg, then for j = 1, 2, 3, ..., span{q1, . . . , qj} = Kj(A, q1). Proof (sketch): Induction on j. AQ = QH

Glasgow 2009 – p. 23

slide-137
SLIDE 137

Krylov-Hessenberg Relationship

If H = Q∗AQ, and H is properly upper Hessenberg, then for j = 1, 2, 3, ..., span{q1, . . . , qj} = Kj(A, q1). Proof (sketch): Induction on j. AQ = QH Aqj =

n

  • i=1

qihij =

j

  • i=1

qihij + qj+1hj+1,j

Glasgow 2009 – p. 23

slide-138
SLIDE 138

Aqj =

j

  • i=1

qihij + qj+1hj+1,j

Glasgow 2009 – p. 24

slide-139
SLIDE 139

Aqj =

j

  • i=1

qihij + qj+1hj+1,j qj+1hj+1,j = Aqj −

j

  • i=1

qihij

Glasgow 2009 – p. 24

slide-140
SLIDE 140

Aqj =

j

  • i=1

qihij + qj+1hj+1,j qj+1hj+1,j = Aqj −

j

  • i=1

qihij Proof by induction follows immediately.

Glasgow 2009 – p. 24

slide-141
SLIDE 141

Aqj =

j

  • i=1

qihij + qj+1hj+1,j qj+1hj+1,j = Aqj −

j

  • i=1

qihij Proof by induction follows immediately. This also gives the student a preview of the Arnoldi process,

Glasgow 2009 – p. 24

slide-142
SLIDE 142

Aqj =

j

  • i=1

qihij + qj+1hj+1,j qj+1hj+1,j = Aqj −

j

  • i=1

qihij Proof by induction follows immediately. This also gives the student a preview of the Arnoldi process, the most important Krylov subspace method.

Glasgow 2009 – p. 24

slide-143
SLIDE 143

and now,

Glasgow 2009 – p. 25

slide-144
SLIDE 144

and now, the Implicit QR Iteration

Glasgow 2009 – p. 25

slide-145
SLIDE 145

and now, the Implicit QR Iteration

Work with Hessenberg form to get ...

Glasgow 2009 – p. 25

slide-146
SLIDE 146

and now, the Implicit QR Iteration

Work with Hessenberg form to get ... ...efficiency.

Glasgow 2009 – p. 25

slide-147
SLIDE 147

and now, the Implicit QR Iteration

Work with Hessenberg form to get ... ...efficiency. ...automatic nested subspace iterations.

Glasgow 2009 – p. 25

slide-148
SLIDE 148

and now, the Implicit QR Iteration

Work with Hessenberg form to get ... ...efficiency. ...automatic nested subspace iterations. Get some shifts ρ1, ..., ρm to define p.

Glasgow 2009 – p. 25

slide-149
SLIDE 149

and now, the Implicit QR Iteration

Work with Hessenberg form to get ... ...efficiency. ...automatic nested subspace iterations. Get some shifts ρ1, ..., ρm to define p. Compute p(A)e1. (power method)

Glasgow 2009 – p. 25

slide-150
SLIDE 150

and now, the Implicit QR Iteration

Work with Hessenberg form to get ... ...efficiency. ...automatic nested subspace iterations. Get some shifts ρ1, ..., ρm to define p. Compute p(A)e1. (power method) Transform A to upper Hessenberg form: ˆ A = Q∗AQ by a matrix Q that has q1 = αp(A)e1.

Glasgow 2009 – p. 25

slide-151
SLIDE 151

ˆ A = Q∗AQ where q1 = αp(A)e1.

Glasgow 2009 – p. 26

slide-152
SLIDE 152

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1

Glasgow 2009 – p. 26

slide-153
SLIDE 153

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ...

Glasgow 2009 – p. 26

slide-154
SLIDE 154

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ... p(A)Kj(A, e1) = Kj(A, p(A)e1)

Glasgow 2009 – p. 26

slide-155
SLIDE 155

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ... p(A)Kj(A, e1) = Kj(A, p(A)e1) i.e. p(A)span{e1, . . . , ej} = span{q1, . . . , qj}

Glasgow 2009 – p. 26

slide-156
SLIDE 156

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ... p(A)Kj(A, e1) = Kj(A, p(A)e1) i.e. p(A)span{e1, . . . , ej} = span{q1, . . . , qj} subspace iteration with a change of coordinate system

Glasgow 2009 – p. 26

slide-157
SLIDE 157

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ... p(A)Kj(A, e1) = Kj(A, p(A)e1) i.e. p(A)span{e1, . . . , ej} = span{q1, . . . , qj} subspace iteration with a change of coordinate system j = 1, 2, 3, ..., n − 1

Glasgow 2009 – p. 26

slide-158
SLIDE 158

ˆ A = Q∗AQ where q1 = αp(A)e1. q1 → Q∗q1 = e1 power method with a change of coordinate system. Moreover ... p(A)Kj(A, e1) = Kj(A, p(A)e1) i.e. p(A)span{e1, . . . , ej} = span{q1, . . . , qj} subspace iteration with a change of coordinate system j = 1, 2, 3, ..., n − 1 |p(λj+1)/p(λj)| j = 1, 2, 3, ..., n − 1

Glasgow 2009 – p. 26

slide-159
SLIDE 159

Details

Glasgow 2009 – p. 27

slide-160
SLIDE 160

Details

choice of shifts

Glasgow 2009 – p. 27

slide-161
SLIDE 161

Details

choice of shifts We change the shifts at each step.

Glasgow 2009 – p. 27

slide-162
SLIDE 162

Details

choice of shifts We change the shifts at each step. ⇒ quadratic or cubic convergence

Glasgow 2009 – p. 27

slide-163
SLIDE 163

Details

choice of shifts We change the shifts at each step. ⇒ quadratic or cubic convergence

Other Questions

Glasgow 2009 – p. 27

slide-164
SLIDE 164

Details

choice of shifts We change the shifts at each step. ⇒ quadratic or cubic convergence

Other Questions

...how to get BLAS 3 speed? ...how to parallelize?

Glasgow 2009 – p. 27

slide-165
SLIDE 165

In Conclusion

Glasgow 2009 – p. 28

slide-166
SLIDE 166

In Conclusion

A careful study of

Glasgow 2009 – p. 28

slide-167
SLIDE 167

In Conclusion

A careful study of the power method and its extensions,

Glasgow 2009 – p. 28

slide-168
SLIDE 168

In Conclusion

A careful study of the power method and its extensions, similarity transformations,

Glasgow 2009 – p. 28

slide-169
SLIDE 169

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form,

Glasgow 2009 – p. 28

slide-170
SLIDE 170

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces

Glasgow 2009 – p. 28

slide-171
SLIDE 171

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces leads directly to the implicitly shifted QR algorithm.

Glasgow 2009 – p. 28

slide-172
SLIDE 172

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces leads directly to the implicitly shifted QR algorithm. The basic, explicit QR algorithm is skipped.

Glasgow 2009 – p. 28

slide-173
SLIDE 173

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces leads directly to the implicitly shifted QR algorithm. The basic, explicit QR algorithm is skipped. The implicit-Q theorem is avoided.

Glasgow 2009 – p. 28

slide-174
SLIDE 174

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces leads directly to the implicitly shifted QR algorithm. The basic, explicit QR algorithm is skipped. The implicit-Q theorem is avoided. Krylov subspaces are emphasized.

Glasgow 2009 – p. 28

slide-175
SLIDE 175

In Conclusion

A careful study of the power method and its extensions, similarity transformations, Hessenberg form, and Krylov subspaces leads directly to the implicitly shifted QR algorithm. The basic, explicit QR algorithm is skipped. The implicit-Q theorem is avoided. Krylov subspaces are emphasized. Krylov subspace methods are foreshadowed.

Glasgow 2009 – p. 28

slide-176
SLIDE 176

One Last Question

Glasgow 2009 – p. 29

slide-177
SLIDE 177

One Last Question

In the implicitly shifted QR algorithm

Glasgow 2009 – p. 29

slide-178
SLIDE 178

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen.

Glasgow 2009 – p. 29

slide-179
SLIDE 179

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name?

Glasgow 2009 – p. 29

slide-180
SLIDE 180

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name? Some possibilities: ...

Glasgow 2009 – p. 29

slide-181
SLIDE 181

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name? Some possibilities: ... ...unitary bulge-chasing algorithm

Glasgow 2009 – p. 29

slide-182
SLIDE 182

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name? Some possibilities: ... ...unitary bulge-chasing algorithm ...Hessenberg-Krylov nonstationary progressive nested subspace iteration

Glasgow 2009 – p. 29

slide-183
SLIDE 183

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name? Some possibilities: ... ...unitary bulge-chasing algorithm ...Hessenberg-Krylov nonstationary progressive nested subspace iteration ...Francis’s algorithm

Glasgow 2009 – p. 29

slide-184
SLIDE 184

One Last Question

In the implicitly shifted QR algorithm the QR decomposition is nowhere to be seen. Should the implicitly-shifted QR algorithm be given a different name? Some possibilities: ... ...unitary bulge-chasing algorithm ...Hessenberg-Krylov nonstationary progressive nested subspace iteration ...Francis’s algorithm Thank you for your attention.

Glasgow 2009 – p. 29