Intervals, Tridiagonal Matrices and the Lanczos Method Wolfgang W - - PowerPoint PPT Presentation

intervals tridiagonal matrices and the lanczos method
SMART_READER_LITE
LIVE PREVIEW

Intervals, Tridiagonal Matrices and the Lanczos Method Wolfgang W - - PowerPoint PPT Presentation

Outline Lanczos Algorithm Clustered Ritz values Conclusion Intervals, Tridiagonal Matrices and the Lanczos Method Wolfgang W ulling August, 21th 2007 Wolfgang W ulling Intervals, Tridiagonal Matrices and the Lanczos Method Outline


slide-1
SLIDE 1

Outline Lanczos Algorithm Clustered Ritz values Conclusion

Intervals, Tridiagonal Matrices and the Lanczos Method

Wolfgang W¨ ulling August, 21th 2007

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-2
SLIDE 2

Outline Lanczos Algorithm Clustered Ritz values Conclusion

1 Lanczos Algorithm

Exact Arithmetic Finite Precision Arithmetic The residual quantity

2 Clustered Ritz values

The Conjecture Estimates for Residual Quantity

3 Conclusion

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-3
SLIDE 3

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Symmetric Eigenvalue Problem

Given a large, sparse and symmetric matrix A ∈ RN×N, find (approximations to) eigenvalues λ and eigenvectors u, i.e. Au = λu Lanczos Method extracts solution / approximations (θ, z) to eigenpair (λ, u) via orthogonal projection from Krylov subspace(s) Az − θz ⊥ K = span(q, Aq, A2q, ..) Ritz value θ ∈ R: θ ≈ λ. Ritz vector z ∈ K, z = 0: z ≈ u

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-4
SLIDE 4

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Lanzcos Recursion, Matrix Notation

AQk = QkTk + βk+1qk+1e∗

k

(1) Q∗

kQk

= Ik (2) Tk =         α1 β2 β2 α2 β3 ... ... ... ... ... βk βk αk         ∈ Rk×k Qk = [q1, . . . , qk] ∈ RN×k orthogonal Lanczos vectors algorithm terminates with a βd = 0, d ≤ N + 1.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-5
SLIDE 5

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Ritz approximations

Spectral decomposition of Tk Tks• = θ•s• Tk = Skdiag(θ1, ..., θk)S∗

k,

S∗

kSk = SkS∗ k = Ik

Ritz values at step k are eigenvalues θ• of Tk Ritz vectors are z• = Qks•, where s• eigenvector of Tk.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-6
SLIDE 6

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Qualitiy of Ritz approximations, δk,•

Quality of Ritz approximation to eigenvalue can be controlled by easily computable residual quantity: δk,• := βk+1|sk,•| where s• =

  • .

. . sk,•

  • , sk,• bottom element of eigenvector of Tk.

min |λ − θ•| ≤ ||Az• − θ•z•|| ||z•|| = ||Az• − θ•z•|| = δk,•

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-7
SLIDE 7

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Stabilized Ritz values

Theorem (C. Paige) Persistence Theorem min

µ |θ• − µ| ≤ δk,•,

µ Ritz value in subsequent step. Definition: A Ritz value θ• is called stabilized to within δk,•. If δk,• is small θ• is called stabilized. Ritz value can stabilize only close to an eigenvalue

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-8
SLIDE 8

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Perturbed Lanczos Recursion

Rounding errors lead to perturbed Lanczos recursion AQk = QkTk + βk+qk+1e∗

k + rounding errors

(3) Q∗

kQk

= Ik + rounding errors (4) Lanczos vectors qk may lose orthogonality – even for small number of iterations, k ≪ N. It is not guaranteed that Algorithm terminates (with βd = 0), might run forever

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-9
SLIDE 9

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Quality of Ritz approximations, δk,•

Quality of Ritz approximations in f.p.computations: δk,• = βk+1|sk,•| controls convergence also in f.p. computations – thanks to Theorem (C. Paige) At any step k of the (f.p.) Lanczos algorithm the following is valid min |λ − θ•| ≤ max{2.5(δk,• + small in ǫ), small in ǫ} (5)

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-10
SLIDE 10

Outline Lanczos Algorithm Clustered Ritz values Conclusion Exact Arithmetic Finite Precision Arithmetic The residual quantity

Interim Conclusion

Residual Quantity The residual quantity δk,• = βk+1|sk,•| controls convergence in exact and finite precision computations!

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-11
SLIDE 11

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Tight, well separated cluster

λl λr γ δ If δ ≪ γ : tight, well separated cluster.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-12
SLIDE 12

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

The Conjecture

Rounding errors: Multiple copies of Ritz approximations to single eigenvalue are generated. Ritz values cluster closely to eigenvalues of A. Conjecture (Strakoˇ s, Greenbaum, 1992) For any Ritz value θ• being part of a tight, well separated cluster, the value of δk• is small, i.e δk,• ≪ 1. Furthermore, for any Ritz value in the cluster δk• is proportional to

  • δ

γ =

  • cluster diameter

gap in spectrum

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-13
SLIDE 13

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Strakoˇ s Matrix, Example 1

Example 1(Strakoˇ s) Apply Lanczos Algorithm to diagonal matrix A with eigenvalues λν = λ1 + ν − 1 N − 1(λN − λ1)ρN−ν, ν = 2, ..., N − 1, (6) where λ1 = 0.1, λN = 100, ρ = 0.7 and N = 24. We look at cluster of two Ritz values close to λ22 ≈= 44.7944.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-14
SLIDE 14

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Strakoˇ s Matrix, Example 1

Example 1: Cluster of two Ritz values close to λ22 ≈ 44.7944.

20 21 22 23 24 25 26 27 28 29 30 10

−18

10

−16

10

−14

10

−12

10

−10

10

−8

10

−6

10

−4

10

−2

10 10

2

Steps k of Lanczos Algorithm

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-15
SLIDE 15

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Counterexample

Consider diagonal matrix A ∈ R23 with eigenvalues λ1 = −100, λj+1 = λj + 0.1 for j = 1, . . . , 10 λ12 = 0 and λ12+j = −λ12−j for j = 1, . . . , 11. λ1 λ11 λ12 = 0 λ13 λ23 Starting vector q1 =

1 √ 23(1, 1, ..., 1)∗:

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-16
SLIDE 16

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Counterexample

k δk,first δk,second

  • δ

γ

9 4.6614 × 10−7

  • 10

7.0356 × 101 7.0356 × 101 9.7023 × 10−05 11 2.3067 × 10−9

  • 12

7.0365 × 101 7.0347 × 101 6.8258 × 10−06 13 1.0724 × 10−11

  • 14

6.7432 × 101 7.3162 × 101 4.6544 × 10−07 15 5.6899 × 10−14

  • 16

5.3119 × 101 6.0079 × 101 3.0700 × 10−08 17 4.1795 × 10−14

  • 18

3.7987 × 10−1 3.4773 × 10−1 1.8764 × 10−09 19 6.5049 × 10−12

  • 20

1.5658 × 10−3 4.8528 × 10−4 2.0156 × 10−10

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-17
SLIDE 17

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Last example: Clustered Ritz values close to eigenvalue, but δk,• ≫

  • δ

γ for all Ritz value in the cluster.

But in (Strakoˇ s, Greenbaum, 1992), for some particular cases estimates obtained with δk,• ≤

  • δ

γ O(||A||).

Idea from (Strakoˇ s, Greenbaum, 1992): Estimate δk,• using

  • nly information about Ritz values in several consecutive

steps. Since δk,• = βk+1

  • =O(||A||)

|sk,•|

  • eigenvector element

concentrate on bottom elements of Tk’s eigenvectors.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-18
SLIDE 18

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Constant Cluster Size: k − 1, k

Theorem Suppose that number of Ritz values in a tight, well separated cluster is constant at steps k − 1 and k. Then:

  • cluster

(sk,•)2 < 3 δ γ − δ

γ

.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-19
SLIDE 19

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Outline of the proof

Proof. Observe, at the kth step, for any Ritz value θ (in the cluster) we have ψk (θ) ψk−1(θ) = 0, ψ′

k(θ)

ψk−1(θ) = 0.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-20
SLIDE 20

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Outline of the proof

Proof. Observe, at the kth step, for any Ritz value θ (in the cluster) we have ψk (θ) ψk−1(θ) = 0, ψ′

k(θ)

ψk−1(θ) = 0. Hence, use formula for sk,• and apply Residue Theorem:

  • cluster

(sk,•)2 =

  • cluster

ψk−1(θ) ψ′

k(θ)

= 1 2πi

  • Γ

ψk−1(z) ψk(z) dz. and estimate line integral.

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-21
SLIDE 21

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Constant Cluster Size: k, k + 1

Theorem Suppose that number of Ritz values in a tight, well separated cluster is constant at steps k and k + 1. Then:

  • cluster

(sk,•)2 < 3 O(||A||) β2

k+1

δ γ − δ

2

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-22
SLIDE 22

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Decreasing or Increasing Cluster Size

Theorem a) Decreasing: Suppose c Ritz values at step k − 1 and c − 1 Ritz values at step k form cluster, then:

  • cluster

(sk,•)2 < 4 3 √ 3 δ2

  • γ − δ

2

2 . b) Increasing: Let c Ritz values at step k and c + 1 Ritz values at step k + 1 form cluster, then:

  • cluster

(sk,•)2 < 4 3 √ 3 O(||A||) β2

k+1

δ2

  • γ − δ

2

2 .

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-23
SLIDE 23

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Example 1, continued

20 22 24 26 28 30 32 34 36 38 40 10

−35

10

−30

10

−25

10

−20

10

−15

10

−10

10

−5

10 10

5

Steps k of Lanczos Algorithm sum of squares of bottom elements and upper bound

A third Ritz value approximating λ22≈ 44.7944

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-24
SLIDE 24

Outline Lanczos Algorithm Clustered Ritz values Conclusion The Conjecture Estimates for Residual Quantity

Stabilized Cluster

Corollary For a tight well separated cluster with diameter δ and spectral gap γ we have δk,min ≤ O(||A||)        no estimate , ck = ck−1 + 1

1 √ck

  • δ

γ− δ

2

, ck = ck−1

1 √ck δ γ− δ

2

, ck = ck−1 − 1

  • r

δk,min ≤ O(||A||)        no estimate , ck = ck+1 + 1

1 √ck

  • δ

γ− δ

2

, ck = ck+1

1 √ck δ γ− δ

2

, ck = ck+1 − 1

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-25
SLIDE 25

Outline Lanczos Algorithm Clustered Ritz values Conclusion

What we have seen ...

Tight, well separated cluster must approximate an eigenvalue

  • f A.

At least one Ritz value will always remain close to an eigenvalue (persistence theorem). Any tight well separated cluster stabilized. in most cases: δk,• small for all Ritz values in a tight, well separated cluster; only exception is an alternating number of Ritz values in the cluster. Intermediate cluster can comprise at most one Ritz value, see also (Knizhnerman, 1995).

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method

slide-26
SLIDE 26

Outline Lanczos Algorithm Clustered Ritz values Conclusion

... and the main ingredients of the proofs:

Idea to express / estimate anything in terms of Ritz values – taken from (Strakoˇ s, Greenbaum (1992)) Residue Theorem Interlacing of Ritz values (not on these slides, but needed for the proofs).

Wolfgang W¨ ulling Intervals, Tridiagonal Matrices and the Lanczos Method