[PPT] - A rational Krylov method for non-linear matrix problems Karl PowerPoint Presentation

SLIDE 1

A rational Krylov method for non-linear matrix problems

Karl Meerbergen (Joint work with Roel Van Beeumen and Wim Michiels)

KU Leuven

SIMAX – Valencia

SLIDE 2

Motivation

Solution of linear eigenvalue problem Ax = λBx is pretty well understood

◮ Appealing method: shift-and-invert Arnoldi ◮ Reliable for computing several eigenvalues near a shift point. ◮ Rational Krylov: more than one interpolation points = poles.

Polynomial eigenvalue problems

◮ We can use Krylov methods (when we ‘linearize’)

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 2 / 24

SLIDE 3

Motivation

Solution of linear eigenvalue problem Ax = λBx is pretty well understood

◮ Appealing method: shift-and-invert Arnoldi ◮ Reliable for computing several eigenvalues near a shift point. ◮ Rational Krylov: more than one interpolation points = poles.

Polynomial eigenvalue problems

◮ We can use Krylov methods (when we ‘linearize’)

Solution of non-linear eigenvalue problem A(λ)x = 0

◮ Residual inverse iteration and Newton iterations are locally convergent

methods

◮ Globally convergent methods by using polynomial approximation of

A(λ)

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 2 / 24

SLIDE 4

Outline

1

Motivation

2

Polynomial eigenvalue problem

3

Nonlinear eigenvalue problem

4

Rational Krylov

5

A statement on moment matching and projection

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 3 / 24

SLIDE 5

Polynomial eigenvalue problem

Polynomial eigenvalue problem: A0x + λA1x + · · · + λNANx = 0 Transformed to

(A − λB)x =           A0 A1 · · · Ap−1 I ... I      − λ      · · · · · · −AN I ... . . . I                x λx . . . λp−1x      =

Is a linear problem, so we can use Arnoldi’s method Is a well known technique.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 4 / 24

SLIDE 6

Linearization for polynomial eigenvalue problems

(Shift and) invert transform A−1B: λ−1      x λx . . . λp−1x      =      S1 · · · · · · Sp I ... . . . I           x λx . . . λp−1x      Krylov sequence:    v . . .   

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 5 / 24

SLIDE 7

Linearization for polynomial eigenvalue problems

(Shift and) invert transform A−1B: λ−1      x λx . . . λp−1x      =      S1 · · · · · · Sp I ... . . . I           x λx . . . λp−1x      Krylov sequence:    v . . .         w = S1v v . . .     

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 5 / 24

SLIDE 8

Linearization for polynomial eigenvalue problems

(Shift and) invert transform A−1B: λ−1      x λx . . . λp−1x      =      S1 · · · · · · Sp I ... . . . I           x λx . . . λp−1x      Krylov sequence:    v . . .         w v . . .             t = S1w + S2v w v . . .       

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 5 / 24

SLIDE 9

Linearization for polynomial eigenvalue problems

(Shift and) invert transform A−1B: λ−1      x λx . . . λp−1x      =      S1 · · · · · · Sp I ... . . . I           x λx . . . λp−1x      Krylov sequence:    v . . .         w v . . .             t w v . . .                 u = S1t + S2w + S3v t w v . . .         

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 5 / 24

SLIDE 10

Linearization for polynomial eigenvalue problems

Shift-and-invert transform: λ−1      x λx . . . λp−1x      =      S1 · · · · · · Sp I ... . . . I           x λx . . . λp−1x      Krylov sequence:               v1 v2 v3 v4 · · · vk v1 v2 v3 vk−1 . . . v1 v2 vk−2 . . . v1 . . . . . . . . . . . . v1              

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 6 / 24

SLIDE 11

Nonlinear eigenvalue problem

Straightforward idea: choose an approximation of a fixed degree and solve the corresponding polynomial eigenvalue problem:

◮ Polynomial expansion, e.g. Taylor, Chebyshev, . . .

A(λ) ≈ A0p0(λ) + · · · + ANpN(λ) [Amiraslani, Corless, Lancaster, 2009], [Effenberger, Kressner, 2011]

◮ Rational expansion, e.g. (Chebyshev) Pad´

e A(λ) ≈ A0 α0 λ − σ0 + · · · + AN αN λ − σN [Su & Bai, 2011]

◮ Spectral discretization

A(λ) ≈ A0 + λA0p0(λ) + · · · + ANpN(λ) α0p0(λ) + · · · + αNpN(λ) [Trefethen 2000], [Michiels, Niculescu 2007], [Breda, Maset, Vermiglio 2005]

In this talk, we explore a ‘flexible’ choice of N and pj.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 7 / 24

SLIDE 12

Taylor-Arnoldi

[Jarlebring, Meerbergen, Michiels, 2010] Power sequence on Taylor series A(λ) ≈ A0 + λA1 + λ2A2 + · · · For k iterations of Arnoldi, use Companion linearization with N > k Arnoldi:    v . . .    Breakthrough for solving the delay (equation) eigenvalue problem

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 8 / 24

SLIDE 13

Taylor-Arnoldi

[Jarlebring, Meerbergen, Michiels, 2010] Power sequence on Taylor series A(λ) ≈ A0 + λA1 + λ2A2 + · · · For k iterations of Arnoldi, use Companion linearization with N > k Arnoldi:    v . . .         w = −A−1

0 A1v

v . . .      Breakthrough for solving the delay (equation) eigenvalue problem

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 8 / 24

SLIDE 14

Taylor-Arnoldi

[Jarlebring, Meerbergen, Michiels, 2010] Power sequence on Taylor series A(λ) ≈ A0 + λA1 + λ2A2 + · · · For k iterations of Arnoldi, use Companion linearization with N > k Arnoldi:    v . . .         w v . . .             t = −A−1

0 (A1w + A2)v

w v . . .        Breakthrough for solving the delay (equation) eigenvalue problem

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 8 / 24

SLIDE 15

Taylor-Arnoldi

[Jarlebring, Meerbergen, Michiels, 2010] Power sequence on Taylor series A(λ) ≈ A0 + λA1 + λ2A2 + · · · For k iterations of Arnoldi, use Companion linearization with N > k Arnoldi:    v . . .         w v . . .             t w v . . .                 u = −A−1

0 (A1t + A2w + A3)v

t w v . . .          Breakthrough for solving the delay (equation) eigenvalue problem

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 8 / 24

SLIDE 16

Other polynomials

Use polynomials in order to

◮ avoid derivatives or ◮ interpolation in more than one point.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 9 / 24

SLIDE 17

Other polynomials

Use polynomials in order to

◮ avoid derivatives or ◮ interpolation in more than one point.

Newton polynomial Lagrange polynomial Chebyshev polynomial

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 9 / 24

SLIDE 18

Other polynomials

Use polynomials in order to

◮ avoid derivatives or ◮ interpolation in more than one point.

Newton polynomial Lagrange polynomial Chebyshev polynomial A (partly) successful generalization of Taylor-Arnoldi to rational Krylov

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 9 / 24

SLIDE 19

Newton polynomial basis

Newton polynomials: N0 := 1, N1 := (λ − σ0), N2 := (λ − σ0)(λ − σ1), . . . A(λ) = A0N0 + A1N1 + · · · with

◮ A0 = [A0] = A(σj) and ◮ Aj = [A0, . . . , Aj] = [A1, . . . , Aj] − [A0, . . . , Aj−1]

σj − σ0 (divided difference).

σ0, σ1, . . . are interpolation points. Linearization:

              A0 A1 · · · AN σ0I I σ1I I ... ... σN−2I I        − λ       · · · · · · I . . . ... . . . I                      x N1(λ)x N2(λ)x . . . . . .         = 0

A−1B is not a block upper Hessenberg matrix

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 10 / 24

SLIDE 20

Rational Krylov

Shift-and-invert step: (A − σjB)w = Bv First step: multiply by B: shift vector downwards B · v = w       · · · · · · I . . . ... . . . I       ·        v1 v2 . . . vN−1 vN        =        v1 v2 . . . vN−1        Then apply (A − σjB)−1: A − σjB =        A0 A1 · · · AN (σ0 − σj)I I (σ1 − σj)I I ... ... (σN−1 − σj)I I       

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 11 / 24

SLIDE 21

Rational Krylov

µj

i = σi − σj.

Shift-and-invert-step:

        A0 A1 A2 A3 A4 A5 σ0I I σ1I I σ2I I σ3I I σ4I I        

−1 

       ⋆         =         ⋆ ⋆ ⋆ ⋆ ⋆ ⋆        

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors This is rational Krylov with poles σ1, σ2, . . . Linear recurrence relation of the form: AVk+1Hk = BVk+1Lk

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 12 / 24

SLIDE 22

Rational Krylov

µj

i = σi − σj.

Shift-and-invert-step: A − σ1B

        A0 A1 A2 A3 A4 A5 µ1

0I

I I µ1

2I

I µ1

3I

I µ1

4I

I        

−1 

       ⋆         =         ⋆ ⋆        

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors This is rational Krylov with poles σ1, σ2, . . . Linear recurrence relation of the form: AVk+1Hk = BVk+1Lk

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 12 / 24

SLIDE 23

Rational Krylov

µj

i = σi − σj.

Shift-and-invert-step: A − σ2B

        A0 A1 A2 A3 A4 A5 µ2

0I

I µ2

1I

I I µ2

3I

I µ2

4I

I        

−1 

       ⋆ ⋆         =         ⋆ ⋆ ⋆        

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors This is rational Krylov with poles σ1, σ2, . . . Linear recurrence relation of the form: AVk+1Hk = BVk+1Lk

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 12 / 24

SLIDE 24

Rational Krylov

µj

i = σi − σj.

Shift-and-invert-step: A − σ3B

        A0 A1 A2 A3 A4 A5 µ3

0I

I µ3

1I

I µ3

2I

I I µ3

4I

I        

−1 

       ⋆ ⋆ ⋆         =         ⋆ ⋆ ⋆ ⋆        

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors This is rational Krylov with poles σ1, σ2, . . . Linear recurrence relation of the form: AVk+1Hk = BVk+1Lk

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 12 / 24

SLIDE 25

Rational Krylov

µj

i = σi − σj.

Shift-and-invert-step: A − σ4B

        A0 A1 A2 A3 A4 A5 µ4

0I

I µ4

1I

I µ4

2I

I µ4

3I

I I        

−1 

       ⋆ ⋆ ⋆ ⋆         =         ⋆ ⋆ ⋆ ⋆ ⋆        

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors This is rational Krylov with poles σ1, σ2, . . . Linear recurrence relation of the form: AVk+1Hk = BVk+1Lk

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 12 / 24

SLIDE 26

Rational Krylov

With constant continuation vector v1, (A − σ1B)−1Bv1, (A − σ2B)−1Bv1, . . . On each iteration, we must solve the system: A(σj)wj = [A(σ0), A(σj)]v1 So, only two matrices have to be stored: A(σ0) and A(σj). Note: [A(σ0), A(σj)] ≈ A′(σj) Recurrence relation has small residual norm (rounding errors). Caveat: choice of continuation vector may be unstable. When the σj become Ritz values or converged Ritz value show up again, be worried.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 13 / 24

SLIDE 27

Rational Krylov

Usual choice of continuation vector v1, (A − σ1B)−1Bv1, (A − σ2B)−1Bv2, . . . where vj is the last subspace vector. On each iteration, we must solve the system: A(σj)wj =

j

i=1

Aiwj−1,i So, k divided differences have to be stored or computed Note: j

i=1 Aiwj−1,i ≈ A′(σj)w0,i

Recurrence relation has small residual norm (rounding errors). Is known to be stable in practice.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 14 / 24

SLIDE 28

Newton basis for scalar equation

Compute roots of α0 + α1N1(z) + · · · + αdNd(z) = 0 Linearization in iteration k (pole σk):                          

α0 · · · αd · · · σ0 1 . . . ... 1 σd 1 ... ... . . . σk−1 1

         − λ        

1 ... 1 ... ... 1

                                 

x0 . . . xd . . . . . . xk

         = 0 The d roots λ are eigenvalues ∞ is a defective eigenvalue (multiplicity k + 1 − d)

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 15 / 24

SLIDE 29

Newton basis for scalar equation

RKS Recurrence relation: AVkHk − BVkKk = fkeT

k

with fk = (A − σkB)vk+1 = 0 In this case, Vk = Ik+1,k Last RKS iteration: (A − σkB)−1B = C dT

with

C ∈ Ck×k From recurrence relation: (A − σkB)−1BVk − VkHk(Kk − σkHk)−1 = vk+1gT

k

C − Hk(Kk − σkHk)−1 = Thus the Ritz values computed from Kkz = λHkz are roots of the polynomial RKS residual is not zero!

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 16 / 24

SLIDE 30

Lagrange polynomial basis

Lagrange polynomials: Lj := ωj

i=j

(λ − σi) , ωj =  

i=j

(σj − σi)  

−1

, j = 0, . . . , N A(λ) = A0L0 + A1L1 + · · · with

◮ Aj = A(σj), j = 0, . . . , N and ◮ σ0, σ1, . . . the interpolation points.

Linearization:

           A0 · · · AN σ0I −ω0 ... . . . σN−1I −ωNI      − λ       · · · · · · I . . . ... . . . I                  L0x L1x . . . LNx      = 0

A−1B is not a block upper Hessenberg matrix

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 17 / 24

SLIDE 31

Rational Krylov

Apply shift: (A − σjB)u = λBu has the following form:

                A0 A1 · · · AN ⋆ −ω0I . . . ... ⋆ −ωNI         − λ        · · · · · · I I ... . . . I                       x L1x L2x . . . LNx        = 0

(A − σjB)−1B applied to vj respects the desired zero structure in the vectors The method does not see the last column of A (ωj): does not work for finding roots of a polynomial.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 18 / 24

SLIDE 32

Chebyshev series

A(λ) = A0T0 + A1T1 + · · · Linearization:

                    −A0 −A1 · · · ⋆ I I I ... ... ... ... ... I I           − λ        · · · · · · · · · −AN I 2I ... . . . 2I                  x = 0

No block upper Hessenberg structure Arnoldi iterations require computation of A(0), A′(0) expressed in Chebyshev basis. i.e. moment matching requires the full Chebyshev expansion (use a functional framework [Jarlebring, Michiels, M. 2011]) Shifting as for Newton basis does not help.

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 19 / 24

SLIDE 33

Numerical example: Hopf bifurcation

Nonlinear equation: ˙ x = f (x, λ) Hopf bifurcation of dynamical system: A(λ)u = iµu with µ real λ is an eigenvalue of (A(λ) ⊗ I + I ⊗ A(λ)x = 0 Usually: continuation method for λ Modified Olmstead model (n = 40) of the form: (A + λEu + (λ + 1)−1Eu + (λ + 2)−1/2Ev)u = 0 Pick ten poles in a Leja fashion in [6, 10], then use Ritz value. For related work, see [Guckerheimer et al. 1993, 1997], [Govaerts, 2000] [M. & Spence 2010], [Elman, M., Spence, Wu, 2012], [Elman, Wu, 2012]

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 20 / 24

SLIDE 34

Hopf bifurcations

20 iterations rational Krylov λ B−1

N ANx − λx2

A(λ)x2 8.127 1.0 · 10−12 6.0 · 10−12 8.127 1.1 · 10−11 7.4 · 10−11 7.405 8.6 · 10−11 4.3 · 10−10 Finding more than three eigenvalues is not possible for this problem with this method

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 21 / 24

SLIDE 35

Gun problem

Problem from the NLEVP collection (K − λM + i

λ − σ2

1W1 + i

λ − σ2

2W2)x = 0

where K has order 9956 and rank(W1) + rank(W2) = 84. The goal is to compute all eigenvalues in a halve circle in the λ-plane We chose five interpolation points equally distributed in the λ-plane

100 150 200 250 300 350 20 40 60 80 100 120 Re √ λ Im √ λ 10 20 30 40 50 60 10−16 10−14 10−12 10−10 10−8 10−6 10−4 10−2 100 iteration relative residual norm E(λ, x)

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 22 / 24

SLIDE 36

Moment matching

Linear system: A(s)x = f Krylov methods match moments of the transfer function AN(s)x = f :

◮ Arnoldi on Companion form: subspace spanned by x(j)(0),

j = 0, . . . , N.

◮ Rational Krylov: subspace spanned by x(σj) = A−1

N (σj)f .

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 23 / 24

SLIDE 37

Conclusions

Also attend talk by Roel Van Beeumen (CP38, Thursday 17:50) Advantages of rational Krylov:

◮ Low degree polynomials ◮ With or without derivatives ◮ Exploit structure, use symbolic tools (See talk Roel Van Beeumen) ◮ Not slower than Newton

Disadvantages of rational Krylov:

◮ Derivatives required for computing many eigenvalues (Hermite

interpolation)

◮ Newton polynomials require divided differences

K. Meerbergen (KU Leuven)

Nonlinear Rational Krylov SIMAX – Valencia 24 / 24