A framework for deflated BiCG and related solvers Martin H. - - PowerPoint PPT Presentation

a framework for deflated bicg and related solvers
SMART_READER_LITE
LIVE PREVIEW

A framework for deflated BiCG and related solvers Martin H. - - PowerPoint PPT Presentation

http://www.sam. ma th. et hz .h /~ mh g A framework for deflated BiCG and related solvers Martin H. Gutknecht Seminar for Applied Mathematics, ETH Zurich SIAM Conference on Applied Linear Algebra Valencia, 20 June 2012 Joint work


slide-1
SLIDE 1

A framework for deflated BiCG and related solvers

Martin H. Gutknecht

Seminar for Applied Mathematics, ETH Zurich

http://www.sam. ma th. et hz . h /~ mh g

SIAM Conference on Applied Linear Algebra Valencia, 20 June 2012 Joint work with André Gaul, Jörg Liesen, Reinhard Nabben

slide-2
SLIDE 2

Augment./Deflat. History CG BiCG Conclusions

Outline

Augmentation and Deflation: Basics History Galerkin: CG, CR, GCR, ... Petrov-Galerkin: BICG, BICR Conclusions

M.H. Gutknecht SIAM-ALA12

  • p. 2
slide-3
SLIDE 3

Augment./Deflat. History CG BiCG Conclusions

Iterative methods based on (Petrov-)Galerkin conditions

To solve: Ax = b with A ∈ CN×N nonsingular. Construct sequence xn such that rn :≡ b − Axn → o . Choose xn from search space x0 + Sn such that some Galerkin or Petrov-Galerkin condition is satisfied: xn ∈ x0 + Sn , rn = A(x⋆ − xn) ⊥ BH Sn with some (formal) inner product matrix BH , i.e., rn ∈ r0 + ASn , rn ⊥ BH Sn . r0 is approximated from ASn such that “error” rn ⊥ Sn . Two cases: Galerkin: Sn = Sn , Petrov-Galerkin: Sn = Sn

B :≡ I for CG/BICG, B :≡ AH for CR/GCR, B :≡ A for BICR.

M.H. Gutknecht SIAM-ALA12

  • p. 3
slide-4
SLIDE 4

Augment./Deflat. History CG BiCG Conclusions

Augmentation and deflation

Ass.: know basis U of approximately A-invariant subspace U, i.e., U = R(U), where U ∈ CN×k full rank. Search space Sn and test space Sn are split up: xn ∈ x0 + Sn , rn ⊥ BH Sn , Sn :≡ Kn ⊕ U ,

  • Sn :≡

Ln ⊕ U ,

  • Kn :≡ Kn(

A, r0) :≡ span { r0, . . . , An−1 r0} . Still to be specified:

  • A,
  • r0,
  • Ln,
  • U.

Galerkin case: Ln :≡ Kn,

  • U :≡ U.

Petrov-Galerkin case: e.g., Ln :≡ Kn( AH,

  • r0),

but other options for Kn and Ln exist.

M.H. Gutknecht SIAM-ALA12

  • p. 4
slide-5
SLIDE 5

Augment./Deflat. History CG BiCG Conclusions

Rationale of augmentation and deflation

Ideally: columns of U ∈ CN×k span A-invariant subspace U belonging to eigenvalues close to 0 . Let Z :≡ AU , Z :≡ AU = U. Note: images of the restriction A−1

  • Z are trivial to compute:

if z = Zc ∈ Z , then A−1z = Uc . Choose projector P such that N(P) = Z . Split up space: CN = R(P) ⊕ Z. Choose A :≡ PA and r0 :≡ Pr0 so that Kn ⊆ R(P). Split up r0 accordingly: r0 =

  • r0
  • ∈ R(P)

+ r0 − r0 ∈ Z . A−1 (r0 − r0) is trivial;

  • A−1

r0 is found with Krylov space solver acting on R(P) .

M.H. Gutknecht SIAM-ALA12

  • p. 5
slide-6
SLIDE 6

Augment./Deflat. History CG BiCG Conclusions

Since N(P) = Z , we have N( A) = N(PA) = U . So the (absolutely) small eigenvalues of A represented by U that caused trouble are replaced by a k-fold EVal o in A (deflation). Projector P is not fully determined by its nullspace since it may be oblique. Hopefully, A

  • R(P) =

A

  • R(P) .

(1) If A is Hermitian and U is A-invariant, and if P is chosen such that R(P) = U⊥ or R(P) = Z⊥ , Eq. (1) holds. If R(P) = Z⊥ , P is an orthogonal projector.

M.H. Gutknecht SIAM-ALA12

  • p. 6
slide-7
SLIDE 7

Augment./Deflat. History CG BiCG Conclusions

How to find an approximately invariant subspace?

It may be known from a theoretical analysis of the problem. It may result from the solution of previous systems with the same A . ( linear system with multiple right-hand sides) It may results from the solution of previous systems with nearby A . It may results from previous cycles of the solution process (if the method is restarted). There are lots of examples in the literature.

M.H. Gutknecht SIAM-ALA12

  • p. 7
slide-8
SLIDE 8

Augment./Deflat. History CG BiCG Conclusions

Things to distinguish

Augmented bases: xn ∈ x0 + Kn( A, r0) + U , where

  • A = A
  • r

spec( A) ⊂ spec(A) ∪ {0} (Spectral) deflation: A A :≡ PA s.t. small EVals 0 EVal translation: A A :≡ AP s.t. small EVals |λ

max|

Krylov space recycling: choice of U based on prev. cycles Flexible KSS: adaptation of P at each restart Two basic approaches: Augmentation of basis with or without spectral deflation. EVal translation by suitable preconditioning (no deflation!).

M.H. Gutknecht SIAM-ALA12

  • p. 8
slide-9
SLIDE 9

Augment./Deflat. History CG BiCG Conclusions

History

Early contributions (many more papers appeared since): Nicolaides ’85/’87SINUM: deflated 3-term CG (w/augm. basis) Dostál ’87/’88IntJCompMath: deflated 2-term CG (w/augm. basis) Morgan ’93/’95SIMAX: GMRES with augmented basis de Sturler ’93/’96JCAM: inner-outer GMRES/GCR (and, briefly, inner-outer BiCGStab/GCR) with augmented basis Chapman / Saad ’95/’97NLAA GMRES with augmented basis Saad ’95/’97SIMAX Analysis of KSS with augmented basis de Sturler ’97/’99SINUM inner-outer GMRES/GCR w/truncation Vuik / Segal / Meijerink ’98/’99JCP 2-term CG w/augm. basis Bristeau / Erhel ’98/’98NumAlg CG with augmented basis Erhel / Guyomarc’h ’97/’00SIMAX defl. 2-term CG w/augm. basis Saad / Yeung / Erhel / Guyomarc’h ’98/’00SISC the same

M.H. Gutknecht SIAM-ALA12

  • p. 9
slide-10
SLIDE 10

Augment./Deflat. History CG BiCG Conclusions

Galerkin: CG, CR, GCR, ... : some details

Given: A, B ∈ CN×k and U ∈ CN×k Most relevant cases: B = I for CG, B = AH for CR, GCR E :≡ UHBAU ∈ Ck×k , assumed nonsingular, M :≡ UE−1UH , P :≡ I − AMB , projector onto (BH U)⊥ along Z , Q :≡ I − MBA , projector onto (AHBH U)⊥ along U ,

  • A :≡ PA = AQ = PAQ

with N( A) = U , R( A) = (BHU)⊥,

  • rn :≡ P(b − A

xn) = Pb − A xn ∈ (BHU)⊥ ,

  • Kn :≡ K(

A, r0) ⊆ (BHU)⊥.

M.H. Gutknecht SIAM-ALA12

  • p. 10
slide-11
SLIDE 11

Augment./Deflat. History CG BiCG Conclusions

An equivalence theorem

THEOREM

For n ≥ 1 the two pairs of conditions, xn ∈ x0 + Kn + U , rn ⊥ B Kn + B U , (2) and

  • xn ∈ x0 +

Kn ,

  • rn ⊥ B

Kn . (3) are equivalent in the sense that xn = Q xn + MBHb and rn = rn . (4)

  • DEF. The direct deflation approach is given by (2),

the indirect deflation approach is given by (3)–(4).

M.H. Gutknecht SIAM-ALA12

  • p. 11
slide-12
SLIDE 12

Augment./Deflat. History CG BiCG Conclusions

Computing xn satisfying (3) means solving the singular linear system

  • A

x = Pb with a Krylov space solver characterized by (3). What are the properties of A ? N( A) = U , R( A) = (BHU)⊥. If U is A-invariant, the corresp. EVals become 0. What can we say about the others? Consider partitioned Jordan decomposition of A A = SJS−1 =

  • S1

S2 J1 J2 SH

1

  • SH

2

  • ,

where S1, S1 ∈ CN×k, S2, S2 ∈ CN×(N−k) and either R(S1) = U or R( S1) = BHU.

M.H. Gutknecht SIAM-ALA12

  • p. 12
slide-13
SLIDE 13

Augment./Deflat. History CG BiCG Conclusions

Theorem

(1) If U = R(S1), if U ∈ CN×k is any matrix satisfying R(U) = U, and if UHBAU is nonsingular, then

  • A = PA =
  • U

PS2 J2 U PS2 −1 (5) with

  • U

PS2 −1 =

  • BHU(UHBHU)−1
  • S2

H . (2) If BHU = R( S1), if U ∈ CN×k is any matrix satisfying R(U) = U, and if UHBAU is nonsingular, then

  • A = PA =
  • U

S2 J2 U S2 −1 (6) with

  • U

S2 −1 =

  • BHU(UHBHU)−1

QH S2 H . In particular, in both cases the spectrum Λ( A) of A is given

M.H. Gutknecht SIAM-ALA12

  • p. 13
slide-14
SLIDE 14

Augment./Deflat. History CG BiCG Conclusions

Petrov-Galerkin: BICG, BICR, ...

Generalized BICG (GENBICG) [G. ’90(CopperMtn), ’97ActaNum] with formal inner product matrix B requires A and B to commute (to maintain short recurrences). For deflated solvers based on Petrov-Galerkin condition we need projectors and operators for creating split dual spaces: Sn :≡ Kn ⊕ U ,

  • Sn :≡

Ln ⊕ U . May consider solving two dual systems at once [Ahuja ’09Diss]: Ax = b , AH x = b such that xn ∈ x0 + Sn ,

  • xn ∈

x0 + Sn , rn ⊥ BH Sn ,

  • rn ⊥ BSn .

M.H. Gutknecht SIAM-ALA12

  • p. 14
slide-15
SLIDE 15

Augment./Deflat. History CG BiCG Conclusions

Petrov-Galerkin: Projectors and other operators

definition null space range further properties E

  • UHBAU

{0} Ck assumed to be nonsingular M UE−1 UH

  • U⊥

U rank M = k P I − AMB Z (BH U)⊥ P2 = P Q I − MBA U (AHBH U)⊥ Q2 = Q

  • A

PA U (BH U)⊥

  • A = PA = AQ = PAQ
  • E

UHBHAH U {0} Ck assumed to be nonsingular

  • M
  • U

E−1UH U⊥

  • U

rank M = k

  • P

I − AH MBH

  • Z

(BU)⊥

  • P2 =

P

  • Q

I − MBHAH

  • U

(ABU)⊥

  • Q2 =

Q

  • A
  • PAH
  • U

(BU)⊥

  • A =

PAH = AH Q = PAH Q

Krylov spaces used:

  • Kn :≡ Kn(

A, r0) ,

  • Ln :≡ Kn(

A, r0) .

M.H. Gutknecht SIAM-ALA12

  • p. 15
slide-16
SLIDE 16

Augment./Deflat. History CG BiCG Conclusions

THEOREM

For n ≥ 1 the two sets of conditions xn ∈ x0 + Kn + U , rn ⊥ BH( Ln + U) ,

  • xn ∈

x0 + Ln + U ,

  • rn ⊥ B(

Kn + U) and

  • xn ∈ x0 +

Kn ,

  • rn⊥ BH

Ln ,

  • xn ∈

x0 + Ln ,

  • rn ⊥ B

Kn are equivalent in the sense that xn = Q xn + MBb and rn = rn , (7)

  • xn =

Q

  • xn + MHBH

b and

  • rn =
  • rn .

(8)

M.H. Gutknecht SIAM-ALA12

  • p. 16
slide-17
SLIDE 17

Augment./Deflat. History CG BiCG Conclusions

Deflated BICG

For solving a single system Ax = b with deflated BICG the indirect approach requires to solve A x = Pb such that

  • xn ∈ x0 +

Kn ,

  • rn ⊥

Ln . This works with BICG if A = AH . Fortunately, we have:

THEOREM

If B = I, then PH = Q and QH = P , and therefore AH = A . It is not clear if one can obtain an equally efficient deflated BICR (where B = A). But BICGSTAB, IDR(s), etc. can be done.

M.H. Gutknecht SIAM-ALA12

  • p. 17
slide-18
SLIDE 18

Augment./Deflat. History CG BiCG Conclusions

Conclusions

We have set up a framework for deflated Krylov space solvers based on a Galerkin condition. We are developing an analogue framework for those based

  • n a Petrov-Galerkin condition.

By our indirect approach we obtain a deflated BICG with coupled two-term recurrences (or three-term recurrences) which reduces to deflated CG when applied to an Hpd problem:

  • If B = I, BICG applied to the projected problem
  • A

x = Pb yields xn, rn, and shadow residuals

  • rn.
  • The transformation (7) yields xn.

M.H. Gutknecht SIAM-ALA12

  • p. 18
slide-19
SLIDE 19

Augment./Deflat. History CG BiCG Conclusions

References

  • 1. A. Gaul, M.H.G., J. Liesen, R. Nabben, A framework for

deflated and augmented Krylov subspace methods. Submitted (Jan. 2011 / June 2012).

  • 2. A. Gaul, M.H.G., J. Liesen, R. Nabben, Deflated and

Augmented Krylov Subspace Methods: A Framework for Deflated BiCG and Related Methods. In preparation.

  • 3. M.H.G., Spectral Deflation in Krylov Solvers: a Theory of

Coordinate Space Based Methods. Electronic Trans.

  • Numer. Anal. 39, 156–185 (2012).

M.H. Gutknecht SIAM-ALA12

  • p. 19