Adaptive Coarse Spaces and Multiple Search Directions: Tools for - - PowerPoint PPT Presentation

adaptive coarse spaces and multiple search directions
SMART_READER_LITE
LIVE PREVIEW

Adaptive Coarse Spaces and Multiple Search Directions: Tools for - - PowerPoint PPT Presentation

Adaptive Coarse Spaces and Multiple Search Directions: Tools for Robust Domain Decomposition Algorithms Nicole Spillane Center for Mathematical Modelling at the Universidad de Chile in Santiago. July 9th, 2015 DD23 Jeju Island, Korea


slide-1
SLIDE 1

Adaptive Coarse Spaces and Multiple Search Directions: Tools for Robust Domain Decomposition Algorithms

Nicole Spillane

Center for Mathematical Modelling at the Universidad de Chile in Santiago.

July 9th, 2015 – DD23 Jeju Island, Korea

slide-2
SLIDE 2

Acknowledgements

Collaborators

◮ Pierre Gosselet (CNRS, ENS Cachan) ◮ Frédéric Nataf (CNRS, Université Pierre et Marie Curie) ◮ Daniel J. Rixen (University of Munich) ◮ François-Xavier Roux (ONERA, Université Pierre et Marie

Curie)

Funding

Financial support by CONICYT through project Fondecyt 3150090.

Nicole Spillane (U. de Chile) 2 / 33

slide-3
SLIDE 3

Balancing Domain Decomposition (BDD): Ku∗ = f (K spd)

Ω Γ Ωs Γs

  • J. Mandel.

Balancing domain decomposition.

  • Comm. Numer. Methods

Engrg., 9(3):233–241, 1993.

BDD reduces the problem to the interface Γ : Au∗,Γ = b, where A := KΓΓ − KΓIK−1

II KIΓ,

b := fΓ − KΓIK−1

II fI.

The operator A is a sum of local contributions : A =

N

  • s=1

Rs⊤SsRs, Ss := Ks

ΓsΓs − Ks ΓsIs(Ks IsIs)−1Ks IsΓs,

Rs : Γ → Γs. The preconditioner H also: H :=

N

  • s=1

Rs⊤DsSs†DsRs, with

N

  • s=1

Rs⊤DsRs = I. The coarse space is range(U) :=

N

  • s=1

Rs⊤DsKer(Ss).

Nicole Spillane (U. de Chile) 3 / 33

slide-4
SLIDE 4

Illustration of the Problem: Heterogeneous Elasticity

N = 81 subdomains, ν = 0.4, E1 = 107 and E2 = 1012

Young’s modulus E Metis Partition

20 40 60 80 100 120 140 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale)

Problem: We want to design new DD methods with three objectives:

◮ Reliability: robustness and scalability. ◮ Efficiency: adapt automatically to difficulty. ◮ Simplicity: non invasive implementation.

Nicole Spillane (U. de Chile) 4 / 33

slide-5
SLIDE 5

Illustration of the Problem: Heterogeneous Elasticity

N = 81 subdomains, ν = 0.4, E1 = 107 and E2 = 1012

Young’s modulus E Metis Partition

20 40 60 80 100 120 140 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous PPCG GenEO

Problem: We want to design new DD methods with three objectives:

◮ Reliability: robustness and scalability. ◮ Efficiency: adapt automatically to difficulty. ◮ Simplicity: non invasive implementation.

Nicole Spillane (U. de Chile) 4 / 33

slide-6
SLIDE 6

Contents

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Nicole Spillane (U. de Chile) 5 / 33

slide-7
SLIDE 7

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Projected PCG [Nicolaides, 1987 – Dostál, 1988]

for Ax∗ = b preconditioned by H and projection Π

◮ Assume that A, H ∈ Rn×n are spd and U ∈ Rn×n0 is full rank, ◮ Define Π := I − U(U⊤AU)−1U⊤A.

1 x0 = U(U⊤AU)−1U⊤b;

Initial Guess

2 r0 = b − Ax0 ;

Initial residual

3 z0 = Hr0; 4 p0 =Πz0 ;

Initial search direction

5 for i = 0, 1, . . . , convergence do 6

qi = Api;

7

αi = (q⊤

i pi)−1(p⊤ i ri); 8

xi+1 = xi + αipi; Update approximate solution

9

ri+1 = ri − αiqi; Update residual

10

zi+1 = Hri+1; Precondition

11

βi = (q⊤

i pi)−1(q⊤ i zi+1); 12

pi+1 = Πzi+1 − βipi; Project and orthogonalize

13 end 14 Return xi+1;

Convergence

[Kaniel, 66 – Meinardus, 63]

x∗ − xiA x∗ − x0A 2 √κ − 1 √κ + 1 i

◮ κ = λmax

λmin ,

◮ λmax and λmin:

extreme eigenvalues

  • f HAΠ excluding 0.

→ GenEO is a choice of range(U) that guarantees fast convergence.

Nicole Spillane (U. de Chile) 6 / 33

slide-8
SLIDE 8

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Bibliography (1/2) (coarse spaces based on generalized eigenvalue

problems)

Multigrid

  • M. Brezina, C. Heberton, J. Mandel, and P. Vaněk. Technical Report 140, University of

Colorado Denver, April 1999. Additive Schwarz

  • J. Galvis and Y. Efendiev. Multiscale Model. Simul., 2010.
  • Y. Efendiev, J. Galvis, R. Lazarov, and J. Willems. ESAIM Math. Model. Numer. Anal., 2012.
  • F. Nataf, H. Xiang, V. Dolean, and N. S. SIAM J. Sci. Comput., 2011.
  • V. Dolean, F. Nataf, R. Scheichl, and N. S. Comput. Methods Appl. Math., 2012.
  • N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl. Numer. Math., 2014.

Optimized Schwarz Methods

  • S. Loisel, H. Nguyen, and R. Scheichl. Technical report, submited, 2015.
  • R. Haferssas, P. Jolivet, and F. Nataf. Technical report, Submited, hal-01100926, 2015.

Nicole Spillane (U. de Chile) 7 / 33

slide-9
SLIDE 9

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Bibliography (2/2) (coarse spaces based on generalized eigenvalue

problems)

Substructuring Methods P.E. Bjørstad, J. Koster and P. Krzyżanowski. Applied Parallel Computing, 2001.

  • J. Mandel and B. Sousedík. Comput. Methods Appl. Mech. Engrg., 2007.
  • B. Sousedík, J. Šístek, and J. Mandel. Computing, 2013
  • N. S. and D. J. Rixen. Int. J. Numer. Meth. Engng., 2013.
  • H. H. Kim and E. T. Chung, Model. Simul., 2015.
  • A. Klawonn, P. Radtke, and O. Rheinbach. DD22 Proceedings, 2014.
  • A. Klawonn, P. Radtke, and O. Rheinbach. SIAM J. Numer. Anal., 2015.

And many other talks at this conference: ◮ Minisymposium on monday: Olof B. Widlund, Clark R. Dohrmann (with Clemens Pechstein), ◮ Pierre Jolivet (HPDDM https://github.com/hpddm), ◮ Frédéric Nataf’s plenary talk tomorrow ! ◮ Session on friday morning (CT 7) !

Nicole Spillane (U. de Chile) 8 / 33

slide-10
SLIDE 10

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Coarse Space: Strategy (1/5)

Abstract Framework ([Toselli, Widlund (2005)]) DD method dependant Problem dependant

(1) λmin > c if stable decomposition (2) λmax < C if local solver is stable (1) holds (2) holds Prove (2) Prove (1) FETI BDD Additive Schwarz

Use trace theorems, Poincaré in- equalities, Korn inequalities...

Relative error bounded w.r.t C/c

[N. S., D. J. Rixen, 2013] [ N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl, 2014] Nicole Spillane (U. de Chile) 9 / 33

slide-11
SLIDE 11

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Coarse Space: Strategy (2/5)

Abstract Framework ([Toselli, Widlund (2005)]) DD method dependant Problem dependant

(1) λmin > c if stable decomposition (2) λmax < C if local solver is stable (1) holds (2) holds Prove (2) Prove (1) FETI BDD Additive Schwarz

Use trace theorems, Poincaré in- equalities, Korn inequalities...

Relative error bounded w.r.t C/c

[N. S., D. J. Rixen, 2013] [ N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl, 2014] Nicole Spillane (U. de Chile) 10 / 33

slide-12
SLIDE 12

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Coarse Space: Strategy (3/5)

Abstract Framework ([Toselli, Widlund (2005)]) DD method dependant Problem dependant

  • In

(1) λmin > c if stable decomposition (2) λmax < C if local solver is stable (1) holds (2) holds Prove (2) Prove (1) FETI BDD Additive Schwarz

Use trace theorems, Poincaré in- equalities, Korn inequalities...

Relative error bounded w.r.t C/c Make (1) or (2) local: xs⊤Ms xs τ xs⊤Ns xs for all xs. (Botleneck Estimate)

[N. S., D. J. Rixen, 2013] [ N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl, 2014] Nicole Spillane (U. de Chile) 11 / 33

slide-13
SLIDE 13

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Coarse Space: Strategy (4/5)

Abstract Framework ([Toselli, Widlund (2005)]) DD method dependant Problem dependant

  • In

(1) λmin > c if stable decomposition (2) λmax < C if local solver is stable (1) holds (2) holds Prove (2) Prove (1) FETI BDD Additive Schwarz

Use trace theorems, Poincaré in- equalities, Korn inequalities...

Relative error bounded w.r.t C/c Solve Ms xs

k = λs kNs xs k.

botleneck estimate xs⊤Ms xs τ xs⊤Ns xs

◮ holds in span{xs k; λs k τ}, ◮ does not hold in span{xs k; λs k > τ}. [N. S., D. J. Rixen, 2013] [ N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl, 2014] Nicole Spillane (U. de Chile) 12 / 33

slide-14
SLIDE 14

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Coarse Space: Strategy (5/5)

Abstract Framework ([Toselli, Widlund (2005)]) DD method dependant Problem dependant

  • In

(1) λmin > c if stable decomposition (2) λmax < C if local solver is stable (1) holds (2) holds Prove (2) Prove (1) FETI BDD Additive Schwarz

Use trace theorems, Poincaré in- equalities, Korn inequalities...

Relative error bounded w.r.t τ and number

  • f neighbours N

Solve Ms xs

k = λs kNs xs k.

botleneck estimate xs⊤Ms xs τ xs⊤Ns xs

◮ holds in span{xs k; λs k τ}, ◮ does not hold in span{xs k; λs k > τ}.

Π: A-orthogonal projection

  • nto the space where the

botleneck estimate holds Ax∗ = b is equivalent to : (I − Π)⊤Ax∗ = (I − Π)⊤b

  • Direct Solver

and Π⊤Ax∗ = Π⊤b.

  • DD method

[N. S., D. J. Rixen, 2013] [ N. S., V. Dolean, P. Hauret, F. Nataf, C. Pechstein, and R. Scheichl, 2014] Nicole Spillane (U. de Chile) 13 / 33

slide-15
SLIDE 15

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

GenEO Coarse Space for BDD

◮ λmin 1. ◮ the botleneck estimate for λmax is

xs⊤RsARs⊤xs (1/τ) xs⊤Ds−1SDs−1xs. So we solve in each subdomain : Ds−1SsDs−1 xs

k = λs k RsARs⊤ xs k,

and define the coarse space as range(U) = span{Rs⊤xs

k; s = 1, . . . , N and λs k τ}.

The the effective condition number is bounded by : κ(HAΠ) N τ ; N: number of neighbours of a subdomain.

Nicole Spillane (U. de Chile) 14 / 33

slide-16
SLIDE 16

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration: Heterogeneous Elasticity

N = 81 subdomains, ν = 0.4, E1 = 107 and E2 = 1012, τ = 0.1

Young’s modulus E Metis Partition

20 40 60 80 100 120 140 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) BDD BDD + GenEO

Size of the coarse space: n0 = 349 including 212 rigid body modes.

Nicole Spillane (U. de Chile) 15 / 33

slide-17
SLIDE 17

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Conclusion for GenEO and introduction for MPCG

◮ Convergence guaranteed in few iterations, ◮ This is achieved with local contributions to the coarse space.

→ Only drawback could be the cost of the eigensolves. Instead of precomputing a coarse space, take advantage of the local components already being computed: H :=

N

  • s=1

Rs⊤DsSs†DsRs

  • :=Hs

.

  • R. Bridson and C. Greif.

SIAM J. Matrix Anal. Appl., 27(4):1056–1068, 2006.

  • C. Greif, T. Rees, and D. Szyld.

DD 21 proceedings, 2014.

  • D. J. Rixen.

PhD thesis, Université de Liège, Belgium, Collection des Publications de la Faculté des Sciences appliquées, n.175, 1997.

  • P. Gosselet, D. J. Rixen, F.-X. Roux, and N. S..

International Journal for Numerical Methods in Engineering, Currently Published Online, 2015. Nicole Spillane (U. de Chile) 16 / 33

slide-18
SLIDE 18

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Multi Preconditioned CG for Ax∗ = b prec. by {Hs}s=1,...,N and Π

◮ A, H ∈ Rn×n spd ◮ U ∈ Rn×n0 full rank, ◮ H = N

s=1 Hs, where Hs spsd.

MPCG

1 x0 = U(U⊤AU)−1U⊤b;

Initial Guess

2 r0 = b − Ax0; 3 Z0 =

  • H1r0 | . . . | HNri+1
  • ;

4 P0 = ΠZ0;

Initial search directions

5 for i = 0, 1, . . . , convergence do 6

Qi = APi;

7

αi = (Q⊤

i Pi)† (Pi⊤ri); 8

xi+1 = xi + Piαi; Update approximate solution

9

ri+1 = ri − Qiαi; Update residual

10

Zi+1 =

  • H1ri+1 | . . . | HNri+1
  • ; Multi Precondition

11

βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1),

j = 0, . . . , i;

12

Pi+1 = ΠZi+1 − i

j=0 Pjβi,j;

Project and orthog.

13 end 14 Return xi+1;

Remark

ri, xi ∈ Rn Zi, Pi, Qi ∈ Rn×N βi,j ∈ RN×N αi ∈ RN ◮ n: size of problem, ◮ N: nb of precs.

Properties

  • 1. P⊤

i APj = 0 (i = j).

  • 2. r⊤

i Pj = 0 (j < i).

  • 3. no short recurrence.
  • 4. x∗ − xi+1A =

min{x∗ − xA; x ∈ xi + range(Pi)}.

Nicole Spillane (U. de Chile) 17 / 33

slide-19
SLIDE 19

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Multi Preconditioned CG for Ax∗ = b prec. by {Hs}s=1,...,N and Π

◮ A, H ∈ Rn×n spd ◮ U ∈ Rn×n0 full rank, ◮ H = N

s=1 Hs, where Hs spsd.

MPCG

1 x0 = U(U⊤AU)−1U⊤b;

Initial Guess

2 r0 = b − Ax0; 3 Z0 =

  • H1r0 | . . . | HNri+1
  • ;

4 P0 = ΠZ0;

Initial search directions

5 for i = 0, 1, . . . , convergence do 6

Qi = APi;

7

αi = (Q⊤

i Pi)† (Pi⊤ri); 8

xi+1 = xi + Piαi; Update approximate solution

9

ri+1 = ri − Qiαi; Update residual

10

Zi+1 =

  • H1ri+1 | . . . | HNri+1
  • ; Multi Precondition

11

βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1),

j = 0, . . . , i;

12

Pi+1 = ΠZi+1 − i

j=0 Pjβi,j;

Project and orthog.

13 end 14 Return xi+1;

PPCG x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0 ; z0 = Hr0; p0 =Πz0; for i = 0, 1, . . . , conv. do qi = Api; αi = (q⊤

i pi)−1(p⊤ i ri);

xi+1 = xi + αipi ; ri+1 = ri − αiqi; zi+1 = Hri+1; βi = (q⊤

i pi)−1(q⊤ i zi+1);

pi+1 = Πzi+1 − βipi; end Return xi+1;

Nicole Spillane (U. de Chile) 17 / 33

slide-20
SLIDE 20

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (Heterogeneous Elasticity E2/E1 = 105)

20 40 60 80 100 120 140 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Simultaneous (MPCG) PPCG GenEO 20 40 60 80 100 120 140 500 1,000 1,500 Iterations Dimension of the minimization space Simultaneous (MPCG) PPCG GenEO 20 40 60 80 100 120 140 0.5 1 1.5 2 2.5 ·104 Iterations Number of local solves Simultaneous (MPCG) PPCG Nicole Spillane (U. de Chile) 18 / 33

slide-21
SLIDE 21

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Simultaneous FETI: Tests with CPU time (F.-X. Roux)

100 subdomains, 17 × 106 dofs, ν = 0.45 or 0.4999, 1 E 105. 2.6 GHz 8-core Xeon processors, Intel fortran, MKL-pardiso. Checker Slices 1 Slices 2

Decomp. ν Solver #it dim Max solves Time (s) Slices 1 0.45 FETI > 800 > 800 > 1600 > 7300 ✗ S-FETI 48 4800 192 493 ✓ Slices2 0.45 FETI 409 409 818 1979 ✓ S-FETI 36 3600 144 363 ✓ Checker 0.4999 FETI 233 233 466 991 ✓ S-FETI 46 4600 276 320 ✓ Slices 1 0.4999 FETI > 800 > 800 > 1600 > 7300 ✗ S-FETI 152 15200 608 4653 ✓ Slices2 0.4999 FETI > 800 > 800 > 1600 > 7300 ✗ S-FETI 144 14400 576 4455 ✓

Nicole Spillane (U. de Chile) 19 / 33

slide-22
SLIDE 22

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Good convergence but two possible limitations

✓ Local contributions Hsri form a good minimization space. ✗ Cost of inverting P⊤

i APi ∈ RN×N at each iteration in

αi = (P⊤

i APi)† (Pi⊤ri) and βi,j = (P⊤ j APj)†(Q⊤ j Zi+1).

✗ Simultaneous BDD may work ‘too hard’ on ‘easy’ problems.

2 4 6 8 10 12 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Simultaneous (MPCG) PPCG GenEO 2 4 6 8 10 12 1,000 2,000 3,000 4,000 Iterations Number of local solves Simultaneous (MPCG) PPCG

Introduce adaptativity into multipreconditioned CG. From the GenEO section we know that:

◮ a few local vectors should suffice to accelerate convergence. ◮ they are the low frequency eigenvectors of:

Ds−1SsDs−1 xs

k = λs k RsARs⊤ xs k.

Nicole Spillane (U. de Chile) 20 / 33

slide-23
SLIDE 23

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Adaptive Multi Preconditioned CG for Ax∗ = b preconditioned

by N

s=1 Hs and projection Π. (τ ∈ R+ is chosen by the user)

1 x0 = U(U⊤AU)−1U⊤b; 2 r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; 3 for i = 0, 1, . . . , convergence do 4

Qi = APi;

5

αi = (Q⊤

i Pi)† (Pi⊤ri); 6

xi+1 = xi + Piαi ;

7

ri+1 = ri − Qiαi ;

8

ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

;

9

if ti < τ then τ-test

10

Zi+1 =

  • H1ri+1 | . . . | HNri+1
  • 11

else

12

Zi+1 = Hri+1;

13

end

14

βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1),

j = 0, . . . , i;

15

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j;

16 end 17 Return xi+1;

Remark

xi, ri ∈ Rn, Zi, Pi, Qi ∈ Rn×N or Rn, αi ∈ RN or R, βi,j ∈ RN×N, RN, R1×N or R, Piαi ∈ Rn.

◮ n: size of problem, ◮ N: nb of precs.

Nicole Spillane (U. de Chile) 21 / 33

slide-24
SLIDE 24

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Theoretical Result (1/3): PPCG like Properties

x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; for i = 0, 1, . . . , convergence do Qi = APi; αi = (Q⊤

i Pi)† (Pi⊤ri);

xi+1 = xi + Piαi ; ri+1 = ri − Qiαi ; ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

; if ti < τ then Zi+1 =

  • H1ri+1 | . . . | HN ri+1
  • else

Zi+1 = Hri+1; end βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1);

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j; end Return xi+1;

Remark

No short recurrence property as soon as the minimization space has been augmented.

Theorem

◮ xn−n0 = x∗ . ◮ Blocs of search directions are

pairwise A-orthogonal: P⊤

j APi = 0 (i = j). ◮ Residuals are pairwise

H-orthogonal: Hrj, ri = 0 (i = j).

x∗ − xiA = min

  • x∗ − xA; x ∈ range(U) + i−1

j=0 range(Pj)

  • .

Nicole Spillane (U. de Chile) 22 / 33

slide-25
SLIDE 25

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Theoretical Result (2/3): Two types of iterations

x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; for i = 0, 1, . . . , convergence do Qi = APi; αi = (Q⊤

i Pi)† (Pi⊤ri);

xi+1 = xi + Piαi ; ri+1 = ri − Qiαi ; ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

; if ti < τ then Zi+1 =

  • H1ri+1 | . . . | HN ri+1
  • else

Zi+1 = Hri+1; end βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1);

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j; end Return xi+1;

Theorem

If the τ–test returns ti−1 τ then x∗ − xiA x∗ − xi−1A

  • 1

1 + λmin τ 1/2 , λmin = 1 for BDD.

Nicole Spillane (U. de Chile) 23 / 33

slide-26
SLIDE 26

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Theoretical Result (2/3): Two types of iterations

x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; for i = 0, 1, . . . , convergence do Qi = APi; αi = (Q⊤

i Pi)† (Pi⊤ri);

xi+1 = xi + Piαi ; ri+1 = ri − Qiαi ; ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

; if ti < τ then Zi+1 =

  • H1ri+1 | . . . | HN ri+1
  • else

Zi+1 = Hri+1; end βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1);

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j; end Return xi+1;

Theorem

If the τ–test returns ti−1 τ then x∗ − xiA x∗ − xi−1A

  • 1

1 + λmin τ 1/2 , λmin = 1 for BDD.

Proof

x∗ = x0 +

n−n0

  • i=0

Piαi = xi +

n−n0

  • j=i

Pjαj ⇒x∗ − xi2

A = n−n0

  • j=i

Pjαj2

A

⇒x∗ − xi−12

A = x∗ − xi2 A + Pi−1αi−12 A.

(Similar to [Axelsson and Kaporin, 2001].) x∗ − xi−12

A

x∗ − xi2

A

= 1 + ri2

H

di2

A

= di2

AHA/di2 A

Pi−1αi−12

A

ri2

H

1 + λmin Pi−1αi−12

A

ri2

H

  • =ti−1

1 + λminτ.

Nicole Spillane (U. de Chile) 23 / 33

slide-27
SLIDE 27

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Theoretical Result (3/3): No extra work for ‘easy’ pbs.

x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; for i = 0, 1, . . . , convergence do Qi = APi; αi = (Q⊤

i Pi)† (Pi⊤ri);

xi+1 = xi + Piαi ; ri+1 = ri − Qiαi ; ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

; if ti < τ then Zi+1 =

  • H1ri+1 | . . . | HN ri+1
  • else

Zi+1 = Hri+1; end βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1);

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j; end Return xi+1;

Theorem

If λmax(HA) 1/τ then the algorithm is the usual PPCG.

Nicole Spillane (U. de Chile) 24 / 33

slide-28
SLIDE 28

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Theoretical Result (3/3): No extra work for ‘easy’ pbs.

x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; P0 = ΠZ0; for i = 0, 1, . . . , convergence do Qi = APi; αi = (Q⊤

i Pi)† (Pi⊤ri);

xi+1 = xi + Piαi ; ri+1 = ri − Qiαi ; ti = (Piαi)⊤A(Piαi) r⊤

i+1Hri+1

; if ti < τ then Zi+1 =

  • H1ri+1 | . . . | HN ri+1
  • else

Zi+1 = Hri+1; end βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1);

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j; end Return xi+1;

Theorem

If λmax(HA) 1/τ then the algorithm is the usual PPCG.

Proof

We begin with ri = ri−1 − Qi−1αi−1 and take the inner product by Hri: Hri, ri =✘✘✘

Hri, ri−1 − Hri, APi−1αi−1 ri1/2

HAH Pi−1αi−11/2 A

(C.S.),

  • r equivalently:

Hri, ri Hri, AHri Pi−1αi−1, APi−1αi−1 Hri, ri . Finally: τ 1 λmax

  • Hri, ri

Hri, AHri Pi−1αi−1, APi−1αi−1 Hri, ri = ti−1.

Nicole Spillane (U. de Chile) 24 / 33

slide-29
SLIDE 29

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Local Adaptive MPCG for N

s=1 Asx∗ = b preconditioned by

N

s=1 Hs and projection Π. (τ ∈ R+ is chosen by the user)

1 x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0;

P0 = ΠZ0;

2 for i = 0, 1, . . . , convergence do 3

Qi = APi;

4

αi = (Q⊤

i Pi)† (Pi⊤ri); 5

xi+1 = xi + Piαi ;

6

ri+1 = ri − Qiαi ;

7

Zi+1 = Hri+1; initialize Zi+1

8

for s = 1, . . . , N do

9

ts

i = Piαi, AsPiαi

r⊤

i+1Hsri+1

;

10

if ts

i < τ then

local τ-test

11

Zi+1 = [Zi+1 | Hsri+1];

12

end

13

end

14

βi,j = (Q⊤

j Pj)†(Q⊤ j Zi+1),

j = 0, . . . , i;

15

Pi+1 = ΠZi+1 −

i

  • j=0

Pjβi,j;

16 end 17 Return xi+1;

◮ A = N s=1 Rs⊤SsRs

  • :=As

.

◮ Between 1 and N

search directions per iteration.

◮ Reduces the cost of

inverting P⊤

i APi (one

limitation of MPCG)

Theorem

If the τ–test returns ts

i−1 τ

for all s = 1, . . . , N then x∗ − xiA x∗ − xi−1A

  • 1

1 + λmin τ 1/2 .

Nicole Spillane (U. de Chile) 25 / 33

slide-30
SLIDE 30

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Implementation for BDD: Saving local solves

1 x0 = U(U⊤AU)−1U⊤b; r0 = b − Ax0; Z0 = Hr0; ˜

P0 = Z0;

2 for i = 0, 1, . . . , convergence do 3

Qi = N

s=1 Qs i where Qs i = AsZi − AsU(U⊤AU)−1(AU)⊤Zi − i−1

  • j=0

Qs

j βi−1,j; 4

αi = (Q⊤

i Pi)† (Pi⊤ri); xi+1 = xi + Π˜

Piαi ; ri+1 = ri − Qiαi ;

5

Zi+1 = Hri+1;

6

for s = 1, . . . , N do

7

ts

i = Π˜

Piαi, Qs

i αi

r⊤

i+1Hsri+1

;

8

if ts

i < τ then 9

Zi+1 = [Zi+1 | Hsri+1];

10

Subtract Hsri+1 from the first column in Zi+1;

11

end

12

end

13

βi,j = (Q⊤

i Pi)†(Qj⊤Zi+1) (j = 0, . . . , i); ˜

Pi+1 = Zi+1 −

i

  • j=0

˜ Pjβi,j;

14 end 15 Return xi+1;

Same ‘trick’ as in :

  • P. Gosselet, D. J. Rixen, F.-X. Roux, and N. S.

Simultaneous FETI and block FETI: Robust domain decomposition with multiple search directions. International Journal for Numerical Methods in Engineering, 2015. Nicole Spillane (U. de Chile) 26 / 33

slide-31
SLIDE 31

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (1/4): Homogenous subdomains

2 4 6 8 10 12 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous PPCG GenEO 2 4 6 8 10 12 200 400 600 800 Iterations Dimension of the minimization space Global Local Simultaneous PPCG GenEO 2 4 6 8 10 12 1,000 2,000 3,000 4,000 Iterations Number of local solves Global Local Simultaneous PPCG Nicole Spillane (U. de Chile) 27 / 33

slide-32
SLIDE 32

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (2/4): Metis subdomains

20 40 60 80 100 120 140 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous PPCG GenEO 20 40 60 80 100 120 140 500 1,000 1,500 Iterations Dimension of the minimization space Global Local Simultaneous PPCG GenEO 20 40 60 80 100 120 140 0.5 1 1.5 2 2.5 ·104 Iterations Number of local solves Global Local Simultaneous PPCG Nicole Spillane (U. de Chile) 28 / 33

slide-33
SLIDE 33

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (2/4): Zoom on new methods

5 10 15 20 25 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous GenEO

Nicole Spillane (U. de Chile) 29 / 33

slide-34
SLIDE 34

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (3/4): Channels

100 200 300 400 500 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous PPCG GenEO 100 200 300 400 500 500 1,000 1,500 2,000 2,500 Iterations Dimension of the minimization space Global Local Simultaneous PPCG GenEO 100 200 300 400 500 2 4 6 8 ·104 Iterations Number of local solves Global Local Simultaneous PPCG Nicole Spillane (U. de Chile) 30 / 33

slide-35
SLIDE 35

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (3/4): Zoom on new methods

10 20 30 40 10−7 10−6 10−5 10−4 10−3 10−2 10−1 100 Iterations Error (log scale) Global Local Simultaneous GenEO

Nicole Spillane (U. de Chile) 31 / 33

slide-36
SLIDE 36

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (4/4)

Variable heterogeneity for N = 81 (k-scaling) : Global τ–test Local τ–test Simultaneous usual BDD GenEO E2/E1 it solves it solves it solves it solves it 1 26 4624 25 4602 14 7212 36 5832 23 10 26 5036 28 5213 14 7212 44 7128 23 102 30 6096 25 5164 15 7786 76 12312 21 103 23 5374 25 5133 16 8360 126 20412 22 104 22 5212 25 5176 16 8360 139 22518 22 105 22 5212 24 5041 16 8360 141 22842 23 dim < 554 dim < 423 dim < 1428 dim < 353 dim < 372 Variable number of subdomains for E2/E1 = 105 (k scaling) : Global τ–test Local τ–test Simultaneous usual BDD GenEO N it solves it solves it solves it solves it 25 20 1784 22 1447 17 2530 69 3450 20 36 24 2392 23 2150 16 3476 87 6264 20 49 20 3364 24 3146 16 4844 110 10780 20 64 21 5264 24 4137 17 7006 152 19456 20 dim < 693 dim < 379 dim < 1193 dim < 320 dim < 327

Nicole Spillane (U. de Chile) 32 / 33

slide-37
SLIDE 37

Adaptive Coarse Spaces (GenEO) Multi Preconditioned CG (Simultaneous BDD) Adaptive Multi Preconditioned CG

Numerical Illustration (4/4)

Variable heterogeneity for N = 81 (k-scaling) : Global τ–test Local τ–test Simultaneous usual BDD GenEO E2/E1 it solves it solves it solves it solves it 1 26 4624 25 4602 14 7212 36 5832 23 10 26 5036 28 5213 14 7212 44 7128 23 102 30 6096 25 5164 15 7786 76 12312 21 103 23 5374 25 5133 16 8360 126 20412 22 104 22 5212 25 5176 16 8360 139 22518 22 105 22 5212 24 5041 16 8360 141 22842 23 dim < 554 dim < 423 dim < 1428 dim < 353 dim < 372 Variable number of subdomains for E2/E1 = 105 (k scaling) : Global τ–test Local τ–test Simultaneous usual BDD GenEO N it solves it solves it solves it solves it 25 20 1784 22 1447 17 2530 69 3450 20 36 24 2392 23 2150 16 3476 87 6264 20 49 20 3364 24 3146 16 4844 110 10780 20 64 21 5264 24 4137 17 7006 152 19456 20 dim < 693 dim < 379 dim < 1193 dim < 320 dim < 327

Nicole Spillane (U. de Chile) 32 / 33

slide-38
SLIDE 38

Conclusion

Reliability, Efficiency and Simplicity through adaptive coarse spaces and adaptive multiple search directions.

Perspectives

◮ Test on industrial test cases and measure CPU times. ◮ Use the τ–test to recycle part of the minimization spaces. ◮ Simultaneous BDD: Theoretical Analysis at the PDE level. ◮ Adaptive Multi PCG for other DD methods: when λmax is

known or neither λmin or λmax.

  • N. S., D. J. Rixen,

Automatic spectral coarse spaces for robust FETI and BDD algorithms. IJNME, 2013.

  • P. Gosselet, D. J. Rixen, F.-X. Roux, and N. S.

Simultaneous FETI and block FETI: Robust domain decomposition with multiple search directions. International Journal for Numerical Methods in Engineering, 2015.

  • N. S.

Adaptive Multi Preconditioned Conjugate Gradient: Algorithm, Theory and an Application to Domain Decomposition Submited, Preprint available online <hal-01170059>, 2015. Downloadable at http://www.ann.jussieu.fr/~spillane/ Nicole Spillane (U. de Chile) 33 / 33