Challenges in achieving scalable and robust linear solvers L. - PowerPoint PPT Presentation

Challenges in achieving scalable and robust linear solvers L. Grigori Alpines Inria Paris and LJLL, Sorbonne University with H. Al Daas, MPI Magdebourg P. Jolivet, CNRS IRIT P. H. Tournier, LJLL, CNRS, Alpines, Sorbonne University September 18, 2019

Plan Motivation of our work Recap on Additive Schwarz methods A robust multilevel additive Schwarz preconditioner Theory of a class of robust two level methods in algebraic setting Extension to multilevel methods Enlarged Krylov methods Conclusions 2 of 49

Motivation of our work Challenge in getting scalable and robust solvers On large scale computers, Krylov solvers reach less than 2% of the peak performance. � Typically, each iteration of a Krylov solver requires � Sparse matrix vector product → point-to-point communication � Dot products for orthogonalization → global communication � When solving complex linear systems arising, e.g. from large discretized systems of PDEs with strongly heterogeneous coefficients most of the highly parallel preconditioners lack robustness � wrt jumps in coefficients / partitioning into irregular subdomains, e.g. one level DDM methods (Additive Schwarz, RAS) � A few small eigenvalues hinder the convergence of iterative methods 3 of 49

Motivation of our work Can we have both scalable and robust methods ? Difficult ... but crucial ... since complex and large scale applications very often challenge existing methods Focus on increasing scalability by reducing coummunication/increasing arithmetic intensity while preserving robustness/dealing with small eigenvalues. � Robust preconditioners that guarantee the condition number of preconditioned matrix � Robust multilevel Additive Schwarz, using Geneo framework � Enlarged Krylov methods � reduce communication, � increase arithmetic intensity - compute sparse matrix-set of vectors product. 4 of 49

Recap on Additive Schwarz methods Notations Solve M − 1 Ax = M − 1 b , where A ∈ R n × n is SPD Notations: � DOFs partitioned into { Ω 1 j } N 1 j =1 overlapping domains of dimensions n 11 , n 12 , . . . n 1 , N 1 � R 1 j ∈ R n 1 j × n restriction operator, R 1 j = I n (Ω 1 j , :) � A 1 j ∈ R n 1 j × n 1 j : restriction of A to domain j , A 1 j = R 1 j AR T 1 j � { D 1 j } N 1 j =1 : algebraic partition of unity , I n = � N 1 j =1 R T 1 j D 1 j R 1 j 1 1 1 1/2 1 1 1/2 1 A 1 5 of 49

Recap on Additive Schwarz methods Additive and Restrictive Additive Schwarz methods � Original idea from Schwarz algorithm at the continuous level (Schwarz 1870) � Symmetric formulation, Additive Schwarz (1989) defined as N 1 � M − 1 1 j A − 1 R T AS := 1 j R 1 j j =1 � Restricted Additive Schwarz (Cai & Sarkis 1999) defined as � N 1 M − 1 R T 1 j D 1 j A − 1 RAS := 1 j R 1 j j =1 � In practice, RAS more efficient than AS 6 of 49

Recap on Additive Schwarz methods Relation between IC0 and RAS � Consider an Alternating Min Max layers ordering for IC0 � Duplicate data on domain j , include all DOFs at distance 2 plus a constant number of other DOFs. 13 14 15 16 17 18 19 20 2 47 97 52 63 64 65 66 67 68 69 70 21 22 23 24 25 26 27 28 3 48 98 53 71 72 73 74 75 76 77 78 29 30 31 32 33 34 35 36 4 49 99 54 79 80 81 82 83 84 85 86 � With L j L T j the IC0 factor of 5 6 7 8 9 10 11 12 1 46 96 51 55 56 57 58 59 60 61 62 38 39 40 41 42 43 44 45 37 50 100 87 88 89 90 91 92 93 94 95 domain j , IC0 preconditioner is 138 139 140 141 142 143 144 145 137 150 200 187 188 189 190 191 192 193 194 195 105 106 107 108 109 110 111 112 101 146 196 151 155 156 157 158 159 160 161 162 113 114 115 116 117 118 119 120 102 147 197 152 163 164 165 166 167 168 169 170 121 122 123 124 125 126 127 128 103 148 198 153 171 172 173 174 175 176 177 178 � N 1 129 130 131 132 133 134 135 136 104 149 199 154 179 180 181 182 183 184 185 186 M − 1 R T 1 j D 1 j ( L j L T j ) − 1 R 1 j IC 0 := j =1 + ghost data for + ghost data for backward substitution forward substitution � For structured 2D grids, RAS with IC0 in subdomains and overlap 2 similar to IC0 (modulo a constant number of extra DOFs per subdomain) � with S. Moufawad and S. Cayrols (proofs in their Phds thesis) 7 of 49

Recap on Additive Schwarz methods Upper bound for the eigenvalues of M − 1 AS , 1 A Let k 1 c be number of distinct colours to colour the subdomains of A . The following holds (e.g. Chan and Mathew 1994) λ max ( M − 1 AS , 1 A ) ≤ k 1 c → Two level preconditioners are required 8 of 49

Recap on Additive Schwarz methods Two level preconditioners Given a coarse subspace S 1 , S 1 = span ( V 1 ), V 1 ∈ R n × n 2 , the coarse grid A 2 = V T 1 AV 1 . the two level AS preconditioner is, N 1 � AS , 2 := V 1 ( A 2 ) − 1 V T 1 j ( A 1 j ) − 1 R 1 j M − 1 R T 1 + j =1 Let k 1 c be minimum number of distinct colors so that { span { R T 1 j }} 1 ≤ i ≤ N 1 of the same color are mutually A -orthogonal. The following holds (e.g. Chan and Mathew 1994) λ max ( M − 1 AS , 2 ) ≤ k 1 c + 1 9 of 49

Recap on Additive Schwarz methods How to compute the coarse subspace S 1 = span ( V 1 ) � Nicolaides 87 (CG): kernel of the operator (constant vectors) � � R T V 1 := 1 j D 1 j R 1 j 1 j =1: N 1 � Other early references: [Morgan 92] (GMRES), [Chapman, Saad 92], [Kharchenko, Yeremin 92], [Burrage, Ehrel, and Pohl, 93] � Estimations of eigenvectors corresponding to smallest eigenvalues / knowledge from the physics � Geneo [Spillane et al., 2014]: through solving local Gen EVPs, bounds smallest eigenvalue for standard FE and bilinear forms, SPD input matrix subd dofs AS AS-ZEM ( V 1 ) GenEO ( V 1 ) 4 1452 79 54 (24) 16 (46) 8 29040 177 87 (48) 16 (102) 16 58080 378 145 (96) 16 (214) ( V 1 ): size of the coarse space AS-ZEM Nicolaides with rigid body motions, 6 per subdomain Results for 3D elasticity problem provided by F. Nataf 10 of 49

A robust multilevel additive Schwarz preconditioner Plan Motivation of our work Recap on Additive Schwarz methods A robust multilevel additive Schwarz preconditioner Theory of a class of robust two level methods in algebraic setting Extension to multilevel methods Enlarged Krylov methods Conclusions 11 of 49

A robust multilevel additive Schwarz preconditioner Theory of a class of robust two level methods in algebraic setting Fictitious space lemma (Nepomnyaschikh 1991) Let A ∈ R n × n , B ∈ R n B × n B be two SPD matrices. Suppose there exists R : R n B R n → v n B �→ R v n B , such that the following holds 1. The operator R is surjective 2. There exists c u > 0 such that ( R v n B ) T A ( R v n B ) ≤ c u v ⊤ ∀ v n B ∈ R n B n B Bv n B , 3. Stable decomposition: there exists c l > 0 such that ∀ v ∈ R n , ∃ v n B ∈ R n B with v = R v n B and n B Bv n B ≤ ( R v n B ) ⊤ A ( R v n B ) = v ⊤ Av c l v ⊤ Then, the spectrum of the operator R B − 1 R T A is in the segment [ c l , c u ]. 12 of 49

A robust multilevel additive Schwarz preconditioner Theory of a class of robust two level methods in algebraic setting Geneo two level DDM preconditioner Consider the generalized eigenvalue problem for each domain j , for given τ : F ind ( u 1 jk , λ 1 jk ) ∈ R n i , 1 × R , λ 1 jk ≤ 1 /τ s uch that ˜ A Neu 1 j u 1 jk = λ 1 jk D 1 j A 1 j D 1 j u 1 jk where ˜ A Neu is the Neumann matrix of domain i , V 1 basis of S 1 , 1 j N 1 � D 1 j R ⊤ S 1 := 1 j Z 1 j , Z 1 j = span { u 1 jk | λ 1 jk < 1 /τ } j =1 N 1 � − 1 � M − 1 V T V T � R T 1 j A − 1 := V 1 1 AV 1 1 + 1 j R 1 j AS , 2 Geneo j =1 Theorem (Spillane, Dolean, Hauret, Nataf, Pechstein, Scheichl’14) With two technical assumptions fulfilled by standard FE and bilinear forms � � M − 1 κ AS , 2 Geneo A ≤ ( k 1 c + 1) (2 + (2 k 1 c + 1) k 1 τ ) where k 1 c = number of distinct colours to colour the graph of A , k 1 = max number of domains that share a common vertex. 13 of 49

Challenges in achieving scalable and robust linear solvers L. - PowerPoint PPT Presentation

Challenges in achieving scalable and robust linear solvers L. Grigori Alpines Inria Paris and LJLL, Sorbonne University with H. Al Daas, MPI Magdebourg P. Jolivet, CNRS IRIT P. H. Tournier, LJLL, CNRS, Alpines, Sorbonne University September

Math 211 Math 211 Lecture #14 M ATLAB s ODE Solvers September 26, 2003 2 Matlab Solvers

OpenFOAMs basic solvers for linear systems of equations Solvers, preconditioners, smoothers

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

SMT Solvers: A Disruptive Technology John Rushby Computer Science Laboratory SRI International

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

ECS 231 Subspace projection methods for LS 1 / 38 Part I. Basics The landscape of solvers for

Robust Linear Quantum Systems Robust Linear Quantum Systems Theory Theory Ian R. Petersen

Performance Evaluation of GPU Accelerated Linear Solvers with TCAD Examples Ana Iontcheva

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Introduction to SAT and SMT Solvers Interfacing Yosys and SMT Solvers for BMC and more using

SAT Solvers Aditya Parameswaran Luv Kumar 1st June, 2005 and 14th June, 2005 Aditya

Solvers Principles and Architecture (SPA) Part 2 SMT Solvers Master Sciences Informatique (Sif)

SAT and SMT Murphy Berzish Overview Boolean Satisfiability (SAT) problem SAT solvers:

Solvers Principles and Architecture (SPA) Part 1 Anatomy of SAT Solvers Master Sciences

Distributed Systems Introduction Matthias Kovatsch Distributed Systems Group Friday, 24

End of presentation

A Fibrational Approach to Automata Theory Eilenberg-type Correspondences in One Liang-Ting Chen

NTCIR 2014 Slides - TUW-IMP at the NTCIR-11 Math-2 Presentation February 2015 CITATIONS READS

Purp rpose Be aware of Educational Pathways after N Level Exams Criteria for admission

Agenda Compliance Reminder Retention of ELLs Policy 6000.1 ACCESS for ELLs

Lecture 1: Overview Reading: Sipser Ch. 0, 1.1 Course information Finite automata

Todays Presenters Shannon Peterson Youth Services Manager, Kitsap Regional Library,