Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems - PowerPoint PPT Presentation

Serial Iterative Methods Parallel Iterative Methods Preconditioning Parallel Numerical Algorithms Chapter 4 – Sparse Linear Systems Section 4.3 – Iterative Methods Michael T. Heath and Edgar Solomonik Department of Computer Science University of Illinois at Urbana-Champaign CS 554 / CSE 512 Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 1 / 52

Serial Iterative Methods Parallel Iterative Methods Preconditioning Outline Serial Iterative Methods 1 Stationary Iterative Methods Krylov Subspace Methods Parallel Iterative Methods 2 Partitioning Ordering Communication-Avoiding Iterative Methods Chaotic Relaxation Preconditioning 3 Simple Preconditioners Domain Decomposition Incomplete LU Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 2 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Iterative Methods for Linear Systems Iterative methods for solving linear system Ax = b begin with initial guess for solution and successively improve it until solution is as accurate as desired In theory, infinite number of iterations might be required to converge to exact solution In practice, iteration terminates when residual � b − Ax � , or some other measure of error, is as small as desired Iterative methods are especially useful when matrix A is sparse because, unlike direct methods, no fill is incurred Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 3 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Jacobi Method Beginning with initial guess x (0) , Jacobi method computes next iterate by solving for each component of x in terms of others � � x ( k +1) a ij x ( k ) � = b i − /a ii , i = 1 , . . . , n i j j � = i If D , L , and U are diagonal, strict lower triangular, and strict upper triangular portions of A , then Jacobi method can be written x ( k +1) = D − 1 � b − ( L + U ) x ( k ) � Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 4 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Jacobi Method Jacobi method requires nonzero diagonal entries, which can usually be accomplished by permuting rows and columns if not already true Jacobi method requires duplicate storage for x , since no component can be overwritten until all new values have been computed Components of new iterate do not depend on each other, so they can be computed simultaneously Jacobi method does not always converge, but it is guaranteed to converge under conditions that are often satisfied (e.g., if matrix is strictly diagonally dominant), though convergence rate may be very slow Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 5 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Gauss-Seidel Method Faster convergence can be achieved by using each new component value as soon as it has been computed rather than waiting until next iteration This gives Gauss-Seidel method � � x ( k +1) � a ij x ( k +1) � a ij x ( k ) = b i − − /a ii i j j j<i j>i Using same notation as for Jacobi, Gauss-Seidel method can be written x ( k +1) = ( D + L ) − 1 � b − Ux ( k ) � Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 6 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Gauss-Seidel Method Gauss-Seidel requires nonzero diagonal entries Gauss-Seidel does not require duplicate storage for x , since component values can be overwritten as they are computed But each component depends on previous ones, so they must be computed successively Gauss-Seidel does not always converge, but it is guaranteed to converge under conditions that are somewhat weaker than those for Jacobi method (e.g., if matrix is symmetric and positive definite) Gauss-Seidel converges about twice as fast as Jacobi, but may still be very slow Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 7 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning SOR Method Successive over-relaxation ( SOR ) uses step to next Gauss-Seidel iterate as search direction with fixed search parameter ω SOR computes next iterate as x ( k +1) = x ( k ) + ω � x ( k +1) − x ( k ) � GS where x ( k +1) is next iterate given by Gauss-Seidel GS Equivalently, next iterate is weighted average of current iterate and next Gauss-Seidel iterate x ( k +1) = (1 − ω ) x ( k ) + ω x ( k +1) GS If A is symmetric, the SOR can be written as the application of a symmetric matrix; this is the Symmetric Successive Over-Relaxation method Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 8 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning SOR Method ω is fixed relaxation parameter chosen to accelerate convergence ω > 1 gives over-relaxation , while ω < 1 gives under-relaxation ( ω = 1 gives Gauss-Seidel method) SOR diverges unless 0 < ω < 2 , but choosing optimal ω is difficult in general except for special classes of matrices With optimal value for ω , convergence rate of SOR method can be order of magnitude faster than that of Gauss-Seidel Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 9 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Conjugate Gradient Method If A is n × n symmetric positive definite matrix, then quadratic function 2 x T Ax − x T b φ ( x ) = 1 attains minimum precisely when Ax = b Optimization methods have form x k +1 = x k + α s k where α is search parameter chosen to minimize objective function φ ( x k + α s k ) along s k For method of steepest descent , s k = −∇ φ ( x ) Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 10 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Conjugate Gradient Method For special case of quadratic problem, Negative gradient is residual vector −∇ φ ( x ) = b − Ax = r Optimal line search parameter is given by α = r T k s k / s T k As k Successive search directions can easily be A -orthogonalized by three-term recurrence Using these properties, we obtain conjugate gradient method ( CG ) for linear systems Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 11 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Conjugate Gradient Method x 0 = initial guess r 0 = b − Ax 0 s 0 = r 0 for k = 0 , 1 , 2 , . . . α k = r T k r k / s T k As k x k +1 = x k + α k s k r k +1 = r k − α k As k β k +1 = r T k +1 r k +1 / r T k r k s k +1 = r k +1 + β k +1 s k end Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 12 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Conjugate Gradient Method Key features that make CG method effective Short recurrence determines search directions that are A -orthogonal (conjugate) Error is minimal over space spanned by search directions generated so far Minimum error property implies that method produces exact solution after at most n steps In practice, rounding error causes loss of orthogonality that spoils finite termination property, so method is used iteratively Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 13 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Conjugate Gradient Method Error is reduced at each iteration by factor of ( √ κ − 1) / ( √ κ + 1) on average, where κ = cond( A ) = � A � · � A − 1 � = λ max ( A ) /λ min ( A ) Thus, convergence tends to be rapid if matrix is well-conditioned, but can be arbitrarily slow if matrix is ill-conditioned But convergence also depends on clustering of eigenvalues of A Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 14 / 52

Serial Iterative Methods Stationary Iterative Methods Parallel Iterative Methods Krylov Subspace Methods Preconditioning Nonsymmetric Krylov Subspace Methods CG is not directly applicable to nonsymmetric or indefinite systems CG cannot be generalized to nonsymmetric systems without sacrificing one of its two key properties (short recurrence and minimum error) Nevertheless, several generalizations have been developed for solving nonsymmetric systems, including GMRES, QMR, CGS, BiCG, and Bi-CGSTAB These tend to be less robust and require more storage than CG, but they can still be very useful for solving large nonsymmetric systems Michael T. Heath and Edgar Solomonik Parallel Numerical Algorithms 15 / 52

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems - PowerPoint PPT Presentation

Serial Iterative Methods Parallel Iterative Methods Preconditioning Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.3 Iterative Methods Michael T. Heath and Edgar Solomonik Department of Computer Science

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.1 Parallel Algorithm

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.3 Parallel

+ Design of Parallel Algorithms Models of Parallel Computation + Chapter Overview: Algorithms

Parallel Algorithms Parallel Algorithms Examples Examples Concepts & Definitions

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources of

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources

Parallel Numerical Algorithms Chapter 3 Dense Linear Systems Section 3.3 Triangular

Parallel Numerical Algorithms Chapter 3 Dense Linear Systems Section 3.3 Triangular

+ Design of Parallel Algorithms Parallel Sorting Algorithms + Topic Overview n Issues in

Parallel Numerical Algorithms Chapter 1 Parallel Computing Michael T. Heath and Edgar

Parallel Numerical Algorithms Chapter 1 Parallel Computing Michael T. Heath and Edgar

Parallel Numerical Algorithms for Heterogeneous Parallel Computers Antonio M. Vidal Maci a

+ Design of Parallel Algorithms Parallel Dense Matrix Algorithms + Topic Overview n

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design

Overview Why Parallel Sorting? Parallel Quicksort Bitonic Sort Parallel Merge Sort

Regression: Probabilistic perspective CE-717: Machine Learning Sharif University of Technology

Review CS 3516 Computer CS 3516 Computer Networks Protocol Home Access Networks What

Monte Carlo Methods and Simulating Quarks Michael Creutz Brookhaven Lab 1946: Stanislaus Ulam

Quantum variational error correction by Peter Johnson, Jonathan Romero, Jonathan Olson, Yudong

Absorption cooling Mechanical Engineering ME462/562 Sustainable Energy: an Exergy Analysis

DSC Credit Committee Operational Stats July 2017 Created : 09/08/2017 Monthly Breakdown of Cash

Preconditioned Locally Harmonic Residual Methods for Interior Eigenvalue Computations Eugene

Algebraic Multilevel ILU-Based Preconditioning for Stationary Flow Problems Jan Mayer Hendryk

Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems - PowerPoint PPT Presentation

Serial Iterative Methods Parallel Iterative Methods Preconditioning Parallel Numerical Algorithms Chapter 4 Sparse Linear Systems Section 4.3 Iterative Methods Michael T. Heath and Edgar Solomonik Department of Computer Science

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.1 Parallel Algorithm

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.3 Parallel

+ Design of Parallel Algorithms Models of Parallel Computation + Chapter Overview: Algorithms

Parallel Algorithms Parallel Algorithms Examples Examples Concepts &amp; Definitions

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources of

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources

Parallel Numerical Algorithms Chapter 3 Dense Linear Systems Section 3.3 Triangular

Parallel Numerical Algorithms Chapter 3 Dense Linear Systems Section 3.3 Triangular

+ Design of Parallel Algorithms Parallel Sorting Algorithms + Topic Overview n Issues in

Parallel Numerical Algorithms Chapter 1 Parallel Computing Michael T. Heath and Edgar

Parallel Numerical Algorithms Chapter 1 Parallel Computing Michael T. Heath and Edgar

Parallel Numerical Algorithms for Heterogeneous Parallel Computers Antonio M. Vidal Maci a

+ Design of Parallel Algorithms Parallel Dense Matrix Algorithms + Topic Overview n

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design

Overview Why Parallel Sorting? Parallel Quicksort Bitonic Sort Parallel Merge Sort

Regression: Probabilistic perspective CE-717: Machine Learning Sharif University of Technology

Review CS 3516 Computer CS 3516 Computer Networks Protocol Home Access Networks What

Monte Carlo Methods and Simulating Quarks Michael Creutz Brookhaven Lab 1946: Stanislaus Ulam

Quantum variational error correction by Peter Johnson, Jonathan Romero, Jonathan Olson, Yudong

Absorption cooling Mechanical Engineering ME462/562 Sustainable Energy: an Exergy Analysis

DSC Credit Committee Operational Stats July 2017 Created : 09/08/2017 Monthly Breakdown of Cash

Preconditioned Locally Harmonic Residual Methods for Interior Eigenvalue Computations Eugene

Algebraic Multilevel ILU-Based Preconditioning for Stationary Flow Problems Jan Mayer Hendryk

Parallel Algorithms Parallel Algorithms Examples Examples Concepts & Definitions