Flexible and Multishift Induced Dimension Reduction Algorithms - PowerPoint PPT Presentation

Flexible and Multishift Induced Dimension Reduction Algorithms Martin van Gijzen Joint work with Gerard Sleijpen and Jens Zemke NWO-JSPS Seminar, Delft, April 10-13, 2012 1 Delft University of Technology

Outline • Introduction • IDR Hessenberg relation • Variable preconditioners • Flexible QMRIDR( s ) • Numerical experiments • Shifted problems • Multishift QMRIDR( s ) • Numerical experiments • Conclusions 2 NWO-JSPS Seminar, Delft, April 10-13, 2012

Introduction (1) Almost all Krylov methods fall in one of the following two classes: 1. Residuals are explicitly generated that satisfy some desirable property. Examples are: CG, CR, BiCG, BiCGSTAB, GCR, IDR( s ) etc. The approximate solution is obtained as a side product. 2. A basis for the Krylov subspace is explicitly generated using a Hessenberg relation. The approximate solution is computed using the (partial) Hessenberg decomposition. Examples are: GMRES, FOM, QMR, etc. 3 NWO-JSPS Seminar, Delft, April 10-13, 2012

Introduction (2) The distinction is algorithmic rather than mathematical, mathematically equivalent methods exists from both classes (e.g. GMRES and GCR). For the second class the residual is normally not available, which gives them a less natural, less elegant flavour (personal opinion). The main advantage, however, is the explicit availability of the partial Hessenberg decomposition, which makes extension for solving other types of problems often quite straightforward. For example, it is trivial to calculate approximate eigenvalues with GMRES, or to use GMRES to solve a sequence of shifted systems. The same is true for QMR. 4 NWO-JSPS Seminar, Delft, April 10-13, 2012

Introduction (3) In this talk we present an IDR-based method of the second type, i.e. IDR is used to make a partial Hessenberg decomposition. The aim is to derive a flexible IDR method (cf. flexible GMRES), and a multishift IDR method (cf. multishift GMRES). To computed the ’IDR basis vectors’ we use orthogonalisation when possible, and to compute approximate solutions we use quasi-minimization, which results in GMRES-like algorithms that still use short recurrences . 5 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (1) 6 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (1) • IDR( s ) constructs vectors that are in a sequence of nested subspace G j of shrinking dimension. 6 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (1) • IDR( s ) constructs vectors that are in a sequence of nested subspace G j of shrinking dimension. • The main steps are as follows: - Compute vector v ∈ G j − 1 ⊥ p 1 , · · · , p s ; - Compute vector g ∈ G j : g = ( I − ω j A ) v . 6 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (1) • IDR( s ) constructs vectors that are in a sequence of nested subspace G j of shrinking dimension. • The main steps are as follows: - Compute vector v ∈ G j − 1 ⊥ p 1 , · · · , p s ; - Compute vector g ∈ G j : g = ( I − ω j A ) v . • To compute a vector ∈ G j , s + 1 vectors ∈ G j − 1 are needed. 6 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (1) • IDR( s ) constructs vectors that are in a sequence of nested subspace G j of shrinking dimension. • The main steps are as follows: - Compute vector v ∈ G j − 1 ⊥ p 1 , · · · , p s ; - Compute vector g ∈ G j : g = ( I − ω j A ) v . • To compute a vector ∈ G j , s + 1 vectors ∈ G j − 1 are needed. • Intermediate vectors ∈ G j are not unique. For stability they can be orthonormalised wrt the vectors ∈ G j . 6 NWO-JSPS Seminar, Delft, April 10-13, 2012

IDR Hessenberg relation (2) The vectors g satisfy a generalised Hessenberg relation. AV n = G n +1 H n V n = G n C n with • G n = [ g 1 , · · · g n ] ; • V n = [ v 1 , · · · v n ] ; • C n : n × n upper triangular; • H n : ( n + 1) × n extended Hessenberg, with upper bandwidth s . 7 NWO-JSPS Seminar, Delft, April 10-13, 2012

Solution by quasi-minimisation 1 For solving a linear system Ax = b we take g 1 = � b � b and search for a solution x n = V n y n by minimising the residual norm: min � b − AV n y n � ⇔ min � G n +1 ( � b � e 1 − H n y n ) � y n y n We ’quasi-minimise’ this expression by solving min �� b � e 1 − H n y n � y n 8 NWO-JSPS Seminar, Delft, April 10-13, 2012

Some remarks • The derivation of the algorithm is analogous to GMRES and QMR. • As in QMR, the algorithm can be implemented using short recurrences (of length s + 2 ) only. • Since the initial s vectors g i form an orthonormal set, our QMRIDR( s ) variant is mathematically equivalent to GMRES in the first s iterations. • An earlier QMRIDR method was proposed by Du, Sogabe, and Zhang (JCAM, 2011), aiming to obtain smoother convergence. Their algorithm is based on the ’prototype’ IDR algorithm, and is not mathematically equivalent to GMRES in the first s steps. 9 NWO-JSPS Seminar, Delft, April 10-13, 2012

Flexible QMRIDR( s ) (1) If right-preconditioning is used, the Hessenberg relation becomes AP − 1 V n = G n +1 H n or AZ n = G n +1 H n , Z n = P − 1 V n . If a variable preconditioner P − 1 is used, this relation still holds. n Z n is then defined by Z n = [ P − 1 1 v 1 , · · · , P − 1 n v n ] 10 NWO-JSPS Seminar, Delft, April 10-13, 2012

Flexible QMRIDR( s ) (2) To find an approximate solution x n put x n = Z n y n and follow the ’quasi-minimisation’ procedure explained before. In the first s steps Flexible QMRIDR( s ) is mathematically equivalent with FGMRES. 11 NWO-JSPS Seminar, Delft, April 10-13, 2012

Geophysical example The test problem models propagation of a sound wave with frequency f in the earth crust. It mimics three layers with a simple heterogeneity. 8 − ∆ p − ( 2 πf c ( x ) ) 2 p = s, in Ω = (0 , 600) × (0 , 1000) m 2 > > < s = δ ( x 1 − 300 , x 2 ) , x 1 = (0 , 600) , x 2 = (0 , 1000) > ∂n = − 2 πif with Neumann conditions ∂p > p on Γ ≡ ∂ Ω . : c 12 NWO-JSPS Seminar, Delft, April 10-13, 2012

Discretisation and preconditioning Discretisation with Finite Element Method gives system ( K − z 1 M ) u = f . System matrix is indefinite → difficult for iterative methods. We use the Shifted Laplace preconditioner P = K − z 2 M with z 2 = − i | z 1 | (Erlangga, Vuik and Oosterlee, 2004, 2006) 13 NWO-JSPS Seminar, Delft, April 10-13, 2012

Shifted Laplace preconditioner (2) In practice, a cheap approximation of P − 1 is used. The idea behind the shifted Laplace preconditioner is that this matrix is much easier to approximate than the discrete Helmholtz operator. Erlangga et al. used one geometric multigrid cycle to approximate the inverse of P . The MG-method we use is AGMG, by Notay (ETNA, 2010). AGMG uses a so-called K-cycle multigrid scheme, which uses a Krylov solver at each level. As a consequence, the outer iteration has to be a flexible Krylov method. 14 NWO-JSPS Seminar, Delft, April 10-13, 2012

Example In the experiment we use the following parameters: • z = (2 πf ) 2 , f = 8 Hz . • h = 12 . 5 m . • 3700 equations. 15 NWO-JSPS Seminar, Delft, April 10-13, 2012

Convergence for increasing s Convergence FQMRIDR(s) 0 s=1 s=2 −1 s=4 s=8 −2 s=16 s=32 −3 s=64 s=128 log(|r|/|b|) fgmres −4 −5 −6 −7 −8 −9 0 200 400 600 800 1000 1200 1400 1600 1800 2000 Number of MATVECS () Wedge problem 16 NWO-JSPS Seminar, Delft, April 10-13, 2012

Observations • Convergence curve for s = 128 coincides with FGMRES. • Convergence improves for larger s (for this example, not true in general if preconditioner is very variable). • General remark: for more variable preconditioner it is often better to use a small value for s . 17 NWO -JSPS Seminar, Delft, April 10-13, 2012

The multiple-frequency problem In practice, often a sequence of systems at different frequencies has to be solved: ( A − z i I ) x z i = b , i = 1 , n z A number of Krylov methods for this problems exist, e.g. • Multishift BiCG, Bi-CGSTAB, BiCGstab( ℓ ) (Jegerlehner, 1996, Frommer 2002) • Multishift QMR (Freund, 1994) • Restarted GMRES (Frommer and Glassner, 1998), FOM (Simoncini, 2003) • CGLS (Van den Eshof, Sleijpen, 2004) 18 NWO -JSPS Seminar, Delft, April 10-13, 2012

Multishift QMRIDR( s ) To solve shifted system ( A − z i I ) x z i = b we use again AV n = G n +1 H n . We search for a solution x z i n = V n y z i n . The norm of the residual of the shifted system is ’quasi-minimised’: � b − ( A − z i I ) V n y z i �� b � e 1 − ( H n − z i C n ) y z i min n � ⇔ min n � y zi y zi n n Here C n is C n with a zero row appended. 19 NWO -JSPS Seminar, Delft, April 10-13, 2012

Remarks The only difference between our QMRIDR( s ) and multishift QMRIDR( s ) is that the quasi-minimisation is performed on (small) shifted Hessenberg systems. The solutions to the large shifted systems are computed using short recurrences. • Computation of Hessenberg decomposition is the same as for QMRIDR( s ) → requires one matvec per iteration plus 3 s vector operations (inner product or update), and storage for s + 1 g vectors. • Computation of solution vectors for the shifted systems requires for each shift s + 1 update vectors. Computational cost consists of n z ( s + 2) vector updates plus scalar operations. 20 NWO -JSPS Seminar, Delft, April 10-13, 2012

Flexible and Multishift Induced Dimension Reduction Algorithms - PowerPoint PPT Presentation

Flexible and Multishift Induced Dimension Reduction Algorithms Martin van Gijzen Joint work with Gerard Sleijpen and Jens Zemke NWO-JSPS Seminar, Delft, April 10-13, 2012 1 Delft University of Technology Outline Introduction IDR

T=0 V out Q d V V v Induced current Induced charge T=1 V out Q d V V Induced current

Dimension Reduction and Nearest Neighbor Search Advanced Algorithms Nanjing University, Fall

Dimension Reduction CSE 6242 / CX 4242 Thanks : Prof. Jaegul Choo , Dr. Ramakrishnan Kannan,

The The Beverly Beverly Middle Middle School School Flexible Flexible Learning Learning

Linear Dimension Reduction (in L 2 ) Linear Dimension Reduction: R D R d Goal: Find a low-dim.

VC-dimension and Erd os-P osa property Nicolas Bousquet LIRMM, University Montpellier II

Personalized Learning Flexible Seating and Space Flexible Seating and Space Flexible Seating and

The Human Dimension Sue Manns Regional Director Pegasus The Human Dimension The Human

Electromagnetic Induction Current Induced in a Straight Conductor Current Induced in a Loop

OVERVIEW Contact-induced change Contact-induced differentiation (CID) A cognitive model

Packing Dimension Results for Anisotropic Gaussian Random Fields Dongsheng Wu Department of

The Metric Dimension Problem. J. D az Monash U., May 2018 The Metric Dimension problem

Nonparametric Variable Selection via Sufficient Dimension Reduction Lexin Li Workshop on Current

Extreme Value Theory and Dimension GARDES Inference on reduction for the study of hyperspectral

Flexible Instruction Day Parent Presentation Flexible Instruction Day March 16 - 20 - Flexible

Flexible Infrastructure Qualification What Is Flexible Infrastructure/Benefits Flexible

1 AN EMPIRICAL COMPARISON OF CONVERTIBLE BOND VALUATION MODELS Robert Jones, Chris Veld, and Yuriy

Some Thoughts on LFCS at 30 Samson Abramsky Department of Computer Science, University of Oxford

Local convergence for random permutations The case of uniform pattern-avoiding permutations

[12] The Eigenvector Two interest-bearing accounts Suppose Account 1 yields 5% interest and

Revelation Series Lesson #006 June 13, 2004 Dean Bible Ministries www.deanbibleministries.org

SLDG schemes for 1st and 2nd order PDEs Olivier Bokanowski Laboratory Jacques Louis Lions (Paris

Invariants for gapped ground state phases in dimensions one and higher 1 Bruno Nachtergaele (UC

srrt s