Krylov subspace methods and exascale computations: good match or lost case?
Zdenˇ ek Strakoš
Charles University in Prague and Czech Academy of Sciences http://www.karlin.mff.cuni.cz/˜strakos SPPEXA Symposium, Münich, January 2016
Krylov subspace methods and exascale computations: good match or - - PowerPoint PPT Presentation
Krylov subspace methods and exascale computations: good match or lost case? Zden ek Strako Charles University in Prague and Czech Academy of Sciences http://www.karlin.mff.cuni.cz/strakos SPPEXA Symposium, Mnich, January 2016
Zdenˇ ek Strakoš
Charles University in Prague and Czech Academy of Sciences http://www.karlin.mff.cuni.cz/˜strakos SPPEXA Symposium, Münich, January 2016
2
Strakos, Z., Efficiency and Optimizing of Algorithms and Programs on the Host Computer / Array Processor System, Parallel Computing, 4, 1987,
Basic problems and principles are not even after thirty years that much different.
3
r0 = b − Ax0, solve Mz0 = r0, p0 = z0 For n = 1, . . . , nmax αn−1 = z∗
n−1rn−1
p∗
n−1Apn−1
xn = xn−1 + αn−1pn−1 , stop when the stopping criterion is satisfied rn = rn−1 − αn−1Apn−1 Mzn = rn , solve for zn βn = z∗
nrn
z∗
n−1rn−1
pn = zn + βnpn−1 End
4
However, for the approximation power of the methods, nonlinearity is essential!
5
State-of-the-art in the algorithmic developments:
and Practice, PhD Thesis, UC at Berkeley, CA, 2015.
6
7
should be given priority to overproduction of algorithmic variations.
to the inner nature of the problem.” (C. Lanczos, 1947). Infinite dimensional considerations are very useful.
restricted to the operator only, universal contraction-based bounds, asymptotic considerations, unjustified or hidden restrictive assumptions.
are instead unwanted and often labeled as “negative.”
8
9
10
Let V be a real (infinite dimensional) Hilbert space with the inner product (·, ·)V : V × V → R, let V # be the dual space of bounded linear functionals on V . Consider a bounded and coercive operator A : V → V # and the equation in V # Ax = b , A : V → V #, x ∈ V, b ∈ V # . Using the Riesz map, (τAx − τb, v)V = 0 for all v ∈ V . The Riesz map τ can be interpreted as transformation of the original problem Ax = b in V # into the equation in V τAx = τb, τA : V → V, x ∈ V, τb ∈ V , which is (unfortunately) called preconditioning.
11
Let B (= τA) be a bounded linear operator on the Hilbert space V . Choosing z0 (= τb − τAx0) ∈ V . Consider the Krylov sequence z0, z1 = Bz0, z2 = Bz1 = B2z0, . . . , zn = Bzn−1 = Bnzn−1, . . . Determine a sequence of operators Bn defined on the sequence of nested subspaces Vn = span {z0, . . . , zn−1} , with the projector En
z1 = Bz0 = Bnz0, z2 = B2z0 = (Bn)2z0, . . . zn−1 = Bn−1z0 = (Bn)n−1z0, Enzn = EnBnz0 = (Bn)nz0.
12
B x = f ← → ω(λ),
↑ ↑ Tn yn = fV e1 ← → ω(n)(λ),
n
ω(n)
i
F
i
λU
λL
λ−1 dω(λ) =
n
ω(n)
i
i
−1 + u − un2
a
f2
V
Stieltjes (1894) and Vorobyev (1958) moment problems for self-adjoint bounded operators reduce to the Gauss-Christoffel quadrature (1814). No one would consider describing it by contraction.
13
r0 = b − Ax0 ∈ V #, p0 = τr0 ∈ V For n = 1, 2, . . . , nmax αn−1 = rn−1, τrn−1 Apn−1, pn−1 = (τrn−1, τrn−1)V (τApn−1, pn−1)V xn = xn−1 + αn−1pn−1 , stop when the stopping criterion is satisfied rn = rn−1 − αn−1Apn−1 βn = rn, τrn rn−1, τrn−1 = (τrn, τrn)V (τrn−1, τrn−1)V pn = τrn + βnpn−1 End Hayes (1954); Vorobyev (1958, 1965); Karush (1952); Stesin (1954) Superlinear convergence for (identity + compact) operators.
14
15
20 40 60 80 100 10
−15
10
−10
10
−5
10
100 200 300 400 500 600 700 800 10
−1510
−1010
−510 iteration number residual smooth ubound backward error loss of orthogonality approximate solution error
Here numerical inexactness due to roundoff. How much may we relax accuracy of the most costly operations without causing an unwanted delay and/or affecting the maximal attainable accuracy?
16
17
Let Φh = (φ(h)
1 , . . . , φ(h) N )
be a basis of the subspace Vh ⊂ V , let Φ#
h = (φ(h)# 1
, . . . , φ(h)#
N
) be the canonical basis of its dual V #
h ,
( V #
h = AVh) . Using the coordinates in
Φh and in Φ#
h ,
f, v → v∗f , (u, v)V → v∗Mu, (Mij) = ((φj, φi)V )i,j=1,...,N , Au → Au , Au = AΦhu = Φ#
h Au ;
(Aij) = (a(φj, φi))i,j=1,...,N , τf → M−1f , τf = τΦ#
h f = ΦhM−1f ;
we get with b = Φ#
h b , xn = Φh xn , pn = Φh pn , rn = Φ# h rn
the algebraic CG formulation.
18
r0 = b − Ax0, solve Mz0 = r0, p0 = z0 For n = 1, . . . , nmax αn−1 = z∗
n−1rn−1
p∗
n−1Apn−1
xn = xn−1 + αn−1pn−1 , stop when the stopping criterion is satisfied rn = rn−1 − αn−1Apn−1 Mzn = rn , solve for zn βn = z∗
nrn
z∗
n−1rn−1
pn = zn + βnpn−1 End Günnel, Herzog, Sachs (2014); Málek, S (2015)
19
M = I , corresponds to the discretization basis Φ orthonormal wrt (·, ·)V .
unpreconditioned algebraic CG that is applied to the preconditioned algebraic system. The resulting matrix of this preconditioned algebraic system is not sparse!
from discretization can be interpreted within this operator preconditioning framework.
20
21
Tn = α1 β2 β2 ... ... ... ... ... ... ... βn βn αn is the Jacobi matrix of the orthogonalization coefficients and the CG method is formulated by Tnyn = τr0V e1, xn = x0 + Qnyn , xn ∈ Vn . Infinite dimensional Krylov subspace methods perform discretization via model reduction.
22
23
−1 −0.5 0.5 1 −1 1 0.2 0.4 0.6 0.8 1 1.2 1.4
−1 1 −1 1 −4 −2 2 4 x 10
−4
Exact solution x (left) and the discretisation error x − xh (right) in the Poisson model problem, linear FEM, adaptive mesh refinement. Quasi equilibrated discretization error over the domain.
24
−1 1 −1 1 −4 −2 2 4 x 10
−4
−1 1 −1 1 −4 −2 2 4 x 10
−4
Algebraic error xh − x(n)
h
(left) and the total error x − x(n)
h
(right) after the number of CG iterations guaranteeing x − xha = ∇(x − xh) ≫ x − xnA .
25
adaptation is the key to their efficient use.
not be efficient without being fast, i.e., without powerful preconditioning.
not be considered separately within isolated disciplines. They form a single problem.
issues, including numerical stability and a posteriori error analysis leading to appropriate stopping criteria.
resolve them, we should fairly admit that they exist.
26
in the Context of Solving PDEs. SIAM Spotlight Series, SIAM (2015)
Chebyshev polynomials and finite precision conjugate gradient computations, Numer. Alg. 65, 759-782 (2014)
algebraic error in numerical solution of partial differential equations, Linear Alg. Appl. 449, 89-114 (2014)
Oxford University Press (2013)
. Tichý, On efficient numerical approximation of the bilinear form c∗A−1b, SIAM J. Sci. Comput. 33, 565-587 (2011)
27
28
Kr´ alick´ y Snˇ eˇ zn´ ık - Glatzer Schneeberg German Artist’ Union Jetscher Artist Amei Hallenger Made by Co F¨
e Hory) 1932