Karl Meerbergen: Matrix algorithms for parametric dynamical systems - - PowerPoint PPT Presentation

karl meerbergen matrix algorithms for parametric
SMART_READER_LITE
LIVE PREVIEW

Karl Meerbergen: Matrix algorithms for parametric dynamical systems - - PowerPoint PPT Presentation

Karl Meerbergen: Matrix algorithms for parametric dynamical systems Applications Structures and vibrations, pharma-industry Uncertainty quantification Design optimization, PDE constrained optimization Bifurcation analysis (Distance


slide-1
SLIDE 1

Karl Meerbergen: Matrix algorithms for parametric dynamical systems

Applications

Structures and vibrations, pharma-industry

◮ Uncertainty quantification ◮ Design optimization, PDE constrained optimization

Bifurcation analysis (Distance problems)

(KU Leuven) Research overview May 6, 2015 1 / 20

slide-2
SLIDE 2

Karl Meerbergen: Matrix algorithms for parametric dynamical systems

Applications

Structures and vibrations, pharma-industry

◮ Uncertainty quantification ◮ Design optimization, PDE constrained optimization

Bifurcation analysis (Distance problems)

Linear algebra problems

Eigenvalue problems: linear, non-linear Model reduction: linear, non-linear Linear systems: preconditioning Tensors

(KU Leuven) Research overview May 6, 2015 1 / 20

slide-3
SLIDE 3

Karl Meerbergen: Matrix algorithms for parametric dynamical systems

Applications

Structures and vibrations, pharma-industry

◮ Uncertainty quantification ◮ Design optimization, PDE constrained optimization

Bifurcation analysis (Distance problems)

Linear algebra problems

Eigenvalue problems: linear, non-linear Model reduction: linear, non-linear Linear systems: preconditioning Tensors

Exascale computing

Flanders ExaScience Lab (Intel) founded in June 2010. Co-design lab for future hardware for HPC

(KU Leuven) Research overview May 6, 2015 1 / 20

slide-4
SLIDE 4

Parametric matrices in uncertainty analysis/optimization

5 10 15 20 25 30 35 40 10

−16

10

−15

10

−14

10

−13

10

−12

10

−11

10

−10

f [Hz] |y| [m2/(rad/s)2] k = 60 − q = 30

Full QMMR

Large linear systems depending on parameters (frequency, physical parameters)

◮ Preconditioning for indefinite systems (= hard) ◮ Many linear systems to solve −

→ parametric model reduction

Eigenvalue problem depending on (physical) parameters

◮ Eigenvalues play an important role in the dynamics of such problem ◮ Parametric modal truncation: Maryam Saadvandi, Wim Desmet

(mechanical engineering)

(KU Leuven) Research overview May 6, 2015 2 / 20

slide-5
SLIDE 5

Recycling/ Quadratic output

SISO linear system (K − ω2M)x = f y = x∗Sx with S symmetric. (Joint with Van Beeumen, Van Nimmen, Lombaert) Method:

◮ Krylov space with f ⇒ Vk = [v1, . . . , vk] ◮ Extract Ritz vectors (M. & Bai) ◮ Use combination of modal superposition (recycle Ritz vectors) with

block Krylov for SVk

10

−2

10

−1

10 10

1

10

2

10

3

|y| |y| |y | |y |

(KU Leuven) Research overview May 6, 2015 3 / 20

slide-6
SLIDE 6

Dominant poles for nonlinear frequency dependent models

SISO Nonlinear system A(s)x = fu(s) y = dTx For many problems, y can be developed in a modal expansion y =

  • j=1

Rj s − λj where λj is a root of det(A(s)) = 0. Dominant pole algorithm makes a reduced model based on the dominant terms: y ≈

k

  • j=1

Rj s − λj Joint work with Maryam Saadvandi and Wim Desmet (Mechanical engineering): compute dominant poles for parametric systems

(KU Leuven) Research overview May 6, 2015 4 / 20

slide-7
SLIDE 7

Dominant poles for parametric equations

(1 + 0.02i)K0 + (k1 + iωc1)K1 − ω2M

  • x

= f y = c∗x.

5 10 15 20 25 30 10

−7

10

−6

10

−5

10

−4

iω(rad/s) |H(iω)|

γ(1) γ(2) γ(3) γ(4)

# iterations time (min) γ(1) 8 0.45 γ(2) 4 0.25 γ(3) 7 0.43 γ(4) 3 0.19 Total 22 1.32

(KU Leuven) Research overview May 6, 2015 5 / 20

slide-8
SLIDE 8

Model order reduction (MOR) in optimization context

Dynamical system: A(ω, γ)x = f y = dTx g = ωmax

ωmin

|y|2dω (energy) Objective: minimize g(γ) Model reduction using moment matching = Pad´ e approximation via Krylov methods Allows for cheap computation of g and ∇γg Penalty and trust region methods based on error estimation of reduced model (Yao Yue)

(KU Leuven) Research overview May 6, 2015 6 / 20

slide-9
SLIDE 9

Example of the Lamot footbridge

OPTEC Application Problem 4 Lamot bridge finite element model (n = 25, 962) The goal is to determine the

  • ptimal stiffness and damping

coefficient of four bridge dampers (=8 parameters). Computation times:

◮ without reduced modeling: 545 for one function evaluation, 70 function

evaluations needed.

◮ with reduced model to evaluate g and ∇g: 879 sec. ◮ trust region method: 200 sec. ◮ penalty method: 189 sec. (KU Leuven) Research overview May 6, 2015 7 / 20

slide-10
SLIDE 10

Minimal maximal eigenvalue

Parametric eigenvalue problem A(x)u = λB(x)u with x ∈ H ⊂ Rm

−15 −10 −5 5 10 15 20 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4 λ1(x) θ1(x) l(x;v1) l(x;v2)

A and B symmetric and B positive definite for x ∈ H Largest eigenvalue is a quasi convex function Our numerical method (Jeroen De Vlieger):

◮ subspace projection method (= bundle method) ◮ Solve the projected problem by any method for non-smooth problems or

a method that approximates the solution by sequence of a 1D problems

◮ Provably convergent (KU Leuven) Research overview May 6, 2015 8 / 20

slide-11
SLIDE 11

Minimal maximal eigenvalue

(KU Leuven) Research overview May 6, 2015 9 / 20

slide-12
SLIDE 12

Parametric eigenvalue problems for stability analysis

Find parameter values p so that A(p)x = λBx has purely imaginary eigenvalues λ. Is translated to (A(p) ⊗ B + B ⊗ A(p))z = 0 Exploit the specific Kronecker structure for efficient solvers Appears to be as efficient as solving eigenvalues of A(p) for fixed p Much more efficient than continuation Work with Spence, Elman, Voss, Schroeder, Vandebril

(KU Leuven) Research overview May 6, 2015 10 / 20

slide-13
SLIDE 13

Nonlinear eigenvalue problems

Nonlinear eigenvalues problems arise in many applications: A(λ)x = 0 Classical iteration schemes converge to one eigenvalue at a time Approximate A(λ) ≃ A0 + λA1 + λ2A2 + · · · Apply Arnoldi’s method to the Companion linearization: several eigenvalues converge at the same time With Michiels, Jarlebring, Van Beeumen

20 40 60 80 100 10

−20

10

−10

10 |λ−λ*| −6 −4 −2 −10 −5 5 10 imag exact eigenvalues Reciprocal Ritz values

(KU Leuven) Research overview May 6, 2015 11 / 20

slide-14
SLIDE 14

Nonlinear eigenvalue problems

Rational Krylov method: pole can change Approximate A(λ) by Newton interpolation in σ1, . . . , σk A(λ) ≃ A0n0(λ) + A1n1(λ) + · · · + Aknk(λ) nj =

j

  • i=1

(λ − σi) Rational Krylov method with starting vector v corresponds to moment matching: A(σ1)−1v, A(σ2)−1v, . . . , A(σk)−1v are contained in the subspace.

(KU Leuven) Research overview May 6, 2015 12 / 20

slide-15
SLIDE 15

Nonlinear eigenvalue problems

Gun problem: F(λ)x = 0 =

  • K − λM + i
  • λ − σ2

1W1 + i

  • λ − σ2

2W2

  • x

Compute eigenvalues near 5 poles.

(KU Leuven) Research overview May 6, 2015 13 / 20

slide-16
SLIDE 16

Incomplete multifrontal factorization

Large sparse linear system Ax = b where A is given in finite element format: A =

  • e∈E

PeAePT

e

◮ Ae is dense (full) matrix ◮ Pe selects degrees of freedom associated with the element

Multilevel preconditioning using element structure instead of sparse. Dk ≡ , , , Dk ≡ diag(, , , ) IMF (Nick Van Nieuwenhoven)

(KU Leuven) Research overview May 6, 2015 14 / 20

slide-17
SLIDE 17

Incomplete multifrontal factorization

Advantages

◮ Use dense matrices: high throughput ◮ More efficient preconditioner than other multilevel techniques

2D Navier-Stokes equation, flow around obstacle, Reynolds number 4000, 16 128 elems (Q1-Q1), 49 440 unknowns, 1 243 530 nnzs.

10−06 10−05 10−04 10−03 10−02 10−01 10+00 10+01 10+02 20 40 60 80 100 120 140 160 Norm of residual Execution time (s) ILUT(0.6) ILU(5) ARMS(0.1) IMF(2) IMF(4)

(KU Leuven) Research overview May 6, 2015 15 / 20

slide-18
SLIDE 18

Sequentially truncated HOSVD

A ≈ ( ˆ U1, ˆ U2, ˆ U3) · ˆ S A ≈ ˆ U1 ˆ U2 ˆ U

3

ˆ S Rank (r1, r2, r3) orthogonal Tucker approximation to A Columns of      ˆ U1 ∈ Rn1×r1 ˆ U2 ∈ Rn2×r2 ˆ U3 ∈ Rn3×r3 can be extended to a basis of      Rn1 Rn2 Rn3

(KU Leuven) Research overview May 6, 2015 16 / 20

slide-19
SLIDE 19

Compression of simulation results I

Results for Nick Vannieuwenhoven’s ST-HOSVD. Compression of numerical solution of the heat equation on a square domain computed by explicit Euler. Tensor of size 101 × 101 × 10001 (x × y × t). Partially symmetric, 102.0 million non-zeros. T-HOSVD and ST-HOSVD truncated to absolute error of 10−4.

(KU Leuven) Research overview May 6, 2015 17 / 20

slide-20
SLIDE 20

Compression of simulation results II

T-HOSVD ST-HOSVD

  • Abs. error

8.512 · 10−5 9.587 · 10−5 Rank (22, 22, 20) (22, 21, 19) T-HOSVD ST-HOSVD Storage (nb. of values) 214144 203140 Factorization time 2h 46m 1m 14.7s 133x speedup!

(KU Leuven) Research overview May 6, 2015 18 / 20

slide-21
SLIDE 21

Flanders ExaScience Lab

Project to give hints for future (exascale) HPC hardware Partners: all Flemish universities, imec, intel, later Janssen pharma Power requirements:

◮ Exascale machine limited by 25MW ◮ With Blue Gene technology, 593MW (Nuclear power plant: 1008MW) ◮ Energy budget: ⋆ Reduce clock speed (= lower voltage) ⋆ Increase number of cores ⋆ Therefore higher power efficiency (= ops/watt) ⋆ Less fault correction hardware ⋆ Higher chances of hardware errors to be solved by software

Work done:

◮ Support for Space Weather simulation: Adaptive Mesh Refinement,

Particle in Cell.

◮ Iterative linear system solvers ◮ Load balancing ◮ Tensor computations for parameter estimation for PDE and for

machine learning

(KU Leuven) Research overview May 6, 2015 19 / 20

slide-22
SLIDE 22

Latency hiding in Krylov methods

Communication avoidance (Demmel et al.): perform s Krylov steps within one bulk synchronisation/communication step. Classical Krylov:

p1 p2 p3 p4 SpMV local dot

+ + +

reduction b-cast axpy Gram-Schmidt norm

+ + +

reduction b-castscale Normalization H update

Latency hiding:

p1 p2 p3 p4 H update

+ + +

reduction bcastscale axpy correction local dot (KU Leuven) Research overview May 6, 2015 20 / 20