Parallel Solution of Large-Scale Algebraic Bernoulli Equations with - PowerPoint PPT Presentation

✬ ✩ Parallel Solution of Large-Scale Algebraic Bernoulli Equations with the matrix Sign Function Method Enrique S. Quintana-Ort´ ı Depto. de Ingenier´ ıa y Ciencia de Computadores Universidad Jaume I de Castell´ on (Spain) quintana@icc.uji.es Joint work with: Sergio Barrachina (UJI), Peter Benner (TU-Chemnitz) Oslo - June 2005 ✫ ✪

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Algebraic Bernoulli Equation (ABE) Solve the equation: A T X + XA − XGX = 0 , where • A ∈ R n × n , • G ∈ R n × n is symmetric, • X ∈ R n × n is the sought-after solution. ✫ ✪ 1

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 What is the ABE? A degenerate Algebraic Riccati Equation (ARE): A T X + XA − XGX + Q = 0 , with Q = 0 , as well as a special case of the more general equation   k − 1 �  = 0 , L ( X ) + X A j X  j =1 where • L ( X ) is a linear operator, • A j ∈ R n × n , j = 1 , . . . , k − 1 . ✫ ✪ 2

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Interest? Dealing with dynamical linear time-invariant (LTI) systems: x (0) = x 0 , x ( t ) = Ax ( t ) + Bu ( t ) , ˙ t > 0 , • n state-space variables, i.e., n is the order of the system; • m inputs. Applications in systems and control theory: • Stabilization. • Model reduction. ✫ ✪ 3

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Interest? (Cont. I) Stabilization problem: Find u ( t ) s.t. the solution of x (0) = x 0 , x ( t ) = Ax ( t ) + Bu ( t ) , ˙ t > 0 , asymptotically converges to zero. The maximal solution X ∗ of the ABE A T X + XA − XBB T X = 0 , defines the feedback law u ( t ) = − B T X ∗ x ( t ) , t ≥ 0 , which stabilizes the system! ✫ ✪ 4

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Interest? (Cont. II) Large-scale unstable dynamical LTI systems [Kamon, Wang, White’98]: RLC cicuits VLSI chip design Large-scale means n as large as 10 3 − 10 4 ! ✫ ✪ 5

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Numerical Solvers Specialized cases of ARE solvers [Mehrmann’91]: Compute a basis U T V T � T , U, V ∈ R n × n , � for the invariant subspace of the 2 n × 2 n matrix � A � G H := 0 − A T associated with the eigenvalues of H in the open left half plane. Then, the stabilizing solution of the ABE (provided it exists) can be computed by X ∗ := − V U − 1 . ✫ ✪ 6

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Numerical Solvers (Cont. I) Computing the appropriate basis: • Compute the real Schur form of H (actually, just A ) and reorder the eigenvalues. • Use a spectral projector such as the matrix sign function [Roberts, 80]. Computational and storage costs O ( n 3 ) and O ( n 2 ) , respectively. = ⇒ Need for parallel computing! ✫ ✪ 7

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Numerical Solvers (Cont. II) • Real Schur form of H via the QR algorithm: – Implicitly iterative. – Composed of fine grain operations. – Hard to parallelize, but doable [ScaLAPACK’97]. – Parallel reordering procedure not available :-( • Matrix sign function: – Explicitly iterative. – Claim: Easy to parallelize. The convergence of the methods depends on the problem: = ⇒ Difficult to infer general results! ✫ ✪ 8

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Sign Function Method The classical Newton iteration for the matrix sign function Z k +1 ← 1 2( Z k + Z − 1 Z 0 ← Z, k ) , k = 0 , 1 , 2 , . . . , � A � G when applied to H := boils down to 0 − A T � � c k A k + c k A − 1 1 1 A 0 ← A, A k +1 ← , k 2 � � c k G k + c k A − 1 k G k A − T 1 1 G 0 ← G, G k +1 ← , k 2 k = 0 , 1 , 2 , . . . . ✫ ✪ 9

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Sign Function Method (Cont. I) Scaling for acceleration of convergence: • Determinantal: c k := | det( H k ) | 1 /n . • Optimal norm: � � H k � 2 c k := . � H − 1 k � 2 • Approximate optimal norm: � � � 2 � A k � 2 F + � G k � 2 � H k � F � F c k := = . � � H − 1 � � k � F 2 � A − 1 F + � A − 1 k G k A − T k � 2 k � 2 F ✫ ✪ 10

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Sign Function Method (Cont. II) Convergence criterion: • Stop when � A k +1 − A k � F ≤ τ · � A � F , where τ is a tolerance threshold. • Use τ = √ ε , with ε as the machine precision, and perform 1–3 additional iterations once this criterion is satisfied. At convergence, after ¯ k iterations, solve the full-rank linear least-squares problem � A ¯ � � � G ¯ k + I n k X = . I n − A T 0 n ¯ k ✫ ✪ 11

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Operation Costs • Matrix factorization and triangular linear systems solves. . . A k + A − 1 A k +1 ← 1 � � , k 2 G k + A − 1 k G k A − T G k +1 ← 1 � � . k 2 Cost: 6 n 3 flops per iteration. • Full-rank least squares problem via QR factorization: � A ¯ � � � G ¯ k + I n k X = . I n − A T 0 n ¯ k 3 n 3 flops. Cost: 14 ✫ ✪ 12

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Implementation Easy parallelization using, e.g., ScaLAPACK. Computing the iterates: A k +1 ← 1 G k +1 ← 1 A k + A − 1 G k + A − 1 k G k A − T � � � � , . k k 2 2 • LU factorization of A , followed by triangular linear system solve, and matrix inversion from LU factors. • A − 1 needs to be computed anyway! Invert A first and then perform two matrix products. ✫ ✪ 13

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Implementation (Cont. I) Matrix inversion: • LU factorization, triangular matrix inversion, and triangular linear system solve. ı 2 , Sun, van • Direct inversion via Gauss-Jordan elimination [Quintana-Ort´ de Geijn’01]. – Well suited for distributed memory machines. – Cyclic distribution not necessary for load balance. – All computations in rank- k update: High performance. ✫ ✪ 14

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Numerical Experiments Unstable systems in the ARE benchmark [Benner, Laub, Mehrmann’95] (AREb) and others. Stabilization of ( ˆ A, B ) = ( A + δI n , B ) . Test: • Relative residual R 1 ( X ∗ ) := � ˆ A T X ∗ + X ∗ ˆ A − X ∗ BB T X ∗ � 1 . � X ∗ � 1 • Stabilized closed-loop A − BB T X ∗ ? ✫ ✪ 15

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Numerical Experiments (Cont. I) Example n δ Iter. R 1 ( X ∗ ) Stab.? Observ. AREb 1 2 1.0e-4 4 4.9e-28 Yes AREb 2 2 0.0 5 5.8e-15 Yes AREb 7 2 0.0 5 0.0e+00 Yes AREb 9 2 1.0 4 1.3e-15 Yes AREb 10 2 0.0 5 8.5e-09 Yes AREb 11 2 0.0 5 1.0e-15 Yes AREb 12 3 0.0 7 3.3e-09 Yes AREb 13 4 1.0e-8 7 2.9e-10 Yes AREb 14 4 0.0 5 5.5e-11 Yes AREb 15 39 1.0e-6 6 8.4e-11 Yes AREb 16 64 1.0e-4 17 1.2e-11 Yes AREb 17 21 1.0 8 5.1e-01 No All eig. at 0.0 AREb 19 60 1.0e-4 17 1.1e-14 Yes RLC 199 1.0e-6 31 1.1e-14 Yes Gen. system ✫ ✪ 16

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Experiments Double precision arithmetic using random unstable systems of order n . Two parallel platforms: • HPCx system with POWER4+ Regatta nodes: – 32 POWER4+ processor@1.7 GHz, with 1.5 Mbytes of L2 cache, and 32 Gbytes RAM. • Cluster of Intel processors: – 20 Intel Xeon@2.4 GHz, with 1 Gbyte of RAM, connected via Myrinet switches. ✫ ✪ 17

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Experiments (Cont. I) Reduction in execution time for ABE of dimension n =3200. ✫ ✪ 18

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Experiments (Cont. II) Speed-up for ABE of dimension n =3200. n p 2 4 6 8 10 12 HPCx 1.92 3.31 4.75 6.41 7.30 9.38 Intel cluster 1.49 2.62 3.47 3.93 4.90 4.89 Efficiency on Linux cluster quite more reduced! ✫ ✪ 19

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Experiments (Cont. III) � Scalability for ABEs of dimension n/ ( n p ) =3200. ✫ ✪ 20

✬ ✩ Parallel solution of large-scale ABEs with the matrix sign function Oslo - June 2005 Parallel Experiments (Cont. IV) Impact of matrix inversion procedure (not included in ABE solvers): ✫ ✪ 21

Parallel Solution of Large-Scale Algebraic Bernoulli Equations with - PowerPoint PPT Presentation

Parallel Solution of Large-Scale Algebraic Bernoulli Equations with the matrix Sign Function Method Enrique S. Quintana-Ort Depto. de Ingenier a y Ciencia de Computadores Universidad Jaume I de Castell on (Spain)

Binomial and Normal Distributions Bernoulli Trials A Bernoulli trial is a random experiment with

Special Discrete Distributions Bernoulli Distribution A Bernoulli trial is an experiment with

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Bernoulli numbers and the unity of mathematics B. Mazur (Very rough notes for the Bartlett

2. Independence and Bernoulli Trials (Euler, Ramanujan and Bernoulli Numbers) Independence :

Bernoulli Equations Bernd Schr oder logo1 Bernd Schr oder Louisiana Tech University,

Ergodicity and type of nonsingular Bernoulli actions Richard Kadison and his mathematical legacy

Ch5: Special Discrete Distributions 5.1 Bernoulli and binomial random variables The sample

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

2/17/2017 Continued from yesterday >java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5

The Tricentenary of the Weak Law of Large Numbers. Eugene Seneta presented by Peter Taylor July

Parallel Hybrid Solution with PHT Parallel Hybrid Solution Pourquoi envisager une vritable

A. Operations with algebraic Algebra practice part 1 expressions 3 4 A. Operations with

Parallel Processing in Algebraic Number Theory Bill Hart February 1, 2007 Bill Hart Parallel

13. The Weak Law and the Strong Law of Large Numbers James Bernoulli proved the weak law of large

Bernoulli convolutions associated with some algebraic numbers De-Jun Feng The Chinese University

Optimization of nonlocal distributed feedback controllers with time delay F . Trltzsch

CENG4480 Lecture 10: Clock Bei Yu byu@cse.cuhk.edu.hk (Latest update: December 3, 2020) Fall

The Next Generation of Microservices Phil Calado http://philcalcado.com @pcalcado We can

1 The induced emf in a closed coil of wire, according to Faradays Law, is proportional to: I.

Access to Event Object event variable (HTML): <div

Embedding a chat widget within the SFX menu CARLI SFX

Scoreboard R EX R ADD IR Issue ADDR MEM R WB R DIV MULT R 1 1 Scoreboard Algorithm

Thoughts on the optimization of the VLENF VLENF meeting at Fermilab, USA September 1, 2011

Parallel Solution of Large-Scale Algebraic Bernoulli Equations with - PowerPoint PPT Presentation

Parallel Solution of Large-Scale Algebraic Bernoulli Equations with the matrix Sign Function Method Enrique S. Quintana-Ort Depto. de Ingenier a y Ciencia de Computadores Universidad Jaume I de Castell on (Spain)

Binomial and Normal Distributions Bernoulli Trials A Bernoulli trial is a random experiment with

Special Discrete Distributions Bernoulli Distribution A Bernoulli trial is an experiment with

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Bernoulli numbers and the unity of mathematics B. Mazur (Very rough notes for the Bartlett

2. Independence and Bernoulli Trials (Euler, Ramanujan and Bernoulli Numbers) Independence :

Bernoulli Equations Bernd Schr oder logo1 Bernd Schr oder Louisiana Tech University,

Ergodicity and type of nonsingular Bernoulli actions Richard Kadison and his mathematical legacy

Ch5: Special Discrete Distributions 5.1 Bernoulli and binomial random variables The sample

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

2/17/2017 Continued from yesterday &gt;java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5

The Tricentenary of the Weak Law of Large Numbers. Eugene Seneta presented by Peter Taylor July

Parallel Hybrid Solution with PHT Parallel Hybrid Solution Pourquoi envisager une vritable

A. Operations with algebraic Algebra practice part 1 expressions 3 4 A. Operations with

Parallel Processing in Algebraic Number Theory Bill Hart February 1, 2007 Bill Hart Parallel

13. The Weak Law and the Strong Law of Large Numbers James Bernoulli proved the weak law of large

Bernoulli convolutions associated with some algebraic numbers De-Jun Feng The Chinese University

Optimization of nonlocal distributed feedback controllers with time delay F . Trltzsch

CENG4480 Lecture 10: Clock Bei Yu byu@cse.cuhk.edu.hk (Latest update: December 3, 2020) Fall

The Next Generation of Microservices Phil Calado http://philcalcado.com @pcalcado We can

1 The induced emf in a closed coil of wire, according to Faradays Law, is proportional to: I.

Access to Event Object event variable (HTML): &lt;div

Embedding a chat widget within the SFX menu CARLI SFX

Scoreboard R EX R ADD IR Issue ADDR MEM R WB R DIV MULT R 1 1 Scoreboard Algorithm

Thoughts on the optimization of the VLENF VLENF meeting at Fermilab, USA September 1, 2011

2/17/2017 Continued from yesterday >java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5

Access to Event Object event variable (HTML): <div