17th International Conference on Domain Decomposition Methods
Mark F. Adams
- St. Wolfgang/Strobl Austria- 3 July 2006
Algebraic multigrid methods for mechanical engineering applications - - PowerPoint PPT Presentation
Algebraic multigrid methods for mechanical engineering applications Mark F. Adams St. Wolfgang/Strobl Austria - 3 July 2006 17th International Conference on Domain Decomposition Methods 0 Outline Algebraic multigrid (AMG) Coarse grid
17th International Conference on Domain Decomposition Methods
17th International Conference on Domain Decomposition Methods 1
17th International Conference on Domain Decomposition Methods 2
smoothing
Finest Grid
Prolongation (P=RT)
The Multigrid V-cycle
First Coarse Grid
Restriction (R) Note: smaller grid
17th International Conference on Domain Decomposition Methods 3
(Schwarz)
problematic
Algebraic coarse grid (Galerkin) AH = RAhP
17th International Conference on Domain Decomposition Methods 4
17th International Conference on Domain Decomposition Methods 5
Start with kernel vectors B of operator
eg, 6 RBMs in elasticity
Nodal aggregation B P0
One Jacobi iteration: P ( I - ω D-1 A ) P0
17th International Conference on Domain Decomposition Methods 6
17th International Conference on Domain Decomposition Methods 7
17th International Conference on Domain Decomposition Methods 8
hide communication
Example: 2D, 4 proc
17th International Conference on Domain Decomposition Methods 9
Cray T3E - 24 Processors – About 30,000 dof Per Processor
17th International Conference on Domain Decomposition Methods 10
17th International Conference on Domain Decomposition Methods 11
17th International Conference on Domain Decomposition Methods 12
17th International Conference on Domain Decomposition Methods 13
17th International Conference on Domain Decomposition Methods 14
17th International Conference on Domain Decomposition Methods 15
17th International Conference on Domain Decomposition Methods 16
17th International Conference on Domain Decomposition Methods 17
17th International Conference on Domain Decomposition Methods 18
Cortical bone Trabecular bone 5-mm Cube
FE mesh generation
17th International Conference on Domain Decomposition Methods 19
17th International Conference on Domain Decomposition Methods 20
Athena: Parallel FE ParMetis
Parallel Mesh Partitioner
(Univerisity of Minnesota)
Prometheus
Multigrid Solver
FEAP
Serial general purpose FE application (University of
California)
PETSc
Parallel numerical libraries
(Argonne National Labs)
µFE Mesh Input File
Athena
ParMetis
FE input file (in memory) FE input file (in memory) Partition to SMPs
Athena Athena
ParMetis
File File File File
FEAP FEAP FEAP FEAP
Material Card
Silo DB Silo DB Silo DB Silo DB
Visit
Prometheus
PETSc ParMetis
METIS METIS METIS METIS
pFEAP
Olympus
17th International Conference on Domain Decomposition Methods 21
17th International Conference on Domain Decomposition Methods 22
80 µm w/ shell
17th International Conference on Domain Decomposition Methods 23
80 µm w/o shell
17th International Conference on Domain Decomposition Methods 24
17th International Conference on Domain Decomposition Methods 25
17th International Conference on Domain Decomposition Methods 26
2 70 26 36 11 5 19 22 20 14 5 5 2 70 26 36 11 5 19 22 20 14 5 6 2 70 26 36 11 5 19 22 20 14 5 4 2 70 26 36 11 5 19 22 20 14 5 3 2 70 26 36 11 5 20 20 20 14 5 2 2 70 25 35 11 5 18 21 20 14 5 1 6 5 4 3 2 1 5 4 3 2 1 Large (537M dof) Small (7.5M dof) Newton Load
17th International Conference on Domain Decomposition Methods 27
2.78 1.00 model 74 76 Flop rate 2.61 1.00 Measured run time 33.0K 19.3K #elems/pr 68 50 #nnz/row 897 450 #iteration 537M dof 7.5M dof
17th International Conference on Domain Decomposition Methods 28
17th International Conference on Domain Decomposition Methods 29
17th International Conference on Domain Decomposition Methods 30
17th International Conference on Domain Decomposition Methods 31
17th International Conference on Domain Decomposition Methods 32
17th International Conference on Domain Decomposition Methods 33
1) Use Identity: I
+ +
I P C C K I R P C RC P RK C C K
i i i i i i i
T i T i
1 T 1 1
+ + + + + + + + + + + + i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 T 1 1
T i T i
P P C C K R R P C R P C R P K R C C K
2) Constraint coarsening:
i i 1 +
R
+ +
C C K
1 T 1 1 i i i
i
i i
T i
T i i
17th International Conference on Domain Decomposition Methods 34
2 1 1 3 2
17th International Conference on Domain Decomposition Methods 35
l T l
+1
T
T [I,J] = PiIT ijP jJ =
ij iI , j J
iI, j J T ij
TClP
T P
TT P
17th International Conference on Domain Decomposition Methods 36
17th International Conference on Domain Decomposition Methods 37
C K
T =
C K
C K I
T 1
T 1C
CK S
C K
T
=
1
S C K
C K I
T 1
17th International Conference on Domain Decomposition Methods 38
constraints
17th International Conference on Domain Decomposition Methods 39
T (Ri A Ri T)-1 Ri, exact subdomain solves
17th International Conference on Domain Decomposition Methods 40
1) GMRES / KKT-AMG
2) Uzawa outer iterations / CG inner iterations
17th International Conference on Domain Decomposition Methods 41
17th International Conference on Domain Decomposition Methods 42
17th International Conference on Domain Decomposition Methods 43
# domains (processors) Iteration counts Iteration counts 106 106 127 127 122 122 100 100 Uzawa 32 32 35 35 33 33 30 30 ILU 31 31 37 37 34 34 35 35 CCS (multiplicative) 42 42 47 47 43 43 47 47 CCS (additive) 47 47 52 52 57 57 51 51 Segregated 8 4 2 1 Smoothers
17th International Conference on Domain Decomposition Methods 44
17th International Conference on Domain Decomposition Methods 45
355 (304) 238 1057 Uzawa 1025 (304) 607 2096 ILU 182 (306) 257 910 CCS(mult) 654 (313) 255 1400 CCS (add) 293 (309) 259 1060 Segregated solve (# of solves) setup end-to-end Smoothers
17th International Conference on Domain Decomposition Methods 46
2 54 8 4 10 28 4 4 284 9 5 8 29 3 17 8016 8 4 7 29 2 L.M primal Segre- gated CCS (mult.) CCS (add.) Uzawa Levels Dof (approx.) Iterations (1st solve)
17th International Conference on Domain Decomposition Methods 47
Ultrascalable implicit finite element analyses in solid mechanics with over a half a billion degrees of freedom M.F. Adams, H.H. Bayraktar,T.M. Keaveny,
ACM/IEEE Proceedings of SC2004: High Performance Networking and Computing
17th International Conference on Domain Decomposition Methods 48
17th International Conference on Domain Decomposition Methods 49
17th International Conference on Domain Decomposition Methods 50
plateau 64-128 nodes
switch Bisection bandwidth
17th International Conference on Domain Decomposition Methods 51
MG-V(A,f)
MG-V(PTAP, rH )
recursion (Galerkin)
17th International Conference on Domain Decomposition Methods 52
17th International Conference on Domain Decomposition Methods 53
28
1 13
Chebyshev
3 19
Chebyshev
2 27
damped Jacobi
2 53
damped Jacobi/Cheb.
1 10
red-black Gauss-Seidel
3 11
red-black Gauss-Seidel
2 20
red-black Gauss-Seidel
1 13
3 16
2 Iterations Smoother Order
17th International Conference on Domain Decomposition Methods 54
Micro-Computed Tomography µCT @ 22 µm resolution 3D image Mechanical Testing E, εyield, σult, etc. 2.5 mm cube 44 µm elements
µFE mesh
17th International Conference on Domain Decomposition Methods 55
17th International Conference on Domain Decomposition Methods 56
17th International Conference on Domain Decomposition Methods 57
anything (ie, constraint oblivious)
17th International Conference on Domain Decomposition Methods 58
Non-aggressive primal coarsening Aggressive primal coarsening
1 1 1 1 1 1 C
2 3 4 5 6
2D Model Problem
TC T =
1 1 1 1 1 1 1 1 1
1 1 1 1 1 1 P
T = CPP
TC T =
1 0 1 0 1
6X
17th International Conference on Domain Decomposition Methods 59