Sparse direct solvers on top of runtime systems
ANR SOLHAR
- E. Agullo, G. Bosilca, A. Buttari, A. Guermouche and
- F. Lopez,
Universit´ e de Toulouse-IRIT
ANR SOLHAR meeting 2014
Sparse direct solvers on top of runtime systems ANR SOLHAR E. - - PowerPoint PPT Presentation
Sparse direct solvers on top of runtime systems ANR SOLHAR E. Agullo, G. Bosilca, A. Buttari, A. Guermouche and F. Lopez , Universit e de Toulouse-IRIT ANR SOLHAR meeting 2014 The multifrontal QR method The Multifrontal QR method The
Universit´ e de Toulouse-IRIT
ANR SOLHAR meeting 2014
3/24 ANR SOLHAR meeting 2014
3/24 ANR SOLHAR meeting 2014
be assembled into the father’s front
3/24 ANR SOLHAR meeting 2014
4/24 ANR SOLHAR meeting 2014
5/24 ANR SOLHAR meeting 2014
7/24 ANR SOLHAR meeting 2014
8/24 ANR SOLHAR meeting 2014
9/24 ANR SOLHAR meeting 2014
more concurrency more complex dependencies, more tasks
memory transfers, CUDA kernels management
MPI layer
9/24 ANR SOLHAR meeting 2014
11/24 ANR SOLHAR meeting 2014
12/24 ANR SOLHAR meeting 2014
12/24 ANR SOLHAR meeting 2014
necessarily square) with flat, binary (communication avoiding) or hybrid panel reduction trees
1 2
a a
3
a
14/24 ANR SOLHAR meeting 2014
necessarily square) with flat, binary (communication avoiding) or hybrid panel reduction trees
1 2
a a
3
a p1 u2 u3 p2 u3 p3 s2 s3 c a
14/24 ANR SOLHAR meeting 2014
necessarily square) with flat, binary (communication avoiding) or hybrid panel reduction trees
1 2
a a
3
a p1 u2 u3 p2 u3 p3 s2 s3 c a p1 u2 u3 u3 u4 u4 u4 s2 s3 s4 p2 p3 c a
14/24 ANR SOLHAR meeting 2014
necessarily square) with flat, binary (communication avoiding) or hybrid panel reduction trees
1 2
a a
3
a p1 u2 u3 u3 u4 u4 u4 p4 s3 s4 p2 p3 c a p1 u2 u3 p2 u3 p3 s2 s3 c a p1 u2 u3 u3 u4 u4 u4 s2 s3 s4 p2 p3 c a
14/24 ANR SOLHAR meeting 2014
... cj-1
cj fi c1 c2 ... fi r ... ... c1 c2 r cj-1 cj
15/24 ANR SOLHAR meeting 2014
# Matrix Gflops Ordering 1 LargeRegFile 19 Metis 2 EternityII A 39 Metis 3 EternityII E 107 Metis 4 cont11 l 112 Metis 5 sc205-2r 160 Metis 6 cat ears 4 4 184 Metis 7 karted 335 Metis 8 degme 558 Metis 9 flower 7 4 724 Metis 10 hirlam 1112 Metis 11 e18 1286 Metis 12 Rucci1 5179 Metis 13 TF17 15663 Metis 14 sls 26363 Metis
@ 2.4 GHz, 4 × 6 cores
16/24 ANR SOLHAR meeting 2014
5 1 0 1 5 20 25 30 1 D 2D
qrm_starpu
1 D 2D
qrm_parsec
L a r g e R e g F i l e E t e r n i t y I I _ A E t e r n i t y I I _ E c
t 1 1 _ l s c 2 5
r c a t _ e a r s k a r t e d d e g m e fl
e r h i r l a m e 1 8 R u c c i 1 T F 1 7 s l s
17/24 ANR SOLHAR meeting 2014
5 1 0 1 5 20 25 30 1 D 2D
qrm_starpu
1 D 2D
qrm_parsec
L a r g e R e g F i l e E t e r n i t y I I _ A E t e r n i t y I I _ E c
t 1 1 _ l s c 2 5
r c a t _ e a r s k a r t e d d e g m e fl
e r h i r l a m e 1 8 R u c c i 1 T F 1 7 s l s
17/24 ANR SOLHAR meeting 2014
1 2
a a
3
a p1 u2 u3 u3 u4 u4 u4 p4 s3 s4 p2 p3 c a p1 u2 u3 p2 u3 p3 s2 s3 c a p1 u2 u3 u3 u4 u4 u4 s2 s3 s4 p2 p3 c a
18/24 ANR SOLHAR meeting 2014
1 2
a a
3
a p1 u2 u3 u3 u4 u4 u4 p4 s3 s4 p2 p3 c a p1 u2 u3 p2 u3 p3 s2 s3 c a p1 u2 u3 u3 u4 u4 u4 s2 s3 s4 p2 p3 c a
18/24 ANR SOLHAR meeting 2014
1 2
a a
3
a p1 u2 u3 u3 u4 u4 u4 p4 s3 s4 p2 p3 c a p1 u2 u3 p2 u3 p3 s2 s3 c a p1 u2 u3 u3 u4 u4 u4 s2 s3 s4 p2 p3 c a
18/24 ANR SOLHAR meeting 2014
19/24 ANR SOLHAR meeting 2014
20/24 ANR SOLHAR meeting 2014
20/24 ANR SOLHAR meeting 2014
20/24 ANR SOLHAR meeting 2014
21/24 ANR SOLHAR meeting 2014
21/24 ANR SOLHAR meeting 2014
21/24 ANR SOLHAR meeting 2014
structure)
22/24 ANR SOLHAR meeting 2014