1 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010 Graham_CAC_2010
1 Managed by UT-Battelle for the Department of Energy
Hierarchy Aware Blocking and Nonblocking Collective - - PowerPoint PPT Presentation
Hierarchy Aware Blocking and Nonblocking Collective Communications-The Effects of Shared Memory Communications in the Cray XT Environment Richard L. Graham, Joshua S. Ladd, Manjunath Venkata 1 Managed by UT-Battelle 1 Managed by
1 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010 Graham_CAC_2010
1 Managed by UT-Battelle for the Department of Energy
2 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
3 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
4 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
5 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
6 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
! " # $ %&'()**+,-./ 012) 3'&/ 4'225.1(-61'. ,)75(61'. 8*)+,)*596 $ :
! " # $ %&'()**+,-./ 012) 3'&/ 4'225.1(-61'. ,)75(61'. 8*)+,)*596 $ : ;'1*)
Ideal Algorithm Impact of System Noise
7 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
Offloaded Algorithm Nonblocking Algorithm
! " # $ %&'()**+,-./ 012) 3'&/ 4'225.1(-61'. ,)75(61'. 8*)+,)*596 :)9);-61'.+<;).6 $ = ! " # $ %&'()**+,-./ 012) 3'&/ 4'225.1(-61'. ,)75(61'. 8*)+,)*596 :)9);-61'.+<;).6 $ =
8 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
9 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
1 2 3 4 1 2 3 4 1 2 3 4 Host 1 Host 2 Inter Host Communication Step 1 Step 2
10 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
1 2 3 4 1 2 3 4 1 2 3 4 Host 1 Host 2 Inter Host Communication Step 1 Step 2 1 2 3 4 Step 3
11 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
12 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
Basic Collectives (bcol) Framework Subgroup Framework
IB OFFLOAD
Pt2Pt SM NUMA IBNET MUMA Collective Framework Tuned (pt2pt) Collectives Comp. MLNX OFED ML – Hierarchical Collectives Comp. Module Component Architecture OMPI
13 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
14 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
15 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
1 2 3 4 5 6 7 8 9 2 4 6 8 10 12
Latency of the Barrier (usecs)
Shared Memory pt-2-pt
16 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
2 4 6 8 10 12 2 4 6 8 10 12 14 16
Latency of the Barrier (usecs)
Processes
Shared Memory pt-2-pt
17 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
Processes on Different Sockets Processes on Same Socket
1 1.5 2 2 Processes 4 Latency of the Barrier (usecs) 0.5
18 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
Processes on Different Sockets Processes on Same Socket
1 1.5 2 2 Processes 4 Latency of the Barrier (usecs) 0.5
19 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010
Message Traffic between Sockets Message Traffic within Socket
1 1.5 2 4 Processes Latency of the Barrier (usecs) 0.5
20 Managed by UT-Battelle for the Department of Energy
Graham_CAC_2010