Designing and Evaluating MPI-2 Dynamic Process Management Support for InfiniBand
Tejus Gangadharappa, Matthew Koop and
- Dhabaleswar. K. (DK) Panda
Designing and Evaluating MPI-2 Dynamic Process Management Support - - PowerPoint PPT Presentation
Designing and Evaluating MPI-2 Dynamic Process Management Support for InfiniBand Tejus Gangadharappa, Matthew Koop and Dhabaleswar. K. (DK) Panda Computer Science & Engineering Department The Ohio State University Outline Motivation
Inter-Communicator Creation
1 2 3 4 Initial Process group *0 *1 *2 *3 *4 Spawned Process group Parent root Child root
MPI Application Dynamic Process Interface Startup Spawn Scheduling Communication MPI Communication Point-to-Point One-Sided Collectives
Parent Process group Spawned Process group
MPI_Init MPI_Comm_spawn MPI_Comm_accept MPI_Init
MPI_Comm_get_parent
MPI_Comm_connect Process group information exchange Inter-Communicator Creation
– Open-source MPI project for InfiniBand and 10GigE/iWARP – Empowers many TOP500 systems – Used by more than 975 organizations in 51 countries – Available as a part of OFED and from many vendors and Linux Distributions (RedHat, SuSE, etc.) – http://mvapich.cse.ohio-state.edu
– http://mvapich/cse.ohio-state.edu/benchmarks/
0.00 5.00 10.00 15.00 20.00 25.00 30.00 35.00 40.00 1 2 4 8 16 32 64 128 256 512
Latency (usec) Number of Processes
MV2-MPD-RC MV2-MPD-UD MV2-mpirun_rsh-RC MV2-mpirun_rsh-UD OpenMPI
Cyclic Rank Allocation
however mpirun_rsh performs better as job size increases
Block Rank Allocation
0.00 5.00 10.00 15.00 20.00 25.00 30.00 35.00 40.00 1 2 4 8 16 32 64 128 256 512
Latency (usec) Number of Processes
MV2-MPD-RC MV2-MPD-UD MV2-mpirun_rsh-RC MV2-mpirun_rsh-UD OpenMPI
time
2 4 6 8 10 12 1 2 4 8 16 32 64 128 256 512
Spawn Rate Number of Processes
MV2-MPD-RC MV2-MPD-UD MV2-mpirun_rsh-RC MV2-mpirun_rsh-UD OpenMPI
the higher cost of creating and destroying RC queue pairs
jobs due to the higher initial overhead in the later case
with increasing job size.
10 20 30 40 50 60 70 80 1 4 16 64 256 1024 4096 16384 65536
Latency (usec) Number of Processes
MV2-Intra MV2-Inter OpenMPI-Intra OpenMPI-Inter
rendezvous threshold values)
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 2 4 8 16 32 64
Application Run-time (s) Number of Processes
MV2-MPD-RC MV2-MPD-UD MV2-mpirun_rsh-RC2 MV2-mpirun_rsh-UD Traditional(MV2)
– Latest version is MVAPICH2 1.4RC2 – Downloadable from http://mvapich.cse.ohio-state.edu
Future Work:
based on job size
dynamic process interface