Design challenges of High- performance and Scalable MPI over InfiniBand
Presented by
Design challenges of High- performance and Scalable MPI over - - PowerPoint PPT Presentation
Design challenges of High- performance and Scalable MPI over InfiniBand Presented by Karthik Presentation Overview In depth analysis of High-Performance and scalable MPI with Reduced Memory Usage Zero Copy protocol using Unreliable
Presented by
High Performance and Scalable MPI with Reduced Memory usage
High Performance and Scalable MPI with Reduced Memory usage IB provides several types of transport services –
Upper level software service High Performance and Scalable MPI with Reduced Memory usage
Shared Receive Queue
(even for connection oriented transport)
High Performance and Scalable MPI with Reduced Memory usage
Remote Direct Memory Access (RDMA)
High Performance and Scalable MPI with Reduced Memory usage
MVAPICH Design Overview
MVAPICH uses two major protocols –
receiver side
High Performance and Scalable MPI with Reduced Memory usage
1 . Adaptive RDMA with Send/Receive
increase, this channel is adaptive.
transferred using RDMA.
High Performance and Scalable MPI with Reduced Memory usage
High Performance and Scalable MPI with Reduced Memory usage
High Performance and Scalable MPI with Reduced Memory usage
NAS Benchmark
High Performance and Scalable MPI with Reduced Memory usage
High Performance Linpack
the world’s fastest supercomputers
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
linearly with the number of connected processes. Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Traditional Zero-Copy
matches it with the posted receive operations.
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
UD vs RC memory usage
For 16k connections – UD = 40 MB / process RC = 240 MB / process
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Challenges for true zero copy design
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Proposed Design
transport
transmit to a QP at a single time.
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Solutions to design challenges
receiver’s end and the sender is notified.
the application receive buffer is posted to it.
to the receiver as part of the completion entry.
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Experimental Evaluation
Ping Pong Latency
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram Uni-Directional Bandwidth
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram Bi-Directional Bandwidth
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
This paper seeks to address two mains questions -
How well do they perform at scale ?
dynamically select protocols to optimized for performance and scalability ?
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand IB provides several types of transport services –
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Message Channel
Eager Protocol Channel
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Message Channel
Rendezvous Protocol Channel
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Channel Evaluation
Performance : Eager Latency
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Channel Evaluation
Performance : Uni-Directional Bandwidth
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Channel Evaluation
Scalability Test : Memory Usage
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Channel Evaluation
Scalability Test : Latency
MVAPICH-Aptus : Scalable High-Performance Multi-Transport MPI over InfiniBand
Channel Characteristics Summary
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Overview of Design
to achieve performance and scalability.
for performance as well as scalability. Design Challenges
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Channel Allocation
different message size
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Channel Usage
level to meet application needs without changing the code within MPI library.
Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram
Performance Evaluation