Open MPI on the Cray XT presented by Richard L. Graham Galen - PowerPoint PPT Presentation

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman

Open MPI Is… • Open source project / community • Consolidation and evolution of several prior MPI implementations • All of MPI-1 and MPI-2 • Production quality • Vendor-friendly • Research- and academic-friendly 2

Current Membership • 14 members, 9 contributors, 1 partner − 4 US DOE labs − 8 universities − 10 vendors − 1 individual 3

Some Current Highlights • Production MPI on SNL’s Thunderbird • Production MPI on LANL’s Road Runner • Working on getting it up on TACC (Ranger) • The MPI used for the EU QosCosGrid: Quasi- Opportunistic Complex System Simulations on Grid • Tightly integrated with VampirTrace (vs 1.3) 4

Modular Component Architecture Framework : • User application − API targeted at a specific task • PTP message Open MPI management • PTP transfer layer • Collectives B … Framework A • Process startup …. Component : • − An implementation of a … Comp A Comp B framework’s API Module : • − An instance of a component Mod B Mod A1 … Mod A2 5

Open MPI’s CNL Port • Portals port from Catamount to CNL • Enhance Point-to-Point BTL component • ALPS support added • Add process control components for ALPS • mpirun wraps multiple calls to APRUN to • Support MPI-2 dynamic process control • Support for recovery from process failure • Support arbitrary number of procs per node (even over subscribe) • Pick up full MPI 2.0 support 6

Modular Component Architecture - Data Transfer User application Open MPI … … Point-to-Point … Portals TCP Infiniband SeaStar NIC 0 HCA NIC 1 7

Process Startup on CNL - Start Allocated Allocated Allocated MPIRUN Allocated Allocated Allocated App App Daemon Daemon App APRUN App Allocated Allocated Allocated App App Daemon Daemon App App 8

Process Startup on CNL - Spawn APRUN Allocated Allocated Allocated App App Daemon Daemon MPIRUN App App Allocated Allocated Allocated App App Daemon Daemon App APRUN App Allocated Allocated Allocated App App Daemon Daemon App App 9

Features in Open MPI for Multi-Core Support • Shared Memory point-to-point communications − On par with other network devices − Does not use any network resources • Shared Memory Collective optimizations − On-host-communicator optimization − Hierarchical collectives on the way 10

Hierarchical Collectives • Exist in the code base (HLRS/University of Houston) • Need to be tested with the new shared-memory module • Need to be optimized 11

Collective Communication Pattern - per process III II proc proc proc proc I proc proc proc proc Network IV proc proc proc proc proc proc proc proc 12

Collective Communication Pattern - Total Interhost traffic I proc proc proc proc sm sm proc proc proc proc II Network II I proc proc proc proc sm sm proc proc proc proc 13

Performance Data 14

Ping-Pong 0 byte MPI latency : Inter-node MPI / Protocol Latency (uSec) Open MPI / CM 6.18 Open MPI / OB1 8.65 Open MPI / OB1 - no ack 7.24 Cray MPT (3.0.7) 7.44 15

Ping-Pong 0 byte MPI latency CM 0 Bytes - 6.18 uSec 16 Bytes - 6.88 uSec 17 Bytes - 9.69 uSec (measured on different system) OB1 0 Bytes - With ACK: 8.65 uSec 0 Bytes - Without ACK: 7.24 uSec 1 Byte - Without ACK: 10.14 uSec (measured on different system) 16

Ping-Pong 0 byte MPI latency : Intra-node MPI / Protocol Latency (uSec) Open MPI / CM Open MPI / OB1 0.64 Open MPI / OB1 - no ack Cray MPT (3.0.7) 0.51 17

Ping-Pong Latency Data - Off Host 18

Ping-Pong Data - Off Host 19

Ping-Pong Data - On Host 20

Ping-Pong Bandwidth Data - On Host 21

Barrier - 16 cores per host 22

Barrier - 16 cores per host - Hierarchical 23

Barrier - XT 24

Shared-Memory Reduction - 16 processes 25

Reduction - 16 core nodes - 8 Bytes 26

Reduction - 16 core nodes - 8 Bytes - Hierarchical 27

Shared-Memory Reduction - 16 Processes 28

Reduction - 16 core nodes - 512 KBytes 29

Reduction - XT 30

Shared Memory Allreduce - 16 processes 31

Allreduce - 16 cores per node - 8 Bytes 32

Allreduce - 16 cores per node - 8 Bytes - Hierarchical 33

Shared Memory Allreduce - 16 processes 34

Allreduce - XT 35

Open MPI on the Cray XT presented by Richard L. Graham Galen - PowerPoint PPT Presentation

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open source project / community Consolidation and evolution of several prior MPI implementations All of MPI-1 and MPI-2 Production quality

Application Performance Tuning on Cray XT Systems Luiz DeRose John Levesque PE Director CSCE

Cray Lustre Model Roadmap Cory Spitz and Derek Robb Cray Inc. 5/24/2011 Introduction and Agenda

The Cray 1 Time line 1969 -- CDC Introduces 7600, designed by cray. 1972 -- Design of the

FFT libraries on Cray XT: CRay Adaptive FFT (CRAFFT) Jonathan Bentz Cray Inc. Outline

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

Howard Pritchard and Igor Gorodetsky Cray, Inc. Cray User Group Conference 2011 1 Cray User

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Managing Cray XT MPI Runtime Environment Variables to Optimize and Scale Applications Geir

Introducing the Cray XMT Petr Konecny November 29 th 2007 Agenda Shared memory programming

Investigation of Parallel Processing Using How to Enable/Access Open MPI in Open MPI ADMB.

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Parallelization strategies in PWSCF (and other QE codes) MPI vs Open MP MPI Message

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

The Ping An story Joe Ye TIAN, Deputy Managing Director, Ping An International Financial Leasing

Mininet Performance Evaluation and Optimization Intermediate talk for the Bachelors Thesis by

University of Maryland

Network Troubleshooting Justin Trieger | Technical Director for Distance Education www.nws.edu

Uptime at IXPs - and NIS Directive Robert Lister UKNOF 40 27 April 2018 | Manchester NIS

Ch02. Constrained Optimization Ping Yu Faculty of Business and Economics The University of Hong

Erlang's Open Telecom Platform (OTP) Framework Steve Vinoski Architect, Basho Technologies

Computer Security DD2395 http://www.csc.kth.se/utbildning/kth/kurser/DD2395/dasakh11/ Fall 2011