All the things you need to know about Intel MPI Library Jerome - PowerPoint PPT Presentation

All the things you need to know about Intel MPI Library Jerome Vienne viennej@tacc.utexas.edu Texas Advanced Computing Center The University of Texas at Austin Austin, TX November 12th, 2016

A Heterogeneous Environment MPI performance depends on many factors MPI libraries have to make choices Why ? Because the number of combinations is too large. Are these choices optimal for my application ? Not necessarily. Can we change them ? Yes, this is why we are there. All the things you need to know about Intel MPI Library | November 12th, 2016 | 2 ▶ CPUs (Number of cores, Cache sizes, Frequency) ▶ Memory (Amount, Frequency) ▶ Network Speed (10,20,40 … Gbit/s) ▶ Size of the job ▶ Type of code: Hybrid (ex: OpenMP+MPI) or Pure MPI

A Heterogeneous Environment MPI performance depends on many factors MPI libraries have to make choices All the things you need to know about Intel MPI Library | November 12th, 2016 | 2 ▶ CPUs (Number of cores, Cache sizes, Frequency) ▶ Memory (Amount, Frequency) ▶ Network Speed (10,20,40 … Gbit/s) ▶ Size of the job ▶ Type of code: Hybrid (ex: OpenMP+MPI) or Pure MPI ▶ Why ? Because the number of combinations is too large. ▶ Are these choices optimal for my application ? Not necessarily. ▶ Can we change them ? Yes, this is why we are there.

Aim of this talk your MPI application All the things you need to know about Intel MPI Library | November 12th, 2016 | 3 ▶ ”How to tune MPI” cannot be found easily inside books. ▶ Show that MPI libraries are not black boxes. ▶ Describe concepts that are common inside MPI libraries. ▶ Understand the difgerence between MPI libraries. ▶ Provide some useful commands for Intel MPI. ▶ Result: Help you to reduce the time and memory foot print of

Before to start Warnings ‼! | November 12th, 2016 | All the things you need to know about Intel MPI Library worth it the most important ones. 4 common. TACC. OpenMPI). ▶ Talk based on Intel MPI (few references to MVAPICH2 and ▶ All experiments were done on Stampede supercomputer at ▶ Tuning options are specific to a MPI library ! But concepts are ▶ Options can have counter-efgects ! ▶ MPI libraries have lot of options for tuning, we will only cover ▶ Tuning could be time consuming, but long-term, it might be

Plan Collective Tuning | November 12th, 2016 | All the things you need to know about Intel MPI Library To conclude Intra-node Point-to-Point Optimization Inter-node Point-to-Point Optimization To conclude Process Placement Hostfile Profiling The Choice of the Benchmark 5 • Basic Tuning • Intermediate Tuning • Conclusion

The Choice of Benchmarks Difgerent MPI library = Tuning Based on Difgerent Benchmarks IMB or OMB, which one is the best to use ? Both are communication intensive without computation Depend on your application The best benchmark is your application ! But… let’s take a look at them in detail ! All the things you need to know about Intel MPI Library | November 12th, 2016 | 8 ▶ Intel MPI: Intel MPI Benchmarks (IMB) ▶ MVAPICH2: OSU Micro-Benchmarks (OMB)

The Choice of Benchmarks Difgerent MPI library = Tuning Based on Difgerent Benchmarks IMB or OMB, which one is the best to use ? But… let’s take a look at them in detail ! All the things you need to know about Intel MPI Library | November 12th, 2016 | 8 ▶ Intel MPI: Intel MPI Benchmarks (IMB) ▶ MVAPICH2: OSU Micro-Benchmarks (OMB) ▶ Both are communication intensive without computation ▶ Depend on your application ▶ The best benchmark is your application !

Intel MPI Benchmarks (IMB) Details (IMB-MPI1) All the things you need to know about Intel MPI Library | November 12th, 2016 | 9 ▶ Originally know as Pallas MPI Benchmarks (PMB) ▶ Support Point-to-Point and Collective operations ▶ 1 program with lot of options for classical MPI functions ▶ Root changes afuer each iteration for collectives

Intel MPI Benchmarks (IMB) Intel MPI vs MVAPICH2 using IMB Bcast with 256 cores | November 12th, 2016 | All the things you need to know about Intel MPI Library 9 10000 Mvapich2 2.2 Intel MPI 2017 1000 Time (us) 100 10 1 4 16 64 256 1K 4K 16K 64K 256K 1M 4M Message Size (Bytes)

OSU Micro-Benchmarks (OMB) Details All the things you need to know about Intel MPI Library | November 12th, 2016 | 10 ▶ Very simple to use ▶ Support Point-to-Point and Collective operations ▶ Multiples programs with simple options ▶ Keep the same root during all iterations + use barrier

OSU Micro-Benchmarks (OMB) Intel MPI vs MVAPICH2 using OMB Bcast with 256 cores | November 12th, 2016 | All the things you need to know about Intel MPI Library 10 1000 Mvapich2 2.2 Intel MPI 2017 100 Time (us) 10 1 4 16 64 256 1K 4K 16K 64K 256K 1M Message Size (Bytes)

OSU Micro-Benchmarks (OMB) Tuned Intel MPI vs MVAPICH2 using OMB Bcast with 256 | November 12th, 2016 | All the things you need to know about Intel MPI Library 10 cores 10000 Mvapich2 2.2 Intel MPI 2017 1000 Time (us) 100 10 1 4 16 64 256 1K 4K 16K 64K 256K 1M Message Size (Bytes)

Benchmarks: What you need to know To resume two MPI libraries be painful, we will see it later :) All the things you need to know about Intel MPI Library | November 12th, 2016 | 11 ▶ Don’t trust them ! ▶ They have difgerent behaviors: so, KNOW your benchmark ! ▶ Don’t provide you necessarily the best results by default. ▶ Be sure that you tune things correctly if you want to compare ▶ Collective tuning for a particular benchmark/application could

To know what you need to tune first Why MPI profiling is important ? choices: communications (size, time spent, functions called etc…) Scalasca, IPM, mpiP …) All the things you need to know about Intel MPI Library | November 12th, 2016 | 13 ▶ To identify which MPI functions are used, you have two ▶ Look at the code ▶ Profile your application ▶ Profiling provides you all the information regarding MPI ▶ Could be integrated in the MPI library (ex: Intel MPI) ▶ Lot of tools can help you to profile your application (TAU,

How to profile ? With Intel MPI at runtime mpiexec -genv I_MPI_STATS=ipm I_MPI_STATS_FILE=myprofile.txt …. Tools All the things you need to know about Intel MPI Library | November 12th, 2016 | 14 ▶ MPI Performance Snapshots (MPS) ▶ Intel Trace Analyzer and Collector (ITAC)

Impact of the hostfile Example of command: mpirun -np 4 -hostfile host ./a.out difgerent results ! All the things you need to know about Intel MPI Library | November 12th, 2016 | 16 ▶ Hostfile provides the list of nodes that will be used ▶ Depending on the MPI library, the same hostfile could lead to

A Qvick Performance Example Intel MPI | November 12th, 2016 | All the things you need to know about Intel MPI Library 19 sec. Correct Hostfile/Command: Default: 51 sec. + Process Placement: 19 sec. NAS SP-MZ on Stampede Correct Hostfile: 176 sec. Default: 176 sec. Mvapich2 node2 node1 mpirun -np 4 -hostfile host ./sp-mz.C.4 2 nodes, 2 MPI tasks/node with 8 OpenMP threads 17

All the things you need to know about Intel MPI Library Jerome - PowerPoint PPT Presentation

All the things you need to know about Intel MPI Library Jerome Vienne viennej@tacc.utexas.edu Texas Advanced Computing Center The University of Texas at Austin Austin, TX November 12th, 2016 A Heterogeneous Environment MPI performance

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

Things you can do Things you can do Things you can do Everything you need to know

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

MPI Internals Advanced Parallel Programming Overview MPI Library Structure Point-to-point

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

AFS at Intel AFS at Intel Travis Broughton Travis Broughton Agenda Agenda Intels

457 Retirement Program 41-10390-29 2018/01/05 457 Retirement Program Things You Already Know

Model Users Group Meeting TMIP Peer Review NCSITE June 9,2004 Rhett Fussell, PE Model Research

Principles of Data Reduction Introduction to BIOSTAT602 Lecture 01 Biostatistics 602 -

Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to

Data processing, presentation and interpretation (AS) L1 Interpret diagrams for single-variable

Control charts for binary correlated variables Linda Lee Ho Airlane P Alencar USP - Brazil

Tree-related microhabitat (TreM) spatial patterns in European beech-dominated forests Laurent

Validation of surrogate traffic safety indicators Carl Johnsson, PhD student, Lund University

CALCULUS I Expectations & Teaching Strategies Mitch Anderson, UH Hilo Erica Pultar, UH Maui

Sambuz

Useful Links

Newsletter

Mail Us

All the things you need to know about Intel MPI Library Jerome - PowerPoint PPT Presentation

All the things you need to know about Intel MPI Library Jerome Vienne viennej@tacc.utexas.edu Texas Advanced Computing Center The University of Texas at Austin Austin, TX November 12th, 2016 A Heterogeneous Environment MPI performance

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

Things you can do Things you can do Things you can do Everything you need to know

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

MPI Internals Advanced Parallel Programming Overview MPI Library Structure Point-to-point

MPI &amp; MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

AFS at Intel AFS at Intel Travis Broughton Travis Broughton Agenda Agenda Intels

457 Retirement Program 41-10390-29 2018/01/05 457 Retirement Program Things You Already Know

Model Users Group Meeting TMIP Peer Review NCSITE June 9,2004 Rhett Fussell, PE Model Research

Principles of Data Reduction Introduction to BIOSTAT602 Lecture 01 Biostatistics 602 -

Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to

Data processing, presentation and interpretation (AS) L1 Interpret diagrams for single-variable

Control charts for binary correlated variables Linda Lee Ho Airlane P Alencar USP - Brazil

Tree-related microhabitat (TreM) spatial patterns in European beech-dominated forests Laurent

Validation of surrogate traffic safety indicators Carl Johnsson, PhD student, Lund University

CALCULUS I Expectations &amp; Teaching Strategies Mitch Anderson, UH Hilo Erica Pultar, UH Maui

Sambuz

Useful Links

Newsletter

Mail Us

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

CALCULUS I Expectations & Teaching Strategies Mitch Anderson, UH Hilo Erica Pultar, UH Maui