Non-Intrusively Avoiding Scaling Problems in and out of MPI - PowerPoint PPT Presentation

Non-Intrusively Avoiding Scaling Problems in and out of MPI Collectives Hongbo Li , Zizhong Chen, Rajiv Gupta, and Min Xie May 21st, 2018

Outline Scaling Problem Avoidance Framework Evaluation Conclusion

Scaling Problem Scaling problem is a type of bug that occurs when the program runs at a large scale in terms of the number of processes (P) OR the input size OR both They frequently arise with the use of MPI collectives as collective communication involves a group of processes and message size (input size)

An Example of MPI Collective Root process : MPI_Gather using two processes ( ! = # ) with each transferring two elements $ = # .

Scaling Problem The root cause of a scaling problem with the use of MPI collectives can be inside MPI collectives or outside MPI collectives

Inside MPI Many scaling problems are challenging to deal with They escape the testing in the development phase It takes days and months to wait for an official fix Difficulty exists in bug reproduction, root-cause diagnosis, and fixing Scaling problems reported online.

Inside MPI Many scaling problems are challenging to deal with They escape the testing in the development phase It takes days and months to wait for an official fix Difficulty exists in bug reproduction, root-cause diagnosis, and fixing Integer OS overflow Environment setting Connection failure Unkown Platform

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 0 ∗ 4 Each process’ 0 sendbuf Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 0 ∗ 4 Each process’ sendbuf 0 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 1 ∗ 4 Each process’ 1 sendbuf 0 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 1 ∗ 4 Each process’ sendbuf 0 1 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 2 ∗ 4 Each process’ 2 sendbuf 0 1 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: *+,-./0 + 234564 2 ∗ 4 Each process’ sendbuf 0 1 2 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow Calculate address: Each process’ sendbuf 0 1 2 i P-1 Root’s recvbuf In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is not corrupted.

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow *+,-./0 + 234564 0 ∗ 4 Calculate address: 0 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow *+,-./0 + 234564 1 ∗ 4 Calculate address: 1 0 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow *+,-./0 + 234564 1 ∗ 4 Calculate address: 0 1 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow *+,-./0 + 234564 2 ∗ 4 Calculate address: 2 0 1 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow *+,-./0 + 234564 2 ∗ 4 Calculate address: 0 1 2 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow 1234567 + *+,-., + ∗ , Calculate address: *+,-., + < 0 i 0 1 2 In MPI_Gatherv, the root process calculate addresses for the incoming messages when !"#$%# is corrupted

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow For MPI_Gatherv, the number of elements (N) received by the root process satisfies * < ,-./0. 1 − 1 + 5*6_89: → < < = ><?_@AB For MPI_Gather (a regular collective), < ≤ D ><?_@AB

Outside MPI In the user code, displacement array !"#$%# ( C int , commonly 32 bits) of irregular collectives can be easily corrupted by integer overflow For MPI_Gatherv, the number of elements (N) received by the root process satisfies * < ,-./0. 1 − 1 + 5*6_89: → < < = ><?_@AB = Huge gap: D For MPI_Gather (a regular collective), < ≤ D ><?_@AB

Outside MPI Irregular collectives’ limitation due to displacement array !"#$%# of data type & "'( Replace int with long long int ? Discussed yet never done --- backward compatibility

An immediate remedy is in need!

Outline Scaling Problem Avoidance Framework Evaluation Conclusion

Avoidance Scaling problem’s trigger Workaround strategy

Trigger (1) [Outside MPI] Irregular collectives’ limitation’s trigger is !"#$%# " < 0

Trigger (2) [Inside MPI] Users perform testing It tells users if there is a scaling problem It also tells at what scale the problem occurs Do users really need a fancy supercomputer to perform testing? Not Necessary!

Trigger (2) [Inside MPI] User side testing: users manifest potential scaling problems of MPI routines of their interest It tells users if there is a scaling problem It also tells at what scale the problem occurs Most scaling problems with the use of MPI collectives relate to both parallelism scale and message size With ONLY 2 nodes with each having 24 cores and 64 GB memory, we easily find 4 scaling problems inside released MPI libraries. Scaling problems related only to the number of processes are not found yet

Workarounds Workaround ( W1 ) Partition ( W2 ) Build big communication data type ( W1-B ) Partition the ( W1-A ) Partition message processes

Workaround (1) !" ≤ $ Filled recvbuf Empty recvbuf Temporary buffer Partitioning one MPI_Gatherv communication using two strategies supposing the bug is triggered when !" > $ . Four processes ( " = $ ) are involved with each sending two elements ( ! = &) and process 0 is the root process.

Workaround (2) Build big data type Message size = s*n Bigger data type (bigger ! ) à smaller " Only effective when the scaling problem is unrelated to ! Effective case: "# > 4 Ineffective case: s"# > 4

Non-Intrusively Avoiding Scaling Problems in and out of MPI - PowerPoint PPT Presentation

Non-Intrusively Avoiding Scaling Problems in and out of MPI Collectives Hongbo Li , Zizhong Chen, Rajiv Gupta, and Min Xie May 21st, 2018 Outline Scaling Problem Avoidance Framework Evaluation Conclusion Outline Scaling Problem Avoidance

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Communication Avoiding Power Scaling Power Scaling Derivatives of Algorithmic Communication

AVOIDING LIGHTING PROBLEMS BEFORE THEY ARE INSTALLED AMERICAN LIGHTING ASSOCIATION SEMINAR

Avoiding Vendor Lock-In Avoiding Vendor Lock-In Using Apache Libcloud Using Apache Libcloud

Two-dimensional self-avoiding walks Mireille Bousquet-Mlou CNRS, LaBRI, Bordeaux, France

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

So#ware Scaling Mo/va/on & Goals HW Configura/on & Scale Out So#ware Scaling

Introduction to FFAGs and a Non- Introduction to FFAGs and a Non- Scaling Model Scaling Model

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Avoiding Antitrust Violations In Avoiding Antitrust Violations In Employment Recruiting Leveraging

Pattern avoiding permutations in genome rearrangement problems: the transposition model G.

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

roottest migration to CMake / CTest Philipp Schoppe 14.07.2014 1 / 23 Agenda Motivation CTest

Elasticity of Pu and ZrW 2 O 8 -a window into fundamental understanding Albert Migliori

Stress and Strain in Crystals Kittel Ch 3 Elasticity Elastic Behavior is the fundamental

CH.6. LINEAR ELASTICITY Continuum Mechanics Course (MMC) - ETSECCPB - UPC Overview Hypothesis

Measurement & Complexity Topics Goal-driven measurement Operational definitions Driving

What do we know from 5 years of Faculty of Health School of Health Policy & Management

Fundamental groups of II 1 factors and equivalence relations (joint work with Sorin Popa)

A Need to Be More Competitive! August 11, 2016 AGENDA Welcome Introductions MPP

Non-Intrusively Avoiding Scaling Problems in and out of MPI - PowerPoint PPT Presentation

Non-Intrusively Avoiding Scaling Problems in and out of MPI Collectives Hongbo Li , Zizhong Chen, Rajiv Gupta, and Min Xie May 21st, 2018 Outline Scaling Problem Avoidance Framework Evaluation Conclusion Outline Scaling Problem Avoidance

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Communication Avoiding Power Scaling Power Scaling Derivatives of Algorithmic Communication

AVOIDING LIGHTING PROBLEMS BEFORE THEY ARE INSTALLED AMERICAN LIGHTING ASSOCIATION SEMINAR

Avoiding Vendor Lock-In Avoiding Vendor Lock-In Using Apache Libcloud Using Apache Libcloud

Two-dimensional self-avoiding walks Mireille Bousquet-Mlou CNRS, LaBRI, Bordeaux, France

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

So#ware Scaling Mo/va/on &amp; Goals HW Configura/on &amp; Scale Out So#ware Scaling

Introduction to FFAGs and a Non- Introduction to FFAGs and a Non- Scaling Model Scaling Model

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Avoiding Antitrust Violations In Avoiding Antitrust Violations In Employment Recruiting Leveraging

Pattern avoiding permutations in genome rearrangement problems: the transposition model G.

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

roottest migration to CMake / CTest Philipp Schoppe 14.07.2014 1 / 23 Agenda Motivation CTest

Elasticity of Pu and ZrW 2 O 8 -a window into fundamental understanding Albert Migliori

Stress and Strain in Crystals Kittel Ch 3 Elasticity Elastic Behavior is the fundamental

CH.6. LINEAR ELASTICITY Continuum Mechanics Course (MMC) - ETSECCPB - UPC Overview Hypothesis

Measurement &amp; Complexity Topics Goal-driven measurement Operational definitions Driving

What do we know from 5 years of Faculty of Health School of Health Policy &amp; Management

Fundamental groups of II 1 factors and equivalence relations (joint work with Sorin Popa)

A Need to Be More Competitive! August 11, 2016 AGENDA Welcome Introductions MPP

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

So#ware Scaling Mo/va/on & Goals HW Configura/on & Scale Out So#ware Scaling

Measurement & Complexity Topics Goal-driven measurement Operational definitions Driving

What do we know from 5 years of Faculty of Health School of Health Policy & Management