Network and Load-Aware Resource Manager for MPI Programs Ashish - PowerPoint PPT Presentation

Network and Load-Aware Resource Manager for MPI Programs Ashish Kumar Naman Jain Preeti Malakar Indian Institite of Technology, Kanpur SRMPDS, International Conference on Parallel Processing 2020 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 1 / 26

Introduction Distributed-memory parallel programs and MPI More than one processing element using their own local memory. Nodes work cooperatively to solve a single big problem. Data exchange through communications by sending and receiving messages. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 2 / 26

Introduction Distributed-memory parallel programs and MPI More than one processing element using their own local memory. Nodes work cooperatively to solve a single big problem. Data exchange through communications by sending and receiving messages. Uses Message Passing Interface (MPI) as ”de facto” standard for message passing. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 2 / 26

Introduction Distributed-memory parallel programs and MPI More than one processing element using their own local memory. Nodes work cooperatively to solve a single big problem. Data exchange through communications by sending and receiving messages. Uses Message Passing Interface (MPI) as ”de facto” standard for message passing. Runs on cluster (shared or dedicated) or a supercomputer. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 2 / 26

Introduction Need for node allocation to run MPI jobs. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 3 / 26

Introduction Need for node allocation to run MPI jobs. In this work, we address the problem of allocating a good set of nodes to run MPI jobs in a shared non-dedicated cluster. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 3 / 26

Non Dedicated/Shared Cluster and challenges Non exclusive access of nodes Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 4 / 26

Non Dedicated/Shared Cluster and challenges Non exclusive access of nodes Shared among many users, same node can be used by different users/processes at same time for different purposes. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 4 / 26

Non Dedicated/Shared Cluster and challenges Non exclusive access of nodes Shared among many users, same node can be used by different users/processes at same time for different purposes. Variation in resource uses across time/nodes Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 4 / 26

Non Dedicated/Shared Cluster and challenges Non exclusive access of nodes Shared among many users, same node can be used by different users/processes at same time for different purposes. Variation in resource uses across time/nodes Which nodes to run our job on? What parameters should be considered? Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 4 / 26

Node Resource Usage Variation Variations in node resource usage across time and node in shared cluster Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 5 / 26

Network Usage Variation Variation in network usage between nodes in shared cluster Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 6 / 26

Going towards our approach Use knowledge of these variations across nodes, time and network to allocate resources in a better way. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 7 / 26

Going towards our approach Use knowledge of these variations across nodes, time and network to allocate resources in a better way. Take into account both static and dynamic attributes of resources, including network availability. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 7 / 26

Overview Node Allocation Algorithm 1 Resource Monitoring 2 Experiments 3 Conclusions and Future Work 4 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 8 / 26

Allocation as Sub Graph Selection v 1 G = ( V , E ) 90 80 60 v 2 v 4 85 75 90 v 3 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 9 / 26

Allocation as Sub Graph Selection v 1 G = ( V , E ) 90 80 Vertex v ∈ V : compute node 60 having compute load CL v and v 2 v 4 available processor count pc v 85 75 90 v 3 Node Compute load #Cores 50.2 6 v 1 43.5 8 v 2 54.7 10 v 3 38.3 4 v 4 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 9 / 26

Allocation as Sub Graph Selection v 1 G = ( V , E ) 90 80 Vertex v ∈ V : compute node 60 having compute load CL v and v 2 v 4 available processor count pc v 85 Edge e ∈ E : network load 75 90 NL ( u , v ) between compute nodes. v 3 Node Compute load #Cores 50.2 6 v 1 43.5 8 v 2 54.7 10 v 3 38.3 4 v 4 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 9 / 26

Allocation as Sub Graph Selection v 1 G = ( V , E ) 90 80 Vertex v ∈ V : compute node 60 having compute load CL v and v 2 v 4 available processor count pc v 85 Edge e ∈ E : network load 75 90 NL ( u , v ) between compute nodes. v 3 n : number of processes to be allocated Node Compute load #Cores Find a sub-graph such that the 50.2 6 v 1 overall cost/load of the 43.5 8 v 2 sub-graph is minimized and 54.7 10 v 3 process demand is fulfilled. 38.3 4 v 4 Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 9 / 26

Some Definitions Compute load: measure of overall load on the node Static attributes: clock speed, core count, total memory. Dynamic attributes: CPU load, CPU utilization, available memory Compute load, CL v = � a ∈ attributes w a ∗ val va Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 10 / 26

Some Definitions Compute load: measure of overall load on the node Static attributes: clock speed, core count, total memory. Dynamic attributes: CPU load, CPU utilization, available memory Compute load, CL v = � a ∈ attributes w a ∗ val va Network load: measure of load on the p2p network link Latency Bandwidth Network load, NL ( u , v ) = w lt LT ( u , v ) + w bw BW ( u , v ) Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 10 / 26

Some Definitions Compute load: measure of overall load on the node Static attributes: clock speed, core count, total memory. Dynamic attributes: CPU load, CPU utilization, available memory Compute load, CL v = � a ∈ attributes w a ∗ val va Network load: measure of load on the p2p network link Latency Bandwidth Network load, NL ( u , v ) = w lt LT ( u , v ) + w bw BW ( u , v ) Avaliable processors: measure of effective number of processors pc v = coreCount v − Load v % coreCount v Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 10 / 26

Some Definitions Compute load: measure of overall load on the node Static attributes: clock speed, core count, total memory. Dynamic attributes: CPU load, CPU utilization, available memory Compute load, CL v = � a ∈ attributes w a ∗ val va Network load: measure of load on the p2p network link Latency Bandwidth Network load, NL ( u , v ) = w lt LT ( u , v ) + w bw BW ( u , v ) Avaliable processors: measure of effective number of processors pc v = coreCount v − Load v % coreCount v Weights can be tuned according to program need/type. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 10 / 26

Allocation Algorithm Find candidate sub-graph corresponding to each node. Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 11 / 26

Allocation Algorithm Find candidate sub-graph corresponding to each node. For each sub-graph G v = ( V v , E v ) define: Compute Load, C G v = � u ∈V v CL u Network Load, N G v = � ( x , y ) ∈E v NL ( x , y ) Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 11 / 26

Allocation Algorithm Find candidate sub-graph corresponding to each node. For each sub-graph G v = ( V v , E v ) define: Compute Load, C G v = � u ∈V v CL u Network Load, N G v = � ( x , y ) ∈E v NL ( x , y ) Total Load, T G v = α × C G v Normalized + β × N G v Normalized Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 11 / 26

Allocation Algorithm Find candidate sub-graph corresponding to each node. For each sub-graph G v = ( V v , E v ) define: Compute Load, C G v = � u ∈V v CL u Network Load, N G v = � ( x , y ) ∈E v NL ( x , y ) Total Load, T G v = α × C G v Normalized + β × N G v Normalized Allocate the best one on the basis of total load Ashish Kumar, Naman Jain, Preeti Malakar Network and Load-Aware Resource Manager for MPI Programs 11 / 26

Network and Load-Aware Resource Manager for MPI Programs Ashish - PowerPoint PPT Presentation

Network and Load-Aware Resource Manager for MPI Programs Ashish Kumar Naman Jain Preeti Malakar Indian Institite of Technology, Kanpur SRMPDS, International Conference on Parallel Processing 2020 Ashish Kumar, Naman Jain, Preeti Malakar

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Load Balancing with nftables by Laura Garca (Zen Load Balancer Team) Netdev 1.1 Prototype of

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

MPI - Message Passing Interface MPI is the mostly used message passing-standard By

Message Passing Programming Designing MPI Applications Overview Lecture will cover MPI

Load Balancing Load Balancing Load balancing: distributing data and/or computations across

Vertical Stress Increases Chapter 8 Point Load 1 3/25/2015 Point Load Point Load

Demand Response Programs: Demand Response Programs: Configuring Load as a Resource for

Perioperative Management of Patients with Cardiac Implantable Electronic Devices Rachel Heise,

Network models Why model? simple representation of complex network can derive

1 9/14/2019 Selective His Bundle Pacing Histological 90 90 40 40 S V H V HBP Correa de

How to Ignore Most Startup Advice and Build a Decent Software Business Ines Montani Explosion

The UPPAAL Model Checker The UPPAAL Model Checker Julin Proenza Systems, Robotics and Vision

VER ND NDN N USI MMUNI NICATION OVE SING NG S ERV CE E DGE GE R OUTE RVICE UTERS Sy Syed

DELTAPATH: PRECISE AND SCALABLE CALLING CONTEXT ENCODING Qiang Zeng*, Junghwan Rhee , Hui Zhang,

Supraventricular Arrhythmias Bread And Butter Or Toast And Jam Derek V Exner , MD, MPH, FRCPC,

Network and Load-Aware Resource Manager for MPI Programs Ashish - PowerPoint PPT Presentation

Network and Load-Aware Resource Manager for MPI Programs Ashish Kumar Naman Jain Preeti Malakar Indian Institite of Technology, Kanpur SRMPDS, International Conference on Parallel Processing 2020 Ashish Kumar, Naman Jain, Preeti Malakar

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Load Balancing with nftables by Laura Garca (Zen Load Balancer Team) Netdev 1.1 Prototype of

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

MPI &amp; MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

MPI - Message Passing Interface MPI is the mostly used message passing-standard By

Message Passing Programming Designing MPI Applications Overview Lecture will cover MPI

Load Balancing Load Balancing Load balancing: distributing data and/or computations across

Vertical Stress Increases Chapter 8 Point Load 1 3/25/2015 Point Load Point Load

Demand Response Programs: Demand Response Programs: Configuring Load as a Resource for

Perioperative Management of Patients with Cardiac Implantable Electronic Devices Rachel Heise,

Network models Why model? simple representation of complex network can derive

1 9/14/2019 Selective His Bundle Pacing Histological 90 90 40 40 S V H V HBP Correa de

How to Ignore Most Startup Advice and Build a Decent Software Business Ines Montani Explosion

The UPPAAL Model Checker The UPPAAL Model Checker Julin Proenza Systems, Robotics and Vision

VER ND NDN N USI MMUNI NICATION OVE SING NG S ERV CE E DGE GE R OUTE RVICE UTERS Sy Syed

DELTAPATH: PRECISE AND SCALABLE CALLING CONTEXT ENCODING Qiang Zeng*, Junghwan Rhee , Hui Zhang,

Supraventricular Arrhythmias Bread And Butter Or Toast And Jam Derek V Exner , MD, MPH, FRCPC,

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards