Design of MPI Passive Target Synchronization for a Non-Cache- - PowerPoint PPT Presentation

Design of MPI Passive Target Synchronization for a Non-Cache- Coherent Many-Core Processor 27th PARS Workshop, Hagen, Germany, May 5 2017 Steffen Christgau , Bettina Schnor Operating Systems and Distributed Systems Institute for Computer Science University of Potsdam, Germany

Motivation: Distributed Hash Table (DHT) • hash table as cache for computational results in MPI application PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 1 / 14

Motivation: Distributed Hash Table (DHT) • hash table as cache for computational results in MPI application • large amount of data → distribute across processes → DHT PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 1 / 14

Motivation: Distributed Hash Table (DHT) • hash table as cache for computational results in MPI application • large amount of data → distribute across processes → DHT local local local DHT DHT part DHT part DHT part ... rank n − 1 rank 0 rank 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 1 / 14

Motivation: Distributed Hash Table (DHT) • hash table as cache for computational results in MPI application • large amount of data → distribute across processes → DHT local local local DHT DHT part DHT part DHT part ... rank n − 1 rank 0 rank 1 • accessing distributed data: hash function returns arbitrary process and address difficult to program with two-sided message passing MPI passive target one-sided communication to the rescue synchronization required PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 1 / 14

Motivation: nCC Systems • Future many-cores may not provide (global) cache coherence. PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 2 / 14

Motivation: nCC Systems • Future many-cores may not provide (global) cache coherence. Intel Knights Landing: coherent multi-socket systems not feasible https://www.extremetech.com/wp-content/uploads/2016/04/KnightsLanding.png PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 2 / 14

Motivation: nCC Systems • Future many-cores may not provide (global) cache coherence. Intel Knights Landing: coherent multi-socket systems not feasible HPE "The Machine", EuroServer: coherence islands https://regmedia.co.uk/2016/11/22/the_machine_universal_memory_pool_access.jpg PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 2 / 14

Research Platform • nCC many-core research system: Intel SCC 48 Pentium cores with L1/2 caches no HW cache coherence L2$ Core MC 2 MC 3 MIU MPB L2$ Core R Tile MC 0 MC 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 3 / 14

Research Platform • nCC many-core research system: Intel SCC 48 Pentium cores with L1/2 caches no HW cache coherence L2$ Core MC 2 MC 3 MIU MPB L2$ Core R Tile MC 0 MC 1 • This talk: design of synchronization on nCC platform. PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 3 / 14

Agenda MPI Passive Target One-Sided Communication Design for Passive Target Synchronization on the SCC Data Structures and Algorithms Data Placement Outlook and Future Work PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 4 / 14

MPI One-Sided Communication • process memory exposed via windows process ’ address space local DHT part local DHT part local DHT part DHT rank 0 rank 1 ... rank n − 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows process ’ address space local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 • key concept : only one communication partner issues communication operations PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 • key concept : only one communication partner issues communication operations origin processes issue communication operations PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 • key concept : only one communication partner issues communication operations origin processes issue communication operations target processes are addressed by operations PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 • key concept : only one communication partner issues communication operations origin processes issue communication operations target processes are addressed by operations typical RMA operations: PUT, GET, . . . PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI One-Sided Communication • process memory exposed via windows • access to windows with window object (handle) process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) rank 0 rank 1 ... rank n − 1 • key concept : only one communication partner issues communication operations origin processes issue communication operations target processes are addressed by operations typical RMA operations: PUT, GET, . . . explicit synchronization required PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 5 / 14

MPI Passive Target Synchronization • locks as means for synchronization, used by origins only • no participation of targets in synchronization (passive targets) PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 6 / 14

MPI Passive Target Synchronization • locks as means for synchronization, used by origins only • no participation of targets in synchronization (passive targets) • usage similar to shared memory locks WIN_LOCK(win, rank, ...) 1. acquire lock for target window PUT(win, rank, ...) 2. perform operations WIN_UNLOCK(win, rank) 3. release lock PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 6 / 14

MPI Passive Target Synchronization • locks as means for synchronization, used by origins only • no participation of targets in synchronization (passive targets) • usage similar to shared memory locks WIN_LOCK(win, rank, ...) 1. acquire lock for target window PUT(win, rank, ...) 2. perform operations WIN_UNLOCK(win, rank) 3. release lock MPI de fi nes two lock types: shared concurrent accesses on target window allowed exclusive prevent concurrent accesses on same target window PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 6 / 14

Distributed Hash Table with MPI OSC process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) ... rank n − 1 rank 0 rank 1 PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 7 / 14

Distributed Hash Table with MPI OSC process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) ... rank n − 1 rank 0 rank 1 DHT_read LOCK(window_obj, target, SHARED) GET(window_obj, target, &data) UNLOCK(window_obj, target) PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 7 / 14

Distributed Hash Table with MPI OSC process ’ address space window object window object window object local DHT part local DHT part local DHT part DHT (window) (window) (window) ... rank n − 1 rank 0 rank 1 DHT_read DHT_write LOCK(window_obj, target, SHARED) LOCK(window_obj, target, EXCLUSIVE) GET(window_obj, target, &data) PUT(window_obj, target, data) UNLOCK(window_obj, target) UNLOCK(window_obj, target) PARS 2017 S. Christgau (U Potsdam): MPI Passive Target Synchronization 7 / 14

Design of MPI Passive Target Synchronization for a Non-Cache- - PowerPoint PPT Presentation

Design of MPI Passive Target Synchronization for a Non-Cache- Coherent Many-Core Processor 27th PARS Workshop, Hagen, Germany, May 5 2017 Steffen Christgau , Bettina Schnor Operating Systems and Distributed Systems Institute for Computer

Passive Gas System Design PRESENTED BY BRYAN WELDON P.E. Passive System Overview 01 Passive

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Passive Fire Protection For the Oil & Gas Industry Passive Fire Protection What is purpose

Content Synchronization Content Synchronization March 2nd 2005 Jukka Honkola T-110.456

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

Passive Transport (no energy input required) Passive Transport Passive transport is the

Passive Intermodulation (PIM), an interference challenge for the radio Passive Intermodulation

What Are Active and Passive Voice? Can you write definitions for active and passive

Distributed hybrid Grbner bases computation Heinz Kredel University of Mannheim ECDS at CISIS

Measurement and Analysis of Hajime: a Peer-to-peer IoT Botnet Stephen Herwig Katura Harvey

Connectivity Properties of Mainline BitTorrent DHT Nodes Raul Jimenez, Flutra Osmani, Bjrn

The time scales of a stochastic network with failures Mathieu Feuillet joint work with Philippe

Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises Full paper

Introduction Need for a highly available Distributed Data Store During the holiday shopping

Randomized Composable Core-sets for Distributed Op7miza7on

Consistent Hashing in your python applications Europython 2017 @ultrabug Gentoo Linux developer

Design of MPI Passive Target Synchronization for a Non-Cache- - PowerPoint PPT Presentation

Design of MPI Passive Target Synchronization for a Non-Cache- Coherent Many-Core Processor 27th PARS Workshop, Hagen, Germany, May 5 2017 Steffen Christgau , Bettina Schnor Operating Systems and Distributed Systems Institute for Computer

Passive Gas System Design PRESENTED BY BRYAN WELDON P.E. Passive System Overview 01 Passive

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Passive Fire Protection For the Oil &amp; Gas Industry Passive Fire Protection What is purpose

Content Synchronization Content Synchronization March 2nd 2005 Jukka Honkola T-110.456

MPI &amp; MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

Passive Transport (no energy input required) Passive Transport Passive transport is the

Passive Intermodulation (PIM), an interference challenge for the radio Passive Intermodulation

What Are Active and Passive Voice? Can you write definitions for active and passive

Distributed hybrid Grbner bases computation Heinz Kredel University of Mannheim ECDS at CISIS

Measurement and Analysis of Hajime: a Peer-to-peer IoT Botnet Stephen Herwig Katura Harvey

Connectivity Properties of Mainline BitTorrent DHT Nodes Raul Jimenez, Flutra Osmani, Bjrn

The time scales of a stochastic network with failures Mathieu Feuillet joint work with Philippe

Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises Full paper

Introduction Need for a highly available Distributed Data Store During the holiday shopping

Randomized Composable Core-sets for Distributed Op7miza7on

Consistent Hashing in your python applications Europython 2017 @ultrabug Gentoo Linux developer

Passive Fire Protection For the Oil & Gas Industry Passive Fire Protection What is purpose

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards