Mercury: RPC for High-Performance Computing Jerome Soumagne The HDF - PowerPoint PPT Presentation

Mercury: RPC for High-Performance Computing Jerome Soumagne The HDF Group June 23, 2017

RPC and High-Performance Computing 2 June 23, 2017 CS/NERSC Data Seminar

RPC and High-Performance Computing 2 Remote Procedure Call (RPC) � Allow local calls to be executed on remote resources � Already widely used to support distributed services – Google Protocol Buffers, etc June 23, 2017 CS/NERSC Data Seminar

RPC and High-Performance Computing 2 Remote Procedure Call (RPC) � Allow local calls to be executed on remote resources � Already widely used to support distributed services – Google Protocol Buffers, etc Typical HPC applications are SPMD � No need for RPC: control flow implicit on all nodes � A series of SPMD programs sequentially produce & analyze data June 23, 2017 CS/NERSC Data Seminar

RPC and High-Performance Computing 2 Remote Procedure Call (RPC) � Allow local calls to be executed on remote resources � Already widely used to support distributed services – Google Protocol Buffers, etc Typical HPC applications are SPMD � No need for RPC: control flow implicit on all nodes � A series of SPMD programs sequentially produce & analyze data Distributed HPC workflow � Nodes/systems dedicated to specific task � Multiple SPMD applications/jobs execute concurrently and interact June 23, 2017 CS/NERSC Data Seminar

RPC and High-Performance Computing 2 Remote Procedure Call (RPC) � Allow local calls to be executed on remote resources � Already widely used to support distributed services – Google Protocol Buffers, etc Typical HPC applications are SPMD � No need for RPC: control flow implicit on all nodes � A series of SPMD programs sequentially produce & analyze data Distributed HPC workflow � Nodes/systems dedicated to specific task � Multiple SPMD applications/jobs execute concurrently and interact Importance of RPC growing � Compute nodes with minimal/non-standard environment � Heterogeneous systems (node-specific resources) � More “service-oriented” and more complex applications � Workflows and in-transit instead of sequences of SPMD June 23, 2017 CS/NERSC Data Seminar

Mercury 3 June 23, 2017 CS/NERSC Data Seminar

Mercury 3 Objective Create a reusable RPC library for use in HPC that can serve as a basis for services such as storage systems, I/O forwarding, analysis frameworks and other forms of inter-application communication June 23, 2017 CS/NERSC Data Seminar

Mercury 3 Objective Create a reusable RPC library for use in HPC that can serve as a basis for services such as storage systems, I/O forwarding, analysis frameworks and other forms of inter-application communication � Why not reuse existing RPC frameworks? – Do not support efficient large data transfers or asynchronous calls – Mostly built on top of TCP/IP protocols ◮ Need support for native transport ◮ Need to be easy to port to new systems June 23, 2017 CS/NERSC Data Seminar

Mercury 3 Objective Create a reusable RPC library for use in HPC that can serve as a basis for services such as storage systems, I/O forwarding, analysis frameworks and other forms of inter-application communication � Why not reuse existing RPC frameworks? – Do not support efficient large data transfers or asynchronous calls – Mostly built on top of TCP/IP protocols ◮ Need support for native transport ◮ Need to be easy to port to new systems � Similar previous approaches with some differences – I/O Forwarding Scalability Layer (IOFSL) – ANL – NEtwork Scalable Service Interface (Nessie) – Sandia – Lustre RPC – Intel June 23, 2017 CS/NERSC Data Seminar

Overview 4 June 23, 2017 CS/NERSC Data Seminar

Overview 4 � Designed to be both easily integrated and extended – “Client” / “Server” notions abstracted ◮ (Server may also act as a client and vice versa) – “Origin” / “Target” used instead c 1 s 1 s 1 Service Nodes (e.g., storage, Compute Nodes, origin c 1 has c 2 s 2 s 2 visualization, etc), s 1 and s 3 are target s 2 targets of s 2 c 3 s 3 s 3 June 23, 2017 CS/NERSC Data Seminar

Overview 4 � Designed to be both easily integrated and extended – “Client” / “Server” notions abstracted ◮ (Server may also act as a client and vice versa) – “Origin” / “Target” used instead c 1 s 1 s 1 Service Nodes (e.g., storage, Compute Nodes, origin c 1 has c 2 s 2 s 2 visualization, etc), s 1 and s 3 are target s 2 targets of s 2 c 3 s 3 s 3 � Basis for accessing and enabling resilient services – Ability to reclaim resources after failure is imperative June 23, 2017 CS/NERSC Data Seminar

Overview 5 RPC proc RPC proc Origin Target June 23, 2017 CS/NERSC Data Seminar

Overview 5 � Function arguments / metadata transferred with RPC request – Two-sided model with unexpected / expected messaging – Message size limited to a few kilobytes (low-latency) Metadata (unexpected + expected messaging) RPC proc RPC proc Origin Target June 23, 2017 CS/NERSC Data Seminar

Overview 5 � Function arguments / metadata transferred with RPC request – Two-sided model with unexpected / expected messaging – Message size limited to a few kilobytes (low-latency) � Bulk data transferred using separate and dedicated API – One-sided model that exposes RMA semantics (high-bandwidth) Metadata (unexpected + expected messaging) RPC proc RPC proc Origin Target Bulk Data (RMA transfer) June 23, 2017 CS/NERSC Data Seminar

Overview 5 � Function arguments / metadata transferred with RPC request – Two-sided model with unexpected / expected messaging – Message size limited to a few kilobytes (low-latency) � Bulk data transferred using separate and dedicated API – One-sided model that exposes RMA semantics (high-bandwidth) � Network Abstraction Layer – Allows definition of multiple network plugins ◮ MPI and BMI plugins first plugins ◮ Shared-memory plugin (mmap + CMA, supported on Cray w/CLE6) ◮ CCI plugin contributed by ORNL ◮ Libfabric plugin contributed by Intel (support for Cray GNI) Metadata (unexpected + expected messaging) RPC proc RPC proc Origin Target Bulk Data (RMA transfer) Network Abstraction Layer June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call 6 � Mechanism used to send an RPC request (may also ignore response) ... ... id 1 id N id 1 id N Origin Target June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call 6 � Mechanism used to send an RPC request (may also ignore response) 1. Register call 1. Register call and get request id and get request id ... ... id 1 id N id 1 id N Origin Target June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call 6 � Mechanism used to send an RPC request (may also ignore response) ... ... id 1 id N id 1 id N 2. (Pre-post receive for target response) Post unexpected send with request id and serialized parameters Origin Target 2. Post receive for unexpected request / Make progress June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call 6 � Mechanism used to send an RPC request (may also ignore response) ... ... id 1 id N id 1 id N Origin Target 3. Execute call June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call 6 � Mechanism used to send an RPC request (may also ignore response) ... ... id 1 id N id 1 id N Origin Target ( 4. Post send with serialized response) 4. Make progress June 23, 2017 CS/NERSC Data Seminar

Progress Model 7 � Callback-based model with completion queue � Explicit progress with HG Progress() and Push on Completion HG Trigger() Progress Callback 1 – Allows user to create workflow Callback ... – No need to have an explicit wait call (shim layers Callback ... possible) Pop and execute callback Trigger Callback N – Facilitate operation scheduling, multi-threaded Callbacks may be wrapped around pthreads, etc execution and cancellation! do { unsigned int actual_count = 0; do { ret = HG_Trigger (context, 0, 1, &actual_count); } while ((ret == HG_SUCCESS) && actual_count); if (done) break ; ret = HG_Progress (context, HG_MAX_IDLE_TIME); } while (ret == HG_SUCCESS); June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call: Example 8 � Origin snippet (Callback model): open_in_t in_struct; /* Initialize the interface and get target address */ hg_class = HG_Init ( "ofi+tcp://eth0:22222" , HG_FALSE); hg_context = HG_Context_create (hg_class); [...] HG_Addr_lookup_wait (hg_context, target_name, &target_addr); /* Register RPC call */ rpc_id = MERCURY_REGISTER (hg_class, "open" , open_in_t , open_out_t ); /* Set input parameters */ in_struct.in_param0 = in_param0; /* Create RPC request */ HG_Create (hg_context, target_addr, rpc_id, &hg_handle); /* Send RPC request */ HG_Forward (hg_handle, rpc_done_cb , &rpc_done_args, &in_struct); /* Make progress */ [...] June 23, 2017 CS/NERSC Data Seminar

Remote Procedure Call: Example 9 � Origin snippet (next): hg_return_t rpc_done_cb ( const struct hg_cb_info *callback_info) { open_out_t out_struct; /* Get output */ HG_Get_output (callback_info->handle, &out_struct); /* Get output parameters */ ret = out_struct.ret; out_param0 = out_struct.out_param0; /* Free output */ HG_Free_output (callback_info->handle, &out_struct); return HG_SUCCESS; } � Cancellation: HG Cancel() on handle – Callback still triggered (canceled = completion) June 23, 2017 CS/NERSC Data Seminar

Mercury: RPC for High-Performance Computing Jerome Soumagne The HDF - PowerPoint PPT Presentation

Mercury: RPC for High-Performance Computing Jerome Soumagne The HDF Group June 23, 2017 RPC and High-Performance Computing 2 June 23, 2017 CS/NERSC Data Seminar RPC and High-Performance Computing 2 Remote Procedure Call (RPC) Allow

Overview of RPC Systems Distributed Systems Sun RPC DCE RPC RPC Case Studies DCOM CORBA Java

(g)RPC - Remote Procedure Call February 13, 2019 Remote Procedure Call (RPC) a form of

Mercury in Women of Childbearing Age in 25 Countries: Study Finds Harmful Levels of Mercury in

Stabilisation of Mercury Stabilisation of Mercury Stabilisation of Mercury Stabilisation of

BEPICOLOMBO MERCURY MISSION The main questions about Mercury Why Mercury is so dense?

RPC Semantics Doug Woos Logistics notes Toms OH canceled this week Last time - Go tips and

Reduction Chris M. Piehler Senior Environmental Scientist The Mercury Cycle Blood Mercury

Mercury in Fish What is Mercury? Mercury (Hg), a toxic element, is a famous heavy metal found in

What is Mercury? Naturally occurring element (atomic number 80) Heavy metal, can be toxic

MERCURY IN OUR RIVERS AND IN OUR BELLIES Chandra Brown Mercury Mercury is a neurotoxin that

One DCE/RPC server to serve them all Samuel Cabrero scabrero@suse.com SUSE DCE/RPC 2 DCE /

700/800 MHz Hz RPC RPC AN ANNUAL L WORK ORKSHOP AND CAPRA RAD TRA RAINING

WNY RPC RPC Bo Board M Meeting September 7, 2017 Horizon Health Training Center 60 East

RPC in the modern world CS 414: Advanced Systems Oliver Kennedy RPC Overview Remote procedures

Mercury and Small Scale Gold Mining Magnitude and Challenges Worldwide Dr. Kevin Telmer

Mercury in the Everglades Cecilia Lizetti Cecilia Lizetti Tasayco Paitan Lazaro Pino

Scalable node addressing Scalable node addre and message routing for global

Advanced Research Issues In Security: Securing Key Internet Technologies CS 236 On-Line MS

Networking 101 By: Stefan Jagroop The Internet The Internet is governed by a series of

Rumor Routing Algorithm Aleksi Ahtiainen Aleksi.Ahtiainen@hut.fi T-79.194 Seminar on Theoretical

DHCAL Progress and Test Beam DHCAL Progress and Test Beam Preparation Preparation Lei Xia Xia

Research at the intersection of AI + Systems Joseph E. Gonzalez Assistant Professor, UC Berkeley

Minimal RPC framework with modern C++ Rui Figueira https://github.com/ruifig/czrpc

te techno hnologi logies: es: le less ssons ons le learned rned and nd fu futur ure e