Just-in-time Staging of Large Input Just-in-time Staging of Large - PowerPoint PPT Presentation

Just-in-time Staging of Large Input Just-in-time Staging of Large Input Data for Supercomputing Jobs Data for Supercomputing Jobs Henry Monti, Ali R. Butt Sudharshan S. Vazhkudai

HPC Center Data Stage-in Problem HPC Center Data Stage-in Problem  Data stage-in entails moving all necessary input files for a job to a center’s local storage • Requires significant commitment of center resources while waiting for the job to run • Storage failures are common, and users may be required to restage data  Delaying input data causes costly job rescheduling  Staging data too early is undesirable • From a center standpoint: • Wastes scratch space that could be used for other jobs • From a user job standpoint: • Potential job rescheduling due to storage system failure ⇒ Coinciding Input Data Stage-in time with job execution time improves HPC center serviceability 2

Current Methods to Stage-in Data Current Methods to Stage-in Data  No standardized method  Employ common point-to-point transfer tools: • GridFTP, HSI, scp, …  Limitations • Entail early stage-in to ensure data availability • Do not leverage orthogonal bandwidth • Oblivious to deadlines or job start times Not an optimal approach for HPC data stage-in 3

Our Contribution: Our Contribution: A Just-in-time Data Stage-in Service A Just-in-time Data Stage-in Service  Coincides data stage-in with job start time  Attempts to reduce overall scratch space usage  Uses intermediate locations for temporary storage  Provides for quick restaging after storage failures  Integrates with real-world tools • Portable Batch System (PBS) • BitTorrent  Supports a fault-tolerant way to stage data while 4 efficiently utilizing scratch space

Job Rescheduled! Transfer completes much earlier than job startup time Storage system failures may entail re-staging of data! 5

Time between stage-in and job startup is small Fast transfers  better opportunities for JIT Staging 6

Challenges Faced in JIT Staging Challenges Faced in JIT Staging 1. Obtaining accurate job start times 1. Obtaining accurate job start times 2. Adapting to dynamic network behavior 2. Adapting to dynamic network behavior 3. Ensuring data reliability and availability 3. Ensuring data reliability and availability 4. Managing deadlines during stage-in 4. Managing deadlines during stage-in 5. Utilizing intermediate nodes 6. Providing incentives to participate 6. Providing incentives to participate 7

Obtaining Accurate Job Start Times Obtaining Accurate Job Start Times  Accurate estimates of job start time needed to avoid job rescheduling  Solution: Use Batch Queue Prediction (BQP) • Provides statistical upper bound on job wait • Predicts probability of job starting by the deadline  Obtain predictions of job start time from BQP  Stage-in data using this deadline 8

Adapting Data Distribution To Adapting Data Distribution To Dynamic Network Behavior Dynamic Network Behavior  Available bandwidth can change • Distribute data randomly – may not be effective • Utilize network monitoring 4 Mb/s  Solution: Use Network Weather Service (NWS) 5 Mb/s • Provides bandwidth Measurement 10 Mb/s 1 Mb/s • Predicts future bandwidth ?  Choose dynamically changing data paths  Select enough nodes to satisfy a given deadline  Monitor and update the selected nodes 9

Protecting Data from Intermediate Protecting Data from Intermediate Storage Location Failure Storage Location Failure  Problem: Node failure may cause data loss  Solution: 1. Use data replication • Achieved through multiple data flow paths 2. Employ Erasure coding • Can be done by the user or at the intermediates 10

Managing Deadlines during Stage-in Managing Deadlines during Stage-in  Use NWS to measure available bandwidths • Use Direct if it can meet a deadline • Otherwise, perform decentralized stage-in  If end host fails or cannot meet deadline • Utilize decentralized stage-in approach T Stage <= T JobStartup 11

Intermediate Node Discovery Intermediate Node Discovery  User specifies known and trusted nodes  Utilize P2P Overlay  Nodes advertise their availability to others  Receiving nodes discovers the advertiser 2 128 -1 0 Identifier space  Discovered nodes utilized as necessary 12

P2P Data Storage and Dissemination P2P Data Storage and Dissemination  P2P-based storage • Enables robust storage of data on loosely coupled distributed participants: CFS, PAST, OceanStore, …  P2P-based multicast  Enables application-level one to many communication  Example: BitTorrent • Uses a scatter-gather protocol to distribute files • Leverages Seeds - peers that store entire files • Employs a tracker to maintain lists of peers • Uses a “torrent file” containing metadata for data retrieval 13

Incentives to Participate in Incentives to Participate in Stage-in Process Stage-in Process  Modern HPC jobs are often collaborative • “Virtual Organizations” - set of geographically distributed users from different sites • Jobs in TeraGrid usually from such organizations  Resource bartering among participants to facilitate each others stage-in over time  Nodes specified and trusted by the user 14

Integrating Stage-in with PBS Integrating Stage-in with PBS  Provide new PBS directives • Specifies destination, intermediate nodes, and deadline #PBS -N myjob #PBS -l nodes=128, walltime=12:00 mpirun -np 128 ~/MyComputation #Stagein file://SubmissionSite:/home/user/input1 #InterNode node1.Site1:49665:50GB ... #InterNode nodeN.SiteN:49665:30GB #Deadline 1/14/2007:12:00 15

Adapting BitTorrent Functionality to Data Stage-in  Tailor BitTorrent to meet the needs of our stage-in  Restrict the amount of result-data sent to a peer • Peers with less storage than the input size can be utilized  Incorporate global information into peer selection • Use NWS bandwidth measurements • Use knowledge of node capacity from PBS scripts • Choose the appropriate nodes with storage capacity  Recipients are not necessarily end-hosts • They may simply pass data onward 16

Evaluation: Experimental Setup Evaluation: Experimental Setup  Objectives • Compare with direct transfer, and BitTorrent • Validate our method as an alternative to other stage-in methods  PlanetLab test bed • 6 PlanetLab nodes: center + end user + 4 intermediate nodes  Experiments: Compare the proposed method with • Point-to-point transfer (scp) • Standard BitTorrent  Observe the effect of bandwidth changes 17

Results: Data Transfer Times with Results: Data Transfer Times with Respect to Direct Transfer Respect to Direct Transfer File Size 100 MB 240 MB 500 MB 2.1 GB Direct 172 351 794 3082 Client Offload 139 258 559 2164 Pull 43 106 193 822 A JIT stage-in is capable of significantly improving transfer times Times are in seconds 18

Results: Data Transfer Times with Results: Data Transfer Times with Respect to Standard BitTorrent BitTorrent Respect to Standard Phase BitTorrent Our Method Send to all intermediate nodes 2653 2164 (Client Offload) HPC Center download (Pull) 960 822 Monitoring based stage-in is capable of outperforming standard BitTorrent Times are in seconds Transferring 2.1 GB file 19

Conclusion Conclusion  A fresh look at Data Stage-in • Decentralized approach • Monitoring-based adaptation  Considers deadlines and job start times  Integrated with real-world tools  Outperforms direct transfer by 73.3% in our experiments 20

Future Work Future Work  Measuring scratch space savings  Measuring potential job delays  Testing other stage-in scenarios  Contact • Virginia Tech. • Distributed Systems and Storage Lab. http://research.cs.vt.edu/dssl/ • {hmonti, butta}@cs.vt.edu • ORNL • http://www.csm.ornl.gov/~vazhkuda/Storage.html • vazhkudaiss@ornl.gov 21

Just-in-time Staging of Large Input Just-in-time Staging of Large - PowerPoint PPT Presentation

Just-in-time Staging of Large Input Just-in-time Staging of Large Input Data for Supercomputing Jobs Data for Supercomputing Jobs Henry Monti, Ali R. Butt Sudharshan S. Vazhkudai HPC Center Data Stage-in Problem HPC Center Data Stage-in

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Staging Drupal Change Management Strategies for Drupal DrupalCamp CT 2010 Staging Drupal

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Last time: staging basics . < e > . 1/ 54 Staging recap Goal : specialise with available

Last time: staging basics . < e > . 1/ 41 Staging pow let rec pow x n = if n = 0 then .

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

Wayland Input Methods Michael Hasselmann Openismus GmbH Wayland Input Methods Input methods?

Pro- and cons of staging complex EVAR Is there more to staging than lower paraplegia? Barend

PREOPERATIVE STAGING PREOPERATIVE STAGING IN RECTAL CANCER IN RECTAL CANCER Jacqueline A.

Kings Staging Ammar Al-Chalabi ENCALS Training for TRICALS Milan, 21-22 May 2016

SOCIAL MEDIA AND MDIAS SOCIAUX THE STAGING OF ET MISE EN SCNE HISTORY DE LHISTOIRE

problems of direct input and solutions Input devices vs. Finger-based input Indirect vs. Direct

Exercise 1: Basic Input Exercise 1: Basic Input FLUKA Beginners Course Exercise 1: Basic Input

Data Staging and Asynchronous I/O in ADIOS Hasan Abbasi ORNL Jong Choi ORNL Greg Eisenhauer

Staging in High Powered Rocketry Presenters: Mai Vo, Carson Dobbs Authors: Carson Coursey, Mai

Kuznets Curve: A Simple Stages Dynamical System-Based What Happens on the . . . What Happens on

5 STAGES OF INTIMACY Dr. Jessica Higgins THE DEVELOPMENT OF INTIMACY Every couple goes

Welcome to Market Leader Power Hour!! Todays call will be recorded View todays

Last time: effects effect E : t 1/ 38 This time: staging . < e > . 2/ 38 Review:

Staging Parser Combinators for Efficient Data Processing Parsing @ SLE, 14 September 2014

Staging studies Callum Wilkinson University of Bern April 15, 2019 CPV 1 kt MW yr (0.04 actual

Just-in-time Staging of Large Input Just-in-time Staging of Large - PowerPoint PPT Presentation

Just-in-time Staging of Large Input Just-in-time Staging of Large Input Data for Supercomputing Jobs Data for Supercomputing Jobs Henry Monti, Ali R. Butt Sudharshan S. Vazhkudai HPC Center Data Stage-in Problem HPC Center Data Stage-in

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Staging Drupal Change Management Strategies for Drupal DrupalCamp CT 2010 Staging Drupal

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Last time: staging basics . &lt; e &gt; . 1/ 54 Staging recap Goal : specialise with available

Last time: staging basics . &lt; e &gt; . 1/ 41 Staging pow let rec pow x n = if n = 0 then .

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

Wayland Input Methods Michael Hasselmann Openismus GmbH Wayland Input Methods Input methods?

Pro- and cons of staging complex EVAR Is there more to staging than lower paraplegia? Barend

PREOPERATIVE STAGING PREOPERATIVE STAGING IN RECTAL CANCER IN RECTAL CANCER Jacqueline A.

Kings Staging Ammar Al-Chalabi ENCALS Training for TRICALS Milan, 21-22 May 2016

SOCIAL MEDIA AND MDIAS SOCIAUX THE STAGING OF ET MISE EN SCNE HISTORY DE LHISTOIRE

problems of direct input and solutions Input devices vs. Finger-based input Indirect vs. Direct

Exercise 1: Basic Input Exercise 1: Basic Input FLUKA Beginners Course Exercise 1: Basic Input

Data Staging and Asynchronous I/O in ADIOS Hasan Abbasi ORNL Jong Choi ORNL Greg Eisenhauer

Staging in High Powered Rocketry Presenters: Mai Vo, Carson Dobbs Authors: Carson Coursey, Mai

Kuznets Curve: A Simple Stages Dynamical System-Based What Happens on the . . . What Happens on

5 STAGES OF INTIMACY Dr. Jessica Higgins THE DEVELOPMENT OF INTIMACY Every couple goes

Welcome to Market Leader Power Hour!! Todays call will be recorded View todays

Last time: effects effect E : t 1/ 38 This time: staging . &lt; e &gt; . 2/ 38 Review:

Staging Parser Combinators for Efficient Data Processing Parsing @ SLE, 14 September 2014

Staging studies Callum Wilkinson University of Bern April 15, 2019 CPV 1 kt MW yr (0.04 actual

Last time: staging basics . < e > . 1/ 54 Staging recap Goal : specialise with available

Last time: staging basics . < e > . 1/ 41 Staging pow let rec pow x n = if n = 0 then .

Last time: effects effect E : t 1/ 38 This time: staging . < e > . 2/ 38 Review: