a design for interchangable simulation and implementation Klaus - PowerPoint PPT Presentation

a design for interchangable simulation and implementation Klaus Birkelund Jensen Brian Vinter August 25, 2015 Niels Bohr Institute

outline 1. Introduction, background and motivation Some context to understand why ISI was developed. 2. The current state of storage simulation What techniques are we using today, and what are the advantages and disadvantages? 3. Our approach to interchangeability What is interchangeability in simulation and implementation? 4. Scalability results What makes the ISI approach viable for large scale (storage) simulation? 5. Summary 2

introduction and motivation

motivation To understand how large scientific data sets can be stored efficiently. Efficiency in • Performance • Resources usage • Locality • Energy consumption We focus on energy consumption. 4

about me Former systems operator at HPC/UCPH. Did storage and compute. • Nordic T1 facility (storage & compute for ATLAS and ALICE) • Multi PB disk, multi PB tape, thousands of compute cores. Now, PhD student on the CINEMA project, working on storage techniques. 5

motivation 6

the problems The energy bill associated with storage is an ever larger part of the data center budget. Most common technique to reduce energy consumption and maximize performance: • Hierarchical Storage Management (HSM) The notion of managing data according to popularity, age, size etc. Move passive data to cheaper lower tier storage (usually tape). SSDs HDDs Tape faster cheaper 7

the problems HSM uses many reasonably good techniques including (but not limited to): • LRU-caching and aging • Manual tagging of data (i.e. “please do NOT move my data!”). • Generally, on-demand retrieval. No prediction. 8

the problems HSM is too general to efficiently store what we define as known data sets . We focus on scientific and industrial tomography imaging. Imaging data exhibits known workloads and structure. We should acknowledge and exploit that. 9

the problems In the data center, durability and reliability is most commonly provided by large RAID systems, but erasure codes are rapidly gaining traction. In RAID, all drives must spin simultaneusly. There are solutions to this in the literature, including: • Power-aware RAID (or gear shifting ). • Intelligent data placement (e.g. locality optimized). They are all general in nature. 10

the problems The principle of “optimizing for the common case” has always been a good strategy. “This data was just used — let’s keep it around for a week... or so ” But, the common case isn’t at all common when working with well-defined scientific data. “This data has just been acquired — the physicist won’t use it for months... if ever ” 11

solutions What is possible if we exploit what is known ? • Raw data can be moved directly to tape • Stream filtering But how to quantify any possible benefits? Simulation of storage hierarchies, workloads and data acquisition and consumption. 12

building a storage system Developing a large-scale storage system where the design isn’t exactly known in advance, could go something like this: 1. Simulate a model and identify a design. 2. Implement a prototype from the design. 3. Measure the prototype and validate it and the model against predictions. 4. Repeat . Feed the results of the validation back into the simulator and/or model and repeat from step 1. The process is sound, but can we improve it? 13

improvements Interchangeability of simulation and implementation Eliminating the simulation – prototyping – measure cycle. Simulation Prototyping Measurement ISI Design Implementation Validation 14

storage simulation Simulate the system model using Discrete Event Simulation (DES). A DES is a priority queue of events, handled sequentially. Each event has a time stamp, updates the model and adds new events to the queue when handled. 15

storage simulation 4: 7: end procedure end while 6: 5: Main loop of a DES. 16 2: 3: 1: procedure DES-LOOP( Q ) Algorithm 1 Discrete Event Simulation while Q ̸ = ∅ do e ← Dequeue ( Q ) T ← Clock ( e ) ▷ update world clock Process ( e ) ▷ process event and add new

storage simulation An event is processed by a handler. Typically a huge function with a single switch-statement. Parallel DES (PDES), generalizes this by allowing multiple processes to have a local priority queue. 17

parallel des ROSS (Rensselaer’s Optimistic Simulation System) is an optimistic PDES. • Extremely high performance • Runs on millions of cores • Relies on Reverse Computation In summary: a savage beast 18

interchangeable simulation and implementation Model the system components as the individual processes they are. The process logic directly implements a prototype. Requires an environment supporting millions of independently communicating processes: • Library based: ZeroMQ Substantial reduction in time spent going from modeling to prototype. 19 • Language based: Go, Erlang, occam- π

interchangeable simulation and implementation Do measurement at the same points that does simulation. No (explicit) priority queues. Communication is done directly between interacting entities. Communicate instead of dictating events. 20

discrete vs. real-time Simulated durations are calculated in the processes that does the work. Interchangeability allows components to be swapped around and possibly mixing discrete time for some components with real-time for other components. 21

simulating a rather huge tape library • 90 days of constant I/O • Three types of entities: clients, tape drives and changers • Up to 250,000 processes simulated. 22 • Fixed ratio of 16 : 8 : 1

i/o communication path client i ch changers ch drives 1: req{ ch client i } 2: req{ ch changer i } drive i 5: req{ ch client i } 23 6: ch client i ← resp{} 4: ch client i ← resp{ ch drive i } 3: ch changer i ← resp{ ch drive i }

go Open source concurrent programming language, created and primarily developed by Google. Designed to be highly productive and easy to learn. Follows the principle of least surprise . Key features: low level language features. • Garbage-collected • Compiled • Statically typed 24 • CSP and π -calculus style channels and processes as

func client(lib *library) { ch := make(chan response, *chanBufSize) for { lib.changers <- request{mount, ch, clock} resp = <-ch clock = clock.Add(resp.t) waitTime += resp.t t += resp.t resp.ch <- request{read, ch, clock} resp = <-ch clock = clock.Add(resp.t) t += resp.t ioTime += resp.t } } 25

scalability results

results (sequential) 27 Runtime of Tape Library Simulation on 1 core 10000 1000 Runtime (seconds) 100 10 unbu ff ered bufsize=100 bufsize=1000 1 10 100 1000 10000 100000 1e+06 Number of total processes (multiples of 8 drives, 1 changer, 16 clients)

results (parallel) 28 Runtime of Tape Library Simulation with bu ff ered channels (size 100) 1000 100 Runtime (seconds) 10 1 core 2 cores 4 cores 8 cores 1 10 100 1000 10000 100000 1e+06 Number of total processes (multiples of 8 drives, 1 changer, 16 clients)

unbuffered channels 652.45 70.10 75.45 10000 292.40 365.42 585.33 243.83 25000 397.34 419.40 528.45 245.64 100000 881.00 726.77 902.13 1788.21 250000 1839.43 1307.85 1392.19 3671.10 80.37 2500 Processes 9.22 1 core 2 cores 4 cores 8 cores 25 2.14 4.14 4.04 4.00 100 4.96 37.77 5.82 6.15 250 23.90 10.77 10.71 13.39 1000 101.09 37.75 32.00 29

buffered channels 176.50 76.30 72.19 10000 123.91 121.59 174.01 315.17 25000 136.47 123.72 322.98 110.83 100000 153.77 136.59 184.69 309.25 250000 691.15 139.34 191.30 311.50 122.83 2500 Processes 5.11 1 core 2 cores 4 cores 8 cores 25 2.22 2.11 2.96 2.91 100 4.96 33.19 4.44 4.58 250 27.09 12.63 9.86 10.78 1000 110.72 43.19 32.16 30

summary and future work • Rapid transition from simulation/modeling to prototype • Communicate instead of dictating events • No reverse computation • Scales well with at least Go • Further refinement and packageing of the ISI patterns. • Look into locality management of Goroutines. 31

Thank you Questions? 32

a design for interchangable simulation and implementation Klaus - PowerPoint PPT Presentation

a design for interchangable simulation and implementation Klaus Birkelund Jensen Brian Vinter August 25, 2015 Niels Bohr Institute outline 1. Introduction, background and motivation Some context to understand why ISI was developed. 2. The

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

Grid simulation (AliEn) Outline GRID simulation Simulation tool Ptolemy (Berkeley)

T7 Cloud Simulation On-demand access simulation December 2016 T7 Cloud Simulation December 2016

Simulation Simulation CHAPTER 1 INTRODUCTION TO SIMULATION 2 MODELING CHAPTER 1 INTRODUCTION

Solar Cells Simulation and Design Solar Cell Simulation and Design - Outline Solar cells are

Simulation of stationary processes Timo Tiihonen 2014 Tactical aspects of simulation

Automated Configuration of Co-simulation with Domain Specific Hints Co-simulation on the rise

MD3311 Simulation Results Joschua Dilly 28.01.2019 MD3311 Simulation Results 2 Introduction

Surgical Simulation: Surgical Simulation: We dont need simulation. We dont need

Why Bayesian methods in Simulation? Simulation Simulation Model Inputs BAYESIAN IDEAS

Statistical Simulation in Python Tushar Shanker Data Scientist DataCamp Statistical Simulation

Chapter 2 Simulation Examples Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Architecture without explicit locks for logic Importance Of Simulation simulation on SIMD

Simulation Monte Carlo Monte Carlo simulation Outcome of a single stochastic simulation run

CPS 108, Fall 1999 Program Design and Implementation Software Design and Implementation

The Modelling and Simulation Process 1. History of Modelling and Simulation 2. Modelling and

Radio-isotope Identification Using Dictionary Learning Approach for Plastic Spectra Junhyeok

Technical Sessions & 2012 14 & 15 th th TECHNOLOGY IN 21 CENTURY ST NATIONAL

A Compressive Sensing Framework for Multirate Signal Estimation Ender M. Ek sio glu, A.

DSP Algorithms for CBE Architecture Jan Kryl (krylj1@fel.cvut.cz) Petr Kurtin

LABOUR FORCE MODELING BASED ON AGENT-BASED SIMULATION Mustafa DN Ph.D., Rtd. Captain (N)

Testing & Validation Human Workload Modeling for Autonomous Ground Vehicles Presen enting

Beta Presentation Matrix: Vehicle Simulator The Capstone Experience Team GM Sam Moore Duncan

CE634 Tel: 6616 6685 Email: Nobari @ sharif Presented by Sadegh Nobari Simulation What is

a design for interchangable simulation and implementation Klaus - PowerPoint PPT Presentation

a design for interchangable simulation and implementation Klaus Birkelund Jensen Brian Vinter August 25, 2015 Niels Bohr Institute outline 1. Introduction, background and motivation Some context to understand why ISI was developed. 2. The

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

Grid simulation (AliEn) Outline GRID simulation Simulation tool Ptolemy (Berkeley)

T7 Cloud Simulation On-demand access simulation December 2016 T7 Cloud Simulation December 2016

Simulation Simulation CHAPTER 1 INTRODUCTION TO SIMULATION 2 MODELING CHAPTER 1 INTRODUCTION

Solar Cells Simulation and Design Solar Cell Simulation and Design - Outline Solar cells are

Simulation of stationary processes Timo Tiihonen 2014 Tactical aspects of simulation

Automated Configuration of Co-simulation with Domain Specific Hints Co-simulation on the rise

MD3311 Simulation Results Joschua Dilly 28.01.2019 MD3311 Simulation Results 2 Introduction

Surgical Simulation: Surgical Simulation: We dont need simulation. We dont need

Why Bayesian methods in Simulation? Simulation Simulation Model Inputs BAYESIAN IDEAS

Statistical Simulation in Python Tushar Shanker Data Scientist DataCamp Statistical Simulation

Chapter 2 Simulation Examples Banks, Carson, Nelson &amp; Nicol Discrete-Event System Simulation

Architecture without explicit locks for logic Importance Of Simulation simulation on SIMD

Simulation Monte Carlo Monte Carlo simulation Outcome of a single stochastic simulation run

CPS 108, Fall 1999 Program Design and Implementation Software Design and Implementation

The Modelling and Simulation Process 1. History of Modelling and Simulation 2. Modelling and

Radio-isotope Identification Using Dictionary Learning Approach for Plastic Spectra Junhyeok

Technical Sessions &amp; 2012 14 &amp; 15 th th TECHNOLOGY IN 21 CENTURY ST NATIONAL

A Compressive Sensing Framework for Multirate Signal Estimation Ender M. Ek sio glu, A.

DSP Algorithms for CBE Architecture Jan Kryl (krylj1@fel.cvut.cz) Petr Kurtin

LABOUR FORCE MODELING BASED ON AGENT-BASED SIMULATION Mustafa DN Ph.D., Rtd. Captain (N)

Testing &amp; Validation Human Workload Modeling for Autonomous Ground Vehicles Presen enting

Beta Presentation Matrix: Vehicle Simulator The Capstone Experience Team GM Sam Moore Duncan

CE634 Tel: 6616 6685 Email: Nobari @ sharif Presented by Sadegh Nobari Simulation What is

Chapter 2 Simulation Examples Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Technical Sessions & 2012 14 & 15 th th TECHNOLOGY IN 21 CENTURY ST NATIONAL

Testing & Validation Human Workload Modeling for Autonomous Ground Vehicles Presen enting