PERHAPS . . . LATEST HARDWARE DEPLOYMENT 3 Courtesy by Miriam, 7a - PowerPoint PPT Presentation

MAX-PLANCK-GESELLSCHAFT ASYNCHRONICITY T HE CHALLENGE OF FINE - GRAINED PARALLELISM Luis Kornblueh September 29, 2016 Max-Planck-Institut für Meteorologie

PERHAPS . . .

LATEST HARDWARE DEPLOYMENT 3 Courtesy by Miriam, 7a

SYSTEM CHARACTERISTICS • 24 nodes with Broadcom BCM2835 SoC (700 MHz ARM 1176JZF-S, VideoCore IV GPU) • Non-blocking fat tree high speed network IEEE 802.3u (100BASE-TX) via USB-2 Bus (aggregated 64.8 MB/s) • NFSv4 network filesystem, SLURM, GCC, mpich • Linux Debian jessie (Kernel 4.4) 4

SYSTEM CHARACTERISTICS • 24 nodes with Broadcom BCM2835 SoC (700 MHz ARM 1176JZF-S, VideoCore IV GPU) • Non-blocking fat tree high speed network IEEE 802.3u (100BASE-TX) via USB-2 Bus (aggregated 64.8 MB/s) • NFSv4 network filesystem, SLURM, GCC, mpich • Linux Debian jessie (Kernel 4.4) Successfully run echam 4.6 T31L19 (CVS version 6.00, 2000-09-19 08:26:58 (Git: da9d477) , no code changes) using the full system. 4

ENERGY CONSUMPTION 100 W 5 Courtesy by Miriam, 7a

SETTING THE STAGE

WHAT IS DRIVING NEW DEVELOPMENTS ? Redefinition: the models we talk about consist of all components which are used in the workflow! 7

WHAT IS DRIVING NEW DEVELOPMENTS ? Redefinition: the models we talk about consist of all components which are used in the workflow! The development of global circulation models in its current form has to change and respond to major challenges in hardware development. 7

WHAT IS DRIVING NEW DEVELOPMENTS ? Redefinition: the models we talk about consist of all components which are used in the workflow! The development of global circulation models in its current form has to change and respond to major challenges in hardware development. Example: old node — 12 cores 2.5 GHz new node 18 cores 2.1 GHz 7

WHAT IS DRIVING NEW DEVELOPMENTS ? Redefinition: the models we talk about consist of all components which are used in the workflow! The development of global circulation models in its current form has to change and respond to major challenges in hardware development. Example: old node — 12 cores 2.5 GHz new node 18 cores 2.1 GHz Consequence: more and more, fine grained parallelism is required to achieve the necessary performance to answer scientific questions posed. 7

OBJECTIVES Key points are • to keep all critical hardware resources concurrently in use, • to minimize or hide the response time for remote access and service requests, • to improve and reduce contributions of parallel resources and task scheduling not used for computational work itself, and • to minimize resource access conflicts. 8

ALGORITHMS The solution framework consists of the • functional description of processing algorithms, and • a direct acyclic graph representation (DAG) of processing (to be used for optimization and parallelization). 9

PROCESSES COMPACTION

COARSE - GRAINED ASYNCHRONOUS PROCESS time integration barrier time radiation atmosphere bio-geo-chemistry ocean time integration barrier no of cores 11

HOW A VECTOR PIPELINING PROCESSING MODEL WORKS node-thread space slot 0 slot 1 slot 2 slot 3 slot 4 store operator3 operator3 operator 2 operator 2 operator 2 operator 1 operator 1 operator 1 operator 1 read read read read read time 12

MOVING TO A DAG BASED PROCESSING MODEL node-thread space arrive operator 1 operator 2 operator3 send arrive operator 1 operator 2 operator3 send arrive operator 1 operator 2 operator3 send arrive operator 1 operator 2 operator3 send arrive operator 1 operator 2 operator3 send time 13

DAG BASED META - SCHEDULING cylc, Hilary Oliver, NIWA 14

FUTURE

DEVELOPMENT ACTIVITIES • Development of a DAG based worker/broker toolkit with arithmetic operators as first test and later add cdo Hermes, Florian Rathgeber and Tiago Quintino (ECMWF) • Refactoring of cdo by moving to C++ and disentangling command line and operator handling • Develop an evaluation hierarchy for cdo operators 16

WHAT NEXT ? • Get a working prototype of post-processing tools and scheduling • Using meta-scheduling for applicable problems • Rethink the time operator splitting of the model physics to allow for a more functional, concurrent usable representation of processes — or resolve those explictly . . . • Development and application of model developer friendly Domain Specific Languages (DSL) 17

ADDITIONAL CONSTRAINTS

UNKNOWNS There are two more aspects contributing to effective system usage. Power consumption and the system’s reliability. The influence of this parameters on future development are not in the primary scope of this considerations, but are supposed to have a strong impact on solutions. 19

PERHAPS . . . LATEST HARDWARE DEPLOYMENT 3 Courtesy by Miriam, 7a - PowerPoint PPT Presentation

MAX-PLANCK-GESELLSCHAFT ASYNCHRONICITY T HE CHALLENGE OF FINE - GRAINED PARALLELISM Luis Kornblueh September 29, 2016 Max-Planck-Institut fr Meteorologie PERHAPS . . . LATEST HARDWARE DEPLOYMENT 3 Courtesy by Miriam, 7a SYSTEM

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

WELCOME Thoughts On Thyroidectomy The extirpation of the thyroid gland . . . typifies, perhaps

Dear Parent, My most sincere thanks in taking some time to look over this literature and perhaps

The Lumbar Spine: A Herniated Disc - Perhaps Not So Simple Case Presentation Gaetano J.

b s b c anomalies anomalies Found by LHCb (and perhaps Found by several experiments

Not Killer Applications but perhaps Killer Solutions. Model Model Results Results

Backing Up Your Mac A Joe ON Tech Guide Backup Basics Your Mac contains valuable and perhaps

Historys 30 Most Inspiring People on the Autism Spectrum (the women) Not on the list but perhaps

Centre for Constitutional Law University of Edinburgh No: odd and perhaps even unique process

Year 11 Information Session 2016 Year 11 Parent Information Session A discussion perhaps

Microinsurance Conference Problem statement of fraud 20 60 20 Try Perhaps Never How much

Induction This is perhaps the most important technique well learn for proving things. Idea: To

SEkFER SAFERjob s Need a job? Fancy a change? Perhaps you have just been head-hunted ? 1 in 3 of

FIRST TIME IN BALTICS 26 30 NOVEMBER RIGA, LATVIA PERHAPS THE SECOND MOST KNOWLEDGEABLE

'South Uist is perhaps the finest of all, where the entire western coastline is a wonderful mosaic

Latin and Greek Elements in English Review of Latin Forms 1. The fern offers perhaps the best

Freedom of Information Act Advisory Committee September 5, 2019 1 Past FOIA Advisory Committee

Financial Portfolio Optimisation Joint work: Pierre Flener, Uppsala University, Sweden (

Fourth Quarter 2013 Safe Harbor Statements All statements in this report that address events,

7+ tools to make your Stata life more pleasant Jesse Wursten 1 London SUGM, August 2019 1 Faculty

INTERPRETATION AND ESTIMATION OF DEFAULT CORRELATIONS Petit D ejeuner de la Finance Maison

Dataflow Super-Computing Jacob Bower YU INFO, February 2012 Maxeler Technologies Maxeler

Credit Risk Models with Filtered Market Information R udiger Frey Universit at Leipzig

MPWSP STATUS REPORT Add photo of pipeline install MPWSP Governance Committee Meeting January 17,