Concurrent Programming Romolo Marotta Data Centers and High - PowerPoint PPT Presentation

Concurrent Programming Romolo Marotta Data Centers and High Performance Computing

Amdahl Law—Fixed-size Model (1967) • The workload is fixed: it studies how the behaviour of the same program varies when adding more computing power S Amdahl = T s T s 1 = = T p α T s + (1 − α ) T s α + (1 − α ) p p • where: α ∈ [0 , 1]: Serial fraction of the program p ∈ N : Number of processors T s : Serial execution time T p : Parallel execution time • It can be expressed as well vs. the parallel fraction P = 1 − α 2 of 46 - Concurrent Programming

Fixed-size Model 3 of 46 - Concurrent Programming

Speed-up According to Amdahl Parallel Speedup vs. Serial Fraction 10 Linear α = 0.95 9 α = 0.8 α = 0.5 α = 0.2 8 7 6 Speedup 5 4 3 2 1 1 2 3 4 5 6 7 8 9 10 Number of Processors 4 of 46 - Concurrent Programming

How Real is This? 1 = 1 p →∞ = lim α + (1 − α ) α p 5 of 46 - Concurrent Programming

How Real is This? 1 = 1 p →∞ = lim α + (1 − α ) α p • So if the sequential fraction is 20%, we have: p →∞ = 1 lim 0 . 2 = 5 • Speedup 5 using infinte processors! 5 of 46 - Concurrent Programming

Gustafson Law—Fixed-time Model (1989) • The execution time is fixed: it studies how the behaviour of a scaled program varies when adding more computing power W ′ = α W + (1 − α ) pW S Gustafson = W ′ W = α + (1 − α ) p • where: α ∈ [0 , 1]: Serial fraction of the program p ∈ N : Number of processors W : Original Workload ′ : Scaled Workload W 6 of 46 - Concurrent Programming

Fixed-time Model 7 of 46 - Concurrent Programming

Speed-up According to Gustafson Parallel Speedup vs. Serial Fraction 10 Linear α = 0.95 9 α = 0.8 α = 0.5 α = 0.2 8 7 6 Speedup 5 4 3 2 1 1 2 3 4 5 6 7 8 9 10 Number of Processors 8 of 46 - Concurrent Programming

Amdahl vs. Gustafson—a Driver’s Experience Amdahl Law: A car is traveling between two cities 60 Kms away, and has already traveled half the distance at 30 Km/h. No matter how fast you drive the last half, it is impossible to achieve 90 Km/h average speed before reaching the second city. It has already taken you 1 hour and you only have a distance of 60 Kms total: Going infinitely fast you would only achieve 60 Km/h. Gustafson Law: A car has been travelling for some time at less than 90 Km/h. Given enough time and distance to travel, the car’s average speed can always eventually reach 90 Km/h, no matter how long or how slowly it has already traveled. If the car spent one hour at 30 Km/h, it could achieve this by driving at 120 Km/h for two additional hours. 9 of 46 - Concurrent Programming

Sun, Ni Law—Memory-bounded Model (1993) • The workload is scaled, bounded by memory S Sun − Ni = sequential time for Workload W ∗ = parallel time for Workload W ∗ = α W + (1 − α ) G ( p ) W = α + (1 − α ) G ( p ) α W + (1 − α ) G ( p ) W α + (1 − α ) G ( p ) p p • where: ◦ G ( p ) describes the workload increase as the memory capacity increases ◦ W ∗ = α W + (1 − α ) G ( p ) W 10 of 46 - Concurrent Programming

Memory-bounded Model 11 of 46 - Concurrent Programming

Speed-up According to Sun, Ni S Sun − Ni = α + (1 − α ) G ( p ) α + (1 − α ) G ( p ) p 12 of 46 - Concurrent Programming

Speed-up According to Sun, Ni S Sun − Ni = α + (1 − α ) G ( p ) α + (1 − α ) G ( p ) p • If G(p) = 1 1 S Amdahl = α + (1 − α ) p 12 of 46 - Concurrent Programming

Speed-up According to Sun, Ni S Sun − Ni = α + (1 − α ) G ( p ) α + (1 − α ) G ( p ) p • If G(p) = 1 1 S Amdahl = α + (1 − α ) p • If G(p) = p S Gustafson = α + (1 − α ) p 12 of 46 - Concurrent Programming

Speed-up According to Sun, Ni S Sun − Ni = α + (1 − α ) G ( p ) α + (1 − α ) G ( p ) p • If G(p) = 1 1 S Amdahl = α + (1 − α ) p • If G(p) = p S Gustafson = α + (1 − α ) p In general G ( p ) > p gives a higher scale-up 12 of 46 - Concurrent Programming

Application Model for Parallel Computers Fixed-memory Workload model Memory bound Fixed-time model Fixed-workload model communication bound Machine size 13 of 46 - Concurrent Programming

Scalability speed-up • Efficiency E = number of processors • Strong Scalability : If the efficiency is kept fixed while increasing the number of processes and maintainig fixed the problem size • Weak Scalability : If the efficiency is kept fixed while increasing at the same rate the problem size and the number of processes 14 of 46 - Concurrent Programming

Superlinear Speedup • Can we have a Speed-up > p ? 15 of 46 - Concurrent Programming

Superlinear Speedup • Can we have a Speed-up > p ? Yes! ◦ Workload increases more than computing power ( G ( p ) > p ) ◦ Cache effect: larger accumulated cache size. More or even all of the working set can fit into caches and the memory access time reduces dramatically ◦ RAM effect: enables the dataset to move from disk into RAM drastically reducing the time required, e.g., to search it. ◦ The parallel algorithm uses some search like a random walk: the more processors that are walking, the less distance has to be walked in total before you reach what you are looking for. 15 of 46 - Concurrent Programming

Parallel Programming • Ad-hoc concurrent programming languages • Development Tools ◦ Compilers try to optimize the code ◦ MPI, OpenMP, Libraries... ◦ Tools to ease the task of debugging parallel code (gdb, valgrind, ...) • Writing parallel code is for artists, not scientists! ◦ There are approaches, not prepackaged solutions ◦ Every machine has its own singularities ◦ Every problem to face has different requisites ◦ The most efficient parallel algorithm is not the most intuitive one 16 of 46 - Concurrent Programming

Ad-hoc languages Ada Alef ChucK Clojure Curry C ω E Eiffel Erlang Go Java Julia Joule Limbo Occam Orc Oz Pict Rust SALSA Scala SequenceL SR Unified Parallel C XProc 17 of 46 - Concurrent Programming

Classical Approach to Concurrent Programming • Based on blocking primitives ◦ Semaphores ◦ Locks acquiring ◦ . . . PRODUCER CONSUMER Semaphore p, c = 0; Semaphore p, c = 0; Buffer b; Buffer b; while(1) { while(1) { <Write on b> wait(p); signal(p); <Read from b> wait(c); signal(c); } } 18 of 46 - Concurrent Programming

Parallel Programs Properties • Safety : nothing wrong happens ◦ It’s called Correctness as well 19 of 46 - Concurrent Programming

Parallel Programs Properties • Safety : nothing wrong happens ◦ It’s called Correctness as well • Liveness : eventually something good happens ◦ It’s called Progress as well 19 of 46 - Concurrent Programming

Correctness • What does it mean for a program to be correct ? ◦ What’s exactly a concurrent FIFO queue? ◦ FIFO implies a strict temporal ordering ◦ Concurrent implies an ambiguous temporal ordering • Intuitively, if we rely on locks, changes happen in a non-interleaved fashion, resembling a sequential execution • We can say a concurrent execution is correct only because we can associate it with a sequential one, which we know the functioning of • A concurrent execution is correct if it is equivalent to a correct sequential execution 20 of 46 - Concurrent Programming

A simplyfied model of a concurrent system • A concurrent system is a collection of sequential threads that communicate through shared data structures called objects . • An object has a unique name and a set of primitive operations . • An invocation of an operation op of the object x is written as A op(args*) x where A is the invoking thread and args ∗ the sequence of arguments A • A response to an operation invocation on x is written as A ret(res*) x where A is the invoking thread and res ∗ the sequence of results 21 of 46 - Concurrent Programming

A simplyfied model of a concurrent execution • A history is a sequence of invocations and replies generated on an object by a set of threads 22 of 46 - Concurrent Programming

A simplyfied model of a concurrent execution • A history is a sequence of invocations and replies generated on an object by a set of threads • A sequential history is a history where all the invocations have an immediate response Sequential H’: A op() x A ret() x B op() x B ret() x A op() y A ret() y 22 of 46 - Concurrent Programming

A simplyfied model of a concurrent execution • A history is a sequence of invocations and replies generated on an object by a set of threads • A sequential history is a history where all the invocations have an immediate response • A concurrent history is a history that is not sequential Sequential Concurrent H’: A op() x H: A op() x A ret() x B op() x B op() x A ret() x B ret() x A op() y A op() y B ret() x A ret() y A ret() y 22 of 46 - Concurrent Programming

A simplyfied model of a concurrent execution (2) • A process subhistory H | P of a history H is the subsequence of all events in H whose process names are P H: A op() x B op() x A ret() x A op() y B ret() x A ret() y 23 of 46 - Concurrent Programming

A simplyfied model of a concurrent execution (2) • A process subhistory H | P of a history H is the subsequence of all events in H whose process names are P H: A op() x A ret() x A op() y A ret() y 23 of 46 - Concurrent Programming

Concurrent Programming Romolo Marotta Data Centers and High - PowerPoint PPT Presentation

Concurrent Programming Romolo Marotta Data Centers and High Performance Computing Amdahl LawFixed-size Model (1967) The workload is fixed: it studies how the behaviour of the same program varies when adding more computing power S Amdahl

Concurrent Programming in Scala 1 / 7 Concurrent Programming 1 Concurrent programming:

Concurrent Message Service M. Clemencic CERN - LHCb Forum on Concurrent Programming Models and

Concurrent programming made simple The (r)evolution of transactional memory Torvald Riegel Nuno

What is Concurrent Programming? M. Ben-Ari Principles of Concurrent and Distributed Programming

Concurrent Enrollment A Guide for Parents and Students What is Concurrent Enrollment? Concurrent

Chapter 3 Concurrent Execution DM519 Concurrent Programming 1 Repetition (Concepts, Models,

Concurrent Programming Using The Disruptor Trisha Gee LMAX Wednesday, 23 May 12 Concurrent

Concurrent Enrollment Board Policy 6172.1 May 13, 2020 Background Definition of concurrent

Towards safer Concurrent Device Drivers Making Safer Concurrent Device Drivers. Modeling RMoX

Modeling and Analyzing Concurrent Systems Robert B. France 1 Overview Why model and analyze

Frame- -Aggregated Concurrent Aggregated Concurrent Frame Matching Switch Matching Switch Bill

Hardware Design with VHDL Concurrent Stmts ECE 443 Concurrent Signal Assignment Statements This

CONCURRENT COLLECTIONS 2 5/24/11 Concurrent Collec9ons

Computational Logic Concurrent (Constraint) Logic Programming 1 Concurrent Logic Programs

Advanced Programming Concurrency Concurrent Programming Until now, a program was a sequence

Java Concurrency 1 Shell CSCE 314 TAMU The World is Concurrent Concurrent programs: more than

Causal Atomicity Heike Wehrheim Joint work with Simon Doherty, Brijesh Dongol, John Derrick,

Supersymmetric localization with dynamical gravitons (in collaboration with Sameer Murthy and

Deciding Contextual Equivalence for IMJ* Andrzej Murawski Steven Ramsay Nikos Tzevelekos

CONCURRENCY AND TYPES IN PROGRAMMING Lus Caires Universidade Nova de Lisboa (based on work

Comments on Simulating equilibrium in multi-product postal markets following deregulation and

Competition and Bank Opacity Liangliang Jiang Ross Levine Chen Lin Broad motivation Banks

Case Study 3 Trucking Industry What was the impact of deregulation on trucking prices in Florida?

Subsidization Competition: Vitalizing the Neutral Internet Richard T. B. Ma School of Computing

Sambuz

Useful Links

Newsletter

Mail Us