Parallel Programming and Heterogeneous Computing Shared-Memory: - PowerPoint PPT Presentation

Parallel Programming and Heterogeneous Computing Shared-Memory: Concurrency Max Plauth, Sven Köhler , Felix Eberhardt, Lukas Wenzel and Andreas Polze Operating Systems and Middleware Group

Von Neumann Model Processor executes a sequence of instructions ■ Arithmetic operations – Memory to be read / written – Address of next instruction – Software layering tackles complexity of instruction stream ■ Parallelism adds coordination problem between multiple instruction ■ streams being executed Memory ParProg 2019 Control Unit Shared-Memory: Concurrency Bus Central Unit Output Sven Köhler Arithmetic Logic Unit Input Chart 2

Concurrency in History 1961, Atlas Computer, Kilburn & Howarth ■ Based on Germanium transistors, □ assembler only First use of interrupts to simulate concurrent □ execution of multiple programs - multiprogramming 60 ‘ s and 70 ‘ s: Foundations for concurrent ■ software developed 1965, Cooperating Sequential Processes, □ E.W.Dijkstra ParProg 2019 First principles of concurrent programming – Shared-Memory: Basic concepts: Critical section, mutual Concurrency – exclusion, fairness, Sven Köhler speed independence Chart 3

Cooperating Sequential Processes [Dijkstra] 1965, Cooperating Sequential Processes, Edsger Wybe Dijkstra ■ Comparison of sequential and non-sequential machine Example: Sequential electromagnetic solution to find the largest □ value in an array Current lead through magnet coil – Switch to magnet with larger current – Progress of time is relevant □ ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 4

Cooperating Sequential Processes [Dijkstra] Progress of time is relevant ■ After applying one step, machine needs □ ParProg 2019 some time to show the result Shared-Memory: Concurrency Same line differs only in left operand □ Sven Köhler Concept of a parameter that comes from history, □ leads to alternative setup for the same behavior Chart 5 Rules of behavior form a program ■

Cooperating Sequential Processes [Dijkstra] Idea: Many programs for expressing the same intent ■ Example: Consider repetitive nature of the problem ■ Invest in a variable j □ à generalize the solution for any number of items ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 6

Cooperating Sequential Processes [Dijkstra] Assume we have multiple of these sequential programs ■ How about the cooperation between such, maybe loosely coupled, ■ sequential processes ? Beside rare moments of communication, □ processes run autonomously Disallow any assumption about the relative speed ■ Aligns to understanding of sequential process, □ which is not affected in its correctness by execution time If this is not fulfilled, it might bring „analogue interferences “ □ ParProg 2019 Note: Dijkstra already identified the „race condition “ problem ■ Shared-Memory: Idea of a critical section for two cyclic sequential processes Concurrency ■ Sven Köhler At any moment, at most one process is engaged in the section □ Implemented through common variables □ Chart 7 Demands atomic read / write behavior □

Critical Section Shared Resource (e.g. memory regions) Critical Section ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 8

Critical Section N threads has some code - critical section - with shared data access ■ Mutual Exclusion demand ■ Only one thread at a time is allowed into its critical section, among all threads □ that have critical sections for the same resource. Progress demand ■ If no other thread is in the critical section, the decision for entering should not be □ postponed indefinitely. Only threads that wait for entering the critical section are allowed to participate in decisions. ParProg 2019 Bounded Waiting demand Shared-Memory: ■ Concurrency It must not be possible for a thread requiring access to a critical section to be □ Sven Köhler delayed indefinitely by other threads entering the section ( starvation problem ) Chart 9

Cooperating Sequential Processes [Dijkstra] Attempt to develop a ■ critical section concept in ALGOL60 parbegin / parend □ extension Atomicity on source □ code line level First approach: ■ Too restrictive, since □ strictly alternating ParProg 2019 Shared-Memory: One process may die □ Concurrency or hang outside of Sven Köhler the critical section (no progress) Chart 10

Cooperating Sequential Processes [Dijkstra] Separate indicators ■ for enter/ leave More fine-grained ■ waiting approach Too optimistic, both ■ processes may end up in the critical section (no mutual exclusion) ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 11

Cooperating Sequential Processes [Dijkstra] First ,raise the flag ‘ , ■ then check for the other Concept of a selfish process ■ Mutual exclusion works ■ If c1=0, then c2=1, □ and vice versa Variables change outside ■ of the critical section only Danger of mutual □ ParProg 2019 blocking ( deadlock ) Shared-Memory: Concurrency Sven Köhler Chart 12

Cooperating Sequential Processes [Dijkstra] Reset locking of critical ■ section if the other one is already in Problem due to assumption ■ of relative speed Process 1 may run much □ faster, always hits the point in time were c2=1 Can lead for one process □ to ,wait forever ‘ without ParProg 2019 any progress Shared-Memory: Concurrency or live lock (both spinning) □ Sven Köhler Chart 13

Cooperating Sequential Processes [Dijkstra] Solution: Dekker ‘ s algorithm, referenced by Dijkstra ■ Combination of fourth approach and turn ,variable ‘ , □ which realizes mutual blocking avoidance through prioritization Idea: Spin for section entry only if it is your turn □ ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 14

Bakery Algorithm [Lamport] def lock(i) { # wait until we have the smallest num choosing[i] = True; num[i] = max(num[0],num[1] ...,num[n-1]) + 1; choosing[i] = False; for (j = 0; j < n; j++) { while (choosing[i]) ; while ((num[j] != 0) && ((num[j],j) “<” (num[i],i))) {};}} def unlock(i) { ParProg 2019 num[i] = 0; } Shared-Memory: Concurrency lock(i) Sven Köhler … critical section … unlock(i) Chart 15

Critical Sections Dekker provided first correct solution only based on shared memory, ■ guarantees three major properties Mutual exclusion □ Freedom from deadlock □ Freedom from starvation □ Generalization by Lamport with the Bakery algorithm ■ Relies only on memory access atomicity □ Both solutions assume atomicity and predictable sequential execution on ■ machine code level ParProg 2019 Shared-Memory: Hardware today: Unpredictable sequential instruction stream ■ Concurrency Out-of-order execution – Sven Köhler Re-ordered memory access – Chart 16 Compiler optimizations –

Test-and-Set Test-and-set processor instruction, wrapped by the operating system ■ Write to a memory location and return its old value as atomic step □ Also known as compare-and-swap (CAS) or read-modify-write □ Idea: Spin in writing 1 to a memory cell, until the old value was 0 ■ Between writing and test, no other operation can modify the value □ Busy waiting for acquiring a (spin) lock ■ function Lock(boolean *lock) { while (test_and_set (lock)) Efficient especially for short ■ ; ParProg 2019 waiting periods } Shared-Memory: Concurrency #define LOCKED 1 int TestAndSet(int* lockPtr) { Sven Köhler int oldValue; oldValue = SwapAtomic(lockPtr, LOCKED); return oldValue == LOCKED; Chart 17 }

ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 18

Binary and General Semaphores [Dijkstra] Find a solution to allow waiting sequential processes to sleep ■ Special purpose integer called semaphore , two atomic operations ■ wait ( S ): P -operation: Decrease value of its argument semaphore by 1, while ( S <= 0); □ “wait” if the semaphore is already zero S --; V -operation: Increase value of its argument semaphore by 1, □ signal ( S ): useful as „signal “ operation S++; Solution for critical section shared between N processes ■ ParProg 2019 Original proposal by Dijkstra did not mandate any wakeup order ■ Shared-Memory: Concurrency Later debated from operating system point of view □ Sven Köhler „Bottom layer should not bother with macroscopic considerations “ □ Chart 19

Example: Binary Semaphore ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 20

Example: General Semaphore ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 21

ParProg 2019 Shared-Memory: Concurrency Sven Köhler Chart 22 https://www.youtube.com/watch?v=6sIlKP2LzbA

Parallel Programming and Heterogeneous Computing Shared-Memory: - PowerPoint PPT Presentation

Parallel Programming and Heterogeneous Computing Shared-Memory: Concurrency Max Plauth, Sven Khler , Felix Eberhardt, Lukas Wenzel and Andreas Polze Operating Systems and Middleware Group Von Neumann Model Processor executes a sequence of

Parallel Computing: Opportunities and Challenges Victor Lee Parallel Computing Lab (PCL), Intel

Parallel Programming and Heterogeneous Computing A2 - Parallel Hardware Max Plauth, Sven Khler,

Parallel Computing the Why and the How Albert-Jan Yzelman February, 2010 Albert-Jan Yzelman

Cluster Basics Hana Sevcikova University of Washington DataCamp Parallel Programming in R

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Outline Overview Theoretical background Parallel computing systems Parallel

Overview Parallel computing platforms Approaches to building parallel computers

PARALLEL Joachim Nitschke PROGRAMMING Project Seminar Parallel Programming, Summer

Parallel Programming and Heterogeneous Computing B2 - Shared-Memory: Programming Models Max

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Parallel Programming and Heterogeneous Computing Shared-Nothing Parallelism Models Max Plauth,

Adventures in HPC and R: Going Parallel What is Parallel Computing? Justin Harrington &

Introduction to OpenMP ! Introduction to parallel computing ! Classification of parallel

Introduction to Parallel Computing George Karypis Parallel Programming Platforms Elements of a

COMP 633 - Parallel Computing Lecture 15 October 1, 2020 Programming Accelerators using

2110412 Parallel Comp Arch Performance and Benchmarking Natawut Nupairoj, Ph.D. Department of

Tiz Tizen en based ased re remote ote contro co ntroller ller CA CAR R usi sing ng i2

Thre read S d Synchro ronization: Too M Much M Milk 1 Implementing Critical Sections in

Physics 2D Lecture Slides Lecture 5: Jan 10 2005 Vivek Sharma UCSD Physics Announcements

THE GRAPHS OF HOFFMAN-SINGLETON, HIGMAN-SIMS, MCLAUGHLIN, AND THE HERMITIAN CURVE OF DEGREE 6 IN

Building a Great, Authentic Patient Engagement Plan Presented by Jami Brown and Jacquelyne

Spiritual Counsel in Nahjul Balagha Session One Ponder and Comment Speech is the spirit s

Edge-connectivity of permutation hypergraphs Zolt an Szigeti Laboratoire G-SCOP INP Grenoble,

Parallel Programming and Heterogeneous Computing Shared-Memory: - PowerPoint PPT Presentation

Parallel Programming and Heterogeneous Computing Shared-Memory: Concurrency Max Plauth, Sven Khler , Felix Eberhardt, Lukas Wenzel and Andreas Polze Operating Systems and Middleware Group Von Neumann Model Processor executes a sequence of

Parallel Computing: Opportunities and Challenges Victor Lee Parallel Computing Lab (PCL), Intel

Parallel Programming and Heterogeneous Computing A2 - Parallel Hardware Max Plauth, Sven Khler,

Parallel Computing the Why and the How Albert-Jan Yzelman February, 2010 Albert-Jan Yzelman

Cluster Basics Hana Sevcikova University of Washington DataCamp Parallel Programming in R

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Outline Overview Theoretical background Parallel computing systems Parallel

Overview Parallel computing platforms Approaches to building parallel computers

PARALLEL Joachim Nitschke PROGRAMMING Project Seminar Parallel Programming, Summer

Parallel Programming and Heterogeneous Computing B2 - Shared-Memory: Programming Models Max

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Parallel Programming and Heterogeneous Computing Shared-Nothing Parallelism Models Max Plauth,

Adventures in HPC and R: Going Parallel What is Parallel Computing? Justin Harrington &amp;

Introduction to OpenMP ! Introduction to parallel computing ! Classification of parallel

Introduction to Parallel Computing George Karypis Parallel Programming Platforms Elements of a

COMP 633 - Parallel Computing Lecture 15 October 1, 2020 Programming Accelerators using

2110412 Parallel Comp Arch Performance and Benchmarking Natawut Nupairoj, Ph.D. Department of

Tiz Tizen en based ased re remote ote contro co ntroller ller CA CAR R usi sing ng i2

Thre read S d Synchro ronization: Too M Much M Milk 1 Implementing Critical Sections in

Physics 2D Lecture Slides Lecture 5: Jan 10 2005 Vivek Sharma UCSD Physics Announcements

THE GRAPHS OF HOFFMAN-SINGLETON, HIGMAN-SIMS, MCLAUGHLIN, AND THE HERMITIAN CURVE OF DEGREE 6 IN

Building a Great, Authentic Patient Engagement Plan Presented by Jami Brown and Jacquelyne

Spiritual Counsel in Nahjul Balagha Session One Ponder and Comment Speech is the spirit s

Edge-connectivity of permutation hypergraphs Zolt an Szigeti Laboratoire G-SCOP INP Grenoble,

Adventures in HPC and R: Going Parallel What is Parallel Computing? Justin Harrington &