Extending the Polyhedral Compilation Model for Debugging and - PowerPoint PPT Presentation

Extending the Polyhedral Compilation Model for Debugging and Optimization of SPMD-style Explicitly Parallel Programs Prasanth Chatarasi Masters Thesis Defense Habanero Extreme Scale Software Research Group Department of Computer Science Rice University April 24th, 2017

40 Years of Microprocessor Trend 10 7 Transistors (thousands) 10 6 Single-Thread 10 5 Performance (SpecINT x 10 3 ) 10 4 Frequency (MHz) 10 3 Typical Power 10 2 (Watts) Number of 10 1 Logical Cores 10 0 1970 1980 1990 2000 2010 2020 Year Moore’s law still continues Performance is driven by parallelism than single-thread https://www.karlrupp.net/2015/06/40-years-of-microprocessor-trend-data/ Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 1

A major challenge facing the overall computer field Programming multi-core processors – how to exploit the parallelism in large-scale parallel hardware without undue programmer effort – Mary Hall et.al., in Communications of ACM 2009 Two major compiler approaches in tackling the challenge Automatic parallelization of sequential programs Compiler extract parallelism Not much burden on programmer but lot of limitations exist! Manually parallelize programs Full burden on programmer but can get higher performance! Can the compilers help the programmer? Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 2

Focus of this work – SPMD-style parallelism We focus on SPMD-style parallel programs All processors execute the same program Sequential code redundantly Parallel code cooperatively OpenMP for multi-cores, CUDA/ OpenCL for accelerators, MPI for distributed systems Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 3

Focus of this work – Polyhedral compilation model Polyhedral compilation model Algebraic framework to reason loop nests Wide range of applications Automatic parallelization High-level synthesis Communication optimizations Used in Production compilers (LLVM, GCC) Just-in-time compilers (PolyJIT) DSL compilers (PolyMage, Halide) http://pluto-compiler.sourceforge.net/ Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 4

Thesis Statement Though the polyhedral compilation model was designed for analysis and optimization of sequential programs, our thesis is that it can be extended to support SPMD-style parallel programs as input with benefits to debugging and optimization of such programs. Chatarasi et.al (LCPC 2016), An Extended Polyhedral Model for SPMD Programs and its use in Static Data Race Detection Chatarasi et.al (ACM SRC PACT 2015), Extending Polyhedral Model for Analysis and Transformation of OpenMP Programs Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 5

Overall flow of the talk Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 6

Polyhedral Compilation Model Compiler (algebraic) techniques for analysis and transformation of codes with nested loops Advantages over Abstract Syntax Tree (AST) based frameworks Reasoning at statement instance in loops Unifies many loop transformations into a single transformation Powerful code generation algorithms Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 7

Polyhedral Representation of Programs - Schedule for(int i = 1; i < M; i++) { 1 for(int j = 1; j < N; j++) { 2 A[i][j] = MAX(A[i-1][j], A[i-1][j-1], A[i][j-1]); // S 3 } 4 } 5 Schedule ( θ ) – A key element of polyhedral representation Assigns a time-stamp to each statement 10 instance S(i, j) 8 Statement instances are executed in 6 loop j increasing order of time-stamps 4 Captures program execution order 2 (total order in general) 0 0 1 2 3 4 5 6 loop i θ ( S ( i , j )) = ( i , j ) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 8

Limitations of Polyhedral Model (a) An SPMD-style program (b) Program execution order #pragma omp parallel num_threads(2) 1 { 2 {S1;} 3 4 #pragma omp barrier //B1 5 6 {S2;} 7 {S3;} 8 9 #pragma omp barrier //B2 10 } 11 Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 9

Limitations of Polyhedral Model (a) An SPMD-style program (b) Program execution order #pragma omp parallel num_threads(2) 1 { 2 {S1;} 3 4 #pragma omp barrier //B1 5 6 {S2;} 7 {S3;} 8 9 #pragma omp barrier //B2 10 } 11 Limitations of Polyhedral Model Currently, there are no approaches to capture partial orders from SPMD programs and express onto schedules Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 10

Overall workflow (PolyOMP) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 11

What are important in SPMD program execution ? #pragma omp parallel 1 { 2 Program execution order for N = 2 for(int i = 0; i < N; i++) 3 { 4 for(int j = 0; j < N; j++) 5 { 6 {S1;} //S1(i, j) 7 #pragma omp barrier //B1(i, j) 8 {S2;} //S2(i, j) 9 } 10 11 #pragma omp barrier //B2(i) 12 13 #pragma omp master 14 {S3;} //S3(i) 15 } 16 } 17 Majorly, two are important, i.e., 1) Threads and 2) Phases Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 12

Extension1 – Thread/Space/Allocation Mapping Space Mapping ( θ A ) Assigns a logical processor id to each statement instance #pragma omp parallel 1 { 2 for(int i = 0; i < N; i++) 3 { 4 for(int j = 0; j < N; j++) 5 { 6 {S1;} //S1(i, j) 7 #pragma omp barrier //B1(i, j) 8 {S2;} //S2(i, j) 9 } 10 11 #pragma omp barrier //B2(i) 12 13 #pragma omp master 14 {S3;} //S3(i) 15 } 16 } 17 For example, θ A ( S 3 ( i )) = 0 Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 13

Extension2 – Phase Mapping Phase Mapping ( θ P ) Assigns a logical phase id to each statement instance #pragma omp parallel 1 { 2 for(int i = 0; i < N; i++) 3 { 4 for(int j = 0; j < N; j++) 5 { 6 {S1;} //S1(i, j) 7 #pragma omp barrier //B1(i, j) 8 {S2;} //S2(i, j) 9 } 10 11 #pragma omp barrier //B2(i) 12 13 #pragma omp master 14 {S3;} //S3(i) 15 } 16 } 17 For example, θ P ( S 3 ( 0 )) = 3 Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 14

How to compute phase mappings? We define phase mappings in terms of reachable barriers Reachable barriers (RB) of a statement instance Set of barrier instances that can be executed after the statement instance without an intervening barrier instance RB ( S 2 ( 0 , 1 )) = B 2 ( 0 ) RB ( S 3 ( 0 )) = B 1 ( 1 , 0 ) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 15

How to compute phase mappings? Observation Two statement instances are in same phase if they have same set of reachable barrier instances θ P ( S 3 ( 0 )) = RB ( S 3 ( 0 )) = B 1 ( 1 , 0 ) θ P ( S 1 ( 1 , 0 )) = RB ( S 1 ( 1 , 0 )) = B 1 ( 1 , 0 ) ⇒ θ P ( S 3 ( 0 )) = θ P ( S 1 ( 1 , 0 )) � To compute absolute phase mappings, θ P ( S ) = θ ( RB ( S )) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 16

Execution order in SPMD-style programs In general, partial orders are expressed through May-Happen-in-Parallel (MHP) or Happens-Before (HB) relations We define MHP relations in terms of space and phase mappings MHP Two statement instances can run in parallel if they are run by different threads and are in same phase of computation Now, program order information in polyhedral model (Space ( θ A ), Phase ( θ P ), Schedule ( θ )) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 17

Overall workflow (PolyOMP) Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 18

Debugging of SPMD-style programs - Data races Data races are common bugs in SPMD shared memory programs Definition: A race occurs when two or more threads perform a conflicting accesses to a shared variable without any synchronization Data races result in non-deterministic behavior Occurs only in few of the possible schedules of a parallel program Extremely hard to reproduce and debug! Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 19

Motivating benchmark #pragma omp parallel shared(U, V, k) 1 { 2 while (k <= Max) // S1 3 { 4 1-dimensional stencil from #pragma omp for nowait 5 for(i = 0 to N) OmpSCR suite 6 U[i] = V[i]; 7 #pragma omp barrier 8 9 Race b/w S1 and S2 on #pragma omp for nowait 10 variable ’k’ for(i = 1 to N-1) 11 V[i] = U[i-1] + U[i] + U[i+1]; 12 #pragma omp barrier 13 Our goal: Detect such races 14 at compile-time #pragma omp master 15 { k++;} // S2 16 } 17 } 18 Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 20

Our approach for race detection 1 Generate race conditions for every pair of read/write accesses of all statements Race(S, T) = true on ’k’ � ⇒ MHP ( S , T ) = true and S,T conflict on ’k’ ⇒ θ A ( S ) ≠ θ A ( T ) and θ P ( S ) = θ P ( T ) and S,T conflict on ’k’ � 2 Solve the race conditions for existence of solutions. If there are no solutions, there are no data races Chatarasi et.al (LCPC 2016), An Extended Polyhedral Model for SPMD Programs and its use in Static Data Race Detection Chatarasi, Prasanth (Rice University) Masters Thesis Defense April 24th, 2017 21

Extending the Polyhedral Compilation Model for Debugging and - PowerPoint PPT Presentation

Extending the Polyhedral Compilation Model for Debugging and Optimization of SPMD-style Explicitly Parallel Programs Prasanth Chatarasi Masters Thesis Defense Habanero Extreme Scale Software Research Group Department of Computer Science Rice

Debugging Debugging Tools Module Overview Introduction to Debugging Problems in Production

Coroutines Update Seva Tolstopyatov @qwwdfsad October 13, 2020 Coroutines debugging Coroutines

A study of some pitfalls preventing peak performance in polyhedral compilation using a polyhedral

AlphaZ: A System for Design Space Exploration in the Polyhedral Model Tomofumi Yuki, Gautam

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

JIT Compilation Module Overview JIT Compilation Native vs. Managed Compilation Managed

Debugging Debugging with High Level Languages Same goals as low-level debugging Examine and

Extending ns Extending ns In OTcl In C++ Debugging Padma Haldar USC/ISI 1 2 ns

Scalable Polyhedral Compilation, Syntax vs. Semantics: 10 in the First Round IMPACT

Polyhedral Compilation Opportunities in MLIR Uday Bondhugula Indian Institute of Science

The Polyhedral Model Beyond Loops Recursion Optimization and Parallelization Through Polyhedral

MC: Meta-level Compilation Extending the Process of Code Compilation with Application-Specific

Debugging Floating-Point Debugging Floating-Point Debugging Floating-Point Math in Racket Math

Computing the Cohomology Ring of a Polyhedral Complex Joint work with D. Kravatz, R.

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &

2/9/10 Instructional Strategies that Work: Systematic Instruction and Natural Training Strategies

Planning Philipp Koehn 26 March 2020 Philipp Koehn Artificial Intelligence: Planning 26 March

Multipath TCP Signalling Costin Raiciu February 9, 2010 1 Introduction One design issue for

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Curriculum Modifications Series Module 4: Facilitation and Interactions CURRICULUM MODIFICATIONS

GENETIC MANIPULATION AND PATENTING OF GENES AND GMO S Uko Zylstra Professor Emeritus Biology

COMP 633 - Parallel Computing Lecture 12 September 17, 2020 CC-NUMA (2) Memory Consistency

Syllables and Phonotactics Syllables and Phonotactics Syllabification Rule Syllabic Consonants

Extending the Polyhedral Compilation Model for Debugging and - PowerPoint PPT Presentation

Extending the Polyhedral Compilation Model for Debugging and Optimization of SPMD-style Explicitly Parallel Programs Prasanth Chatarasi Masters Thesis Defense Habanero Extreme Scale Software Research Group Department of Computer Science Rice

Debugging Debugging Tools Module Overview Introduction to Debugging Problems in Production

Coroutines Update Seva Tolstopyatov @qwwdfsad October 13, 2020 Coroutines debugging Coroutines

A study of some pitfalls preventing peak performance in polyhedral compilation using a polyhedral

AlphaZ: A System for Design Space Exploration in the Polyhedral Model Tomofumi Yuki, Gautam

Polyhedral Volumes Visual Techniques T. V. Raman &amp; M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman &amp; M. S. Krishnamoorthy Polyhedral Volumes

JIT Compilation Module Overview JIT Compilation Native vs. Managed Compilation Managed

Debugging Debugging with High Level Languages Same goals as low-level debugging Examine and

Extending ns Extending ns In OTcl In C++ Debugging Padma Haldar USC/ISI 1 2 ns

Scalable Polyhedral Compilation, Syntax vs. Semantics: 10 in the First Round IMPACT

Polyhedral Compilation Opportunities in MLIR Uday Bondhugula Indian Institute of Science

The Polyhedral Model Beyond Loops Recursion Optimization and Parallelization Through Polyhedral

MC: Meta-level Compilation Extending the Process of Code Compilation with Application-Specific

Debugging Floating-Point Debugging Floating-Point Debugging Floating-Point Math in Racket Math

Computing the Cohomology Ring of a Polyhedral Complex Joint work with D. Kravatz, R.

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &amp;

2/9/10 Instructional Strategies that Work: Systematic Instruction and Natural Training Strategies

Planning Philipp Koehn 26 March 2020 Philipp Koehn Artificial Intelligence: Planning 26 March

Multipath TCP Signalling Costin Raiciu February 9, 2010 1 Introduction One design issue for

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Curriculum Modifications Series Module 4: Facilitation and Interactions CURRICULUM MODIFICATIONS

GENETIC MANIPULATION AND PATENTING OF GENES AND GMO S Uko Zylstra Professor Emeritus Biology

COMP 633 - Parallel Computing Lecture 12 September 17, 2020 CC-NUMA (2) Memory Consistency

Syllables and Phonotactics Syllables and Phonotactics Syllabification Rule Syllabic Consonants

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &