INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant - PowerPoint PPT Presentation

INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture

Overview ¨ Announcement ¤ HW1 solutions will be posted in Canvas n Recall that late submission = no submission n One of your lowest assignment scores will be dropped ¤ Homework 2 will be released tonight (due on Feb. 13 th ) ¨ This lecture ¤ Impacts of data dependence ¤ Pipeline performance ¤ Instruction level parallelism

Data Dependence ¨ Point of production ¤ The pipeline stage where an instruction produces a value that can be used by its following instructions ¨ Point of consumption ¤ The pipeline stage where an instruction consumes a produced data PoC PoP Ints. 1: producer Inst. 2: consumer

Problem ¨ Consider a 10-stage pipeline processor, where point of production and point of consumption are separated by 4 cycles. Assume that half the instructions do not introduce a data hazard and half the instructions depend on their preceding instruction. What is the maximum attainable IPC?

Problem ¨ Consider a 10-stage pipeline processor, where point of production and point of consumption are separated by 4 cycles. Assume that half the instructions do not introduce a data hazard and half the instructions depend on their preceding instruction. What is the maximum attainable IPC? Stall Cycles 2 … IPC = = 0.4 5 Instructions

Performance vs. Pipeline Depth ¨ Impact of stall cycles on performance ¤ Independent instructions ¤ Dependent instructions No Stalls 1 𝑚𝑏𝑢𝑑ℎ 𝑚𝑏𝑢𝑓𝑜𝑑𝑧 Performance Pipeline Depth (number of stages)

Performance vs. Pipeline Depth ¨ Impact of stall cycles on performance ¤ Independent instructions ¤ Dependent instructions No Stalls Fully Stalled 1 𝑚𝑏𝑢𝑑ℎ 𝑚𝑏𝑢𝑓𝑜𝑑𝑧 Performance Pipeline Depth (number of stages)

Performance vs. Pipeline Depth ¨ Impact of stall cycles on performance ¤ Independent instructions ¤ Dependent instructions No Stalls Fully Stalled Average 1 𝑚𝑏𝑢𝑑ℎ 𝑚𝑏𝑢𝑓𝑜𝑑𝑧 Performance Increase overlap among instructions in the pipeline (Instruction Level Parallelism) Pipeline Depth (number of stages)

Instruction Level Parallelism ¨ Potential overlap among instructions ¤ A property of the program dataflow Code 1 Code 2 ADD R1, R2, R3 ADD R1, R2, R3 SUB R4, R1, R5 SUB R4, R6, R5 XOR R6, R4, R7 XOR R8, R2, R7 AND R8, R6, R9 AND R9, R6, R0 ILP = 1 ILP = 4 Fully serial Fully parallel

Instruction Level Parallelism ¨ Potential overlap among instructions ¤ A property of the program dataflow ¤ Influenced by compiler X ß A + B + C + D Code 1: ADD R5, R1, R2 ADD R5, R5, R3 ADD R5, R5, R4

Instruction Level Parallelism ¨ Potential overlap among instructions ¤ A property of the program dataflow ¤ Influenced by compiler X ß A + B + C + D Code 1: Code 2: ADD R5, R1, R2 ADD R6, R1, R2 ADD R5, R5, R3 ADD R7, R3, R4 ADD R5, R5, R4 ADD R5, R6, R7 Average ILP = 3/3 = 1 Average ILP = 3/2 = 1.5 Five registers Seven registers

Instruction Level Parallelism ¨ Potential overlap among instructions ¤ A property of the program dataflow ¤ Influenced by compiler ¨ An upper limit for attainable IPC for a given code ¤ IPC represents exploited ILP ADD R5, R1, R2 ADD R6, R1, R2 ADD R5, R5, R3 ADD R7, R3, R4 ADD R5, R5, R4 ADD R5, R6, R7 Average ILP = 3/3 = 1 Average ILP = 3/2 = 1.5 Five registers Seven registers

Instruction Level Parallelism ¨ Potential overlap among instructions ¤ A property of the program dataflow ¤ Influenced by compiler ¨ An upper limit for attainable IPC for a given code ¤ IPC represents exploited ILP ¨ Can be exploited by HW-/SW-intensive techniques ¤ Dynamic scheduling in hardware ¤ Static scheduling in software (compiler)

INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant - PowerPoint PPT Presentation

INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement HW1 solutions will be posted in Canvas n Recall that late submission =

Hardware Parallelism vs. Software Parallelism USENIX Workshop on Hot Topics in Parallelism March

Instruction-Level Parallelism (ILP) Fine-grained parallelism Obtained by: instruction

CSCI341 Lecture 37, Introduction to Parallelism PIPELINING Exploits potential parallelism

MLP yes! Definitions ILP no ! MLP ILP = Instruction Level = Memory Level Parallelism Work

Chapter 17: Parallel Databases Introduction I/O Parallelism Interquery Parallelism

Data-Level Parallelism Nima Honarmand Fall 2015 :: CSE 610 Parallel Computer Architectures

Chapter 2 Chapter 2 Instruction-Level Parallelism and Its Exploitation p 1 Overview

Exploitation of instruction level parallelism Computer Architecture J. Daniel Garca Snchez

Chapter 2 Instruction-Level Parallelism and Its E Exploitation l it ti 1 Overview

Pervasive Parallelism Laboratory Stanford University ppl.stanford.edu Make parallelism

Advanced OpenMP Lecture 6: Nested parallelism Nested parallelism Nested parallelism is

Dataflow Computers Motivation: exploit instruction-level parallelism on a massive scale

Chapter 3: Instruction Level Parallelism (ILP) and its exploitation Pipeline CPI = Ideal

SIMD Single Instruction Multiple Data Parallelism through simultaneous operations on different

Unit 8: Superscalar Pipelines Then: Static & dynamic scheduling Extract much more

Parallel Models Different ways to exploit parallelism Outline Shared-Variables Parallelism

Decision Trees and Nave Bayes 3/29/17 Hypothesis Spaces Decision Trees and K-Nearest

Applications Involving the Sine Law MCR3U: Functions Example Two surveyors, Alice and Bob, need

Abstract Space distribution of ionisation produced by charged particle in gas is discussed. Energy

EMPOWERING VIRUS SEQUENCE RESEARCH THROUGH CONCEPTUAL MODELING ANNA BERNASCONI, ARIF CANAKOGLU,

Ia Iatr trogenic ogenic bile duct bile duct injur injury Eduard Jonas Surgical

More Polymorphism Tiziana Ligorio 1 Details There is a lot of detail one needs to pay

Inheritance II Is-a versus has-a When an object of

MATH 12002 - CALCULUS I 3.3: Information from the Graph of the Derivative Professor Donald L.

INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant - PowerPoint PPT Presentation

INSTRUCTION LEVEL PARALLELISM Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement HW1 solutions will be posted in Canvas n Recall that late submission =

Hardware Parallelism vs. Software Parallelism USENIX Workshop on Hot Topics in Parallelism March

Instruction-Level Parallelism (ILP) Fine-grained parallelism Obtained by: instruction

CSCI341 Lecture 37, Introduction to Parallelism PIPELINING Exploits potential parallelism

MLP yes! Definitions ILP no ! MLP ILP = Instruction Level = Memory Level Parallelism Work

Chapter 17: Parallel Databases Introduction I/O Parallelism Interquery Parallelism

Data-Level Parallelism Nima Honarmand Fall 2015 :: CSE 610 Parallel Computer Architectures

Chapter 2 Chapter 2 Instruction-Level Parallelism and Its Exploitation p 1 Overview

Exploitation of instruction level parallelism Computer Architecture J. Daniel Garca Snchez

Chapter 2 Instruction-Level Parallelism and Its E Exploitation l it ti 1 Overview

Pervasive Parallelism Laboratory Stanford University ppl.stanford.edu Make parallelism

Advanced OpenMP Lecture 6: Nested parallelism Nested parallelism Nested parallelism is

Dataflow Computers Motivation: exploit instruction-level parallelism on a massive scale

Chapter 3: Instruction Level Parallelism (ILP) and its exploitation Pipeline CPI = Ideal

SIMD Single Instruction Multiple Data Parallelism through simultaneous operations on different

Unit 8: Superscalar Pipelines Then: Static &amp; dynamic scheduling Extract much more

Parallel Models Different ways to exploit parallelism Outline Shared-Variables Parallelism

Decision Trees and Nave Bayes 3/29/17 Hypothesis Spaces Decision Trees and K-Nearest

Applications Involving the Sine Law MCR3U: Functions Example Two surveyors, Alice and Bob, need

Abstract Space distribution of ionisation produced by charged particle in gas is discussed. Energy

EMPOWERING VIRUS SEQUENCE RESEARCH THROUGH CONCEPTUAL MODELING ANNA BERNASCONI, ARIF CANAKOGLU,

Ia Iatr trogenic ogenic bile duct bile duct injur injury Eduard Jonas Surgical

More Polymorphism Tiziana Ligorio 1 Details There is a lot of detail one needs to pay

Inheritance II Is-a versus has-a When an object of

MATH 12002 - CALCULUS I 3.3: Information from the Graph of the Derivative Professor Donald L.

Unit 8: Superscalar Pipelines Then: Static & dynamic scheduling Extract much more