Chapter 6: Designing a Pipelined CPU What are our resources? 1 - PowerPoint PPT Presentation

Chapter 6: Designing a Pipelined CPU • What are our resources? 1 washer, 1 dryer, 1 folder (you), 1 “put awayer” (roommate) What % of the time are they idle? 1

2 Chapter 6: Designing a Pipelined CPU

Chapter 6: Designing a Pipelined CPU What % of the time are resources idle? - steady-state - ramp up - ramp down 3

Chapter 6: Designing a Pipelined CPU What is our roommate takes off? What happens to the pipeline? 4

Chapter 6: Designing a Pipelined CPU Massive Laundry Pile What if our roommate is gone? What happens to the pipeline? 5

Chapter 6: Designing a Pipelined CPU Massive Laundry Pile What if our roommate is gone? What happens to the pipeline? 6

Chapter 6: Designing a Pipelined CPU No Laundry Pile Scheduling work later reduces “laundry pile” 7

Chapter 6: Designing a Pipelined CPU Scheduling work later reduces “laundry pile” 8

Execution in a Pipelined Datapath TIME CC1 CC2 CC3 CC4 CC5 CC6 CC7 CC8 CC9 IF ID EX MEM WB ALU lw IM Reg DM Reg IF ID EX MEM WB ALU lw IM Reg DM Reg IF ID EX MEM WB ALU lw IM Reg DM Reg IF ID EX MEM WB lw ALU IM Reg DM Reg IF ID EX MEM WB ALU lw IM Reg DM Reg steady state 9

Instruction Latencies and Throughput Single-Cycle CPU Cycle 2 Cycle 1 Latency: Load IF Dec EX Mem WB Throughput: Load IF Dec EX Mem WB Multiple Cycle CPU Cycle 6 Cycle 7 Cycle 8 Cycle 9Cycle 10 Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Latency: Load IF Dec EX Mem WB Throughput: Load IF Dec EX Mem WB Pipelined CPU Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Cycle 6 Cycle 7 Cycle 8 Latency: Load IF Dec EX Mem WB Throughput: Load IF Dec EX Mem WB Load IF Dec EX Mem WB Load IF Dec EX Mem WB 10

Self Check! • If my single cycle CPU has a cycle time of 14ns and my multicycle CPU has a cycle time of 3ns and my pipelined CPU has a cycle time of 3ns, what is the relative performance of my machines? - What kind of answer would you provide? - What kind of information do you need to know? ET = IC * CPI * CT What differs across machines? CT and CPI Single: CT = 14ns, CPI = 1 Multi: CT = 3ns CPI=??? NEED DYN INST LOAD INFO PIPELINED: CT = 3ns CPI = ?? WHAT IS IT? ALWAYS 5? 11

Pipelining Advantages • Higher maximum throughput • Higher utilization of CPU resources • But, more hardware needed, perhaps complex control before, a simple FSM could guide execution of one instruction at a time but, now if we implemented the FSM, it would need to control 5 instructions simultaneously! 12

Mixed Instructions in the Pipeline Cycle # CC1 CC2 CC3 CC4 CC5 CC6 WB IF Dec EX Mem lw What’s add WB IF Dec EX wrong with this? 13

To avoid structural hazard, schedule resource usage homogeneously Cycle # CC1 CC2 CC3 CC4 CC5 CC6 WB IF Dec EX Mem lw add Mem WB IF Dec EX 14

Pipeline Principles • All instructions that share a pipeline must have the same stages in the same order. - therefore, add does nothing during Mem stage - sw does nothing during WB stage • All intermediate values must be latched each cycle. • There is no functional block reuse - example: we need 2 adders and ALU (like in single- cycle) IF ID EX MEM WB ALU IM Reg DM Reg 15

Pipelined Datapath Instruction Fetch Instruction Decode/ Execute/ Memory Access Write Back Register Fetch Address Calculation 0 M u x 1 IF/ID ID/EX EX/MEM MEM/WB Add Add 4 Add result Shift left 2 Read register 1 Address PC Read data 1 Read register 2 Zero Instruction Registers ALU Read ALU memory 0 Read Write Address data 2 1 result data register M M u Data u Write x memory x data 1 0 Write data 16 32 Sign extend 16 Is this more similar to multicycle or single cycle datapath?

Chapter 6: Designing a Pipelined CPU What are our resources? 1 - PowerPoint PPT Presentation

Chapter 6: Designing a Pipelined CPU What are our resources? 1 washer, 1 dryer, 1 folder (you), 1 put awayer (roommate) What % of the time are they idle? 1 2 Chapter 6: Designing a Pipelined CPU Chapter 6: Designing a Pipelined CPU

TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

DLX Pipeline 2-stage fully pipelined Adder 4-stage fully pipelined Multiplier 5-cycle

Review: FP Pipeline Model 4-stage fully pipelined adder, Non-pipelined multiplier and divider A1

Router Architectures CPU CPU Memory Memory packets NFE NFE Processor Processor Line Card

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

CPU scheduling CPU 1 P k P 3 P 2 P 1 . . . CPU 2 . . . CPU n The scheduling problem: - Have

CPU Scheduling Heechul Yun 1 Agenda Introduction to CPU scheduling Classical CPU

Designing for Designing for Greenspace Greenspace Greenspace Designing for Designing for

Class 14 Slides SLIDE what is the designing principle how does designing principle

CPU Scheduling Eric McCreath Introduction CPU scheduling is at the heart of a multiprogrammed

Lecture 16: Basic CPU Design Todays topics: Single-cycle CPU Multi-cycle CPU

CPU Scheduling Mehdi Kargahi School of ECE University of Tehran Spring 2008 CPU and I/O Bursts

CPU Scheduling Heechul Yun 1 Administrative Midterm Mar. 15, 2016 Closed book,

CPSC 410/611: Week 4 Threads CPU Scheduling Synchronization (Part I) CPU

Biorthogonal Filter Pairs und Wavelets WTBV January 20, 2016 WTBV Biorthogonal Filter Pairs und

A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis M.

FOR C ONSERVATION L AWS : N UMERICAL ANALYSIS Margarete O. Domingues 1 , S onia M. Gomes 2 ,

h"p://icv.ims.ut.ee shb@ut.ee Conventional

BUILDING INCLUSIVE ECONOMIES Advancing the research agenda Kay McGowan Global Development Lab

NLO QCD corrections to Wb b/Zb b production at hadron colliders Laura Reina RADCOR 07,

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

On Fair Selection in the Presence of Implicit Variance Emelianov, Gast, Gummadi, Loiseau (EC 2020)

Chapter 6: Designing a Pipelined CPU What are our resources? 1 - PowerPoint PPT Presentation

Chapter 6: Designing a Pipelined CPU What are our resources? 1 washer, 1 dryer, 1 folder (you), 1 put awayer (roommate) What % of the time are they idle? 1 2 Chapter 6: Designing a Pipelined CPU Chapter 6: Designing a Pipelined CPU

TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES TXN/SEC CPU CORES

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

DLX Pipeline 2-stage fully pipelined Adder 4-stage fully pipelined Multiplier 5-cycle

Review: FP Pipeline Model 4-stage fully pipelined adder, Non-pipelined multiplier and divider A1

Router Architectures CPU CPU Memory Memory packets NFE NFE Processor Processor Line Card

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

CPU Scheduling CPU Scheduling CPU Scheduling 101 CPU Scheduling 101 The CPU scheduler makes a

CPU scheduling CPU 1 P k P 3 P 2 P 1 . . . CPU 2 . . . CPU n The scheduling problem: - Have

CPU Scheduling Heechul Yun 1 Agenda Introduction to CPU scheduling Classical CPU

Designing for Designing for Greenspace Greenspace Greenspace Designing for Designing for

Class 14 Slides SLIDE what is the designing principle how does designing principle

CPU Scheduling Eric McCreath Introduction CPU scheduling is at the heart of a multiprogrammed

Lecture 16: Basic CPU Design Todays topics: Single-cycle CPU Multi-cycle CPU

CPU Scheduling Mehdi Kargahi School of ECE University of Tehran Spring 2008 CPU and I/O Bursts

CPU Scheduling Heechul Yun 1 Administrative Midterm Mar. 15, 2016 Closed book,

CPSC 410/611: Week 4 Threads CPU Scheduling Synchronization (Part I) CPU

Biorthogonal Filter Pairs und Wavelets WTBV January 20, 2016 WTBV Biorthogonal Filter Pairs und

A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis M.

FOR C ONSERVATION L AWS : N UMERICAL ANALYSIS Margarete O. Domingues 1 , S onia M. Gomes 2 ,

h&quot;p://icv.ims.ut.ee shb@ut.ee Conventional

BUILDING INCLUSIVE ECONOMIES Advancing the research agenda Kay McGowan Global Development Lab

NLO QCD corrections to Wb b/Zb b production at hadron colliders Laura Reina RADCOR 07,

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

On Fair Selection in the Presence of Implicit Variance Emelianov, Gast, Gummadi, Loiseau (EC 2020)

h"p://icv.ims.ut.ee shb@ut.ee Conventional