Control Hazards 1 Today Quiz 5 Mini project #1 solution Mini - PowerPoint PPT Presentation

Control Hazards 1

Today • Quiz 5 • Mini project #1 solution • Mini project #2 assigned • Stalling recap • Branches! 2

Key Points: Control Hazards • Control hazards occur when we don’t know which instruction to execute next • Mostly caused by branches • Strategies for dealing with them • Stall • Guess! • Leads to speculation • Flushing the pipeline • Strategies for making better guesses • Understand the difference between stall and flush 3

Ideal operation Cycles Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back 4

Stalling for Load Cycles Only four stages are Fetch Deco Mem Write EX de back occupied. What’s in Fetch Deco Mem Write EX de back Mem? Fetch Deco Mem Write Load $s1, 0($s1) EX de back Fetch Deco Mem Write EX EX Addi $t1, $s1, 4 de back Deco Fetch Deco Mem Write EX de de back All stages of the Fetch Fetch Deco Mem Write EX de back pipeline earlier than Fetch Deco Mem EX the stall stand still. de To “stall” we insert a noop in place of the instruction and freeze the earlier stages of the pipeline 5

Inserting Noops Cycles Fetch Deco Mem Write EX de back Fetch Deco Mem Write EX de back The noop is Fetch Deco Mem Write in Mem Load $s1, 0($s1) EX de back Mem Write Noop inserted back Fetch Deco Mem Write EX EX Addi $t1, $s1, 4 de back Deco Fetch Deco Mem Write EX de de back Fetch Fetch Deco Mem EX de To “stall” we insert a noop in place of the instruction and freeze the earlier stages of the pipeline 6

Control Hazards add $s1, $s3, $s2 • Computing the new PC sub $s6, $s5, $s2 beq $s6, $s7, somewhere and $s2, $s3, $s1 Fetch Deco Mem Write EX de back 7

Computing the PC • Non-branch instruction • PC = PC + 4 • When is PC ready? No Hazard. Fetch Deco Mem Write EX de back 8

Computing the PC • Branch instructions • bne $s1, $s2, offset • if ($s1 != $s2) { PC = PC + offset} else {PC = PC + 4;} • When is the value ready? Fetch Deco Mem Write EX de back 9

Solution #1: Stall on branches • Worked for loads and ALU ops. • • But wait! • When do we know whether the instruction is a branch? Fetch Deco Mem Write EX de back We would have to stall on every instruction 10

There is a constant control hazard • We don’t even know what kind of instruction we have until decode. • What do we do? 11

Smart ISA design Cycles Fetch Deco Mem Write EX add $s0, $t0, $t1 de back Fetch Deco Mem Write EX sub $t2, $s0, $t3 de back Fetch Deco Mem Write EX sub $t2, $s0, $t3 de back Fetch Deco Mem Write EX sub $t2, $s0, $t3 de back • Make it very easy to tell if the instruction is a branch -- maybe a single bit or just a couple. • Decoding these bits is nearly trivial. • In MIPS the branches and jumps are opcodes 0-7, so if the high order bits are zero, it’s a control instruction 12

Dealing with Branches: Option 1 -- stall Cycles Fetch Deco Mem Write EX sll $s4, $t6, $t5 de back Fetch Deco Mem Write bne $t2, $s0, somewhere EX de back Fetch Fetch Fetch Deco Mem Write EX add $s0, $t0, $t1 Stall de back Fetch Deco Mem EX and $s4, $t0, $t1 de No instructions in Decode or Execute • What does this do to our CPI? • Speedup? 13

Performance impact of • ET = I * CPI * CT • Branches about about 1 in 5 instructions • What’s the CPI for branches? 1 + 2 = 3 • Amdah’ls law:Speedup = • 1/(.2/(1/3) + (.8)) = 0.714 • ET = 1 * (.2*3 + .8 * 1) * 1 = 1.4 14

Option 2: The compiler • Use “branch delay” slots. • The next N instructions after a branch are always executed • Much like load-delay slots • Good • Simple hardware • Bad • N cannot change. • MIPS has one branch delay slot • It’s a big pain! 15

Delay slots. Cycles Taken Fetch Deco Mem Write EX bne $t2, $s0, somewhere de back Fetch Deco Mem Write EX add $t2, $s4, $t1 de back Branch Delay Fetch Deco Mem Write EX add $s0, $t0, $t1 de back ... Fetch Deco Mem somewhere: EX de sub $t2, $s0, $t3 16

Option 3: Simple Prediction • Can a processor tell the future? • For non-taken branches, the new PC is ready immediately. • Let’s just assume the branch is not taken • Also called “branch prediction” or “control speculation” • What if we are wrong? 17

Predict Not-taken Cycles Fetch Deco Mem Write EX bne $t2, $s0, somewhere de back Fetch Deco Mem Write EX sll $s4, $t6, $t5 de back Fetch Deco Mem Write EX bne $t2, $s4, else de back These two Fetch Deco Mem W EX and $s4, $t0, $t1 Squash de back instructions Fetch Deco add $s0, $t0, $t1 EX Squash become de ... Noops Fetch Deco else: de sub $t2, $s0, $t3 • We start the ‘add’ and the ‘and’, and then, when we discover the branch outcome, we “squash” them. • This means we turn it into a Noop. 18

Simple “static” Prediction • “static” means before run time • Many prediction schemes are possible • Predict taken • Pros? Loops are commons • Predict not-taken • Pros? Not all branches are for loops. Backward Taken/Forward not taken Best of both worlds. 19

Implementing Backward taken/forward not taken .// 2 .// :;5< 7+<=> !"#$+%$$&+- !"#$%&'()" ?@$@ 3+45#$+% !"#$ *+,)%- *+,)%- +,#*#+- !6+$';A?+' !"#$+%$$&+. !"#$ 657+ ?+'ABC+' BC+'A*+, !"#$ .89 %$$&"'' 01 %$$&"'' (&)*"+%$$& ,#*# *+,ADE !"#$ +,#*#+. (&)*"+,#*# (&)*"+,#*# :54" BC$+"/ -/ 0.

Implementing Backward taken/forward not taken sign bit comparison result Compute target Sign Shi< le< ¡2 Extend Add Insert bubble to flush Add Add 4 Shi< le< ¡2 Read ¡Addr ¡1 Instruc(on Data Read Register Memory Memory ¡Data ¡1 IFetch/Dec Read ¡Addr ¡2 Read File Dec/Exec Exec/Mem Read ALU Address PC Address Write ¡Addr Data Mem/WB Read ¡Data ¡2 Write ¡Data Write ¡Data Sign Extend 16 32

Implementing Backward taken/forward not taken • Changes in control • New inputs to the control unit • The sign of the offset • The result of the branch • New outputs from control • The flush signal. • Inserts “noop” bits in datapath and control 22

Performance Impact • ET = I * CPI * CT • Back taken, forward not taken is 80% accurate • Branches are 20% of instructions • Changing the pipeline increases the cycle time by 10% • What is the speedup Bt/Fnt compared to just stalling on every branch? 23

Performance Impact • ET = I * CPI * CT • Back taken, forward not taken is 80% accurate • Branches are 20% of instructions • Changing the front end increases the cycle time by 10% • What is the speedup Bt/Fnt compared to just stalling on every branch? • Btfnt • CPI = 0.2*0.2*(1 + 2) + (1-.2*.2)*1 = 1.08 • CT = 1.1 • ET = 1.188 • Stall • CPI = .2*3 + .8*1 = 1.4 • CT = 1 • ET = 1.4 • Speed up = 1.4/1.188 = 1.17 24

The Importance of Pipeline depth • There are two important parameters of the pipeline that determine the impact of branches on performance • Branch decode time -- how many cycles does it take to identify a branch (in our case, this is less than 1) • Branch resolution time -- cycles until the real branch outcome is known (in our case, this is 2 cycles) 25

Pentium 4 pipeline (Willamette) 1. Branches take 19 cycles to resolve 2. Identifying a branch takes 4 cycles. 1. The P4 fetches 3 instructions per cycle 3. Stalling is not an option. • Pentium 4 pipelines peaked at 31 stage!!! • Current cpus have about 12-14 stages.

Performance Impact • ET = I * CPI * CT • Back taken, forward not taken is 80% accurate • Branches are 20% of instructions • Changing the front end increases the cycle time by 10% • What is the speedup Bt/Fnt compared to just stalling on every branch? • Btfnt What if this were 20? CPI = .2*.2*(1 + 2 ) + .9*1 • • CT = 1.1 • ET = 1.118 • Stall • CPI = .2*4 + .8*1 = 1.6 • CT = 1 • ET = 1.4 • Speed up = 1.4/1.118 = 1.18 27

Performance Impact • ET = I * CPI * CT • Back taken, forward not taken is 80% accurate • Branches are 20% of instructions • Changing the front end increases the cycle time by 10% • What is the speedup Bt/Fnt compared to just stalling on every branch? • Btfnt • CPI = .2*.2*(1 + 20 ) + .8*1 = 1.64 • CT = 1.1 • ET = 1.804 • Stall • CPI = .2*(1 + 20) + .8*1 = 5 • CT = 1 • ET = 5 • Speed up = 5/1.804= 2.77 28

Dynamic Branch Prediction • Long pipes demand higher accuracy than static schemes can deliver. • Instead of making the the guess once, make it every time we see the branch. • Predict future behavior based on past behavior 29

Predictable control • Use previous branch behavior to predict future branch behavior. • When is branch behavior predictable? 30

Control Hazards 1 Today Quiz 5 Mini project #1 solution Mini - PowerPoint PPT Presentation

Control Hazards 1 Today Quiz 5 Mini project #1 solution Mini project #2 assigned Stalling recap Branches! 2 Key Points: Control Hazards Control hazards occur when we dont know which instruction to execute next

The Challenge of Natural Hazards This PowerPoint will cover information on: Natural Hazards

EE 457 Unit 6c Control Hazards 2 Control Hazards Control (branch) hazards are named such

Occupational Health Hazards PPT-SM-OCPHLTHHAZ 1 V.A.0.0 Occupational Health Hazards Three

Health Hazards in Construction Health Hazards Potential exposures to health hazards: Worker

CSCI341 Lecture 36, Pipelining & Hazards RECALL... RECALL... HAZARDS Data Hazards

SIO15-SS1 2020: Topic 6 Earthquake Hazards { SIO15-SS1 2020: Topic 6 Earthquake Hazards {

Key Points: Control Hazards Control hazards occur when we dont know what the next

Key Points: Control Hazards Control hazards occur when we don t know what the next

Overview of Food Safety Hazards and Control Measures Junshi Chen I nstitute of Nutrition and

LECTURE 9 Pipeline Hazards PIPELINED DATAPATH AND CONTROL In the previous lecture, we

What about branches? Branch outcomes are not known until EXE What are our options? 1

Hazards hazard : previous instruction blocks following instruction structural hazards

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Hawaii Hazards Awareness & Resilience Program November 9, 2016 BG(R) Bruce Oliveira State

Technological advances in Detecting Microbial Hazards in Food. Rahul Warke Microbial Hazards

Topics Topics ASSESSMENT OF ASSESSMENT OF Hazards from food Hazards from food

On the Existence of Hazard-Free Multi-Level Logic Steven M. Nowick Charles W. ODonnell

Managing globalisation CBS 25 February 2013 Jrgen Huno Rasmussen Jrgen Huno Rasmussen

Welcome to the 2014 CMSUK AGM Gatwick Hilton Hotel 13 th November 2013 2014 AGM AGENDA Sam

The Epic of Gilgamesh The Epic of Gilgamesh The Cuneiform Text of The Epic of Gilgamesh

Instruction-Level Parallelism (ILP) Fine-grained parallelism Obtained by: instruction

Tomasulos Algorithm Another dynamic scheduling technique

Chapter 8 Further Topics in Moral Hazard 8.1 Efficiency Wages The aim of an incentive

Exploring Local Hazard Mitigation Plans These materials were developed by CIRES Education &

Control Hazards 1 Today Quiz 5 Mini project #1 solution Mini - PowerPoint PPT Presentation

Control Hazards 1 Today Quiz 5 Mini project #1 solution Mini project #2 assigned Stalling recap Branches! 2 Key Points: Control Hazards Control hazards occur when we dont know which instruction to execute next

The Challenge of Natural Hazards This PowerPoint will cover information on: Natural Hazards

EE 457 Unit 6c Control Hazards 2 Control Hazards Control (branch) hazards are named such

Occupational Health Hazards PPT-SM-OCPHLTHHAZ 1 V.A.0.0 Occupational Health Hazards Three

Health Hazards in Construction Health Hazards Potential exposures to health hazards: Worker

CSCI341 Lecture 36, Pipelining &amp; Hazards RECALL... RECALL... HAZARDS Data Hazards

SIO15-SS1 2020: Topic 6 Earthquake Hazards { SIO15-SS1 2020: Topic 6 Earthquake Hazards {

Key Points: Control Hazards Control hazards occur when we dont know what the next

Key Points: Control Hazards Control hazards occur when we don t know what the next

Overview of Food Safety Hazards and Control Measures Junshi Chen I nstitute of Nutrition and

LECTURE 9 Pipeline Hazards PIPELINED DATAPATH AND CONTROL In the previous lecture, we

What about branches? Branch outcomes are not known until EXE What are our options? 1

Hazards hazard : previous instruction blocks following instruction structural hazards

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Hawaii Hazards Awareness &amp; Resilience Program November 9, 2016 BG(R) Bruce Oliveira State

Technological advances in Detecting Microbial Hazards in Food. Rahul Warke Microbial Hazards

Topics Topics ASSESSMENT OF ASSESSMENT OF Hazards from food Hazards from food

On the Existence of Hazard-Free Multi-Level Logic Steven M. Nowick Charles W. ODonnell

Managing globalisation CBS 25 February 2013 Jrgen Huno Rasmussen Jrgen Huno Rasmussen

Welcome to the 2014 CMSUK AGM Gatwick Hilton Hotel 13 th November 2013 2014 AGM AGENDA Sam

The Epic of Gilgamesh The Epic of Gilgamesh The Cuneiform Text of The Epic of Gilgamesh

Instruction-Level Parallelism (ILP) Fine-grained parallelism Obtained by: instruction

Tomasulos Algorithm Another dynamic scheduling technique

Chapter 8 Further Topics in Moral Hazard 8.1 Efficiency Wages The aim of an incentive

Exploring Local Hazard Mitigation Plans These materials were developed by CIRES Education &amp;

CSCI341 Lecture 36, Pipelining & Hazards RECALL... RECALL... HAZARDS Data Hazards

Hawaii Hazards Awareness & Resilience Program November 9, 2016 BG(R) Bruce Oliveira State

Exploring Local Hazard Mitigation Plans These materials were developed by CIRES Education &