CS 3330: Pipelining 6 October 2016 1 Human pipeline: laundry - PowerPoint PPT Presentation

CS 3330: Pipelining 6 October 2016 1

Human pipeline: laundry whites sheets sheets sheets colors colors colors whites whites whites colors colors colors whites whites 14:00 Washer 13:00 12:00 11:00 Table Folding Dryer Washer 14:00 13:00 12:00 11:00 Table Folding Dryer 2

Waste (1) whites wasted time! wasted time! sheets sheets sheets colors colors colors whites Washer whites 14:00 13:00 12:00 11:00 Table Folding Dryer 3

Waste (2) whites sheets sheets sheets colors colors colors whites whites Washer 14:00 13:00 12:00 11:00 Table Folding Dryer 4

Latency — Time for One colors normal latency (1.8 h) colors colors colors pipelined latency (2.1 h) sheets sheets sheets colors colors Washer whites whites whites 14:00 13:00 12:00 11:00 Table Folding Dryer 5

Throughput — Rate of Many colors time between starts (0.83 h) loads/h h load time between fjnishes (0.83 h) sheets sheets sheets colors colors Washer whites whites whites 14:00 13:00 12:00 11:00 Table Folding Dryer 6

Throughput — Rate of Many Washer time between starts (0.83 h) time between fjnishes (0.83 h) sheets sheets sheets colors colors colors whites whites whites 14:00 13:00 12:00 11:00 Table Folding Dryer 6 1 load 0 . 83 h = 1 . 2 loads/h

times three circuit 7 10 results/ns throughput 100 ps latency 100 ps 50 ps 0 ps 21 14 add add ADD ADD ADD ADD 7 A 2 × A 3 × A

times three circuit 7 10 results/ns throughput 100 ps latency 100 ps 50 ps 0 ps 21 14 7 ADD ADD ADD ADD A 2 × A 3 × A A add A + A 2 × A add 2 A + A 3 × A

times three circuit 7 100 ps 50 ps 0 ps 21 14 7 ADD ADD ADD ADD A 2 × A 3 × A 100 ps latency = ⇒ 10 results/ns throughput A add A + A 2 × A add 2 A + A 3 × A

times three and repeat 2 21 17 34 51 4 8 12 1 3 7 23 46 69 0 ps 100 ps 200 ps 300 ps 400 ps 500 ps 14 add 8 2 7 14 17 34 4 8 add 1 23 46 0 ps 100 ps 200 ps 300 ps 400 ps 500 ps A add A + A 2 × A add 2 A + A 3 × A

times three and repeat 2 21 17 34 51 4 8 12 1 3 7 23 46 69 0 ps 100 ps 200 ps 300 ps 400 ps 500 ps 14 8 2 23 7 14 17 34 4 8 1 46 0 ps 100 ps 200 ps 300 ps 400 ps 500 ps A add A + A 2 × A add 2 A + A 3 × A A add A + A 2 × A add 2 A + A 3 × A

pipelined times three ( 51 34 17 21 14 7 7 ) ( ) ) ( ) ( ADD ADD ADD ADD 9 A ( t + 2 ) 2 × A ( t + 1 ) 3 × A ( t + 0 ) A ( t + 1 )

pipelined times three 7 51 34 17 21 14 7 9 ADD ADD ADD ADD A ( t + 2 ) 2 × A ( t + 1 ) 3 × A ( t + 0 ) A ( t + 1 ) A ( t + 2 ) A ( t + 1 ) 2 × A ( t + 1 ) 3 × A ( t + 0 )

register tolerances register output register input output changes input must not change register delay 10

times three pipeline timing throughput: G operations/sec ps 11 ADD ADD ADD ADD A ( t + 2 ) 2 × A ( t + 1 ) 3 × A ( t + 0 ) A ( t + 1 ) 10 ps 50 ps 10 ps 50 ps 10 ps

times three pipeline timing ADD throughput: 11 ADD ADD ADD A ( t + 2 ) 2 × A ( t + 1 ) 3 × A ( t + 0 ) A ( t + 1 ) 10 ps 50 ps 10 ps 50 ps 10 ps 1 60 ps ≈ 16 G operations/sec

deeper pipeline ps Problem: Can we even do this? Problem: How much faster can we get? partial results partial results G ops/sec ps throughput: ps ps ps ps ps ps ps ps ADD ADD ADD ADD 12 A ( t + 2 ) 2 × A 2 × A ( t + 1 ) 3 × A 3 × A ( t + 0 ) A ( t + 1 ) A

deeper pipeline throughput: Problem: Can we even do this? Problem: How much faster can we get? partial results partial results G ops/sec ps 12 ADD ADD ADD ADD A ( t + 2 ) 2 × A 2 × A ( t + 1 ) 3 × A 3 × A ( t + 0 ) A ( t + 1 ) A 10 ps 25 ps 10 ps 25 ps 10 ps 25 ps 10 ps 25 ps 10 ps

deeper pipeline throughput: Problem: Can we even do this? Problem: How much faster can we get? partial results partial results 12 ADD ADD ADD ADD A ( t + 2 ) 2 × A 2 × A ( t + 1 ) 3 × A 3 × A ( t + 0 ) A ( t + 1 ) A 10 ps 25 ps 10 ps 25 ps 10 ps 25 ps 10 ps 25 ps 10 ps 1 35 ps ≈ 28 G ops/sec

diminishing returns: register delays . 10 ps . . . . . . . logic (3/3) . . . . 1 ps 11 ps per cycle … 33 ps 10 ps logic (all) 33 ps 100 ps 110 ps per cycle 10 ps logic (1/2) 50 ps 60 ps per cycle 10 ps logic (2/2) 50 ps 10 ps logic (1/3) 33 ps 43 ps per cycle 10 ps logic (2/3) 13 10 ps 1 ps 10 ps 1 ps 10 ps 1 ps 10 ps

diminishing returns: register delays number of stages time per completion (ps) 14 120 100 80 60 40 20 0 2 4 6 8 10 12 14

diminishing returns: register delays register delay time per completion (ps) number of stages 14 120 100 80 60 40 20 0 2 4 6 8 10 12 14

diminishing returns: register delays register delay time per completion (ps) number of stages 1.02x speedup 1.83x speedup 14 120 100 80 60 40 20 0 2 4 6 8 10 12 14

diminishing returns: register delays 1.83x throughput throughput (ops/ns) number of stages 1.02x throughput 15 100 80 60 40 20 0 2 4 6 8 10 12 14

diminishing returns: register delays 1.83x throughput throughput (ops/ns) number of stages max. rate of register updates 1.02x throughput 15 100 80 60 40 20 0 2 4 6 8 10 12 14

diminishing returns: uneven split . 10 ps logic (3/3) 30 ps 10 ps . . . . logic (2/3) . . . . . . . 35 ps 10 ps Can we split up some logic (e.g. adder) arbitrarily? 60 ps Probably not... logic (all) 100 ps 110 ps per cycle 10 ps logic (1/2) 70 ps per cycle per cycle 10 ps logic (2/2) 45 ps 10 ps logic (1/3) 40 ps 50 ps 17

addq processor split signal skips two stages writeback execute decode PC update fetch and add 2 ADD ADD 0xF R[srcB] PC R[srcA] next R[dstE] next R[dstM] dstE dstM srcB srcA register fjle Mem. Instr. 18

pipelined addq processor R[srcB] fetch/fetch execute/writeback decode/execute fetch/decode add 2 ADD ADD 0xF split R[srcA] PC next R[dstE] next R[dstM] dstE dstM srcB srcA register fjle Mem. Instr. 19

CS 3330: Pipelining 6 October 2016 1 Human pipeline: laundry - PowerPoint PPT Presentation

CS 3330: Pipelining 6 October 2016 1 Human pipeline: laundry whites sheets sheets sheets colors colors colors whites whites whites colors colors colors whites whites 14:00 Washer 13:00 12:00 11:00 Table Folding Dryer

Pipelining Instruction Pipelining is the use of pipelining to allow more than one instruction to

CS 3330 Introduction Daniel and Charles CS 3330 Computer Architecture 1 Charles and I will

Pipelining 1 Today Quiz Introduction to pipelining 2 Pipelining L L a a Logic

Chapter 3: Pipelining and Parallel Processing Keshab K. Parhi Outline Introduction

Appendix A Appendix A Pipelining: Basic and Intermediate Concepts p 1 Overview Basics of

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

Lecture 2 (I ): Lecture 2 (I ): Pipelining & Retiming Pipelining & Retiming

Hg Tutorial For : COP 3330. Object oriented Programming (Using C++)

C++ Tour About me: Piyush Kumar Phone: 645-2355 Email: piyush@cs.fsu.edu For : COP 3330.

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Overview Basics of Pipelining Pipeline Hazards Appendix A Pipeline Implementation

Computer Architecture Summer 2020 Pipelining Tyler Bletsch Duke University Includes material

Appendix A Pipelining: Basic and Intermediate C Concepts t 1 Overview Basics of

Chapter Six 1 2004 Morgan Kaufmann Publishers Pipelining The laundry analogy for

EE 457 Unit 6a Basic Pipelining Techniques 2 Pipelining Introduction Consider a drink

Pipelining PIPELINING what Seymour Cray taught the laundry industry How to correctly pipeline

Survivance Understanding & Surviving Mental Illness & Suicide In Indian Country 1

Introduction to Experiments February 4 1 / 42 Outline for today 1. Introductions 2. Overview of

Git for Teams of One or More Git for Teams of One or More Emma Jane Westby Twitter: emmajanehw

Welcome to Python! Justin Kiggins Product Manager DataCamp Python for MATLAB Users

The Publics Response to Biological Terrorism: A Possible Scenario Involving the Release of

Mikhail V. Medvedev (KU) Students (at KU): Simulations: Sarah Reynolds, Ken-Ichi Nishikawa (U.

Wicket 6 Jochen Mader Chief Developer @ Senacor Technologies AG http://www.senacor.com

Wicket Explained Ease your way into Ease your way into Component Based Web Application

Sambuz

Useful Links

Newsletter

Mail Us

CS 3330: Pipelining 6 October 2016 1 Human pipeline: laundry - PowerPoint PPT Presentation

CS 3330: Pipelining 6 October 2016 1 Human pipeline: laundry whites sheets sheets sheets colors colors colors whites whites whites colors colors colors whites whites 14:00 Washer 13:00 12:00 11:00 Table Folding Dryer

Pipelining Instruction Pipelining is the use of pipelining to allow more than one instruction to

CS 3330 Introduction Daniel and Charles CS 3330 Computer Architecture 1 Charles and I will

Pipelining 1 Today Quiz Introduction to pipelining 2 Pipelining L L a a Logic

Chapter 3: Pipelining and Parallel Processing Keshab K. Parhi Outline Introduction

Appendix A Appendix A Pipelining: Basic and Intermediate Concepts p 1 Overview Basics of

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

Lecture 2 (I ): Lecture 2 (I ): Pipelining &amp; Retiming Pipelining &amp; Retiming

Hg Tutorial For : COP 3330. Object oriented Programming (Using C++)

C++ Tour About me: Piyush Kumar Phone: 645-2355 Email: piyush@cs.fsu.edu For : COP 3330.

Object Oriented Object 3 Programming Object 1 Object 2 Object 4 For : COP 3330. Object

Overview Basics of Pipelining Pipeline Hazards Appendix A Pipeline Implementation

Computer Architecture Summer 2020 Pipelining Tyler Bletsch Duke University Includes material

Appendix A Pipelining: Basic and Intermediate C Concepts t 1 Overview Basics of

Chapter Six 1 2004 Morgan Kaufmann Publishers Pipelining The laundry analogy for

EE 457 Unit 6a Basic Pipelining Techniques 2 Pipelining Introduction Consider a drink

Pipelining PIPELINING what Seymour Cray taught the laundry industry How to correctly pipeline

Survivance Understanding &amp; Surviving Mental Illness &amp; Suicide In Indian Country 1

Introduction to Experiments February 4 1 / 42 Outline for today 1. Introductions 2. Overview of

Git for Teams of One or More Git for Teams of One or More Emma Jane Westby Twitter: emmajanehw

Welcome to Python! Justin Kiggins Product Manager DataCamp Python for MATLAB Users

The Publics Response to Biological Terrorism: A Possible Scenario Involving the Release of

Mikhail V. Medvedev (KU) Students (at KU): Simulations: Sarah Reynolds, Ken-Ichi Nishikawa (U.

Wicket 6 Jochen Mader Chief Developer @ Senacor Technologies AG http://www.senacor.com

Wicket Explained Ease your way into Ease your way into Component Based Web Application

Sambuz

Useful Links

Newsletter

Mail Us

Lecture 2 (I ): Lecture 2 (I ): Pipelining & Retiming Pipelining & Retiming

Survivance Understanding & Surviving Mental Illness & Suicide In Indian Country 1