CS4617 Computer Architecture
Lecture 2 Dr J Vaughan September 10, 2014
1/26
CS4617 Computer Architecture Lecture 2 Dr J Vaughan September 10, - - PowerPoint PPT Presentation
CS4617 Computer Architecture Lecture 2 Dr J Vaughan September 10, 2014 1/26 Amdahls Law Speedup = Execution time for entire task without using enhancement Execution time for entire task using enhancement when possible Speedup
1/26
Speedupenhanced 2/26
10
3/26
10 ≈ 1.22
1.6 ≈ 1.23 4/26
5/26
6/26
7/26
8/26
9/26
10/26
11/26
◮ ARM: Thumb ◮ MIPS: MIPS16 12/26
13/26
◮ Transistor count on a chip doubles every 18 to 24 months
◮ Capacity per DRAM chip doubles every 2-3 years, but this rate
◮ Standard for personal mobile devices (PMDs) ◮ Capacity per chip doubles every 2 years approximately ◮ 15-20 times cheaper per bit than DRAM
◮ Density doubles every 3 years approximately. ◮ 15-20 times cheaper per bit than flash ◮ 300-500 times cheaper than DRAM ◮ Central to server and warehouse-scale storage
◮ Depends on performance of switches ◮ Depends on performance of the transmission system 14/26
◮ Possible to design single-chip 32-bit microprocessor ◮ ...then microprocessors + L1 cache ◮ ...then multicores + caches
15/26
16/26
17/26
18/26
19/26
◮ Turn off clock of inactive modules, e.g., FP unit, idle cores to
◮ Reduce clock frequency and/or voltage when highest
◮ Most µPs now offer a range of operating frequencies and
◮ PMDs and laptops are often idle ◮ Use low power mode DRAM to save energy ◮ Spin disk at lower rate ◮ PCs use emergency slowdown if program execution causes
20/26
◮ Run at higher clock rate on a few cores until temperature rises ◮ 3.3 GHz Core i7 can run in short bursts at 3.6 GHz
◮ Powerstatic ∝ Currentstatic × Voltage ◮ Current flows in transistors even when idle: leakage current ◮ Leakage ranges from 25% to 50% of total power ◮ Power Gating turns off power to inactive modules
◮ Processor is only part of system cost ◮ Use faster, less energy-efficient processor to allow the rest of
21/26
◮ Performance per mm2 of Si
◮ Performance per Watt ◮ Tasks per Joule
22/26
23/26
◮ The first term is the wafer area divided by die area ◮ However, the wafer is circular and the die is rectangular ◮ So the second term divides the circumference (2πR) by the
◮ Subtracting the partial dies along the rim gives the maximum
24/26
◮ Wafer yield accounts for wafers that are completely bad, with
◮ Defects per unit area accounts for random manufacturing
◮ N = process complexity factor, measures manufacturing
25/26
◮ Find the number of dies per 300mm wafer for a die that is 1.5
26/26