CSE775: Computer Architecture
Chapter 1: Fundamentals of Chapter 1: Fundamentals of Computer Design
1
CSE775: Computer Architecture Chapter 1: Fundamentals of Chapter 1: - - PowerPoint PPT Presentation
CSE775: Computer Architecture Chapter 1: Fundamentals of Chapter 1: Fundamentals of Computer Design 1 Computer Architecture Topics Input/Output and Storage Disks, WORM, Tape p RAID Emerging Technologies DRAM Interleaving Memories
1
Disks, WORM, Tape RAID Input/Output and Storage DRAM p Emerging Technologies Interleaving Memories L2 Cache Coherence, Bandwidth, Latency Memory Hierarchy Addressing, L1 Cache Latency VLSI y
Instruction Set Architecture
Pipelining, Hazard Resolution, S l R d i Protection, Exception Handling VLSI Pipelining and Instruction
2
Superscalar, Reordering, Prediction, Speculation, Vector, DSP Level Parallelism
M P M P M P M P ° ° °
Interconnection Network S
Processor-Memory-Switch
Processor-Memory-Switch
3
Design
g Analysis
Cost / Performance Analysis Good Ideas Good Ideas
4
5
6
7
CMOS ( l t t l id i d t ) VLSI
dominates older technologies like TTL (Transistor Transistor Logic) in cost AND performance
8
I 90’ th i f i ti i t d i h f RISC st le
9
In 90’s, the main source of innovations in computer design has come from RISC-style pipelined processors. In the last several years, the annual growth rate is (only) 10-20%.
10
11
12
13
IC cost = Die cost + Testing cost + Packaging cost Final test yield Die cost = Wafer cost Dies per Wafer * Die yield Di f š * ( W f di / 2)2 š * W f di T di Dies per wafer = š * ( Wafer_diam / 2)2 – š * Wafer_diam – Test dies Die Area ¦ 2 * Die Area Die Yield = Wafer yield * 1 + Defects_per_unit_area * Die_Area
− α
14
DAP.S98 1
α
15
Throughput Plane Boeing 747 Speed 610 mph DC to Paris 6 5 hours Passengers 470 Throughput (pmph) 286 700 Boeing 747 BAD/Sud 610 mph 1350 mph 6.5 hours 3 hours 470 132 286,700 178 200
Concodre 1350 mph 3 hours 132 178,200
– Execution time, response time, latency
16
– Throughput, bandwidth
17
Application Answers per month Programming Language Application Answers per month Operations per second Compiler
ISA
(millions) of Instructions per second: MIPS (millions) of (FP) operations per second: MFLOP/s Datapath Control
ISA
Function Units (millions) of (FP) operations per second: MFLOP/s Megabytes per second Transistors Wires Pins Function Units Cycles per second (clock rate)
18
19
20
21
– SPECWeb SPECFS
22
– SPECFS
23
24
25
26
27
28
29
30
31
Speedup due to enhancement E:
ExTime w/o E Performance w/ E / / Speedup(E) = ------------- = ------------------- ExTime w/ E Performance w/o E
Suppose that enhancement E accelerates a fraction F
task is unaffected
32
DAP.S98 32
ExTimenew = ExTimeold x (1 - Fractionenhanced) + Fractionenhanced Speedupenhanced Speedupoverall = ExTimeold = 1 (1 F ti ) + F ti p poverall ExTimenew (1 - Fractionenhanced) + Fractionenhanced Speedupenhanced
33
34
CPU time = Seconds = Instructions x Cycles x Seconds Program Program Instruction Cycle
35
CPI = (CPU Time * Clock Rate) / Instruction Count
n CPI = (CPU Time * Clock Rate) / Instruction Count = Cycles / Instruction Count CPU time = CycleTime * CPI * I i = 1 i i
CPI =
n i i i i
i = 1 i Instruction Count
36
Base Machine (Reg / Reg) Base Machine (Reg / Reg) Op Freq Cycles CPI(i) (% Time) ALU 50% 1 .5 (33%) L d 20% 2 4 (27%) Load 20% 2 .4 (27%) Store 10% 2 .2 (13%) Branch 20% 2 .4 (27%)
Typical Mix
1.5
37