Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
Radu Teodorescu Department of Computer Science and Engineering The Ohio State University http://arch.cse.ohio-state.edu
computer architecture research lab
Designing Energy-Efficient Microprocessors in the Era of - - PowerPoint PPT Presentation
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors Radu Teodorescu Department of Computer Science and Engineering The Ohio State University http://arch.cse.ohio-state.edu T computer architecture research lab
computer architecture research lab
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
2
Energy efficiency is now crucial to all computing markets, in particular the growth areas: mobile and cloud computing.
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
3
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Voltage
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
4
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
5
1E-10 1E-08 1E-06 1E-04 1E-02 1E+00 900 825 750 675 600 525 450 375 300 Supply Voltage - millivolts Probability of SRAM Bit Failure Intel Vcc-min NTV 5% error rate
0.5 1 1.5 2 Frequency Distribution
nominal NTV%
Voltage Emergency
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
6
Delay Nom. Delay NTV
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
7
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
8
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
9
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
10
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
11
56 100 351 511 437 351 100 437 511 56 87 204 5 2 511 56 351 499 201 1 511 73
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
12
Data Block (cache line) Parity encoders Parity encoders Parity encoders PW PW PW ... PW PW PW ... PW PW PW ... ... Parity Group 0 Parity Group 1 Parity Group N Permutation Network Permutation 0 Permutation 1 Permutation N
Line 0 Line 1 Line 2 Line 3 Line 4 Line 5 Line 6 Line 7
Redundant bits
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
13
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
14
Vdd Freq + Latency Nominal 0.9V 3GHz NTHigh 0.375V 460Mhz 4 NTMid 0.350V 355Mhz 4 NTLow 0.337V 300Mhz 6
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
15
“Improving cache lifetime reliability at ultra-low voltages,” in International Symposium on Microarchitecture, December 2009.
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
16
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
17
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
18
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
19
0.5 1 1.5 2 Frequency Distribution
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
0.5 1 1.5 2 Frequency Distribution
20
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
21
Frequency Execution progress Wasted Perf. Execution progress Frequency NTV Variation No variation Bottleneck
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
22
Core Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 Core 8 Core 9 Core 10 Core 11 Core 12 Core 13 Core 14 Core 15 Core Core 1 Core 2 Core 3 Core 4 Core 5 Core 6 Core 7 Core 8 Core 9 Core 10 Core 11 Core 12 Core 13 Core 14 Core 15
Vdd High Vdd Low
Vdd High & Low, with Fhigh and Flow
two rails and Fhigh and Flow
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
23
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
24
Slow Fast
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
VAR:
25
Blocked Normal Critical
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
26
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
27
0.5 0.6 0.7 0.8 0.9 1 1.1 barnes
water-nsqd cholesky fft lu radix blackscholes fluidanimate swaptions dedup streamclster g.mean Normalized Execution Time Hetero Scheduling Booster VAR Booster SYNC Heterogeneous
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
28
0.5 0.6 0.7 0.8 0.9 1 1.1 radiosity raytrace volrend bodytrack g.mean Normalized Execution Time Hetero Scheduling Booster VAR Booster SYNC Heterogeneous
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
29
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
30
0.0010 0.0015 0.3 0.4 0.5 0.6 0.7
V(out) (V) time (s) V(out) I(load)
5 10 15 20 25 30 35 40 45
I(load) (A)
+10%
0.0010 0.0015 0.3 0.4 0.5 0.6 0.7
V(out) (V) time (s) V(out) I(load)
5 10 15 20 25 30 35 40 45
I(load) (A)
+10%
Guardband
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
31
Voltage Emergency
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 35 40 45 50 55 4 8 12 16
32
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 30 40 50 60 70 80 1 2 3 4
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 30 35 40 45 50 55 60 65 70 2 4 6 8
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
VRSync saves 33% energy
33
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
34
First enter Execution Blocked on barrier 1 Delay All in barrier 1 T0 T7 Linear schedule t7 T1 T2 T3 T4 T5 T6 Time Threads
Linear schedule
First enter Execution Blocked on barrier 1 All in barrier Delay All out T0 T7 Threads T1 T2 T3 T4 T5 T6 Time
Bulk schedule
50 100 150 200 250 300 400 450 500 550 600 650 700
V(out)
V(out) (mV) time (µs) 4 8 12 16 20 24 28 32 36
50 100 150 200 250 300 400 450 500 550 600 650 700
V(out)
time (µs) V(out) (mV) 4 8 12 16 20 24 28 32 36
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
35
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
36
Power (Watts) Cores in Barrier Time (milliseconds) Emergency Emergency Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 80 90 46.98 47 47.02 47.04 47.06 8 16 24 32 Power (Watts) Cores in Barrier Time (milliseconds) Emergency Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 80 90 49.44 49.47 49.5 49.53 49.56 49.59 8 16 24 32 Power (Watts) Cores in Barrier Time (milliseconds) Emergency Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 80 90 49.38 49.41 49.44 49.47 49.5 49.53 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
37
VRSync Bulk! Baseline!
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
38
Baseline! VRSync Linear!
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
39
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 r a d i
i t y b a r n e s
e a n r a y t r a c e w a t e r
s q u a r e d c h
e s k y f f t l u r a d i x b l a c k s c h
e s b
y t r a c k f l u i d a n i m a t e s w a p t i
s d e d u p s t r e a m c l u s t e r g . m e a n Normalized Execution Time
2.1
Linear Bulk
11% 6%
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
40
Technique! Guardband! Runtime! Power! Energy! Baseline with High Guardband! 210mV! 1.0! 1.563! 1.563! VRSync Linear! 60mV! 1.112! 0.98! 1.086! VRSync Bulk! 60mV! 1.063! 0.99! 1.049! VRSync Bulk Fast! 160mV! 1.045! 1.361! 1.422!
31% 33%
VRSync Bulk is 33% more energy efficient than baseline with high guardband
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
41
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
margins (voltage speculation)
42
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
SPECfp, and SPECint benchmarks
43
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
44
Observation: Correctable errors always triggered before uncorrectable ones, while running a stress test workload.
2 4 6 8 10 12 14 16 18 0.96 0.98 1 1.02 1.04 1.06 1.08 1.1 Error Rate (errors/minute) Supply Voltage Unsafe Vdd Itanium Core
Failure Vdd Correctable error range
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
45
Core-to-core variation in safe/min Vdd: 0.96-1V
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
Voltage - determined post-manufacturing by running stress test workload
46
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
47 Safety padding (10 mV) First Error Voltage Margin Voltage Supply Voltage Time Correctable Errors Core Vdd Core errors Discovery phase Runtime
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
48 Safety padding (10 mV) First Error Voltage Margin Voltage Supply Voltage Time Core Vdd Correctable errors Max error threshold
burst testing
Min error threshold
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
0.965 0.97 0.975 0.98 0.985 0.99 0.995 1 5 10 15 20 10 20 30 40 50 Supply Voltage (V) Error Rate (per minute) Time (minutes) Margin Voltage Error rate Core Voltage
49
SPECjbb
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
0.5 0.6 0.7 0.8 0.9 1 Specjbb2005 SPECint SPECfp Relative Power Cores-only CPU Total
50
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
50 100 150 200 250 300 core0 core1 core2 core3 core4 core5 core6 core7 Correctable Errors Correctable Cache Errors Correctable RF Errors
(a) Processor A
50 100 150 200 250 core0 core1 core2 core3 core4 core5 core6 core7 Correctable Errors
(b) Processor B
51
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
52
Data Parity Parity
Power (Watts) Cores in Barrier Time (milliseconds) Cores in Barrier Power (Watts) 10 20 30 40 50 60 70 36 38 40 42 44 46 48 50 8 16 24 32
Designing Energy-Efficient Microprocessors in the Era of Unpredictable Transistors
T
architecture research lab
now Assist. Prof. @ SUNY Binghamton
53