Dark Silicon and its Implications for Future Processor Design Max - - PowerPoint PPT Presentation

dark silicon and its implications for future processor
SMART_READER_LITE
LIVE PREVIEW

Dark Silicon and its Implications for Future Processor Design Max - - PowerPoint PPT Presentation

Dark Silicon Dark Silicon and its Implications for Future Processor Design Max Menges 22. December 2015 1 / 21 Dark Silicon Introduction: What is dark silicon? Motivation 10 25 3500 3000 8 20 Frequency [GHz] Utilization [%] 2500


slide-1
SLIDE 1

Dark Silicon

Dark Silicon and its Implications for Future Processor Design

Max Menges

  • 22. December 2015

1 / 21

slide-2
SLIDE 2

Dark Silicon Introduction: What is dark silicon?

Motivation

2 4 6 8 10 90nm TSMC 45nm TSMC 32nm ITRS Frequency [GHz] 500 1000 1500 2000 2500 3000 3500 90nm TSMC 45nm TSMC 32nm ITRS Power [W] 5 10 15 20 25 90nm TSMC 45nm TSMC 32nm ITRS Utilization [%]

2 / 21

slide-3
SLIDE 3

Dark Silicon Introduction: What is dark silicon?

What is dark silicon?

The utilization wall refers to the part of a chip which can actively be used within the power budget at full frequency. This is dropping exponentially with each process generation. The unused silicon that is left unpowered is referred to as dark silicon.

3 / 21

slide-4
SLIDE 4

Dark Silicon Introduction: What is dark silicon?

1 Introduction: What is dark silicon? 2 Background: Where does dark silicon come from? 3 Conservation Cores: Utilizing dark silicon.

4 / 21

slide-5
SLIDE 5

Dark Silicon Background: Where does dark silicon come from?

What is in a CPU?

Sudha Yalamanchili, Architectural Alternatives for Energy Efficient Performance Scaling, VLSI Conference, 2013 5 / 21

slide-6
SLIDE 6

Dark Silicon Background: Where does dark silicon come from?

Power Consumption

Dynamic power when switching Pdyn = αCLV 2f Subthreshold leakage Pleak ∝ e

VGS −Vth nVT

Gate-oxide leakage due to quantum meachanical tunneling

D-cache 6% Data path 38% Register file 14% Fetch/ decode 19% I-cache 23% Baseline CPU 91 pJ/instr.

Goulding-Hotta et al. The GreenDroid Mobile Application Processor: An Architecture for Silicon’s Dark Future, Micro IEEE, vol.31, no.2, 2011 6 / 21

slide-7
SLIDE 7

Dark Silicon Background: Where does dark silicon come from?

Technology Scaling

Scale geometries by factor S = 1.4, e.g. from 90nm to 65nm Ideally scale all voltages etc. accordingly Devices per chip at constant area A increases by S2 ≈ 2 No increase in power due to constant energy density PS = 1

S C · 1 S2V 2 · Sf = 1 S2P Param. Description Rel. Classical Scaling W , L Transistor dimensions 1/S Vdd, Vth Supply & threshold voltages 1/S tox Oxide thicknes 1/S C Gate capacitance WL/tox 1/S p Power per device CV 2f 1/S2 P Full die, full power Dp 1 U Utilization B/P 1

7 / 21

slide-8
SLIDE 8

Dark Silicon Background: Where does dark silicon come from?

Dennard Scaling

Source Drain Gate

Subthreshold Gate-leakage

Dennard’s Law: The power density in a transistor stays constant as geometries shrink Breakdown of Dennard scaling due to leakage current at about 2005-2007, around the 65nm process Limited by subthreshold leakage current and QM tunneling effects at thin gate oxide

8 / 21

slide-9
SLIDE 9

Dark Silicon Background: Where does dark silicon come from?

Technology Scaling II

Post Dennard scaling is leakage limited, Vdd and Vth cannot be lowered Continue to scale geometries by factor S = 1.4 Utilization will decrease with a factor of 1/S2 with each new process generation

Param. Description Rel. Classical Scaling Leakage Limited W , L Transistor dimensions 1/S 1/S Vdd, Vth Supply & threshold voltages 1/S 1 tox Oxide thicknes 1/S 1/S C Gate capacitance WL/tox 1/S 1/S p Power per device CV 2f 1/S2 1 P Full die, full power Dp 1 1/S2 U Utilization B/P 1 1/S2

9 / 21

slide-10
SLIDE 10

Dark Silicon Background: Where does dark silicon come from?

Muticore CPUs

Single core CPUs derive speedup from frequency gains

Single core

Core 0 Core 1

Manycore little BIG ? 10 / 21

slide-11
SLIDE 11

Dark Silicon Background: Where does dark silicon come from?

Muticore CPUs

Single core CPUs derive speedup from frequency gains Transition to multicore CPUs

Reduce clock frequency to 80% Power: PM = 0.512 · PS Gain 1.6x performance by adding a second core

Single core

Core 0 Core 1

Manycore little BIG ? 10 / 21

slide-12
SLIDE 12

Dark Silicon Background: Where does dark silicon come from?

Muticore CPUs

Single core CPUs derive speedup from frequency gains Transition to multicore CPUs

Reduce clock frequency to 80% Power: PM = 0.512 · PS Gain 1.6x performance by adding a second core

Low frequency, throughput orientated manycores for regular floating point arithmetics

Single core

Core 0 Core 1

Manycore little BIG ? 10 / 21

slide-13
SLIDE 13

Dark Silicon Background: Where does dark silicon come from?

Muticore CPUs

Single core CPUs derive speedup from frequency gains Transition to multicore CPUs

Reduce clock frequency to 80% Power: PM = 0.512 · PS Gain 1.6x performance by adding a second core

Low frequency, throughput orientated manycores for regular floating point arithmetics Heterogeneous cores for energy efficient computations

Single core

Core 0 Core 1

Manycore little BIG ? 10 / 21

slide-14
SLIDE 14

Dark Silicon Background: Where does dark silicon come from?

Muticore CPUs

Single core CPUs derive speedup from frequency gains Transition to multicore CPUs

Reduce clock frequency to 80% Power: PM = 0.512 · PS Gain 1.6x performance by adding a second core

Low frequency, throughput orientated manycores for regular floating point arithmetics Heterogeneous cores for energy efficient computations Specialized hardware

Single core

Core 0 Core 1

Manycore little BIG ? 10 / 21

slide-15
SLIDE 15

Dark Silicon Background: Where does dark silicon come from?

1 Introduction: What is dark silicon? 2 Background: Where does dark silicon come from? 3 Conservation Cores: Utilizing dark silicon.

11 / 21

slide-16
SLIDE 16

Dark Silicon Conservation Cores: Utilizing dark silicon.

The GreenDroid

The GreenDroid is a proposed energy efficient chip design targeted at Andriod mobile phones Power limitation of 3W Android OS ideal as a limited software platform

User applications run in a VM Common applications: Web browser, e-mail, media player Short replacement cycle

Utilize dark silicon in energy efficient conservation cores

12 / 21

slide-17
SLIDE 17

Dark Silicon Conservation Cores: Utilizing dark silicon.

What is a C-Core?

Specialized core which implements software functions in hardware Analyze most frequently used functions and translate to verilog code C-cores are coupled to a host CPU via L1 cache and scan chain

13 / 21

slide-18
SLIDE 18

Dark Silicon Conservation Cores: Utilizing dark silicon.

Tile layout

Chip layout Tile layout

Venkatesh et al. Conservation Cores: Reducing the Energy of Mature Computations, SIGARCH Comput. Archit. News, March 2010 14 / 21

slide-19
SLIDE 19

Dark Silicon Conservation Cores: Utilizing dark silicon.

Generating the Cores I

Characterize workload and identify regions of hot code Translate CFG to state machine Compile code

Compare c-core specs and code Generate stubs that allow execution on c-core or CPU

computeArraySum { sum = 0; for(i = 0; i < n; i++) { sum += a[i]; } return(sum); }

i = 0 sum = 0 phi(i) phi(sum) i < n sum+=a[i] i++ return(sum) F

Venkatesh et al. Conservation Cores: Reducing the Energy of Mature Computations, SIGARCH Comput.

  • Archit. News, March 2010

15 / 21

slide-20
SLIDE 20

Dark Silicon Conservation Cores: Utilizing dark silicon.

Generating the Cores II

Change code to allow patching

Constants Operators Control flow

Insert scan chains and synthesise to hardware Add exception bit to each state transition

i sum a + ld unit addr valid en value + + 1 muxSel muxSel ldEn ldValid < n cond

Data Path sInit s1 s2 s3 ldValid==0 sRet cond==0 Control Path Cache Interface Scan Chain Interface Scan Chain

Venkatesh et al. Conservation Cores: Reducing the Energy of Mature Computations, SIGARCH Comput. Archit. News, March 2010 16 / 21

slide-21
SLIDE 21

Dark Silicon Conservation Cores: Utilizing dark silicon.

Execution

Decide at runtime to execute code on CPU or c-core Pass function arguments via scan chain Start c-core execution with a single bit master scan chain Once complete the c-core throws an exception and transfers controll back to the CPU

In case of a patched core, pass control back and forth between CPU and c-core

17 / 21

slide-22
SLIDE 22

Dark Silicon Conservation Cores: Utilizing dark silicon.

Results I

Goulding-Hotta et al. The GreenDroid Mobile Application Processor: An Architecture for Silicon’s Dark Future, Micro IEEE, vol.31, no.2, 2011

Reduce energy by removing instruction fetch and decode and simplifying the data path C-core executions in one example tile span ≈ 10%

  • f code

18 / 21

slide-23
SLIDE 23

Dark Silicon Conservation Cores: Utilizing dark silicon.

Results II

Normalized application execution time

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

1.00 1.01

Exception Argument Transfer Dispatch Overhead Initialization Conservation Core MIPS MIPS C-Core

Normalized application energy

0.2 0.4 0.6 0.8 1 1.2 1.4

1 . 1.00 0.67

Conservation Core Leakage Core Leakage D−Cache Leakage Core Clock Conservation Core Dynamic Core Dynamic D−Cache Dynamic MIPS C-Core

Venkatesh et al. Conservation Cores: Reducing the Energy of Mature Computations, SIGARCH Comput. Archit. News, March 2010

Average energy and execution times of bzip2, cjpeg, djpeg, mcf and vpr in c-cores and the MIPS core.

19 / 21

slide-24
SLIDE 24

Dark Silicon Conservation Cores: Utilizing dark silicon.

Conclusion

Dark silicon is the part of chip which cannot be operated within the power budget

20 / 21

slide-25
SLIDE 25

Dark Silicon Conservation Cores: Utilizing dark silicon.

Conclusion

Dark silicon is the part of chip which cannot be operated within the power budget It is a result of scaling process technologies without reducing supply voltage accordingly

20 / 21

slide-26
SLIDE 26

Dark Silicon Conservation Cores: Utilizing dark silicon.

Conclusion

Dark silicon is the part of chip which cannot be operated within the power budget It is a result of scaling process technologies without reducing supply voltage accordingly Introduce specialized energy efficient hardware to utilize dark silicon

20 / 21

slide-27
SLIDE 27

Dark Silicon Conservation Cores: Utilizing dark silicon.

Thank you for your attention. Any questions?

21 / 21