Integrated Systems Integrated Systems Challenges and Opportunities - - PowerPoint PPT Presentation

integrated systems integrated systems challenges and
SMART_READER_LITE
LIVE PREVIEW

Integrated Systems Integrated Systems Challenges and Opportunities - - PowerPoint PPT Presentation

Integrated Systems Integrated Systems Challenges and Opportunities Challenges and Opportunities Giovanni De Micheli Giovanni De Micheli Centre Syst Syst mes mes Int Int gr gr s s Centre Integrated systems Ubiquitous


slide-1
SLIDE 1

Integrated Systems Integrated Systems Challenges and Opportunities Challenges and Opportunities

Giovanni De Giovanni De Micheli Micheli Centre Centre Syst Systè èmes mes Int Inté égr gré és s

slide-2
SLIDE 2

De Micheli 2

Integrated systems

  • Ubiquitous presence of integrated

circuits and systems in products

  • The market pull:

– Run demanding SW applications with minimal energy consumption

  • The technology push:

– Pushing the physical limits of computational structures

slide-3
SLIDE 3

De Micheli 3

Anecdotes

  • I think there is a world market for maybe 5 computers -
  • T. Watson – IBM, 1949
  • There is no reason anyone would want a computer in

their home - K. Olsen – DEC 1977

  • I see no advantage whatsoever to a graphical user

interface – B. Gates - Microsoft, 1983

  • The cost of silicon in a car is higher than the cost of

steel - circa 2000

  • Communications of ACM dedicates a full issue to

internet games - November 2006

slide-4
SLIDE 4

De Micheli 4

Multi-processor Systems on Chips

  • Large-scale systems

– Billion-transistor chips – Multi-cores, multi-threaded SW – Power-consumption limited

  • Very expensive to design

– Non recurring engineering costs – Require large market

IBM Cell Multi-Processor

slide-5
SLIDE 5

De Micheli 5

Platforms

  • Address application-specific needs

– Domain-specific hardware – Differentiation via software

  • Examples

– Telecom:

  • Philips Nexperia
  • ST Nomadic

– Automotive

slide-6
SLIDE 6

De Micheli 6

Designing a large chip

CMOS, mostly digital, 65nm, >200mm2 System Blocks 103 RTL RTL lines 105 Mask Trapezoids 1013 Layout Polygons 1012 Circuit Transistors 109 Netlist Gates 108

Interrupt Interrupt Controller Controller UART UART AMBA AHB AMBA AHB AMBA APB AMBA APB Arbitration Arbitration & Decode & Decode & & Mux Mux GPIO GPIO Watchdog Watchdog Timer Timer AHB/APB AHB/APB Bridge Bridge SDRAM SDRAM I/F I/F AHB/ AHB/PCIe PCIe Bridge Bridge USB 2.0 USB 2.0 ISB 1.1 ISB 1.1 Ethernet Ethernet SATA SATA 1394a 1394a USB PHY USB PHY PCI Express PCI Express PCIe PHY PCIe PHY SATA PHY SATA PHY Boot Boot Rom Rom CPU CPU Custom Custom Block Block Ethernet PHY Ethernet PHY 1394 PHY 1394 PHY

Process begin Wait until not Clock = 1; If (Enable =‘1’) then Toggle = Not Toggle; Endif End process;

slide-7
SLIDE 7

De Micheli 7

Challenges

  • Complexity (giga scale)

– Intractable large scale problems

  • Technology (nano scale)

– Ever shrinking CMOS – New disruptive technologies

  • Architectures

– Multi-processing – Structured communication

  • Objectives

– High performance – Low-energy consumption – Small footprint - low cost – Dependable

  • Synthesis technology

– Model HW with languages – Compile into masks

  • Issues

– Design closure – Handling new technologies – HW/SW co-design – Deal with multiple objectives – Verifying correctness – …

& solutions

slide-8
SLIDE 8

De Micheli 8

Outline

  • The nanotechnology challenge

– Variability management – Error tolerance

  • The energy consumption challenge

– Temperature management

  • The communication bottleneck

– Networks on chips

  • A vision and conclusions
slide-9
SLIDE 9

De Micheli 9

Where are we heading?

  • Medium term:

– More Moore

  • More scaling - More complex chips - Fewer players

– More than Moore

  • Use silicon technologies beyond computational

structures

  • Interaction with environment, sensors, etc…
  • System integration
  • Long term:

– A multi-furcation of Moore’s scaling law beyond the 22 nanometer node – New technologies:

  • There is plenty of room at the bottom

Sensor Sensor

slide-10
SLIDE 10

De Micheli 10

Will a new nano-electronic technology prevail?

  • The skeptical view:

– Investments in CMOS silicon are huge – We will not need localized computing power beyond what is achievable with a 1 cm2 die in 22 nm silicon CMOS – Wiring is the bottleneck: making transistor smaller does not help

  • The optimistic view:

– We will always need increasing computing power and storage capacity – We need to curb the increasing costs of manufacturing – We will invent new computing architectures, storage media and communication means

slide-11
SLIDE 11

De Micheli 11

How is the transition path?

  • When will current semiconductor technologies run out of steam?
  • What factor will provide a radical change in technology?

– Performance, power density, cost?

  • Will the transition eliminate previous CMOS technologies?

– Are the new nanoelectronic technologies compatible with standard silicon?

  • How will we design nanoelectronic circuits:

– What are the common characteristics, from a design technology standpoint?

slide-12
SLIDE 12

De Micheli 12

Common characteristics of nano-devices

  • Self-assembly can be used to create structures

– Manufacturing paradigm is both bottom-up and top down – Attempt to avoid lithography bottleneck

  • Combined presence of micro and nano-structures

– Interfacing and compatibility issues

  • More physical defects and higher failure rates

– 10-15% defective devices according to recent estimates – Design must deal with nonworking and short-lived devices

  • Advantage stems from the high density of devices

– Two orders higher than scaled CMOS

slide-13
SLIDE 13

De Micheli 13

Design issues

  • Variability

– Physical parameter variation – Molecular structural effects

  • Reliability

– Higher failure rate – Higher environmental exposure – Transient and permanent errors

slide-14
SLIDE 14

De Micheli 14

Variability

  • Variations within/across chips

– Fast/slow transistors and interconnect

  • Design objective:

– Achieve better than worse-case performance

  • Solutions

– Statistical timing analysis – Statistical logic synthesis – Asynchronous design – System-level approaches

slide-15
SLIDE 15

De Micheli 15

Self-calibrating circuits

  • Adapt to inter-chip variations and to

environmental changes

– Use on-line adaptation policy

  • Examples:

– Dynamic voltage scaling of bus swings [Worm,Ienne –EPFL] – Dynamic voltage scaling in processors

  • Razor [Austin – U Michigan]

– Dynamic latency adjustment for NoCs

  • Terror [Tamhankar -Stanford]
  • Autonomic computing

– Systems that understand and react to environment [IBM]

dd

v FIFO

ch

F

Controller

FIFO

n

dd

v

Encoder

Decoder

Ack

ch

v

errors

ch

v

Error_L

Error

comparator

RAZOR FF

clk_del Main Flip-Flop clk Shadow Latch Q1 D1 1

slide-16
SLIDE 16

De Micheli 16

Reliability:

coping with transient malfunctions

  • Soft errors

– Data corruption due external radiation exposure

  • Crosstalk

– Data corruption due to internal field exposure

  • Both malfunctions manifest

themselves as timing errors

– Error containment

slide-17
SLIDE 17

De Micheli 17

Soft error rates

  • Vary with altitude and latitude
slide-18
SLIDE 18

De Micheli 18

Propagation of soft errors

slide-19
SLIDE 19

De Micheli 19

Logic protection techniques

Redundancy (TMR) Detection + System Correction Hardened Libraries

100% to 200% overhead

Protection Transistor embedded in the cell

ERR signal used by system (Hardware or Software) to correct the error

MODULE DUPLICATED MODULE TRIPLICATED MODULE

VOTE

Data Out Clock

Q Q

DFF DFF Dup δ

D D

Combinational Logic Comp ERR

Sequential Element D Q

clock

Other Techniques

D Q

clock

Scan Sequential Element

scan clock scan in scan out

SCAN hardware used as DETECTION hardware in functional mode

Redundancy Shielding Others [Source IROC]

slide-20
SLIDE 20

De Micheli 20

Reliability: aging of materials

  • Failure mechanisms:

– Electromigration – Oxide Breakdown – Thermo-mechanical stress

  • Temperature dependence

– Arrhenius law – Gradients

time

Failure rate

slide-21
SLIDE 21

De Micheli 21

Summary:

coping with variability & reliability

  • Design chips so that they are insensitive

to local timing variations

  • Exploit redundancy to replace failing

devices

  • Manage power consumption and

temperature of on-chip components

slide-22
SLIDE 22

De Micheli 22

Outline

  • The nanotechnology challenge

– Variability management – Error tolerance

  • The energy challenge

– Temperature management

  • The communication bottleneck

– Networks on chips

  • A vision and conclusions
slide-23
SLIDE 23

De Micheli 23

Execution core

120oC

Cache AGU Temp

(oC)

Thermal map

1.5 GHz Itanium-2

[Source: Intel Corporation and Prof. V. Oklobdzija]

slide-24
SLIDE 24

De Micheli 24

Thermal map: multiprocessosr

slide-25
SLIDE 25

De Micheli 25

Thermal management modeling

Thermal Manager Policy Queue 4 3 2 1 Sleep Active Idle Sleep System Active Idle E nvironment

slide-26
SLIDE 26

De Micheli 26

Thermal management policies

  • Objective:

– Increase energy efficiency and enhance reliability by controlling activity and temperature – Control systems on chips while running

  • Mathematical problem:

– Compute a policy that shuts down / slows down components – Policy complexity depends on requested accuracy

  • Markov, Semi-Markov, Time-Index Markov models, …

– Under mild assumptions, the policy can be computed exactly by LP – Convex optimization can be used

∑∑∑ ∑ ∑∑ ∑ ∑∑ ∑

∈ ∈ ∈ = ∈ ∈ ∈ ∈ ∈ =

= ∀ ≤ ∀ < ∀ = ∀ ∀ = −

F i A a S s i core c s const c const N c c perf A a S s s A a A a S s s N c c energy

a s f a s y a s c l Tpl c Perf t c a s f a s T c s a s f a s s M a s f t s t ) , ( ) , ( ) , ( ; Re ) ( ; cos ; 1 ) , ( ) , ( , ; ) , ' ( ) , | ' ( ) , ( . . cos min

1 , ' 1 ,

λ λ λ

slide-27
SLIDE 27

De Micheli 27

Effect of power management policy

  • n MTTF
  • At high temperatures, EM

and breakdown dominate

– DPM helps reliability

  • At low temperatures, thermal

stress dominates

– DPM lowers MTTF

  • As the temperature gap

between the active and sleep state widens, thermal stress tends to dominate

10.0 11.0 12.0 13.0 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2

power [W] MTTF[year]

T=80C T=90C T=100C T=110C T=120C

  • Any management policy must address both power consumption and

temperature management

slide-28
SLIDE 28

De Micheli 28

Summary: power and thermal management

  • Power and temperature management addresses:

– Battery lifetime extension for mobile systems – Component lifetime extension because of reliability enhancement

  • Management policies become increasingly more

important as geometries scale down:

– More devices dissipate more power (per unit area) – Smaller devices are more prone to fail

slide-29
SLIDE 29

De Micheli 29

Outline

  • The nanotechnology challenge

– Variability management – Error tolerance

  • The energy challenge

– Temperature management

  • The communication bottleneck

– Networks on chips

  • A vision and conclusions
slide-30
SLIDE 30

De Micheli 30

Why on-chip networks?

  • Provide a structured methodology

for realizing on-chip communication

– Modularity – Flexibility

  • Cope with inherent limitations of busses

– Performance and power of busses do not scale up

  • Support reliable operation

– Layered approach to error detection and correction

PE Network Interface Packets Routes

slide-31
SLIDE 31

De Micheli 31

NoC multi-processors: the RAW architecture

  • Fully programmable SoC

– Homogenous array of tiles:

  • Processor cores

with local storage

  • Each tile has a router

[Agrawal MIT]

  • The raw architecture is exposed to the compiler

– Cores and routers are programmable – Compiler determines which wires are used at each cycle – Compiler pipelines long wires

slide-32
SLIDE 32

The BONE roadmap

[Source: KAIST]

slide-33
SLIDE 33

De Micheli 33

Metrics for NoC design

  • Low communication latency

– Streamlined control protocols – Data and control signals can be separate

  • High communication bandwidth

– To support demanding SW applications

  • Low energy consumption

– Wiring switched capacitance dominates – Local storage in register buffers is expensive

  • Error resiliency

– To compensate/correct electrical-level errors

  • Flexibility and programmability
slide-34
SLIDE 34

De Micheli 34

Flexibility in NoC design

  • NoCs have modular structure

– Core interfaces – Switches/routers – High-speed links

  • NoCs can be tailored to

applications

– Topology selection – Switch/link sizing – Protocols

  • Several parameters for optimization

and a large design space

– NoC synthesis and optimization

CPU Memory DSP Memory

link switch network interface

CPU

slide-35
SLIDE 35

De Micheli 35

Netchip tool flow

Topology Synthesis includes: Floorplanner NoC Router

Communication characteristics NoC Area models User

  • bjectives

NoC Power models User Constraints

SUNFLOOR

System specs RTL Simulation SystemC code FPGA Emulation Synthesis IP Core models

To fab

Placement& Routing Codesign Simulation

Application

Platform Generation

NoC component library Platform Generation (xpipes- Compiler)

xpipes

[Source: Murali]

slide-36
SLIDE 36

De Micheli 36 Processor- memory cluster

SUNFLOOR vs. manual design

multimedia chip with 30 cores

P-processors, M-private memories, T-traffic generators, S-shared slaves Hand-mapped topology SUNFLOOR custom topology

Bi-directional links Bi-directional links Uni-directional links

slide-37
SLIDE 37

De Micheli 37

Design layouts

Hand-design (custom mesh) SUNFLOOR Design From Cadence SoC Encounter

slide-38
SLIDE 38

De Micheli 38

SUNFLOOR vs. manual design

Manual design:

  • Topology: 5x3 mesh

(15 switches)

  • Operating frequency:

885 MHz (post-layout)

  • Power consumption:

368 mW

  • Floorplan area:

35.4 mm2

  • Design time: several

weeks

  • 0.13 µm technology

Manual design:

  • Topology: 5x3 mesh

(15 switches)

  • Operating frequency:

885 MHz (post-layout)

  • Power consumption:

368 mW

  • Floorplan area:

35.4 mm2

  • Design time: several

weeks

  • 0.13 µm technology

SUNFLOOR:

  • Topology: custom

(8 switches)

  • Operating frequency:

885 MHz (post-layout)

  • Power consumption:

277 mW (-25%)

  • Floorplan area:

37 mm2 (+4%)

  • Design time: 4 hours

design to layout

  • 0.13 µm technology

SUNFLOOR:

  • Topology: custom

(8 switches)

  • Operating frequency:

885 MHz (post-layout)

  • Power consumption:

277 mW (-25%)

  • Floorplan area:

37 mm2 (+4%)

  • Design time: 4 hours

design to layout

  • 0.13 µm technology

Benchmark execution times comply with application requirements and, in fact, are even 10% better on the SUNFLOOR topology

slide-39
SLIDE 39

De Micheli 40

Summary: networks on chips

  • Networks on Chips are the structured

interconnect of the future

– NoCs exists in many forms and flavors

  • Networks on Chips are necessary for chips

designed in 45 nm technology and beyond

– NoCs deal with wire delay variability – NoCs provide reliability enhancement

slide-40
SLIDE 40

De Micheli 41

Outline

  • The nanotechnology challenge

– Variability management – Error tolerance

  • The energy challenge

– Temperature management

  • The communication bottleneck

– Networks on chips

  • Vision and conclusions
slide-41
SLIDE 41

De Micheli 42

A vision for the future

  • Mobile, ubiquitous, pervasive computing

– Ultra low-power demands low-voltage operation

  • High performance requires parallel computation
  • High reliability is achieved by redundancy
  • New paradigm for computation:

– Array-based computation (e.g., RAW) – Array-oriented communication (NoC)

slide-42
SLIDE 42

De Micheli 43

Conclusions

  • High-performance, ultra low-power, reliable circuits will be

required by distributed embedded systems

  • Novel nanotechnologies will provide us with unprecedented

levels of functional integration and performance

  • Novel design tools and methodologies will be needed to

leverage the technology:

– Cope with variability, reliability, thermal and other issues

slide-43
SLIDE 43

De Micheli 44

It is only a beginning … … the challenges are still ahead

[Source: Kubrick]