Integrated Systems Integrated Systems Challenges and Opportunities - - PowerPoint PPT Presentation
Integrated Systems Integrated Systems Challenges and Opportunities - - PowerPoint PPT Presentation
Integrated Systems Integrated Systems Challenges and Opportunities Challenges and Opportunities Giovanni De Micheli Giovanni De Micheli Centre Syst Syst mes mes Int Int gr gr s s Centre Integrated systems Ubiquitous
De Micheli 2
Integrated systems
- Ubiquitous presence of integrated
circuits and systems in products
- The market pull:
– Run demanding SW applications with minimal energy consumption
- The technology push:
– Pushing the physical limits of computational structures
De Micheli 3
Anecdotes
- I think there is a world market for maybe 5 computers -
- T. Watson – IBM, 1949
- There is no reason anyone would want a computer in
their home - K. Olsen – DEC 1977
- I see no advantage whatsoever to a graphical user
interface – B. Gates - Microsoft, 1983
- The cost of silicon in a car is higher than the cost of
steel - circa 2000
- Communications of ACM dedicates a full issue to
internet games - November 2006
De Micheli 4
Multi-processor Systems on Chips
- Large-scale systems
– Billion-transistor chips – Multi-cores, multi-threaded SW – Power-consumption limited
- Very expensive to design
– Non recurring engineering costs – Require large market
IBM Cell Multi-Processor
De Micheli 5
Platforms
- Address application-specific needs
– Domain-specific hardware – Differentiation via software
- Examples
– Telecom:
- Philips Nexperia
- ST Nomadic
– Automotive
De Micheli 6
Designing a large chip
CMOS, mostly digital, 65nm, >200mm2 System Blocks 103 RTL RTL lines 105 Mask Trapezoids 1013 Layout Polygons 1012 Circuit Transistors 109 Netlist Gates 108
Interrupt Interrupt Controller Controller UART UART AMBA AHB AMBA AHB AMBA APB AMBA APB Arbitration Arbitration & Decode & Decode & & Mux Mux GPIO GPIO Watchdog Watchdog Timer Timer AHB/APB AHB/APB Bridge Bridge SDRAM SDRAM I/F I/F AHB/ AHB/PCIe PCIe Bridge Bridge USB 2.0 USB 2.0 ISB 1.1 ISB 1.1 Ethernet Ethernet SATA SATA 1394a 1394a USB PHY USB PHY PCI Express PCI Express PCIe PHY PCIe PHY SATA PHY SATA PHY Boot Boot Rom Rom CPU CPU Custom Custom Block Block Ethernet PHY Ethernet PHY 1394 PHY 1394 PHYProcess begin Wait until not Clock = 1; If (Enable =‘1’) then Toggle = Not Toggle; Endif End process;
De Micheli 7
Challenges
- Complexity (giga scale)
– Intractable large scale problems
- Technology (nano scale)
– Ever shrinking CMOS – New disruptive technologies
- Architectures
– Multi-processing – Structured communication
- Objectives
– High performance – Low-energy consumption – Small footprint - low cost – Dependable
- Synthesis technology
– Model HW with languages – Compile into masks
- Issues
– Design closure – Handling new technologies – HW/SW co-design – Deal with multiple objectives – Verifying correctness – …
& solutions
De Micheli 8
Outline
- The nanotechnology challenge
– Variability management – Error tolerance
- The energy consumption challenge
– Temperature management
- The communication bottleneck
– Networks on chips
- A vision and conclusions
De Micheli 9
Where are we heading?
- Medium term:
– More Moore
- More scaling - More complex chips - Fewer players
– More than Moore
- Use silicon technologies beyond computational
structures
- Interaction with environment, sensors, etc…
- System integration
- Long term:
– A multi-furcation of Moore’s scaling law beyond the 22 nanometer node – New technologies:
- There is plenty of room at the bottom
Sensor Sensor
De Micheli 10
Will a new nano-electronic technology prevail?
- The skeptical view:
– Investments in CMOS silicon are huge – We will not need localized computing power beyond what is achievable with a 1 cm2 die in 22 nm silicon CMOS – Wiring is the bottleneck: making transistor smaller does not help
- The optimistic view:
– We will always need increasing computing power and storage capacity – We need to curb the increasing costs of manufacturing – We will invent new computing architectures, storage media and communication means
De Micheli 11
How is the transition path?
- When will current semiconductor technologies run out of steam?
- What factor will provide a radical change in technology?
– Performance, power density, cost?
- Will the transition eliminate previous CMOS technologies?
– Are the new nanoelectronic technologies compatible with standard silicon?
- How will we design nanoelectronic circuits:
– What are the common characteristics, from a design technology standpoint?
De Micheli 12
Common characteristics of nano-devices
- Self-assembly can be used to create structures
– Manufacturing paradigm is both bottom-up and top down – Attempt to avoid lithography bottleneck
- Combined presence of micro and nano-structures
– Interfacing and compatibility issues
- More physical defects and higher failure rates
– 10-15% defective devices according to recent estimates – Design must deal with nonworking and short-lived devices
- Advantage stems from the high density of devices
– Two orders higher than scaled CMOS
De Micheli 13
Design issues
- Variability
– Physical parameter variation – Molecular structural effects
- Reliability
– Higher failure rate – Higher environmental exposure – Transient and permanent errors
De Micheli 14
Variability
- Variations within/across chips
– Fast/slow transistors and interconnect
- Design objective:
– Achieve better than worse-case performance
- Solutions
– Statistical timing analysis – Statistical logic synthesis – Asynchronous design – System-level approaches
De Micheli 15
Self-calibrating circuits
- Adapt to inter-chip variations and to
environmental changes
– Use on-line adaptation policy
- Examples:
– Dynamic voltage scaling of bus swings [Worm,Ienne –EPFL] – Dynamic voltage scaling in processors
- Razor [Austin – U Michigan]
– Dynamic latency adjustment for NoCs
- Terror [Tamhankar -Stanford]
- Autonomic computing
– Systems that understand and react to environment [IBM]
dd
v FIFO
ch
F
Controller
FIFO
n
dd
v
Encoder
Decoder
Ack
ch
v
errors
ch
v
Error_L
Error
comparator
RAZOR FF
clk_del Main Flip-Flop clk Shadow Latch Q1 D1 1
De Micheli 16
Reliability:
coping with transient malfunctions
- Soft errors
– Data corruption due external radiation exposure
- Crosstalk
– Data corruption due to internal field exposure
- Both malfunctions manifest
themselves as timing errors
– Error containment
De Micheli 17
Soft error rates
- Vary with altitude and latitude
De Micheli 18
Propagation of soft errors
De Micheli 19
Logic protection techniques
Redundancy (TMR) Detection + System Correction Hardened Libraries
100% to 200% overhead
Protection Transistor embedded in the cell
ERR signal used by system (Hardware or Software) to correct the error
MODULE DUPLICATED MODULE TRIPLICATED MODULE
VOTE
Data Out Clock
Q Q
DFF DFF Dup δ
D D
Combinational Logic Comp ERR
Sequential Element D Q
clock
Other Techniques
D Q
clock
Scan Sequential Element
scan clock scan in scan out
SCAN hardware used as DETECTION hardware in functional mode
Redundancy Shielding Others [Source IROC]
De Micheli 20
Reliability: aging of materials
- Failure mechanisms:
– Electromigration – Oxide Breakdown – Thermo-mechanical stress
- Temperature dependence
– Arrhenius law – Gradients
time
Failure rate
De Micheli 21
Summary:
coping with variability & reliability
- Design chips so that they are insensitive
to local timing variations
- Exploit redundancy to replace failing
devices
- Manage power consumption and
temperature of on-chip components
De Micheli 22
Outline
- The nanotechnology challenge
– Variability management – Error tolerance
- The energy challenge
– Temperature management
- The communication bottleneck
– Networks on chips
- A vision and conclusions
De Micheli 23
Execution core
120oC
Cache AGU Temp
(oC)
Thermal map
1.5 GHz Itanium-2
[Source: Intel Corporation and Prof. V. Oklobdzija]
De Micheli 24
Thermal map: multiprocessosr
De Micheli 25
Thermal management modeling
Thermal Manager Policy Queue 4 3 2 1 Sleep Active Idle Sleep System Active Idle E nvironment
De Micheli 26
Thermal management policies
- Objective:
– Increase energy efficiency and enhance reliability by controlling activity and temperature – Control systems on chips while running
- Mathematical problem:
– Compute a policy that shuts down / slows down components – Policy complexity depends on requested accuracy
- Markov, Semi-Markov, Time-Index Markov models, …
– Under mild assumptions, the policy can be computed exactly by LP – Convex optimization can be used
∑∑∑ ∑ ∑∑ ∑ ∑∑ ∑
∈ ∈ ∈ = ∈ ∈ ∈ ∈ ∈ =
= ∀ ≤ ∀ < ∀ = ∀ ∀ = −
F i A a S s i core c s const c const N c c perf A a S s s A a A a S s s N c c energy
a s f a s y a s c l Tpl c Perf t c a s f a s T c s a s f a s s M a s f t s t ) , ( ) , ( ) , ( ; Re ) ( ; cos ; 1 ) , ( ) , ( , ; ) , ' ( ) , | ' ( ) , ( . . cos min
1 , ' 1 ,
λ λ λ
De Micheli 27
Effect of power management policy
- n MTTF
- At high temperatures, EM
and breakdown dominate
– DPM helps reliability
- At low temperatures, thermal
stress dominates
– DPM lowers MTTF
- As the temperature gap
between the active and sleep state widens, thermal stress tends to dominate
10.0 11.0 12.0 13.0 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2
power [W] MTTF[year]
T=80C T=90C T=100C T=110C T=120C
- Any management policy must address both power consumption and
temperature management
De Micheli 28
Summary: power and thermal management
- Power and temperature management addresses:
– Battery lifetime extension for mobile systems – Component lifetime extension because of reliability enhancement
- Management policies become increasingly more
important as geometries scale down:
– More devices dissipate more power (per unit area) – Smaller devices are more prone to fail
De Micheli 29
Outline
- The nanotechnology challenge
– Variability management – Error tolerance
- The energy challenge
– Temperature management
- The communication bottleneck
– Networks on chips
- A vision and conclusions
De Micheli 30
Why on-chip networks?
- Provide a structured methodology
for realizing on-chip communication
– Modularity – Flexibility
- Cope with inherent limitations of busses
– Performance and power of busses do not scale up
- Support reliable operation
– Layered approach to error detection and correction
PE Network Interface Packets Routes
De Micheli 31
NoC multi-processors: the RAW architecture
- Fully programmable SoC
– Homogenous array of tiles:
- Processor cores
with local storage
- Each tile has a router
[Agrawal MIT]
- The raw architecture is exposed to the compiler
– Cores and routers are programmable – Compiler determines which wires are used at each cycle – Compiler pipelines long wires
The BONE roadmap
[Source: KAIST]
De Micheli 33
Metrics for NoC design
- Low communication latency
– Streamlined control protocols – Data and control signals can be separate
- High communication bandwidth
– To support demanding SW applications
- Low energy consumption
– Wiring switched capacitance dominates – Local storage in register buffers is expensive
- Error resiliency
– To compensate/correct electrical-level errors
- Flexibility and programmability
De Micheli 34
Flexibility in NoC design
- NoCs have modular structure
– Core interfaces – Switches/routers – High-speed links
- NoCs can be tailored to
applications
– Topology selection – Switch/link sizing – Protocols
- Several parameters for optimization
and a large design space
– NoC synthesis and optimization
CPU Memory DSP Memory
link switch network interface
CPU
De Micheli 35
Netchip tool flow
Topology Synthesis includes: Floorplanner NoC Router
Communication characteristics NoC Area models User
- bjectives
NoC Power models User Constraints
SUNFLOOR
System specs RTL Simulation SystemC code FPGA Emulation Synthesis IP Core models
To fab
Placement& Routing Codesign Simulation
Application
Platform Generation
NoC component library Platform Generation (xpipes- Compiler)
xpipes
[Source: Murali]
De Micheli 36 Processor- memory cluster
SUNFLOOR vs. manual design
multimedia chip with 30 cores
P-processors, M-private memories, T-traffic generators, S-shared slaves Hand-mapped topology SUNFLOOR custom topology
Bi-directional links Bi-directional links Uni-directional links
De Micheli 37
Design layouts
Hand-design (custom mesh) SUNFLOOR Design From Cadence SoC Encounter
De Micheli 38
SUNFLOOR vs. manual design
Manual design:
- Topology: 5x3 mesh
(15 switches)
- Operating frequency:
885 MHz (post-layout)
- Power consumption:
368 mW
- Floorplan area:
35.4 mm2
- Design time: several
weeks
- 0.13 µm technology
Manual design:
- Topology: 5x3 mesh
(15 switches)
- Operating frequency:
885 MHz (post-layout)
- Power consumption:
368 mW
- Floorplan area:
35.4 mm2
- Design time: several
weeks
- 0.13 µm technology
SUNFLOOR:
- Topology: custom
(8 switches)
- Operating frequency:
885 MHz (post-layout)
- Power consumption:
277 mW (-25%)
- Floorplan area:
37 mm2 (+4%)
- Design time: 4 hours
design to layout
- 0.13 µm technology
SUNFLOOR:
- Topology: custom
(8 switches)
- Operating frequency:
885 MHz (post-layout)
- Power consumption:
277 mW (-25%)
- Floorplan area:
37 mm2 (+4%)
- Design time: 4 hours
design to layout
- 0.13 µm technology
Benchmark execution times comply with application requirements and, in fact, are even 10% better on the SUNFLOOR topology
De Micheli 40
Summary: networks on chips
- Networks on Chips are the structured
interconnect of the future
– NoCs exists in many forms and flavors
- Networks on Chips are necessary for chips
designed in 45 nm technology and beyond
– NoCs deal with wire delay variability – NoCs provide reliability enhancement
De Micheli 41
Outline
- The nanotechnology challenge
– Variability management – Error tolerance
- The energy challenge
– Temperature management
- The communication bottleneck
– Networks on chips
- Vision and conclusions
De Micheli 42
A vision for the future
- Mobile, ubiquitous, pervasive computing
– Ultra low-power demands low-voltage operation
- High performance requires parallel computation
- High reliability is achieved by redundancy
- New paradigm for computation:
– Array-based computation (e.g., RAW) – Array-oriented communication (NoC)
De Micheli 43
Conclusions
- High-performance, ultra low-power, reliable circuits will be
required by distributed embedded systems
- Novel nanotechnologies will provide us with unprecedented
levels of functional integration and performance
- Novel design tools and methodologies will be needed to
leverage the technology:
– Cope with variability, reliability, thermal and other issues
De Micheli 44
It is only a beginning … … the challenges are still ahead
[Source: Kubrick]