Working Around the Limits of Working Around the Limits of CMOS - - PowerPoint PPT Presentation

working around the limits of working around the limits of
SMART_READER_LITE
LIVE PREVIEW

Working Around the Limits of Working Around the Limits of CMOS - - PowerPoint PPT Presentation

Working Around the Limits of Working Around the Limits of CMOS CMOS CMOS CMOS Mary Jane Irwin, Penn State Mary Jane Irwin, Penn State NSF Workshop: Electronic Design Automation Past, Present, and Future Past, Present, and Future


slide-1
SLIDE 1

Working Around the Limits of Working Around the Limits of CMOS CMOS CMOS CMOS

Mary Jane Irwin, Penn State Mary Jane Irwin, Penn State

NSF Workshop: Electronic Design Automation – Past, Present, and Future Past, Present, and Future July 8 and 9, 2009

slide-2
SLIDE 2

Ab t t Ab t t Abstract Abstract

 

The design constraints of improved performance better energy efficiency increased reliability and

The design constraints of improved performance, better energy efficiency, increased reliability, and constrained design costs challenge EDA researchers as silicon technology continues to scale according to Moore’s Law. However, there are functions that our “standard” silicon technology – CMOS – just doesn’t do well. For functions such as global interconnects, on-chip non-volatile memory, and massive (high bandwidth) input/output, technologies other than CMOS combined with 3D integration holds great

  • promise. For example, a network-on-chip in a second layer exploiting optical and/or RF technology can

provide high performance energy efficient and reliable global interconnects SRAM/DRAM memory provide high performance, energy efficient, and reliable global interconnects. SRAM/DRAM memory stacking allows massively parallel memory access helping to mitigate the memory wall and dramatically reducing the large off-chip memory energy consumption. Additionally, stacking emerging non-volatile memory which is immune to radiation-induced soft errors can provide on-chip non-volatile storage while consuming zero standby power. Stacked layers of chemoresistive sensors, mass-sensitive nanoresonators, and biologically-selective FETs fabricated via a directed-assembly approach can provide radically new input/output mechanisms. input/output mechanisms.

 

But to achieve the promise of 3D integration as a way to sustain Moore’s law as well as to enable More- than-Moore requires advances by the EDA community working with the design community, as well as interdisciplinary efforts with chemist, biologists, and material scientists. Fundamental research challenges for the designer include determining a functional partitioning that maximizes the benefits of vertical g g p g connections while achieving optimal performance and energy efficiency, designing the interface circuitry between the CMOS “brains” and the non-CMOS technologies, and ensuring temperature stability across and between layers. T

  • meet these challenges, design methodologies and design tools necessary to

implement and simulate/validate 3D architectures which integrate these new technologies and must be developed.

–July 22, 2008 –2

slide-3
SLIDE 3

EDA Designer EDA Researcher

–July 22, 2008 –3

slide-4
SLIDE 4

Bookkeeping fabric Bookkeeping fabric Bookkeeping fabric Bookkeeping fabric

 No need to be blazingly

fast

 “Old” CMOS suffices

(>250nm)

  • Better reliability

L l k

  • Less leakage
  • No or little process

variation variation

  • Existing tools (mostly)

work fine

–July 22, 2008 –4

slide-5
SLIDE 5

Adding compute power fabric Adding compute power fabric Adding compute power fabric Adding compute power fabric

 Take advantage of scaling

for compute power (<90nm) No hi hl s sce tible

 Now highly susceptible

to faults, variation, leakage, etc. g

  • Must be dynamically

reconfigurable

  • Must have to have a way
  • Must have to have a way

to monitor the “health” of the compute elements

–July 22, 2008 –5

slide-6
SLIDE 6

variability reliability probabilistic behavior

Picture of SET PADOX Structure : Fabrication method for IC-oriented Si single-electron transistors Ono, Y.; Takahashi, Y.; Yamazaki, K.; Nagase, M. Namatsu, H.; Kurihara, K.; Murase, K.

slide-7
SLIDE 7

Monitoring the compute fabric Monitoring the compute fabric Monitoring the compute fabric Monitoring the compute fabric

 Performance/power/

p fault “sensors”

  • Hardware counters
  • Temperature sensors

C t l k b

 Control knobs

  • Turn off idle and faulty

cores & links cores & links

  • Apply DVFS

–July 22, 2008 –7

slide-8
SLIDE 8

NBTI and oxide wearout sensors NBTI and oxide wearout sensors NBTI and oxide wearout sensors NBTI and oxide wearout sensors

LIFO + Control Bits DUT

  • L. Co

14.4

20b Register 20b Register 20b Register 20b Counter

Oscillator nverters

21.42 m 40 m 21.42 m

DUT

P1 St d D i Stressed Device P0 T emperature Calibration Device

NAND Oscillator Level Converters

slide-9
SLIDE 9

NBTI sensors NBTI sensors NBTI sensors NBTI sensors

164 m 164 m 164 m 164 m 164 m 164 m

10 12

Stress Phase Recovery Phase

 Other kind of

LIFO + Scan Control 1 LIFO + Scan Control LIFO + Scan Control 1 LIFO + Scan Control 1 LIFO + Scan Control LIFO + Scan Control 1 LIFO + Scan Control 1 LIFO + Scan Control LIFO + Scan Control 1

6 8 10

(mV)

sensors

 Number and

16 Oxide Sensors 16 Oxide Sensors 96 m 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 96 m 16 Oxide Sensors 16 Oxide Sensors 96 m 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 96 m 16 Oxide Sensors 16 Oxide Sensors 96 m 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 16 Oxide Sensors 96 m

2 4 6

Vth (

Vstress = 1.7V Temperature = 130C

location of sensors A ti f

Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors Sensors

DISCRETES DISCRETES DISCRETES DISCRETES

2 50 100 150 200

Time (103 s)

Temperature = 130C Sampling Rate = 10s

 Aggregation of

sensor information

196 m 144 Oxide Sensors 96 NBTI Sensors 1 00 196 m 196 m 144 Oxide Sensors 96 NBTI Sensors 1 00 196 m

information

m m m 1500 m m m m m 1500 m m

– David Blaauw, UMich

slide-10
SLIDE 10

Adding a communication network Adding a communication network Adding a communication network Adding a communication network

 By moving from 2D to 3D

have many more close neighbors

 Design space exploration  Design space exploration

  • Many TSV’s → higher

bandwidth → lower yield

 Other interconnect

technologies like RF and g

  • ptical
  • Interface circuitry issues
  • Thermal issues
  • Thermal issues

–July 22, 2008 –10

slide-11
SLIDE 11

3D challenges 3D challenges 3D challenges 3D challenges

Architecture/ Design/

Memory-stacked CMP 3D applications Thermal i l ti

Design/ Applications

3D uProc 3D applications

System-level design analysis tools

Cost-driven 3D simulation Physical design Scan

Design automation

Cost-driven 3D EDA tools OpenAccess 3D EDA tools Clocktree T est access Scan design Pre bond

T esting and design-for- testability

synthesis T est scheduling Fault models Pre-bond wafer/die test

testability Process

–11

Nearly Mature Research initiated, Some tools/techniques available No solutions available, research needed

– IBM 11/2002

slide-12
SLIDE 12

Adding (nonvolatile) memory Adding (nonvolatile) memory Adding (nonvolatile) memory Adding (nonvolatile) memory

 Use 3D memory stacking

to take advantage of the increased bandwidth and reduced latency

 Will probably require a

redesign of the memory

  • rganization/interface
  • rganization/interface

 Stacking NVM (MRAM,

PCRAM, …)

  • Instant on/off, rad hardened
  • Extremely low leakage
  • Long write latencies and large
  • Long write latencies and large

write energy

–July 22, 2008 –12

slide-13
SLIDE 13

MRAM (L2 cache) stacking MRAM (L2 cache) stacking MRAM (L2 cache) stacking MRAM (L2 cache) stacking

2M-SRAM-SNUCA 2M-SRAM-DNUCA 8M Hybrid DNUCA 8M-MRAM-DNUCA 0 8 1 2M SRAM SNUCA 2M SRAM DNUCA 8M Hybrid DNUCA 8M MRAM DNUCA 0.6 0.8 0.2 0.4

galgel apsi equake fma3d swim streamcluster

–13

T

  • tal Power Comparison

– Yuan Xie, PSU

slide-14
SLIDE 14

Adding I/O Adding I/O Adding I/O Adding I/O

 “Traditional” I/O with

  • ptical device stacking

– HP Labs ISCA 08 HP Labs, ISCA 08

 Nontraditional I/O

–July 22, 2008 –14

slide-15
SLIDE 15

Nontraditional I/O Nontraditional I/O Nontraditional I/O Nontraditional I/O

 Chemical sensors

Chemical sensors

 Electrofluidic nanowire and

nanobead self assembly

– Tom Mallouk, Chemistry, PSU –Theresa Mayer, EE, PSU y

–15

slide-16
SLIDE 16

A A nano nano nose application nose application A A nano nano nose application nose application

 Input – digitized

p g responses from (100x100) gas sensor array

 Memory – threshold

l value storage

 Compute fabric –

neighborhood neighborhood aggregation (e.g., systolic array) systolic array)

–July 22, 2008 –16

slide-17
SLIDE 17

More nontraditional I/O More nontraditional I/O More nontraditional I/O More nontraditional I/O

 Biomedical imaging

g g

  • Measure tiny magnetic fields

 Use magnetoelectric

g sensors (Magnetic field →

Strain → Electric field) M i i ( i l

  • Magnetostriction (materials

that change their shape in response to a magnetic field) p g )

  • Piezoelectricity (generate

electric potential in response to applied mechanical stress)

–July 22, 2008 –17

to applied mechanical stress)

slide-18
SLIDE 18

Integrated sensor transistor Integrated sensor transistor Integrated sensor transistor Integrated sensor transistor

Fig.4a Integrated sensor systems with associated parastics g g y p Fig.4b Process sequence for the fabrication and direct integration of

  • Low parasitics from

elimination of lossy cable

the magnetoelectric cantilever structure. The resonant

  • scillation of the

cantilever structure

elimination of lossy cable

  • Very low external noise
  • Easy batch fabrication for

is capacitively coupled to the floating gate, triggering the transistor drain –18

array demonstration

transistor drain current response.

– Suman Datta, PSU

slide-19
SLIDE 19

Adding a power supply Adding a power supply Adding a power supply Adding a power supply

 Energy buffer (battery or

gy ( y capacitor)

  • Recharge issues (especially

if the device is implanted)

 Energy scavenging

S RF ib i

  • Stray RF sources, vibration,

kinetic energy, thermoelectric generators, g micro wind turbines, etc. (en.wikipedia.org/wiki/Ener gy harvesting ) gy_harvesting )

–July 22, 2008 –19

slide-20
SLIDE 20

Material Material Scientists/Engr. Chemists, Physicists Application Designer EDA Researcher pp Domain Experts D i Device Physicists / Fabricators

–July 22, 2008 –20