Integration, Architecture, and Applications of 3D CMOS Memristor - - PowerPoint PPT Presentation

integration architecture and applications of 3d cmos
SMART_READER_LITE
LIVE PREVIEW

Integration, Architecture, and Applications of 3D CMOS Memristor - - PowerPoint PPT Presentation

Integration, Architecture, and Applications of 3D CMOS Memristor Circuits K. T. Tim Cheng and Dimitri Strukov Univ. of California, Santa Barbara ISPD 2012 1 3D Hybrid CMOS/NANO add-on nanodevices layer top nanowire level CMOS


slide-1
SLIDE 1

K.‐T. Tim Cheng and Dimitri Strukov

  • Univ. of California, Santa Barbara

ISPD 2012

1

Integration, Architecture, and Applications of 3D CMOS‐ Memristor Circuits

slide-2
SLIDE 2

3D Hybrid ‐ CMOS/NANO

  • CMOS stack + nano add-on
  • nanowire crossbar of two-terminal devices (memristors)

add-on CMOS stack bottom nanowire level top nanowire level similar two-terminal nanodevices at each crosspoint

2

nanodevices layer CMOS layer

slide-3
SLIDE 3

Resistive Switching “Memristive” Devices

(latching switches, a.k.a. resistive switches, a.k.a. programmable diodes, a.k.a. memristive switches)

3

+ Wide range of material systems and physical phenomena

  • J. Yang Iet al. Natue Nano, (2008)

<50 ns ‐200 ‐100 100 200 Current ( uA ) ‐2 ‐1 1 2 Voltage ( V ) 50 nm hp Pt Pt TiO2 TiOx

V + ‐

slide-4
SLIDE 4

Area‐Distributed “CMOL” Interfaces

4

CMOS stack (just a cartoon) interface pins gold nanowire levels (nanoimprint) MOSFET nanodevices (latching switches) interface via (“pin”) Si wafer

  • K. Likharev (2004, 2005); D. Strukov and K. Likharev (2006)

Tip radii 2-10 nm

http://www.oxfordplasma.de/ process/sibo_wtc.htm

slide-5
SLIDE 5

AFOSR‐MURI HyNano: 3D Hybrid CMOS‐Nano Circuits

5

slide-6
SLIDE 6

The HyNano Team

Michael Chabinyc Materials, UCSB Tim Cheng ECE, UCSB (Director) Susanne Stemmer Materials, UCSB Dmitri Strukov ECE, UCSB Marivi Fernandez‐Serra Physics, Stony Brook Qiangfei Xia ECE, UM Amherst Konstantin K. Likharev Physics, Stony Brook Luke Theogarajan ECE, UCSB Wei Lu EECS, Michigan

6

slide-7
SLIDE 7

APPLICATIONS ARCHITECTURES/CIRCUITS

solid electrolyte metal oxide

  • rganic

a‐Si nanoimprint e‐beam

3D CMOS/nano circuits

  • w. area‐distributed interface

3D hybrid memories information processing mixed‐signal CrossNets

compact models

DEVICES MATERIALS

Project Overview

7

Optical lithography

3D hybrid SoC drift diffusion and ab‐initio modeling reproducible, high‐performance, high‐endurance devices

slide-8
SLIDE 8

Thrust Area #1: Application/Architecture/Ckt Exploration

  • Memory arrays for high‐performance computing
  • CMOL‐based FPGA
  • Neuromorphic networks for bio‐inspired information

processing

  • Evolvable analog circuits
  • Tunable bias network for analog design
  • Weighted multiply and add circuits
  • High precision Digital‐to‐Analog converter

8

slide-9
SLIDE 9

“CMOL” Interface – Integrating CMOS with Crossbar Memory Array

CMOS stack (just a cartoon) interface pins gold nanowire levels (nanoimprint) MOSFET nanodevices (latching switches) interface via (“pin”) Si wafer

slide-10
SLIDE 10
  • There are two types of pins – Blues and Reds
  • Each array of pins has its own decoding scheme

Addressing Crossbar Memory Array

Double decoding scheme:

  • An array of N2 blue pins

uniquely accessed with 2N control signals.

  • Another 2N control signals for

the corresponding N2 red pins

slide-11
SLIDE 11

Double Decoding Scheme

  • Four decoders:

11

data I/O memory cell array select decoder demux mux/demux select decoder

slide-12
SLIDE 12

Crossbar Construction – Top View

slide-13
SLIDE 13

Crossbar Construction – Top View

slide-14
SLIDE 14

Crossbar Construction – Side View

CMOS stack (just a cartoon) interface pins gold nanowire levels (nanoimprint) MOSFET nanodevices (latching switches) interface via (“pin”) Si wafer

slide-15
SLIDE 15

Crossbar Construction – Top View

slide-16
SLIDE 16

Crossbar Construction – Bottom Level

slide-17
SLIDE 17

Crossbar Construction – Top Level

slide-18
SLIDE 18

Crossbar Construction

slide-19
SLIDE 19

Crossbar Construction

slide-20
SLIDE 20

Crossbar Construction

Connectivity Domain

slide-21
SLIDE 21

Crossbar Construction

slide-22
SLIDE 22

Crossbar Construction

slide-23
SLIDE 23

Address space provided by yellow cells is wasted!

Unused Address Space

The red pin can only interact with blue pins in its connectivity domain

slide-24
SLIDE 24

Key Geometric Parameters

  • Distance between nanowires is 2FNANO
  • Size of cell is 2βFCMOS
  • β2 = r2 + 1 where r is an odd integer > 1.
  • Crossbar is tilted by an angle α equal to ArcTan(1/r)

with respect to the pins.

  • # of reachable crosspoints per wire segment is r2 – 1
slide-25
SLIDE 25

Crossbar Construction – Bottom Level

slide-26
SLIDE 26

Adding a Second Crossbar Layer

Connectivity domain in the first crossbar layer Connectivity domain in the second crossbar layer

The mapping is done through pin translation wires Blue pins are common to all crossbar layers. Red pins are "redefined" for each layer using the pin translation wires.

slide-27
SLIDE 27

First layer of red pins.

slide-28
SLIDE 28

First layer of red and blue pins.

slide-29
SLIDE 29

Layer of (bluish) wires connected to the blue pins.

slide-30
SLIDE 30

Single (orange) wire connected to a red pin. The cross‐points with the bottom wires are shown in green.

slide-31
SLIDE 31

First complete crossbar layer.

slide-32
SLIDE 32

A single pin translation wire (in yellow).

slide-33
SLIDE 33

Every orange wire is “translated” into another point using the same type

  • f pin translation wire.
slide-34
SLIDE 34

The first crossbar layer with its pins translation wires are then “buried” in SiO2

slide-35
SLIDE 35

We start to build the next crossbar layer (bluish wires)

slide-36
SLIDE 36

We start to build the next crossbar layer (bluish wires)

slide-37
SLIDE 37

We add the orange wires (the cross‐points are formed)

slide-38
SLIDE 38

And we add the pins translation wires and repeat the process…

slide-39
SLIDE 39

Maximum Number of Layers

  • Each layer has N2 cells.
  • There are r2 – 1 cross points per cell.
  • That gives us a total of N2(r2 – 1) cross points

per layer.

  • The double decoding scheme allows us to

address up to N4 locations

  • Which means that we can (potentially) have

up to N2/(r2 – 1) crossbar layers.

slide-40
SLIDE 40

How Does it Stand Up as a Memory?

40

Memristor PCM STTRAM DRAM Flash HDD Density (F2) <4 8–16 37–64 6–8 4–6 2/3 Energy per bit† (pJ) 0.1–3 2–27 0.1 2 10000 1–10x109 Read time (ns) 10-100(?) 20–70 10–30 10–50 25000 5–8x106 Write time (ns) ~10 50–500 13–95 10–50 200000 5–8x106 Retention years years weeks?

<<second

years years Endurance (cycles) >1012 107 1015 1015 106 104

slide-41
SLIDE 41

If Successful, 3D Hybrids Can Achieve…..

  • Unprecedented memory density

– Footprint of a nano‐device is 4Fnano

2/K, for K

vertically integrated crossbar layers – Potentially up to 1014 bits on a single 1‐cm2chip

  • Enormous memory bandwidth

– Potentially up to 1018 bits/second/cm2

  • At manageable power dissipation
  • With abundant redundancy for yield/reliability

41

slide-42
SLIDE 42

Thrust Area #1: Application/Architecture/Ckt Exploration

  • Memory arrays for high‐performance computing
  • CMOL‐based FPGA
  • Neuromorphic networks for bio‐inspired information

processing

  • Evolvable analog circuits
  • Tunable bias network for analog design
  • Weighted multiply and add circuits
  • High precision Digital‐to‐Analog converter

42

slide-43
SLIDE 43

CMOL‐Based FPGA

  • Programming for xpoint memristors similar to CMOL digital

memories

  • Uniform fabric with CMOS inverter cells
  • Crossbar wires for routings

43

A B A+B cell A B F CMOS inverter nanodevices A+B A RON Rpass Cwire B

slide-44
SLIDE 44

Density: CMOS vs. CMOL

44

Metrics (units) 2009 2010 2011 2012 2013 Comments Half-pitch FCMOS (nm) 50 45 40 36 32 In accordance with ITRS Half-pitch Fnano (nm) 20 18 16 14 12

  • CMOS memories (Gbits/cm2)

6.7 8.2 10.5 13 16 Follows ITRS (with A = 6F2

CMOS)

CMOL memories (Gbits/cm2) 4 10 23 36 67 Initial progress impacted by q CMOS FPGA (Mgates/cm2) 0.4 0.5 0.6 0.8 1.0 Rescaled from 0.18 μm rules CMOL FPGA (Mgates/cm2) 625 775 1,000 1,200 1,500

  • Metrics (units)

2016 2019 2022 2025 2028 Comments Half-pitch FCMOS (nm) 30 28 26 24 22 Grows slower than in ITRS Half-pitch Fnano (nm) 10 6 4 3.5 3

  • CMOS memories (Gbits/cm2)

18 21 25 29 35 Follows A = 6F2

CMOS

CMOL memories (Gbits/cm2) 100 350 900 1,200 1,700 Spectacular progress at lower q CMOS FPGA (Mgates/cm2) 1.1 1.3 1.5 1.7 2.1 Rescaled from 0.18 μm rules CMOL FPGA (Mgates/cm2) 1,700 2,000 2,300 2,700 3,200

slide-45
SLIDE 45

Thrust Area #1: Application/Architecture/Ckt Exploration

  • Memory arrays for high‐performance computing
  • CMOL‐based FPGA
  • Neuromorphic networks for bio‐inspired information

processing

  • Evolvable analog circuits
  • Tunable bias network for analog design
  • Weighted multiply and add circuits
  • High precision Digital‐to‐Analog converter

45

slide-46
SLIDE 46

Thrust Areas: # 2: High‐Performance/‐Yield Devices # 3: 3D Hybrids Integration

Using:

  • Nanoimprint
  • E‐beam lithography
  • Optical lithography
  • Heterogeneous wafer‐level

integration

46

(a) (b)

50 μm 100 μm

<20nm Overlay Alignment (Xia) E‐Beam Crossbar Arrays (Lu)

Integrating CMOS with devices

  • f different materials:
  • a‐Si
  • Metal oxide
  • Organic
  • Solid‐state electrolyte
slide-47
SLIDE 47

Integrated Crossbar Array/CMOS System

47

PI: Lu

Kim et al. Nano Lett., 12, 389–395 (2012).

Integrated crossbar/CMOS chip with probe card attached

CMOS Crossbar array

slide-48
SLIDE 48

Integrated Crossbar Array/CMOS System

48

slide-49
SLIDE 49

Performance of a‐Si and Metal‐Oxide Device Array

  • Tight distribution from 256 devices measured
  • Devices shown excellent on/off and intrinsic diode

characteristics

49

100nm

filament

“on” “off”

slide-50
SLIDE 50

APPLICATIONS ARCHITECTURES/CIRCUITS

solid electrolyte metal oxide

  • rganic

a‐Si nanoimprint e‐beam

3D CMOS/nano circuits

  • w. area‐distributed interface

3D hybrid memories information processing mixed‐signal CrossNets

compact models

DEVICES MATERIALS

Project Overview

50

Optical lithography

3D hybrid SoC drift diffusion and ab‐initio modeling reproducible, high‐performance, high‐endurance devices

slide-51
SLIDE 51

BACKUP SLIDES

51

slide-52
SLIDE 52

Thrust Area #3:

High‐Performance/‐Yield/‐Reproducibility Devices

  • a‐Si (Lu)
  • Metal oxide (Stemmer, Xia)
  • Organic (Chabinyc)
  • Solid‐state electrolyte (Lu)

52

slide-53
SLIDE 53

a‐Si Memristive Devices and Arrays

53

Lu et al., Nano Lett. (2008, 2009)

53

2.9 2.6 3.2 3.5 3.8 4.1 4.4 10 20 30 40 50 Vth (V)

# of devices

PI: Lu

100nm

  • 4
  • 2

2 4 6 0.0 0.5 1.0 1.5 2.0 2.5 3.0 1st cycle After 2nd cycle Voltage (V)

  • 4 -2

2 4 10

  • 12

10

  • 6

10

  • 7

10

  • 8

10

  • 9

10

  • 10

10

  • 11

Current (100nA)

Voltage

filament

“on” “off”

slide-54
SLIDE 54

Project Organization

UCSB

CMOS circuit design for CMOL integration MBE fabrication

  • f memristive

devices; Organic memristive devices Memristive device modeling Digital and analog 3D hybrid circuit architectures

Cheng, Strukov, Theogarajan Stemmer Chabinyc Strukov Cheng, Strukov, Theogarajan

  • U. Michigan

UMass Stony Brook University

a-Si & solid electrolyte devices; 3D integration with CMOS Metal oxide memristive devices; 3D integration with CMOS Ab-initio simulation of memristive devices Mixed-signal neuromorphic 3D hybrid circuit architectures

Lu Xia Fernandez-Serra, Likharev Likharev

<------- experiment ---------> <---------theory/modeling------->

54

slide-55
SLIDE 55

Crossbar Architecture

V V A =Vr =Vw Read Write i v vw ‐vw vr

V V

Vr/2

=

V V =Vr/2 V V

Vw/2

=

V V =Vw/2

July 2011 MURI Kickoff 55

bottom (nano)wire level similar two‐terminal devices at each crosspoint top (nano)wire level

‐ Xbar to preserve density ‐ Passive (no transistors) but nonlinear I‐ V ‐ Common way (from periphery) CMOS for decoding and sensing

slide-56
SLIDE 56

Generic Memory Array

  • Asserting a word line makes the access element to place the

contents of the memory element in the bit line. A particular bit is then selected with a MUX.

Memory element Access element decode r multiplex er

slide-57
SLIDE 57

Generic Memory Array

  • An array of N2 memory elements can be uniquely accessed

using 2N control signals (word+bit lines).

Other representation

  • f the same array
slide-58
SLIDE 58

Area‐Distributed “CMOL” Interfaces (II)

58

Most important feature: pin array tilt by angle  = arcsin(Fnano/FCMOS) = arctan(1/r) Every nanowire (and hence every crosspoint) may be addressed from CMOS!

2FCMOS pin 2A 2Fnano  pin 1 pin 2B 2rFnano

A B

  • K. Likharev (2004, 2005); D. Strukov and K. Likharev (2006)
slide-59
SLIDE 59

A Possible Solution

With this particular connectivity domain geometry (r=3), we can cover all the plane... But that is not always the case. The pin translation wires are another layer of wires on top of the crossbar We can add more crossbar layers by simply inserting a layer of pin translation wires between them.

slide-60
SLIDE 60

Crossbar Analysis

The crossbar is rotated by an angle α such that: Where r is an integer (an odd integer greater than 1). Once we set r and β (the CMOS cell complexity), the angle α and Fnano are fixed as well the length of the wires in the crossbar and the number of memristive devices reachable per wire segment.

slide-61
SLIDE 61

Crossbar Analysis

The parameter r also sets the maximum, minimum and average paths the electric signals have to propagate to access a bit (a memristive device). This paths are given by: Maximum (worst) case: 2Fnano * (r2 - r + 1) Minimum (best) case: 2Fnano * r Average (real) case: 2Fnano * (r2 + 1)/2

slide-62
SLIDE 62

3D Hybrid Integration with Multi‐Layer Crossbars

  • D. Strukov and R. S. Williams (2009)

1 2 3 4 5 A B C D E 1 2 3 4 5 A B C D E

N2access devices/vias

~N2β2

crosspoint devices per layer (out of N4 total) crosspoint device in 1st layer crosspoint device in 2nd layer crossbar layer via translation layer CMOS layer N data/control lines via translation wires connectivity domain in 1st layer connectivity domain in 2nd layer 62

slide-63
SLIDE 63

APPLICATIONS (Cheng, Likharev, Strukov, Theogarajan) ARCHITECTURES/CIRCUITS (Cheng, Likharev, Strukov, Theogarajan)

solid electrolyte (Lu) metal oxide (Xia, Stemmer)

  • rganic

(Chabinyc) a‐Si (Lu) nanoimprint (Xia, Chabinyc) e‐beam (Lu)

3D CMOS/nano circuits

  • w. area‐distributed interface

3D hybrid memories information processing mixed‐signal CrossNets

compact models (Likharev, Strukov)

DEVICES MATERIALS

Project Overview

(Fernandez‐ Serra, Strukov)

63

Optical lithography (Strukov, Stemmer)

3D hybrid SoC drift diffusion and ab‐initio modeling reproducible, high‐performance, high‐endurance devices

slide-64
SLIDE 64

64

IC Applications Continue to Demand More Memory and Higher Bandwidth

Chip Size

100 200 300 400 500 600

PA PM Peripheral I/O Controller Baseband Networking BT/WiFi GPU, CPU, Chipset & FPGA Memory Application Processor Transceiver Discrete

I/O

Switch

High High

For most applications running on high‐ end SoCs, amount of available memory and memory bandwidth have been and will continue to be the bottlenecks

slide-65
SLIDE 65

Bistable Two‐Terminal Devices

(latching switches, a.k.a. resistive switches, a.k.a. programmable diodes, a.k.a. memristive switches)

Demonstrated with many materials; no clear winner yet; few reproducibility reports, e.g.: Si /α-Si / M: Ti / Pt / TiO2 / Pt:

  • S. H. Jo and W. Lu (2008)
  • Q. Xia et al. (2009);
  • J. Borghetti et al. (2010)

65

slide-66
SLIDE 66

Device Requirements Vary for Different Ckts/Architectures/Applications

Dynamic range

  • f resistance

Signal DC AC Small Tuning ‐ Large Memory, FPGA, DAC MAC

66

slide-67
SLIDE 67

(c) (d) (e) (a) (b)

Xia, Strukov et al. (2009)

CMOS‐CMOL Integration: Initial Demonstration

67

PI: Xia