Stress Aw are Active Area Sizing, Gate Sizing and Repeater Insertion - - PowerPoint PPT Presentation

stress aw are active area sizing gate sizing and repeater
SMART_READER_LITE
LIVE PREVIEW

Stress Aw are Active Area Sizing, Gate Sizing and Repeater Insertion - - PowerPoint PPT Presentation

Stress Aw are Active Area Sizing, Gate Sizing and Repeater Insertion Ashutosh Chakraborty David Z. Pan ashutosh@cerc.utexas.edu dpan@ece.utexas.edu ECE Department, University of Texas at Austin Outline Intro. to source/drain (S/D) SiGe


slide-1
SLIDE 1

Stress Aw are Active Area Sizing, Gate Sizing and Repeater Insertion

Ashutosh Chakraborty David Z. Pan

ashutosh@cerc.utexas.edu dpan@ece.utexas.edu

ECE Department, University of Texas at Austin

slide-2
SLIDE 2

Outline

  • Intro. to source/drain (S/D) SiGe technology

Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-3
SLIDE 3

Outline

  • Intro. to source/drain (S/D) SiGe technology

SiGe Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-4
SLIDE 4

Stress/Strain Basics

Squeezing lattice produces compressive stress Pulling lattice apart produces tensile stress In direction of charge carrier flow,

› Compressive stress improves PMOS performance › Tensile stress improves NMOS performance › Larger stress means more performance benefit.

slide-5
SLIDE 5

Basics of S/D SiGe Technology

SiGe instead of Si in S/D regions. Imparts compressive strain Increase PMOS speed Popular? Feasible?

› Yes. Now routinely used by processor manufacturers. › AMD using it for 45nm, plans also for 32nm [RTP ’08] › Intel used it for 65nm and 45nm [IEDM ‘07] › Sony has used it [VLSI Symp ‘08] › Manufacturing cost up by only 4% [VLSI Symp 08]

Si Si Si

slide-6
SLIDE 6

S/D SiGe Aw are active area (AA) sizing

Factor affecting this mobility enhancement:

› Active area dimension (Lpp) › Concentration of Ge › Recess depth R

Layout designer can control active area (AA) size

› Traditionally, trained to minimize it › However, with S/D SiGe, increasing AA helps!

slide-7
SLIDE 7

Stress as function of SiGe AA size

Based on [IMEC, 2006] SiGe AA Increase (times) We used this setting

slide-8
SLIDE 8

Previous Works

Modeling SiGe AA increase impact

› Eneman, VLSI Symp ’05 › Simoen, Trans Elect. Dev. ’08 › Applied Materials ’07 report

SiGe AA aware layout optimization

› Chakraborty, DATE 08 › Joshi, ISPD 08 › Joshi, DAC 08

SiGe AA aware physical synthesis

› None existing. › This work targets this void.

RTL Physical Synthesis Layout Fabrication Fabrication Layout Physical Synthesis

slide-9
SLIDE 9

Motivational Example

You have a product without S/D SiGe (1 GHz) Soon will use S/D SiGe. (magically get 1.5 GHz) Is change required at physical synthesis stage?

› Gate sizing algorithms › Repeater insertion algorithms › Buffer planning tools

  • Yes. Must change these to exploit fully

› Approximately 10% lesser module delay › Approximately 10% lesser global interconnect delay › Can get 1.65 GHz!

slide-10
SLIDE 10

Outline

  • Intro. to source/drain (S/D) SiGe technology

SiGe Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-11
SLIDE 11

Cell Delay Model Derivation

Analyze cell layout to obtain RC switch model

› Consider AA aware PMOS resistance values › Consider increased self-loading capacitances

Compute new fall and rise time Average fall and rise delay to get cell delay

S C D G W L Lpp Lpp S C D G W L L’pp L’pp

slide-12
SLIDE 12

PMOS Resistance Decrease

Stress Mobility (µ) RON (ON resistance) By curve fitting Relates PMOS RON decrease to SiGe AA increase. Value of “A” depends on Ge conc and recess

  • depth. For our settings, A = 3.4.
slide-13
SLIDE 13

Example [NAND Gate]

D(K) = RC ((F+1)(K+0.5* h+1) + 0.5) Characteristic Delay Equation

slide-14
SLIDE 14

For FO-4, 2X Lpp => 20% delay decrease

NAND Cell Delay (different fan-outs)

For FO-10, 2.5X Lpp => 25% delay decrease

D(K)/D(1)

SiGe AA Increase (K)

slide-15
SLIDE 15

Delay Decrease for other Gates

3 input NOR 2 input NOR 3 input NAND 2 input NAND INV Gate

  • 16.1%

RC ((F+1)(3K+0.5*h+1) + 3FK)

  • 16.4%

RC ((F+1)(2K+0.5*h+0.5) + FK)

  • 16.2%

RC ((F+1)(2K+0.5*h+1.5) + 1.5)

  • 21.9%

RC ((F+1)(K+0.5*h+1) + 0.5)

  • 17.9%

RC (F+1)(K+0.5*h+0.5) ΔD @ FO4 Characteristic Delay Equation

slide-16
SLIDE 16

Outline

  • Intro. to source/drain (S/D) SiGe technology

Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-17
SLIDE 17

Optimal Repeater Insertion (basics)

(basics)

Target: Minimize delay through interconnects. Divide a long interconnect into several parts. A

repeater is inserted to drive each of these.

AA sizing aware ORI: Apart from gate size,

number of repeaters, also determine optimal AA size of the repeater cell.

slide-18
SLIDE 18

AA Sizing Aw are Repeater Insertion

Source Sink R R R R R Repeater Insertion Length Per Unit Resistance Per Unit Capacitance # Of Repeaters Interconnect Length Repeater Sizing Rw Cw M L S

slide-19
SLIDE 19

Minimum Interconnect Delay

Minimizing the delay equation analytically…

slide-20
SLIDE 20

Results [ORI for Performance]

1.04*M (=> +4%) M # Repeaters 0.87*S (=> -13%) S Gate Size 1.7 (=> +70%) 1 AA Size Total Power Delay Metric 0.91*D (=> -9%) D AA Sizing + ORI Traditional ORI 1.1*Ptotal (=>+10%) Ptotal Thus, 9% better delay than the “optimal” repeater insertion solution without SiGe AA size change. What if the aim is not to maximize performance? i.e. iso-delay case (compared to traditional ORI)

slide-21
SLIDE 21

Results [Reducing # of interconnects]

Reduce no. of repeaters until AA sizing aware

sub-optimal repeater insertion delay is same as traditional post ORI delay.

45% reduction in number of repeaters!

› Very interesting for layout level timing closure stability

R R R R R R R R R R R R D < DORI D == DORI

slide-22
SLIDE 22

Outline

  • Intro. to source/drain (S/D) SiGe technology

Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-23
SLIDE 23

CGAS: Concurrent Gate and AA Sizing

Target: Minimize a convex objective

› Delay through the module, or › Power under delay budget, or other.

Determine gate size of each cell and its active

area sizing.

slide-24
SLIDE 24

CGAS: Formulation

Let tuple {S, C, K} represent gate size, input pin

capacitance, and active area sizing for a gate.

Delay of gate i:

slide-25
SLIDE 25

CGAS: Formulation

Are all constraints convex?

› All except first are convex trivially. › First can be proven to be posynomial (see paper) as long as fitting parameter A is >= 1

slide-26
SLIDE 26

Results [ CGAS on IWLS benchmarks ]

0.9 10.1 Average: 1.5 11.0 400 449 2007 C5315 1.3 13.9 233 270 3759 des 0.6 10.2 199 222 628 vda 0.3 9.8 236 262 481 large 0.4 6.4 687 734 2581 C7552 0.5 8.7 295 323 1163 k2 0.2 10.7 159 178 149 frg1 0.4 9.0 309 340 502 C880 3.2 11.0 1175 1320 3316 C6288 % ∆ Cap. %Perf Imprv Delay CGAS Delay GS Num Gates Bench More than 10% reduction with CGAS over traditional Gate Sizing (GS) Less than 1% capacitance increase due to larger active area

Note: All delay values are multiples of RC

slide-27
SLIDE 27

Outline

  • Intro. to source/drain (S/D) SiGe technology

Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions

slide-28
SLIDE 28

Conclusions

When moving to S/D SiGe, physical synthesis

must be revisited to extract maximum benefit.

Proposed SiGe AA sizing aware RC model with

cap increase and PMOS RON decrease.

For long global interconnects, with SiGe AA

sizing of repeaters, delay reduced further by 9%.

slide-29
SLIDE 29

Conclusions (contd…)

Or reduce repeater count by 45%. Break cycle:

timing analysis buffering layout legalization

Concurrent gate and SiGe AA sizing (CGAS)

proposed and proven as a convex problem.

For module delay reduction, CGAS reduces

delay by 10% over non-AA aware sizing.

slide-30
SLIDE 30

References

“Scalability of the SiGe S/D technology for the

45-nm technology node and beyond,” in IEEE Transactions on Electron Devices, July 2006.

L.Washington et al., “pMOSFET with 200%

mobility enhancement induced by multiple stressors,” Electron Device Letters, IEEE, vol. 27,

  • no. 6, pp. 511–513,June 2006
  • S. Boyd et al. , Convex Optimization. Cambridge
  • Univ. Press, March 2004.
slide-31
SLIDE 31

Notation

In the rest of the work:

Increasing the active area of a gate by K times reduces its PMOS’s resistance by F times. These are related by the formula A = 1 : PMOS resistance independent of K A < 1 : PMOS resistance increases with higher K A > 1 : PMOS resistance reduces with higher K In our curve fit, A = 3.4

slide-32
SLIDE 32

Optimal K Value

Depends on fabrication technology

› i.e. on single fitting parameter A

slide-33
SLIDE 33

Flow Used for CGAS

Benchmark Optmz and Tech map in SIS 2-nand 2-nor inv decomposition used C++ tool writes out the constraints MOSEK + AMPL solvers report results

slide-34
SLIDE 34

Link to the paper

slide-35
SLIDE 35

S C D G W L Lpp Lpp S C D G W L L’pp L’pp

slide-36
SLIDE 36

AA Sizing Aw are Repeater Insertion

Source Sink R R R R R Repeater Insertion Length Per Unit Resistance Per Unit Capacitance # Of Repeaters Interconnect Length Repeater Sizing Rw Cw M L S