SLIDE 1
Stress Aw are Active Area Sizing, Gate Sizing and Repeater Insertion
Ashutosh Chakraborty David Z. Pan
ashutosh@cerc.utexas.edu dpan@ece.utexas.edu
ECE Department, University of Texas at Austin
SLIDE 2 Outline
- Intro. to source/drain (S/D) SiGe technology
Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 3 Outline
- Intro. to source/drain (S/D) SiGe technology
SiGe Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 4
Stress/Strain Basics
Squeezing lattice produces compressive stress Pulling lattice apart produces tensile stress In direction of charge carrier flow,
› Compressive stress improves PMOS performance › Tensile stress improves NMOS performance › Larger stress means more performance benefit.
SLIDE 5
Basics of S/D SiGe Technology
SiGe instead of Si in S/D regions. Imparts compressive strain Increase PMOS speed Popular? Feasible?
› Yes. Now routinely used by processor manufacturers. › AMD using it for 45nm, plans also for 32nm [RTP ’08] › Intel used it for 65nm and 45nm [IEDM ‘07] › Sony has used it [VLSI Symp ‘08] › Manufacturing cost up by only 4% [VLSI Symp 08]
Si Si Si
SLIDE 6
S/D SiGe Aw are active area (AA) sizing
Factor affecting this mobility enhancement:
› Active area dimension (Lpp) › Concentration of Ge › Recess depth R
Layout designer can control active area (AA) size
› Traditionally, trained to minimize it › However, with S/D SiGe, increasing AA helps!
SLIDE 7
Stress as function of SiGe AA size
Based on [IMEC, 2006] SiGe AA Increase (times) We used this setting
SLIDE 8
Previous Works
Modeling SiGe AA increase impact
› Eneman, VLSI Symp ’05 › Simoen, Trans Elect. Dev. ’08 › Applied Materials ’07 report
SiGe AA aware layout optimization
› Chakraborty, DATE 08 › Joshi, ISPD 08 › Joshi, DAC 08
SiGe AA aware physical synthesis
› None existing. › This work targets this void.
RTL Physical Synthesis Layout Fabrication Fabrication Layout Physical Synthesis
SLIDE 9 Motivational Example
You have a product without S/D SiGe (1 GHz) Soon will use S/D SiGe. (magically get 1.5 GHz) Is change required at physical synthesis stage?
› Gate sizing algorithms › Repeater insertion algorithms › Buffer planning tools
- Yes. Must change these to exploit fully
› Approximately 10% lesser module delay › Approximately 10% lesser global interconnect delay › Can get 1.65 GHz!
SLIDE 10 Outline
- Intro. to source/drain (S/D) SiGe technology
SiGe Active Area (AA) aware Delay Model AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 11
Cell Delay Model Derivation
Analyze cell layout to obtain RC switch model
› Consider AA aware PMOS resistance values › Consider increased self-loading capacitances
Compute new fall and rise time Average fall and rise delay to get cell delay
S C D G W L Lpp Lpp S C D G W L L’pp L’pp
SLIDE 12 PMOS Resistance Decrease
Stress Mobility (µ) RON (ON resistance) By curve fitting Relates PMOS RON decrease to SiGe AA increase. Value of “A” depends on Ge conc and recess
- depth. For our settings, A = 3.4.
SLIDE 13
Example [NAND Gate]
D(K) = RC ((F+1)(K+0.5* h+1) + 0.5) Characteristic Delay Equation
SLIDE 14
For FO-4, 2X Lpp => 20% delay decrease
NAND Cell Delay (different fan-outs)
For FO-10, 2.5X Lpp => 25% delay decrease
D(K)/D(1)
SiGe AA Increase (K)
SLIDE 15 Delay Decrease for other Gates
3 input NOR 2 input NOR 3 input NAND 2 input NAND INV Gate
RC ((F+1)(3K+0.5*h+1) + 3FK)
RC ((F+1)(2K+0.5*h+0.5) + FK)
RC ((F+1)(2K+0.5*h+1.5) + 1.5)
RC ((F+1)(K+0.5*h+1) + 0.5)
RC (F+1)(K+0.5*h+0.5) ΔD @ FO4 Characteristic Delay Equation
SLIDE 16 Outline
- Intro. to source/drain (S/D) SiGe technology
Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 17
Optimal Repeater Insertion (basics)
(basics)
Target: Minimize delay through interconnects. Divide a long interconnect into several parts. A
repeater is inserted to drive each of these.
AA sizing aware ORI: Apart from gate size,
number of repeaters, also determine optimal AA size of the repeater cell.
SLIDE 18
AA Sizing Aw are Repeater Insertion
Source Sink R R R R R Repeater Insertion Length Per Unit Resistance Per Unit Capacitance # Of Repeaters Interconnect Length Repeater Sizing Rw Cw M L S
SLIDE 19
Minimum Interconnect Delay
Minimizing the delay equation analytically…
SLIDE 20
Results [ORI for Performance]
1.04*M (=> +4%) M # Repeaters 0.87*S (=> -13%) S Gate Size 1.7 (=> +70%) 1 AA Size Total Power Delay Metric 0.91*D (=> -9%) D AA Sizing + ORI Traditional ORI 1.1*Ptotal (=>+10%) Ptotal Thus, 9% better delay than the “optimal” repeater insertion solution without SiGe AA size change. What if the aim is not to maximize performance? i.e. iso-delay case (compared to traditional ORI)
SLIDE 21
Results [Reducing # of interconnects]
Reduce no. of repeaters until AA sizing aware
sub-optimal repeater insertion delay is same as traditional post ORI delay.
45% reduction in number of repeaters!
› Very interesting for layout level timing closure stability
R R R R R R R R R R R R D < DORI D == DORI
SLIDE 22 Outline
- Intro. to source/drain (S/D) SiGe technology
Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 23
CGAS: Concurrent Gate and AA Sizing
Target: Minimize a convex objective
› Delay through the module, or › Power under delay budget, or other.
Determine gate size of each cell and its active
area sizing.
SLIDE 24
CGAS: Formulation
Let tuple {S, C, K} represent gate size, input pin
capacitance, and active area sizing for a gate.
Delay of gate i:
SLIDE 25
CGAS: Formulation
Are all constraints convex?
› All except first are convex trivially. › First can be proven to be posynomial (see paper) as long as fitting parameter A is >= 1
SLIDE 26
Results [ CGAS on IWLS benchmarks ]
0.9 10.1 Average: 1.5 11.0 400 449 2007 C5315 1.3 13.9 233 270 3759 des 0.6 10.2 199 222 628 vda 0.3 9.8 236 262 481 large 0.4 6.4 687 734 2581 C7552 0.5 8.7 295 323 1163 k2 0.2 10.7 159 178 149 frg1 0.4 9.0 309 340 502 C880 3.2 11.0 1175 1320 3316 C6288 % ∆ Cap. %Perf Imprv Delay CGAS Delay GS Num Gates Bench More than 10% reduction with CGAS over traditional Gate Sizing (GS) Less than 1% capacitance increase due to larger active area
Note: All delay values are multiples of RC
SLIDE 27 Outline
- Intro. to source/drain (S/D) SiGe technology
Cell Delay Model Derivation AA aware Optimal Repeater Insertion (ORI) Concurrent AA and Gate Sizing Conclusions
SLIDE 28
Conclusions
When moving to S/D SiGe, physical synthesis
must be revisited to extract maximum benefit.
Proposed SiGe AA sizing aware RC model with
cap increase and PMOS RON decrease.
For long global interconnects, with SiGe AA
sizing of repeaters, delay reduced further by 9%.
SLIDE 29
Conclusions (contd…)
Or reduce repeater count by 45%. Break cycle:
timing analysis buffering layout legalization
Concurrent gate and SiGe AA sizing (CGAS)
proposed and proven as a convex problem.
For module delay reduction, CGAS reduces
delay by 10% over non-AA aware sizing.
SLIDE 30 References
“Scalability of the SiGe S/D technology for the
45-nm technology node and beyond,” in IEEE Transactions on Electron Devices, July 2006.
L.Washington et al., “pMOSFET with 200%
mobility enhancement induced by multiple stressors,” Electron Device Letters, IEEE, vol. 27,
- no. 6, pp. 511–513,June 2006
- S. Boyd et al. , Convex Optimization. Cambridge
- Univ. Press, March 2004.
SLIDE 31
Notation
In the rest of the work:
Increasing the active area of a gate by K times reduces its PMOS’s resistance by F times. These are related by the formula A = 1 : PMOS resistance independent of K A < 1 : PMOS resistance increases with higher K A > 1 : PMOS resistance reduces with higher K In our curve fit, A = 3.4
SLIDE 32
Optimal K Value
Depends on fabrication technology
› i.e. on single fitting parameter A
SLIDE 33
Flow Used for CGAS
Benchmark Optmz and Tech map in SIS 2-nand 2-nor inv decomposition used C++ tool writes out the constraints MOSEK + AMPL solvers report results
SLIDE 34
Link to the paper
SLIDE 35
S C D G W L Lpp Lpp S C D G W L L’pp L’pp
SLIDE 36
AA Sizing Aw are Repeater Insertion
Source Sink R R R R R Repeater Insertion Length Per Unit Resistance Per Unit Capacitance # Of Repeaters Interconnect Length Repeater Sizing Rw Cw M L S