new approaches to harness global interconnects
play

New Approaches to Harness Global Interconnects Jason Cong Computer - PDF document

PART V New Approaches to Harness Global Interconnects Jason Cong Computer Science Department University of California at Los Angeles Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong DAC'2000 Tutorial Jason Cong 1


  1. PART V New Approaches to Harness Global Interconnects Jason Cong Computer Science Department University of California at Los Angeles Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong DAC'2000 Tutorial Jason Cong 1 Part V Outline I Interconnect Interconnect-Centric Design Flow Centric Design Flow I Interconnect Performance Estimation Interconnect Performance Estimation I Examples of Interconnect Planning Examples of Interconnect Planning N Problem formulation Problem formulation N Buffer block planning Buffer block planning N Wire width planning Wire width planning I System System-Level Partitioning with Retiming Level Partitioning with Retiming N Hierarchical Performance Hierarchical Performance-Driven Partitioning Driven Partitioning with retiming with retiming I Concluding Remarks Concluding Remarks DAC'2000 Tutorial Jason Cong 2

  2. Interconnect-Centric Design Methodology I Proposed transition Proposed transition interconnect device device interconnect device/function centric interconnect/communication centric I Analogy Analogy Data/Objects Programs Programs Data/Objects DAC'2000 Tutorial Jason Cong 3 Interconnect-Centric Design Flow I Key steps in an interconnect Key steps in an interconnect-centric design flow: centric design flow: N Interconnect Planning Interconnect Planning N Interconnect Synthesis Interconnect Synthesis N Interconnect Layout Interconnect Layout I Other supporting tools to enable an interconnect Other supporting tools to enable an interconnect- centric design flow centric design flow N Interconnect performance estimation Interconnect performance estimation N Interconnect performance verification Interconnect performance verification DAC'2000 Tutorial Jason Cong 4

  3. Interconnect Performance Estimation I Introduction & Motivation Introduction & Motivation I Problem Formulation Problem Formulation I Interconnect Delay Estimation Models under Interconnect Delay Estimation Models under Various Layout Optimizations Various Layout Optimizations I Application and Conclusion Application and Conclusion DAC'2000 Tutorial Jason Cong 5 Interconnect Layout Optimization I E.g., UCLA E.g., UCLA TRIO TRIO (Tree, Repeater, Interconnect (Tree, Repeater, Interconnect Optimization) Package Optimization) Package N Interconnect topology optimization Interconnect topology optimization N Optimal buffer insertion Optimal buffer insertion N Wiresizing Wiresizing optimization optimization N Global interconnect sizing and spacing Global interconnect sizing and spacing N Simultaneous driver, buffer, and interconnect sizing Simultaneous driver, buffer, and interconnect sizing N Simultaneous topology generation with buffer insertion and Simultaneous topology generation with buffer insertion and wiresizing wiresizing Available from Available from http:// http://cadlab cadlab.cs cs.ucla ucla.edu edu/~cong /~cong I Delay can be improved by up to 7x ! Delay can be improved by up to 7x ! DAC'2000 Tutorial Jason Cong 6

  4. Impact of Interconnect Optimization on Future Technology Generations 5 2cm DS 4.5 4 2cm BIS 3.5 Delay (ns) 2cm BISWS 3 2.5 G DS: Driver Sizing only 2 G BIS: Buffer Insertion 1.5 and Sizing 1 G BISWS: Simultaneous 0.5 Buffer Insertion/Sizing and Wiresizing 0 0.25 0.18 0.15 0.13 0.1 0.07 Technology ( u m) DAC'2000 Tutorial Jason Cong 7 Complexity of Existing Interconnect Opt. Algorithms I 2cm line, W=20, B=10, segment every 500um 2cm line, W=20, B=10, segment every 500um I Use Use best available best available algorithms: algorithms: N Local Refinement ( Local Refinement (LR LR) ) N Dynamic Programming ( Dynamic Programming (DP DP) ) N Hybrid of Hybrid of DP+LR DP+LR DP DP+LR LR Algorithm OWS BI+OWS BIWS BISWS Delay (ns) 4.5 1.6 1.02 0.81 CPU (s) 0.06 0.42 4.5 12.4 ( HSPICE needs additional 60 seconds! ) ( HSPICE needs additional 60 seconds! ) DAC'2000 Tutorial Jason Cong 8

  5. Needs for Efficient Interconnect Estimation Models I Efficiency Efficiency I Abstraction Abstractionto hide detailed design information to hide detailed design information N granularity of wire segmentation granularity of wire segmentation N number of wire widths, buffer sizes, ... number of wire widths, buffer sizes, ... I Explicit relation Explicit relationto enable optimal design decision at to enable optimal design decision at high levels high levels I Ease of interaction Ease of interaction with logic/high level synthesis tools with logic/high level synthesis tools DAC'2000 Tutorial Jason Cong 9 Interconnect Performance Estimation Modeling [Cong-Pan, ASPDAC’99, TAU’99, DAC’99] I Develop a set of Develop a set of interconnect performance estimation interconnect performance estimation models models (IPEM IPEM), under different optimization alternatives: ), under different optimization alternatives: N Optimal Wire Sizing Optimal Wire Sizing (OWS) (OWS) N Simultaneous Driver and Wire Sizing Simultaneous Driver and Wire Sizing (SDWS) (SDWS) N Simultaneous Buffer Insertion and Wire Sizing Simultaneous Buffer Insertion and Wire Sizing (BIWS) (BIWS) N Simultaneous Buffer Insertion/Sizing and Wire Sizing Simultaneous Buffer Insertion/Sizing and Wire Sizing (BISWS) (BISWS) I IPEM have IPEM have N closed closed-form formula or simple characteristic equations form formula or simple characteristic equations N constant running time in practice constant running time in practice N high accuracy (about 90% accuracy on average) high accuracy (about 90% accuracy on average) DAC'2000 Tutorial Jason Cong 10

  6. Problem Formulation G l G 0 C L Input driver effective resistance of the input stage driver effective resistance of the input stage G 0 I R d0 d0 driver effective resistance of driver effective resistance of G I R d interconnect wire length interconnect wire length I l loading capacitance loading capacitance I C L What is the optimized delay? Do not run TRIO or other optimization tools ! DAC'2000 Tutorial Jason Cong 11 Parameters and Notations I Interconnect Interconnect N c a area capacitance coefficient area capacitance coefficient fringing capacitance coefficient fringing capacitance coefficient N c f sheet resistance sheet resistance N r I Device Device intrinsic gate delay intrinsic gate delay N t g N c g input capacitance of the minimum gate input capacitance of the minimum gate N r g output resistance of the minimum gate output resistance of the minimum gate I Based on 1997 National Technology Roadmap for Based on 1997 National Technology Roadmap for Semiconductors (NTRS’97) Semiconductors (NTRS’97) DAC'2000 Tutorial Jason Cong 12

  7. Delay/Area Estimation under OWS I Closed Closed-form form delay estimation formula delay estimation formula  α α  l 2 l = 1 + 1 + ⋅   T ( R , l , C ) R c R rc c l l + ows d L d f d a f α α 2   W ( l ) W ( l ) 2 2 where rc α α 1 1 a rc 1 = , 2 = a 4 2 R C d L w = W(x) is Lambert’s W function defined as we x I Closed Closed-form form area estimation formula area estimation formula + r ( c l 2 C ) = ⋅ f L A ( R , l , C ) l ows d L 2 R c d a DAC'2000 Tutorial Jason Cong 13 Property of DEM-OWS I Theorem: Theorem: T ows ows is a sub is a sub-quadratic, convex function of quadratic, convex function of length length l l , wiring delay ∝ l 2 , I Note: Without Note: Without wiresizing wiresizing, wiring delay , as used in as used in some previous layout some previous layout-driven logic synthesis systems, driven logic synthesis systems, such as [ such as [Ramachandran Ramachandranet al., ICCAD et al., ICCAD-92] 92] – no longer no longer accurate! accurate! I Closed Closed-form DEM form DEM-OWS will serve as a basis for OWS will serve as a basis for deriving SDWS, BIWS and BISWS deriving SDWS, BIWS and BISWS DAC'2000 Tutorial Jason Cong 14

  8. Comparison of IPEM-OWS vs. TRIO Delay modeling Model ns TRIO 1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 2000 4000 6000 8000 10000 12000 14000 16000 length(um) n 0.18um, R d = r g /100, C L = c g x 100 n For expt., max wire width is 20x min, wire is segmented in every 10um DAC'2000 Tutorial Jason Cong 15 Area Estimation for OWS Model TRIO w i d t h ( u m ) 2 1.5 1 0.5 0 0 4000 8000 12000 16000 20000 length(um) DAC'2000 Tutorial Jason Cong 16

  9. Critical Length for BI under OWS No buffer T ( R , l , C ) ows d L R d Solve for l , => C L critical length l crit l (b, R d , C L ) - Computed by 1 best buffer bisection method R d b - Constant time in α l (1- α )l C L practice { } = α + + − α T ( R , l , C ) T ( R , l , C ) t T ( R , ( 1 ) l , C min 1 biws d L ows d b g ows b L ≤ α ≤ 0 1 DAC'2000 Tutorial Jason Cong 17 Critical Lengths l crit (b, R b , C b ) Decrease Technology (um) 0.25 0.18 0.15 0.13 0.10 0.07 4.12 3.80 3.97 3.61 2.92 2.08 b=10x 6.40 5.81 6.01 5.51 4.45 3.30 b=50x 7.47 6.83 7.04 6.39 5.30 3.91 b=100x 8.65 7.92 8.14 7.43 6.35 4.49 b=200x b=500x 9.98 9.10 9.30 8.57 7.13 5.21 unit: mm Min. WS 2.52 2.23 2.14 1.94 1.50 1.43 - Cf. [OttenISPD’98, Otten-BraytonDAC’98] (uniform wire width) - Denote l c = l crit (b, R b , C b ) DAC'2000 Tutorial Jason Cong 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend