New Approaches to Harness Global Interconnects Jason Cong Computer - PDF document

PART V New Approaches to Harness Global Interconnects Jason Cong Computer Science Department University of California at Los Angeles Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong DAC'2000 Tutorial Jason Cong 1 Part V Outline I Interconnect Interconnect-Centric Design Flow Centric Design Flow I Interconnect Performance Estimation Interconnect Performance Estimation I Examples of Interconnect Planning Examples of Interconnect Planning N Problem formulation Problem formulation N Buffer block planning Buffer block planning N Wire width planning Wire width planning I System System-Level Partitioning with Retiming Level Partitioning with Retiming N Hierarchical Performance Hierarchical Performance-Driven Partitioning Driven Partitioning with retiming with retiming I Concluding Remarks Concluding Remarks DAC'2000 Tutorial Jason Cong 2

Interconnect-Centric Design Methodology I Proposed transition Proposed transition interconnect device device interconnect device/function centric interconnect/communication centric I Analogy Analogy Data/Objects Programs Programs Data/Objects DAC'2000 Tutorial Jason Cong 3 Interconnect-Centric Design Flow I Key steps in an interconnect Key steps in an interconnect-centric design flow: centric design flow: N Interconnect Planning Interconnect Planning N Interconnect Synthesis Interconnect Synthesis N Interconnect Layout Interconnect Layout I Other supporting tools to enable an interconnect Other supporting tools to enable an interconnect- centric design flow centric design flow N Interconnect performance estimation Interconnect performance estimation N Interconnect performance verification Interconnect performance verification DAC'2000 Tutorial Jason Cong 4

Interconnect Performance Estimation I Introduction & Motivation Introduction & Motivation I Problem Formulation Problem Formulation I Interconnect Delay Estimation Models under Interconnect Delay Estimation Models under Various Layout Optimizations Various Layout Optimizations I Application and Conclusion Application and Conclusion DAC'2000 Tutorial Jason Cong 5 Interconnect Layout Optimization I E.g., UCLA E.g., UCLA TRIO TRIO (Tree, Repeater, Interconnect (Tree, Repeater, Interconnect Optimization) Package Optimization) Package N Interconnect topology optimization Interconnect topology optimization N Optimal buffer insertion Optimal buffer insertion N Wiresizing Wiresizing optimization optimization N Global interconnect sizing and spacing Global interconnect sizing and spacing N Simultaneous driver, buffer, and interconnect sizing Simultaneous driver, buffer, and interconnect sizing N Simultaneous topology generation with buffer insertion and Simultaneous topology generation with buffer insertion and wiresizing wiresizing Available from Available from http:// http://cadlab cadlab.cs cs.ucla ucla.edu edu/~cong /~cong I Delay can be improved by up to 7x ! Delay can be improved by up to 7x ! DAC'2000 Tutorial Jason Cong 6

Impact of Interconnect Optimization on Future Technology Generations 5 2cm DS 4.5 4 2cm BIS 3.5 Delay (ns) 2cm BISWS 3 2.5 G DS: Driver Sizing only 2 G BIS: Buffer Insertion 1.5 and Sizing 1 G BISWS: Simultaneous 0.5 Buffer Insertion/Sizing and Wiresizing 0 0.25 0.18 0.15 0.13 0.1 0.07 Technology ( u m) DAC'2000 Tutorial Jason Cong 7 Complexity of Existing Interconnect Opt. Algorithms I 2cm line, W=20, B=10, segment every 500um 2cm line, W=20, B=10, segment every 500um I Use Use best available best available algorithms: algorithms: N Local Refinement ( Local Refinement (LR LR) ) N Dynamic Programming ( Dynamic Programming (DP DP) ) N Hybrid of Hybrid of DP+LR DP+LR DP DP+LR LR Algorithm OWS BI+OWS BIWS BISWS Delay (ns) 4.5 1.6 1.02 0.81 CPU (s) 0.06 0.42 4.5 12.4 ( HSPICE needs additional 60 seconds! ) ( HSPICE needs additional 60 seconds! ) DAC'2000 Tutorial Jason Cong 8

Needs for Efficient Interconnect Estimation Models I Efficiency Efficiency I Abstraction Abstractionto hide detailed design information to hide detailed design information N granularity of wire segmentation granularity of wire segmentation N number of wire widths, buffer sizes, ... number of wire widths, buffer sizes, ... I Explicit relation Explicit relationto enable optimal design decision at to enable optimal design decision at high levels high levels I Ease of interaction Ease of interaction with logic/high level synthesis tools with logic/high level synthesis tools DAC'2000 Tutorial Jason Cong 9 Interconnect Performance Estimation Modeling [Cong-Pan, ASPDAC’99, TAU’99, DAC’99] I Develop a set of Develop a set of interconnect performance estimation interconnect performance estimation models models (IPEM IPEM), under different optimization alternatives: ), under different optimization alternatives: N Optimal Wire Sizing Optimal Wire Sizing (OWS) (OWS) N Simultaneous Driver and Wire Sizing Simultaneous Driver and Wire Sizing (SDWS) (SDWS) N Simultaneous Buffer Insertion and Wire Sizing Simultaneous Buffer Insertion and Wire Sizing (BIWS) (BIWS) N Simultaneous Buffer Insertion/Sizing and Wire Sizing Simultaneous Buffer Insertion/Sizing and Wire Sizing (BISWS) (BISWS) I IPEM have IPEM have N closed closed-form formula or simple characteristic equations form formula or simple characteristic equations N constant running time in practice constant running time in practice N high accuracy (about 90% accuracy on average) high accuracy (about 90% accuracy on average) DAC'2000 Tutorial Jason Cong 10

Problem Formulation G l G 0 C L Input driver effective resistance of the input stage driver effective resistance of the input stage G 0 I R d0 d0 driver effective resistance of driver effective resistance of G I R d interconnect wire length interconnect wire length I l loading capacitance loading capacitance I C L What is the optimized delay? Do not run TRIO or other optimization tools ! DAC'2000 Tutorial Jason Cong 11 Parameters and Notations I Interconnect Interconnect N c a area capacitance coefficient area capacitance coefficient fringing capacitance coefficient fringing capacitance coefficient N c f sheet resistance sheet resistance N r I Device Device intrinsic gate delay intrinsic gate delay N t g N c g input capacitance of the minimum gate input capacitance of the minimum gate N r g output resistance of the minimum gate output resistance of the minimum gate I Based on 1997 National Technology Roadmap for Based on 1997 National Technology Roadmap for Semiconductors (NTRS’97) Semiconductors (NTRS’97) DAC'2000 Tutorial Jason Cong 12

Delay/Area Estimation under OWS I Closed Closed-form form delay estimation formula delay estimation formula  α α  l 2 l = 1 + 1 + ⋅   T ( R , l , C ) R c R rc c l l + ows d L d f d a f α α 2   W ( l ) W ( l ) 2 2 where rc α α 1 1 a rc 1 = , 2 = a 4 2 R C d L w = W(x) is Lambert’s W function defined as we x I Closed Closed-form form area estimation formula area estimation formula + r ( c l 2 C ) = ⋅ f L A ( R , l , C ) l ows d L 2 R c d a DAC'2000 Tutorial Jason Cong 13 Property of DEM-OWS I Theorem: Theorem: T ows ows is a sub is a sub-quadratic, convex function of quadratic, convex function of length length l l , wiring delay ∝ l 2 , I Note: Without Note: Without wiresizing wiresizing, wiring delay , as used in as used in some previous layout some previous layout-driven logic synthesis systems, driven logic synthesis systems, such as [ such as [Ramachandran Ramachandranet al., ICCAD et al., ICCAD-92] 92] – no longer no longer accurate! accurate! I Closed Closed-form DEM form DEM-OWS will serve as a basis for OWS will serve as a basis for deriving SDWS, BIWS and BISWS deriving SDWS, BIWS and BISWS DAC'2000 Tutorial Jason Cong 14

Comparison of IPEM-OWS vs. TRIO Delay modeling Model ns TRIO 1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 2000 4000 6000 8000 10000 12000 14000 16000 length(um) n 0.18um, R d = r g /100, C L = c g x 100 n For expt., max wire width is 20x min, wire is segmented in every 10um DAC'2000 Tutorial Jason Cong 15 Area Estimation for OWS Model TRIO w i d t h ( u m ) 2 1.5 1 0.5 0 0 4000 8000 12000 16000 20000 length(um) DAC'2000 Tutorial Jason Cong 16

Critical Length for BI under OWS No buffer T ( R , l , C ) ows d L R d Solve for l , => C L critical length l crit l (b, R d , C L ) - Computed by 1 best buffer bisection method R d b - Constant time in α l (1- α )l C L practice { } = α + + − α T ( R , l , C ) T ( R , l , C ) t T ( R , ( 1 ) l , C min 1 biws d L ows d b g ows b L ≤ α ≤ 0 1 DAC'2000 Tutorial Jason Cong 17 Critical Lengths l crit (b, R b , C b ) Decrease Technology (um) 0.25 0.18 0.15 0.13 0.10 0.07 4.12 3.80 3.97 3.61 2.92 2.08 b=10x 6.40 5.81 6.01 5.51 4.45 3.30 b=50x 7.47 6.83 7.04 6.39 5.30 3.91 b=100x 8.65 7.92 8.14 7.43 6.35 4.49 b=200x b=500x 9.98 9.10 9.30 8.57 7.13 5.21 unit: mm Min. WS 2.52 2.23 2.14 1.94 1.50 1.43 - Cf. [OttenISPD’98, Otten-BraytonDAC’98] (uniform wire width) - Denote l c = l crit (b, R b , C b ) DAC'2000 Tutorial Jason Cong 18

New Approaches to Harness Global Interconnects Jason Cong Computer - PDF document

PART V New Approaches to Harness Global Interconnects Jason Cong Computer Science Department University of California at Los Angeles Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong DAC'2000 Tutorial Jason Cong 1

(VHI ADVANTAGE PRO) ADVANTAGE PRO HARNESS USER INSTRUCTIONS HARNESS DESIGNATION: FALL ARREST

New Approaches to Harness Global Interconnects Jason Cong Computer Science Department

(VHI ULTRALITE X)) ULTRALITE X HARNESS USER INSTRUCTIONS HARNESS DESIGNATION: FALL ARREST

1/5/2012 Overview of Interconnects Presentation Outline Myrinet and Quadrics General

Retiming & Pipelining over Global Retiming & Pipelining over Global Interconnects

Harness the Power of View Modes! Aimee Degnan @hook42inc Harness the Power of View Modes!

CHEP 2010 How to harness the performance potential How to harness the performance potential of

Coupled Thermal-Electrical Transient Analysis of 3D Fuses and Interconnects Self Heating Effects

Interconnects Outline Interconnect scaling issues Aluminum technology Copper

Optical Interconnects for Cloud Computing Data Centers: Recent Advances and Future Challenges Dr.

Overview Overview Processors Interconnects A few machines Examine the Top242 2 1

Product presentation BIPRO 3 Daily Ease The BIPRO 3 tandem pilot harness has many useful

Scientific Computations Using the Test Harness 1 Brian T. Smith Numerica 21 Inc. August 3, 2011

#electro Wiring 04: X Assembly Harness Relevant Additional Sources multimeter jellybox jellybox

New Approaches to New Approaches to New Approaches to Repair of Repair of Repair of Spinal

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS

Linear Programming DPV Chapter 7, Part 1 Jim Royer March 20, 2019 Uncredited diagrams are from

Computational Optimization Constrained Optimization m R b , m n n Easiest Problem

Linear Programming Lecturer: Shi Li Department of Computer Science and Engineering University at

MA162: Finite mathematics . Jack Schmidt University of Kentucky October 17, 2011 Schedule:

2. Elements of convex optjmizatjon Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines

Combiners for Backdoored Random Oracles Balthazar Bauer, Pooya Farshim, Sogol Mazaheri ENS,

On the number of distinct solutions generated by the simplex method for LP . . . . .

Mixed Integer Linear Programming Combinatorial Problem Solving (CPS) Javier Larrosa Albert

Sambuz

Useful Links

Newsletter

Mail Us

New Approaches to Harness Global Interconnects Jason Cong Computer - PDF document

PART V New Approaches to Harness Global Interconnects Jason Cong Computer Science Department University of California at Los Angeles Email: cong@cs.ucla.edu Tel: 310-206-2775 http://cadlab.cs.ucla.edu/~cong DAC'2000 Tutorial Jason Cong 1

(VHI ADVANTAGE PRO) ADVANTAGE PRO HARNESS USER INSTRUCTIONS HARNESS DESIGNATION: FALL ARREST

New Approaches to Harness Global Interconnects Jason Cong Computer Science Department

(VHI ULTRALITE X)) ULTRALITE X HARNESS USER INSTRUCTIONS HARNESS DESIGNATION: FALL ARREST

1/5/2012 Overview of Interconnects Presentation Outline Myrinet and Quadrics General

Retiming &amp; Pipelining over Global Retiming &amp; Pipelining over Global Interconnects

Harness the Power of View Modes! Aimee Degnan @hook42inc Harness the Power of View Modes!

CHEP 2010 How to harness the performance potential How to harness the performance potential of

Coupled Thermal-Electrical Transient Analysis of 3D Fuses and Interconnects Self Heating Effects

Interconnects Outline Interconnect scaling issues Aluminum technology Copper

Optical Interconnects for Cloud Computing Data Centers: Recent Advances and Future Challenges Dr.

Overview Overview Processors Interconnects A few machines Examine the Top242 2 1

Product presentation BIPRO 3 Daily Ease The BIPRO 3 tandem pilot harness has many useful

Scientific Computations Using the Test Harness 1 Brian T. Smith Numerica 21 Inc. August 3, 2011

#electro Wiring 04: X Assembly Harness Relevant Additional Sources multimeter jellybox jellybox

New Approaches to New Approaches to New Approaches to Repair of Repair of Repair of Spinal

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS

Linear Programming DPV Chapter 7, Part 1 Jim Royer March 20, 2019 Uncredited diagrams are from

Computational Optimization Constrained Optimization m R b , m n n Easiest Problem

Linear Programming Lecturer: Shi Li Department of Computer Science and Engineering University at

MA162: Finite mathematics . Jack Schmidt University of Kentucky October 17, 2011 Schedule:

2. Elements of convex optjmizatjon Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines

Combiners for Backdoored Random Oracles Balthazar Bauer, Pooya Farshim, Sogol Mazaheri ENS,

On the number of distinct solutions generated by the simplex method for LP . . . . .

Mixed Integer Linear Programming Combinatorial Problem Solving (CPS) Javier Larrosa Albert

Sambuz

Useful Links

Newsletter

Mail Us

Retiming & Pipelining over Global Retiming & Pipelining over Global Interconnects