gregory shklover ben emanuel
play

Gregory Shklover Ben Emanuel Intel Corporation Motivation Data Gate - PowerPoint PPT Presentation

Gregory Shklover Ben Emanuel Intel Corporation Motivation Data Gate Sizing by Lagrangian Relaxation (LR) Clock & Data Gate Sizing Algorithm Experimental Results G. Shklover, ISPD '12 2 Methodology Methodology Class Class Structure


  1. Gregory Shklover Ben Emanuel Intel Corporation

  2. Motivation Data Gate Sizing by Lagrangian Relaxation (LR) Clock & Data Gate Sizing Algorithm Experimental Results G. Shklover, ISPD '12 2

  3. Methodology Methodology Class Class Structure Structure Methods Methods Skew, Dynamic Variability, Programming, Tree Non ‐ Convex … … Lagrangian Timing, Relaxation, Convex Graph Power, Analytic, DP… … G. Shklover, ISPD '12 3

  4. FF Balance power and timing in both clock and data for better global solution. G. Shklover, ISPD '12 4

  5. Proposed by C. Chen et al Exploits the nature of timing constraints to reduce complexity Efficient, suitable for industrial design flows (standard library with Vt/sizing). G. Shklover, ISPD '12 5

  6. � � �∈���∩��� �∈����� Timing propagation Setup constraints constraints � � � �→� � � �,��� �,� � � � � G. Shklover, ISPD '12 6

  7. Lagrangian multipliers ( � ) Initialize + KKT ‐ derived simplification Multipliers Size Gates � �,� �→� �→�∈� Update Timing �,� � �∈��� Update � Multipliers �∈����� G. Shklover, ISPD '12 7

  8. � � �,� � ����� � � � �,��� � ����� � ����� � � ����� � � � �,��� � ����� ���→� � � �,� � ����� ���→� � � �,� � ������ � � ������ � clk d q FF �,� �,� � �,� �,� �,��� � �∈��� �∈����� G. Shklover, ISPD '12 8

  9. �,��� ����→� �→� �→� �,� �,� � � �∈����� �∈������ � ������ � ����� � � G. Shklover, ISPD '12 9

  10. Dynamic Programming (DP) Algorithm Initialize Multipliers • Originates from buffered tree Size Gates construction by Van Ginneken • Systematically explores solution Update Timing space by building partial solutions bottom-up Update Multipliers G. Shklover, ISPD '12 10

  11. � � � � �∈����� �∈����� Set of solutions per tree node � � � Pruning criterion � � � � (differs from minimal delay objectives) G. Shklover, ISPD '12 11

  12. Gate sizing: ��� ���� ��� ���� ���� Solution merge: ��� � � ��� � � Leaf nodes: FF � � G. Shklover, ISPD '12 12

  13. ? Approximation Approximation + + Input slews Input slews convergence convergence A Approximation Approximation Side-load Side-load + + effects effects convergence convergence B G. Shklover, ISPD '12 13

  14. � � � � . . . k-Sampling k-Sampling Exponential O(max(k,L)kN) O(max(k,L)kN) number of solutions Objective(a clk ) “Cooling” “Cooling” Convergence � � ����� � � � ����� � ��� | � � ��� ��� | � � ��� |����� � |����� � G. Shklover, ISPD '12 14

  15. Reference: Separate optimization Data sizing for given clock schedule Timing ‐ preserving clock sizing Test: Simultaneous clock & data sizing Same objective as above, but clock and data sized simultaneously G. Shklover, ISPD '12 15

  16. Total Slack Leakage ClkDPwr Total Power Block ref new ref new ref new ref new block1 ‐ 0.038 ‐ 0.044 2.26 2.10 2.07 1.77 4.33 3.87 block2 ‐ 0.051 ‐ 0.015 1.80 1.77 1.38 1.36 3.19 3.14 block3 ‐ 2.387 ‐ 1.902 6.59 6.22 5.51 5.18 12.10 11.40 block4 ‐ 0.032 ‐ 0.030 1.42 1.39 1.46 1.44 2.88 2.84 Total Slack Leakage ClkDPwr Total Power block5 ‐ 0.275 ‐ 0.206 3.86 3.77 4.44 4.20 8.30 7.97 Block ref new ref new ref new ref new block6 ‐ 0.087 ‐ 0.056 6.05 5.95 0.25 0.27 6.31 6.22 block7 ‐ 0.207 ‐ 0.158 3.61 3.57 3.42 3.33 7.03 6.90 Total ‐ 6.02 ‐ 4.03 60.88 58.03 33.32 31.52 94.20 89.55 block8 ‐ 0.407 ‐ 0.179 5.61 5.09 2.30 2.26 7.92 7.35 block9 ‐ 1.075 ‐ 0.537 6.49 6.24 0.96 0.89 7.44 7.12 block10 ‐ 0.108 ‐ 0.066 3.31 3.08 1.65 1.55 4.96 4.63 Useful skew: better timing, Natively balances block11 ‐ 0.794 ‐ 0.529 7.73 7.42 2.84 2.70 10.57 10.12 block12 ‐ 0.154 ‐ 0.121 3.47 2.98 2.44 2.39 5.91 5.37 lower gate leakage clock power vs timing block13 ‐ 0.171 ‐ 0.058 3.00 2.93 0.50 0.52 3.50 3.44 block14 ‐ 0.168 ‐ 0.072 2.57 2.51 1.78 1.70 4.35 4.20 block15 ‐ 0.062 ‐ 0.063 3.10 3.02 2.33 1.97 5.43 4.99 Total ‐ 6.02 ‐ 4.03 60.88 58.03 33.32 31.52 94.20 89.55 G. Shklover, ISPD '12 16

  17. Extend traditional gate sizing to simultaneous clock & data optimization Benefits of global optimization Balances between useful skew, clock power and data power Future directions: Extend optimization objective Topological changes G. Shklover, ISPD '12 17

  18. Prof. C. Chen for participating in discussion and reviews Yoram Aloni and Lior Nissim for supporting this effort G. Shklover, ISPD '12 18

  19. G. Shklover, ISPD '12 19

  20. G. Shklover, ISPD '12 20

  21. +80ps 21 FF G. Shklover, ISPD '12 -20ps ? power Objective

  22. Total Slack Block cooling off cooling on Convergence control eliminates block1 ‐ 0.023 ‐ 0.023 block2 ‐ 0.019 ‐ 0.019 overshoot while optimizing block3 ‐ 2.649 ‐ 1.885 block4 ‐ 0.036 ‐ 0.013 piecewise linear objective. block5 ‐ 0.166 ‐ 0.160 block6 ‐ 0.153 ‐ 0.064 block7 ‐ 0.126 ‐ 0.118 block8 ‐ 0.224 ‐ 0.211 block9 ‐ 0.693 ‐ 0.535 ��� � � block10 ‐ 0.185 ‐ 0.083 ��� � � block11 ‐ 0.662 ‐ 0.553 � ��� block12 ‐ 0.102 ‐ 0.118 block13 ‐ 0.073 ‐ 0.032 block14 ‐ 0.055 ‐ 0.053 block15 ‐ 0.130 ‐ 0.052 Total ‐ 5.29 ‐ 3.92 G. Shklover, ISPD '12 22

  23. G. Shklover, ISPD '12 23

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend