GDP and More: Performance and Power Solutions for Multi-Core VLSI - PowerPoint PPT Presentation

GDP and More: Performance and Power Solutions for Multi-Core VLSI Systems Hai Wang University of Electronic Science & Technology of China Homepage (English): https://wanghaiuestc.github.io Homepage (Chinese): http://faculty.uestc.edu.cn/wanghai1 2020

Motivation and Background

The new challenges in IC industry Core #: Leakage from multi increases to many 3D Dark integration silicon l Scaling causes new challenges in IC industry. l Solutions needed for new challenges.

The leakage problems Normalized leakage current 1 HSPICE 0.8 Curve Fitting Core #: Leakage 0.6 from multi increases 0.4 to many 0.2 0 20 40 60 80 100 120 Temperature (° C) 3D Dark integration silicon l Leakage power becomes significant. l Leakage power highly and nonlinearly relates to temperature: dangerous and difficult to model.

The many-core challenge Core #: Leakage from multi increases to many 3D Dark integration silicon l Core # increases: tens or more cores on a single die. l Difficult to coordinate cores for best performance under thermal constraint.

The problem of 3D integration Core #: Leakage from multi increases to many 3D Dark integration silicon (a) Temperature ( K ) distribution. (b) Von Mises thermal stress ( MPa ) distribution. l 3D IC: go vertical for higher integration density. l High power density leads to high temperature, large stress, and reliability issues.

The dark silicon hazard Core #: Leakage from multi scaling increases to many 4-core with 64 nm 3D Dark integration silicon 16-core with 32 nm l Not all cores can be on simultaneously anymore. l Which cores should be on and how much power can be consumed for best performance?

Outline l Leakage Matters: o Leakage-aware thermal estimation (IEEE Trans. on Computers, 2018) o Leakage-aware thermal management (white-box model) (ASP-DAC Best Paper Nomination, 2019) (IEEE Trans. on Industrial Informatics, 2020) o Leakage-aware thermal management (black-box model) (IEEE Trans. on CAD of Integrated Circuits and Systems, 2019) l Many-Core Solutions: o Hierarchical thermal management (ACM Trans. on Design Automation of Electronic Systems, 2016)

Outline l 3D Integration: o Runtime stress estimation using ANN (ACM Trans. on Design Automation of Electronic Systems, 2019) o STREAM: Stress-aware reliability management (IEEE Trans. on CAD of Integrated Circuits and Systems, 2018) l Dark Silicon Hazard: o GDP: Greedy based dynamic power budgeting (IEEE Trans. on Computers 2019) o Performance optimization of 3-D microprocessors (IEEE Trans. on Computers 2020)

Leakage Matters Leakage-aware thermal estimation • H. Wang, J. Wan, et al. , “A fast leakage-aware full-chip transient thermal estimation method”, IEEE Trans. on Computers, 2018 Leakage-aware thermal management • White-box model through PWL approximation • X. Guo, H. Wang, et al. , “Leakage-aware thermal management for multi-core systems using piecewise linear model predictive control”, ASP-DAC Best Paper Nomination, 2019 H. Wang, L. Hu, X. Guo et al. , “Compact piecewise linear model based temperature control of multi-core systems considering leakage power”, IEEE Transactions on Industrial Informatics, 2020 Black-box model using Echo State Network (ESN) • H. Wang, X. Guo, et al. , “Leakage-aware predictive thermal management for multi-core systems using echo state network”, IEEE Trans. on CAD of Integrated Circuits and Systems, 2019

Nonlinear leakage problem in thermal estimation l Leakage power depends on temperature nonlinearly. Normalized leakage current 1 HSPICE 0.8 Curve Fitting 0.6 0.4 0.2 0 20 40 60 80 100 120 Temperature (° C) l Difficult to compute temperature l Initial guess and iteration needed to solve the nonlinear thermal model (white-box model)! GT ð t Þ þ C dT ð t Þ ¼ BP ð T; t Þ ; dt Y ð t Þ ¼ LT ð t Þ ;

Piecewise linear based thermal estimation l Build local linear thermal models by Taylor expansion P s ¼ P 0 þ A s T; let G l ¼ G � BA s , G l T ð t Þ þ C dT ð t Þ ¼ B ð P d ð t Þ þ P 0 Þ ; dt Y ð t Þ ¼ LT ð t Þ : l Change Taylor expansion points on the fly Temp Expansion points Time

Leakage-aware thermal management problem l Dynamic power is controllable l Change core’s V/f l Switch tasks by scheduling l Leakage power is uncontrollable l Depends mainly on temperature l How to compute the dynamic power recommendation in leakage-aware thermal management? Thermal management (white-box model) Dynamic power Multi-core Thermal sensor recommendation system plant readings

Basic framework of Predictive DTM l The basic idea of predictive DTM l Compute the dynamic power recommendation P d , which tracks the given target temperature l P d can be solved by optimization using thermal prediction Temp Target Temp Control step Current Temp Now Time thermal prediction Formulate optimization using white-box minimize thermal model

Determine expansion points in thermal management l Build PWL white-box thermal model for DTM l A systematic way to choose Taylor expansion points l Simulate the extreme curve (black) to determine points l Normal curves (orange, blue) share the points of the extreme

Using black-box model for DTM l When detailed structure unavailable l Build black-box thermal model l Training using input (power) and output (temp.) pairs l Remarks l Input should be dynamic power l Model should be nonlinear l Leakage handled implicitly inside model Thermal management (black-box model) Dynamic power Multi-core Thermal sensor recommendation system plant readings

First try (failed): RNN based model l Using recurrent neural network (RNN) l Nonlinear model specially for dynamic system modeling l Training using back propagation through time (BPTT) l First try failed! Due to exploding gradient in training l Large error using RNN x r ( k ) = f ( A r P d ( k ) + D r T r ( k − 1) + α ) , T r ( k ) = E r x r ( k ) + β , 3 2.8 2.6 2.4 κ i 2.2 2 1.8 1.6 0 500 1000 1500 2000 2500 Time (s) Singular value > 1: exploding gradient

ESN to avoid exploding gradient l Echo State Network (ESN) is a special RNN l Fixing the recurrent weights in hidden units l Only train the input and output weights l Training does not propagate through time (vs. BPTT) l Good accuracy in leakage-aware thermal modeling x ( k ) = (1 − γ ) x ( k − 1) + γ f ( AP d ( k ) + Dx ( k − 1)) , T ( k ) = Ex ( k ) + HP d ( k ) , Simple training via least square, No exploding gradient problem: � T � x (1) , x (2) , . . . , x ( n k ) S = P tr (1) , P tr (2) , . . . , P tr ( n k ) O = [ T tr (1) , T tr (2) , . . . , T tr ( n k )] T W out = ( S † O ) T

Many-Core Solutions Hierarchical thermal management • H. Wang, J. Ma, et al. , “Hierarchical dynamic thermal management method for high-performance many-core microprocessors”, ACM Trans. on Design Automation of Electronic Systems, 2016

Model predictive control in thermal management • We want to match the desired power profile using current power profile, by using task migration and DVFS. Current power profile Current thermal profile MPC Matching problem Desired power profile Desired thermal profile

The many-core system DTM problem • Computing time increases as core number increases • Large control delay reduces efficiency An example of 100-core chip, assuming core in red is in charge of the DTM computing.

Two-level Hierarchical method • Lower level matching • Simply group spatially adjacent cores into blocks. • Do matching inside each block (intra block) l Upper level matching l Do Matching using lower level unmatched ones (inter block) Lower level matching Upper level matching

GDP and More: Performance and Power Solutions for Multi-Core VLSI - PowerPoint PPT Presentation

GDP and More: Performance and Power Solutions for Multi-Core VLSI Systems Hai Wang University of Electronic Science & Technology of China Homepage (English): https://wanghaiuestc.github.io Homepage (Chinese):

Gross domestic product 1 st quarter 2020 30 June 2020 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 4 th quarter 2019 3 March 2020 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 3 rd quarter 2019 3 December 2019 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 2 nd quarter 2019 3 September 2019 Embargo: 11:30am @StatsSA #GDP

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More

Why Transformers Work. More info blablabla More info blablabla More info blablabla More

Figure 1: GDP per capita before and after a democratization. 25 Change in GDP per capita log

Invest in Lombardy Lombardy at a glance Almost one quarter of Italys total GDP

Tie public debt-to-GDP ratios for CEE countries have improved substantially Debt reduction PEAK

1Q20 Duration of the quarantine and GDP forecasts Impact of COVID-19 on GDP and the unemployment

Why Has the Unemployment Rate Fared Better than GDP Growth? Answer: Between 2007 and 2014, GDP

Economics 2 Professor Christina Romer Spring 2018 Professor David Romer LECTURE 15 MEASUREMENT

World trade has grown since 1930.... World Trade (in percent of GDP), 1930-2015 70 60 50 World

Economics 2 Professor Christina Romer Spring 2019 Professor David Romer LECTURE 15 MEASUREMENT

Boulder Economic Forecast Business Research Division Leeds School of Business University of

Variational Inference for Bayes vMF Mixture Hanxiao Liu September 23, 2014 1 / 14 Variational

Extended Variational Inference for Non-Gaussian Statistical Models Zhanyu Ma

Natural Language Processing Lecture 132/26/2015 Martha Palmer Today Start on Parsing

Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang

design of the radiation cooled positron target Sabine Riemann (DESY), Andriy Ushakov (UHH),

A Photon Dump Study for ILC Undulator Positron Source Yu Morikawa 2017/9/20 1 ILC Beam Dumps

Estimation in nonparametric regression with discrete errors Wolfgang Wefelmeyer (University of

Collective dynamics in life sciences Lecture 2: the Vicsek model Pierre Degond Imperial College

Sambuz

Useful Links

Newsletter

Mail Us

GDP and More: Performance and Power Solutions for Multi-Core VLSI - PowerPoint PPT Presentation

GDP and More: Performance and Power Solutions for Multi-Core VLSI Systems Hai Wang University of Electronic Science & Technology of China Homepage (English): https://wanghaiuestc.github.io Homepage (Chinese):

Gross domestic product 1 st quarter 2020 30 June 2020 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 4 th quarter 2019 3 March 2020 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 3 rd quarter 2019 3 December 2019 Embargo: 11:30am @StatsSA #GDP GDP

Gross domestic product 2 nd quarter 2019 3 September 2019 Embargo: 11:30am @StatsSA #GDP

Learn more Do more Be more Learn more Do more Be more UNITY Learn more Do

Defect Detection Thomas Zimmermann The First Bug September 9, 1947 More Bugs More Bugs More

Why Transformers Work. *More info blablabla *More info blablabla *More info blablabla *More

Figure 1: GDP per capita before and after a democratization. 25 Change in GDP per capita log

Invest in Lombardy Lombardy at a glance Almost one quarter of Italys total GDP

Tie public debt-to-GDP ratios for CEE countries have improved substantially Debt reduction PEAK

1Q20 Duration of the quarantine and GDP forecasts Impact of COVID-19 on GDP and the unemployment

Why Has the Unemployment Rate Fared Better than GDP Growth? Answer: Between 2007 and 2014, GDP

Economics 2 Professor Christina Romer Spring 2018 Professor David Romer LECTURE 15 MEASUREMENT

World trade has grown since 1930.... World Trade (in percent of GDP), 1930-2015 70 60 50 World

Economics 2 Professor Christina Romer Spring 2019 Professor David Romer LECTURE 15 MEASUREMENT

Boulder Economic Forecast Business Research Division Leeds School of Business University of

Variational Inference for Bayes vMF Mixture Hanxiao Liu September 23, 2014 1 / 14 Variational

Extended Variational Inference for Non-Gaussian Statistical Models Zhanyu Ma

Natural Language Processing Lecture 132/26/2015 Martha Palmer Today Start on Parsing

Kai-Wei Chang UCLA References: http://kwchang.net Kai-Wei Chang

design of the radiation cooled positron target Sabine Riemann (DESY), Andriy Ushakov (UHH),

A Photon Dump Study for ILC Undulator Positron Source Yu Morikawa 2017/9/20 1 ILC Beam Dumps

Estimation in nonparametric regression with discrete errors Wolfgang Wefelmeyer (University of

Collective dynamics in life sciences Lecture 2: the Vicsek model Pierre Degond Imperial College

Sambuz

Useful Links

Newsletter

Mail Us

Why Transformers Work. More info blablabla More info blablabla More info blablabla More