A Statistical Framework for Designing On-chip Thermal Sensing Infrastructure
Yufu Zhang, Bing Shi, Ankur Srivastava
University of Maryland, College Park
{yufuzh, bingshi, ankurs}@umd.edu
A Statistical Framework for Designing On-chip Thermal Sensing - - PowerPoint PPT Presentation
A Statistical Framework for Designing On-chip Thermal Sensing Infrastructure Yufu Zhang, Bing Shi, Ankur Srivastava University of Maryland, College Park {yufuzh, bingshi, ankurs}@umd.edu Outline Motivation/overview Fusion center design
University of Maryland, College Park
{yufuzh, bingshi, ankurs}@umd.edu
2
Motivation/overview Fusion center design Sensor design/compression
Noisy sensor behavior Exploiting the correlation
Sensor placement Overall flow and interplay Results and conclusion
3
Thermal/power stress
Heavy task execution Increasing chip density Leakage power
Dynamic thermal management (DTM)
Essentially sacrificing performance for lower temperature Need accurate runtime thermal information
4
On-chip sensors can sample the thermal state of the
Counter Sensor Output Enabled for a fixed period of time tp
EN
A simple ring
thermal sensor
Sensors cannot go everywhere Sensors are subject to noise Resource is limited
Sensor design/compression Sensor placement Data fusion
5
6
Central register (finite size M)
Could be a single or multiple actual registers
Fusion algorithm
Model the thermal profile as a random vector T Predict (T) given the sensor obs vector (TS) Exploit statistical information (mean, var, correlation etc.)
Bayesian Estimation Philosophy
7
1
S S T S TS SS
−
TS T S T S S S
Scalar case: Vector case:
85 90 95 100 105 110 115 500 1000 1500 2000 2500 3000 3500 4000 Temperature (degrees Celsius) Number of samples
Given sensor input, the variance of T is reduced to:
Diagonal elements – variance of the thermal estimates. Reflects the fundamental uncertainty of our estimation.
(how far away our estimates are from the real temperature)
Used to drive sensor placement.
A better metric to drive sensor placement?
Sensors are not like cameras Generate the probability of capturing all hotspots
8
1
TT s s TT TS SS ST
−
Noisy sensor behavior (Monte Carlo Simulation) Sensor readings are
Hypothesis testing
9
1 1 ( )
PHL PLH
f P N t t = = +
3 4 2 1 ln ( / ) ( ) 2
t DD t PHL n
n DD t DD t DD
V V V C t C W L V V V V V µ − = + − −
0.002( )
t t
V V T T = + −
1.5 /
( / )
n p
T T µ µ
−
=
50 100 150 200 250 300 350 0.5 1 1.5 2 2.5 3 3.5x 10
4
Sensor frequency(MHz) Number of samples
T = 20°C T = 40°C T = 100°C T = 80°C T = 60°C
Counter Sensor Output Enabled for a fixed period of time tp
EN
Target: minimize the expected prediction error:
Optimal decision rule: Implement as an encoder at the sensor output
10
1
(| | ) | | ( | )
pred real
n pred i real i
i
Cost E T T T T H prob T H T
=
= − = − ⋅ =
1
( | ) ( ) ( | ) ( | )
real i i
real i i n
real j j j
prob T T H P prob T prob T T H P prob T T H P
=
= ⋅ = = ⋅ = = ⋅
1...
pred n
pred
pred real
T H H
=
Bayes rule
They fit into the central register Collectively they provide better accuracy
Decide how to allocate a total of M bits to n sensors so that the
Minimize Subject to
11
1 2
n
i i i i
s b s M ≤ ≤ = ∑
Target: to reduce the overall expected error caused by
Different compression scheme leads to different overall error. Can be formulated as a optimization problem (see details in our
paper).
12
1 2 : 1
( , ,..., ) | ( | ) ( | ) | | ( ) |
n c a s s i i grids i c a s s TS SS rows
TotalCost E error s s s E E T T E T T E T T
∀ − ∀
= = − = Σ Σ −
1
( | ) ( )
i
S i S S T TS SS
E T T T µ µ
−
= + Σ Σ −
Let “S” and “T” represent the set of sensor
Problem formulation: As mentioned earlier represents the
13
TT
1 TT TT TS SS ST −
14
15
Sensor placement Bit allocation/ sensor compression Evaluate overall E(error) = Fusion center design total size of CR = M Design spec Statistical info Too much error? Yes Increase number of sensors No Done
( )
TT
trace Σ
16
1200 1300 1400 1500 1600 1700 1800 75 80 85 90 95 100 105 Time (seconds) Temperature (degrees Celsius) Actual temperature Our estimates Range-based estimates
17
the number of sensors
We presented a unified statistical framework for
Significant improvement in thermal sensing accuracy can
Our methodology has the capability of trading off
18
19