Is Reuse Distance Applicable to Data Locality Analysis
- n Chip Multiprocessors?
Yunlian Jiang Eddy Z. Zhang Kai Tian Xipeng Shen (presenter)
Department of Computer Science The College of William and Mary, VA, USA
Is Reuse Distance Applicable to Data Locality Analysis on Chip - - PowerPoint PPT Presentation
Is Reuse Distance Applicable to Data Locality Analysis on Chip Multiprocessors? Yunlian Jiang Eddy Z. Zhang Kai Tian Xipeng Shen (presenter) Department of Computer Science The College of William and Mary, VA, USA Cache Sharing A common
Yunlian Jiang Eddy Z. Zhang Kai Tian Xipeng Shen (presenter)
Department of Computer Science The College of William and Mary, VA, USA
The College of William and Mary
2
The College of William and Mary
3
The College of William and Mary
4
RD histogram
The College of William and Mary
5
The College of William and Mary
+:SIGMETRICS04,Zhong+:PACT03]
6
The College of William and Mary
7
The College of William and Mary
8
The College of William and Mary
9
The College of William and Mary
10
a b b c d a p q p q P1: P2: SRD = 3; CRD =3+2=5
The College of William and Mary
11
a b c b a
SRD = 2 CRD = 2 + x r = speed(P2)/speed(P1) The larger r is, the greater x tends to be. Dependance on relative running speeds of co-runners.
The College of William and Mary
12
relative speed original: r = IPCi/IPCj after instrumentation: r’ = IPC’i/IPC’j changes of relative speed: |r-r’|/r
The College of William and Mary
13
The College of William and Mary
14
training SRD SRD IPC
training platform testing platform predictor = for single runs
training CRD CRD IPC
training platform testing platform chicken-egg dilemma for co-runs
The College of William and Mary
15
training CRD CRD IPC
training platform testing platform
The College of William and Mary
16
training CRD CRD IPC
training platform testing platform
The College of William and Mary
17
The College of William and Mary
18
The College of William and Mary
19
The College of William and Mary
20
SRDT1 SRDT2 SRDTm ... CRDT1 CRDT2 CRDTm ... prob. model
The College of William and Mary
21
a ... a ∆ trace of T1: ∆ dT1 dT2 dTm ... CRDT1= dT1 + dT1 + ... + dTm
assuming no data sharing
The College of William and Mary
22
time distance
Shows the probability for an access to have a certain TD.
The College of William and Mary
23
Pi(∆ ) = Pi(∆-1) + qi(∆ ) ∆ Pi(∆-1) = Pi(∆-2) + qi(∆-1) Pi(1) = Pi(0) + qi(1) ...
i(Δ) =
τ =1 Δ
qi(∆ ): Oi is accessed at time point ∆, but not at the ∆-1 points ahead.
TDH
The College of William and Mary
24
τ
δ =τ +1 T
TDH TDH
i(Δ) = ni
Δ
δ =τ +1 T
The College of William and Mary
25
d
TDH See paper for details.
The College of William and Mary
26
a b X X b X c d X a X: references by T2
should not be counted.
The College of William and Mary
27
S: set of all shared data. N1, N2: data size of T1 and T2. n1, n2: # of distinct data accessed by T1 and T2 in an interval of length V. C: intersection of data sets referenced by T1 and T2 in the interval.
See paper for details.
The College of William and Mary
s: sharing ratio n1, n2: data sizes
The College of William and Mary
29
The College of William and Mary
30
The College of William and Mary
31
Difficult in general. Reliance on relative speeds; loss of hardware-indep; falling into a chicken-egg dilemma. Yes for a class of multithreading applications. A probabilistic model facilitates the derivation of CRD.
The College of William and Mary
32