A Relaxed Criterion for Loop Tiling
Riyadh Baghdadi, Albert Cohen, Sven Verdoolaege
UPMC/INRIA/ENS
September 22, 2015
1/22
A Relaxed Criterion for Loop Tiling Riyadh Baghdadi, Albert Cohen, - - PowerPoint PPT Presentation
A Relaxed Criterion for Loop Tiling Riyadh Baghdadi, Albert Cohen, Sven Verdoolaege UPMC/INRIA/ENS September 22, 2015 1/22 Tiling Main benefit: enhance data locality 2/22 Tiling Main benefit: enhance data locality Useful in architectures
1/22
2/22
2/22
3/22
3/22
3/22
4/22
4/22
4/22
5/22
5/22
5/22
5/22
5/22
5/22
6/22
6/22
7/22
Iteration j Iteration j+1 s1(j) s2(j) WAR Live range s2(j+1) s1(j+1)
7/22
Iteration j Iteration j+1 s1(j) s2(j) WAR Live range s2(j+1) s1(j+1)
7/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 8/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2
8/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2
8/22
9/22
9/22
9/22
10/22
10/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2
11/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2
11/22
S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2 S1 S2
11/22
12/22
13/22
14/22
15/22
16/22
16/22
16/22
17/22
18/22
18/22
18/22
19/22
19/22
20/22
20/22
21/22
21/22
resize dilate color conversion affine warping 2D convolution gaussian smoothing basic histogram 0.2 0.5 1.0 2.0 4.0 10.0 Speedups (logarithmic scale)
OpenCV-OpenCL PPCG-OpenCL
resize dilate color conversion affine warping 2D convolution gaussian smoothing basic histogram 0.2 0.5 1.0 2.0 4.0 10.0 Speedups (logarithmic scale)
OpenCV-OpenCL PPCG-OpenCL
resize dilate color conversion affine warping 2D convolution gaussian smoothing basic histogram 0.2 0.5 1.0 2.0 4.0 10.0 Speedups (logarithmic scale)
OpenCV-OpenCL PPCG-OpenCL
22/22