1
ONE VIEW: a tool for performance analysis agregation
- W. Jalby
ONE VIEW: a tool for performance analysis agregation W. Jalby - - PowerPoint PPT Presentation
ONE VIEW: a tool for performance analysis agregation W. Jalby University of Versailles Saint Quentin en Yvelines/ECR 1 Stage Architectures, Applications and last but not least Tools also are becoming very complex A simple example:
1
2
3
4
5
6
7
8
9
10
1 1,1 1,2 1,3 1,4 1,5 1,6 1,7 1,8 1,9 2 1 4 7 11 15 18 21 25 28 32 35 39 46 52 56 61 65 69 73 80 83 86 90 94 99 102 109 113 120 123 127 132 140 145 151 155 159 163 167 170 175
If Clean If FP vectorized If fully vectorized
11
ID Coverage (% app. time) CQA speedup if clean CQA speedup if FP arith vectorized CQA speedup if fully vectorized CQA speedup if no inter- iteration dependency CQA speedup if next bottleneck killed Loop 39633 22,02 Path 1 1.23 1.45 3.94 1.00 1.03 Loop 39632 5,87 Path 1 1.00 1.31 3.82 1.00 1.20 Loop 7865 4,51 Path 1 1.02 1.00 1.21 1.00 3.65 Loop 21866 2,83 Path 1 1.65 1.54 4.00 1.00 1.42 Loop 7862 2,5 Path 1 1.07 1.07 4.00 1.00 1.05 Loop 39623 2,01 Path 1 1.03 1.04 4.00 1.00 1.13 Loop 25990 1,25 Path 1 1.39 1.69 4.00 1.00 1.13 Loop 7836 1,2 Path 1 1.09 1.06 3.49 1.00 1.09 Loop 4729 1,2 Path 1 1.57 1.60 5.68 1.00 1.33 Path 2 1.73 1.65 5.85 1.00 1.36 Path 3 1.73 1.65 5.85 1.00 1.36 Path 4 2.00 1.73 6.10 1.00 1.39
12
1,01 1,02 1,03 1,04 1,05 1,06 1,07 1,08 1,09 1,1 1,11 39633 39632 7865 21866 7862 39623 25990 7836 4729 4756 22001 2809 5121 2584 25986 5132 22005 43388 2813 44910 2569 39673 43395 7880 43400 45900 5118 5106 5120 7947 43397 4773 21887 39671 43164 21886 2561 2505 43403 43120 22198 2576
Cumul Speedup If Clean
13
1 1,02 1,04 1,06 1,08 1,1 1,12 1,14 1,16 1,18 1,2 39633 7865 7862 25990 4729 22001 5121 25986 22005 2813 2569 43395 43400 5118 5120 43397 21887 43164 2561 43403 22198 4760 4753 4781 45899 43121 43234 22209 45274 4717 4779 5570 5618 7925 43128 22195 4758 999 875 4780 43271 22213 4844 4813
Cumul Speedup If FP Arith Vectorized
14
0,2 0,4 0,6 0,8 1 1,2 1,4 1,6 1,8 2 39633 39632 7865 21866 7862 39623 25990 7836 4729 4756 22001 2809 5121 2584 25986 5132 22005 43388 2813 44910 2569 39673 43395 7880 43400 45900 5118 5106 5120 7947 43397 4773 21887 39671 43164 21886 2561 2505 43403 43120 22198
Cumul Speedup If Fully Vectorized
15
16
17
18
1 1,01 1,02 1,03 1,04 1,05 1,06 1,07 1,08 1,09 1,1
Cumul Speedup If L1
19