Sana Damani and Vivek Sarkar Habanero Extreme Scale Software Research Lab Georgia Institute of Technology
Common Subexpression Convergence (CSC)
Short paper at LCPC ’19, Atlanta, GA
Common Subexpression Convergence (CSC) Sana Damani and Vivek Sarkar - - PowerPoint PPT Presentation
Common Subexpression Convergence (CSC) Sana Damani and Vivek Sarkar Habanero Extreme Scale Software Research Lab Georgia Institute of Technology Short paper at LCPC 19, Atlanta, GA Agenda Motivation Common Subexpression Convergence
Sana Damani and Vivek Sarkar Habanero Extreme Scale Software Research Lab Georgia Institute of Technology
Short paper at LCPC ’19, Atlanta, GA
2
3
Image credits: https://devblogs.nvidia.com/inside-volta
4
threadIdx.x 1 2 3 4 5 6 7 (1) (2) (3) (4) (5)
5
6
7
8
9
10
11
entry b = ... c = ...
12
tid%2 tid%3 a=b*c a=b*c a=b*c F T F T Bottom-Up Traversal Through CDG
13
14
15
16
17
18
19
20
Note: nvprof shows major gains due to reduction in global reads of up to 27% with CSC (common address reads/coalesced accesses)
0% 20% 40% 60% 80% 100% 120% Hoist Sink Split Function Nested Switch
SIMT efficiency
SIMT efficiency Before SIMT efficiency After 2 4 6 8 10 12 Hoist Sink Split Switch
Speedup
Speedup
21
0.00% 20.00% 40.00% 60.00% 80.00% 100.00% 120.00% min eff max eff avg eff
SIMT Efficiency for Bitonic Sort
before after 50 100 150 200 250 300 350 400 450 500 min run time max run time avg run time
Run Time for Bitonic Sort
before after
22
23