SLIDE 20
20/48
thread 1 L(0,0) L(2,1) L(2,2) L(3,1) L(3,2) L(3,3) L(6,0) L(6,2) L(6,5) L(6,6) thread 2 L(1,1) L(4,0) L(4,1) L(4,4) L(5,2) L(5,5) thread 3 L(7,0) L(7,1) L(7,2) L(7,7) thread 4 thread 5 thread 6 thread 1 L(0,0) L(2,1) L(2,2) L(4,0) L(4,4) L(5,2) L(5,5) L(7,0) L(7,7) thread 2 L(4,1) L(7,1) thread 3 L(7,2) thread 4 L(1,1) L(3,1) L(3,2) L(3,3) L(6,0) L(6,5) L(6,6) thread 5 L(6,2) thread 6 thread 1 L(0,0) L(6,0) L(6,2) L(6,5) L(6,6) thread 2 L(1,1) L(7,0) L(7,1) L(7,2) L(7,7) thread 3 L(2,1) L(2,2) thread 4 L(3,1) L(3,2) L(3,3) thread 5 L(4,0) L(4,1) L(4,4) thread 6 L(5,2) L(5,5) warp 1 warp 2 warp 1 warp 2 warp 1 warp 2 time (a) Level-Set SpTRSV. (b) Warp-Level Synchronization-Free SpTRSV. (c) Thread-Level Synchronization-Free SpTRSV (CapelliniSpTRSV). Level 0 Level 1 Level 2 Level 3 Data transmission