SLIDE 15 IIIT Hyderabad!
GPU BWT vs. Bzip2 BWT"
0.092 0.212 0.18 0.397 0.021 0.021 0.02 0.02 0.07 0.209 0.104 0.09 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5
enwik8, MSD = 960, ASD = 298 wiki-xml, MSD = 960, ASD = 614 silesia.tar MSD=16320, ASD=1406 linux-2.6.11.tar, MSD = 65472, ASD = 2836
Average runtime (secs/per block) for CPU and GPU BWT Algorithms, Block Size : 900KB
GPU Sort (2/3rd + 1/3rd) CPU Merge (1/3rd + 2/3rd) CPU BWT (bzip2) GPU BWT Time (Increases with MSD/ASD) Constant Time Merge Operation
0.28 0.878 0.75 3.152 0.183 0.186 0.171 0.162 0.46 1.69 0.768 0.95 0.5 1 1.5 2 2.5 3 3.5 4
enwik8, MSD = 960, ASD = 576 wiki-xml, MSD = 960, ASD = 874 silesia.tar MSD=16320, ASD=4075 linux-2.6.11.tar, MSD = 65472, ASD = 10078
Average runtime (secs/per block) for CPU and GPU BWT Algorithms, Block Size : 4.5MB
GPU Sort (2/3rd + 1/3rd) CPU Merge (1/3rd + 2/3rd) CPU BWT (bzip2) GPU BWT Time increase with MSD/ASD Constant Time Merge Operation
0.526 1.81 1.57 9.39 0.414 0.406 0.385 0.412 1.08 4.53 1.79 2.31 1 2 3 4 5 6 7 8 9 10
enwik8, MSD = 960, ASD = 813 wiki-xml, MSD = 960, ASD = 929 silesia.tar MSD=16320, ASD=8430 linux-2.6.11.tar, MSD = 262080, ASD = 27340
Average runtime (secs/per block) for CPU and GPU BWT Algorithms, Block Size : 9MB
GPU Sort (2/3rd + 1/3rd) CPU Merge (1/3rd + 2/3rd) CPU BWT (bzip2) GPU BWT Time increase with MSD/ASD Constant Time Merge Operation
No'speedup'for' small'blocks' ' GPU'not'uHlized' sufficiently' Speedup'on'large' blocks'' ' GPU'sHll'slow'for' worstIcase'linux' dataset'
May 29, 2015 AsHES 2015