Evaluation of a performance portable lattice Boltzmann code using OpenCL
Simon McIntosh-Smith Dan Curran
Computer Science University of Bristol
1
Twitter: @simonmcs
Evaluation of a performance portable lattice Boltzmann code using - - PowerPoint PPT Presentation
Evaluation of a performance portable lattice Boltzmann code using OpenCL Simon McIntosh-Smith Dan Curran Computer Science University of Bristol Twitter: @simonmcs 1 Motivation Our BUDE molecular docking code turned out to show strong
1
Twitter: @simonmcs
2
"High Performance in silico Virtual Drug Screening on Many-Core Processors",
3
4
5
at (128,1,1) for all OpenCL runs on all devices.
same way as the OpenCL execution, with a blocksize of (128,1,1).
OpenCL/CUDA versions
6
7
Single precision results
8
OpenCL single precision results 57% 67%
9
10
OpenCL single precision results
11
OpenCL single precision results AMD GPUs NVIDIA GPUs Intel CPU
12
13
14
15
Core Processors", S. McIntosh-Smith, J. Price, R.B. Sessions, A.A. Ibarra, IJHPCA 2014. doi: 10.1177/1094342014528252
many-core computer architectures", S.N. McIntosh-Smith, M. Boulton, D. Curran and J.R. Price. To appear, International Supercomputing, Leipzig, June 2014.
CUDA", Herdman, J., Gaudin, W., McIntosh-Smith, S., Boulton, M., Beckingsale, D., Mallinson, A., Jarvis, S. In: High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:. (Nov 2012) 465–471.
16
17