An Agile Approach to Building a GPU-enabled and Performance- portable Global Cloud-resolving Atmospheric Model
- Dr. Richard Loft*
An Agile Approach to Building a GPU-enabled and Performance- - - PowerPoint PPT Presentation
An Agile Approach to Building a GPU-enabled and Performance- portable Global Cloud-resolving Atmospheric Model Dr. Richard Loft* Director, Technology Development CISL/NCAR *National Center for Atmospheric Research GTC, San Jose, CA March 26,
2
3
– A set of non-linear partial differential equations (PDE) – Capture features of atmospheric flow around the Earth
RBF-FD solution to SWE test case “Flow over an isolated mountain” using 655,532 points [1]
3
An example of 75-point stencil
Evaluate differential
Stencil points Non-stencil points Cone-shaped mountain Day 1 Day 15
4
Insufficient Workload Parallelism Sufficient Workload Parallelism
predicts performance well, even for more complicated algorithms.
DRAM BW limit when cache size is exceeded, with some state reuse.
less sensitive to problem size that Xeon, saturates with CI figure.
fits CI model GPU’s require higher levels of parallelism to reach saturation.
5
6
Simulation of 2012 Tropical Cyclones at 4Km Resolution – Courtesy of Falko Judt, NCAR
7
8
MPAS Dynamics MPAS Physics Problem Reports and Support Ideas and Results
9
10 Problem Reports and Support
https://github.com/NCAR/KGen
11
13
14