PARALLEM: massively Parallel Landscape Evolution Modelling
Tuesday 28th November 2017 The University of Sheffield
- A. Stephen McGough, Darrel Maddy
- J. Wainwright, S. Liang, M. Rapoportas, A. Trueman,
- R. Grey, G. Kumar Vinod, and James Bell
PARALLEM: massively Parallel Landscape Evolution Modelling Tuesday - - PowerPoint PPT Presentation
PARALLEM: massively Parallel Landscape Evolution Modelling Tuesday 28 th November 2017 The University of Sheffield A. Stephen McGough , Darrel Maddy J. Wainwright, S. Liang, M. Rapoportas, A. Trueman, R. Grey, G. Kumar Vinod, and James Bell
31 22 32 33 32 25 33 34 29 26 27 39 36 27 26 41 44 50 45 44 40 51 55 39 44 46
7 10 7 10 5 8 9 5 9 6 4 6 7 8 4 8 7 9 8 7 9 8 4 6 5 6
Flow Routing Flow Accumulation Erosion/ Deposition
1 1 3 1 1 7 2 1 1 5 1 1 1 1 1 2 1 1 1 1 1 1 6 1 2
How much material will be removed? How much material will be deposited? Sequential version is much slower than this…
1M years
this would take 83 – 694 days!
8
Single flow direction vs multiple flow direction MFD is ‘better’ but much more computationally demanding
Flow Routing Accumulation Correct 1 1 1 1 1 1 1 1 1 1 1 3 2 2 2 2 4 6 3 4 5 6 19 7 14
0.01 0.1 1 10 100 1000 0.01 0.1 1 10 100 Time (s) DEM size CybErosion-slim T esla single iteration 580 single iteration T esla average 10 580 average 10 (millions)
0.0001 0.001 0.01 0.1 1 10 100 1000 0.001 0.01 0.1 1 10 100 Time (s) DEM size (millions) Sequential Flow Direction T esla Flow Direction 580 Flow Direction T erraflow Flow Direction
0.001 0.01 0.1 1 10 100 1000 0.001 0.01 0.1 1 10 100 Time (s) DEM size (millions) Sequential Flow Accumulation T esla Flow Accumulation 580 Flow Accumulation T erraflow Flow Accumulation T esla K20 Flow Accumulation T esla K20 - removed conditionals
Upper Thames Valley + 120K
GPU 1 GPU 2 GPU 3 GPU 4
5E+10 1E+11 1.5E+11 2E+11 2.5E+11 1 2 3 4 5 6 7
Wallclock Runtime (nanoseconds)
Compute Transfer
GPU Count
1 10 100 1000 10000 1 2 3 4 5 6
Wall-clock runtime (seconds) GPU Count
5m Active Cells (Kepler K40/K80) 20m Active Cells (Kepler K40/K80) 5m Active Cells (Pascal Titan XP) 5m Active Cells Sequential (CPU)
10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 138
10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 50 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 25 50 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 20 25 50 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 15 20 25 50 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 10 15 20 25 50 70 115 125 138 10 20 30 40 50 60 70 80 90 100 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5 10 15 20 25 50 70 115 125 138
70 72 74 76 78 80 82 84 500 1000 1500 2000 2500 3000 1k 20kSFD 20kMFD
20 40 60 80 100 1 10 100 1000 10000 Percentage Complete Iteration
26/11/2017 Iapetuslogo.svg file:///Users/asm/Desktop/Iapetuslogo.svg 1/1
One potential PhD position to work on this
stephen.mcgough@newcastle.ac.uk darrel.maddy@newcastle.ac.uk
and James Bell We Are recruiting:
26/11/2017 Iapetuslogo.svg file:///Users/asm/Desktop/Iapetuslogo.svg 1/1
One potential PhD position to work on this