High Performance In-Situ Visualization
- n Thousands of GPUs
Jeroen Bédorf Simon Portegies Zwart
Leiden Observatory
High Performance In-Situ Visualization on Thousands of GPUs Jeroen - - PowerPoint PPT Presentation
High Performance In-Situ Visualization on Thousands of GPUs Jeroen Bdorf Evghenii Gaburov Simon Portegies Zwart Peter Messmer Leiden Observatory Compute machine Simulation I/O layer disk I/O software Storage
Leiden Observatory
Compute machine Simulation Ex-situ visualization machine I/O layer I/O layer
analysis & visualization software
disk I/O software Storage disk I/O software
Compute & in-situ visualization machine Simulation
analysis & visualization, simulation steering sw
I/O layer Storage disk I/O software
http://github.com/treecode/Bonsai
http://github.com/treecode/Bonsai
Compute & in-situ visualization machine Bonsai
analysis & visualization, simulation steering sw
I/O layer
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Display Display (240 ms) Display Compositing
1
Simulation step (80 ms)
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Simulation step …
1 2 3 4 5 7 8 9 6
1 2 3 4 5 7 8 9 6
depth
depth
depth
4 3 2 5 1
depth
P Q L
1 2 3 4 5 7 8 9 6
P Q L
1 2 3 4 5 7 8 9 6
P Q L
1 2 9 1 2 3 4 5 7 8 9 6
1 2 3 4 5 7 8 9 6
P Q L
1 2 9
1 2 3 4 5 7 8 9 6
P Q L
1 2 9 3 4 3 5 4 5 6 7
2 3 4 5 6 7 8 9
P Q L
1
1 2 3 4 5 6 7 8 9
P Q L
1 4 8 7
2 3 5 6 9
SFC Recursive multi-section
SFC Recursive multi-section
GPU-0 GPU-1 GPU-2 GPU-3 GPU-4 GPU-5 GPU-6 GPU-7 GPU-8
1 4 8 7
2 3 5 6 9
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 7 7 7 7 7 1 1 1 1,3 1,3 3 7 7 7 7 7 1 1 1,6 1,3,6 1,3,6 3 7 7 7 7 7 6 3,6 3,6 3 7 7 7 7 7 6 6 6 6 6 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
MPI_Alltoallv(..)
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 7 7 7 7 7 1 1 1 1,3 1,3 3 7 7 7 7 7 1 1 1,6 1,3,6 1,3,6 3 7 7 7 7 7 6 3,6 3,6 3 7 7 7 7 7 6 6 6 6 6 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 7 7 7 7 7 1 1 1 1+3 1+3 3 7 7 7 7 7 1 1 1+6
1+3+6 1+3+6
3 7 7 7 7 7 6 3+6 3+6 3 7 7 7 7 7 6 6 6 6 6 6
proc 0 proc 1 proc 2 proc 3 proc 4 proc 5 proc 7 proc 6
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Compositing
1
Simulation step (80 ms)
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Simulation step … Display Display (240 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
1
Simulation step (80 ms)
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Simulation step
1
Simulation step (80 ms) Simulation step (80 ms) Simulation step (80 ms) Simulation step …
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Simulation step
1
Simulation step (80 ms) Simulation step (80 ms) Simulation step (80 ms) Simulation step … Data partition (50 ms)
2
Data partition (50 ms) Data partition (50 ms) Data partition
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Simulation step
1
Simulation step (80 ms) Simulation step (80 ms) Simulation step (80 ms) Simulation step … Data partition (50 ms)
2
Data partition (50 ms) Data partition (50 ms) Data partition OpenGL rendering OpenGL rendering (60 ms) OpenGL rendering (60 ms)
3
OpenGL rendering (60 ms) OpenGL rendering Compositing Compositing (50 ms) Compositing (50 ms) Compositing (50 ms)
4
Compositing (50 ms) Display Display (60 ms) Display (60 ms) Display (60 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Compositing
1
Simulation step (80 ms)
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Simulation step … Display Display (240 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Simulation step
1
Simulation step (80 ms) Simulation step (80 ms) Simulation step (80 ms) Simulation step … Data partition (50 ms)
2
Data partition (50 ms) Data partition (50 ms) Data partition OpenGL rendering OpenGL rendering (60 ms) OpenGL rendering (60 ms)
3
OpenGL rendering (60 ms) OpenGL rendering Compositing Compositing (50 ms) Compositing (50 ms) Compositing (50 ms)
4
Compositing (50 ms) Display Display (60 ms) Display (60 ms) Display (60 ms) Display
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Compositing
1
Simulation step (80 ms)
2
Data partition (50 ms)
3
OpenGL rendering (60 ms)
4
Compositing (50 ms) Simulation step … Display Display (240 ms) Display
http://github.com/treecode/Bonsai
10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms 10 ms
Simulation step
1
Simulation step (80 ms) Simulation step (80 ms) Simulation step (80 ms) Simulation step … Data partition (50 ms)
2
Data partition (50 ms) Data partition (50 ms) Data partition OpenGL rendering OpenGL rendering (60 ms) OpenGL rendering (60 ms)
3
OpenGL rendering (60 ms) OpenGL rendering Compositing Compositing (50 ms) Compositing (50 ms) Compositing (50 ms)
4
Compositing (50 ms) Display Display (60 ms) Display (60 ms) Display (60 ms) Display
Leiden Observatory