Netherlands Institute for Radio Astronomy
Image-Domain Gridding on Accelerators
Bram Veenboer
Monday 26th March, 2018, GPU Technology Conference 2018, San Jose, USA ASTRON is part of the Netherlands Organisation for Scientific Research (NWO)
Image-Domain Gridding on Accelerators Bram Veenboer Monday 26 th - - PowerPoint PPT Presentation
Netherlands Institute for Radio Astronomy Image-Domain Gridding on Accelerators Bram Veenboer Monday 26 th March, 2018, GPU Technology Conference 2018, San Jose, USA ASTRON is part of the Netherlands Organisation for Scientific Research (NWO)
Netherlands Institute for Radio Astronomy
Monday 26th March, 2018, GPU Technology Conference 2018, San Jose, USA ASTRON is part of the Netherlands Organisation for Scientific Research (NWO)
Image credits: NRAO
1
2
3
Image credits: NRAO 4
5
6
7
8
gridder
baseline (pair of receivers) receiver
× I
correlator imager visibilities calibration sky-model sky-image imaging ionosphere i n c
i n g r a d i
a v e s
9
10
W-projection gridding
using convolution kernels
Image-Domain gridding
using subgrids grid: visibility: convolution: updated pixel:
channels time
visibilities Fourier grid gridder kernel visibilities image subgrids Fourier subgrids Fourier grid gridder kernel FFT adder For more details: Image-Domain Gridding on Graphics Processors, Bram Veenboer, Matthias Petschow and John. W Romein, IPDPS 2017 11
SKA1 Low, Australia SKA1 Mid, Africa
12
10 20 30 40 50 60 Haswell KNL Pascal Vega Runtime [seconds] gridder subgrid-ifft adder grid-fft splitter subgrid-fft degridder
20 40 60 80 100 120 140 160 180 200 Haswell KNL Pascal Vega Throughput [MVisibilities/s] gridding degridding
13
1 2 4 8 16 32 64 128 256 512 1024 0.1 1 10
gridder degridder gridder degridder
Operational intensity [Op/Byte] Performance [TOp/s] Haswell KNL Pascal Vega
14
15
1 2 4 8 16 32 64 128 256 512 1024 0.1 1 10
gridder degridder gridder degridder
Operational intensity [Op/Byte] Performance [TOp/s] Haswell KNL Pascal Vega
16
1 α = . . . ; 2 3 for c=1,. . . , ˜
C do // channel
4
Φ = cos (α) + i sin (α);
5 6
Re(pix11) += Re(vis11[c]) ∗ Re(Φ[c]);
7
Im(pix11) += Re(vis11[c]) ∗ Im(Φ[c]);
8
Re(pix11) −= Im(vis11[c]) ∗ Im(Φ[c]);
9
Im(pix11) += Im(vis11[c]) ∗ Re(Φ[c]);
10 11
// [... same for pix12, pix21 and pix22]
12 end
17
1 2 4 8 16 32 64 128 256 512 1024 0.1 1 10
gridder degridder gridder degridder
Operational intensity [Op/Byte] Performance [TOp/s] Haswell KNL Pascal Vega
18
1 4 1 2
1 2 4 8 0.1 1 10
gridder degridder
Operational intensity [Op/Byte] Performance [TOp/s] Pascal Vega
19
2 4 6 8 10 12 14 16 18 Haswell KNL Pascal Vega Energy consumption [kJ] gridder subgrid-ifft adder grid-fft splitter subgrid-fft degridder host 5 10 15 20 25 30 35 Haswell KNL Pascal Vega Energy efficiency [GFlop/W] gridder degridder
20
8 16 24 32 40 48 56 64 107 108 W-kernel size NW Throughput [Visibilities/s] IDG Pascal WPG Pascal AWPG Pascal
21
1024 2048 4096 8192 16384 32768 65536 180 200 220 240 Size [pixels2] Throughput [MVisibilities/s] GPU only, gridding GPU only, degridding
22
1024 2048 4096 8192 16384 32768 65536 180 200 220 240 Size [pixels2] Throughput [MVisibilities/s] Hybrid, gridding Hybrid, degridding GPU only, gridding GPU only, degridding
23
1024 2048 4096 8192 16384 32768 65536 180 200 220 240
tiling in adder/splitter
Size [pixels2] Throughput [MVisibilities/s] Unified, gridding Unified, degridding Hybrid, gridding Hybrid, degridding GPU only, gridding GPU only, degridding
24
SKA-1 Low SKA-1 Mid # receivers 512 133 # baselines 13,0816 8778 # channels 65,536 65.536 # polarizations 4 4 integration time 0.9 (s) 0.14 (s) data rate 8.3 (GVis/s) 9.53 (GVis/s)
25
More details: Image-Domain Gridding on Graphics Processors, Bram Veenboer, Matthias Petschow and John. W Romein, IPDPS 2017 Source available at: https://gitlab.com/astron-idg/idg 26