DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions
- n GPUs
Xiaobai Sun and Nikos Pitsianis Duke University September 19, 2007 At High Performance Embedded Computing 2007 MIT-LL
FFTs of Arbitrary Dimensions on GPUs Xiaobai Sun and Nikos - - PowerPoint PPT Presentation
HPEC-2 0 0 7 DUKE MI T-LL FFTs of Arbitrary Dimensions on GPUs Xiaobai Sun and Nikos Pitsianis Duke University September 19, 2007 At High Performance Embedded Computing 2007 MIT-LL HPEC-2 0 0 7 DUKE MI T-LL Overview Motivation
DUKE MI T-LL HPEC-2 0 0 7
Xiaobai Sun and Nikos Pitsianis Duke University September 19, 2007 At High Performance Embedded Computing 2007 MIT-LL
FFTs of Arbitrary Dimensions on GPUs
2
DUKE MI T-LL HPEC-2 0 0 7
– FFTs of arbitrary dimensions and their applications – Graphics processing units (GPUs)
FFTs of Arbitrary Dimensions on GPUs
3
DUKE MI T-LL HPEC-2 0 0 7
Remove Keystone Remove Keystone Spatially Variant Refocus Spatially Variant Refocus Motion Comp Motion Comp Auto- Focus Auto- Focus
From S. Bellofiore and H. Schmitt at
Polar Format NUFFT Polar Format NUFFT
FFTs of Arbitrary Dimensions on GPUs
4
DUKE MI T-LL HPEC-2 0 0 7
I m a g e d e g r a d a t i
b y t r a d i t i
a l i n t e r p
a t i
p r i
i t
F F T – L
s
r e s
u t i
– L
s
d a t a
Synthetic Aperture Ground Plane Slant Plane
CRP
Δθ Range Samples Δθ PRFs
2-D FFT
1800 1800 2048 2048
Polar Format
1:1 Range 1.25:1 Azimuth
From S. Bellofiore and H. Schmitt at
FFTs of Arbitrary Dimensions on GPUs
5
DUKE MI T-LL HPEC-2 0 0 7
GPU : Graphics Processing Unit
FFTs of Arbitrary Dimensions on GPUs
6
DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions on GPUs
7
DUKE MI T-LL HPEC-2 0 0 7
– Two complex numbers per pixel vector (4 floating point numbers) :
– Even columns at the front layers, odd columns at the back layers
– utilizing best the architectural support
– Radix-2, radix 3 and mixed radices
– Up to certain sub-array size – 2D data partitioning in large data array Re( X ) Im( X )
FFTs of Arbitrary Dimensions on GPUs
8
DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions on GPUs
9
DUKE MI T-LL HPEC-2 0 0 7
block size (low bits)
(bit reversal in high bits)
X( i, j ) X( Rm (i), Rn (j) )
FFTs of Arbitrary Dimensions on GPUs
10
DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions on GPUs
11
DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions on GPUs
12
DUKE MI T-LL HPEC-2 0 0 7
18 19 20 21 22 23 50 10 15 20 25 30 35 40 2D FF T T im es log2 of data v
e Time in msec Arithm etic Bit R ev ersa l
FFTs of Arbitrary Dimensions on GPUs
13
DUKE MI T-LL HPEC-2 0 0 7
18 19 20 21 22 23 10 20 30 40 50 60 T
T T im es log2 of data v
e Time in msec Write to GP U Arithm etic Bit R ev ersa l Re ad from G PU
FFTs of Arbitrary Dimensions on GPUs
14
DUKE MI T-LL HPEC-2 0 0 7
FFTs of Arbitrary Dimensions on GPUs
15
DUKE MI T-LL HPEC-2 0 0 7
18 19 20 21 22 23 50 10 15 20 25 30 35 40 45 1D, 2D and 3D FFT s log2 of data v
e Compute time in msec 1-D FF T 2-D FF T 3-D FF T
FFTs of Arbitrary Dimensions on GPUs
16
DUKE MI T-LL HPEC-2 0 0 7
– Pre-calculated, partially calculated, calculate on the fly – Numerical behavior
– Data placement in main memory – A sequence of successive FFTs
– IBM Cell