CS 294-73 Software Engineering for Scientific Computing

    CS 294-73   Software Engineering for Scientific Computing   Lecture 11: Fourier transforms; inheritance.  

The Fourier Transform Given a function , we can define its Fourier transform and its inverse: This is useful in a number of settings: approximation theory, solving partial differential equations, signal processing. 2 9/28/2017 CS294-73 Lecture 11

The Fourier Transform Examples: • If f is a (q+1) – times differentiable, periodic function, then • If f is a smooth function, then Taken together, these two examples provide a mechanism for computing very accurate approximations to f and its derivatives. 3 9/28/2017 CS294-73 Lecture 11

The Fourier Transform For dimensions greater than 1, can apply the 1D transform in each direction separately. Then estimates and identities from the 1D case apply to the partial sums and to derivatives: 4 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform Given a discrete set of function values on an equally-spaced grid, e.g., we can define its discrete Fourier transform (DFT): We could have used any contiguous set of wave numbers k ∈ [ k 0 , . . . , k 0 + N − 1] W ( k ) = W ( k + N ) ( z N = 1) and get back our original series using . 5 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform The set of vectors form an orthonormal basis for : So the DFT is invertible, and the inverse given by The above relationship is a discrete analogue of Fourier expansion of a function described earlier. 6 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform A discrete Fourier transform has analogous properties to the continuous one. • Interpolation: if where F is (q+1)-times differentiable, then 7 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform • Diagonalizing difference operators: 8 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform • All of this extends to multiple dimensions by applying it one dimension at a time: 9 9/28/2017 CS294-73 Lecture 11

The Discrete Fourier Transform A straightforward implementation of costs O( N 2 ) operations. • Each inner product is O( N ), and there are N of them; • Computing the sum is also O( N) operations for each j , and there are N of those. • The DFT is multiplication an NxN orthogonal matrix. 10 9/28/2017 CS294-73 Lecture 11

Fast Fourier Transform (Cooley and Tukey, 1965) We also have which allows us to obtain from the above sums. It also allows us to compute , k > N/2 from So the number of flops to compute is 2N, given that you have 11 9/28/2017 CS294-73 Lecture 11

Fast Fourier Transform Can continue this process until computing 2 M-1 sets of . At each level, the computational cost is (4 multiplies + 6 adds) x N/2 (the factors z k are precomputed once, at a cost of O(N)). So the total number of flops is O(M N) = O(N log N). (actually, 5N log 2 N). (NB: only one complex multiply per pair). (a + i *b)*(c+ i *d) =a*c-b*d + i *(a*d+b*c) This leads to the following implementation using recursion. 12 9/28/2017 CS294-73 Lecture 11

Fast Fourier Transform - Recursive Implementation void FFT::applyFFT(vector<complex<double> >& a_fhat, const vector<complex<double> >& a_f, int a_level) { if (level > 1){ vector<complex<double> > fEven = evenPoints(a_f); vector<complex<double> > fOdd = oddPoints(a_f); int N = a_f.size(); a_fhat.resize(N); Lots of copying, setting up vectors, … vector<complex<double> > fHatHalfEven,fHatHalfOdd; applyFFT(fEven,fHatHalfEven,level-1); Lots of function calls - O(2 m ) them, many of which do very little work. applyFFT(fOdd,fHatHalfOdd,level-1); // loop to compute fHatEven[k] + z^k*FhatOdd[k] } else fHat[0] = a_f[0] + a_f[1]; fHat[1] = a_f[0] - a_f[1]}; // z = -1 at level 1. }; (We’ll defer a deeper dive into recursion to a later time). 13 9/28/2017 CS294-73 Lecture 11

std::complex (2) Complex type. This is an STL type, templated on the type of the real and imaginary parts ( float, double, int) . A complex number is stored as a pair (realpart,imaginarypart) , in contiguous memory locations. To use it, you need the header #include <complex> Using namespace std: ... complex<int> c(1,2); complex<int> csq = c*c; complex<float> c(1.,2.); complex<float> csq = c*c; 14 9/28/2017 CS294-73 Lecture 11

Fast Fourier Transform Let’s look a little more closely at what is going on… F N/ 2 ( E f ) k = F N/ 4 ( E 2 f ) k + z k F N/ 4 ( OE f ) k F N/ 2 ( E f ) k + N/ 4 = F N/ 4 ( E 2 f ) k − z k F N/ 4 ( OE f ) k F N/ 2 ( O f ) k = F N/ 4 ( EO f ) k + z k F N/ 4 ( O 2 f ) k F N/ 2 ( O f ) k + N/ 4 = F N/ 4 ( EO f ) k − z k F N/ 4 ( O 2 f ) k z = e 2 πι (2 /N ) 15 9/28/2017 CS294-73 Lecture 11

Fast Fourier Transform F N/ 4 ( E 2 f ) k = F N/ 8 ( E 3 f ) k + z k F N/ 8 ( OE 2 f ) k F N/ 4 ( E 2 f ) k + N/ 8 = F N/ 8 ( E 3 f ) k − z k F N/ 8 ( OE 2 f ) k F N/ 4 ( EO f ) k = F N/ 8 ( E 2 O f ) k + z k F N/ 8 ( OEO f ) k F N/ 4 ( EO f ) k + N/ 8 = F N/ 8 ( E 2 O f ) k − z k F N/ 8 ( OEO f ) k F N/ 4 ( OE f ) k = F N/ 8 ( EOE f ) k + z k F N/ 8 ( O 2 E f ) k F N/ 4 ( OE f ) k + N/ 8 = F N/ 8 ( EOE f ) k − z k F N/ 8 ( O 2 E f ) k F N/ 4 ( O 2 f ) k = F N/ 8 ( EO 2 f ) k + z k F N/ 8 ( O 3 f ) k F N/ 4 ( O 2 f ) k + N/ 8 = F N/ 8 ( EO 2 f ) k − z k F N/ 8 ( O 3 f ) k z = e 2 πι (4 /N ) 9/28/2017 CS294-73 Lecture 11 16

Update-in-place Cooley-Tukey Implementation Even: bit = 0 000 001 010 011 100 101 110 111 Odd: bit = 1 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 17 9/28/2017 CS294-73 Lecture 11

Update-in-place Cooley-Tukey Implementation Let’s change how we store the function values in memory, based on reversing the order of the binary digits: . Sorted based on reversed- ordered indices 0 4 2 6 1 5 3 7 Values for reverse-ordered indices 000 100 010 110 001 101 011 111 Original Ordering 000 001 010 011 100 101 110 111 000 100 010 110 001 101 011 111 000 001 010 011 100 101 110 111 000 100 010 110 001 101 011 111 000 001 010 011 100 101 110 111 000 100 010 110 001 101 011 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 000 001 010 011 100 101 110 111 No recursion - expressed as a loop instead. Entirely update in place, with a pair of complex numbers as temporaries. Memory locality ? 18 9/28/2017 CS294-73 Lecture 11

Split-radix formulation An FFT of size N = N 1 N 2 can be expressed as a composition of two sets of FFTs, one set of length N 1 , the other set of length N 2 , with a set of N 1 N 2 multiplications in between N 1 − 1 N 2 − 1 f j 2 N 1 + j 1 z ( j 2 N 1 + j 1 ) k ˆ X X f k 2 + N 2 k 1 = , k = k 2 + N 2 k 1 , N = N 1 N 2 N j 1 =0 j 2 =0 N 1 − 1 N 2 − 1 f j 2 N 1 + j 1 z j 2 N 1 ( k 2 + N 2 k 1 ) X X z j 1 k = N N j 1 =0 j 2 =0 N 1 − 1 ⇣ N 2 − 1 ⌘ X X f j 2 N 1 + j 1 z j 2 k 2 z j 1 k ( z N 2 N = z N 1 , z N 1 N 2 = = 1) N 2 N N j 1 =0 j 2 =0 N 1 − 1 ⇣⇣ N 2 − 1 ⌘ ⌘ ( z j 1 ( k 2 + k 1 N 2 ) X X z j 1 k 1 f j 2 N 1 + j 1 z j 2 k 2 z j 1 k 2 = z j 1 k 2 z j 1 k 1 = N 1 ) N 1 N 2 N N N j 1 =0 j 2 =0 Duhamel and Vetterli, “Fast Fourier Transforms: a Tutorial Review and the State of the Art”, Signal Processing 19, p. 259-299 (1990). 19 9/28/2017 CS294-73 Lecture 11

CS 294-73 Software Engineering for Scientific Computing - PowerPoint PPT Presentation

CS 294-73 Software Engineering for Scientific Computing Lecture 11: Fourier transforms; inheritance. The Fourier Transform Given a function , we can define its Fourier transform and its

CS 294-73 Software Engineering for Scientific Computing Lecture 5: More

CS 294-73 Software Engineering for Scientific Computing Lecture 6: Git, homework #1,

CS 294-73 Software Engineering for Scientific Computing Lecture 10:Dense Linear

CS 294-73 Software Engineering for Scientific Computing Lecture 13: Particle

CS 294-73 Software Engineering for Scientific Computing Lecture 9: Performance on

CS 294-73 Software Engineering for Scientific Computing Lecture 3

CS 294-73 Software Engineering for Scientific Computing Lecture 18: Performance

CS 294-73 Software Engineering for Scientific Computing Lecture 4:

CS 294-73 Software Engineering for Scientific Computing Lecture 14: Development

CS 294-73 Software Engineering for Scientific Computing Lecture 14: PPPM for

CS 294-73 Software Engineering for Scientific Computing Lecture 7: STL

CS 294-73 Software Engineering for Scientific Computing Lecture

CS 294-73 Software Engineering for Scientific Computing Lecture 15: Development

CS 294-73 Software Engineering for Scientific Computing Lecture 8:

CS 294-73 Software Engineering for Scientific Computing

Concepts and Algorithms of Scientific and Visual Computing Fast Fourier Transform CS448J,

The Fast Fourier Transform (FFT) A top 10 Algorithm* Rubin H Landau Sally Haerer,

Polynomials and Fast Fourier Transform (FFT) Polynomials n-1 a i x i a polynomial of degree n-1

PQL: A Purely-Declarative Java Extension for Parallel Programming Christoph Reichenbach 1 , 2 ,

How to Write Fast Numerical Code Spring 2011 Lecture 21 Instructor: Markus Pschel TA: Georg

5.6 Convolution and FFT Fast Fourier Transform: Applications Applications. Optics,

Lecture 23: Fourier Transform, Convolution Theorem, and Linear Dynamical Systems April 28, 2016.

From Fourier to Koopman Spectral Methods for Long-term Time Series Prediction arXiv:2004.00574