Parameter Tuning of a Hybrid Treecode-FMM on GPUs
Rio Yokota, Lorena Barba Department of Mechanical Engineering, Boston University
Saturday, June 4, 2011
Parameter Tuning of a Hybrid Treecode-FMM on GPUs Rio Yokota, Lorena - - PowerPoint PPT Presentation
Parameter Tuning of a Hybrid Treecode-FMM on GPUs Rio Yokota, Lorena Barba Department of Mechanical Engineering, Boston University Saturday, June 4, 2011 Previous Calculations N=3x10 9 : 6 sec (Yokota & Barba) N=3x10 9 : 20 sec 40 TFlops
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
* j
i
do i = 1,N ff = 0 do j = 1,N ff = ff+1/ ( x( i )-x( j ) ) end do f( i ) = ff end do do k = 1,p gg = 0 do j = 1,N gg = gg+( x( j ) - xs )**( k-1 ) end do g(k) = gg end do do i = 1,N ff = 0 do k = 1,p ff = ff+( x( i )-xs )**( -k )*g( k-1 ) end do end do
N
N
p−1
N
p−1
xi−x∗
gives
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
Saturday, June 4, 2011
1 2 4 8 16 32 64 128 256 512 50 100 150 200 250 300 350 400 Nprocs time x Nprocs [s] 1 2 4 8 16 32 64 128 256 512 50 100 150 200 250 300 350 400 Nprocs time x Nprocs [s]
tree construction mpisendp2p mpisendm2l P2Pkernel P2Mkernel M2Mkernel M2Lkernel L2Lkernel L2Pkernel
Saturday, June 4, 2011
! "# #$% #&!' & $ (& ($ #& #$ "& )*+,-./01/2.03-44-4/567849 :;+-/54-39 <03=>/-?=>*=@;0A BCC/-?=>*=@;0A C7D/30++*A;3=@;0A 678/30++*A;3=@;0A :.--/30A4@.*3@;0A
Saturday, June 4, 2011
and FMMs
performance must be compared to other treecodes and FMMs
Saturday, June 4, 2011