SLIDE 1 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Space-Time Methods for PDEs, RICAM Linz, November 10, 2016
aˇ s, M. Merta, J. Zapletal, and A. Veit Vˇ SB–Technical University of Ostrava, Czech Rep. University of Chicago industry email: dalibor.lukas@vsb.cz
SLIDE 2 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 3 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 4 Parallel fast BEM and applications
Laplace equation in an unbounded domain Ω ⊂ R3 lipschitz domain −△u( x) = 0,
γN u(x) := du
dn(x) = g(x),
x ∈ Γ := ∂Ω |u( x)| = O(1/| x|), | x| → ∞. Representation formula ∀ x ∈ Ω : u( x) = −
γNu(y) G( x, y) dS(y)
x|)
+
u(y) γN,yG( x, y) dS(y)
x|)
, where G( x, y) :=
1 4π| x−y|.
An indirect method Find an auxiliary double-layer density φ : Γ → R such that γN
φ(y) γN,yG(x, y) dS(y) = g(x), x ∈ Γ u( x) =
φ(y) γN,yG( x, y) dS(y), x ∈ Ωe.
SLIDE 5 Parallel fast BEM and applications
Shape optimization of a DC electromagnet, FEM-BEM coupling
Ωi: ferromagnetic yoke Ωi Ωe: air Ωe: air Ωe
J
coil Ωm sample Ωo: focusing optics Γ: boundary
L., Postava, ˇ Zivotsk´ y: J Magn Magn Mater ’10, Math Comput Simulat ’12
SLIDE 6
Parallel fast BEM and applications
Acoustics of a railway wheel H-matrices, ACA/FMM
SLIDE 7
Parallel fast BEM and applications
Parallel fast BEM using cyclic graph decompositions
1 2 3 4 5 6 7 8 9 10 11 12 rotate G0 G1 1 1 2 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 10 10 11 11 12 12
L., Kov´ aˇ r, Kov´ aˇ rov´ a, Merta: Numer Alg ’15 Solution to the system of 2.7M DOFs on 273 cores in 16 minutes.
SLIDE 8 Parallel fast BEM and applications
- Elmg. forming of plates with Fraunhofer IWU Chemnitz, FEM-BEM
0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 −0.015 −0.01 −0.005 0.005 0.01 0.015 0.02 r [m] z [m] Time 3e−05 s, Thalf 6e−05 s, Jmax 9.48183e+06 A/m2, Hmax 61.1986 A/m, Fmax 720.694 N/m3.
SLIDE 9 Parallel fast BEM and applications
Structural health monitoring of aircrafts with Honeywell Int. using 3d anisotropic mixed elements (TD-NNS) by [Pechstein (Sinwel), Sch¨
piezo-actuator crack piezo-sensor
SLIDE 10 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 11
Boundary integral formulation of sound-hard scattering
Sound-hard scattering Given the scatterer Ω and the causal incident wave uinc (satisfying ¨ u − △u = 0), we look for the scattered field u: uinc u Γ Ω ¨ u − △u = 0 in Ωe × [0, T], u(., 0) = ˙ u(., 0) = 0 in Ωe, ∂u ∂n = −∂uinc ∂n on Γ × [0, T].
SLIDE 12 Boundary integral formulation of sound-hard scattering
Double-layer indirect boundary integral method We search for u in the form of the retarded double-layer potential u(x, t) = − 1 4π
n(y) · (x − y) x − y
x − y2 + ˙ φ(y, t − x − y) x − y
which satisfies the wave equation and the initial conditions. It remains to fulfill the Neumann boundary condition lim
Ω∋ x→x∈Γ n(x) · ∇ xu(
x, t)
= g(x, t) on Γ × [0, T], where g := −∂uinc
∂n .
SLIDE 13 Boundary integral formulation of sound-hard scattering
Weak boundary integral formulation [Bamberger, HaDuong ’86] Find φ ∈ V such that a(ξ, φ) = b(ξ) ∀ξ ∈ V, where a(ξ, φ) := T
4πx − y ˙ ξ(x, t) ¨ φ(y, t − x − y) + curlΓ ˙ ξ(x, t) · curlΓφ(y, t − x − y) 4πx − y
b(ξ) := T
g(x, t) ˙ ξ(x, t) dS(x) dt.
SLIDE 14 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 15 Time-domain boundary element method
Discrete ansatz Replace V by a finite-dimensional subspace V h,∆t spanned by the tensor-product of N temporal and M spatial basis functions: φh,∆t(x, t) :=
N
M
αj
l ϕj(x) bl(t).
We arrive at the (N M) × (N M) block linear system A1,1 . . . A1,N . . . ... . . . AN,1 . . . AN,N α1 . . . αN = b1 . . . bN , where (Ak,l)i,j := a (ϕi(x) bk(t), ϕj(y) bl(t)) , (bk)i := b (ϕi(x) bk(t)) , (αl)j := αj
l .
SLIDE 16 Time-domain boundary element method
Matrix: a deeper look (Ak,l)i,j =
n(x) · n(y) 4πx − y ϕi(x) ϕj(y)
=:Ψk,l(x−y)
˙ bk(t)¨ bl(t − x − y) dt dS(y) dS(x) +
curlΓϕi(x) · curlΓϕj(y) 4πx − y T ˙ bk(t) bl(t − x − y) dt
Ψk,l(x−y)
dS(y) dS(x), Piecewise smooth time-ansatz expensive quadrature due to nontrivial intersection of the light cone supp Ψk,l, supp Ψk,l with supp ϕi × supp ϕj. [El Gharib ’99], [Stephan, Maischak, Ostermann ’08]
SLIDE 17 Time-domain boundary element method
C∞-smooth (partition of unity) temporal basis [Sauter, Veit ’12, ’14]
0.5 1.5 2.5 0.2 0.4 0.6 0.8
t [s]
b0 b1 b2 b3
- allows for Sauter-Schwab quadrature
- ver
supp ϕi × supp ϕj
- and for higher-order approximation in
time. (Ak,l)i,j =
n(x) · n(y) 4πx − y ϕi(x) ϕj(y)
=:Ψk,l(x−y)∈C∞(R)
˙ bk(t)¨ bl(t − x − y) dt dS(y) dS(x) +
curlΓϕi(x) · curlΓϕj(y) 4πx − y T ˙ bk(t) bl(t − x − y) dt
Ψk,l(x−y)∈C∞(R)
dS(y) dS(x), To accelerate the assembly, Ψ and Ψ are replaced by piecewise Chebyshev interpolants.
SLIDE 18 Time-domain boundary element method
Convergence of φh,∆t(x, .) − φanalytical(x, .)L2(0,T) on the sphere [Veit ’12] Ω the unit sphere, φanalytical a spherical harmonic function, x ∈ Γ 1st-order time-basis functions 2nd-order time-basis functions
10
1
10
2
10
−3
10
−2
10
−1
1/N
Number of timesteps Error
5 10 20 40 10
−5
10
−4
10
−3
1/N 2
Number of timesteps Error
SLIDE 19
Time-domain boundary element method
Matrix structure The matrix is sparse and it has a block-Hessenberg structure.
SLIDE 20 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 21 Parallelization, preconditioning, numerical experiments
Parallel implementation
- For equidistant time stepping only the blue
parts have to be assembled.
- We employ a hybrid MPI-OpenMP model:
– pairs
triangles corresponding to nonzero entries are evenly distributed to MPI nodes, – on each node the assembly (quadrature) is performed using OpenMP.
- We use up to 64 Intel Xeon E5 nodes (1024
cores) of the cluster Anselm, Vˇ SB-TU Os- trava.
SLIDE 22 Parallelization, preconditioning, numerical experiments
Parallel implementation We distribute blocks among MPI processes (nodes) as follows: Each process does the following:
- 1. Precompute the sparsity pattern of the block.
- 2. Distribute pairs of elements among computational nodes using MPI.
- 3. On each computational node assemble its contribution to the block in a shared
memory using OpenMP.
- 4. Gather the data on the MPI rank(s) owning the block.
SLIDE 23
Parallelization, preconditioning, numerical experiments
Weak parallel scalability of the assembly Submarine surface decomposed into 5604 triangles, 40 time steps, 80 time DOFs
SLIDE 24 Parallelization, preconditioning, numerical experiments
Preconditioning To accelerate FGMRES iterations we approxi- mate the upper Hessenberg matrix by an alge- braic multilevel preconditioner: A := AI,I AI,II AII,I AII,II
AI,I AII,I AII,II
I,I and A−1 II,II are approximated by a
small fixed number of FGMRES iterations even- tually preconditioned the same again.
SLIDE 25
Parallelization, preconditioning, numerical experiments
Numerical experiments, ball, 162 nodes, 1st-order in time
SLIDE 26
Parallelization, preconditioning, numerical experiments
Numerical experiments, ball, 162 nodes, 2nd-order in time
SLIDE 27
Parallelization, preconditioning, numerical experiments
Numerical experiments, submarine 2804 nodes, 40 time steps, 1st-order in time. GMRES(500): 9636 iters., 1607 s FGMRES(500,1(40)): 243 iters., 962 s
SLIDE 28 Parallelization, preconditioning, numerical experiments
- M. Merta, J. Zapletal et al.: BEM4I library
SLIDE 29 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Outline
- Parallel fast BEM and applications
- Boundary integral formulation of sound-hard scattering
- Time-domain boundary element method
- Parallelization, preconditioning, numerical experiments
- Conclusion, outlook, references
SLIDE 30 Parallel Time-Domain Boundary Element Method for 3-Dimensional Wave Equation
Conclusion, outlook Time-domain BEM for 3d wave equation, adaptivity in time, parallely scalable assembly and postprocessing, → mapping properties of the operator, preconditioning, → adaptivity in time, → extension to the elastic wave equation. References
- Analysis: Bamberger, Ha Duong, Math. Meth. Appl. Sci. ’86
- Numerics: Sauter, Veit, Numer. Math. ’14
- Parallelization: Veit, Merta, Zapletal, L., Int. J. Numer. Meth. Eng. ’16
- Parallel fast BEM: L., Kov´
aˇ r, Kov´ aˇ rov´ a, Merta, Numer. Alg. ’15