Multilevel Summation Method for Calculating Electrostatic - PowerPoint PPT Presentation

Multilevel Summation Method for Calculating Electrostatic Interactions in NAMD David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign http://www.ks.uiuc.edu/~dhardy/ 15th Annual Workshop on Charm++ and its Applications April 17-19, 2017 NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Molecular Dynamics Integrate Newton’s equations of motion: d 2 r i ( t ) = �r i U ( ~ for billions of time steps! R ) m i dt 2 ~ Coulomb potential NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Motivation for multilevel summation method (MSM) • Need to accurately represent electrostatic interactions - long-range, requires fast method • Usually done using PME (particle-mesh Ewald) • PME has two shortcomings - requires periodic boundary conditions - poses bottleneck to parallel scalability • MSM overcomes both shortcomings! NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Best features of MSM • Supports periodic boundaries and also supports : - non-periodic boundaries (e.g. protein folding in water droplet) - semi-periodic boundaries (e.g. membrane channel) • Offers better parallel scaling through hierarchical structure (does not need FFT) • Arithmetic intensity and localized memory access well suited to modern hardware (CPU vector instructions and GPUs) • Produces smooth forces for stable dynamics • Extends to other pairwise interactions (e.g. dispersion) • Algorithm has linear time complexity NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Comparing MSM with PME PME MSM scattered across grid highly localized Memory Access (depicting FFT in 1D) (depicting convolution in 2D) Parallel many-to-many tree-like Communication (matrix transpose) (reduction and expansion) Bisection Bandwidth ⇣ ( N/P ) 2 / 3 ⌘ ⇣ N/P 2 / 3 ⌘ fixed O O width on 3D torus (Blue Waters)

MSM essential ideas • Splitting the interaction kernel • Interpolating the slowly varying kernels from grids • Nesting the approximation between levels Splitting Interpolating . . . . . . 2 h -grid + 1/ r Nesting = h -grid + atoms r a 2 a grid spacings h , 2h , 4h , ... cutoff distances a , 2a , 4a , ... NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Splitting the interaction kernel (i) | r 0 − r | � 1 = k 0 ( r , r 0 ) + k 1 ( r , r 0 ) + · · · + k L ( r , r 0 ) In one dimension, unparameterized, in terms of function : γ 1 ρ = γ 0 ( ρ ) + 1 2 γ 1 (1 2 ρ ) + · · · + 1 2 L γ L ( 1 2 L ρ ) γ 0 ( ρ ) = (1 / ρ ) − γ ( ρ ) , γ l ( ρ ) = 2 γ (2 ρ ) − γ ( ρ ) , l = 1 , 2 , . . . , L − 1 , γ L ( ρ ) = 2 γ (2 ρ ) 1 2 l a γ l ( r 2 l a ) parameterized by cutoff value a k l ( r , r 0 ) = NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Splitting the interaction kernel (ii) For interpolation with degree piecewise polynomials we p − 1 want splitting with continuity: C p − 1 ( τ p ( ρ 2 ) , for 0 ≤ ρ ≤ 1 , γ ( ρ ) = 1 / ρ , for ρ ≥ 1 s − 1 / 2 = 1 − 1 2( s − 1) + 3 8( s − 1) 2 − 5 16( s − 1) 3 + · · · = τ p ( s ) + O (( s − 1) p ) ⇣ d p Z 1 ⌘ 2 Optimal in the sense that it minimizes for γ ( ρ ) d ρ p γ ( ρ ) d ρ 0 NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Interpolating kernels on grids X X φ l m ( r ) k l ( r l m , r l n ) φ l I l k l ( r , r 0 ) = n ( r 0 ) , l = 1 , 2 , . . . , L m n where is interpolation operator and I ⇣ x − x l ⇣ y − y l ⇣ z − z l ⌘ ⌘ ⌘ φ l m m m m ( r ) = Φ Φ Φ 2 l − 1 h 2 l − 1 h 2 l − 1 h is piecewise polynomial of degree with stencil size Φ p − 1 p and is the finest grid spacing h Nesting the approximation between grid levels: ✓ �⌘◆ ⇣ � k ( r , r 0 ) ≈ ( r , r 0 ) k 0 + I 1 k 1 + I 2 k 2 + · · · I L � 1 ( k L � 1 + I L k L ) · · · NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

MSM computation exact interpolated = + force short-range long-range part part Computational Steps 4 h -grid long-range prolongation restriction 2 h -grid parts prolongation restriction h -grid interactions anterpolation interpolation short-range interactions positions forces charges NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

NAMD hybrid decomposition for short-range Kale, et al., J. Comp. Phys. 151 :283-312, 1999 • Decompose atoms spatially into patches • Decompose work into concurrent compute objects • Compute objects facilitate iterative, measurement-based load balancing NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

MSM Grid Interactions • Potential summed from grid point charges within cutoff • Uniform spacing enables distance-based interactions to be precomputed as stencil of “weights” • Weights at each level are identical up to scaling factor (!) • Calculate grid potential as 3D convolution of weights with charges X e l k l ( r l m , r l n ) q l m = l = 1 , 2 , . . . , L n , n Cutoff radius Sphere of grid point charges Accumulate potential NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

MSM decomposition for grid interactions Hybrid spatial-work decomposition, similar to short-range • Grids of charge and potential are decomposed into blocks • Interactions between blocks are separately scheduled as block computes • Need only charges to calculate potentials, send in one direction NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

MSM use of Charm++ • 3D chare arrays of grid blocks, one per level - Performs restriction and prolongation to 2h cover - Sends charges up and then to block computes - Receives partial potentials from above and also from block computes • 1D chare array of block computes • Associate an object with each NAMD patch to perform anterpolation and interpolation NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Some Charm++ coding paradigms part of an MSM block class MsmBlock { public : void add_charge_from_below(const Grid<float>& qh) { my_qh += qh; // qh is a subgrid of my_qh if (++cnt_recv_charge == max_recv_charge) { compute_restriction(); // calculate my_q2h_cover from my_qh send_charge_up(); // send my_q2h_cover send_charge_across(); // send my_qh } } }; Most compelling use I’ve ever seen for class MsmBlockChare : multiple inheritance in scientific computing! public MsmBlock, public CBase_MsmBlockChare { // communication wrapper for MsmBlock }; NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Static load balancing • Distribute grid blocks evenly among nodes • Assign block computes to sender or receiver node (trying to minimize inter-node communication) • Each node distributes the block compute objects evenly among cores NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Optimizing the critical path • Highest message priority assigned to restrictions going up the hierarchy, then block computes and prolongations going from the top down 4 h -grid long-range prolongation restriction 2 h -grid parts prolongation restriction h -grid interactions anterpolation interpolation short-range interactions positions forces charges NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

MSM scaling results Strong scaling ~92K-atom ApoA1 on Cray XE6 Blue Waters hardware Hardy, et al. , J. Chem. Theory Comput. 11 :766-779, 2015 NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Recent MSM advances • B-spline interpolation - improves accuracy by an order of magnitude for the same computational effort - caveat: more expensive to calculate stencils • CPU vectorization - improves single core performance - caveat: requires extensive data reorganization NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Clustering grid points Enables use of grid stencil matrix CPU vector instructions (AVX/FMA) * * * * * * * * += * * * * * * * * vector of potential Cluster into vector of charge 8-point cubes single precision Shows about 7x improvement over non-vector version NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

B-spline interpolation • Basis set for splines • Interpolation with p-1 degree splines gives pth order accuracy • Smallest possible local support of p • Continuity is C(p-2) • B-splines provide nested interpolation : a coarse level B-spline is exactly represented by finer level B-splines NIH Center for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/

Multilevel Summation Method for Calculating Electrostatic - PowerPoint PPT Presentation

Multilevel Summation Method for Calculating Electrostatic Interactions in NAMD David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate

Electrostatic Speakers Electrostatic Speakers Natalie Grace-Hansen Natalie Grace-Hansen John

Dipole Assisted Dipole Assisted Inertial Electrostatic Inertial Electrostatic Confinement

Income ome A Appr pproach Three methods to determine: Summation Method (build-up method)

Figure 1a: Multilevel Visualization of Market Power Cases in Tree Format Figure 1b: Multilevel

Agenda 1. More on multilevel R formulas 2. Generalized Multilevel Models 3. GMLM in R 1 More

Multilevel Krylov Methods Deflation Deflation, DD, MG Reinhard Nabben Multilevel Krylov

Calculating distributions Chung-chieh Shan Indiana University 2018-09-21 Calculating

Calculating Derivatives There are two types of formulas for calculating derivatives, which we may

Calculating Derivatives There are two types of formulas for calculating derivatives, which we may

Upwind Summation By Parts Methods for Large Scale Elastic Wave Equation ICERM, Brown University

Problem of random summation and its role in risk aggregation models Gregory Temnov School of

Calculating A k using Fulmers Method Rasheen Alexander, Katie Huston, Thomas Le, Camera Whicker

Multilevel Cooling Systems August Hale May 16, 2014 August Hale Multilevel Cooling Systems May

The Challenge of The Challenge of Multilevel Security Multilevel Security Rick Smith, Ph.D.,

Introduction to Multilevel Analysis Introduction to Multilevel Analysis PSYC 575 PSYC 575 Mark

Model reduction of wave propagation via phase-preconditioned rational Krylov subspaces Delft

Higher-Rank Fields, Currents, and Higher-Spin Holography arXiv:1312.6673 O.A.Gelfond, M.V. +

Model Checking Continuous-Time Markov Chains Joost-Pieter Katoen Software Modeling and

How I Built This: Putting Together Your Marketing Tech Stack Moderator: Kevin Peranio, Chief

Submodular Maximization in a Data Streaming Setting Amit Chakrabarti Dartmouth College Hanover,

Everything you wanted to know about VAMP but were afraid to ask Brooke Husic Stanford/FU Berlin

Emory University MBA Program Trip to Sydney, Australia & Auckland, New Zealand February -

Device Mainlining KS Lightning Talk October, 2015 Tim Bird Senior Software Engineer, Sony

Multilevel Summation Method for Calculating Electrostatic - PowerPoint PPT Presentation

Multilevel Summation Method for Calculating Electrostatic Interactions in NAMD David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate

Electrostatic Speakers Electrostatic Speakers Natalie Grace-Hansen Natalie Grace-Hansen John

Dipole Assisted Dipole Assisted Inertial Electrostatic Inertial Electrostatic Confinement

Income ome A Appr pproach Three methods to determine: Summation Method (build-up method)

Figure 1a: Multilevel Visualization of Market Power Cases in Tree Format Figure 1b: Multilevel

Agenda 1. More on multilevel R formulas 2. Generalized Multilevel Models 3. GMLM in R 1 More

Multilevel Krylov Methods Deflation Deflation, DD, MG Reinhard Nabben Multilevel Krylov

Calculating distributions Chung-chieh Shan Indiana University 2018-09-21 Calculating

Calculating Derivatives There are two types of formulas for calculating derivatives, which we may

Calculating Derivatives There are two types of formulas for calculating derivatives, which we may

Upwind Summation By Parts Methods for Large Scale Elastic Wave Equation ICERM, Brown University

Problem of random summation and its role in risk aggregation models Gregory Temnov School of

Calculating A k using Fulmers Method Rasheen Alexander, Katie Huston, Thomas Le, Camera Whicker

Multilevel Cooling Systems August Hale May 16, 2014 August Hale Multilevel Cooling Systems May

The Challenge of The Challenge of Multilevel Security Multilevel Security Rick Smith, Ph.D.,

Introduction to Multilevel Analysis Introduction to Multilevel Analysis PSYC 575 PSYC 575 Mark

Model reduction of wave propagation via phase-preconditioned rational Krylov subspaces Delft

Higher-Rank Fields, Currents, and Higher-Spin Holography arXiv:1312.6673 O.A.Gelfond, M.V. +

Model Checking Continuous-Time Markov Chains Joost-Pieter Katoen Software Modeling and

How I Built This: Putting Together Your Marketing Tech Stack Moderator: Kevin Peranio, Chief

Submodular Maximization in a Data Streaming Setting Amit Chakrabarti Dartmouth College Hanover,

Everything you wanted to know about VAMP but were afraid to ask Brooke Husic Stanford/FU Berlin

Emory University MBA Program Trip to Sydney, Australia &amp; Auckland, New Zealand February -

Device Mainlining KS Lightning Talk October, 2015 Tim Bird Senior Software Engineer, Sony

Emory University MBA Program Trip to Sydney, Australia & Auckland, New Zealand February -