Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For - PowerPoint PPT Presentation

Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For Implicit Solution of Subsurface Reactive Flow Problems, Towards the Petascale on Cray XT systems Richard Tran Mills Computational Earth Sciences Group Computer Science and Mathematics Division Oak Ridge National Laboratory + Lots of other people Managed by UT-Battelle for the Department of Energy

Introduction Funded by SciDAC-II project, “Modeling Multiscale-Multiphase-Multicomponent • Subsurface Reactive Flows using Advanced Computing”, involving subsurface scientists, applied mathematicians, and computer scientists at several institutions: – LANL: Peter Lichtner (PI), Chuan Lu, Bobby Philip, David Moulton – ORNL: Richard Mills – ANL: Barry Smith – PNNL: Glenn Hammond – U. Illinois: Al Valocchi Also collaborating with SciDAC PERI: • – NC State: G. (Kumar) Mahinthakumar, Vamsi Sripathi – ORNL: Pat Worley Project goals: • – Develop a next-generation code (PFLOTRAN) for simulation of multiscale, multiphase, multicomponent flow and reactive transport in porous media. – Apply it to field-scale studies of (among others) • Geologic CO2 sequestration, • Radionuclide migration at Hanford site, Nevada Test Site 2 Managed by UT-Battelle for the Department of Energy Presentation_name

Motivating example -- Hanford 300 Area Mo At the 300 area, U(VI) plumes continue to exceed drinking standards. • Calculations predicted cleanup by natural attenuation years ago! • Due to long in-ground residence times, U(VI) is present in complex, microscopic • inter-grain fractures, secondary grain coatings, and micro-porous aggregates. (Zachara et al., 2005). Models assuming constant K d (ratio of sorbed mass to mass in solution) do not • account for slow release of U(VI) from sediment grain interiors through mineral dissolution and diffusion along tortuous pathways. In fact, the K d approach implies behavior entirely contrary to observations! • We must accurately incorporate millimeter scale effects over a domain • measuring approximately 2000 x 1200 x 50 meters! 3 Managed by UT-Battelle for the Department of Energy Presentation_name

Fundamental challenge: • Need to capture millimeter-scale (or smaller) processes within kilometer scale domains! (Similar variations in time scales.) • Discretizing 2km x 1 km x 500 m domain onto cubic millimeter grid means 10^18 computational nodes! • Address the problem via – Massively parallel computing • Continuing development of PFLOTRAN code – Multi-continuum (“sub-grid”) models • Multiplies total degrees of freedom in primary continuum by number of nodes in sub-continuum – Adaptive mesh refinement • Allows front tracking • Introduce multi-continuum models only where needed 4 Managed by UT-Battelle for the Department of Energy Presentation_name

Outline • Subsurface flow and reactive transport • Numerical discretization of governing eqns. • Parallel implementation (solvers, code arch.) • Computation phase performance • I/O performance • Future directions 5 Managed by UT-Battelle for the Department of Energy Presentation_name

Porous media flow • Continuity equation (mass conservation) • Darcy’s law in place of momentum eqn. 6 Managed by UT-Battelle for the Department of Energy Presentation_name

Porous media flow • Continuity equation (mass conservation) • Darcy’s law in place of momentum eqn. q = Q A = � K � h 7 Managed by UT-Battelle for the Department of Energy Presentation_name

PFLOTRAN governing equations Mass Conservation: Flow Equations � � ) + �� q � � � X i � � � s � D i [ ] = Q i � � � � X i � � � t ( � s � � � X i q � = � kk � � ( p � � W � � � gz ) p � = p � � p c , �� µ � Energy Conservation Equation � [ ] + �� [ ] = Q e s � � � U � + (1 � � ) � r c r T q � � � H � � � � T � t � � � � � Multicomponent Reactive Transport Equations � [ ] + �� [ ] = � v jm I m � s � � + Q j � t � � � � � � j � � m Total Concentration Total Solute Flux � = � � l C j � + � = ( � �� s � D � � + q � ) � j � � v ji C i � j � j � i Mineral Mass Transfer Equation �� m � t = V m I m � + = 1 � m � m 8 Managed by UT-Battelle for the Department of Energy Presentation_name

PFLOTRAN governing equations Mass Conservation: Flow Equations � � ) + �� q � � � X i � � � s � D i [ ] = Q i � � � � X i � � � t ( � s � � � X i q � = � kk � � ( p � � W � � � gz ) p � = p � � p c , �� µ � Darcy’s law Energy Conservation Equation (homogenized momentum eq.) � [ ] + �� [ ] = Q e s � � � U � + (1 � � ) � r c r T q � � � H � � � � T � t � � � � � Multicomponent Reactive Transport Equations � [ ] + �� [ ] = � v jm I m � s � � + Q j � t � � � � � � j � � m Total Concentration Total Solute Flux � = � � l C j � + � = ( � �� s � D � � + q � ) � j � � v ji C i � j � j � i Mineral Mass Transfer Equation �� m � t = V m I m � + = 1 � m � m 9 Managed by UT-Battelle for the Department of Energy Presentation_name

PFLOTRAN governing equations Mass Conservation: Flow Equations � � ) + �� q � � � X i � � � s � D i [ ] = Q i � � � � X i � � � t ( � s � � � X i q � = � kk � � ( p � � W � � � gz ) p � = p � � p c , �� µ � Relative permeability depends on saturation -- introduces nonlinearity Energy Conservation Equation � [ ] + �� [ ] = Q e s � � � U � + (1 � � ) � r c r T q � � � H � � � � T � t � � � � � Multicomponent Reactive Transport Equations � [ ] + �� [ ] = � v jm I m � s � � + Q j � t � � � � � � j � � m Total Concentration Total Solute Flux � = � � l C j � + � = ( � �� s � D � � + q � ) � j � � v ji C i � j � j � i Nonlinear function of the concentrations Mineral Mass Transfer Equation of primary chemical components �� m � t = V m I m � + = 1 � m � m 10 Managed by UT-Battelle for the Department of Energy Presentation_name

Integrated finite-volume discretization � A Form of governing F = qpX �� Dp � X � t + �� F = S equation: Integrated finite-volume discretization ) V n k + 1 � A n ( k R n = A n F n � A n � n � S n V n � t + � n Discretized residual equation: n � X n � X n ' F nn ' = ( q � ) nn ' X nn ' � ( � D � ) nn ' d n + d n ' i i = � R n i � x � i + 1 = � R n i J n � J n � (Inexact) Newton iteration: � n n n i � x � n � n 11 Managed by UT-Battelle for the Department of Energy Presentation_name

IFV solver time-step • At each time step: ) V n k + 1 � A n ( k R n = A n F n � A n � n � S n V n – Calculate residual � t + � n n � • For each node, calculate accumulation term • For each connection, calculate flux term X n � X n ' F nn ' = ( q � ) nn ' X nn ' � ( � D � ) nn ' d n + d n ' • Calculate source-sink term for appropriate connections i i = � R n – Calculate Jacobian J n � n i � x � • Via finite differences n • …or analytically (analogous to residual calculation) – Solve linear system i � x � i + 1 = � R n i J n � � n n n � 12 Managed by UT-Battelle for the Department of Energy Presentation_name

Outline • Subsurface flow and reactive transport • Numerical discretization of governing eqns. • Parallel implementation (solvers, code arch.) • Computation phase performance • I/O performance • Future directions 13 Managed by UT-Battelle for the Department of Energy Presentation_name

Domain-decomposition parallelism • PFLOTRAN parallelism comes from domain decomposition. • Each processor responsible for nodes in one subdomain. (Plus associated vector, matrix entries) • Accumulation terms are easy: Each processor calculates terms for the nodes it owns. • Flux terms are not! – Must calculate fluxes across subdomain boundaries. – Scatter/gather of ghost nodes required ( halo exchanges ) 14 Managed by UT-Battelle for the Department of Energy Presentation_name

Solving the linear systems • Linear system solve often accounts for > 90% of time. • Linear systems are large (N=total degrees of freedom) and sparse (very few nonzeros) • Do not want to do Gaussian elimination (LU factorization): – Fill-in can eliminate sparsity (unmanageable memory cost) – Extremely difficult to parallelize! • In general, cannot do row k+1 w/o first doing row k � x x x x � � x x x x � � � � � x x x x x x • • � � � � x x x � � � x x x • • � LU factorization � � � � x x x x • • � � � � x x x x � � x • • • x x x � � � � � � x x x x x x • � � � � x x � � x x • � � � � � � x x x � � x x x • • • • • � � 15 Managed by UT-Battelle for the Department of Energy Presentation_name

Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For - PowerPoint PPT Presentation

Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For Implicit Solution of Subsurface Reactive Flow Problems, Towards the Petascale on Cray XT systems Richard Tran Mills Computational Earth Sciences Group Computer Science and

Algebraic multigrid in PETSc Mark Adams Lawrence Berkeley National Laboratory PETSc user

Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute

To thread or not to thread? Why PETSc favors MPI-only Plenary Discussion PETSc User Meeting 2016

Fluid Interface Detection with PETSc and DONLP2 PETSc User Meeting Vienna 2016 Poster Session

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Progress with PETSc on Manycore and GPU-based Systems on the Path to Exascale Richard Tran Mills

Understanding and Tuning Performance in PETSc (on emerging manycore, GPGPU, and traditional

Design, Implementation and Applications of PETSc-MUMPS Inteface Hong Zhang Computer Science,

The P ortable E xtensible T oolkit for S cientific C omputing Toby Isaac (building on slides from

A massivelly parallel multigrid solver using PETSc for unstructured meshes on Tier0

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Using AmgX to Accelerate PETSc- Based CFD Codes Pi-Yueh Chuang pychuang@gwu.edu George

www.Novotechsoftware.com Soil void ratio Soil density sity Soil permeab meabil ility ity

Fast and Wild: Bootstrap Inference in Stata Using boottest David Roodman James G. MacKinnon Open

HAND Diagnostic Issues HAND Diagnostic Issues Addressing Mental Addressing Mental Health Health

Roger H. Peters, PhD Travis Parker, MS, LIMHP, CPC December 4, 2019 Hosted by SAMHSAs GAINS

Gradual Program Verification (with Implicit Dynamic Frames) Johannes Bader , Karlsruhe Institute of

Cross ring data move 1. Segmentation based protection breaks 2. Kernel level actual data move

Many stories, two basic plots. common Fun with Gaussian distributions: average A few billion

Structural Health Monitoring Mechanical and impact damage

Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For - PowerPoint PPT Presentation

Experiences and Challenges Scaling PFLOTRAN, a PETSc-based Code For Implicit Solution of Subsurface Reactive Flow Problems, Towards the Petascale on Cray XT systems Richard Tran Mills Computational Earth Sciences Group Computer Science and

Algebraic multigrid in PETSc Mark Adams Lawrence Berkeley National Laboratory PETSc user

Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute

To thread or not to thread? Why PETSc favors MPI-only Plenary Discussion PETSc User Meeting 2016

Fluid Interface Detection with PETSc and DONLP2 PETSc User Meeting Vienna 2016 Poster Session

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Progress with PETSc on Manycore and GPU-based Systems on the Path to Exascale Richard Tran Mills

Understanding and Tuning Performance in PETSc (on emerging manycore, GPGPU, and traditional

Design, Implementation and Applications of PETSc-MUMPS Inteface Hong Zhang Computer Science,

The P ortable E xtensible T oolkit for S cientific C omputing Toby Isaac (building on slides from

A massivelly parallel multigrid solver using PETSc for unstructured meshes on Tier0

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Using AmgX to Accelerate PETSc- Based CFD Codes Pi-Yueh Chuang pychuang@gwu.edu George

www.Novotechsoftware.com Soil void ratio Soil density sity Soil permeab meabil ility ity

Fast and Wild: Bootstrap Inference in Stata Using boottest David Roodman James G. MacKinnon Open

HAND Diagnostic Issues HAND Diagnostic Issues Addressing Mental Addressing Mental Health Health

Roger H. Peters, PhD Travis Parker, MS, LIMHP, CPC December 4, 2019 Hosted by SAMHSAs GAINS

Gradual Program Verification (with Implicit Dynamic Frames) Johannes Bader , Karlsruhe Institute of

Cross ring data move 1. Segmentation based protection breaks 2. Kernel level actual data move

Many stories, two basic plots. common Fun with Gaussian distributions: average A few billion

Structural Health Monitoring Mechanical and impact damage

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms