Optimization of the Poisson Operator Optimization of the Poisson - PowerPoint PPT Presentation

Optimization of the Poisson Operator Optimization of the Poisson Operator in Chombo in Chombo Razvan Carbunescu, Meriem Ben Salah Razvan Carbunescu, Meriem Ben Salah and and Andrew Gearhart Andrew Gearhart Research supported by Microsoft (Award #024263) and Intel (Award #024894) funding and by matching funding by U.C. Discovery (Award #DIG07-10227)

Administrivia : : The emu is watching you The emu is watching you Administrivia

Outline Outline - Introduction Introduction - - Why the Poisson operator? - Why the Poisson operator? - What is Chombo? - What is Chombo? - Theoretical Background Theoretical Background - - Targeted Architectures Targeted Architectures - - Implementation - Implementation - Challenges Challenges -

Outline Outline - Results Results - - Serial Implementation - Serial Implementation - pthreads - pthreads - OpenMP - OpenMP - GPU (GTX280) - GPU (GTX280) - Future Work - Future Work

Introduction : Why the Poisson Operator? : Why the Poisson Operator? Introduction - The Poisson operator is a - The Poisson operator is a key component of the Poisson key component of the Poisson equation equation - A Poisson solution is the - A Poisson solution is the Figure: Flow Evolution startup after Poisson solve first step toward solving first step toward solving incompressible Navier-Stokes equations for fluid flow incompressible Navier-Stokes equations for fluid flow - Parlab Health Application - Parlab Health Application - Modeling blood flow through cerebral - Modeling blood flow through cerebral arteries arteries Figure: Particle-In-Cell simulation from LBL using Chombo

Introduction : What is Chombo? : What is Chombo? Introduction - Developed and distributed by the Applied - Developed and distributed by the Applied Numerical Algorithms Group of Lawrence Numerical Algorithms Group of Lawrence Berkeley National Lab Berkeley National Lab - a framework to implement finite difference - a framework to implement finite difference methods for the solution of PDEs on block methods for the solution of PDEs on block structured, adaptively refined grids. structured, adaptively refined grids. - Chombo provides elliptic and time-dependent modules, as well - Chombo provides elliptic and time-dependent modules, as well as support for standardized self-describing file formats. as support for standardized self-describing file formats. - Chombo is architecture and operating system independent. - Chombo is architecture and operating system independent.

Introduction : What is Chombo? : What is Chombo? Introduction - For parallel platforms, Chombo provides - For parallel platforms, Chombo provides a distributed memory implementation a distributed memory implementation using the Message Passing Interface (MPI) using the Message Passing Interface (MPI) library library - This begs the question: - This begs the question:

Introduction : What is Chombo? : What is Chombo? Introduction - For parallel platforms, Chombo provides - For parallel platforms, Chombo provides a distributed memory implementation a distributed memory implementation using the Message Passing Interface (MPI) using the Message Passing Interface (MPI) library library - This begs the question: - This begs the question: Is a distributed memory implementation Is a distributed memory implementation always the most efficient? always the most efficient?

Introduction : What is Chombo? : What is Chombo? Introduction - For parallel platforms, Chombo provides - For parallel platforms, Chombo provides a distributed memory implementation a distributed memory implementation using the Message Passing Interface (MPI) using the Message Passing Interface (MPI) library library - This begs the question: - This begs the question: Is a distributed memory implementation Is a distributed memory implementation always the most efficient? always the most efficient? - One of the major components of Chombo is a collection of - One of the major components of Chombo is a collection of Multigrid (MG) solvers for discretized elliptic problems, including Multigrid (MG) solvers for discretized elliptic problems, including Poisson's Equation Poisson's Equation - This portion of the Chombo suite has been modified to explore the - This portion of the Chombo suite has been modified to explore the above question above question

Introduction : Strategy and Goals : Strategy and Goals Introduction - Determine whether we can improve Chombo performance - Determine whether we can improve Chombo performance through the use of locality and faster access time to on-chip through the use of locality and faster access time to on-chip cores cores - Via different models of parallel execution and specialized - Via different models of parallel execution and specialized hardware (ie. the GPU) hardware (ie. the GPU) - Identify critical “crossover” points where one algorithm becomes - Identify critical “crossover” points where one algorithm becomes more efficient (if they exist) more efficient (if they exist) - Hopefully foster the creation of heterogeneous systems that - Hopefully foster the creation of heterogeneous systems that automatically adapt execution to utilize distributed/shared automatically adapt execution to utilize distributed/shared memory or the GPU to enhance performance memory or the GPU to enhance performance

Theoretical Background Theoretical Background - The Poisson operator, aka the Laplacian, is a second order - The Poisson operator, aka the Laplacian, is a second order elliptic differential operator and defined in an n-dimensional elliptic differential operator and defined in an n-dimensional Cartesian space by: Cartesian space by: - The Poisson operator appears in the definition of the Helmholtz - The Poisson operator appears in the definition of the Helmholtz differential equation: differential equation: - The Helmholtz differential equation reduces to the - The Helmholtz differential equation reduces to the Poisson equation: Poisson equation:

Theoretical Background Theoretical Background - The definition of appropriate boundary conditions, Dirichlet or - The definition of appropriate boundary conditions, Dirichlet or Neumann, allows for the solution of the Poisson problem Neumann, allows for the solution of the Poisson problem - A numerical solution requires the discretization of the continuous - A numerical solution requires the discretization of the continuous Poisson’s equation, e.g. by the standard centered-difference Poisson’s equation, e.g. by the standard centered-difference approximation, as well as a discrete handling of the boundary approximation, as well as a discrete handling of the boundary conditions conditions - The discrete Poisson operator, the focus of this project, is given - The discrete Poisson operator, the focus of this project, is given by the following stencil: by the following stencil:

Targeted Architectures Targeted Architectures Existing Implementations: Existing Implementations: - Chombo’s implementation is currently tuned for distributed - Chombo’s implementation is currently tuned for distributed memory with the domain being decomposed into small bins (32^3 memory with the domain being decomposed into small bins (32^3 elements for 3D) and individual bin computations are allocated elements for 3D) and individual bin computations are allocated for execution via serial f77 codes for execution via serial f77 codes - Because of the small size of - Because of the small size of bins there is a small amount bins there is a small amount computational intensity to computational intensity to use a threaded shared use a threaded shared memory implementation or memory implementation or to hide the cost of GPU to hide the cost of GPU memory transfers memory transfers

Targeted Architectures Targeted Architectures - Our interest: - Our interest: - Could Chombo be optimized for different models of parallel - Could Chombo be optimized for different models of parallel computation if the bin sizes were increased? computation if the bin sizes were increased? - We would like to implement shared memory computation - We would like to implement shared memory computation models utilizing: models utilizing: - OpenMP - OpenMP - Pthreads - Pthreads - lightweight threads may - lightweight threads may perform well with small perform well with small bins bins

Optimization of the Poisson Operator Optimization of the Poisson - PowerPoint PPT Presentation

Optimization of the Poisson Operator Optimization of the Poisson Operator in Chombo in Chombo Razvan Carbunescu, Meriem Ben Salah Razvan Carbunescu, Meriem Ben Salah and and Andrew Gearhart Andrew Gearhart Research supported by Microsoft

Poisson Distribution: Review Poisson Over Time Let B 1 Poisson( ) be the number of bikes

Randomness in Computing L ECTURE 14 Last time Poisson distribution Poisson approximation

15. Poisson Processes In Lecture 4, we introduced Poisson arrivals as the limiting behavior of

Simulating events: the Poisson process Michel Bierlaire michel.bierlaire@epfl.ch Transport and

Poisson Regression Models for Count Data Outline Review Introduction to Poisson

X-RAY SPECTRAL WORKSHOP 2019 POISSON STATISTICS WITH BACKGROUNDS Background measurement Image

Workshop 10.6a: Poisson regression Murray Logan 12 Sep 2016 Section 1 Poisson regression

Shuffle algebra perspective on operator valued probability theory 30 mars 2020 1/25 Operator

Poisson Approximation for Two Scan Statistics with Rates of Convergence Xiao Fang (Joint work

Poisson homology, D-modules on Poisson varieties, and complex singularities Pavel Etingof (MIT)

Punt: T , T , T are mutually independent . How about T , the number of

The Poisson Arrival Process CS 70, Summer 2019 Bonus Lecture, 8/14/19 1 / 22 Poisson

Poisson Point Processes Will Perkins April 23, 2013 The Poisson Process Say you run a website

PPMLHDFE: Fast Poisson Estimation with High Dimensional Fixed Effects Paulo Guimares 2020

Poisson cohomology of multidimensional Dubrovin-Novikov Poisson structures and their normal

Detecting changes in Detecting changes in the rate the rate of a of a Poisson process

MDS 3.0 - Section Q Participation in Assessment and Goal Setting Do You Want To Speak To

Occupancy Grid based Urban Localization Using Weighted Point Cloud Lindong Guo, Ming Yang, Bing

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds Francis Engelmann*

Patent Infringement: Proving Royalty Damages Amid Increased Court Scrutiny Use of Licenses, the

Application of Green's Function to Application of Green's Function to Analysis of Grounding

Modellingof nonwoven materials Stepan V. Lomov, Amit Rawal Department Metallurgy and Materials

Error Estimation and Parameter Dependence of the Calculation of the Fast Ion Distribution

Source: U. S. Department of Energy, Energy Information Agency, Crude Oil Proved Reserves,