Fast Dynamic Load Balancing for Extreme Scale Systems
Cameron W. Smith, Gerrett Diamond, M.S. Shephard Computation Research Center (SCOREC) Rensselaer Polytechnic Institute
Outline:
Fast Dynamic Load Balancing for Extreme Scale Systems Cameron W. - - PowerPoint PPT Presentation
Fast Dynamic Load Balancing for Extreme Scale Systems Cameron W. Smith, Gerrett Diamond, M.S. Shephard Computation Research Center (SCOREC) Rensselaer Polytechnic Institute Outline: n Some comments on our tools for parallel unstructured mesh
Outline:
PUMI Services:
2 layers of read only copies Geometric model Partition model Distributed mesh
mesh region mesh face mesh edge mesh vertex region region or face region, face or edge region, face, edge, or vertex GEOMETRIC DOMAIN ENTITIES MESH ADJACENCIES
Entity migration Communication links
Fully parallel operating on distributed meshes
A scalable workflow requires effective component coupling
5
SCOREC unstructured mesh technologies:
6
At scale found graph and geometric based methods either
Original partition improvement work focused on using mesh
Goal of current EnGPar developments is generalization
Multiple tools needed to maintain partition quality at scale
Partitioning 1.6B element mesh from 128K to
Partitioning 12.9B element mesh from 128K (< 7% imb)
Employ an N-graph in the development of EnGPar
The N-graph, when considering multiple criteria, is:
Order of migration controlled by graph distance calculations Two steps to determine “Distance from Center”
Edges at part boundaries operated on to drive migration:
This ordering results in removing disconnected components faster and
EnGPar based on more standard graph operations than ParMA
Continuing developments:
Applications using unstructured meshes exhibit
To achieve the best performance require:
N-graph construction includes
Tests run on billion element mesh
Resulting imbalances after running
Accounting for multiple entities
Results close to ParMA what was
Overset grids
Non-conforming adaptive FV grids
Unstructured mesh particle in cell for fusion
FV application (FUN3D)
N-Graph construction includes:
3.6 million element mesh
1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 128 256 512 1024
EnGPar vtximb EnGPar edgeimb ParMETIS vtximb ParMETIS edgeimb
CODES simulates running an MPI application
The main CODES units are logical
The N-graph construction includes:
Acknowledgements