OSPRI: An Optimized One-Sided Communication Runtime for Leadership-Class Machines
Jeff Hammond
Argonne Leadership Computing Facility
11 October 2012
Jeff Hammond PGAS12
OSPRI: An Optimized One-Sided Communication Runtime for - - PowerPoint PPT Presentation
OSPRI: An Optimized One-Sided Communication Runtime for Leadership-Class Machines Jeff Hammond Argonne Leadership Computing Facility 11 October 2012 Jeff Hammond PGAS12 Overview Motivating application: NWChem, which uses Global Arrays
Argonne Leadership Computing Facility
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
1 translate global indices to rank plus local indices 2 issue remote GetS operations to each rank 3 data arrives at initiator from each target rank 4 local buffer assembled
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
128 256 512 1024 2048 4096 8192 16384 32768 Number of Cores 10 100 600 Walltime [s] Ideal Scaling Unsorted Data Presorted Data
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12
Jeff Hammond PGAS12