- T. Hoefler : Optimizing Communication on Blue Waters
Optimizing Communication on Blue Waters Torsten Hoefler
PRAC Workshop, Oct. 19th 2010
Optimizing Communication on Blue Waters Torsten Hoefler PRAC - - PowerPoint PPT Presentation
Optimizing Communication on Blue Waters Torsten Hoefler PRAC Workshop, Oct. 19 th 2010 T. Hoefler : Optimizing Communication on Blue Waters Hottest Optimizations on Blue Waters Serial optimizations (e.g., Vectorization)
PRAC Workshop, Oct. 19th 2010
2
3
mostly serial conceptually simple not clearly defined yet
4
Platform or System Model (Hardware, Middleware) Application Model (Algorithm, Structure) Performance Model
5
6
7
fundamental laws of physics
lattice field theories (QCD & Beyond Standard Model)
comparison with results from experimental programs in high energy & nuclear physics
8
MR testbed
pack overheads >10% pack time
10
Darray (HPF-like distributed arrays)
implementation (many unexplored possibilities!)
Fourier Transform and Conjugate Gradient using MPI Datatypes”
1. Type_struct for complex numbers 2. Type_contiguous for blocks 3. Type_vector for stride
Reproducible peak at P=192 Scaling stops w/o datatypes
Scaling stops w/o datatypes DDT increase scalability
Waters
27
28
Source: B. Arimilli et al. “The PERCS High- Performance Interconnect”
29
Source: B. Arimilli et al. “The PERCS High- Performance Interconnect”
30
connected
Source: B. Arimilli et al. “The PERCS High- Performance Interconnect”
31
32
33
34
somewhat persistent!
35
Technique
up to 80%
computes mapping in ~0.8s for 1024 cores
36
37
need to improve
Datatype benchmarks: http://www.unixer.de/research/datatypes/
Jeongnim Kim, William Kramer, Marc Snir
Datatype benchmarks: http://www.unixer.de/research/datatypes/