SLIDE 42 A Spectrum of Tool Challenges
— research - develop new techniques for measurement, analysis, and presentation
– challenges of emerging systems increasing scale of systems (e.g. Sequoia) heterogeneity (e.g., host + accelerator (MIC or GPU); AMD Fusion) exploding growth of threading: MIC supports 200+ threads – blame shifting: identify causes rather than symptoms – analyzing asynchronous activities – measure and analyze all facets of application performance CPU, accelerator, intra- and inter-node data move & sync, I/O, interaction with HW, interaction with other jobs, interaction with system software – provide higher level insight, diagnosis, and guidance
- Community leadership and engagement
— OS support for tools
– past successes: BG/P CNK and Cray CNL kernels; BG/Q spec; Linux kernel; PMU device drivers – today: interfaces for observing communication, I/O network issues, data access latency
— standards committees: today - OpenMP tools API; future - OpenCL tools API? — vendor engagement: NVIDIA, Intel, IBM —
- utreach with workshops
- Software
— development: new instructions for new CPUs; new programming models for heterogeneous — maintenance — deployment & user support
42