Onkar Patil1, Saurabh Hukerikar2, Frank Mueller1, Christian Engelmann2
- 1Dept. of Computer Science, North Carolina State University
2Computer Science and Mathematics Division, Oak Ridge National Laboratory
Exploring Use-cases for Non-Volatile Memories in support of HPC - - PowerPoint PPT Presentation
Exploring Use-cases for Non-Volatile Memories in support of HPC Resilience Onkar Patil 1 , Saurabh Hukerikar 2 , Frank Mueller 1 , Christian Engelmann 2 1 Dept. of Computer Science, North Carolina State University 2 Computer Science and Mathematics
Onkar Patil1, Saurabh Hukerikar2, Frank Mueller1, Christian Engelmann2
2Computer Science and Mathematics Division, Oak Ridge National Laboratory
architecture
APPLICATION
STATIC DATA STRUCTURES DYNAMIC DATA STRUCTURES
DRAM NVM APPLICATION
STATIC DATA STRUCTURES DYNAMIC DATA STRUCTURES
DRAM NVM APPLICATION
STATIC DATA STRUCTURES DYNAMIC DATA STRUCTURES
DRAM NVM
Data Versioning NVM-based Main Memory Application-directed Checkpointing
SSD from 100GB to 256GB
elements
0.001 0.01 0.1 1 10 4 8 16 GFLOPS Nodes
GFLOPS in node scaling for StarDGEMM
DRAM PMEM_ONLY PMEM_CPY PMEM_VER
0.01 0.1 1 10 100 4 8 16 Time(sec) Nodes
Execution times in node scaling for StarDGEMM
DRAM PMEM_ONLY PMEM_CPY PMEM_VER
0.0001 0.001 0.01 0.1 1 10 1000 2000 3000 GFLOPS
GFLOPS for problem size scaling in StarDGEMM
DRAM PMEM_ONL Y
0.01 0.1 1 10 100 1000 10000 1000 2000 3000 Time(sec)
Execution time for problem size scaling in StarDGEMM
DRAM PMEM_ONLY PMEM_CPY PMEM_VER
resilience of the system
Evaluating Performance of Burst Buffer Models for Real-Application Workloads in HPC Systems
Harsh Khetawat Frank Mueller Christopher Zimmer
Introduction
becoming bottleneck
– Checkpoint/Restart I/O – Staging – Write-through cache for parallel FS
Burst Buffers on Cori
Placement
– Co-located with compute nodes (Summit) – Co-located with I/O nodes (Cori) – Separate set of nodes
– Capability – I/O models, staging, etc. – Predictability – Impact on shared resources, runtime variability – Economic – Infrastructure reuse, cost of storage device
– Choice of network topology
Idea
– CODES simulation suite – Real-world I/O traces (Darshan) – Full multi-tenant system with mixed workloads (capability/capacity) – Supports network topologies – Local & external storage models
architectures
Conclusion
– Burst buffer placement – Network topology – Performance of striping across burst buffers – Overhead of resilience schemes
– Simulate specific workloads – Determine best fit
Thank You