HPC-SIG Ecosystem Validation
- Jan. 14 2019
HPC-SIG Ecosystem Validation Jan. 14 2019 Baptiste Gerondeau - - PowerPoint PPT Presentation
HPC-SIG Ecosystem Validation Jan. 14 2019 Baptiste Gerondeau Renato Golin HPC-SIG Lab and Validation Matrix Aggregate machines in the same infrastructure, and validate their performance using a Validation Matrix Validation Matrix must be
For more info visit
linaro.org/hpc
For more info visit
linaro.org/hpc
For more info visit
linaro.org/hpc
Identifying the different dimensions
A Vertical Slice of the Stack Principal dimensions : ➔ Application ➔ HPC environment stack ➔ Machine provisioning
OpenHPC’s testsuite
For more info visit
linaro.org/hpc
Identifying the different dimensions
The Stack from the Lab’s point of view Machine provisioning : ➔ Network configuration ➔ Kernel ➔ OS ➔ HPC Stack
provisioning
For more info visit
linaro.org/hpc
Identifying the different dimensions
Provisioning Method Variations Multiple ways to provision : ➔ Warewulf Stateless (VNFS) ➔ Warewulf Stateful (OS image) ➔ Ansible
For more info visit
linaro.org/hpc
Identifying the different dimensions
Different Network Layouts
head node only
For more info visit
linaro.org/hpc
Identifying the different dimensions
Different Kernels
Platform
platforms in the process of being upstreamed
For more info visit
linaro.org/hpc
Identifying the different dimensions
Different Operating Systems
For more info visit
linaro.org/hpc
Abstractions, and the user’s environment
Abstracting Network Variations
For more info visit
linaro.org/hpc
Abstractions, and the user’s environment
Abstracting Provisioning Variations
For more info visit
linaro.org/hpc
Abstractions, and the user’s environment
Abstracting Environment Variations
For more info visit
linaro.org/hpc
Abstractions, and the user’s environment
Accounting for extra HPC services
(additional hardware)
For more info visit
linaro.org/hpc
What the User sees, configures
The Lab’s Interface ➔ Choose Application ❖ Lab picks default configuration ❖ User fine tunes configuration
For more info visit
linaro.org/hpc
Cluster Deployment
For more info visit
linaro.org/hpc
Distributed Applications Enablement
For more info visit
linaro.org/hpc
Toolchain Benchmarking
For more info visit
linaro.org/hpc
Library Enablement and Enhancement
For more info visit
linaro.org/hpc
○ We have multiple vendors available ○ On a standardised infrastructure
For more info visit
linaro.org/hpc
○ We have multiple vendors available ○ On a standardised infrastructure
○ OpenHPC Ansible recipes (with the OpenHPC community) ○ SDI (MrP, Jenkins, Ansible) helping members to replicate our work ○ Community CI (OpenHPC test-suite, MPI MTT, OpenMP tests, OpenBLAS CI)
For more info visit
linaro.org/hpc
○ We have multiple vendors available ○ On a standardised infrastructure
○ OpenHPC Ansible recipes (with the OpenHPC community) ○ SDI (MrP, Jenkins, Ansible) helping members to replicate our work ○ Community CI (OpenHPC test-suite, MPI MTT, OpenMP tests, OpenBLAS CI)
○ Internal tests and benchmarks (via Jenkins, no infrastructure knowledge needed) ○ Testing new packages, libraries, compilers (comparison jobs, CI results, statistic analysis)
For more info visit
linaro.org/hpc
○ We have multiple vendors available ○ On a standardised infrastructure
○ OpenHPC Ansible recipes (with the OpenHPC community) ○ SDI (MrP, Jenkins, Ansible) helping members to replicate our work ○ Community CI (OpenHPC test-suite, MPI MTT, OpenMP tests, OpenBLAS CI)
○ Internal tests and benchmarks (via Jenkins, no infrastructure knowledge needed) ○ Testing new packages, libraries, compilers (comparison jobs, CI results, statistic analysis)