ANALYZING PERFORMANCE OF CONTAINERIZED CLIMATE MODELS IN SINGULARITY - - PowerPoint PPT Presentation

analyzing performance of containerized climate models in
SMART_READER_LITE
LIVE PREVIEW

ANALYZING PERFORMANCE OF CONTAINERIZED CLIMATE MODELS IN SINGULARITY - - PowerPoint PPT Presentation

ANALYZING PERFORMANCE OF CONTAINERIZED CLIMATE MODELS IN SINGULARITY ON A SUPERCOMPUTER COMPUTER SCIENCE / IT LAURA CARRIERE, CODE 606.2 NOAH OLLER SMITH, PURDUE UNIVERSITY INTRODUCTION AND BACKGROUND Software is currently deployed in HPC


slide-1
SLIDE 1

ANALYZING PERFORMANCE OF CONTAINERIZED CLIMATE MODELS IN SINGULARITY ON A SUPERCOMPUTER

COMPUTER SCIENCE / IT

LAURA CARRIERE, CODE 606.2 NOAH OLLER SMITH, PURDUE UNIVERSITY

slide-2
SLIDE 2

INTRODUCTION AND BACKGROUND

  • Software is currently deployed in HPC

via modules and package managers

  • Containers provide easy software

deployment and different environments

  • Singularity vs Docker - Singularity is

made for HPC and can make use of docker resources.

This is Discover, a supercomputing cluster with over 129,000 cores for NCCS!

slide-3
SLIDE 3

QUESTION / INVESTIGATION

  • Testing to see whether a containerized version of a

part of GEOS (Goddard Earth Observing System) can compete with a non-containerized version

  • Mainly looking at performance impact on

containerized versions

  • GEOSgcm and GEOSfvdycore were containerized
  • GEOSfvdycore was containerized for testing
slide-4
SLIDE 4

METHODS AND APPROACH

  • Had to learn usage of Singularity & Slurm – Used

Docker and PBS before NASA

  • Created multiple containers for testing, including

Debian 10, Fedora 33, and CentOS 7

  • Before containerizing GEOSfvdycore, the base

libraries had to be containerized first

slide-5
SLIDE 5

METHODS AND APPROACH

  • Two parts to a run: Actually setting up the

experiment and then running.

  • Wanted to see longer and shorter compute times.
  • Horizontal resolution of experiments was changed to

get a better idea of efficiency

  • Each container and the non-containerized version got

3 runs on a single node for longer and shorter compute times

slide-6
SLIDE 6

RESULTS AND GRAPHS

64 70 71 82.7 10 20 30 40 50 60 70 80 90 Fedora 33 No Container Debian 10 CentOS 7 AVERAGE TIME (MINUTES) OS TYPE

OS Type vs. Average Time for Longer Computations

8.2 8.2 9.1 10.5 2 4 6 8 10 12 Fedora 33 No Container Debian 10 CentOS 7 AVERAGE TIME (MINUTES) OS TYPE

OS Type vs. Average Time for Shorter Computations

slide-7
SLIDE 7

DISCUSSIONS AND CONCLUSIONS

  • Tests definitely indicate that containers can compete with non-containerized

versions

  • Containers could have done well from having up-to-date software libraries

that GEOSfvdycore depends upon compared to Discover

  • Topics to move forward on this project would include minimizing the container

size and improving how models can be executed

slide-8
SLIDE 8

REFERENCES AND ACKNOWLEDGEMENTS

  • Singularity documentation https://sylabs.io/docs/
  • Slurm documentation https://slurm.schedmd.com/documentation.html
  • MPI Tutorial https://mpitutorial.com/
  • GEOS-5 wiki https://geos5.org/wiki/index.php?title=GEOS-

5_Earth_System_Modeling_and_Data_Assimilation

  • Special thanks to Laura Carriere, Matt Thompson, Kenneth Peck, Jordan

Caraballo-Vega, and Bruce Pfaff