HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC - PowerPoint PPT Presentation

NVIDIA TO ACCELERATE THE HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC Gunter Roeth gunterr@nvidia.com March 2020

GRAND CHALLENGES REQUIRE MASSIVE COMPUTING REINVENTING THE LI-ION BATTERY UNDERSTANDING HIV’S STRUCTURE CLOUD RESOLVING CLIMATE SIMULATIONS 3M Node Hours | 7 Days on Titan 100M Node Hours | 840 Days on Piz Daint 10M node Hours |16 Days on BlueWaters 2

TOP500 EFFECTS 100 PFLOPS 10 PFLOPS 1 PFLOPS 100 TFLOPS 10 TFLOPS 1 TFLOPS 100 GFLOPS All #1 #500 3

SOMETHING NEW: AI + HPC = REVOLUTION 4

INGREDIENTS: BIG DATA 5

BIG DATA IN SCIENCE Big Science ingests/outputs Big Data Johns Hopkins Square Kilometer Array Large Hadron Collider Turbulence Database NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 6

AI WORKFLOW FOR HPC DATA NEW DATA TRAINING SET REGRESSION SET SIMULATION TRAINING REGRESSION TESTING INFERENCE (FP64/FP32) (FP32/FP16) (FP16/INT8) (FP16/INT8) ERRORS 7

THE CONVERGENCE OF HPC * AI Integrating the Third and Fourth Pillars of Scientific Discovery HPC AI 40+ years of algorithms New algorithms and models based on first principles with potential to increase theory model size and accuracy Dramatically Improves Accuracy and /or Time-to-Solution at Large Scale Commercially Improve or validate the Understanding Climate/weather Clinically viable viable fusion Standard Model of cosmological dark forecasts with ultra- precision medicine energy Physics energy and matter high fidelity

AI FOR HPC Transformative Tool To Accelerate The Pace of Scientific Innovation 5,000X Faster 300,000X Faster 90% accuracy 33% Faster Process LIGO Signal Predict Molecular Energetics Fusion Sustainment Track Neutrinos Understanding Universe Drug Discovery Clean Energy Particle Physics 70% accuracy 11% higher accuracy Weeks to 10 milliseconds 14X Faster Score Protein Ligand Monitor Earth’s Vital Analyze Gravitational Lensing Generate Bose-Einstein Drug Discovery Climate Astrophysics Condensate (Physics) Improves Accuracy Accelerates Time to Solution Enabling realization of full scientific potential Unlocking the use of science in exciting new ways 9

INTELLIGENT HPC DL Driving Future HPC Breakthroughs From Trained networks as solvers • calendar • Super-resolution of coarse simulations time to real Low- and mixed-precision • time? Simulation for training, network in production • Pre- Post- Simulation processing processing • Select/classify/augment/ • Analyze/reduce/augment distribute input data output data • Control job parameters • Act on output data 10

THE SHAPE OF AI SUPERCOMPUTING 11

VOLTA TENSOR CORE GPU FUELS WORLD'S FASTEST SUPERCOMPUTER Fused HPC and AI Computing In a Unified Platform Genomics (CoMet) 150X World’s First Exascale Run Finding Genes-to-disease Connection Over Titan Same accuracy as FP64 w/ Tensor Core 50X Quantum Chemistry (QMCPack) Simulate New Materials High-Temperature Semiconductors Over Titan Summit Supercomputer AI: 3 Exaflops Oakridge National Labs HPC: 122 Petaflops Measured performance: Summit node vs Titan node 12

AI: A NEW COMPUTING PARADIGM 1000X GPU-Computing perf 10 7 by 1.5X per year 2025 10 6 1.1X per year 10 5 10 4 √ 10 3 1.5X per year 10 2 Single-threaded perf 1980 1990 2000 2010 2020 Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 13

NVIDIA DL AND HPC – JOINTLY SOLVE NEW PROBLEMS, BETTER 14

AI SUPERCOMPUTING IS HERE Extending The Reach of HPC By Combining Computational & Data Science Clean Energy Drug Discovery Turbulent Flow Molecular Dynamics “ What’s happening?” “Is there cancer?” Monitoring Climate “Next move?” Understanding Universe Change Structural Analysis N-body Simulation “What does she mean?” COMPUTATIONAL SCIENCE DATA SCIENCE COMPUTATIONAL & DATA SCIENCE S8242 – DL for Computational Science, Jeff Adie & Yang Juntao Presented ~20 Success Stories of DL in Computational Science (GTC on-demand: http://on-demand-gtc.gputechconf.com)

Computational Chemistry 16

AI Quantum Breakthrough Background Developing a new drug costs $2.5B and takes 10-15 years. Quantum chemistry (QC) simulations are important to accurately screen millions of potential drugs to a few most promising drug candidates. Challenge QC simulation is computationally expensive so researchers use approximations, compromising on accuracy. To screen 10M drug candidates, it takes 5 years to compute on CPUs. Solution Researchers at the University of Florida and the University of North Carolina leveraged GPU deep learning to develop ANAKIN-ME, to reproduce molecular energy surfaces with super speed (microseconds versus several minutes), extremely high (DFT) accuracy, and at 1-10/millionths of the cost of current computational methods. Essentially the DL model is trained to learn Hamiltonian of the Schrodinger equation. Impact Faster, more accurate screening at far lower cost 55 17

NEURAL NETWORK MODEL APPROACH Training set: ~20M DFT data points. Molecules with 1 to 8 atoms from GDB database 18

Computational Mechanics 20

Deep Learning for Solid Mechanics FEA UPDATED WITH NEURAL NETWORK FEA trained deep neural network for surrogate modelling of estimated stress distribution. Deepvirtuality, a spinoff from Volkswagen Data:Lab under Nvidia Inception Program has demonstrate with their software aimed for a quicker prediction of structural data. An demonstration of Structure Born Noise of a V12 Engine with Deepvirtuality Torsional Frequencies of a Car Body by Deepvirtuallity 21

Eulerian Fluid Simulation Approximating PDE solutions NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 22 “Accelerating Eulerian Fluid Simulation With Convolutional Networks” , Thompson et al., 2016

SimRes at SC19 in Denver Physics Informed NN 23

EXAMPLES OF PINN Vortex induced vibrations problem of flow past a circular cylinder. (eta) 24 Incompressible Navier-Stokes equations

Train first Data Driven Networks (DDNN) 25

AUTOMOTIVE AERODYNAMICS Inference Training 26

DATA DRIVEN METHODS Pros & Cons - Need to generate a lot of Simulations (accuracy dependent on the simulation code) - No Physics Awareness; Generalizability may be limited - Not very efficient for Complex 3D Geometries/Curved Surfaces - Interpolation/Extrapolation Errors + Not dependent on Physics 27 27

Use Physics Informed NN (PINNs) 28

PHYSICS INFORMED NEURAL NETS: ARCHITECTURE A Neural Network Architecture for Computational Mechanics/Physics problems ❑ Point Cloud for 3D Geometries & Meshes (Fixed/Moving, Deforming, Structured & Unstructured) ❑ Physics Driven & Physics Aware Networks (respects the governing PDEs, Multi-disciplinary) ❑ Performance optimized for GPU tensor cores PINN - Physics Informed Neural Point Cloud representation of 29 29 Networks Computational Domain & Data on 3D Geometries

PHYSICS DRIVEN METHODS Special Considerations Problem Modeling: ❑ o Complex Geometries Sampling Insensitivity ❑ Network Architecture: ❑ Faithfully represent the Physics with initial & boundary conditions o Architectural Requirements for n th order derivatives o Loss Convergence Acceleration o Activation Functions o Gradients & Discontinuities o Global vs. Local o 30 30

Results of Physics Informed NN (PINNs) 32

STEADY STATE: 2D LID DRIVEN CAVITY SIMNET VS. OPENFOAM U velocity difference = 0.2% V velocity difference = 0.4% 33

HEAT SINK: A MULTI-PHYSICS PROBLEM Heat Sink – * Temperatures to not exceed the design criteria Objectives – * Similar accuracy as the Solver * Geometry representation with Point Clouds * Multiple simultaneous parametrized & unparametrized geometries Physics involved – CFD & Heat Transfer Ansys IcePack used for Simulation (** we kindly acknowledge Ansys’s support **) 34

NETWORK ARCHITECTURE Multi-Physics Neural Networks Multi-Physics PDEs CFD (with turbulence) – 2 nd Order PDE CFD Heat Transfer in Solids & Fluid (turbulent) Fluid-Solid Interface Conditions Heat Transfer Temperature in Fluid Heat Flux PINN Network Architecture Heat Transfer 10 layers for non-Physics Informed Network 10 x 2n layers for n th order PDEs in Solid 50-500 neurons per layer Swish Activation Function 35

HEATSINK DESIGN OPTIMIZATION Physics Informed Neural Net for Coupled CFD-Heat Transfer Problems 37

Earth Science 38

Deep Learning for Climate Modeling ANOMALY DETECTION IN CLIMATE DATA Identifying “extreme” weather events in multi -decadal datasets with 5-layered Convolutional Neural Network. Reaching 99.98% of detection accuracy. (Kim et al, 2017) Dataset: Visualization of historic cyclones from JWTC hurricane report from 1979 to 2016 Systemic framework for detection and localization of extreme climate event 39

HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC - PowerPoint PPT Presentation

NVIDIA TO ACCELERATE THE HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC Gunter Roeth gunterr@nvidia.com March 2020 GRAND CHALLENGES REQUIRE MASSIVE COMPUTING REINVENTING THE LI-ION BATTERY UNDERSTANDING HIVS STRUCTURE

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

The convergence of HPC and BigData What does it mean for HPC sysadmins? damienfrancois FOSDEM

ML HPC: Optimizing Optimizers for Optimization Workshop on the Convergence of ML & HPC

HPC and Cloud Convergence: What about HPC and Edge? Panel at HPBDC 19 by Dhabaleswar K. (DK)

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

Computer Security Summer Scholars 2016 Ma7 Vander Werf HPC System Administrator Security in HPC

Building a Grid System for HPC HPC on Grid High Performance Computing (HPC): Use of computer

HPC IN EUROPE Organisation of public HPC resources Context Focus on publicly-funded HPC

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, PhD.

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, H. Cartiaux

Commission Overview The Commission was co chaired by two former governors, Mike Leavitt of

Continuum Percolation and Duality with Hard-Particle Systems Across Dimensions Salvatore Torquato

Surgery of the Nasal Valve Scott B. Roofe, MD, FACS Facial Plastic and Reconstructive Surgery

Speech Signal Representations Berlin Chen 2004 References: 1. X. Huang et. al., Spoken Language

Reducing Health Inequalities through Innovation and System Change Paul Bollom, Interim Chief

Compute First Networking: Distributed Computing meets ICN Micha Krl 1 , Spyridon Mastorakis 2

Water The H 2 O molecule bonding: usually we draw two lone

Formal Approaches to Safe Software Development for Medical Devices by Alena Simalatsar

Sambuz

Useful Links

Newsletter

Mail Us

HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC - PowerPoint PPT Presentation

NVIDIA TO ACCELERATE THE HPC-AI CONVERGENCE Workshop on the Convergence of ML & HPC Gunter Roeth gunterr@nvidia.com March 2020 GRAND CHALLENGES REQUIRE MASSIVE COMPUTING REINVENTING THE LI-ION BATTERY UNDERSTANDING HIVS STRUCTURE

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

The convergence of HPC and BigData What does it mean for HPC sysadmins? damienfrancois FOSDEM

ML HPC: Optimizing Optimizers for Optimization Workshop on the Convergence of ML &amp; HPC

HPC and Cloud Convergence: What about HPC and Edge? Panel at HPBDC 19 by Dhabaleswar K. (DK)

CONTAINERS DEMOCRATIZE HPC CJ Newburn, Principal Architect for HPC, NVIDIA GTC19 S9525 -

Computer Security Summer Scholars 2016 Ma7 Vander Werf HPC System Administrator Security in HPC

Building a Grid System for HPC HPC on Grid High Performance Computing (HPC): Use of computer

HPC IN EUROPE Organisation of public HPC resources Context Focus on publicly-funded HPC

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, PhD.

HPC platforms @ UL Overview (as of 2013) and Usage http://hpc.uni.lu S. Varrette, H. Cartiaux

Commission Overview The Commission was co chaired by two former governors, Mike Leavitt of

Continuum Percolation and Duality with Hard-Particle Systems Across Dimensions Salvatore Torquato

Surgery of the Nasal Valve Scott B. Roofe, MD, FACS Facial Plastic and Reconstructive Surgery

Speech Signal Representations Berlin Chen 2004 References: 1. X. Huang et. al., Spoken Language

Reducing Health Inequalities through Innovation and System Change Paul Bollom, Interim Chief

Compute First Networking: Distributed Computing meets ICN Micha Krl 1 , Spyridon Mastorakis 2

Water The H 2 O molecule bonding: usually we draw two lone

Formal Approaches to Safe Software Development for Medical Devices by Alena Simalatsar

Sambuz

Useful Links

Newsletter

Mail Us

ML HPC: Optimizing Optimizers for Optimization Workshop on the Convergence of ML & HPC