MIXED PRECISION TRAINING Michael OConnor MIXED PRECISION What is - PowerPoint PPT Presentation

Oct 25, 2022 •423 likes •541 views

MIXED PRECISION TRAINING Michael OConnor MIXED PRECISION What is the benefit? Using mixed precision and Volta your networks can be: 1. 3-4x faster 2. Reduce memory consumption and bandwidth pressure 3. just as powerful with no architecture

MIXED PRECISION TRAINING Michael O’Connor
MIXED PRECISION What is the benefit? Using mixed precision and Volta your networks can be: 1. 3-4x faster 2. Reduce memory consumption and bandwidth pressure 3. just as powerful with no architecture change. 2
A MIXED PRECISION SOLUTION Imprecise weight updates "Master" weights in FP32 Gradients underflow Loss (Gradient) Scaling Maintain precision Accumulate to FP32 (Tensor Cores) 3
MIXED SOLUTION: FP32 MASTER WEIGHTS Apply FP32 Master FP32 Master Weights Gradients FP16 Copy Gradients FP16 FP16 Loss Weights Forward Pass 4
GRADIENTS RANGE OFFSET 5
MIXED PRECISION TRAINING Remove scale, Apply (+clip, etc.) Scaled FP32 Master FP32 Gradients FP32 Weights Gradients Scaled Copy FP16 Gradients FP16 FP32 Scaled FP32 Weights Loss Loss Loss Scaling Forward Pass 6
NVCAFFE V0.16 TRAINING ALEXNET 2700 Balance memory alloc 2568 btw. I/O & conv w.s. Parallelize I/O decode & deserialize Images per second Improved algo selection CPU Affinity 2200 Fused weight update Parallel AllReduce 1700 Starting point nvCaffe 0.15 @ 1265 1200 June 2016 Sept 2016 Oct 2016 Dec 2016 Feb 2017 March 2017 May 2017 Single P100 GPU, Batch Size=128 7
RESNET-50 FP32 PERFORMANCE Caffe Caffe2 TensorFlow MXNet Torch CNTK Chainer 2000 1750 1500 1250 1000 750 Images per second 500 250 0 1 GPU 2 GPU 4 GPU 8 GPU 4/30/2017 : DGX-1 with Batch Size=64 per GPU. Chainer numbers are preliminary. 8
RESNET-50 MIXED PRECISION AND FP32 1 GPU 2 GPU 4 GPU 8 GPU 7000 6500 6000 5500 5000 4500 4000 3500 3000 2500 Images per second 2000 1500 1000 500 0 MXNet FP32 GTC 2017 MXNet FP32 GTC 2018 MXNet Mixed GTC 2018 9
INFORMATION SOURCES Where to learn about mixed precision training CE8130 - Connect with the Experts: Deep Learning Training for Volta Tensor Cores Tu 2PM S8923 - Training Neural Networks with Mixed Precision: Theory and Practice Wed 2PM S81012 - Training Neural Networks with Mixed Precision: Real Examples Th 9 AM CE8162 - Connect with the Experts: Deep Learning Training for Volta Tensor Cores Th 2PM Mixed- Precision Training of Deep Neural Networks (NVIDIA Developer Blog) Training with Mixed Precision (NVIDIA User Guide) 10

Recommend

Mixed Precision Training PAI Overview What is mixed-precision

Mixed Precision Training PAI Overview What is mixed-precision & Why mixed-precision How mixed-precision Mixed-precision tools on PAI-tensorflow Experimental results 1 What is mixed-precision

1.38k views • 45 slides

MIXED PRECISION TRAINING OF DEEP NEURAL NETWORKS Carl Case, NVIDIA OUTLINE 1. What is mixed

MIXED PRECISION TRAINING OF DEEP NEURAL NETWORKS Carl Case, NVIDIA OUTLINE 1. What is mixed precision training? 2. Considerations and methodology for mixed precision training 3. Automatic mixed precision 4. Performance guidelines and

1.03k views • 41 slides

EFFECTIVE USE OF MIXED PRECISION FOR HPC Kate Clark, Smoky Mountain Conference 2019 Why Mixed

EFFECTIVE USE OF MIXED PRECISION FOR HPC Kate Clark, Smoky Mountain Conference 2019 Why Mixed Precision Lattice Quantum Chromodynamics Mixed Precision and Krylov Solvers AGENDA Mixed Precision and Multigrid Tensor cores Future Challenges

945 views • 47 slides

MIXED PRECISION TRAINING: THEORY AND PRACTICE Paulius Micikevicius What is Mixed Precision

MIXED PRECISION TRAINING: THEORY AND PRACTICE Paulius Micikevicius What is Mixed Precision Training? Reduced precision tensor math with FP32 accumulation, FP16 storage Successfully used to train a variety of: Well known public

1.61k views • 37 slides

Automated Mixed-Precision for TensorFlow Training Reed Wanderman-Milne (Google) and Nathan Luehr

Automated Mixed-Precision for TensorFlow Training Reed Wanderman-Milne (Google) and Nathan Luehr (NVIDIA) March 20, 2019 Mixed Precision Training Background What is Mixed Precision? Using a mix of float32 and float16 precisions float16 is

697 views • 38 slides

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Oxidation Catalysis: Oxidation Catalysis: The Role of The Role of The Role of Oxidation Catalysis: Oxidation Catalysis: The Role of

791 views • 36 slides

Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods

Preserving Ideographic Quality in Mixed Methodological Analysis David F. Feldon Utah State University May 8, 2018 Mixed Methods vs. Mixed Models (Johnson & Onwuegbuzie, 2004) Mixed methods Mixed models Entails independent quan and

290 views • 28 slides

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with

Regression 2: Mixed Models Marco Baroni Practical Statistics in R Outline Mixed models with subject and item effects Mixed models in R Outline Mixed models with subject and item effects Introduction Varying intercept mixed models

943 views • 55 slides

Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a

Mixed models Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a mixed model? For simplicity well only talk about linear models. Mixed GLS y = X + Zb + , Cov( y ) = , b , and are all

555 views • 7 slides

Training of Convolutional Neural Networks (CNNs) Typical Datasets Typical Networks CIFAR10

Multi-Precision Policy Enforced Training (MuPPET) Multi-Precision Policy Enforced Training (MuPPET) A precision-switching strategy for quantised fixed-point training of CNNs A precision-switching strategy for quantised fixed-point training of

430 views • 23 slides

VLVK EHF. VLVK EHF. Precision machining Precision machining Professional precision for

VLVK EHF. VLVK EHF. Precision machining Precision machining Professional precision for 26 years From the start Vlvk was established in 1988 By Danel Gumundsson One person working for the first year (DG) Magns

564 views • 9 slides

2018 Milken Institute Hamptons Dialogues Precision, Precision, Precision: The Future of Health

2018 Milken Institute Hamptons Dialogues Precision, Precision, Precision: The Future of Health and Medicine 1 1958: Americas Response to Sputnik 1958: Americas Response to Sputnik U.S. Defense budget increased to a peace- time record

4.49k views • 14 slides

AUTOMATIC MIXED PRECISION IN PYTORCH Michael Carilli and Michael Ruberry, 3/20/2019 THIS TALK

AUTOMATIC MIXED PRECISION IN PYTORCH Michael Carilli and Michael Ruberry, 3/20/2019 THIS TALK Using mixed precision and Volta/Turing your networks can be: 1. 2-4x faster 2. more memory-efficient 3. just as powerful with no architecture change.

1.14k views • 50 slides

MAAC Precision Aerobatics MAAC Precision Aerobatics JUDGES TRAINING JUDGES TRAINING

MAAC Precision Aerobatics MAAC Precision Aerobatics JUDGES TRAINING JUDGES TRAINING PRESENTATION PRESENTATION 2008 2008 1 SCHEMATIC MANEUVER DIAGRAMS SCHEMATIC MANEUVER DIAGRAMS INTERMEDIATE INTERMEDIATE 2 1 Takeoff It is not

321 views • 17 slides

Mixed Feelings about Mixed Precision? Judy Hill Scientific Computing Group Leader, Center for

Mixed Feelings about Mixed Precision? Judy Hill Scientific Computing Group Leader, Center for Computational Sciences Stuart Slattery Computational Scientist, Computational Sciences and Engineering August 27, 2019 Smoky Mountains Computational

441 views • 5 slides

Mixed Strategies Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam

Mixed Strategies Krzysztof R. Apt CWI, Amsterdam, the Netherlands , University of Amsterdam Mixed Strategies p. 1/13 Overview Mixed strategies. Mixed extension of a finite game. Nash Theorem. Minimax Theorem. Mixed Strategies p.

260 views • 13 slides

Gateway: ENGI E1112 CS Lab Project Ross Basri Ruchir Khaitan Design Brief Program and

Gateway: ENGI E1112 CS Lab Project Ross Basri Ruchir Khaitan Design Brief Program and integrate new firmware for an HP 20b calculator. Specifications Final product must Limitations HP20b calculator Linux Workstation

343 views • 15 slides

Transverse Impact Parameter Resolution FCCee Peter Winkel Rasmussen August 22, 2017 Transverse

Transverse Impact Parameter Resolution FCCee Peter Winkel Rasmussen August 22, 2017 Transverse Impact Parameter Resolution FCCee 1 / 4 Introduction The 2017-07-12 version of the ILC software was used. The the FCCee o5 v03 detector model was

189 views • 4 slides

Carnap and the Rationality of Theory Choice Mtys (Matthias) Brendel Budapest University of

Carnap and the Rationality of Theory Choice Mtys (Matthias) Brendel Budapest University of Technology and Economics, Hungary, Ph.D. program "History of Technology, Engineering and S cience S upervisor: Mrta Fehr Currently living

308 views • 16 slides

Author Disambiguation & Impact Assessment Gentner Day 2009 @ CERN Henning Weiler 1 Author

Author Disambiguation & Impact Assessment Gentner Day 2009 @ CERN Henning Weiler 1 Author Disambiguation & Impact Assessment Introduction The Problem The Idea Whats next? Discussion... 2 Introduction The Presenter... PhD

180 views • 16 slides

Cayucos Sustainable Cayucos Sustainable Water Project Water Project Flows and Loading Update

Cayucos Sustainable Cayucos Sustainable Water Project Water Project Flows and Loading Update Flows and Loading Update May 19, 2016 Saturday Ammonia (July 4 th Holiday) Results Average 39 mg/L No. of Samples 45 Domestic Wastewater

492 views • 24 slides

Tier 1 Water Budget CTC SPC Meeting # 2/09 Agenda Item # 6.1 February 17, 2009 Gayle

CTC Source Protection Region CTC Source Protection Region CTC Source Protection Region CTC Source Protection Region www.ctcswp.ca CTC Source Protection Region CTC Source Protection Region CTC Source Protection Region CTC Source Protection

645 views • 48 slides

Kern ern Grou roundwater Auth thor orit ity GSP Framework W k Worksh shop Apri ril 4 4,

Kern ern Grou roundwater Auth thor orit ity GSP Framework W k Worksh shop Apri ril 4 4, , 2017 Groundwater Sustainability Plan Outlines Overview Analysis Approach GSP Sections Overview Umbrella/Chapter GSP Outline

895 views • 32 slides

Growth and Dividends Investor Presentation January 2017 Disclaimer This presentation includes

Growth and Dividends Investor Presentation January 2017 Disclaimer This presentation includes forward-looking statements that involve known and unknown risks and uncertainties, many of which are beyond the Companys control and all of which are

1.11k views • 34 slides