TRAINING NEURAL TRAINING NEURAL NETWORKS ON THE NETWORKS ON THE - PowerPoint PPT Presentation

Jan 17, 2024 •417 likes •779 views

TRAINING NEURAL TRAINING NEURAL NETWORKS ON THE NETWORKS ON THE EDGE EDGE Navjot Kukreja, Alena Shilova Also: Olivier Beaumont Jan Huckelheim Nicola Ferrier Paul Hovland Gerard Gorman BACKGROUND BACKGROUND Typical data ow pattern

TRAINING NEURAL TRAINING NEURAL NETWORKS ON THE NETWORKS ON THE EDGE EDGE Navjot Kukreja, Alena Shilova
Also: Olivier Beaumont Jan Huckelheim Nicola Ferrier Paul Hovland Gerard Gorman
BACKGROUND BACKGROUND
Typical data �ow pattern for adjoint problems
Memory consumption during an adjoint problem
Checkpointing (Revolve)
S e tup Forward step Executing forward step Saved forward step Reverse step Executing reverse step Reverse step completed
Where else do we see the same data-access pattern? VGGNet
ARRAY OF THINGS ARRAY OF THINGS
WAGGLE PAYLOAD COMPUTER WAGGLE PAYLOAD COMPUTER ODROID XU4 based on the Samsung Exynos5422 CPU four A15 cores, four A7 cores Mali-T628 MP6 GPU that supports OpenCL, 2GB LPDDR3 RAM attached �ash storage
VIEWPOINT PROBLEM VIEWPOINT PROBLEM
STUDENT-TEACHER MODEL STUDENT-TEACHER MODEL
CHALLENGES CHALLENGES Network (not a challenge) Storage (not a challenge) Computation (not necessarily a challenge) Memory!
MEMORY REQUIRED TO TRAIN RESNET MEMORY REQUIRED TO TRAIN RESNET
Memory required (MB) for image size $224 \times 224$
Memory required (MB) for batch size 1
Memory required (GB) for batch size 8
CHECKPOINTING CHECKPOINTING
PyTorch fast-evolving Python package widely applied in deep learning uses Tensors as a basic class Tensors are similar to NumPy arrays which also allow to work with them on GPU dynamically de�nes the computational graph of the model designed to be memory ef�cient: there is checkpointing strategy
Checkpoint sequential: number of segments = 2
Checkpoint sequential: number of segments = 2
Checkpoint sequential: number of segments = 2
Checkpoint sequential: number of segments = 2
Checkpoint sequential: number of segments = 2 $$ \mbox{Memory} = s - 1 + \bigl(l - \left\l�oor l/s \right\r�oor (s -1) \bigr). $$
Revolve: dynamic programming $$ \small{\mbox{Opt}[\ell,1] = \frac{\ell (\ell +1)}{2} u_f + (\ell+1 ) u_b}$$ $$\small{\mbox{Opt}[1, c] = u_f +2 u_b}$$ $$\small{\mbox{Opt}[\ell, c] = \min_{1 \leq i \leq \ell-1} ( i u_f +\mbox{Opt}[\ell - i, c -1] + \mbox{Opt}[i-1, c]) }$$
Comparison of Checkpoint sequential and Revolve Batch Size: $1$, Image Size: $224 \times 224$
Batch Size: $8$, Image Size: $224 \times 224$
Batch Size: $1$, Image Size: $500 \times 500$
Batch Size: $8$, Image Size: $500 \times 500$
PRACTICAL IMPLEMENTATION AND PRACTICAL IMPLEMENTATION AND CONCLUDING REMARKS CONCLUDING REMARKS
THANK YOU THANK YOU

Recommend

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks and their Application to Go A. Bausch Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training neural networks Problems AlphaGo Anne-Marie Bausch The Game of Go Policy Network

280 views • 24 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training

Feed-forward Networks Network Training Error Backpropagation Applications Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training Error Backpropagation Applications Neural Networks Neural networks

956 views • 46 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory Associative Memory CHAPTER III : III : Neural Networks as Associative Memory CHAPTER Neural Networks as

513 views • 22 slides

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use UMaine

412 views • 9 slides

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

852 views • 33 slides

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

1.17k views • 91 slides

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg Durrett Neural Networks Neural Networks Linear classification: argmax y w > f ( x, y ) possible because Linear Neural we transformed

316 views • 4 slides

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography Hopfield, J. J., "Neural networks and physical systems with emergent collective computational abilities," Proceedings of the National Academy

367 views • 19 slides

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they

1.63k views • 119 slides

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova November 21, 2016 Neural Networks 2/20 Neural Networks 3/20 Neural Networks Neural computing requires a number of neurons , to be connected

813 views • 21 slides

Shared Clusters Jack Li , Calton Pu Yuan Chen , Vanish Talwar, Dejan Milojicic Georgia Institute

Improving Preemptive Scheduling with Application-Transparent Checkpointing in Shared Clusters Jack Li , Calton Pu Yuan Chen , Vanish Talwar, Dejan Milojicic Georgia Institute of Technology Hewlett Packard Labs Shared Clusters for Big Data

522 views • 21 slides

Checkpointing strategies for parallel jobs Marin Bougeret , Henri Casanova , Mika el Rabie , Yves

Checkpointing strategies for parallel jobs Marin Bougeret , Henri Casanova , Mika el Rabie , Yves Robert , and Fr ed eric Vivien ENS Lyon & INRIA, France University of Hawaii at M anoa, USA University of Montpellier, France

1.16k views • 69 slides

Incremental checkpointing of program state to NVRAM for transiently-powered systems Fayal

Incremental checkpointing of program state to NVRAM for transiently-powered systems Fayal At-Aoudia Kevin Marquet Guillaume Salagnac IoT constraints Cheap 2 Harvesting from solar Size 3 Harvesting from RF Constrain

468 views • 19 slides

Resilient Distributed Concurrent Collections Cdric Bassem Promotor: Prof. Dr. Wolfgang De

Resilient Distributed Concurrent Collections Cdric Bassem Promotor: Prof. Dr. Wolfgang De Meuter Advisor: Dr. Yves Vandriessche 1 Evolution of Performance in High Performance Computing Exascale = 10 18 Flop/s Petascale = 10 15 Flop/s

304 views • 29 slides

Distributed Real-Time Stream Processing: Why and How Petr Zapletal @petr_zapletal NE Scala 2016

Distributed Real-Time Stream Processing: Why and How Petr Zapletal @petr_zapletal NE Scala 2016 Agenda Motivation Stream Processing Available Frameworks Systems Comparison Recommendations The Data Deluge 8

906 views • 71 slides

FS Consistency & Journaling Nima Honarmand (Based on slides by Prof. Andrea Arpaci-Dusseau)

Fall 2017 :: CSE 306 FS Consistency & Journaling Nima Honarmand (Based on slides by Prof. Andrea Arpaci-Dusseau) Fall 2017 :: CSE 306 Why Is Consistency Challenging? File system may perform several disk writes to serve a single

770 views • 35 slides

CHECKPOINT/CLEARIDLE Overarching Goal Mobile clients need to provide a smooth responsive

CHECKPOINT/CLEARIDLE Overarching Goal Mobile clients need to provide a smooth responsive user experience Draft Goals Reduce the cost of providing quasi-real-time reception of server state change Minimize client costs for

338 views • 6 slides

Checkpointing for the RESTART Problem in Markov Networks Lester Lipsky Derek Doran Swapna

Overview of ME distributions 2 Failure Recover Scenarios 7 A Taboo Process - Two Absorbing States 14 RESTART and Checkpoints for Markov Models 18 Example 31 Checkpointing for the RESTART Problem in Markov Networks Lester Lipsky Derek

773 views • 43 slides