IMPLEMENTATION OF A PARALLEL BATCH TRAINING ALGORITHM FOR DEEP - PowerPoint PPT Presentation

IMPLEMENTATION OF A PARALLEL BATCH TRAINING ALGORITHM FOR DEEP NEURAL NETWORK YUPING LIN IFLYTEK LABORATORY FOR NEURAL COMPUTING FOR MACHINE LEARNING DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE YORK UNIVERSITY, TORONTO NOVEMBER 10, 2015 1

OUTLINE Review  Neural Network Representation  Sequential Trainer  Concurrent Trainer  Dividing tasks  Collecting statistics  Using monitor  Using thread pool  Testing  Future Work  2

REVIEW -- NEURAL NETWORK TRAINING output Forward phase  (𝑚+1) = 𝐺 𝑗 𝑥 𝑗𝑘 ∙ 𝑎 𝑗 𝑚 + 𝑐  𝑎 𝑘 𝑘 Where 𝐺 𝑦 is the nonlinear activation function input 3

REVIEW -- NEURAL NETWORK TRAINING Error back propagation  (𝑝𝑣𝑢) = 𝑎 𝑙 (𝑝𝑣𝑢) − 𝑈 (𝑝𝑣𝑢)  𝜀 𝑙 𝑙 output (𝑚) = 𝐺 ′ 𝑎 𝑗 𝑚 𝑚+1  ∙ 𝑘 𝑥 𝑗𝑘 ∙ 𝜀 𝜀 𝑗 𝑘 Where 𝐺′ 𝑦 is the derivative of the activation function  𝑈 is the desired output vector  Weight updating  (𝑚) ∙ 𝜀 (𝑚+1)  ∆𝑥 𝑗𝑘 = 𝑎 𝑗 𝑘  𝑥 𝑗𝑘 = 𝑥 𝑗𝑘 − 𝛿 ∙ ∆𝑥 𝑗𝑘 Where 𝛿 is the learning rate  input 4

REVIEW -- SEQUENTIAL TRAINING VS. CONCURRENT TRAINING Send train data VS. Master Worker Worker Worker trainer … trainer trainer trainer trainer Weight Weight Collect statistics update update 5 Sequential training Concurrent training

BROAD VIEW OF THE IMPLEMENTATION There are 3 major components in our implementation:  The neural network representation : package of classes that form a neural network.  Sequential trainers : classes that implement the sequential training algorithm.  Concurrent trainers : classes that implement the concurrent training algorithm.  6

NEURAL NETWORK REPRESENTATION 7

NEURAL NETWORK REPRESENTATION Main class that represents a multi-layer  perceptron. Has attributes representing the components  of a neural network. Weights  Biases  Activation Functions  8

NEURAL NETWORK REPRESENTATION Represents a weight matrix  Support forward/backward multiplications  9

THE SEQUENTIAL TRAINER 10

THE SEQUENTIAL TRAINER Divide training into 3 layers:  train() for the whole training process  trainEpoch() for the training of each epoch  trainBatch() for the training of each mini-batch  11

THE CONCURRENT TRAINER 12

THE CONCURRENT TRAINER Similar to the sequential trainer.  Keep references to the monitor object and  a thread pool. Concurrency occur within the trainBatch()  method. 13

THE CONCURRENT TRAINER -- DIVIDING TASKS Implements the java.lang.Runnable  interface Represents a training task for worker thread.  Keep references to the monitor object and  the global shared variables for update statistics. 14

THE CONCURRENT TRAINER -- COLLECTING STATISTICS The update statistics are accumulated locally within each task.  Then update in the shared global variables concurrently upon finish of the task.  Need synchronization: synchronized blocks, compare and set etc.  15

THE CONCURRENT TRAINER -- USING MONITOR 16

THE CONCURRENT TRAINER -- USING THREAD POOL Repeatedly creating and destroying threads can waste a lot of resource and time.  Can pre-define a fixed size thread pool to avoid this problem.  17

TESTING The concurrent training algorithm only parallelizes the computations over data samples within  each mini-batch. The computed update statistics should be the same for both the sequential and concurrent  algorithms. Define the concurrent implementation as correct if the model trained by the concurrent trainer  is equivalent to the same model trained by the sequential trainer. Two models are considers equivalent if the differences between all their weights and biases  are within some small error 𝜁 . 18

TESTING Have ran 100  comparison tests and all of them are considered equal. 19

FUTURE WORK Run the sequential and the concurrent algorithm on a multicore machine to see how much  training time can be reduced by using the concurrent algorithm. 20

IMPLEMENTATION OF A PARALLEL BATCH TRAINING ALGORITHM FOR DEEP - PowerPoint PPT Presentation

IMPLEMENTATION OF A PARALLEL BATCH TRAINING ALGORITHM FOR DEEP NEURAL NETWORK YUPING LIN IFLYTEK LABORATORY FOR NEURAL COMPUTING FOR MACHINE LEARNING DEPARTMENT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE YORK UNIVERSITY, TORONTO NOVEMBER

Batch Systems Running calculations on HPC resources Outline What is a batch system? How

ECE 417 Fall 2018 Lecture 19: Mini-Batch Training and Data Augmentation Mark Hasegawa-Johnson

HEBT Magnet Vacuum Chambers for Batch 2 and Batch 3 PSP Code 2.3.7.1.2.3.2 Lukas Urban /

Batch Systems Running your jobs on an HPC machine Outline What are batch systems? Why are

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

Batch Modeling and Process Monitoring Geir Rune Flten Agenda CAMO Batch analysis

Automating batch fecundity measurements Automating batch fecundity measurements using digital

A Novel Micro- -Batch Mixer Batch Mixer A Novel Micro That Scales To That Scales To The

Enabling Efficient Batch Verification Enabling Efficient Batch Verification on Data Integrity for

Process costing By: Jyotsna Khaitan Batch Costing: It is a modified form of job costing where

Building the Easy Button: Automating SAS Program Batch Runs Nancy Brucken inVentiv Health June

Information-Theoretic Considerations in Batch RL Jinglin Chen, Nan Jiang University of Illinois

Asphalt Production Asphalt Plants Batch Plant Drum Plant Produces asphalt one batch at a time

Learning about the process and organism: Batch Sef Heijnen, Department of Biotechnology, Faculty

Stream Processing Marco Serafini COMPSCI 532 Lecture 5 Stream vs. Batch Processing Batch

A new batch system, dCache and nfs A. Pickford Background Nikhef Local Batch System (Stoomboot)

Road Scene Understanding Presented by: Mohamed Mohsen Agenda Problem Definition Currently

r

Binarized Neural Network to Compress DNN Xianda ( Bryce ) Xu xxu373@wisc.edu November 7th Why

Lecture 12: Perceptron and Back Propagation CS109A Introduction to Data Science Pavlos

Lab activity visual presentations-1 Name:B.Saritha Assoc. Professor Affiliation: RC1529 MVSR

Convolutional Neural Networks for Image Classification Dan C. Cirean , Ueli Meier, Jonathan

Lo Local Im Image Pri riors for Automatic Im Image Colorization with Simultaneous Cla lassifi

Deep Learning of Binary Hash Codes for Fast Image Retrieval Kevin Lin, Huei-Fang Yang, Jen-Hao