Neural Networks - I Henrik I Christensen Robotics & Intelligent - PowerPoint PPT Presentation

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - I Henrik I Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0280 hic@cc.gatech.edu Henrik I Christensen (RIM@GT) Neural Networks 1 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Introduction 1 Neural Networks - Architecture 2 Network Training 3 Small Example - ZIP Codes 4 Summary 5 Henrik I Christensen (RIM@GT) Neural Networks 2 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Introduction Initial motivation for design from modelling of neural systems Perceptrons emerged about same time as we started to have real neural data Studies of functional specialization in the brain Henrik I Christensen (RIM@GT) Neural Networks 3 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neurons - the motivation Henrik I Christensen (RIM@GT) Neural Networks 4 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Code Example Henrik I Christensen (RIM@GT) Neural Networks 5 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Outline Outline of ANN architecture Formulation of the criteria function Optimization of weights Example from image analysis Next time: Bayesian Neural Networks Henrik I Christensen (RIM@GT) Neural Networks 6 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Data Process w. Two-Layer Neural Network wTx wTz h(.) σ (x) Henrik I Christensen (RIM@GT) Neural Networks 8 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Net Architecture as a Graph hidden units z M w (1) w (2) MD KM x D y K inputs outputs y 1 x 1 w (2) z 1 10 x 0 z 0 Henrik I Christensen (RIM@GT) Neural Networks 9 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Network Equations Consider an input layer D w (1) � a j = ji x i i =0 where w j 0 and x 0 represent the bias weight / term The activation, a j , is mapped by an activation function z j = h ( a j ) which typically is a Sigmoid or tanh The output is considered the hidden activations Output unit activations are computed, similarly M w (2) � a k = kj z j j =0 Henrik I Christensen (RIM@GT) Neural Networks 10 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - A few more details The full system is then � D  � M w (2) w (1) � � y k ( x , w ) = σ kj h ji x i   j =0 i =0 The information is flowing “forward” through the system Naming is sometimes complicated! 3-layer network single-hidden-layer network two-layer network (input/output) Henrik I Christensen (RIM@GT) Neural Networks 11 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Training Neural Networks For optimization we consider the error function: N � || y ( x n , w ) − t n || 2 E ( w ) = n =1 The optimization is similar to earlier searches Objective ∇ E ( w ) = 0 Due to non-linearity closed form solution is a challenge Newton-Raphson type solutions are possible ∆ w = − H − 1 ∇ E w Often an iterated solution is realistic w ( τ +1) = w ( τ ) − η ∇ E ( w ( τ ) ) Henrik I Christensen (RIM@GT) Neural Networks 13 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Error Backpropagation Consider the error composed of parts N � E ( w ) = E n ( w ) n =1 Considering errors by parts we get � y k = w ki x i i with the error E n = 1 � ( y nk − t nk ) 2 2 k the associated gradient is ∂ E n = ( y nj − t nj ) x ni ∂ w ji Henrik I Christensen (RIM@GT) Neural Networks 14 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Computing gradients Given � a j = w ji z i i and z j = h ( a j ) The gradient is (using chain rule) ∂ E n = ∂ E n ∂ a j ∂ w ji ∂ a j ∂ w ji We already know ∂ E n = ( y k − t j ) = δ j ∂ a j and ∂ a j = z i ∂ w ji Henrik I Christensen (RIM@GT) Neural Networks 15 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Updating of weights Updating backwards in the systems z i δ k δ j w ji w kj z j δ 1 Error Propagation � δ j = h ′ ( a j ) w kj δ k k Henrik I Christensen (RIM@GT) Neural Networks 16 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Update Algorithm 1 Enter a training sample x n , propagate and compare to expected value t n , y ( x n ) 2 Evaluate δ k at all outputs 3 Backpropagate δ to correct hidden unit weights 4 Evaluate derivatives to correct input level weights Henrik I Christensen (RIM@GT) Neural Networks 17 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Issues related to training of networks The Sigmoid is “linear” at 0 so random values around 0 is a good start. Be aware that training a network too much could result in over fitting There can be multiple hidden layers Henrik I Christensen (RIM@GT) Neural Networks 18 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Small Example From (Le Cun 1989) on state of the art of ANN’s for recognition Recognition of handwritten characters has been widely studied Still considered an important benchmark for new recognition methods Henrik I Christensen (RIM@GT) Neural Networks 20 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary ZIP code data Data normalized to 16x16 pixels 320 digits in training set and 160 digits in test set Henrik I Christensen (RIM@GT) Neural Networks 21 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Different types of networks No hidden layer - pure 1 level regression 1 hidden layer with 12 hidden units - fully connected 2 hidden layers and local connectivity 2 hidden layers, locally connected and weight sharing 2 hidden layers, locally connected and 2 level weight sharing Henrik I Christensen (RIM@GT) Neural Networks 22 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example - Net Architectures Henrik I Christensen (RIM@GT) Neural Networks 23 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example Results Henrik I Christensen (RIM@GT) Neural Networks 24 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Example - Summary Careful design of network architectures is important Neural Networks offer a rich variety of solutions Later results have shown improved performance with SVN’s Henrik I Christensen (RIM@GT) Neural Networks 25 / 27

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Summary Neural networks are general approximators Useful both for regression and discrimination Some would term them - “self-parameterized lookup tables” There is a rich community engaged in design of systems Rich variety of optimization techniques Henrik I Christensen (RIM@GT) Neural Networks 27 / 27

Neural Networks - I Henrik I Christensen Robotics & Intelligent - PowerPoint PPT Presentation

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary Neural Networks - I Henrik I Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0280

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Neural Networks 1. Introduction Spring 2019 1 Neural Networks are taking over! Neural

Neural network applications ALVINN (Pomerleau, mid 1990s) Autonomous Land Vehicle in Neural

Principles of neural network design Francois Belletti, CS294 RISE Human brains as metaphors of

Theia: Networking for Ultra- Dense Data Centers meg walraed-sullivan,

Geomaterial Characterization Using Electrical Properties (EP) EP of geomaterials are their

Overarching Architecture for Mobile and Wireless Networks S Baydere, T ElBatt, K Harras, P

Computer Networks Zizhan Zheng Spring 2018 1 Outline Administrative trivias What Is

CSci 4211: Introduction to Computer Networks Time: Monday and Wednesday 2:30 to 3:45 pm

Data management in Wireless Sensor Networks (WSN) Giuseppe Amato ISTI-CNR