Training Neural Networks with Local Error Signals Arild Nkland - PowerPoint PPT Presentation

Sep 12, 2023 •498 likes •647 views

Training Neural Networks with Local Error Signals Arild Nkland Lars H. Eidnes Local learning Typically we train neural networks by backpropagating errors from the loss function and back through the layers. Hard to explain how the

Training Neural Networks with Local Error Signals Arild Nøkland Lars H. Eidnes
Local learning • Typically we train neural networks by backpropagating errors from the loss function and back through the layers. • Hard to explain how the brain could do this. • Backward locking, weight symmetry, other problems • Massive practical benefits if you could avoid this. • Don't have to keep activations in memory • Can parallelize easily. Put each layer on its own GPU, train all at the same time.
Training each layer on its own works! Results on more datasets later.
The approach Train each layer with two sub-networks, each with its own loss function
Similarity matching loss Intuition: Want things from the same class to have similar representations. Measure similarity with a matrix of cosine similarities.
Results
Results
Results
Optimization vs generalization • Back-prop has fastest & lowest drop in training error • Local learning is competitive with back-prop in terms of test error • Local learning is a good regularizer • But: Both pred and sim- losses help optimization in a complementary way.
Sim-loss + global backprop
Results, back-prop free version • Still have 1-step backprop. To remove it: • Remove the conv2d before the sim-loss • Use Feedback Alignment [Lillicrap et al, 2014] through linear before the pred-loss • Also: Use a random projection of the labels
Summary • We train each layer on its own, without global backprop • We use two loss functions • Standard cross entropy loss • A similarity matching loss • Squared error on similarity matrices • Wants similar activations for things of the same class • Works well on VGG-like networks
Intriguing questions • We’ve just prodded the space of local loss functions, and stumbled across something that helps a lot. Is there more to be found in this space? • Can we better understand how layers interact when they are trained on their own? I.e. why does this work? • Does something like this happen in the brain?

Recommend

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks and their Application to Go A. Bausch Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training neural networks Problems AlphaGo Anne-Marie Bausch The Game of Go Policy Network

280 views • 24 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training

Feed-forward Networks Network Training Error Backpropagation Applications Artificial Neural Networks Oliver Schulte - CMPT 726 Feed-forward Networks Network Training Error Backpropagation Applications Neural Networks Neural networks

956 views • 46 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory Associative Memory CHAPTER III : III : Neural Networks as Associative Memory CHAPTER Neural Networks as

513 views • 22 slides

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use UMaine

412 views • 9 slides

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

852 views • 33 slides

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they have

1.17k views • 91 slides

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg Durrett Neural Networks Neural Networks Linear classification: argmax y w > f ( x, y ) possible because Linear Neural we transformed

316 views • 4 slides

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography Hopfield, J. J., "Neural networks and physical systems with emergent collective computational abilities," Proceedings of the National Academy

367 views • 19 slides

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural

Neural Networks 1. Introduction Spring 2020 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In many problems they

1.63k views • 119 slides

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova November 21, 2016 Neural Networks 2/20 Neural Networks 3/20 Neural Networks Neural computing requires a number of neurons , to be connected

813 views • 21 slides

Bounds for the capacity error function for unidirectional channels with noiseless feedback

Bounds for the capacity error function for unidirectional channels with noiseless feedback Christian Deppe 1 , Vladimir Lebedev 2 and Georg Maringer 1 1 Technical University of Munich Institute for Communications Engineering 2 Kharkevich Institute

351 views • 19 slides

CSE 802 Spring 2017 Logistic Regression Inci M. Baytas Computer Science Michigan State

CSE 802 Spring 2017 Logistic Regression Inci M. Baytas Computer Science Michigan State University March 29, 2017 1 / 10 Introduction Consider two-class classification problem, the posterior probability of class C 1 can be written as: w T

396 views • 10 slides

+ + Error Surfaces Backpropagation is based on gradient descent in a criterion function, we

10/9/08 + + Error Surfaces Backpropagation is based on gradient descent in a criterion function, we can gain understanding and intuition about the algorithm by studying error surfaces------the function J( w ) Some general properties of

536 views • 6 slides

Baby Penguin Slips and Slides Baby Penguin Slips and Slides Book Review Book Review This book

[PDF] Baby Penguin Slips and Slides Baby Penguin Slips and Slides Baby Penguin Slips and Slides Book Review Book Review This book is great. I have go through and so i am confident that i will going to read through once again again in the

590 views • 3 slides

CS7015 (Deep Learning) : Lecture 3 Sigmoid Neurons, Gradient Descent, Feedforward Neural Networks,

CS7015 (Deep Learning) : Lecture 3 Sigmoid Neurons, Gradient Descent, Feedforward Neural Networks, Representation Power of Feedforward Neural Networks Mitesh M. Khapra Department of Computer Science and Engineering Indian Institute of

869 views • 70 slides

Error Handling Marco Chiarandini Department of Mathematics & Computer Science University of

DM560 Introduction to Programming in C++ Error Handling Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark [ Based on slides by Bjarne Stroustrup ] Error Handling Outline 1. Error Handling 2

573 views • 36 slides

Linear Regression - Estimating Parameters Bernd Schr oder logo1 Bernd Schr oder

Coefficients Examples Error Sum of Squares Coefficient of Determination Linear Regression - Estimating Parameters Bernd Schr oder logo1 Bernd Schr oder Louisiana Tech University, College of Engineering and Science Linear Regression -

1.16k views • 102 slides

BBM406 Fundamentals of Machine Learning Lecture 5: ML Methodology Aykut Erdem // Hacettepe

Illustration: detail from The Alchemist Discovering Phosphorus by Joseph Wright (1771) BBM406 Fundamentals of Machine Learning Lecture 5: ML Methodology Aykut Erdem // Hacettepe University // Fall 2019 About class projects This semester

1.42k views • 46 slides