Modular Neural Networks CPSC 533 Franco Lee Ian Ko Modular Neural - PowerPoint PPT Presentation

Modular Neural Networks CPSC 533 Franco Lee Ian Ko

Modular Neural Networks What is it ? Dif f erent models of neural net w orks combined int o a single syst em. Each single net w ork is made int o a module t hat can be f reely int ermixed w it h modules of ot her t ypes in t hat syst em.

Agenda 1) Issues leading to the development of modular neural networks 2) Problems in Neural Network Modeling 3) Cascade Correlation - characteristics - algorithm - mathematical background - examples

Issues Leading to Modular Networks -> Reducing Model Complexity -> Incorporating Knowledge -> Data Fusion and Prediction Averaging -> Combination of Techniques -> Learning Different Tasks Simultaneously -> Robustness and Incrementality

Problems in Neural Network Modeling: -> The selection of the appropriate number of hidden units -> Inefficiencies of Back-Propagation: 1) Slow Learning 2) Moving Target problem

Problems in Neural Network Modeling: Se le ct ion of hidde n unit s: Courtesy of Neural Nets Using Back-propagation presentation - CPSC 533

Problems in Neural Network Modeling: Slow Le a r ning: When training a network with backpropagation, all input weights into the hidden units must be re-adjusted to minimize the residual error.

Problems in Neural Network Modeling: M ov ing Ta r ge t Pr oble m : Each unit within the network is trying to evolve into a feature detector but input problems are changing constantly. This causes all hidden units to be in a chaotic state, and it takes a long time to settle down.

Problems in Neural Network Modeling: M ov ing Ta r ge t Pr oble m : The Herd Effect: Suppose we have a number of hidden units to solve two tasks. Each unit can not communicate with one another, so they must decide independently which task to tackle. If one task generates a larger error signal, then all units tend to solve this task and ignore the other. Once it has been solved, then all units moves to the second task, but the first problem will re-appear.

Cascade Correlation (CC): Characteristics -> supervised learning algorithm - evaluated on its performance via external source -> a network that determines its own size and topology - starts with input/output layer - builds a minimal multi-layer network by creatingits own hidden layer

Cascade Correlation (CC): Characteristics -> recruits new units according to the residual approximation error - trains and adds hidden units one by one to tackle new tasks, hence “ Cascade ” - the residual error “ Correlat ion ” between the new units and its output is maximized - input weights going into the new hidden unit become frozen (fixed)

Cascade Correlation (CC) -> CC combines two ideas: - cascade architecture: hidden units added one at a time and is frozen - learning algorithm: trains and installs new hidden units

CC Algorithm -> starts with minimal network consisting of a input and output layer. -> train the network with a learning algorithm (ie. Gradient Descent, Simulating Annealing) -> train until no significant error reduction can be measured -> add new hidden unit to reduce residual error

CC Algorithm -> hidden units are added one by one to the network which is connected by all input units and to every pre-existing hidden unit -> freeze all incoming weights of the hidden unit -> repeat until desired performance is reached

Cascade Correlation - Diagram

CC Mathematical Background We want to maximize ‘S’ where S is the sum of all output units. This leads to the creation of very powerful and organized feature detectors (the hidden units).

Example: Speech Recognition The difficulties with speech recognition: -> deciphering different phonetics sounds -> everyone has a different voice!

Example: Speech Recognition A simple example: Designing a network which can classify speech data into one of 10 different phonemes.

Example: Speech Recognition -> train 10 hidden units separately, put them together and train the output unit one by one -> adding new phoneme: train new hidden units for this phoneme and add to the network, and then retrain the output layer

Example: Two-Spirals Problem A primary benchmark for the backpropagation algorithms because it is an extremely hard problem to solve.

Example: Two-Spirals Problem

Cascade Correlation (CC) Advantages: -> reduces learning time -> transparent -> creates a structured network

Cascade Correlation (CC) Disadvantages: Can lead to specialization of just the training sets.

Cascade Correlation References 1. Rojas, R. (1996). Neural Networks - A Systematic Introduction. Springer - Verlag Berlin Heidelberg. 2. http://www.mass.u- bordeaux2.fr/~corsini/SNNS_Manual/node164.html 3. ftp://archive.cis.ohio-state.edu/pub/neuroprose/fahlman.cascor- tr.ps.Z

Modular Neural Networks CPSC 533 Franco Lee Ian Ko Modular Neural - PowerPoint PPT Presentation

Modular Neural Networks CPSC 533 Franco Lee Ian Ko Modular Neural Networks What is it ? Dif f erent models of neural net w orks combined int o a single syst em. Each single net w ork is made int o a module t hat can be f reely int

Modular Budgets Modular Budgets Modular Budgets Modular Budgets OSPA NANO Session 10/25/06

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

1 TEMPORARY MODULAR HOUSING Meeting Purpose Learn how Temporary Modular Housing will allow

Modular Applications, Loose Coupling, and the NetBeans Lookup API The Need for Modular

Neural Network Backpropagation 3-2-16 Recall from Monday... Perceptrons can only classify

Lecture 16: Introduction to Neural Networks, Feed-forward Networks and Back-propagation Dr.

Multi-layer Perceptrons & the Back-propagation Algorithm Instructor: Sham Kakade Please email

Backpropagation I2DL: Prof. Niessner, Prof. Leal-Taix 1 Lecture 3 Recap I2DL: Prof. Niessner,

Circuit-GNN: Graph Neural Networks for Distributed Circuit Design Guo Zhang Hao He Dina Katabi

A brief history of deep learning 1 Andrew Kurenkov. This summary is based on A Brief

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 7 Slides adapted from

Cost function Machine Learning Neural Network (Classification) total no. of layers in network

Modular Neural Networks CPSC 533 Franco Lee Ian Ko Modular Neural - PowerPoint PPT Presentation

Modular Neural Networks CPSC 533 Franco Lee Ian Ko Modular Neural Networks What is it ? Dif f erent models of neural net w orks combined int o a single syst em. Each single net w ork is made int o a module t hat can be f reely int

Modular Budgets Modular Budgets Modular Budgets Modular Budgets OSPA NANO Session 10/25/06

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

1 TEMPORARY MODULAR HOUSING Meeting Purpose Learn how Temporary Modular Housing will allow

Modular Applications, Loose Coupling, and the NetBeans Lookup API The Need for Modular

Neural Network Backpropagation 3-2-16 Recall from Monday... Perceptrons can only classify

Lecture 16: Introduction to Neural Networks, Feed-forward Networks and Back-propagation Dr.

Multi-layer Perceptrons &amp; the Back-propagation Algorithm Instructor: Sham Kakade Please email

Backpropagation I2DL: Prof. Niessner, Prof. Leal-Taix 1 Lecture 3 Recap I2DL: Prof. Niessner,

Circuit-GNN: Graph Neural Networks for Distributed Circuit Design Guo Zhang Hao He Dina Katabi

A brief history of deep learning 1 Andrew Kurenkov. This summary is based on A Brief

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 7 Slides adapted from

Cost function Machine Learning Neural Network (Classification) total no. of layers in network

Multi-layer Perceptrons & the Back-propagation Algorithm Instructor: Sham Kakade Please email