Evolving Neural Networks This lecture is based on Xin Yaos tutorial - PowerPoint PPT Presentation

Evolving Neural Networks This lecture is based on Xin Yao’s tutorial slides “ From Evolving Single Neural Networks to Evolving Ensembles ” presented at CEC 2007. Literature: Evolving Artificial Neural Networks , Xin Yao, Proceedings of the IEEE,Volume 87, Issue 9, Sep 1999 Page(s):1423 - 1447 http://www.cs.bham.ac.uk/ ∼ xin/papers/published iproc sep99.pdf Good Internet article: Evolutionary Neural Networks: Design Methodologies – p. 211

Evolving Neural Networks (cont.) • Learning and evolution are two fundamental forms of adaptation • Simulated evolution can be introduced into neural networks at different levels Idea: integration of neural and evolutionary computation • for evolving neural network connection weights • for evolving neural network architectures • for evolving neural network learning rules – p. 212

What Is Evolutionary Computation • It is the study of computational systems which use ideas and get inspirations from natural evolution • One of the principles borrowed is survival of the fittest • Evolutionary computation techniques can be used in optimization, learning and design • Evolutionary computation techniques do not require rich domain knowledge to use. However, domain knowledge can be incorporated into evolutionary computation techniques – p. 213

A Simple Evolutionary Algorithm (EA) 1. Generate the initial population P (0) at random, and set i = 0 2. REPEAT (a) Evaluate the fitness of each individual in P ( i ) (b) Select parents from P ( i ) based on their fitness in P ( i ) (c) Generate offspring from the parents using crossover and mutation to form P ( i + 1) (d) i = i + 1 3. UNTIL halting criteria are satisfied – p. 214

Evolving Connection Weights • Recall: supervised learning in neural networks was formulated as minimization of an error function (sum-of-squares error, cross-entropy error) • Gradient-based optimization techniques are often used, but • they require the error function to be differentiable • they are sensitive to initial conditions • they get trapped in local minima • they can converge slowly (if no acceleration tricks are used) • EAs are good at dealing with complex and nondifferentiable functions. They are robust and less sensitive to initial conditions – p. 215

Encoding Connection Weights Before evolution, and encoding scheme (genotypic representation) is often needed to represent neural networks. There are different methods for representing weights. • Binary representation • Real number representation • Hybrid method – p. 216

The Evolution of Connection Weights Assumption: the neural network architecture is fixed during the evolution 1. Decode each individual (genotype) into a set of connection weights (phenotype) 2. Evaluate the fitness (error) of each individual 3. Reproduce a number of children for each individual in the current generation according to its fitness 4. Apply genetic operators, such as crossover and mutation, to each child individual generated above and obtain the next generation – p. 217

Permutation Problem e a b c d e a 0000 0000 0100 0010 0000 7 3 b 0000 0000 1010 0000 0000 c d 2 c 0000 0000 0000 0000 0111 4 10 d 0000 0000 0000 0000 0011 e 0000 0000 0000 0000 0000 a b e a b c d e a 0000 0000 0010 0100 0000 3 7 b 0000 0000 0000 1010 0000 c d c 0000 0000 0000 0000 0011 4 2 10 d 0000 0000 0000 0000 0111 e 0000 0000 0000 0000 0000 a b Caused by the many to one mapping from genotypes to phenotypes. – p. 218

Discussions on Evolutionary Training • Evolutionary training is attractive because it can handle the complex and nondifferentiable space better. It is particularly appealing when gradient information is unavailable or very costly to obtain or estimate • Evolutionary training may be slow for some problems in comparison with fast gradient descent algorithms. However, it always searches for a globally optimal solution. • For some problems, evolutionary training is significantly faster and more reliable than gradient descent algorithms – p. 219

Hybrid Training • Global search techniques are always in conflict between exploitation and exploration • EAs are not very good at fine-tuned local search although they are good at global search • Efficiency of evolutionary training can be improved significantly by incorporating a local search procedure into the evolution, i.e., combining EA’s global search ability with local search’s fine-tuning ability • Other alternative: use EAs to search for a near optimal set of connection weights and then use a local search algorithm to fine tune the weights – p. 220

Evolving NN Architectures • Neural networks have mostly been designed manually • Design of a near optimal neural network architecture can be formulated as a search problem in the architecture space • Evolutionary algorithms are good candidates for searching this space – p. 221

Evolving NN Architectures: Common Practice population of individuals (encoded NN architectures) decoding replacement new generation of decoded NN individuals architectures crossover training mutation selection fitness values of individuals selected individuals (encoded architecture) – p. 222

Typical Cycle of Evolving NN Architectures 1. Decode each individual in the current generation into an architecture 2. Train each neural network with the decoded architecture starting from different sets of random initial connection weights 3. Calculate the fitness of each individual 4. Reproduce a number of children for each individual in the current generation based on its fitness 5. Apply variation operators to the children generated above and obtain next generation – p. 223

Fitness Evaluation Fitness is a measure of the quality of the found solution. Fitness measure can based on • Training error • Testing error • Training/testing error and network complexity • based on the number of connections • based on some information criterion, such as AIC or MDL – p. 224

Direct Encoding Scheme • In the direct encoding scheme, each connection is an architecture is directly specified by its binary representation • An N × N matrix C can represent an architecture with N units, where C ij indicates presence or absence of the connection from unit i to unit j • Each such matrix has a direct one-to-one mapping to the corresponding architecture. The binary string representing an architecture is just the concatenation of rows (or columns) of the matrix. – p. 225

Feedforward Network Encoding (Example)   0 0 1 1 0 0 0 1 0 1       0110 101 01 1 0 0 0 0 1     0 0 0 0 1     • Direct encoding scheme 0 0 0 0 0 is simple and straightforward to implement 5 • Found solutions are tight and contains interesting 3 4 designs that no one has hit upon so far • It does not scale well for 1 2 large architectures – p. 226

Evolving Neural Networks This lecture is based on Xin Yaos tutorial - PowerPoint PPT Presentation

Evolving Neural Networks This lecture is based on Xin Yaos tutorial slides From Evolving Single Neural Networks to Evolving Ensembles presented at CEC 2007. Literature: Evolving Artificial Neural Networks , Xin Yao, Proceedings of the

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Evolving Data Access Evolving Data Access Evolving Data Access Evolving Data Access

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Neural Networks 0. Logistics Spring 2019 1 Neural Networks are taking over! Neural networks

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks 1. Introduction Fall 2017 Neural Networks are taking over! Neural networks

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Evolving Neural Networks Keith L. Downing The Norwegian University of Science and Technology

Relaxation and Hopfield Networks Neural Networks Neural Networks - Hopfield 1 Bibliography

Evolving Artificial Neural Networks Tim Kovacs Evolving ANNs 1 of 23 Introduction Adapting

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 2: Basics of Cloud Computing 1

for Constructing a Reliable MCDS in Probabilistic Wireless Networks Jing He, Zhipeng Cai,

Understanding Diversity and Recombination in Simple Evolutionary Algorithms Dirk Sudholt

New signals and old backgrounds in dark matter direct detection Josef Pradler Perimeter

Wireless Sensor Networks PRACTICAL TRAINING WITH CC430F6137

Precision Laser Spectroscopy of the Ground State Hyperfine Splitting in Muonic Hydrogen 1

ECAL off detector architecture Nikitas Loukas, University of Notre Dame N.Loukas, Aug 30 2017

Optimal control of parabolic PDEs with state constraints Francesco Ludovici Joint work with Ira