Artificial Neural Networks
CS 486/686: Introduction to Artificial Intelligence
1
Artificial Neural Networks CS 486/686: Introduction to Artificial - - PowerPoint PPT Presentation
Artificial Neural Networks CS 486/686: Introduction to Artificial Intelligence 1 Introduction Machine learning algorithms can be viewed as approximations of functions that describe the data In practice, the relationships between input and
CS 486/686: Introduction to Artificial Intelligence
1
Machine learning algorithms can be viewed as approximations
In practice, the relationships between input and output can be extremely complex. We want to:
2
Idea: The humans can often learn complex relationships very well. Maybe we can simulate human learning?
3
4
between firing neuron's axon and receiving neuron's dendrite
transfer across specific synaptic junctions.
5
very early models of the neuron.
used theoretical neuroscience, not machine learning
6
7
Link~ Synapse Weight ~ Efficiency Input Fun.~ Dendrite Activation Fun.~ Soma Output = Fire or not
Output
Input Links Activation Function Input Function Output Links
a0 = 1 aj = g(inj) aj g inj wi,j w0,j
Bias Weight
ai
from i to j
8
have a linear equation)
right amounts
9
concepts)
10
It is possible to construct a universal set of logic gates using the neurons described (McCulloch and Pitts 1943)
11
It is possible to construct a universal set of logic gates using the neurons described (McCulloch and Pitts 1943)
12
13
14
15
16
Can learn only linear separators
17
Learning means adjusting the weights
How do we measure loss of fidelity?
18
X
k
1 2(yk − (hW (x))k)2
19
20
21
22
just use Gradient Descent, as before.
have a problem: What is y?
23
error in the output layer.
proportionate to the connection strength.
24
layer, and update the weights .
updated, and update it.
25
networks with more than one hidden layer
hidden layer to approximate any continuous function, if you use multiple layers you typically need less units
26
27
28
2n-2 hidden layers
29
How do you train them?
30
ImageNet Large Scale Visual Recognition Challenge
valued inputs, and/or noisy (e.g. sensor data)
model)
understand the mapping
31
in many-layered networks.
many neurons are needed?
predictions outside of the range of values it was trained on)
32