a i a j Input Input Activation Output Output Links Function - PowerPoint PPT Presentation

Neural Learning Methods } An obvious source of biological inspiration for learning research: the brain } The work of McCulloch and Pitts on the perceptron (1943) started as research into how we could precisely model the neuron and the network of connections that allow animals (like us) to learn Class #19: } These networks are used as classifiers : given an input, Neural Networks they label that input with a classification, or a distribution over possible classifications Machine Learning (COMP 135): M. Allen, 30 March 20 2 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 1 2 Source: Russel & Norvig, The Basic Neuron Model Input Bias Weights AI: A Modern Approach (Prentice Hal, 2010) Bias Weight Bias Weight a 0 = 1 a j = g ( in j ) a 0 = 1 a j = g ( in j ) w 0 ,j w 0 ,j g in j g w i,j Σ in j a i a j w i,j Σ a i a j Input Input Activation Output Output Links Function Function Links Input Input Activation Output } Each input a i to neuron j is given a weight w i,j Output Links Function Function Links } Each neuron is treated as having a fixed dummy input, a 0 = 1 } Neuron gets input from a set of other neurons, or from } The input function is then the weighted linear sum: the problem input, and computes the function g n X in j = w i,j a i = w 0 ,j a 0 + w 1 ,j a 1 + w 2 ,j a 2 + · · · + w n,j a n } Output a j is either passed along to another set of neurons, i =0 or is used as final output for learning problem itself = w 0 ,j + w 1 ,j a 1 + w 2 ,j a 2 + · · · + w n,j a n 4 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 3 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 3 4 1

We’ve Seen This Before! Neuron Output Functions Bias Weight Bias Weight a 0 = 1 a 0 = 1 a j = g ( in j ) a j = g ( in j ) w 0 ,j w 0 ,j g g in j in j w i,j w i,j Σ Σ a i a j a i a j Input Input Activation Output Input Input Activation Output Output Output Links Function Function Links Links Function Function Links } The weighted linear sum of inputs, with dummy, a 0 = 1 , is just a form of the cross-product that our classifiers have been using all along } While the inputs to any neuron are treated in a linear } Remember that the “neuron” here is just another way of looking at the fashion, the output function g need not be linear perceptron idea we already discussed } The power of neural nets comes from fact that we can n combine large numbers of neurons together to compute X in j = w i,j a i = w 0 ,j + w 1 ,j a 1 , + w 2 ,j a 2 + · · · + w n,j a n any function (linear or not) that we choose i =0 = w j · a 6 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 5 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 5 6 The Perceptron Threshold Function The Sigmoid Activation Function Bias Weight Bias Weight a 0 = 1 a 0 = 1 a j = g ( in j ) a j = g ( in j ) w 0 ,j w 0 ,j g g in j in j w i,j w i,j Σ Σ a i a j a i a j Input Input Activation Output Input Input Activation Output Output Output Links Function Function Links Links Function Function Links } A function that has been more often used in neural networks is } One possible function is the binary threshold, which is the logistic (also known as the Sigmoid), as seen before suitable for “firm” classification problems, and causes the } This gives us a “soft” value, which we can often interpret as the neuron to activate based on a simple binary function: probability of belonging to some output class ( 1 if in j ≥ 0 1 g ( in j ) = g ( in j ) = 0 else 1 + e − in j 8 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 7 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 7 8 2

Power of Perceptron Networks Power of Perceptron Networks } A single-layer network with inputs for variables ( x 1 , x 2 ), and } A single-layer network combines a linear function of input bias term ( x 0 == 1 ), can compute OR of inputs weights with the non-linear output function } Threshold: ( y == 1 ) if weighted sum ( S >= 0 ); else ( y == 0 ) } If we threshold output, we have a boolean ( 1/0 ) function } This is sufficient to compute numerous linear functions x 1 OR x 2 1 x 1 x 2 y x 1 OR x 2 x 1 AND x 2 y x 1 0 0 0 x 1 x 2 y x 1 x 2 y 0 1 1 x 2 0 0 0 0 0 0 1 0 1 0 1 1 0 1 0 1 1 1 1 0 1 1 0 0 } What weights can we apply to the three inputs to produce OR ? 1 1 1 1 1 1 } One answer: -0.5 + x 1 + x 2 10 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 9 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 9 10 Power of Perceptron Networks Linear Separation with Perceptron Networks } We can think of binary functions as dividing ( x1, x2 ) plane x 1 AND x 2 1 } The ability to express such a function is analogous to the ability to linearly separate data in such regions x 1 x 2 y x 1 y 0 0 0 x 1 OR x 2 0 1 0 x 2 1 1 0 0 x 1 x 2 y 1 1 1 0 0 0 x 2 0 1 1 } What about the AND function instead? 1 0 1 0 } One answer: -1.5 + x 1 + x 2 1 1 1 0 1 x 1 = 1 = 0 12 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 11 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 11 12 3

Functions with Non-Linear Boundaries Linear Separation with Perceptron Networks There are some functions that cannot be expressed using a single layer of linear } We can think of binary functions as dividing ( x1, x2 ) plane } weighted inputs, and a non-linear output } The ability to express such a function is analogous to the ability Again, this is analogous to the inability to linearly separate data in some cases } to linearly separate data in such regions x 1 AND x 2 x 1 XOR x 2 1 1 x 1 x 2 y x 1 x 2 y 0 0 0 0 0 0 x 2 x 2 0 1 1 0 1 0 1 0 1 1 0 0 0 0 1 1 1 1 1 0 0 1 0 x 1 1 x 1 = 1 = 1 = 0 = 0 14 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 13 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 13 14 MLP’s for Non-Linear Boundaries Review: Properties of the Sigmoid Function } Neural networks gain expressive power } The Sigmoid takes its name because they can have more than one layer from the shape of its plot 1 1 } A multi-layer perceptron has one or more hidden layers between input and output } It always has a value in range: x 2 } Each hidden node applies a non-linear 0 ≤ x ≤ 1 activation function, producing output that it sends along to the next layer 0.5 } The function is everywhere 0 In such cases, much more complex functions } differentiable, and has a are possible, corresponding to non-linear decision boundaries (as in current derivative that is easy to homework assignment) 0 x 1 1 calculate, which turns out to 0 1 8 -6 -4 -2 0 2 4 6 be useful for learning: h 1 (b) y 1 x 1 g ( in j ) = g 0 ( in j ) = g ( in j )(1 − g ( in j )) h 2 1 + e − in j x 2 16 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 15 Monday, 30 Mar. 2020 Machine Learning (COMP 135) 15 16 4

a i a j Input Input Activation Output Output Links Function - PowerPoint PPT Presentation

Neural Learning Methods } An obvious source of biological inspiration for learning research: the brain } The work of McCulloch and Pitts on the perceptron (1943) started as research into how we could precisely model the neuron and the network of

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Links Student Web Presence Guidelines Summary 1. The Purpose of Links 2. Worst Links 3. Best

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Compilers Activation Records Alex Aiken Activation Records The information needed to manage

16. Recursion 2 Output: 103 Input: (3 + 5) * 20 Output: 160 Input: -(3 + 5) + 20 Output: 12

17. Recursion 2 Input: 3 + 5 * 20 Output: 103 Input: (3 + 5) * 20 Output: 160 Input: -(3 + 5) + 20

7. Java Input/Output User Input/Console Output, File Input and Output (I/O) 133 User Input (half

BASIC INPUT/OUTPUT Fundamentals of Computer Science I Outline: Basic Input/Output Screen

BASIC INPUT/OUTPUT Fundamentals of Computer Science Outline: Basic Input/Output Screen Output

LINKS AND RULES GENOME VISUALIZATION WITH CIRCOS LINKS AND RULES 1 Martin Krzywinski

How of the Conceptual Future Internet Links lead to links that link to other links. Many

Nonlinear Control Lecture # 14 Input-Output Stability Nonlinear Control Lecture # 14 Input-Output

The Stream Hierarchy Inheritance of istream and ostream from ios ios istream ostream Stream

Learning algorithms using logic (inductive logic programming) input output cat c dog d bear

Multiple Input and Output Channels Multiple Input and Output Channels Multiple Input Channels In

Input/Output Cmd Line Input Formatted I/O Formatted Output Formatted Input Volker Sorge

Universality and Individuality in Recurrent Neural Networks Niru Maheswaranathan, Alex Williams,

The Potjans-Diesmann local microcircuit model using different neuron classes for excitatory and

Neural Networks: Design Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science,

Lecture 1: Neurons Lecture 2: Coding with spikes Lecture 3: Tuning curves and receptive fields

for NeuronBank Ontology Weiling Li, Rajshekhar Sunderraman, and Paul Katz Georgia State

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks Wojciech Samek

CMP784 DEEP LEARNING Lecture #03 Multi-layer Perceptrons Aykut Erdem // Hacettepe University

CSCE 478/878 Lecture 4: Artificial Neural Networks Stephen D. Scott (Adapted from Tom