Lecture 1: Feedforward Princeton University COS 495 Instructor: - PowerPoint PPT Presentation

Deep Learning Basics Lecture 1: Feedforward Princeton University COS 495 Instructor: Yingyu Liang

Motivation I: representation learning

Machine learning 1-2-3 • Collect data and extract features • Build model: choose hypothesis class 𝓘 and loss function 𝑚 • Optimization: minimize the empirical loss

Features 𝑦 Color Histogram Extract build 𝑧 = 𝑥 𝑈 𝜚 𝑦 features hypothesis Red Green Blue

Features: part of the model Nonlinear model build 𝑧 = 𝑥 𝑈 𝜚 𝑦 hypothesis Linear model

Example: Polynomial kernel SVM 𝑦 1 𝑧 = sign(𝑥 𝑈 𝜚(𝑦) + 𝑐) 𝑦 2 Fixed 𝜚 𝑦

Motivation: representation learning • Why don’t we also learn 𝜚 𝑦 ? Learn 𝜚 𝑦 Learn 𝑥 𝜚 𝑦 𝑧 = 𝑥 𝑈 𝜚 𝑦 𝑦

Feedforward networks • View each dimension of 𝜚 𝑦 as something to be learned … 𝑧 = 𝑥 𝑈 𝜚 𝑦 … 𝑦 𝜚 𝑦

Feedforward networks 𝑈 𝑦 don’t work: need some nonlinearity • Linear functions 𝜚 𝑗 𝑦 = 𝜄 𝑗 … 𝑧 = 𝑥 𝑈 𝜚 𝑦 … 𝑦 𝜚 𝑦

Feedforward networks 𝑈 𝑦) where 𝑠(⋅) is some nonlinear function • Typically, set 𝜚 𝑗 𝑦 = 𝑠(𝜄 𝑗 … 𝑧 = 𝑥 𝑈 𝜚 𝑦 … 𝑦 𝜚 𝑦

Feedforward deep networks • What if we go deeper? … … … … 𝑧 … … ℎ 𝑀 ℎ 1 𝑦 ℎ 2

Figure from Deep learning , by Goodfellow, Bengio, Courville. Dark boxes are things to be learned.

Motivation II: neurons

Motivation: neurons Figure from Wikipedia

Motivation: abstract neuron model • Neuron activated when the correlation between the input and a pattern 𝜄 𝑦 1 exceeds some threshold 𝑐 𝑦 2 • 𝑧 = threshold(𝜄 𝑈 𝑦 − 𝑐) or 𝑧 = 𝑠(𝜄 𝑈 𝑦 − 𝑐) 𝑧 • 𝑠(⋅) called activation function 𝑦 𝑒

Motivation: artificial neural networks

Motivation: artificial neural networks • Put into layers: feedforward deep networks … … … … 𝑧 … … ℎ 𝑀 ℎ 1 𝑦 ℎ 2

Components in Feedforward networks

Components • Representations: • Input • Hidden variables • Layers/weights: • Hidden layers • Output layer

Components First layer Output layer … … … … 𝑧 … … ℎ 𝑀 Hidden variables ℎ 1 ℎ 2 Input 𝑦

Input • Represented as a vector • Sometimes require some Expand preprocessing, e.g., • Subtract mean • Normalize to [-1,1]

Output layers Output layer • Regression: 𝑧 = 𝑥 𝑈 ℎ + 𝑐 • Linear units: no nonlinearity 𝑧 ℎ

Output layers Output layer • Multi-dimensional regression: 𝑧 = 𝑋 𝑈 ℎ + 𝑐 • Linear units: no nonlinearity 𝑧 ℎ

Output layers Output layer • Binary classification: 𝑧 = 𝜏(𝑥 𝑈 ℎ + 𝑐) • Corresponds to using logistic regression on ℎ 𝑧 ℎ

Output layers Output layer • Multi-class classification: • 𝑧 = softmax 𝑨 where 𝑨 = 𝑋 𝑈 ℎ + 𝑐 • Corresponds to using multi-class logistic regression on ℎ 𝑨 𝑧 ℎ

Hidden layers • Neuron take weighted linear combination of the previous … layer • So can think of outputting one value for the next layer … ℎ 𝑗 ℎ 𝑗+1

Hidden layers • 𝑧 = 𝑠(𝑥 𝑈 𝑦 + 𝑐) • Typical activation function 𝑠 𝑠(⋅) • Threshold t 𝑨 = 𝕁[𝑨 ≥ 0] 𝑦 𝑧 • Sigmoid 𝜏 𝑨 = 1/ 1 + exp(−𝑨) • Tanh tanh 𝑨 = 2𝜏 2𝑨 − 1

Hidden layers • Problem: saturation 𝑠(⋅) 𝑦 𝑧 Too small gradient Figure borrowed from Pattern Recognition and Machine Learning , Bishop

Hidden layers • Activation function ReLU (rectified linear unit) • ReLU 𝑨 = max{𝑨, 0} Figure from Deep learning , by Goodfellow, Bengio, Courville.

Hidden layers • Activation function ReLU (rectified linear unit) • ReLU 𝑨 = max{𝑨, 0} Gradient 1 Gradient 0

Hidden layers • Generalizations of ReLU gReLU 𝑨 = max 𝑨, 0 + 𝛽 min{𝑨, 0} • Leaky- ReLU 𝑨 = max{𝑨, 0} + 0.01 min{𝑨, 0} • Parametric- ReLU 𝑨 : 𝛽 learnable gReLU 𝑨 𝑨

Lecture 1: Feedforward Princeton University COS 495 Instructor: - PowerPoint PPT Presentation

Deep Learning Basics Lecture 1: Feedforward Princeton University COS 495 Instructor: Yingyu Liang Motivation I: representation learning Machine learning 1-2-3 Collect data and extract features Build model: choose hypothesis class

CHAPTER VI VI CHAPTER Learning in Feedforward Feedforward Learning in Neural Networks Neural

Word Embeddings in Feedforward Networks; Tagging and Dependency Parsing using Feedforward

CS7015 (Deep Learning) : Lecture 3 Sigmoid Neurons, Gradient Descent, Feedforward Neural Networks,

Feedforward Control So far, most of the focus of this course has been on feedback control. In

CHAPTER 15: FEEDFORWARD CONTROL Outline of the lesson. A process challenge - improve

An Introduction to Neural Networks - Feedforward NN Backpropagation Agathe Merceron Beuth

CS7015 (Deep Learning): Lecture 4 Feedforward Neural Networks, Backpropagation Mitesh M. Khapra

Deep Feedforward Networks Lecture slides for Chapter 6 of Deep Learning www.deeplearningbook.org

Machine Learning Lecture 06: Deep Feedforward Networks Nevin L. Zhang lzhang@cse.ust.hk

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Feedforward Turning feedback around Julia Townshend, Tony Northeast, Marcia Worrell - and all

Parallel Gradient Descent for Multilayer Feedforward Neural Networks Palash Goyal 1 Nitin Kamra 1

Mixed phenomenological and neural approach to induction motor speed estimation B. Beliczynski,

Recurrent Language Models CMSC 470 Marine Carpuat Toward a Neural Language Model Figures by

Adaptive Feedforward Repetitive Run Out Tracking in Bit Patterned Recording Behrooz

1 Domain Flux-based DGA Botnet Detection Using Feedforward Neural Network Md. Ishtiaq Ashiq

Open Security Controls Assessment Language (OSCAL) Lunch with the OSCAL Developers David

IoT link layer issues Questions Energy efficiency Efficient forwarding over layer 2

Convolution Layers Convolution Layers In [1]: from mxnet import autograd, nd from mxnet.gluon

Advanced Computer Graphics CS 525M: ProfileDroid: Multi layer Profiling of Android Applications

Week 4 - Monday What did we talk about last time? Queues and stacks Non-recursive DFS

Using iRODS as a presentation layer for Research Data Storage at UCL iRODS User Group meeting

Jena: Implementing the Semantic Web Recommendations Jeremy J. Carroll, Ian Dickinson, Chris

Sambuz

Useful Links

Newsletter

Mail Us