Introduction to Machine Learning Multilayer Perceptron Barnabs - PowerPoint PPT Presentation

Nov 16, 2023 •197 likes •655 views

Introduction to Machine Learning Multilayer Perceptron Barnabs Pczos The Multilayer Perceptron 2 Multilayer Perceptron 3 ALVINN: AN AUTONOMOUS LAND VEHICLE IN A NEURAL NETWORK Dean A. Pomerleau, Carnegie Mellon University, 1989

Introduction to Machine Learning Multilayer Perceptron Barnabás Póczos
The Multilayer Perceptron 2
Multilayer Perceptron 3
ALVINN: AN AUTONOMOUS LAND VEHICLE IN A NEURAL NETWORK Dean A. Pomerleau, Carnegie Mellon University, 1989 Training: using simulated road generator 4
Gradient Descent We want to solve: 5
Starting Point 6
Starting Point 7
Fixed step size can be too big 8
Fixed step size can be too small 9
10
11
Character Recognition with MLP Matlab: appcr1 12
The network Noise-free input: 26 different letters of size 7x5 13
Noisy inputs 14
Matlab MLP Training % Create MLP hiddenlayers=[10, 25]; net1 = feedforwardnet(hiddenlayers); net1 = configure(net1,X,T); %View view(net1); %Train net1 = train(net1,X,T); %Test Y1 = net1(Xtest); 15
Prediction errors ▪ Network 1 was trained on clean images 16 ▪ Network 2 was trained on noisy images. 30 noisy copies of each letter are created
The Backpropagation Algorithm 17
Multilayer Perceptron 18
The gradient of the error 19
Notation 20
Some observations 21
The backpropagated error 22
The backpropagated error Lemma 23
The backpropagated error Therefore, 24
The backpropagation algorithm 25
26
27
28
29
30
31
32
What functions can multilayer perceptrons represent? 33
Perceptrons cannot represent the XOR function f(0,0)=1, f(1,1)=1, f(0,1)=0, f(1,0)=0 What functions can multilayer perceptrons represent? 34
Hilbert’s 13 th Problem 1902: 23 “most important” problems in mathematics The 13 th Problem: “Solve 7-th degree equation using continuous functions of two parameters .” Conjecture : It can’t be solved… Related conjecture: Let f be a function of 3 arguments such that Prove that f cannot be rewritten as a composition of finitely many functions of two arguments. Another rewritten form: Prove that there is a nonlinear continuous system of three variables that cannot be decomposed with finitely many functions of two variables. 35
Function decompositions f(x,y,z) =Φ 1 (ψ 1 (x), ψ 2 (y ))+Φ 2 (c 1 ψ 3 (y)+c 2 ψ 4 (z),x) ψ 1 Φ 1 x ψ 2 f(x,y,z) Σ y c 1 ψ 3 Φ 2 Σ z c 2 ψ 4 36
Function decompositions 1957, Arnold disproves Hilbert ’s conjecture. 37
Function decompositions Corollary: Issues: This statement is not constructive. 38
Universal Approximators Kur Hornik, Maxwell Stinchcombe and Halber White : “Multilayer feedforward networks are universal approximators ”, Neural Networks, Vol:2(3), 359-366, 1989 Definition: Σ N (g) neural network with 1 hidden layer: Definition: Theorem: 39
Universal Approximators Definition: Theorem: ( Blum & Li, 1991) Formal statement: 40
Proof GOAL: Integral approximation in 1-dim: Integral approximation in 2-dim: x i x i x i          i i j 41 i
Proof GOAL: x i x i x i The indicator function of X i polygon can be learned by this neural network: 1 if x is in X i -1 otherwise The weighted linear combination of these indicator functions will be a good approximation of the original function f 42
Proof This linear equation can also be solved. 43
Thanks for your attention! 44

Recommend

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Rob Schapire Princeton University www.cs.princeton.edu/ schapire Machine

1.26k views • 38 slides

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum Computing Machine Learning Quantum Computing Machine Learning so hot so so hot Quantum Computing Machine Learning Quantum Computing Machine Learning

835 views • 51 slides

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is Machine Learning? Azure Machine Learning: How it works Azure Machine Learning in action Get started Contents What is Machine Learning?

456 views • 21 slides

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING Exam Format The exam lasts a total of 3 hours: - Upon entering the room, you must

373 views • 21 slides

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING 2012 MACHINE LEARNING MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How to separate the red class from the grey class? x 2 360 r x 1 Polar coordinates Data

1.04k views • 44 slides

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach to Preventing to Preventing to Preventing to Preventing Avoidable ED Utilization Avoidable ED Utilization Avoidable ED

727 views • 13 slides

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning Introduction to Machine Learning 1 / 18 Outline 1 Classification, Regression, Unsupervised Learning 2 About Dimensionality 3 Drawings and

701 views • 18 slides

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is Machine learning is about predicting the future based on the past. -- Hal Daume III Machine Learning is Machine learning is about predicting

917 views • 59 slides

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning Introduction to Machine Learning 1 / 12 Outline 1 Nearest Neighbor Prediction 2 Complexity Considerations 3 The Voronoi Diagram 4 Overfitting

571 views • 12 slides

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

MACHINE LEARNING TOOLBOX Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret R package Automates supervised learning (a.k.a. predictive modeling ) Target variable Machine Learning Toolbox

634 views • 16 slides

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University April 23, 2008 1 How can studies of machine (human) learning inform machine (human) learning inform studies of h human (machine) learning? (

872 views • 49 slides

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Rob

824 views • 58 slides

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me This class is going to be interactive! What is Machine Learning? 2 What is Machine Learning? 3 What is Machine Learning? Study of

850 views • 52 slides

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

ADVANCED MACHINE LEARNING ADVANCED MACHINE LEARNING MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED MACHINE LEARNING Structure of todays and next weeks class 1) Briefly go through one

614 views • 28 slides

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

DataCamp Machine Learning for Finance in Python MACHINE LEARNING FOR FINANCE IN PYTHON Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning for Finance in Python Machine Learning in Finance source:

389 views • 36 slides

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

MACHINE LEARNING - MSc Course APPLIED MACHINE LEARNING APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE LEARNING - MSc Course APPLIED MACHINE LEARNING Objectives Learn basic techniques for data

1.1k views • 74 slides

Inverting monotone continuous functions in constructive analysis Helmut Schwichtenberg

Inverting monotone continuous functions in constructive analysis Helmut Schwichtenberg Mathematisches Institut der Universit at M unchen CiE, Swansea, 3. July 2006 Contents 1. Motivation 2. Tools: Reals, continuous functions 3. Inverse

344 views • 20 slides

Metrization Theorem Space-Time Analogs . . . How the (Non- . . . for Space-Times: Constructive

Causality: A Reminder Urysohns Problem: . . . Space-Time Models: . . . Space-Time Analog of . . . Metrization Theorem Space-Time Analogs . . . How the (Non- . . . for Space-Times: Constructive . . . Constructive . . . A Constructive

831 views • 16 slides

Orevkov, Khalfin, and Explanation Possible Applications Quantum Field Theory: How Bibliography

Orevkovs 1972 Results Can This Result Help . . . Problem Revisited Orevkov, Khalfin, and Explanation Possible Applications Quantum Field Theory: How Bibliography Constructive Mathematics Acknowledgments Home Page Can Help Physics

384 views • 19 slides

Quantifiers and Functions in Intuitionistic Logic Collegium Logicum Proof Theory: Herbrands

Quantifiers and Functions in Intuitionistic Logic Collegium Logicum Proof Theory: Herbrands Theorem revisited Vienna, 2527 May, 2017 Rosalie Iemhoff Utrecht University, the Netherlands 1 / 28 Skolemization in classical logic Thm For any

673 views • 27 slides

Constructive Presheaf Models of Univalence Thierry Coquand Pittsburgh, 12 August 2019

Constructive Presheaf Models of Univalence Thierry Coquand Pittsburgh, 12 August 2019 Constructive Presheaf Models of Univalence Acknowledgments Parts are joint work with Steve Awodey and Emily Riehl Parts are also joint with Evan Cavallo and

469 views • 24 slides

Infinite sets that satisfy the principle of omniscience in constructive type theory Mart n

Infinite sets that satisfy the principle of omniscience in constructive type theory Mart n H otzel Escard o University of Birmingham, UK Tallinn, 25 May 2017 Mathematics in dependent type theory 1. Ill work in intensional

710 views • 52 slides

Addition is exponentially harder than counting for shallow monotone circuits Igor Carboni

Addition is exponentially harder than counting for shallow monotone circuits Igor Carboni Oliveira Columbia University / Charles University in Prague Joint work with Xi Chen (Columbia) and Rocco Servedio (Columbia) 1 What is this talk about?

556 views • 38 slides

Constructive Aspects of Gelfand Duality Christopher Mulvey University of Sussex

Constructive Aspects of Gelfand Duality Christopher Mulvey University of Sussex www.maths.sussex.ac.uk/Staff/CJM c.j.mulvey@cantab.net Background In this talk, I will tell you something about Gelfand duality, seen through the eyes of the

462 views • 42 slides