Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and - PowerPoint PPT Presentation

Non-Local Manifold Parzen Windows Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and Pascal Vincent D´ epartement d’informatique et de recherche op´ erationnelle Universit´ e de Montr´ eal July 15 th , 2005

Non-Local Manifold Parzen Windows Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Introduction Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why :

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that ... to publish papers

Non-Local Manifold Parzen Windows Introduction About this talk... What : density estimation of high dimensional continuous data, lying on a lower dimensional manifold How : using the Manifold Parzen Windows model learning the model’s parameters with a neural network Why : ... because my supervisor wants me to work on that ... to publish papers but mostly to use and make a point about non-local learning

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Extension of the Parzen Windows model (mixture of spherical Gaussians, centered on the training points) The Gaussians are parametrized so that most of the density is situated on the underlying manifold F IG .: Parzen Windows vs Manifold Parzen Windows

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1 Parametrization : d Σ( x t ) = σ 2 � noise ( x t ) I + s j ( x t ) v j ( x t ) v j ( x t ) ′ j = 1

Non-Local Manifold Parzen Windows Introduction Manifold Parzen Windows (Vincent and Bengio, 2003) Density estimator : n p ( x ) = 1 � N ( x ; µ ( x t ) , Σ( x t )) n t = 1 Parametrization : d Σ( x t ) = σ 2 � noise ( x t ) I + s j ( x t ) v j ( x t ) v j ( x t ) ′ j = 1 Training : µ ( x t ) = x t is fixed for each x t , use principal eigenvalues ( s j ( x t ) ) and eigenvectors ( v j ( x t ) ) of k nearest neighbors covariance matrix σ noise ( x t ) is an hyper-parameter

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network The neural network can capture global information about the underlying manifold, and share it among all training points

Non-Local Manifold Parzen Windows Introduction Non-Local Manifold Parzen Windows In Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are stored in memory for every training point x t In Non-Local Manifold Parzen Windows, µ ( x t ) , σ noise ( x t ) , s j ( x t ) and v j ( x t ) are functions of x t , modeled by a neural network The neural network can capture global information about the underlying manifold, and share it among all training points The neural network is trained using stochastic gradient descent on the average negative log-likelihood of the training set

Non-Local Manifold Parzen Windows Local vs Non-Local learning Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Local vs Non-Local learning Informal definitions What is local learning : A learning algorithm is said to be local if it uses mostly nearby points of x to make a prediction at x Examples : k nearest neighbors, SVM, most popular dimensionality reduction algorithms, Manifold Parzen Windows

Non-Local Manifold Parzen Windows Local vs Non-Local learning Informal definitions What is local learning : A learning algorithm is said to be local if it uses mostly nearby points of x to make a prediction at x Examples : k nearest neighbors, SVM, most popular dimensionality reduction algorithms, Manifold Parzen Windows What is non-local learning : A learning algorithm is said to be non-local if it is able to use information from training points far from x to generalize at x

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example We are trying to learn a density using this training set : 0.6 0.4 0.2 0 −0.2 −0.4 −0.4 −0.2 0 0.2 0.4 0.6 F IG .: Samples from a spiral distribution

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example We are trying to learn a density using this training set : 0.6 0.4 0.2 0 −0.2 −0.4 −0.4 −0.2 0 0.2 0.4 0.6 F IG .: Samples from a spiral distribution Let’s train a Manifold Parzen Windows model, and look at the first principal direction of variance of the training point gaussians

Non-Local Manifold Parzen Windows Local vs Non-Local learning Toy example Because the training of Manifold Parzen Windows uses only local information, some of the principal directions of variance of badly estimated. F IG .: On the left : training points. On the right : first principal direction of variance.

Non-Local Manifold Parzen Windows Local vs Non-Local learning Real life examples A lot of large scale, real life problems are likely to befenit from non-local learning : Vision : the pixels at a certain position from very different images share the same properties with respect to certain transformations (e.g. translation, rotation) ; Natural Language Processing : words that are very different in some aspect still usually share a lot of properties (e.g. two nouns, even if they have very different meanings, will still obey to the same grammatical rules)

Non-Local Manifold Parzen Windows Experiments and Results Plan Introduction 1 Local vs Non-Local learning 2 Experiments and Results 3 Conclusion 4

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments Sinusoidal distribution Spiral distribution 1.5 0.6 1 0.4 0.5 0.2 0 0 −0.5 −0.2 −0.4 −1 −1.5 −5 −4 −3 −2 −1 0 1 2 3 4 5 −0.4 −0.2 0 0.2 0.4 0.6 1.5 0.6 1 0.4 0.5 0.2 0 0 −0.5 −0.2 −1 −0.4 −1.5 −5 −4 −3 −2 −1 0 1 2 3 4 5 −0.4 −0.2 0 0.2 0.4 0.6

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments Results : Algorithm sinus spiral Non-Local MP 1.144 -1.346 Manifold Parzen 1.345 -0.914 Gauss Mix Full 1.567 -0.857 Parzen Windows 1.841 -0.487 T AB .: Average out-of-sample negative log-likelihood on two toy problems, for Non-Local Manifold Parzen, a Gaussian mixture with full covariance, Manifold Parzen and Parzen Windows. The non-local algorithm dominates all the others.

Non-Local Manifold Parzen Windows Experiments and Results Toy 2D data experiments F IG .: From left to right, top to bottom, densities learned by Non-Local Manifold Parzen, a Gaussian mixture with full covariance, Manifold Parzen and Parzen Windows.

Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and - PowerPoint PPT Presentation

Non-Local Manifold Parzen Windows Non-Local Manifold Parzen Windows Yoshua Bengio, Hugo Larochelle and Pascal Vincent D epartement dinformatique et de recherche op erationnelle Universit e de Montr eal July 15 th , 2005 Non-Local

Simplifying mixtures of Parzen windows GRETSI 2011, Bordeaux, France Olivier Schwander Frank

Linear Manifold Clustering Robert Haralick and Rave Harpaz Outline Background The linear

n -dimensional manifold M with T := TM n -dimensional manifold M with T := TM T n -dimensional

Windows Not just for houses Windows 1-10 Windows Server Essentially a jacked up windows 8 box

Platform Convergence Journey Windows Embedded Standard 7 Windows Embedded Standard 8 Converged

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

1. 2. 3. 1. 2. 3. Windows 10 IoT Core Universal Windows Platform (UWP) Microsoft Azure v7

Windows Not Just For Houses Everyone Uses Windows! Versions of Windows 10 There are multiple

Manifold Learning: Applications in Neuroimaging Robin Wolz 23/09/2011 Overview Manifold

Module 1 Overview of Windows 10 Module Overview Introduction to Windows 10 Implementing

Lecture 6: Non-Parametric Methods Parzen Estimation Dr. Chengjiang Long Computer Vision

Manifold-driven spirals and rings Lia Athanassoula LAM, Marseille Lia Athanassoula Manifold

Manifold Regularization Lorenzo Rosasco 9.520 Class 10 March 6, 2011 L. Rosasco Manifold

Charting the Right Manifold: Manifold Mixup for Few-Shot Learning Puneet Mangla 1,2* , Mayank

Manifold Construction and Parameterization for Nonlinear Manifold-Based Model Reduction Chenjie

Game Bot Identification Game Bot Identification based on Manifold Learning based on Manifold

Non-parametric Methods Selim Aksoy Bilkent University Department of Computer Engineering

Non-parametric Methods Selim Aksoy Bilkent University Department of Computer Engineering

Robust Data Processing in What If We Only Have . . . the Presence of Uncertainty What If We Only

Architecture-Based Software Reliability Estimation: Problem Space, Challenges and Strategies

Lecture 3 Homework Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear

CSCE 970 Lecture 2: most probable class Bayesian-Based Classifiers Given M classes 1 , . .

Revealing the secrets of success Theoretical efficiency of side-channel distinguishers Annelie

Generative adversarial networks Ian Jean Mehdi Goodfellow Pouget-Abadie Mirza David Bing