Principal Component Ananalysis 4-8-2016 PCA: the setting - PowerPoint PPT Presentation

Principal Component Ananalysis 4-8-2016

PCA: the setting Unsupervised learning ● Unlabeled data Dimensionality reduction ● Simplify the data representation

Change of basis examples so far Support vector machines ● Data that's not linearly separable in the standard basis may be (approximately) linearly separable in a transformed basis. ● The kernel trick sometimes lets us work with high-dimensional bases. Approximate Q-learning ● When the state space is too large for Q-learning, we may be able to extract features that summarize the state space well. ● We then learn values as a linear function of the transformed representation.

Change of basis in PCA This looks like the change of basis from linear algebra. ● PCA performs an affine transformation of the original basis. ○ Affine ≣ linear plus a constant The goal: ● find a new basis where most of the variance in the data is along the axes. ● Hopefully only a small subset of the new axes will be important.

PCA change of basis illustrated

PCA: step one First step: center the data. ● From each dimension, subtract the mean value of that dimension. ● This is the "plus a constant" part, afterwards we'll perform a linear transformation. ● The centroid is now a vector of zeros.

PCA: step two The hard part: find an orthogonal basis that's a linear transformation of the original, where the variance in the data is explained by as few dimensions as possible. ● Orthogonal basis: all axes are perpendicular. ● Linear transformation of a basis: rotate (m - 1 angles) ● Explaining the variance: data varies a lot along some axes, but much less along others.

PCA: step three Last step: reduce the dimension. ● Sort the dimensions of the new basis by how much the data varies. ● Throw away some of the less-important dimensions. ○ Could keep a specific number of dimensions. ○ Could keep all dimensions with variance above some threshold. ● This results in a projection into the subspace of the remaining axes.

Computing PCA: step two ● Construct the covariance matrix. ○ m x m (m is the number of dimensions) matrix. ○ Diagonal entries give variance along each dimension. ○ Off-diagonal entries give cross-dimension covariance. ● Perform eigenvalue decomposition on the covariance matrix. ○ Compute the eigenvectors/eigenvalues of the covariance matrix. ○ Use the eigenvectors as the new basis.

Covariance matrix example X T data 4 8 -2 X x 0 x 1 x 2 x 3 x 4 C = ⅕ (X)(X T ) 3 0 6 4 3 -4 1 2 4 3 -4 1 2 -4 -1 -7 7.8 3.2 8 8 0 -1 -2 -5 8 0 -1 -2 -5 1 -2 6 3.2 18.8 -1.2 -2 6 -7 6 -3 -2 6 -7 6 -3 2 -5 -3 8 -1.2 26.8

Linear algebra review: eigenvectors Eigenvectors are vectors that the matrix doesn’t rotate. If X is a matrix, and v is a vector, then v is an eigenvector of x iff there is some constant λ, such that: Xv = λv λ, the amount by which X stretches the eigenvector is the eigenvalue.

Linear algebra review: eigenvalue decomposition If the matrix (X)(X T ) has eigenvectors with eigenvalues for i ∈ {1, …, m}, then the following vectors form an orthonormal basis: The key point: computing the eigenvectors of the covariance matrix gives us the optimal (linear) basis for explaining the variance in our data. Sorting by eigenvalue tells us the relative importance of each dimension.

PCA change of basis illustrated

When does PCA fail?

Exam questions Topics coming later today. Lectures since the last exam: machine learning intro Q-learning decision trees approximate Q-learning perceptrons MCTS for MDPs backpropagation POMDPs analyzing backprop particle filters naive Bayes hierarchical clustering k nearest neighbors EM, k-means, and GNG support vector machines principal component analysis value iteration

Principal Component Ananalysis 4-8-2016 PCA: the setting - PowerPoint PPT Presentation

Principal Component Ananalysis 4-8-2016 PCA: the setting Unsupervised learning Unlabeled data Dimensionality reduction Simplify the data representation Change of basis examples so far Support vector machines Data that's not

Continuous Latent Variables Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 12 Principal Component

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Section 1 Principal Component Analysis 1 / 16 Principal Component Analysis ST 810-006

Functional components Notification component Application received Refuse ? Notification

WIO IOSAP Project Budget Nairobi Convention WIO IOSAP Budget per Project Component COMPONENT

Principal Component Analysis Powerpoint Presentation What is multivariate analysis? Summarizing

Principal component analysis Ingo Blechschmidt December 17th, 2014 Kleine Bayessche AG

Component selection 1 (c) 2020 A.J.M. Montagne Component selection + - + - + - 2 (c)

For use in AIM Awards centres Component Level: Level Three Component Guided Learning Hours: 21

For use in AIM Awards centres Component Level: Level Three Component Guided Learning Hours: 28

CS530L lab component of lab component of CS530L Security Systems course Security

Principal Component Analysis http://setosa.io/ev/principal- Food consumption in the UK

CS475/CS675 Lecture 23: July 19, 2016 Principal Component Analysis, Eigenfaces CS475/CS675 (c)

Introduction to Principal Component Analysis and Indepedent Component Analysis Tristan A. Hearn

Hebbian Learning, Hebbian Learning Principal Component Analysis, and Independent Component

Chapter 5 Singular value decomposition and principal component analysis In A Practical Approach to

w s o

Optimal control of the cylinder wake flow using Proper Orthogonal Decomposition (POD) Michel

Anne Bracy CS 3410 Computer Science Cornell University The slides are the product of many

Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications ICAPS 2020

Solving large scale eigenvalue problems Lecture 8, April 18, 2018: Krylov spaces

Important Scientific Presentation Jonathan Doe Department of Electrical Engineering University

Topological order from quantum loops and nets Paul Fendley It has proved to be quite tricky to T

SVD and Low-rank Approximation Lecture 23 April 18, 2019 Chandra (UIUC) CS498ABD 1 Spring