Principal Component Analysis in a Linear Algebraic View by Anna - PowerPoint PPT Presentation

Jan 13, 2024 •354 likes •456 views

Principal Component Analysis in a Linear Algebraic View by Anna Orosz under the mentorship of Jakob Hansen Directed Reading Program at the University of Pennsylvania April 30th, 2020 Principal Component Analysis as a Transformation invented

Principal Component Analysis in a Linear Algebraic View by Anna Orosz under the mentorship of Jakob Hansen Directed Reading Program at the University of Pennsylvania April 30th, 2020
Principal Component Analysis as a Transformation invented in 1901 by Karl Pearson ● rotation of data from one coordinate system to ● another Goal: ● dimension reduction of multidimensional datasets
Fitting the Best Ellipsoid on the data multidimensional data: ● rows: sample values ○ columns: measured variables ○ fitting a p-dimensional ellipsoid to the ● data each axis of the ellipsoid represents a ● principal component the small axes represent small variances ●
Computing PCA through the EVD of the covariance matrix 1. calculate data covariance matrix of the original data 2. perform eigenvalue decomposition (EVD) on the covariance matrix original data matrix is Y ● subtract data means from each point ○ X is the shifted version of Y with column-wise 0 empirical mean ○ covariance matrix is X T * X ● first component’s direction computed by maximizing the variance: ● other components will be computed by iterating this ○ and with the help of Gram-orthogonalization ○
Result of computing PCA using EVD this way we obtain a W matrix ● this is orthonormal ○ result is T = X*W ● W is a p-by-p matrix of weights ○ columns: eigenvectors of X T * X ○ last few columns of T can be omitted, in case the majority of the ● variance can be explained using the first few columns dimension reduction ○
Another Computational Method: Singular Value Decomposition factorization of a real or complex matrix ● m*n M matrix is given → SVD gives: ● M= U Σ V T U is m*m unitary matrix (rotation or reflection) ○ Σ is an m*n rectangular diagonal matrix ○ V T is an n*n unitary matrix ○ diagonal entries σ i = Σ ii of Σ are non-negative numbers ● known as the singular values of M ○
Computing Principal Component Analysis using Singular Value Decomposition SVD of the data matrix X: X = U Σ W T ● we get T = U Σ form (polar decomposition of T) ● → NO need to determine the covariance matrix more numerically stable than using EVD on covariance matrix ● Primary method to compute PCA ● (unless only a handful of components are required) ○
Why/why not use Principal Component Analysis? Pros Cons reflects our intuitions about the data cubic time of computing ● ● allows estimating probabilities in expensive for huge datasets ● ○ only for continuous variables ● high-dimensional data assumes linearity of the data ● monumental reduction in size of data ● catastrophic for fine-grained tasks ● faster processing ○ smaller storage outliers, interesting special cases ○ ○
Applications of Principal Component Analysis quantitative finance ● risk management of interest rate ○ derivative portfolios eigen-faces ● facial recognition ○ image compression ● countless other applications ● for example in neuroscience, medical ○ data correlation etc.

Recommend

Continuous Latent Variables Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 12 Principal Component

Principal Component Analysis Continuous Latent Variables Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 12 Principal Component Analysis Outline Principal Component Analysis Principal Component Analysis Outline Principal Component Analysis

621 views • 24 slides

Section 1 Principal Component Analysis 1 / 16 Principal Component Analysis ST 810-006

ST 810-006 Statistics and Financial Risk Section 1 Principal Component Analysis 1 / 16 Principal Component Analysis ST 810-006 Statistics and Financial Risk Background Principal Component Analysis (PCA) is a tool for looking at

360 views • 16 slides

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Empirical Principal Component FPC for the model Empirical vs. theoretical FPC Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the model Empirical vs. theoretical FPC Outline Empirical Principal

626 views • 15 slides

Dimensionality Reduction: Linear Discriminant Analysis and Principal Component Analysis CMSC 678

Dimensionality Reduction: Linear Discriminant Analysis and Principal Component Analysis CMSC 678 UMBC Outline Linear Algebra/Math Review Two Methods of Dimensionality Reduction Linear Discriminant Analysis (LDA, LDiscA) Principal Component

884 views • 70 slides

Principal Component Analysis Powerpoint Presentation What is multivariate analysis? Summarizing

Principal Component Analysis Powerpoint Presentation What is multivariate analysis? Summarizing and plotting multivariate data in R, Dimension reduction vs. clustering, Principal component analysis (PCA) (in R). Principal Component Analysis and

520 views • 3 slides

Principal component analysis Ingo Blechschmidt December 17th, 2014 Kleine Bayessche AG

Theory Applications Principal component analysis Ingo Blechschmidt December 17th, 2014 Kleine Bayessche AG Principal component analysis 1 / 12 Theory Applications Principal component analysis Ingo Blechschmidt December 17th, 2014 Kleine

471 views • 45 slides

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Linear, Linear, Linear CS7616 Pattern Recognition A. Bobick CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive Computing Linear, Linear, Linear CS7616 Pattern Recognition A. Bobick Administrivia

685 views • 64 slides

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view from Kildrum Road Aerial view from western entrance View from southern playground View of school entrance View from site entrance View of

456 views • 18 slides

Functional components Notification component Application received Refuse ? Notification

Functional components Notification component Application received Refuse ? Notification component Missing info component Refuse Info Available Pay component Issue ? Complete Assess Issue component Order Component Playbooks

401 views • 12 slides

WIO IOSAP Project Budget Nairobi Convention WIO IOSAP Budget per Project Component COMPONENT

WIO IOSAP Project Budget Nairobi Convention WIO IOSAP Budget per Project Component COMPONENT BUDGET (USD) Component A- Critical habitats management 3,388,000 Component B - Improved water quality 2,310,000 Component C - Sustainable

456 views • 7 slides

Linear Algebraic Graph Algorithms Linear Algebraic Graph Algorithms for Back End Processing for

Linear Algebraic Graph Algorithms Linear Algebraic Graph Algorithms for Back End Processing for Back End Processing Jeremy Kepner, Nadya Bliss, and Eric Robinson MIT Lincoln Laboratory This work is sponsored by the Department of Defense

552 views • 34 slides

Linear Algebraic Representation of Knowledge State of Agent Satoshi Tojo JAIST 28 August, 2018

Linear Algebraic Representation of Knowledge State of Agent Satoshi Tojo JAIST 28 August, 2018 1 / 40 Outline Introduction 1 Linear Algebraic Semantics for Modal Logic 2 Linear Algebraic Semantics for Multi-agent Communication 3

1.18k views • 50 slides

SOLUTION OF LINEAR ALGEBRAIC EQUATIONS I. Hajj 2017 Linear Equation Solution Methods Consider a

ECE 552 Numerical Circuit Analysis Chapter Three SOLUTION OF LINEAR ALGEBRAIC EQUATIONS I. Hajj 2017 Linear Equation Solution Methods Consider a set of n linear algebraic equations Ax = b where A is a real or complex n x n nonsingular matrix.

1.18k views • 96 slides

Principal Component Analysis http://setosa.io/ev/principal- Food consumption in the UK

Principal Component Analysis http://setosa.io/ev/principal- Food consumption in the UK component-analysis/ How can we focus in just a few of the variables? We want to reduce the dimension of the feature space, Lets try to reduce to one

778 views • 36 slides

CS475/CS675 Lecture 23: July 19, 2016 Principal Component Analysis, Eigenfaces CS475/CS675 (c)

CS475/CS675 Lecture 23: July 19, 2016 Principal Component Analysis, Eigenfaces CS475/CS675 (c) 2016 P. Poupart 1 Principal Component Analysis (PCA) Data exploration technique: Dimensionality reduction Principal components are axes

660 views • 10 slides

Introduction to Principal Component Analysis and Indepedent Component Analysis Tristan A. Hearn

National Aeronautics and Space Administration Introduction to Principal Component Analysis and Indepedent Component Analysis Tristan A. Hearn Bioscience and Technology Branch, NASA Glenn Research Center May 29, 2010 www.nasa.gov National

1.62k views • 128 slides

A new window on primordial non-Gaussianity based on 1201.5375 with M. Zaldarriaga Enrico Pajer

A new window on primordial non-Gaussianity based on 1201.5375 with M. Zaldarriaga Enrico Pajer Princeton University 2.0 R k 2 10 9 1.5 CMB LSS 1.0 10 4 0.01 1 100 10 4 k Mpc Summary We know little about

187 views • 14 slides

Logic Programming Prolog as Language Temur Kutsia Research Institute for Symbolic Computation

Logic Programming Prolog as Language Temur Kutsia Research Institute for Symbolic Computation Johannes Kepler University Linz, Austria kutsia@risc.jku.at 1 / 30 Prolog as Language Syntax Operators Equality Arithmetic

402 views • 15 slides

Applications of Rule Mining in Knowledge Bases Luis Galrraga November 3 rd , 2014 PIKM,

Applications of Rule Mining in Knowledge Bases Luis Galrraga November 3 rd , 2014 PIKM, Shanghai 1 Knowledge Bases (KBs) Barack Obama hasChild born On hasChild Malia Aug 4, 1961 hasChild marriedTo hasChild Michelle Sasha 2 KBs in

652 views • 49 slides

Effective and Efficient Compromise Recovery for Weakly Consistent Replication Prince Mahajan (UT

Effective and Efficient Compromise Recovery for Weakly Consistent Replication Prince Mahajan (UT Austin), Ramakrishna Kotla, Cathy Marshall, Venugopalan Rama Ramasubramanian, Tom Rodeheffer, Doug Terry, Ted Wobber (Microsoft Research

1.34k views • 85 slides

When Samples Are Strategically Selected Hanrui Zhang Yu Cheng Vincent Conitzer Duke University

When Samples Are Strategically Selected Hanrui Zhang Yu Cheng Vincent Conitzer Duke University Academia in 20 years SHE HAS 50 A NEW PAPERS AND I POSTDOC ONLY WANT TO APPLICANT. READ 3. Bob, Professor of Rocket Science Academia in

646 views • 18 slides

Lecture 3 Principal Component Analysis Lin ZHANG, PhD School of Software Engineering Tongji

Lecture 3 Principal Component Analysis Lin ZHANG, PhD School of Software Engineering Tongji University Fall 2020 Lin ZHANG, SSE, 2020 Content Matrix Differentiation Lagrange Multiplier Principal Component Analysis

847 views • 52 slides

Avoiding alerts overload from microservices Sarah Wells Principal Engineer, Financial Times

Avoiding alerts overload from microservices Sarah Wells Principal Engineer, Financial Times @sarahjwells Knowing when theres a problem isnt enough @sarahjwells You only want an alert when you need to take action Hello @sarahjwells 1

1.74k views • 141 slides

Animation Maneesh Agrawala CS 448B: Visualization Fall 2018 Last Time: Network Analysis 1

Animation Maneesh Agrawala CS 448B: Visualization Fall 2018 Last Time: Network Analysis 1 Centrality Y Y X X outdegree indegree Y X X Y closeness betweenness How dense is it? density = e/ e max Max. possible edges: Directed:

671 views • 32 slides