principal component of explained variance
play

Principal Component of Explained Variance High-Dimensional - PowerPoint PPT Presentation

Principal Component of Explained Variance High-Dimensional Estimation and Inference Max Turgeon February 14th, 2020 University of Manitoba Departments of Statistics and Computer Science 1/47 Introduction In modern statistics, we often


  1. Principal Component of Explained Variance High-Dimensional Estimation and Inference Max Turgeon February 14th, 2020 University of Manitoba Departments of Statistics and Computer Science 1/47

  2. Introduction • In modern statistics, we often encounter multivariate data of • In biomedical sciences (e.g. neuroimaging, genomics), pattern recognition, text recognition, fjnance, etc. • We are often faced with the following problem: and how do we identify which variables drive the association? 2/47 large dimension ( p > n ). • Given two multivariate datasets W = ( W 1 , . . . , W p ) and Z = ( Z 1 , . . . , Z q ) , how do we test for global association,

  3. Introduction • The matrix B of regression parameters controls the global association and the contribution of each components of Z . • Regularized regression can also be used to detect sparse signals (e.g. Lasso, SCAD). • However, this framework can be cumbersome when W has dimension greater than one, especially when we have heterogeneous variable types (e.g. continuous and categorical). 3/47 • Regression: E ( W | Z ) = B T Z .

  4. Motivating Examples

  5. Motivating Examples The next examples have the following in common: We have a (possibly high-dimensional) multivariate vector Y and a set of covariates X . We are interested in low dimensional representations of Y that summarise the relationship between Y and X . 4/47

  6. Motivating Example #1 • Digit recognition : A famous example in machine learning coming from Le Cun et al. (1990). pixels), where the goal is to automatically identify the digit. • Y is the set of gray scale values for each pixel, and X is the digit to which the image corresponds • We would like to extract lower-dimensional features to use for predicting the digit. 5/47 • Consists of 28 × 28 gray scale images of digits (i.e. 784

  7. Motivating Example #2 • Data from 340 participants from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) load in 96 brain regions the (binary) disease status. 6/47 • Brain imaging was employed to assess amyloid- β (A β ) protein • Y is the set of A β load values for each brain region, and X is

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend