STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer - PowerPoint PPT Presentation

Jun 16, 2023 •217 likes •470 views

Dimensionality Reduction STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer Dawson 1 / 24 Dimensionality Reduction Outline Dimensionality Reduction 2 / 24 Dimensionality Reduction High Dimensional Data Modern datasets

Dimensionality Reduction STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer Dawson 1 / 24
Dimensionality Reduction Outline Dimensionality Reduction 2 / 24
Dimensionality Reduction High Dimensional Data ● Modern datasets often have huge numbers of variables ● E.g., images, biomarker data, measurements at fine-grained time points, social networks, product preferences ● Clustering can be a useful way to find “groups” of similar observations ● However, distance measures have some strange properties in high dimensions ● Can be useful to try to extract a few dimensions that carry most of the “signal” 3 / 24
Dimensionality Reduction Images Have Many Variables 4 / 24 but maybe only a few meaningful “features”
Dimensionality Reduction High dimensional inputs Comprehensible arranged this way... 5 / 24
Dimensionality Reduction “Eigenfaces” 6 / 24
Dimensionality Reduction Finding the "Main Direction" of Variation 20 10 QuizCentered ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −10 −20 −20 −10 0 10 20 MidtermCentered 7 / 24
Dimensionality Reduction Finding the “Eigen-features” ## Here I am pulling out the perpendicular directions in (Midterm,Quiz) ## space that align with the ellipse on the scatterplot. ## If you know some linear algebra: ## These are the eigenvectors of the covariance matrix directions <- select(Scores, Midterm, Quiz) %>% cov() %>% eigen() directions %>% extract2("vectors") %>% round(digits = 2) [,1] [,2] [1,] -0.97 0.24 [2,] -0.24 -0.97 ## Creating two new variables that are a weighted sum and weighted ## difference of the midterm and quiz score, with weights chosen so ## that the new variables are uncorrelated Scores_augmented <- mutate(Scores, V1 = 0.97 * Midterm + 0.24 * Quiz, V2 = 0.24 * Midterm - 0.97 * Quiz) 8 / 24

Recommend

Dimensionality Reduction Alexandros Tantos Assistant Professor Aristotle University of

DataCamp Dimensionality Reduction in R DIMENSIONALITY REDUCTION IN R Dimensionality Reduction Alexandros Tantos Assistant Professor Aristotle University of Thessaloniki DataCamp Dimensionality Reduction in R Curse of Dimensionality

550 views • 35 slides

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

Investigating Investigating Investigating Dimensionality Dimensionality Dimensionality with with Investigating Dimensionality Mokken Analysis Mokken Analysis and CFA and CFA by means of Nader et al. Nader et al. Mokken Analysis

571 views • 7 slides

WIKIPEDIA ARTICLE GROUP 9 Contents Article Overview 1. Dimensionality Reduction 2.

WIKIPEDIA ARTICLE GROUP 9 Contents Article Overview 1. Dimensionality Reduction 2. Gaussian Process 3. Gaussian Process Latent Variable Model Article Edits Dimensionality Reduction According to dimensionality reduction

622 views • 12 slides

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs.

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs. dimensionality reduction Nonlinear dimensionality reduction techniques: ISOMAP, LLE, Charting A fun example that uses non- metric, replicated

882 views • 38 slides

Dimensionality Reduction Algorithms (and how to interpret their output) Dalya Baron (Tel Aviv

Dimensionality Reduction Algorithms (and how to interpret their output) Dalya Baron (Tel Aviv University) XXX Winter School, November 2018 What is Dimensionality Reduction? Dimensionality Reduction algorithm 28 x 28 features per object 2

437 views • 33 slides

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical Statistics in R Outline Introduction Clustering Clustering in R Dimensionality reduction Dimensionality reduction in R Outline Introduction

773 views • 65 slides

Applied Machine Learning Dimensionality reduction using PCA Siamak Ravanbakhsh COMP 551 (Fall

Applied Machine Learning Dimensionality reduction using PCA Siamak Ravanbakhsh COMP 551 (Fall 2020) Learning objectives What is dimensionality reduction? What is it good for? Linear dimensionality reduction: Principal Component Analysis

661 views • 26 slides

Preprocessing and Dimensionality Reduction J er emy Fix CentraleSup elec

Datasets Preprocessing Dimensionality reduction Preprocessing and Dimensionality Reduction J er emy Fix CentraleSup elec jeremy.fix@centralesupelec.fr 2017 1 / 73 Datasets Preprocessing Dimensionality reduction Where to get data

1.22k views • 73 slides

DIMENSIONALITY REDUCTION DIMENSIONALITY REDUCTION MATTHIEU BLOCH April 21, 2020 1 / 26

DIMENSIONALITY REDUCTION DIMENSIONALITY REDUCTION MATTHIEU BLOCH April 21, 2020 1 / 26 MULTIDIMENSIONAL SCALING MULTIDIMENSIONAL SCALING There are situations for which Euclidean distance is not appropriate Suppose we have access to a

501 views • 26 slides

Probabilistic Dimensionality Reduction Neil D. Lawrence University of Sheffield Facebook, London

Probabilistic Dimensionality Reduction Neil D. Lawrence University of Sheffield Facebook, London 14th April 2016 Outline Probabilistic Linear Dimensionality Reduction Non Linear Probabilistic Dimensionality Reduction Examples Conclusions

1.68k views • 148 slides

Kernel-Based Dimensionality Reduction Methods on Synthesized and Facial Image Data Jonathan L.

High Dimensionality Dimensionality Reduction Methods Application to Simulated Data Application to Morph-II Wrap-up Kernel-Based Dimensionality Reduction Methods on Synthesized and Facial Image Data Jonathan L. Fabish Statistical Data Mining

340 views • 18 slides

STAT 830 Blank Slides for Notes Richard Lockhart SFU STAT 830 Fall 2020 Richard Lockhart

STAT 830 Blank Slides for Notes Richard Lockhart SFU STAT 830 Fall 2020 Richard Lockhart (SFU) STAT 830 Blank Slides for Notes STAT 830 Fall 2020 1 / 1 Blank Page for Algebra Richard Lockhart (SFU) STAT 830 Blank Slides for Notes

357 views • 12 slides

Spatial Data: Dimensionality Reduction CS444 Techniques, Lecture 3 In this subfield, we think

Spatial Data: Dimensionality Reduction CS444 Techniques, Lecture 3 In this subfield, we think of a data point as a vector in R^n (what could possibly go wrong?) Linear dimensionality reduction: Reduction is achieved by is a single

1.02k views • 20 slides

Spatial Data: Dimensionality Reduction CSC444 Techniques In this subfield, we think of a data

Spatial Data: Dimensionality Reduction CSC444 Techniques In this subfield, we think of a data point as a vector in R^n (what could possibly go wrong?) Linear dimensionality reduction: Reduction is achieved by multiplying a point by

354 views • 22 slides

Dimensionality Reduction INFO-4604, Applied Machine Learning University of Colorado Boulder

Dimensionality Reduction INFO-4604, Applied Machine Learning University of Colorado Boulder October 25, 2018 Prof. Michael Paul Dimensionality The dimensionality of data is the number of variables Usually this refers to the number of input

438 views • 42 slides

Introduction to Geometry Return to Table of Contents Slide 6 / 209 The Origin of Geometry

Slide 1 / 209 Slide 2 / 209 Geometry Points, Lines & Planes 2015-10-21 www.njctl.org Slide 3 / 209 Table of Contents click on the topic to go to that section Introduction to Geometry Points and Lines Planes Congruence, Distance and

2.06k views • 105 slides

DISTRICT OF COLUMBIA HEALTH INFORMATION EXCHANGE POLICY BOARD MEETING July 23, 2020| 3:00

DISTRICT OF COLUMBIA HEALTH INFORMATION EXCHANGE POLICY BOARD MEETING July 23, 2020| 3:00 5:00 PM THIS MEETING IS BEING RECORDED Department of Health Care Finance | Remote Meeting AGENDA Call to Order Virtual Meeting Processes

1.01k views • 41 slides

Evaluation of Replica Placement and Retrieval Algorithms in Self Organizing CDNs Jan Coppens,

Evaluation of Replica Placement and Retrieval Algorithms in Self Organizing CDNs Jan Coppens, Tim Wauters, Filip De Turck, Bart Dhoedt and Piet Demeester IFIP/IEEE International Workshop onSelf-Managed Systems & Services (SELFMAN)

772 views • 18 slides

Midterm #1 Review February 1, 2013 1 / 11 I will provide . . . ASCII character encoding

Midterm #1 Review February 1, 2013 1 / 11 I will provide . . . ASCII character encoding table powers of 2 up to 2 15 a list of relevant instructions for assembly problems just the names! you should know the difference between:

461 views • 11 slides

Coherence Analysis Overview Definition Coherency Definition R xy (e j ) Properties G xy

Coherence Analysis Overview Definition Coherency Definition R xy (e j ) Properties G xy ( ) R x (e j ) R y (e j ) Estimation Correlation of complex-valued RVs Also known as the coherency spectrum (Weiner, 1930) or

270 views • 15 slides

Non-Attacking Chess Pieces: The Dance of Bishops Thomas Zaslavsky Binghamton University (State

Non-Attacking Chess Pieces: The Dance of Bishops Thomas Zaslavsky Binghamton University (State University of New York) Joint with Seth Chaiken and Christopher R.H. Hanusa Outline 1. Chess Problems: Non-Attacking Pieces 2. Largely Czech

608 views • 17 slides

Two Theads, One Shared Variable Two threads updating shared variable amount T 1 wants to decrement

Two Theads, One Shared Variable Two threads updating shared variable amount T 1 wants to decrement amount by $10K Thread Synchronization: T 2 wants to decrement amount by 50% Foundations T 1 T 2 . . . . . . amount := amount - 10,000; amount :=

300 views • 12 slides

Valley Clean Energy Special CAC Meeting July 23, 2020 Via Teleconference Item 7 Overview

Valley Clean Energy Special CAC Meeting July 23, 2020 Via Teleconference Item 7 Overview of committee meeting procedures 1 Public Comments To Provide Public Comment on any agenda item please: E-mail 300 words or less to:

484 views • 45 slides

How much structure is needed? The case of the Persian VP Pegah Faghiri & Pollet Samvelian

How much structure is needed? The case of the Persian VP Pegah Faghiri & Pollet Samvelian Universit Sorbonne Nouvelle & CNRS {pegah.faghiri,pollet.samvelian}@univ-paris3.fr HeadLex 2016, Warsaw Poland 1 / 50 Outline Goals and

791 views • 49 slides