Exploratory Analysis Dimensionality Reduction Davide Bacciu - PowerPoint PPT Presentation

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Exploratory Analysis Dimensionality Reduction Davide Bacciu Computational Intelligence & Machine Learning Group Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Introduzione all’Intelligenza Artificiale - A.A. 2012/2013

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Lecture Outline Exploratory Analysis 1 Dimensionality Reduction 2 Curse of Dimensionality General View Feature Extraction 3 Finding Linear Projections Principal Component Analysis Applications and Advanced Issues Conclusion 4

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Drowning into complex data Slide credit goes to Percy Liang (Lawrence Berkeley National Laboratory)

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Exploratory Data Analysis (EDA) Discover structure in data Find unknown patterns in the data that cannot be predicted using current expert knowledge Formulate new hypotheses about the causes of the observed phenomena A mix of graphical and quantitative techniques Visualization Finding informative attributes in the data Finding natural groups in the data Interdisciplinary approach Computer graphics Machine learning Data Mining Statistics

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion A Machine Learning Perspective Often an unsupervised learning task Dimensionality reduction Feature Extraction Feature Selection Clustering Tackle with Large datasets.. ...as well as high-dimensional data and small sample size Exploiting tools and models beyond statistics E.g. non-parametric neural models

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Finding Natural Groups in DNA Microarray SL Pomeroy et al (2002) Prediction of central nervous system embryonal tumour outcome based on gene expression, Nature , 415, 436-442

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Finding Informative Genes SL Pomeroy et al (2002) Prediction of central nervous system embryonal tumour outcome based on gene expression, Nature , 415, 436-442

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion The Curse of Dimensionality If the data lies in a high dimensional space, then an enormous amount of data is required to learn a model Curse of Dimensionality (Bellman, 1961) Some problems become intractable as the number of the variables increases Huge amount of training data required Too many model parameters (complexity) Given a fixed number of training samples, the predictive power reduces as sample dimensionality increases (Hughes Effect, 1968)

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion A Simple Combinatorial Example (I) A toy 1-dimensional classification task with 3 classes Classes cannot be separated well: lets add another feature.. Better class separation, but still errors. What if we add another feature?

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion A Simple Combinatorial Example (II) Classes are well separated Exponential growth in the complexity of the learned model with increasing dimensionality Exponential growth in the number of examples required to maintain a given sampling density 3 samples per bin in 1-D 81 samples per bin in 3-D

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion Intrinsic Dimension The intrinsic dimension of data is the minimum number of independent parameters needed to account for the observed properties of the data Data might live in a lower dimensional surface (fold) than expected

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion What is the Intrinsic Dimension? Might not be an easy question to answer... It may increase due to noise A data fold needs to be unfolded to reveal its intrinsic dimension

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion Informative Vs Uninformative Features Data can be made of several dimensions that are either unimportant or comprise only noise Irrelevant information might distract the learning model Learning resources (memory) are wasted to represent irrelevant portions of the input space Dimensionality reduction aims at automatically finding a lower-dimensional representation of high-dimensional data Counteracts the curse of dimensionality Reduces the effect of unimportant attributes

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion Why Dimensionality Reduction? Data Visualization Projecting high-dimensional data to a 2D/3D screen space Preserving topological relationships E.g. visualize semantically related textual documents Data Compression Reducing storage requirements Reducing complexity E.g. stopwords removal Feature ranking and selection Identifying informative bits of information Noise reduction E.g. identify words correlated with document topics

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion Flavors of Dimensionality Reduction Feature Extraction - Create a lower dimensional representation of x ∈ R D by combining the existing features with a given function f : R D → R D ′   x 1   y 1 x 2   y 2 y = f ( x )     x = − − − − → y = . . .     . . .       y D ′ x D Feature Selection - Choose a D ′ -dimensional subset of all the features (possibly the most-informative)   x 1   x i 1 x 2   x i 2 select i 1 ,..., i D ′     x = . . . − − − − − − − − → y =     . . .       x i D ′ x D

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion A Unique Formalization Definition (Dimensionality Reduction) Given an input feature space x ∈ R D find a mapping f : R D → R D ′ such that D ′ < D and y = f ( x ) preserves most of the informative content in x . Often the mapping f ( x ) is chosen as a linear function y = Wx y is a linear projection of x W ∈ R D ′ × R D is the matrix of linear coefficients  x 1      y 1 w 11 w 12 . . . w 1 D x 2   y 2 w 21 w 22 w 2 D . . .        = . . .       . . . . . . . . . . . . . . .        y D ′ w D ′ 1 w D ′ 2 . . . w D ′ D x D

Exploratory Analysis Dimensionality Reduction Curse of Dimensionality Feature Extraction General View Conclusion Unsupervised Vs Supervised Dimensionality Reduction The linear/nonlinear map y = f ( x ) is learned from the data based on an error function that we seek to minimize Signal representation (Unsupervised) The goal is to represent the samples accurately in a lower-dimensional space Principal Component Analysis (PCA) Classification (Supervised) The goal is to enhance the class-discriminatory information in the lower-dimensional space Linear Discriminant Analysis (LDA)

Exploratory Analysis Finding Linear Projections Dimensionality Reduction Principal Component Analysis Feature Extraction Applications and Advanced Issues Conclusion Feature Extraction Objective - Create a lower dimensional representation of x ∈ R D by combining the existing features with a given function f : R D → R D ′ , while preserving as much information as possible  x 1    y 1 x 2   y 2 y = f ( x )     x = . . . − − − − → y =     . . .       y D ′ x D where D ′ ≪ D and, for visualization, D ′ = 2 or D ′ = 3.

Exploratory Analysis Finding Linear Projections Dimensionality Reduction Principal Component Analysis Feature Extraction Applications and Advanced Issues Conclusion Linear Feature Extraction Signal Representation (Unsupervised) Independent Component Analysis (ICA) Principal Component Analysis (PCA) Non-negative Matrix Factorization (NMF) Classification (Supervised) Linear Discriminant Analysis (LDA) Canonical Correlation Analysis (CCA) Partial Least Squares (PLS) We focus on unsupervised approaches exploiting linear mapping functions

Exploratory Analysis Finding Linear Projections Dimensionality Reduction Principal Component Analysis Feature Extraction Applications and Advanced Issues Conclusion Linear Methods Setup Given N samples x n ∈ R D , define the input data as the matrix  |  x 11 . . . x N 1  ∈ R D × R N X = x 2 . . . . . . . . .  x 1 D | . . . x ND Choose D ′ ≪ D projection directions w k   w 11 | w D ′ 1 . . .  ∈ R D × R D ′ W = . . . w 2 . . . . . .  w 1 D | x D ′ D . . . Compute the projection of x along each direction w k as y = [ y 1 , . . . , y D ′ ] T = W T x Linear methods only differ in the criteria used for choosing W

Exploratory Analysis Dimensionality Reduction Davide Bacciu - PowerPoint PPT Presentation

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Exploratory Analysis Dimensionality Reduction Davide Bacciu Computational Intelligence & Machine Learning Group Dipartimento di Informatica Universit di Pisa

STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer Dawson 1 / 24 Dimensionality

Dimensionality Reduction Alexandros Tantos Assistant Professor Aristotle University of

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

Applied Machine Learning Dimensionality reduction using PCA Siamak Ravanbakhsh COMP 551 (Fall

WIKIPEDIA ARTICLE GROUP 9 Contents Article Overview 1. Dimensionality Reduction 2.

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs.

Dimensionality Reduction Algorithms (and how to interpret their output) Dalya Baron (Tel Aviv

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical

Preprocessing and Dimensionality Reduction J er emy Fix CentraleSup elec

DIMENSIONALITY REDUCTION DIMENSIONALITY REDUCTION MATTHIEU BLOCH April 21, 2020 1 / 26

Probabilistic Dimensionality Reduction Neil D. Lawrence University of Sheffield Facebook, London

Introduction to Data Science: x (1) x 1 x 2 x ( n ) x i n 1 1 Size: size

Exploratory Data Analysis Paul Cohen ISTA 370 Spring, 2012 Paul Cohen ISTA 370 () Exploratory

CME/STATS 195 CME/STATS 195 Lecture 5: Exploratory Data Analysis Lecture 5: Exploratory Data

Dimensionality Reduction: Linear Discriminant Analysis and Principal Component Analysis CMSC 678

Kernel-Based Dimensionality Reduction Methods on Synthesized and Facial Image Data Jonathan L.

ECOSYSTEM SERVICES Towards integrated marine infrastructure project assessment Annelies Boerema,

Coastwide Reference Monitoring System - Wetlands Status Report for the CWPPRA Technical Committee

Sediment Management in Savin Hill Cove Image source: Google Maps Environmental, Earth, &

Dredge Propulsion System Senior Design - Spring 2005 Laura Christianson Shane Ice J. D. Karber

Data Reduction - Summary Lecture 08 Biostatistics 602 - Statistical Inference . Summary . .

EXPERIMENTAL VALIDATION OF STAR-CCM+ FOR LIQUID CONTAINER SLOSH DYNAMICS Brandon Marsell a.i.

CASA Development Activities Status, Opportunities, and Constraints Jeff Kern NRAO CASA CASA

CSRS E EPOCH 2 2017.50 WHA HAT Y YOU SHO HOULD D KNOW 2018 CLSA C CONFERENCE SCOTT P.

Exploratory Analysis Dimensionality Reduction Davide Bacciu - PowerPoint PPT Presentation

Exploratory Analysis Dimensionality Reduction Feature Extraction Conclusion Exploratory Analysis Dimensionality Reduction Davide Bacciu Computational Intelligence & Machine Learning Group Dipartimento di Informatica Universit di Pisa

STAT 209 Dimensionality Reduction November 26, 2019 Colin Reimer Dawson 1 / 24 Dimensionality

Dimensionality Reduction Alexandros Tantos Assistant Professor Aristotle University of

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

Applied Machine Learning Dimensionality reduction using PCA Siamak Ravanbakhsh COMP 551 (Fall

WIKIPEDIA ARTICLE GROUP 9 Contents Article Overview 1. Dimensionality Reduction 2.

Nonlinear Dimensionality Reduction Donovan Parks Overview Direct visualization vs.

Dimensionality Reduction Algorithms (and how to interpret their output) Dalya Baron (Tel Aviv

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical

Preprocessing and Dimensionality Reduction J er emy Fix CentraleSup elec

DIMENSIONALITY REDUCTION DIMENSIONALITY REDUCTION MATTHIEU BLOCH April 21, 2020 1 / 26

Probabilistic Dimensionality Reduction Neil D. Lawrence University of Sheffield Facebook, London

Introduction to Data Science: x (1) x 1 x 2 x ( n ) x i n 1 1 Size: size

Exploratory Data Analysis Paul Cohen ISTA 370 Spring, 2012 Paul Cohen ISTA 370 () Exploratory

CME/STATS 195 CME/STATS 195 Lecture 5: Exploratory Data Analysis Lecture 5: Exploratory Data

Dimensionality Reduction: Linear Discriminant Analysis and Principal Component Analysis CMSC 678

Kernel-Based Dimensionality Reduction Methods on Synthesized and Facial Image Data Jonathan L.

ECOSYSTEM SERVICES Towards integrated marine infrastructure project assessment Annelies Boerema,

Coastwide Reference Monitoring System - Wetlands Status Report for the CWPPRA Technical Committee

Sediment Management in Savin Hill Cove Image source: Google Maps Environmental, Earth, &amp;

Dredge Propulsion System Senior Design - Spring 2005 Laura Christianson Shane Ice J. D. Karber

Data Reduction - Summary Lecture 08 Biostatistics 602 - Statistical Inference . Summary . .

EXPERIMENTAL VALIDATION OF STAR-CCM+ FOR LIQUID CONTAINER SLOSH DYNAMICS Brandon Marsell a.i.

CASA Development Activities Status, Opportunities, and Constraints Jeff Kern NRAO CASA CASA

CSRS E EPOCH 2 2017.50 WHA HAT Y YOU SHO HOULD D KNOW 2018 CLSA C CONFERENCE SCOTT P.

Sediment Management in Savin Hill Cove Image source: Google Maps Environmental, Earth, &