Multivariate analysis DAAG Chapter 12 Learning objectives In this - PowerPoint PPT Presentation

Multivariate analysis DAAG Chapter 12

Learning objectives In this section, we will learn some basic approaches to multivariate analysis. ◮ Principal components analysis ◮ What is principal components analysis? ◮ What does principal components analysis do? ◮ How can principal components analysis be used? ◮ Multi-dimensional scaling (MDS) ◮ What is a distance measure? ◮ What are Euclidean, Manhattan, Canberra distances? ◮ What does MDS do? ◮ How can MDS be used?

Multivariate analysis: Motivating problem Possum morphology data. 104 possums trapped at seven sites in Australia. ◮ sex ◮ age ◮ head length ◮ skull width ◮ total length ◮ tail length ◮ foot length ◮ ear conch length ◮ eye measurement ◮ chest girth ◮ belly girth How can we analyze these data to uncover the patterns that exist?

Plots of possum data B Cambarville Byrangery Bulburin ● Cambarville Byrangery Bulburin ● Bellbird Conondale Bellbird Conondale Whian Whian ● Allyn River Whian Whian Allyn River ● 55 50 55 50 ear conch ● ● ● ● ● ● length ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 45 ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 40 45 40 70 75 earconch 75 ● ● ● foot 70 ● ● length ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 65 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 60 65 ● ● ● ● ● ● ● ● ● ● 60 ● ● ● ● ● ● ● 38 40 42 42 ● ● ● ● ● ● ● ● ● ● ● 40 ● ● ● ● tail ● ● ● ● ● ● 38 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● length ● ● ● ● ● ● ● ● ● ● ● ● 36 ● ● ● ● footlgth ● ● ● ● ● ● taill 34 32 34 36 32 Scatter Plot Matrix

Principal components analysis For the possum data, we have 9 morphological measurements. ◮ This is a lot to visualize. ◮ Also, there is no “response” variable ◮ How can we uncover structure in these data? Principal components analysis creates new variables (components) using linear combinations of the existing variables. ◮ The first component is chosen to explain as much variation as possible ◮ Subsequent components are chosen in the same way ◮ Components are orthogonal

Principal components on possums Importance of components: Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 Standard deviation 6.800 5.033 2.6993 2.1601 1.7372 Proportion of Variance 0.498 0.273 0.0785 0.0503 0.0325 Cumulative Proportion 0.498 0.771 0.8495 0.8998 0.9323 Comp.6 Comp.7 Comp.8 Comp.9 Standard deviation 1.5989 1.2860 1.1111 0.91696 Proportion of Variance 0.0275 0.0178 0.0133 0.00906 Cumulative Proportion 0.9598 0.9776 0.9909 1.00000

Principal components on possums Loadings: Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 hdlngth 0.413 0.282 0.339 -0.185 0.695 skullw 0.296 0.269 0.540 -0.338 -0.519 totlngth 0.518 0.315 -0.648 -0.156 taill 0.251 -0.350 -0.194 footlgth 0.514 -0.468 -0.336 earconch 0.309 -0.650 0.249 eye chest 0.219 0.175 0.174 -0.177 belly 0.246 0.178 0.134 0.891 Comp.6 Comp.7 Comp.8 Comp.9 hdlngth 0.277 -0.184 skullw -0.276 0.259 0.112 totlngth -0.226 -0.145 0.336 taill 0.437 -0.753 0.106 footlgth 0.633 earconch -0.584 0.208 -0.172 eye 0.195 0.242 0.942 chest -0.189 -0.763 -0.404 0.267 belly -0.102 0.239 0.144

Principal components on possums Cambarville Whian Whian Conondale ● Bulburin Bellbird Byrangery ● Allyn River 10 2nd Principal Component ● ● ● ● ● 5 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● ● ● ● ● −5 −10 −15 −10 −5 0 5 10 15 1st Principal Component

Uses of principal components ◮ Description of patterns in high-dimensional data ◮ Direct interpretation of components ◮ Graphical display using components ◮ Grouping/clustering ◮ Transformation for subsequent statistical analysis ◮ Use components as explanatory variables in regression ◮ Good for summarizing the effects of many covariates ◮ Avoid problems with multicollinearity ◮ Use first component as response variable in regression

Multidimensional scaling We have seen how to use principal components analysis to display multivariate information in fewer dimensions. ◮ Principal components analysis is a specific version of a more general class of methods called multidimensional scaling (MDS) ◮ In MDS, we take multivariate data and display them in fewer dimensions, doing our best to maintain the distance between points ◮ Classical MDS with Euclidean distance is equivalent to the principal components representation ◮ However, we can extend the lower-dimensional representation in two ways: 1. Use a different distance (or dissimilarity ) metric. 2. Use a different criteria for ordination (display of objects).

Ordination methods ◮ Classical MDS ◮ Distances are treated as Euclidean. ◮ Find the lower-dimensional representation that best preserves distances. ◮ Sammon method ◮ Similar to classical MDS. ◮ Minimize weighted sum of squared differences between dissimilarities and representation distances. ◮ Weights are proportional to dissimilarities (more dissimilar = more weight). ◮ Kruskal’s non-metric MDS ◮ Dissimilarities are allowed a monotonic transformation ◮ Only the ranks of the dissimilarities matter � � i ( d i − r i ) 2 ◮ Minimize stress S = where � d 2 i ◮ d i are the input dissimilarities (transformed) ◮ r i are the output representation (Euclidean) distances

MDS example Data are for 47 swiss provinces circa 1888 (undergoing demographic transition). Variables are proportion of population (agricultural, education, religion, infant mortality,...). Swiss provincial data ca. 1888: Sammon Swiss provincial data ca. 1888: Kruskal ● ● ● ● ● ● ● ● ● ● ● ● 20 ● ● ● 20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 ● ● 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −20 ● ● −20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −40 ● −40 −60 −60 ● ● −60 −40 −20 0 20 40 −60 −40 −20 0 20 40

Multivariate analysis DAAG Chapter 12 Learning objectives In this - PowerPoint PPT Presentation

Multivariate analysis DAAG Chapter 12 Learning objectives In this section, we will learn some basic approaches to multivariate analysis. Principal components analysis What is principal components analysis? What does principal

Outline Multivariate Data 1 Multivariate Parametric Methods Multivariate Normal Distribution 2

Multivariate t-distributions Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Reading multivariate data Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Tatiana

Multivariate Linear Regression Max Turgeon STAT 4690Applied Multivariate Analysis

Multivariate Normal Distribution Max Turgeon STAT 4690Applied Multivariate Analysis Building

Multivariate Analysis of Variance Max Turgeon STAT 4690Applied Multivariate Analysis Quick

Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson,

Robust Statistics Part 2: Multivariate location and scatter Peter Rousseeuw LARS-IASC School,

Advanced PHP Dr. Steven Bitner A/B and Multivariate testing Why use multivariate testing If

Multivariate normal distribution Surajit Ray Reader, University of Glasgow DataCamp

Multivariate Data Analysis in Omics Research Diverging Alternative Splicing Fingerprints

Principal Component Analysis Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Principal Component Analysis Powerpoint Presentation What is multivariate analysis? Summarizing

Analyzing spatial multivariate structures St ephane Dray Univ. Lyon 1 CARME 2011, Rennes

Multivariate Control Charts Stat 3570 28 Feb, 2013 1 / 13 Multivariate Control Charts In

Chocola late Fla lavor Ren enata Ja Januszewska Glo lobal R&D Sen ensory Manager

1 Relational Database Domains SNOOPYFAMILY

QQ ca cnn.com IT serves How connect to nearest edge TEJA CA 2 c DNS Based End If ACA

CS 4803 A block cipher E is a collection of functions from n bits to n bits. Each function is

Conversation is a Joint Activity (1) 1. Gracie: Oh yeah . . . and then Mr. and Mrs. Jones were

!"#$%&'(&)&#*&)'$%'+,&'-./0'10'!&2$314

RESPOSTA: ADIO/CAUSA/TEMPO/CONTRASTE/SOMA 2 RESOLUO DE QUESTES CONJUNCTIONS

COMMONWEALTH OF THE BAHAMAS COMMONWEALTH OF THE BAHAMAS MINISTRY OF PUBLIC WORKS & TRANSPORT

Sambuz

Useful Links

Newsletter

Mail Us

Multivariate analysis DAAG Chapter 12 Learning objectives In this - PowerPoint PPT Presentation

Multivariate analysis DAAG Chapter 12 Learning objectives In this section, we will learn some basic approaches to multivariate analysis. Principal components analysis What is principal components analysis? What does principal

Outline Multivariate Data 1 Multivariate Parametric Methods Multivariate Normal Distribution 2

Multivariate t-distributions Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Reading multivariate data Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Tatiana

Multivariate Linear Regression Max Turgeon STAT 4690Applied Multivariate Analysis

Multivariate Normal Distribution Max Turgeon STAT 4690Applied Multivariate Analysis Building

Multivariate Analysis of Variance Max Turgeon STAT 4690Applied Multivariate Analysis Quick

Regression Diagnostics and the Forward Search 3. A Single Multivariate Sample Anthony Atkinson,

Robust Statistics Part 2: Multivariate location and scatter Peter Rousseeuw LARS-IASC School,

Advanced PHP Dr. Steven Bitner A/B and Multivariate testing Why use multivariate testing If

Multivariate normal distribution Surajit Ray Reader, University of Glasgow DataCamp

Multivariate Data Analysis in Omics Research Diverging Alternative Splicing Fingerprints

Principal Component Analysis Surajit Ray Reader, University of Glasgow DataCamp Multivariate

Principal Component Analysis Powerpoint Presentation What is multivariate analysis? Summarizing

Analyzing spatial multivariate structures St ephane Dray Univ. Lyon 1 CARME 2011, Rennes

Multivariate Control Charts Stat 3570 28 Feb, 2013 1 / 13 Multivariate Control Charts In

Chocola late Fla lavor Ren enata Ja Januszewska Glo lobal R&amp;D Sen ensory Manager

1 Relational Database Domains SNOOPYFAMILY

QQ ca cnn.com IT serves How connect to nearest edge TEJA CA 2 c DNS Based End If ACA

CS 4803 A block cipher E is a collection of functions from n bits to n bits. Each function is

Conversation is a Joint Activity (1) 1. Gracie: Oh yeah . . . and then Mr. and Mrs. Jones were

!&quot;#$%&amp;'(&amp;)&amp;#*&amp;)'$%'+,&amp;'-./0'10'!&amp;2$314

RESPOSTA: ADIO/CAUSA/TEMPO/CONTRASTE/SOMA 2 RESOLUO DE QUESTES CONJUNCTIONS

COMMONWEALTH OF THE BAHAMAS COMMONWEALTH OF THE BAHAMAS MINISTRY OF PUBLIC WORKS &amp; TRANSPORT

Sambuz

Useful Links

Newsletter

Mail Us

Chocola late Fla lavor Ren enata Ja Januszewska Glo lobal R&D Sen ensory Manager

!"#$%&'(&)&#*&)'$%'+,&'-./0'10'!&2$314

COMMONWEALTH OF THE BAHAMAS COMMONWEALTH OF THE BAHAMAS MINISTRY OF PUBLIC WORKS & TRANSPORT