multivariate analysis of variance
play

Multivariate Analysis of Variance (MANOVA) Consider Univariate - PowerPoint PPT Presentation

Find a significant difference between groups Multivariate Analysis of Variance (MANOVA) Consider Univariate ANOVA Used when you have 3 or more samples C B A 508 514.25 583.25 727.5


  1. Find a significant difference between groups Multivariate Analysis of Variance (MANOVA)

  2. Consider Univariate ANOVA Used when you have 3 or more samples 𝑦 𝐡𝐢𝐷 C B A 508 514.25 583.25 727.5 𝑦 𝑑 𝑦 𝐢 𝑦 𝐡 𝐼 𝑝 : 𝜈 𝐡 = 𝜈 𝐢 = 𝜈 𝐷 𝐼 𝑏 : 𝜈 𝐡 β‰  𝜈 𝐢 β‰  𝜈 𝐷 The alternative could be true because all the means are different or just one of them is different than the others If we reject the null hypothesis we need to perform some further analysis to draw conclusions about which population means differ from the others and by how much

  3. Consider Univariate ANOVA Used when you have 3 or more samples 𝑦 𝐡𝐢𝐷 C B A SIGNAL NOISE 508 514.25 583.25 727.5 𝑦 𝑑 𝑦 𝐢 𝑦 𝐡 𝐺 = π‘‘π‘—π‘•π‘œπ‘π‘š 𝐺 = π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ π‘œπ‘π‘—π‘‘π‘“ π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ π‘œ π‘œ 𝑦 𝑗 βˆ’ 𝑦 𝐡𝑀𝑀 2 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑗 𝑗 𝑗 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ = π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ = βˆ— 𝑠 π‘œ π‘œ βˆ’ 1 A large F-value indicates a significant difference

  4. Consider Univariate ANOVA Used when you have 3 or more samples 𝑦 𝐡𝐢𝐷 C B A SIGNAL NOISE 508 514.25 583.25 727.5 𝑦 𝑑 𝑦 𝐢 𝑦 𝐡 βˆ— 4 = 727.5 βˆ’ 583.25 2 + 514.25 βˆ’ 583.25 2 + 508 βˆ’ 583.25 2 𝐡,𝐢,𝐷 𝑦 𝑗 βˆ’ 𝑦 𝐡𝐢𝐷 2 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ = 𝑗 βˆ— 4 3 βˆ’ 1 2 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ = πŸ•πŸ‘πŸ“πŸ•πŸ’. πŸ‘πŸ” π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ = 𝑀𝑏𝑠 𝐡 + 𝑀𝑏𝑠 𝐢 + 𝑀𝑏𝑠 = 891.6667 + 819.3333 + 305.5833 𝐷 3 3 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ = πŸ•πŸ–πŸ‘. πŸπŸ˜πŸ“πŸ’ 𝐺 = π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ = 62463.25 One-way ANOVA in R: 672.1943 = πŸ˜πŸ‘. πŸ˜πŸ‘πŸ“πŸ’πŸ˜ anova(lm(YIELD~VARIETY))

  5. F-Distribution (family of distributions- shape is dependent on degrees of freedom) π‘‘π‘—π‘•π‘œπ‘π‘š < π‘œπ‘π‘—π‘‘π‘“ π‘‘π‘—π‘•π‘œπ‘π‘š > π‘œπ‘π‘—π‘‘π‘“ 𝐺 = π‘‘π‘—π‘•π‘œπ‘π‘š 𝐺 = π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ π‘œπ‘π‘—π‘‘π‘“ π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ Probability of observation π‘œ 𝑦 𝑗 βˆ’ 𝑦 𝐡𝑀𝑀 2 π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑐𝑓𝑒π‘₯π‘“π‘“π‘œ = 𝑗 βˆ— π‘œπ‘π‘π‘žπ‘’ π‘œ βˆ’ 1 π‘œ π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ π‘₯π‘—π‘’β„Žπ‘—π‘œ = π‘€π‘π‘ π‘—π‘π‘œπ‘‘π‘“ 𝑗 𝑗 π‘œ ∝= 0.05 ∞ In R: In R: qf ( p, 𝑒𝑔 1 , 𝑒𝑔 2 ) pf ( F, 𝑒𝑔 1 , 𝑒𝑔 2 ) P-value (percentiles, probabilities) Present 1-p-value 0 0.50 0.95 The larger the F-value the further into the tail – AND the smaller the probability that the calculated F- value was found by chance, MEANING there is a high probability that something is causing a significant difference between the groups

  6. Using DISCRIM to predict which group Problem : A new skull is found but we don’t know whether it belongs to homo erectus or homo habilis or if it’s a new group? Homo erectus Skull measurement Homo habilis Group centroid New find (unknown origin) Popular method in taxonomy and anthropology How predictions work: 1. Calculate group centroid 2. Find out which centroid is the closest position to the unknown data point New groups are defined when we find a significant difference between new find and predefined groups

  7. Multivariate Analysis of Variance (MANOVA) Is there a significant difference among groups based on multiple response variables? (e.g. ANOVA with multiple response variables) When we calculate a centroid of a group Skull measurement you build a probability distribution around the centroid for comparison You can the run repeated t-tests (with adjusted p-values for multiple comparisons) to compare the new data to the groups but MANOVA does it all for you in one shot! MANOVA in R: output=manova(responseMatrix~predictorMatrix) (stats package) Another lab on MANOVA for reference: Laura’s website, RENR 480, Lab 22

  8. Assumptions of (MANOVA) MANOVA is VERY sensitive to invalid assumptions and outliers Within groups we need to have: 1. Normality: Residuals have to be normally distributed 2. Homogeneity of variances: residuals need to have equal variances Need to meet the assumption in the univariate context to meet them for multivariate analyses You therefore first have to check each individual measurement (response variable) for normality and homogeneity e.g. By making boxplots or plotting ANOVA residuals for each variable

  9. Assumptions of (MANOVA) Boxplots in R (multiple plots): boxplot(ResponseVariable~Group) Generate boxplots for each response variable and assess shape & whiskers Frequency Left skewed Normal Right skewed Median negatively skewed perfectly symmetric positively skewed Represented as a Mean boxplot Mode Mode Bi-Modal Frequency Two different modes Not necessarily symmetric Mean Median

  10. Assumptions of (MANOVA) Residual plots in R (multiple plots): plot(lm(ResponseVariable~Group))(2 nd plot) Testing for Normality & Equal Variances – Residual Plots Predicted values β€’ NORMAL distribution: equal number of points along observed β€’ EQUAL variances: equal spread on either side of the mean predicted value =0 0 β€’ Good to go! Observed (original units) Predicted values β€’ NON-NORMAL distribution: unequal number of points along observed β€’ 0 EQUAL variances: equal spread on either side of the mean predicted value =0 β€’ Optional to fix Observed (original units) Predicted values β€’ NORMAL/NON NORMAL: look at histogram or test β€’ 0 UNEQUAL variances: cone shape – away from or towards zero β€’ This needs to be fixed for MANOVA (transformations) Observed (original units) Predicted values 0 β€’ OUTLIERS: points that deviate from the majority of data points β€’ This needs to be fixed for MANOVA (transformations or removal) Observed (original units)

  11. Assumptions of (MANOVA) If you violate the assumptions of MANOVA: 1. Transform your data (follow examples we will discuss on the board) 2. Use non-parametric options (e.g. perMANOVA Lab 6)

  12. Multivariate Analysis of Variance (MANOVA) - output You can see if there is a significant difference across all predictor variables using the Wilk’s MANOVA test statistic Or you can see if there is a significant difference among groups for each predictor variable separately P-value – the probability the observed difference between groups or larger is due to random chance Thus if p-value is small this means that something is having an effect on the groups causing the difference

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend