Data Science in the Wild Lecture 8: Advanced Experimental Analysis - PowerPoint PPT Presentation

Data Science in the Wild Lecture 8: Advanced Experimental Analysis Eran Toch Data Science in the Wild, Spring 2019 � 1

Types of Tests • Parametric vs. Non- Parametric • Difference vs. Correlation • Categorical vs. Differential • Number of samples Data Science in the Wild, Spring 2019 � 2

Agenda 1. Introduction 2. ANOVA 3. Post-hoc tests 4. Correlation tests 5. Sampling Data Science in the Wild, Spring 2019 � 3

Analysis of Variance - ANOVA Data Science in the Wild, Spring 2019 � 4

Why not t-tests? • Every time you conduct a t-test there is a chance that you will make a Type I error with a probability of α = 0.05 • By running two t-tests on the same data you will have increased your chance of making a mistake to about 0.1 • ANOVA controls for these errors, keeping the confidence level to 0.95 Data Science in the Wild, Spring 2019 � 5

Example If you are comparing 3 groups (A, B, C), than you can do a 3 total comparisons • A – B A – C B – C The experiment-wise error rate without any adjustments would be: • α e = 1 - (1- α ) c = 1 – (1-.05) 3 = 1 - .95 3 = 1 - 0.86 = .14 Data Science in the Wild, Spring 2019 � 6

ANOVA • ANOVA will tell us if one No alcohol 1 unit 2 units 5 units condition is significantly Frequency different to one or more of the others • But it won’t tell us which conditions are different Reaction time (ms) • We can compare one (or more) against one (or more) of the others Data Science in the Wild, Spring 2019 � 7

ANOVA • Analysis of variance (ANOVA) is used to determine whether groups of data are the same or different • It incorporates means and variances to determine its test statistics, called the F-ratio • What is the null hypothesis? • H 0: x 1 = x 2 = x 3 = x 4 = … x k (x - group mean, k - number of groups) Data Science in the Wild, Spring 2019 � 8

Conditions • The dependent variable is normally distributed in each group • Homogeneity of variances: • The variance in each group should be similar enough. • For example, using the Bartlett test   • Data type: The dependent variable must be interval or ratio (e.g., time or error rates) Data Science in the Wild, Spring 2019 � 9

Analysis of Variance (ANOVA) Area of non-overlap (hypothesis true) F-ratio = Area of overlap (hypothesis false) 1 unit 2 units 5 units 10 units Frequency No alcohol • Large “F” means significant differences • Large “F” means evidence in support of hypothesis • We need to calculate the size of all these Reaction time (ms) areas Data Science in the Wild, Spring 2019 � 10

F ratio SS MS = ‣ Mean square df ‣ MS error - the variance not accounted for by the variable MS within F = ‣ F ratio is a variance ratio or MS between ‘signal to noise’ ratio ‣ Large F means large Where: differences accounted for by MS - mean square the variable SS - sum of squares df - degrees of freedom Data Science in the Wild, Spring 2019 � 11

One-way Anova One-way ANOVA is used to determine whether there are any statistically significant • differences between the means of three or more independent groups Suits a simple between-subject design with one independent variable • Participant Condition Values 1 Mouse 1 2 Mouse 2 3 Mouse 2 4 Touch 5 5 Touch 6 6 Touch 5 7 Speech 2 8 Speech 1 Data Science in the Wild, Spring 2019 � 12

Model Y ij = µ +A i + 𝜁 ij An observation Y ij is given by the average performance of the users (µ), the • effect of the treatment (A i ) and an error for each participant and condition 𝜁 ij Our goal is to test if the hypothesis   • A 1 = A 2 = A 3 = A 4 = … A k = 0 is plausible Data Science in the Wild, Spring 2019 � 13

Calculation • Means: M mouse = (1 + 2 + 2) / 3 = 1.667 • M touch = (5 + 6 + 5) / 3 = 5.33 • M Speech = (1 + 2) / 2 = 1.5 • • The grand mean is calculated as follows: • µ ^ = (1 + 2 ++ 2 + 5 + 6 + 5 + 2 + 1) / 8 = 3 Data Science in the Wild, Spring 2019 � 14

Estimated Effect • The estimated effects, A ^i , are the difference between the estimated overall mean and the estimated treatment mean: A ^i = M i - µ ^ • Therefore, we get: • A mouse = 1.667 - 3 = -1.33 • A touch = 5.333 - 3 = 2.333 • A Speech = 1.5 - 3 = -1.5 Data Science in the Wild, Spring 2019 � 15

Degrees of Freedom • Calculating the degrees of freedom (just minus 1, actually) • df between = 3 -1 = 2 • df within = 8 - 1 = 7 Data Science in the Wild, Spring 2019 � 16

Sum of Squares • SS between : Sum of squares between conditions ∑ A ^i2 · #measures = (-1.33) 2 * 3 + (2.33) 2 * 3 + (1.4) 2 * 2 = 26.17 • SS within : Sum of squares within conditions ∑ i ∑ j (y ij - y i )2 = [(1-1.667) 2 + (2 - 1.6667) 2 + (2 - 1.6667) 2 ] + [0.667] + [0.5] = 1.83 Data Science in the Wild, Spring 2019 � 17

Calculating the Mean Square • MS = SS / df • MS between = SS between = 26.17 / 2 = 13.08 df between • MS within = SS within = 1.83 / 5 = 0.37 df within • F = MS between = 13.08 / 0.37 = 35.68 MS within Data Science in the Wild, Spring 2019 � 18

Interpretation • The F − value says us how far away we are from the hypothesis of indistinguishability between the error and the conditions (treatment) • A large F-value implies that the effect of the treatment (conditions) is relevant • We calculate the critical value for the level α = 5% with degrees of freedom 2 and 5. • p = 0.011 => We can reject the hypothesis that A mouse = A touch = A speech = 0 Data Science in the Wild, Spring 2019 � 19

Python Code from scipy import stats F, p = stats.f_oneway(d_data['ctrl'], d_data['trt1'], d_data['trt2']) Data Science in the Wild, Spring 2019 � 20

Factorial ANOVA Observation Gender Dosage Alertness 8 1 m a 2 m a 12 3 m a 13 Factorial ANOVA (two-way) • 4 m a 12 measures whether a combination 5 m b 6 of independent variables predict 6 m b 7 the value of a dependent variable 7 m b 23 8 m b 14 Suits between-group design, with • 9 f a 15 multiple conditions 10 f a 12 11 f a 22 12 f a 14 13 f b 15 14 f b 12 15 f b 18 16 f b 22 https://personality-project.org/r/r.guide.html Data Science in the Wild, Spring 2019 � 21

Python Code formula = 'len ~ C(supp) + C(dose) + C(supp):C(dose)' model = ols(formula, data).fit() aov_table = anova_lm(model, typ=2) from pyvttbl import DataFrame df=DataFrame() df.read_tbl(datafile) df['id'] = xrange(len(df['len'])) print(df.anova('len', sub='id', bfactors=['supp', 'dose'])) https://www.marsja.se/three-ways-to-carry-out-2-way-anova-with-python/ Data Science in the Wild, Spring 2019 � 22

Visualization Interaction Plot Data Science in the Wild, Spring 2019 � 23

Repeated Measure ANOVA Observation Subject Valence Recall In repeated measure ANOVA, we test • 1 Jim Neg 32 the same entity in several conditions 2 Jim Neu 15 One independent variable: one • 3 Jim Pos 45 way 4 Victor Neg 30 Several independent variables: • 5 Victor Neu 13 two way 6 Victor Pos 40 7 Faye Neg 26 Suits a within-subject study with • 8 Faye Neu 12 multiple conditions 9 Faye Pos 42 The design should be balanced: • 10 Ron Neg 22 without missing values in some 11 Ron Neu 10 conditions 12 Ron Pos 38 13 Jason Neg 29 aov = df.anova('rt', sub='Sub_id', wfactors=['condition']) 14 Jason Neu 8 print(aov) 15 Jason Pos 35 Data Science in the Wild, Spring 2019 � 24

Visual Representation Data Science in the Wild, Spring 2019 � 25

Kruskal-Wallis rank sum test We can use the Kruskal-Wallis rank sum test to compare the means of non-parametric groups 1 # Kruskal-Wallis H-test 2 from numpy.random import seed 3 from numpy.random import randn 4 from scipy.stats import kruskal 5 # seed the random number generator 6 seed(1) 7 # generate three independent samples 8 data1 = 5 * randn(100) + 50 9 data2 = 5 * randn(100) + 50 10 data3 = 5 * randn(100) + 52 11 # compare samples 12 stat, p = kruskal(data1, data2, data3) 13 print('Statistics=%.3f, p=%.3f' % (stat, p)) Data Science in the Wild, Spring 2019 � 26

Summary • ANOVA uses general analysis of variance to • F-value as the main inferential statistics • One way / two way / repeated measures Data Science in the Wild, Spring 2019 � 27

Post-Hoc Tests Data Science in the Wild, Spring 2019 � 28

Limits of ANOVA • Analysis of variance just tells us there is at least one level that is significantly different than the other • It does not tell us which level is different and how • t-tests would not keep the alpha level in the confidence interval Data Science in the Wild, Spring 2019 � 29

Types of Post-Hoc Tests • Fisher's least significant difference (LSD) • The Bonferroni procedure • Holm–Bonferroni method • Tukey's procedure • And many more… Data Science in the Wild, Spring 2019 � 30

Data Science in the Wild Lecture 8: Advanced Experimental Analysis - PowerPoint PPT Presentation

Data Science in the Wild Lecture 8: Advanced Experimental Analysis Eran Toch Data Science in the Wild, Spring 2019 1 Types of Tests Parametric vs. Non- Parametric Difference vs. Correlation Categorical vs. Differential

Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable

Data Science in the Wild Lecture 1: Introduction Eran Toch Data Science in the Wild, Spring 2019

Literacy Activity Wild Animal Habitat What is your favourite wild animal? Where do wild animals

Data Science in the Wild Lecture 6: Running Experiments Eran Toch Data Science in the Wild,

Data Science in the Wild Lecture 9: Sampling Eran Toch Data Science in the Wild, Spring 2019

Data Science in the Wild Lecture 12: Memory-Based Data Warehouses Eran Toch Data Science in the

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 7 Week 7 Census and Election Data

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 12 Week 12 The proper care and feeding

Data Science in the Wild Lecture 5: ETL - Extract, Transform, Load - 2 Eran Toch Data Science

Data Science in the Wild Lecture 14: Explaining Models Eran Toch Data Science in the Wild,

Data Science in the Wild Lecture 7: Analyzing Experiments Eran Toch Data Science in the Wild,

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 1 Week 1 Data collection Lecturer:

Wild Horse Tourism in NM Wild Horse Tourism in NM How the Jicarilla Ranger District of the Carson

Sushi Gone Wild: Skit & Music Details for the flight of The Wild Sushi Adrienne Chan

Wild Atlantic Way Update 2016 Presented by Suzanne Trehy Client Services Manager The Wild

Physics 2D Lecture Slides Oct 13 Vivek Sharma UCSD Physics Quiz 2 : Wild Wild West got a Bit

Outline 1

Truncations of unitary matrices and Brownian bridges Alain Rouault (Laboratoire de

Generalization Error of Generalized Linear Models in High Dimensions Melika Emami 1 , Mojtaba

Steins Method for Matrix Concentration Lester Mackey Collaborators: Michael I. Jordan ,

Variational Model Selection for Sparse Gaussian Process Regression Michalis K. Titsias School of

Representing Images and Sounds Class 4. 3 Sep 2009 Instructor: Bhiksha Raj Representing an

Investigating the security properties of MACs based on stream ciphers Leonie Simpson, Mufeed Al

Modeling and Verification with SPIN Wishnu Prasetya wishnu@cs.uu.nl www.cs.uu.nl/docs/vakken/pv

Data Science in the Wild Lecture 8: Advanced Experimental Analysis - PowerPoint PPT Presentation

Data Science in the Wild Lecture 8: Advanced Experimental Analysis Eran Toch Data Science in the Wild, Spring 2019 1 Types of Tests Parametric vs. Non- Parametric Difference vs. Correlation Categorical vs. Differential

Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable Wild Horse and Burro Roundtable

Data Science in the Wild Lecture 1: Introduction Eran Toch Data Science in the Wild, Spring 2019

Literacy Activity Wild Animal Habitat What is your favourite wild animal? Where do wild animals

Data Science in the Wild Lecture 6: Running Experiments Eran Toch Data Science in the Wild,

Data Science in the Wild Lecture 9: Sampling Eran Toch Data Science in the Wild, Spring 2019

Data Science in the Wild Lecture 12: Memory-Based Data Warehouses Eran Toch Data Science in the

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 7 Week 7 Census and Election Data

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 12 Week 12 The proper care and feeding

Data Science in the Wild Lecture 5: ETL - Extract, Transform, Load - 2 Eran Toch Data Science

Data Science in the Wild Lecture 14: Explaining Models Eran Toch Data Science in the Wild,

Data Science in the Wild Lecture 7: Analyzing Experiments Eran Toch Data Science in the Wild,

ETC5512: Wild Caught Data ETC5512: Wild Caught Data Week 1 Week 1 Data collection Lecturer:

Wild Horse Tourism in NM Wild Horse Tourism in NM How the Jicarilla Ranger District of the Carson

Sushi Gone Wild: Skit &amp; Music Details for the flight of The Wild Sushi Adrienne Chan

Wild Atlantic Way Update 2016 Presented by Suzanne Trehy Client Services Manager The Wild

Physics 2D Lecture Slides Oct 13 Vivek Sharma UCSD Physics Quiz 2 : Wild Wild West got a Bit

Outline 1

Truncations of unitary matrices and Brownian bridges Alain Rouault (Laboratoire de

Generalization Error of Generalized Linear Models in High Dimensions Melika Emami 1 , Mojtaba

Steins Method for Matrix Concentration Lester Mackey Collaborators: Michael I. Jordan ,

Variational Model Selection for Sparse Gaussian Process Regression Michalis K. Titsias School of

Representing Images and Sounds Class 4. 3 Sep 2009 Instructor: Bhiksha Raj Representing an

Investigating the security properties of MACs based on stream ciphers Leonie Simpson, Mufeed Al

Modeling and Verification with SPIN Wishnu Prasetya wishnu@cs.uu.nl www.cs.uu.nl/docs/vakken/pv

Sushi Gone Wild: Skit & Music Details for the flight of The Wild Sushi Adrienne Chan