Two-Sample Experimental Designs 707.031: Evaluation Methodology - - PowerPoint PPT Presentation

two sample experimental designs
SMART_READER_LITE
LIVE PREVIEW

Two-Sample Experimental Designs 707.031: Evaluation Methodology - - PowerPoint PPT Presentation

Two-Sample Experimental Designs 707.031: Evaluation Methodology Winter 2015/2016 Eduardo Veas Modelling 2 General statistical model Outcome = (model) / error 3 Calculating if a score will occur z scores: conversion to normal


slide-1
SLIDE 1

Two-Sample Experimental Designs

707.031: Evaluation Methodology Winter 2015/2016

Eduardo Veas

slide-2
SLIDE 2

Modelling

2

slide-3
SLIDE 3

General statistical model

Outcome = (model) / error

3

slide-4
SLIDE 4

Calculating if a score will occur

  • z scores: conversion to normal distribution with

a mean of 0 and sd =1.

  • z = (X - X’) / sd
  • z-scores: 1.96 cuts 2.5% off the top.
  • 2.58 cuts 0.5% off the top
  • 3.29 cuts 0.05% off the top

4

slide-5
SLIDE 5

Beyond the data

5

Advanced descriptive stats

slide-6
SLIDE 6

Beyond the data

6

slide-7
SLIDE 7

Advanced descriptive statistics

  • Goal:
  • establish relationships between circumstances

and behaviors

  • Fit these relationships into an orderly body of

knowledge

  • we want to say something about the world

7

slide-8
SLIDE 8

8

slide-9
SLIDE 9

Advanced descriptive statistics

  • sampling variation: different samples will have

means that differ from the population

  • sampling distribution: the frequency distribution
  • f the sample means from the same population
  • standard error= standard deviation of sample

means represents h ow well the sample represents the population

9

slide-10
SLIDE 10

Advanced descriptive statistics

  • standard error of the mean (SE): the standard

deviation of sample means

  • samples > 30
  • samples < 30 (t-distribution)

10

slide-11
SLIDE 11

Advance descriptive statistics

  • effect size: how important is the effect?
  • Cohen’s d
  • Pearson correlation coefficient r
  • .1 (small effect) explains 1% of the total

variance

  • .3 (med effect) explains 9% of the total variance
  • .5 (large effect) explains 25% of the total var.

11

slide-12
SLIDE 12

Comparing two means

12

slide-13
SLIDE 13

Comparing two means

13

slide-14
SLIDE 14

null hypothesis

H0: the difference between condition A and condition B can be attributed to chance

14

slide-15
SLIDE 15

t-test

  • compare whether a correlation coefficient is

different from 0

  • compare whether a regression coefficient is

different from 0

  • compare whether two group means are

different

15

slide-16
SLIDE 16

t-test experimental design

16

faster interaction input (multitouch) input (mouse+kbd) INDEPENDENT DEPENDENT H1: People will be faster at selecting and resizing a target with multitouch than with mouse+kbd Q: Are people actually faster with multitouch than with mouse and keyboard?

slide-17
SLIDE 17

t-test experimental design

  • one independent variable with two levels and a

measurable dependent variable

  • often we compare one condition of the

independent variable against a baseline condition

  • Is the movie scream 2 scarier than the original

scream? People who take 707.031 perform better experiments than those who don’t.

17

slide-18
SLIDE 18

t-test rationale

  • two samples are collected
  • their means can differ by little or lot
  • assumption: if the samples come from the same

population, their means will be roughly similar

  • procedure: compare difference between

collected sample means and expected sample means with the standard error

  • larger difference -> more confidence

18

slide-19
SLIDE 19

t-test rationale

19

t = Observed difference between sample means Expected difference between population means Estimate of the standard error

  • f the difference between two

sample means

slide-20
SLIDE 20

t-test experimental design

  • independent-means t-test: for independent

measures designs

  • dependent-means t-test: for repeated measures

desings

20

slide-21
SLIDE 21

independent t-test: assumptions

  • the distribution is normally distributed
  • data are measured at least at interval level
  • scores in different conditions are independent
  • homogeneity of variance *

21

slide-22
SLIDE 22

independent t-test equation

22

t = Observed difference between sample means Expected difference between population means Estimate of the standard error

  • f the difference between two

sample means

slide-23
SLIDE 23

independent t-test equation

23

slide-24
SLIDE 24

independent t-test reporting

An independent samples t-test indicated that the difference in anxiety experienced from real spiders (M=47.0, SE=3.18) and from a pictures of a spider (M=40.0, SE=2.68) was not significant t(21.39)=

  • 1.68, p>.05, albeit representing a medium sized

effect r=.34

24

slide-25
SLIDE 25

dependent t-test assumptions

  • the distribution is normally distributed
  • data are measured at least at interval level

25

slide-26
SLIDE 26

dependent t-test equation

26

t = Observed difference between sample means Expected difference between population means standard error

  • f the differences
slide-27
SLIDE 27

dependent t-test equation

27

slide-28
SLIDE 28

dependent t-test result

  • df = N-1
  • p = the exact probability that a value of t could
  • ccur if the null hypothesis were true.
  • t : when positive means that the first condition

had larger mean than the second

28

slide-29
SLIDE 29

dependent t-test reporting

A paired samples t-test revealed that on average, participants experienced significantly greater anxiety from real spiders (M=47.0, SE=3.18), than from pictures of spiders (M=40.0, SE=2.68), t(11)=2.47, r=.60.

29

slide-30
SLIDE 30

Non-parametric tests

30

When assumptions failed

slide-31
SLIDE 31

Mann-Whitney U / Wilcoxon summed rank

31

for independent measures

slide-32
SLIDE 32

Mann-Whitney U / Wilcoxon summed rank

  • #1: no difference
  • rank the data ignoring the groups
  • expect a similar number of high and low

values

  • #2: if there’s difference
  • ranking the data
  • expect higher ranks in one of the groups.

32

slide-33
SLIDE 33

Mann-Whitney U / Wilcoxon summed rank

  • sort all scores in ascending order
  • rank them (1~n)
  • tied ranks (where scores are the same) are

averaged

  • add ranks for each group
  • subtract the mean rank = N(N+1) / 2
  • W = sum of ranks - mean rank
  • calculate p-values (Monte Carlo or

approximation)

33

slide-34
SLIDE 34

Mann-Whitney U / Wilcoxon summed rank

34

slide-35
SLIDE 35

Mann-Whitney U / Wilcoxon summed rank

Depression levels in ecstasy users (Mdn 17.50) did not differ significantly from alcohol users (Mdn=16.00) the day after the drugs were taken, W=35.5, p=.286, r=-.25. However, by Wed, ecstasy users (Mdn=33.5) were significantly more depressed than alcohol users (Mdn=7.5), W=4, p<.001, r=-.78.

35

slide-36
SLIDE 36

Wilcoxon signed-rank test

36

for repeated measures

slide-37
SLIDE 37

Wilcoxon signed-rank test

  • based upon differences between scores in the

two conditions

  • rank the differences
  • if the difference is 0, data are excluded
  • get sum of negative and positive ranks for each

condition

37

slide-38
SLIDE 38

Wilcoxon signed-rank test

  • use mean (T) and SE to calculate significance
  • both are functions of the sample size
  • convert the test statistics to a z-score
  • if the values are bigger than 1.96, then the test is

significant at p<0.05

38

slide-39
SLIDE 39

Wilcoxon signed-rank test: reporting

A Wilcoxon signed rank test revealed that for ecstasy users, depression levels were significantly higher on Wed(Mdn=33.50) than on Sun (Mdn=17.50, p=.047, r=-.56). However, for alcohol users, the opposite was the opposite was true, depression levels were significantly lower on Wed(Mdn=7.50) than on Sun (Mdn=16.0, p=.012, r= -.45).

39

slide-40
SLIDE 40

Readings

  • Discovering statistics using R (Andy Field, Jeremy

Miles, Zoe Field)

40