Sta$s$cs & Experimental Design with R Barbara - PowerPoint PPT Presentation

Sta$s$cs ¡& ¡Experimental ¡Design ¡ with ¡R ¡ Barbara ¡Kitchenham ¡ Keele ¡University ¡ 1 ¡

Comparing ¡two ¡or ¡more ¡groups ¡ Part ¡5 ¡ 2 ¡

Aim ¡ • To ¡cover ¡standard ¡approaches ¡for ¡independent ¡ and ¡dependent ¡groups ¡ – For ¡two ¡groups ¡ • Student’s ¡“t” ¡test ¡(parametric) ¡ • Mann-‑Whitney ¡Wicoxon ¡(non-‑parametric) ¡ – For ¡mul$ple ¡groups ¡ • ANOVA ¡ • Kruskal-‑Wallis ¡ • To ¡introduce ¡more ¡modern ¡approaches ¡for ¡2 ¡and ¡ more ¡groups ¡ – Non-‑parametric ¡ – Robust ¡ 3 ¡

Student’s ¡“t” ¡ • Standard ¡classical ¡method ¡ • Two ¡independent ¡groups ¡ – Size ¡n 1 ¡and ¡n 2 ¡ ¡ – Some ¡measure ¡of ¡interest ¡x ij ¡ – i=1 ¡or ¡2 ¡specifying ¡group ¡ • j=1,… ¡n 1 ¡ if ¡i=1 ¡ • j=1,… ¡n 2 ¡ if ¡i=2 ¡ • Assump$ons ¡ – x ij ¡are ¡iid ¡ – x ij ¡ ~N(μ i ,σ 2 ) ¡ • H0: ¡μ 1 = ¡μ 2 , ¡H1: ¡μ 1 ≠ ¡μ 2 ¡| ¡μ 1 <μ 2 ¡| ¡μ 1 >μ 2 ¡ ¡ 4 ¡

Jus$fica$on ¡ • Normal ¡distribu$on ¡means: ¡ • Since ¡individual ¡ ¡x ij ¡independent ¡μ i ¡in ¡each ¡ group ¡are ¡independent ¡ • Variance ¡of ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡is ¡ • Es$mate ¡of ¡σ 2 ¡is ¡ • Under ¡null ¡hypothesis ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ – With ¡n 1 +n 2 -‑2 ¡degrees ¡of ¡freedom ¡ 5 ¡

Varia$on1 ¡ • Paired ¡values ¡ ¡ – n 1 =n 2 =n ¡ – Paired ¡values ¡are ¡not ¡independent ¡so ¡ – Difference ¡ • d j =x 1j -‑x 2j ¡ – Paired ¡values ¡reduces ¡variance ¡ • More ¡likely ¡to ¡find ¡a ¡significant ¡difference ¡ • Reason ¡why ¡repeat ¡measure ¡experiments ¡are ¡ considered ¡useful ¡ – Degrees ¡of ¡freedom=n-‑1 ¡ 6 ¡

Varia$on ¡2 ¡ • Variance ¡of ¡groups ¡differ ¡ – Welch’s ¡test ¡(default ¡in ¡R) ¡ – Changes ¡degrees ¡of ¡freedom ¡(ν) ¡ – where ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ 7 ¡

Problems ¡with ¡t-‑test ¡ • Mean ¡is ¡not ¡robust ¡ – Single ¡large ¡value ¡can ¡inflate ¡mean ¡ • Es$mate ¡of ¡variance ¡may ¡be ¡very ¡poor ¡ – If ¡there ¡are ¡outlier ¡values ¡that ¡inflate ¡mean ¡they ¡ will ¡also ¡inflate ¡variance ¡ – Es$mate ¡of ¡variance ¡is ¡not ¡robust ¡ • If ¡outliers ¡in ¡the ¡data ¡real ¡effects ¡may ¡not ¡be ¡ found ¡ – i.e. ¡power ¡of ¡t-‑test ¡is ¡low ¡if ¡there ¡are ¡outliers ¡ • In ¡the ¡presence ¡of ¡outliers, ¡the ¡outliers ¡may ¡ not ¡be ¡easily ¡detected ¡(i.e. ¡masked) ¡ 8 ¡

Mann-‑Whitney-‑Wilcoxon ¡test ¡ ) • Non-‑parametric ¡test ¡ – Used ¡very ¡frequently ¡in ¡SE ¡studies ¡because ¡ datasets ¡are ¡oren ¡not ¡Normal ¡ • Usually ¡es$mated ¡via ¡ranks ¡ – Values ¡measured ¡on ¡items ¡in ¡two ¡groups ¡ • Rank ¡values ¡across ¡all ¡values ¡ – Mann-‑Whitney ¡ – where ¡ – Wilcoxon, ¡W=Sum ¡of ¡ranks ¡from ¡G2 ¡ ¡ • W=U+n ¡(n+1)/2 ¡ 9 ¡

Tes$ng ¡process ¡ • Large ¡sample ¡approxima$on ¡ • Converts ¡into ¡standard ¡normal ¡deviate ¡ – E 0 (W)=n(m+n+1)/2 ¡ • Sum ¡of ¡all ¡ranks ¡=(n+m)×(n+m+1)/2 ¡ • Under ¡H 0 ¡ Propor$on ¡of ¡ranks ¡in ¡Group ¡2= ¡n/(n+m) ¡ – Var 0 (W)=mn(n+1)/12 ¡ – Standardized ¡(W)=[W-‑E 0 (W)]/[Var 0 ] 0.5 ¡ – For ¡ ¡U ¡ • E 0 (U)=mn/2 ¡ • Var 0 (U)=mn(m+n+1)/12 ¡ • R ¡func$on: ¡wilcox.test ¡reports ¡U ¡(but ¡says ¡W) ¡ 10 ¡

Problems ¡with ¡Mann-‑Whitney ¡ • Has ¡poor ¡power ¡if: ¡ – Ties ¡among ¡data ¡ ¡ – When ¡distribu$on ¡of ¡two ¡groups ¡differs, ¡uses ¡the ¡ wrong ¡standard ¡error ¡ • Alterna$ve ¡methods ¡available ¡ – Mann-‑Whitney ¡test ¡is ¡related ¡to ¡probability ¡(p) ¡ than ¡random ¡observa$on ¡from ¡group ¡1 ¡<random ¡ observa$on ¡from ¡group ¡2 ¡ • H0: ¡p=0.5 ¡ – Other ¡ ¡methods ¡based ¡on ¡this ¡viewpoint ¡ 11 ¡

Alterna$ve ¡“New” ¡Nonparametric ¡ Methods ¡ • Cliff’s ¡method ¡(1996) ¡ – p 1 =P(X I1 >X i2 ) ¡, ¡p 2 =P(X I1 =X i2 ), ¡p 3 =P(X I1 <X i2 ) ¡ – P=p 3 +0.5p 2 ¡ – δ=p 3 -‑p 1, ¡ H0: ¡δ=0 ¡ ¡giving ¡δ=1-‑2P ¡ • Brunner-‑Munzel ¡(2000) ¡ – When ¡$ed ¡values ¡average ¡rank ¡of ¡$ed ¡values ¡ • R ¡func$ons ¡in ¡WRS ¡package ¡ – Load ¡library ¡WRS ¡ 12 ¡

Advantages ¡of ¡New ¡methods ¡ • ¡ ¡ ¡ ¡ ¡ ¡provides ¡a ¡sensible ¡non-‑parametric ¡ effect ¡size ¡ • Have ¡well-‑defined ¡process ¡for ¡handling ¡ $ed ¡data ¡ • Version ¡of ¡both ¡Cliff ¡& ¡Brunner-‑Munzel ¡ available ¡for ¡ ¡three ¡or ¡more ¡groups ¡ – Although ¡tests ¡suggest ¡Cliff ¡is ¡slightly ¡be|er ¡ at ¡achieving ¡specified ¡alpha ¡level ¡ 13 ¡

Permuta$on ¡test ¡ • Useful ¡when ¡data ¡sets ¡are ¡small ¡ • Calculate ¡test ¡sta$s$c ¡based ¡on ¡actual ¡data ¡T 0 ¡ • Could ¡be ¡“t” ¡value, ¡the ¡Mann-‑Whitney ¡sta$s$cs ¡or ¡another ¡ test ¡sta$s$c ¡e.g. ¡sum ¡of ¡ranks ¡of ¡smallest ¡group ¡ • Resample ¡data ¡ without ¡replacement ¡ – Calculate ¡ ¡and ¡record ¡new ¡sum ¡(T 1 ) ¡ • Repeat ¡for ¡every ¡possible ¡way ¡of ¡arrangement ¡of ¡data ¡ • Arrange ¡T i ¡in ¡ascending ¡order ¡ • If ¡T 0 ¡fall ¡outside ¡the ¡middle ¡95% ¡of ¡values, ¡reject ¡ hypothesis ¡ • If ¡too ¡many ¡permuta$ons, ¡take ¡sample ¡ 14 ¡

R ¡Permuta$on ¡Test ¡facility ¡ • Load ¡packages ¡ – coin ¡& ¡lmPerm ¡ • library(coin) ¡ – For ¡t-‑test ¡ • oneway_test(y~A) ¡ ¡ – For ¡Wilcoxon ¡test ¡ • wilcox_test(y~A) ¡ – A ¡must ¡be ¡defined ¡as ¡a ¡factor ¡with ¡two ¡ levels ¡ ¡ 15 ¡

Other ¡robust ¡approaches ¡ • Use ¡differences ¡between ¡medians ¡and ¡standard ¡ error ¡of ¡medians, ¡then ¡ – where ¡c=(1-‑α/2) ¡quan$le ¡of ¡unit ¡normal ¡distribu$on ¡ – But ¡which ¡es$mate ¡of ¡SE ¡of ¡median? ¡ • Version ¡of ¡t-‑test ¡based ¡on ¡20% ¡trimmed ¡means ¡ – Allowing ¡for ¡unstable ¡variances ¡ – Yuen-‑Welch ¡method ¡available ¡in ¡R ¡package ¡WRS ¡ • Library(WRS) ¡ • yuen(y,x,tr=0.2,alpha=0.05) ¡ 16 ¡

Comparing ¡Two ¡Groups ¡ • From ¡COCOMO ¡dataset ¡ • Produc$vity ¡(KLoc/MM) ¡of ¡organic ¡ projects ¡that ¡used ¡different ¡amounts ¡of ¡ tool ¡support ¡ • GR1 ¡(Low): ¡{0.09, ¡0.13, ¡0.77,0.08, ¡0.20, ¡ 0.22, ¡0.12} ¡ • GR2 ¡(Average): ¡ {0.19,0.48,0.72,0.31,0.34,0.34,0.45,0.64, ¡ 0.35,0.56 ¡} ¡ 17 ¡

Productivity 0.1 0.3 0.5 0.7 Box ¡plot ¡ 1 18 ¡ 2

Are ¡groups ¡different? ¡ • Basic ¡sta$s$cs ¡ – Mean ¡G1=0.23 ¡(n 1 =7) ¡ – Mean ¡G2=0.4236 ¡(n 2 =11) ¡ – StDev1=0.2439 ¡ ¡ – StDev2=0.1622 ¡ – Median ¡G1=0.13 ¡ – Median ¡G2=0.35 ¡ 19 ¡

Difference ¡Test ¡Results ¡ • t-‑test, ¡t=2.0348, ¡df=16, ¡p=0.05879 ¡ • Welch ¡test, ¡t=1.8558, ¡df=9.406, ¡p= ¡0.09503, ¡ • Wilcoxon ¡rank ¡test ¡p=0.0204 ¡ • Yuen-‑Welch ¡test ¡for ¡trimmed ¡means ¡ – 20% ¡Trimmed ¡means ¡G1=0.152, ¡G2=0.4014 ¡ – p=0.0029, ¡df=9.3 ¡ • Cliff, ¡ ¡ ¡ ¡ ¡=0.8312, ¡CI ¡(0.46131, ¡0.9659), ¡p=0.081 ¡ • Brunner-‑Munzel, ¡ ¡ ¡ ¡ ¡=0.8312, ¡CI ¡(0.4894, ¡1.1729), ¡ p=0.056, ¡df=6.42 ¡ • Permuta$on ¡t-‑test, ¡z=1.8694, ¡p=0.062 ¡ • Permuta$on ¡Wilcoxon ¡test ¡,z=2.3095, ¡p=0.019 ¡

Robust ¡methods ¡plot ¡difference ¡ 2.0 1.5 1.0 0.5 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 21 ¡

Sta$s$cs & Experimental Design with R Barbara - PowerPoint PPT Presentation

Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Comparing two or more groups Part 5 2 Aim To cover standard

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

Basic Experimental Design Basic Concepts in Experimental Design Prof. Dr. Luc Duchateau Ghent

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Experimental Design and Probability Introduction to course Robin Elahi Experimental Design and

Experimental Design in R Kaelen Medeiros Product Data Scientist at DataCamp DataCamp

Sta$s$cs & Experimental Design with R Barbara Kitchenham

Sta$s$cs & Experimental Design with R Barbara Kitchenham

2011 11 12 12 th th at t Sta tate te (3:18.02) :18.02) 2012 12 10 10 th th at t

STA STA 2Q 2Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1. .

STA STA 4Q 4Q19 19 & FY & FY19 19 An Analy lyst st Pre Presentat sentation ion

STA STA 1Q 1Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1.

Open Water Swimming Speaker: Dave Candler, STA President Qualifications STA Level 1 Award for

STA STA 1Q 1Q20 20 Pr Prese esentation ntation Opportu ortunity nity Day 5 June e 2020

STA Graduation 2019/20 STA Graduation Application https://forms.gle/tZsKJXUmbAQgcSn57 This google

263-2810: Advanced Compiler Design 2.0 Sta>c Single Assignment Form Thomas R. Gross Computer

WHAT WOULD TREX DO? From Experimental Design to Analysis, the TREX Approach EXPERIMENTAL DESIGN

Independent Evaluator for The Public Service Company of Colorados 2017 All-Source Solicitation

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale

STAT 113 Independent vs. Paired Samples Colin Reimer Dawson Oberlin College November 16, 2017

Independence Complexes of Finite Groups Casey Pinckney Research Advisors: Dr. Alexander Hulpke,

Oregon Public Schools Oregon Public Schools Di Disposition Best Practices Guide iti B t P ti

Comparing Groups 1 Two Independent Groups Parametric t.test(q1 ~ gender)

Siddhi Slides Private Limited https://www.indiamart.com/parmar-marketing/ Tech-Hydra is the

Audio is only available by conference call Please call: (866) 254-5938 Participant Access Code:

Sambuz

Useful Links

Newsletter

Mail Us

Sta$s$cs & Experimental Design with R Barbara - PowerPoint PPT Presentation

Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Comparing two or more groups Part 5 2 Aim To cover standard

Sta$s$cs Sta$s$cs Fourth Dimension of a Sta$s$cal Programmer

Basic Experimental Design Basic Concepts in Experimental Design Prof. Dr. Luc Duchateau Ghent

F orwa rd L ooking Sta te me nt Ce rta in o f the sta te me nts ma de in this Pre se nta tio

Experimental Design and Probability Introduction to course Robin Elahi Experimental Design and

Experimental Design in R Kaelen Medeiros Product Data Scientist at DataCamp DataCamp

Sta$s$cs &amp; Experimental Design with R Barbara Kitchenham

Sta$s$cs &amp; Experimental Design with R Barbara Kitchenham

2011 11 12 12 th th at t Sta tate te (3:18.02) :18.02) 2012 12 10 10 th th at t

STA STA 2Q 2Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1. .

STA STA 4Q 4Q19 19 &amp; FY &amp; FY19 19 An Analy lyst st Pre Presentat sentation ion

STA STA 1Q 1Q19 19 An Analyst lyst Pre Presentation entation 1 CO CONTENTS TENTS 1.

Open Water Swimming Speaker: Dave Candler, STA President Qualifications STA Level 1 Award for

STA STA 1Q 1Q20 20 Pr Prese esentation ntation Opportu ortunity nity Day 5 June e 2020

STA Graduation 2019/20 STA Graduation Application https://forms.gle/tZsKJXUmbAQgcSn57 This google

263-2810: Advanced Compiler Design 2.0 Sta&gt;c Single Assignment Form Thomas R. Gross Computer

WHAT WOULD TREX DO? From Experimental Design to Analysis, the TREX Approach EXPERIMENTAL DESIGN

Independent Evaluator for The Public Service Company of Colorados 2017 All-Source Solicitation

The Numerical Reproducibility Fair Trade: Facing the Concurrency Challenges at the Extreme Scale

STAT 113 Independent vs. Paired Samples Colin Reimer Dawson Oberlin College November 16, 2017

Independence Complexes of Finite Groups Casey Pinckney Research Advisors: Dr. Alexander Hulpke,

Oregon Public Schools Oregon Public Schools Di Disposition Best Practices Guide iti B t P ti

Comparing Groups 1 Two Independent Groups Parametric t.test(q1 ~ gender)

Siddhi Slides Private Limited https://www.indiamart.com/parmar-marketing/ Tech-Hydra is the

Audio is only available by conference call Please call: (866) 254-5938 Participant Access Code:

Sambuz

Useful Links

Newsletter

Mail Us

Sta$s$cs & Experimental Design with R Barbara Kitchenham

Sta$s$cs & Experimental Design with R Barbara Kitchenham

STA STA 4Q 4Q19 19 & FY & FY19 19 An Analy lyst st Pre Presentat sentation ion

263-2810: Advanced Compiler Design 2.0 Sta>c Single Assignment Form Thomas R. Gross Computer