STAT 113 Tests and Confidence Intervals Colin Reimer Dawson - PowerPoint PPT Presentation

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind STAT 113 Tests and Confidence Intervals Colin Reimer Dawson Oberlin College October 10th, 2016

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Reminders and Announcements • HW online, due Friday (but ok if you want to turn it in during break)

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Two-Tailed Tests Two-Tailed Test In a Two-Tailed Test , H 1 does not specify the direction (sign) of a difference/correlation/slope. So outcomes at either extreme count in its favor. The P -value therefore uses outcomes at or past the observed one, but also the symmetric outcomes on the other “tail” We should prefer two-tailed tests, unless only one side of the alternative is plausible a priori .

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind What is low enough? Significance level ( α ) We need to decide for ourselves, in advance of collecting data , what we will count as a “low enough” P -value to achieve statistical significance. This threshold is called the significance level of the test. (Notation: α )

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Making a Decision Reject H 0 or not? Compare P to α . (a) P ≥ α : Do not reject H 0 . (Data wouldn’t be that surprising if H 0 true. H 0 is “presumed innocent”.) (b) P < α : Reject H 0 . (Data would be too surprising if H 0 were true. Beyond a “reasonable doubt”.) We do not “accept H 0 ”. We “fail to reject” it. (Not enough evidence to decide)

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Types of Errors 2 × 2 table of possibilities. Is H 0 actually false (does the treatment actually work)? Did we reject H 0 (did we conclude that it works)? Action H 0 rejected H 0 not rejected True Discovery Missed Discovery H 0 is false Truth H 0 is true False Discovery No Error Table: Possible outcomes of a null hypothesis significance test

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Type I vs. Type II Errors • We can set α to whatever we want. The lower it is, the less often we make Type I Errors. • Tradeoff: Fewer Type I Errors → More Type II Errors.

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Type I vs. Type II Errors Decreasing α moves the rejection threshold out toward the tail of the H 0 distribution. 0.20 ● α = 0.15 , threshold = 8 ● ● 0.15 Probability ● 0.10 ● ● 0.05 ● ● ● 0.00 ● ● ● ● ● ● ● ● ● ● ● ● 0 5 10 15 20 Values Blue spikes: Distribution of outcomes if H 0 is true

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Type I vs. Type II Errors We retain H 0 when we do not exceed the threshold. But if H 1 is correct, this is a Type II Error. More stringent threshold → missed discoveries. 0.20 ● α = 0.15 , threshold = 8 ● ● ● 0.15 ● ● Probability ● ● ● 0.10 ● ● ● ● 0.05 ● ● ● ● ● ● ● 0.00 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 5 10 15 20 Values Blue spikes: Distribution of outcomes if H 0 is true Orange spikes: Distribution of outcomes for one possible parameter value under .

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Worksheet: Love is Blind, Continued

STAT 113 Tests and Confidence Intervals Colin Reimer Dawson - PowerPoint PPT Presentation

Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind STAT 113 Tests and Confidence Intervals Colin Reimer Dawson Oberlin College October 10th, 2016 Two-Tailed Tests and Stat. Significance Worksheet: Love is Blind Reminders and

STAT 113 Confidence Intervals Colin Reimer Dawson Oberlin College October 3, 2017 1 / 51

STAT 113 Bootstrap Confidence Intervals Colin Reimer Dawson Oberlin College 3 March 2017

Creating Confidence Intervals using Excel 2013 XL8A-V0R XL8A-V0R XL8A-V0R Create Confidence

Creating Confidence Intervals using Excel 2010 5/08/2015 V0M V0M V0M Create Confidence

Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values

Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values

Intro to Confidence Intervals SECTION 10.1 1 Confidence Intervals Slides.notebook December 22,

M5S1 - Confidence Intervals Professor Jarad Niemi STAT 226 - Iowa State University October 9,

I05 - Confidence intervals STAT 587 (Engineering) Iowa State University September 24, 2020

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

Confidence Intervals II 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda Polling:

Confidence Intervals II 18.05 Spring 2014 Agenda Polling: estimating in Bernoulli( ). CLT

Confidence Intervals II 18.05 Spring 2014 Agenda Polling: estimating in Bernoulli( ). CLT

More on the Cox PH model I. Confidence intervals and hypothesis tests Two methods for

Confidence intervals and power Applied Statistics and Experimental Design Chapter 4 Peter Hoff

Quantitative Evaluation Research Questions Quantitative Data Controlled Studies Experimental

Session 09: Hypothesis Testing Stats 60/Psych 10 Ismael Lemhadri Summer 2020 This time (and next

4: Significance Testing Machine Learning and Real-world Data Simone Teufel Computer Laboratory

Pairwise, Rigid Registration The ICP Algorithm and Its Variants 1 1 Correspondence Problem

Linear Models: Comparing Variables Stony Brook University CSE545, Fall 2017 Statistical

Testing 6.1 Specification testing Michel Bierlaire A short reminder on hypothesis testing

Sample Size Power, Sample Size, and the FDR How many observations do we need? Depends on

Last time: space curves and arc-length Recall the formula b | r ( t ) | dt . L = a

Sambuz

Useful Links

Newsletter

Mail Us