Hypotheses testing, p-values, Type I and Type II Errors Statistics - PowerPoint PPT Presentation

Hypotheses testing, p-values, Type I and Type II Errors “Statistics are not substitute for judgment.” Henry Clay (US Senator)

Formal hypotheses testing population A Is this a difference due B to random chance? Mean height A B sample Population sample 𝐼 𝑝 : 𝑦 𝐵 = 𝑦 𝐶 If actual p <  , reject null hypothesis ( 𝐼 𝑝 ) and accept alternative 𝐼 1 : 𝑦 𝐵 ≠ 𝑦 𝐶 hypothesis ( 𝐼 1 )

How to convert between scales Original units 𝑦 (𝑤𝑏𝑚𝑣𝑓 − 𝑦 )/𝑇𝐹 𝑦 ( t- 𝑤𝑏𝑚𝑣𝑓 ∗ 𝑇𝐹 𝑦 ) + 𝑦 T-value (standard error) -3 -2 -1 0 1 2 3 𝑟𝑢(𝛽, 𝑒𝑔) p 𝑢( t− 𝑤𝑏𝑚𝑣𝑓, 𝑒𝑔) Test p-value Test p-value  -level Significant Not Significant P-value (percentiles, 0.999 0.001 0.50 probabilities)

A B “Is this difference due to random chance?” Mean height In other words : “Is random chance a plausible explanation?” Population sample P-value – the probability the observed value or larger is due to random chance Theory : We can never really prove if the 2 samples are truly different or the same – only ask if what we observe (or a greater difference) is due to random chance How to interpret p-values: P-value = 0.05 – “Yes, 1 out of 20 times.” P-value = 0.01 – “Yes, 1 out of 100 times.” The lower the probability a difference is due to random chance – the more likely is the result of an effect (what we test for)

Null hypothesis is true Alternative hypothesis is true Incorrect Fail to reject hypothesis   Type I Error – reject the null hypothesis (H 0 ) when Correct Decision the null Decision False Negative it is actually true Type II Error Incorrect Type II Error – failing to reject the null hypothesis Reject the hypothesis   Decision null Correct Decision (H 0 ) when it is not true False Positive Type I Error Remember rejection or acceptance of a p-value ( and therefore the chance you will make an error ) depends on the arbitrary  -level you choose  -level will probability of making a Type I Error , but this the • probability of making a Type II Error The  -level you choose is completely up to you ( typically it is set at 0.05), however, it should be chosen with consideration of the consequences of making a Type I or a Type II Error . Based on your study, would you rather err on the side of false positives or false negatives?

Example: Will current forests adequately protect genetic resources under climate change? H O : Range of the current climate for the BMW protected area = Range of the BMW protected area under climate change H a : Range of the current climate for the BMW protected area ≠ Range of the BMW protected area under climate change If we reject H O : Climates ranges are different, therefore genetic resources are not adequately protected and new Birch Mountain Wildlands protected areas need to be created Consequences if I make: • Type I Error: Climates are actually the same and genetic resources are indeed adequately protected in the BMW protected area – we created new parks when we didn’t need to • Type II Error : Climates are different and genetic resources are vulnerable – we didn’t create new protected areas and we should have From an ecological standpoint it is better to make a Type I Error, but from an economic standpoint it is better to make a Type II Error Which standpoint should I take?

Statistical Power Power is your ability to reject the null hypothesis when it is false (i.e. your ability to detect an effect when there is one). There are many ways to increase power: 1. Increase your sample size (sample more of the population) Given you are testing whether or not what you observed or greater is due to random chance, more data gives you a better understanding of what is truly happening within the population, therefore sample size will the probability of making a Type 2 Error 2. Increase your alpha value (e.g. from 0.01 to 0.05) – watch for Type I Error! 3. Use a one-tailed test (you know the direction of the expected effect) 4. Use a paired test (control and treatment are same sample)

Hypotheses testing, p-values, Type I and Type II Errors Statistics - PowerPoint PPT Presentation

Hypotheses testing, p-values, Type I and Type II Errors Statistics are not substitute for judgment. Henry Clay (US Senator) Formal hypotheses testing population A Is this a difference due B to random chance? Mean height A B sample

Hypotheses with two variates Two sample hypotheses R.W. Oldford Common hypotheses Recall some

13. hypothesis testing 1 competing hypotheses 2 competing hypotheses 3 competing hypotheses

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

Hypotheses with two variates Paired data R.W. Oldford Common hypotheses Recall some common

Testing Terminology System testing Types of errors Function testing Structure

Chapter 11, Testing ! Function testing Types of errors ! Structure Testing Dealing with

How willing are you to be wrong? Type I and Type II Errors Type 1, Type II Errors and Power

Verifying Test Hypotheses - HOL/TestGen An Experiment in Test and Proof Thomas Malcher January

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Type Checking Grammar Rule Semantic Rule var-decl id : type-exp Insert (id.name, type-exp .

Safe Testing: S-Values and Optional Continuation Peter Gr unwald Rianne de Heide Wouter M.

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Type u A type is a collection of values and operations on those values. u Example u Integer type

Values Learning Outcomes Define what values are Identify your personal values Relate

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Hypothesis Testing Recall that a point estimate of some parameter is its most plausible value, in

Statistical Power in Statistical Power in ANOVA ANOVA Rick Balkin Balkin, Ph.D., LPC , Ph.D.,

Primer on multiple testing Joshua Loftus July 23, 2015 One hypothesis, many kinds of errors We

14. hypothesis testing 1 competing hypotheses Programmers using the Eclipse IDE make fewer

An introduction to R: Basic statistics with R No emie Becker, Sonja Grath & Dirk Metzler

A/B Testing: Avoiding Common Pitfalls Danielle Jabin Mrz 6, 2014 2 Make all the worlds

New approaches to error control in multiple testing Juliet Popper Shaffer Fourth Lehmann

+ Quantitative Statistics: Chi-Square ScWk 242 Session 7 Slides + Chi-Square Test of

Hypotheses testing, p-values, Type I and Type II Errors Statistics - PowerPoint PPT Presentation

Hypotheses testing, p-values, Type I and Type II Errors Statistics are not substitute for judgment. Henry Clay (US Senator) Formal hypotheses testing population A Is this a difference due B to random chance? Mean height A B sample

Hypotheses with two variates Two sample hypotheses R.W. Oldford Common hypotheses Recall some

13. hypothesis testing 1 competing hypotheses 2 competing hypotheses 3 competing hypotheses

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

Hypotheses with two variates Paired data R.W. Oldford Common hypotheses Recall some common

Testing Terminology System testing Types of errors Function testing Structure

Chapter 11, Testing ! Function testing Types of errors ! Structure Testing Dealing with

How willing are you to be wrong? Type I and Type II Errors Type 1, Type II Errors and Power

Verifying Test Hypotheses - HOL/TestGen An Experiment in Test and Proof Thomas Malcher January

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Type Checking Grammar Rule Semantic Rule var-decl id : type-exp Insert (id.name, type-exp .

Safe Testing: S-Values and Optional Continuation Peter Gr unwald Rianne de Heide Wouter M.

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Type u A type is a collection of values and operations on those values. u Example u Integer type

Values Learning Outcomes Define what values are Identify your personal values Relate

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Hypothesis Testing Recall that a point estimate of some parameter is its most plausible value, in

Statistical Power in Statistical Power in ANOVA ANOVA Rick Balkin Balkin, Ph.D., LPC , Ph.D.,

Primer on multiple testing Joshua Loftus July 23, 2015 One hypothesis, many kinds of errors We

14. hypothesis testing 1 competing hypotheses Programmers using the Eclipse IDE make fewer

An introduction to R: Basic statistics with R No emie Becker, Sonja Grath &amp; Dirk Metzler

A/B Testing: Avoiding Common Pitfalls Danielle Jabin Mrz 6, 2014 2 Make all the worlds

New approaches to error control in multiple testing Juliet Popper Shaffer Fourth Lehmann

+ Quantitative Statistics: Chi-Square ScWk 242 Session 7 Slides + Chi-Square Test of

An introduction to R: Basic statistics with R No emie Becker, Sonja Grath & Dirk Metzler