An introduction to R: Basic statistics with R No emie Becker, - PowerPoint PPT Presentation

An introduction to R: Basic statistics with R No´ emie Becker, Sonja Grath & Dirk Metzler nbecker@bio.lmu.de - grath@bio.lmu.de Winter semester 2017-18

Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of a test 4 Questions for the exam 5

Theory of statistical tests Contents Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of a test 4 Questions for the exam 5

Theory of statistical tests A simple example You want to show that a treatment is effective.

Theory of statistical tests A simple example You want to show that a treatment is effective. You have data for 2 groups of patients with and without treatment.

Theory of statistical tests A simple example You want to show that a treatment is effective. You have data for 2 groups of patients with and without treatment. 80% patients with treatment recovered whereas only 30% patients without recovered.

Theory of statistical tests A simple example You want to show that a treatment is effective. You have data for 2 groups of patients with and without treatment. 80% patients with treatment recovered whereas only 30% patients without recovered. A pessimist would say that this just happened by chance. What do you do to convince the pessimist?

Theory of statistical tests A simple example You want to show that a treatment is effective. You have data for 2 groups of patients with and without treatment. 80% patients with treatment recovered whereas only 30% patients without recovered. A pessimist would say that this just happened by chance. What do you do to convince the pessimist? You assume he is right and you show that under this hypothesis the data would be very unlikely.

Theory of statistical tests In statistical words What you want to show is the alternative hypothesis H 1 . The pessimist (by chance) is the null hypothesis H 0 .

Theory of statistical tests In statistical words What you want to show is the alternative hypothesis H 1 . The pessimist (by chance) is the null hypothesis H 0 . Show that the observation and everything more ’extreme’ is sufficiently unlikely under this null hypothesis. Scientists have agreed that it suffices that this probability is at most 5%. This refutes the pessimist. Statistical language: We reject the null hypothesis on the significance level 5%.

Theory of statistical tests In statistical words What you want to show is the alternative hypothesis H 1 . The pessimist (by chance) is the null hypothesis H 0 . Show that the observation and everything more ’extreme’ is sufficiently unlikely under this null hypothesis. Scientists have agreed that it suffices that this probability is at most 5%. This refutes the pessimist. Statistical language: We reject the null hypothesis on the significance level 5%. p = P ( observation and everything more ’extreme’ / H 0 is true ) If the p value is over 5% you say you cannot reject the null hypothesis.

Theory of statistical tests Statistical tests in R There is a huge variety of statistical tests that you can perform in R. We will cover on example of test in this lecture.

Student T test: reminder Contents Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of a test 4 Questions for the exam 5

Student T test: reminder T test main idea We have measured a variable in one or two samples and we want to test: One-sample: test if the mean is equal to a certain value (often 0) Two-samples: test if the two means are different

Student T test: reminder T test main idea We have measured a variable in one or two samples and we want to test: One-sample: test if the mean is equal to a certain value (often 0) Two-samples: test if the two means are different In case of two samples there are several possibilities: The same individuals are measured twice (ex: before and after a treatment): paired T test Independent samples: unpaired T test

Student T test: reminder T test main idea We have measured a variable in one or two samples and we want to test: One-sample: test if the mean is equal to a certain value (often 0) Two-samples: test if the two means are different In case of two samples there are several possibilities: The same individuals are measured twice (ex: before and after a treatment): paired T test Independent samples: unpaired T test In case of unpaired there are several possibilities: Variance in the two samples is assumed equal Variance in the two samples is not assumed equal

Student T test: reminder More details about the One-sample T test Given : Observations X 1 , X 2 , . . . , X n

Student T test: reminder More details about the One-sample T test Given : Observations X 1 , X 2 , . . . , X n Null hypothesis H 0 : µ X = c (We test for a value c , usually c = 0) Level of significance: α (usually α = 5 % )

Student T test: reminder More details about the One-sample T test Given : Observations X 1 , X 2 , . . . , X n Null hypothesis H 0 : µ X = c (We test for a value c , usually c = 0) Level of significance: α (usually α = 5 % ) Test : t-Test Compute test statistic X − c t := s ( X ) / √ n

Student T test: reminder More details about the One-sample T test Given : Observations X 1 , X 2 , . . . , X n Null hypothesis H 0 : µ X = c (We test for a value c , usually c = 0) Level of significance: α (usually α = 5 % ) Test : t-Test Compute test statistic X − c t := s ( X ) / √ n p-value = Pr ( | T n − 1 | ≥ | t | ) ( n − 1 degrees of freedom) Reject null hypothesis, if p-value ≤ α

Student T test: reminder More details about the paired T test Given : paired observations ( Y 1 , Z 1 ) , ( Y 2 , Z 2 ) , . . . , ( Y n , Z n )

Student T test: reminder More details about the paired T test Given : paired observations ( Y 1 , Z 1 ) , ( Y 2 , Z 2 ) , . . . , ( Y n , Z n ) Null hypothesis H 0 : µ Y = µ Z Level of significance: α (usually α = 5 % )

Student T test: reminder More details about the paired T test Given : paired observations ( Y 1 , Z 1 ) , ( Y 2 , Z 2 ) , . . . , ( Y n , Z n ) Null hypothesis H 0 : µ Y = µ Z Level of significance: α (usually α = 5 % ) Test : paired t-Test (more precisely: two-sided paired t-Test) Compute the difference X := Y − Z Compute test statistic X t := s ( X ) / √ n

Student T test: reminder More details about the paired T test Given : paired observations ( Y 1 , Z 1 ) , ( Y 2 , Z 2 ) , . . . , ( Y n , Z n ) Null hypothesis H 0 : µ Y = µ Z Level of significance: α (usually α = 5 % ) Test : paired t-Test (more precisely: two-sided paired t-Test) Compute the difference X := Y − Z Compute test statistic X t := s ( X ) / √ n p-value = Pr ( | T n − 1 | ≥ | t | ) ( n − 1 degrees of freedom) Reject null hypothesis, if p-value ≤ α

Student T test: reminder More details about the unpaired T test with equal variance Given : unpaired observations X 1 , X 2 , . . . , X n andY 1 , Y 2 , . . . , Y m

Student T test: reminder More details about the unpaired T test with equal variance Given : unpaired observations X 1 , X 2 , . . . , X n andY 1 , Y 2 , . . . , Y m Null hypothesis H 0 : µ X = µ Y Level of significance: α (usually α = 5 % )

Student T test: reminder More details about the unpaired T test with equal variance Given : unpaired observations X 1 , X 2 , . . . , X n andY 1 , Y 2 , . . . , Y m Null hypothesis H 0 : µ X = µ Y Level of significance: α (usually α = 5 % ) Test : unpaired t-Test (more precisely: two-sided unpaired t-Test) Compute common variance of the sample p = ( n − 1 ) · s 2 X + ( m − 1 ) · s 2 s 2 Y m + n − 2 Compute test statistic X − Y t = � n + 1 1 s p · m

Student T test: reminder More details about the unpaired T test with equal variance Given : unpaired observations X 1 , X 2 , . . . , X n andY 1 , Y 2 , . . . , Y m Null hypothesis H 0 : µ X = µ Y Level of significance: α (usually α = 5 % ) Test : unpaired t-Test (more precisely: two-sided unpaired t-Test) Compute common variance of the sample p = ( n − 1 ) · s 2 X + ( m − 1 ) · s 2 s 2 Y m + n − 2 Compute test statistic X − Y t = � n + 1 1 s p · m p-value = Pr ( | T n + m − 2 | ≥ | t | ) ( n + m − 2 degrees of freedom) Reject null hypothesis, if p-value ≤ α

T test in R Contents Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of a test 4 Questions for the exam 5

T test in R Function t.test() The function used in R is called t.test() . ?t.test t.test(x, y = NULL, alternative = c("two.sided", "less", "greater"), mu = 0, paired = FALSE, var.equal = FALSE, conf.level = 0.95, ...)

T test in R Martian example Dataset containing height of martian of different colours. See the code on the R console.

T test in R Martian example Dataset containing height of martian of different colours. See the code on the R console. We cannot reject the null hypothesis. It was an unpaired test because the two samples are independent.

T test in R Shoe example Dataset containing wear of shoes of 2 materials A and B. The same persons have weared the two types of shoes and we have a measure of use of the shoes.

An introduction to R: Basic statistics with R No emie Becker, - PowerPoint PPT Presentation

An introduction to R: Basic statistics with R No emie Becker, Sonja Grath & Dirk Metzler nbecker@bio.lmu.de - grath@bio.lmu.de Winter semester 2017-18 Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of

Conference Report AI Lab NLP center Jiangtong Li Basic Statistics Basic Statistics Basic

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

The Pulse monitors: Statistics Smartpods PULSE 1 - Improve Facility Efficiencies 2 - Increase

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Order Statistics and Applications Rosemary Smith Introduction to Order Statistics Unordered

Exercise 1: Basic Input Exercise 1: Basic Input FLUKA Beginners Course Exercise 1: Basic Input

Introduction Metrics and Review of Basic Statistics Metrics CS 239 Why are we talking

Order Statistics and Pitman Closeness Katherine F. Davies Department of Statistics University of

Advanced Statistics Janette Walde janette.walde@uibk.ac.at Department of Statistics University

Introduction to Business Statistics Professor Jarad Niemi STAT 226 - Iowa State University

Statistics 1B Statistics 1B 1 (11) 0. Lecture 1. Introduction and probability review

Hypotheses testing, p-values, Type I and Type II Errors Statistics are not substitute for

Hypothesis Testing Recall that a point estimate of some parameter is its most plausible value, in

Statistical Power in Statistical Power in ANOVA ANOVA Rick Balkin Balkin, Ph.D., LPC , Ph.D.,

Primer on multiple testing Joshua Loftus July 23, 2015 One hypothesis, many kinds of errors We

A/B Testing: Avoiding Common Pitfalls Danielle Jabin Mrz 6, 2014 2 Make all the worlds

New approaches to error control in multiple testing Juliet Popper Shaffer Fourth Lehmann

+ Quantitative Statistics: Chi-Square ScWk 242 Session 7 Slides + Chi-Square Test of

Sta$s$cs & Experimental Design with R Barbara Kitchenham

An introduction to R: Basic statistics with R No emie Becker, - PowerPoint PPT Presentation

An introduction to R: Basic statistics with R No emie Becker, Sonja Grath & Dirk Metzler nbecker@bio.lmu.de - grath@bio.lmu.de Winter semester 2017-18 Theory of statistical tests 1 Student T test: reminder 2 T test in R 3 Power of

Conference Report AI Lab NLP center Jiangtong Li Basic Statistics Basic Statistics Basic

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

1 Practical Information 2 Introduction to Statistics Per Bruun Brockhoff 3 Descriptive Statistics:

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

Areal statistics Barry Rowlingson Research Fellow DataCamp Spatial Statistics in R Borders

The Pulse monitors: Statistics Smartpods PULSE 1 - Improve Facility Efficiencies 2 - Increase

Quality Assurance in Official Statistics Directorate of Economics &amp; Statistics, Planning

UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics UK Bleeding Disorder Statistics

The Statistics Network The Statistics Network Statistics network Compute servers Desktop PCs

Order Statistics and Applications Rosemary Smith Introduction to Order Statistics Unordered

Exercise 1: Basic Input Exercise 1: Basic Input FLUKA Beginners Course Exercise 1: Basic Input

Introduction Metrics and Review of Basic Statistics Metrics CS 239 Why are we talking

Order Statistics and Pitman Closeness Katherine F. Davies Department of Statistics University of

Advanced Statistics Janette Walde janette.walde@uibk.ac.at Department of Statistics University

Introduction to Business Statistics Professor Jarad Niemi STAT 226 - Iowa State University

Statistics 1B Statistics 1B 1 (11) 0. Lecture 1. Introduction and probability review

Hypotheses testing, p-values, Type I and Type II Errors Statistics are not substitute for

Hypothesis Testing Recall that a point estimate of some parameter is its most plausible value, in

Statistical Power in Statistical Power in ANOVA ANOVA Rick Balkin Balkin, Ph.D., LPC , Ph.D.,

Primer on multiple testing Joshua Loftus July 23, 2015 One hypothesis, many kinds of errors We

A/B Testing: Avoiding Common Pitfalls Danielle Jabin Mrz 6, 2014 2 Make all the worlds

New approaches to error control in multiple testing Juliet Popper Shaffer Fourth Lehmann

+ Quantitative Statistics: Chi-Square ScWk 242 Session 7 Slides + Chi-Square Test of

Sta$s$cs &amp; Experimental Design with R Barbara Kitchenham

Quality Assurance in Official Statistics Directorate of Economics & Statistics, Planning

Sta$s$cs & Experimental Design with R Barbara Kitchenham