Unit 3: Inferential Statistics for Continuous Data Statistics for - PowerPoint PPT Presentation

Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R – A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento, Italy 2 Corpus Linguistics Group Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 1 / 33

Outline Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 2 / 33

Inferential statistics Preliminaries Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 3 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters ◮ In this session, we assume a Gaussian population distribution ◮ estimate/test parameters µ and σ of this distribution ◮ sometimes a scale transformation is necessary (e.g. lognormal) SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters ◮ In this session, we assume a Gaussian population distribution ◮ estimate/test parameters µ and σ of this distribution ◮ sometimes a scale transformation is necessary (e.g. lognormal) ◮ Nonparametric tests need fewer assumptions, but . . . ◮ cannot test hypotheses about µ and σ (instead: median m , IQR = inter-quartile range, etc.) ◮ more complicated and computationally expensive procedures ◮ correct interpretation of results often difficult SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 ◮ Question 1: What is a suitable statistic? ◮ depends on null hypothesis H 0 ◮ large difference T − E 0 [ T ] should provide evidence against H 0 ◮ e.g. unbiased estimator for population parameter to be tested SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 ◮ Question 1: What is a suitable statistic? ◮ depends on null hypothesis H 0 ◮ large difference T − E 0 [ T ] should provide evidence against H 0 ◮ e.g. unbiased estimator for population parameter to be tested ◮ Question 2: what is “large enough”? ◮ reject if difference is unlikely to arise by chance ◮ need to compute sampling distribution of T under H 0 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Easy if statistic T has a Gaussian distribution T ∼ N ( µ, σ 2 ) ◮ µ and σ 2 are determined by null hypothesis H 0 ◮ reject H 0 at two-sided significance level α = . 05 if T < µ − 1 . 96 σ or T > µ + 1 . 96 σ σ σ g ( t ) 2 σ 2 σ µ t SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 6 / 33

Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Easy if statistic T has a Gaussian distribution T ∼ N ( µ, σ 2 ) ◮ µ and σ 2 are determined by null hypothesis H 0 ◮ reject H 0 at two-sided significance level α = . 05 if T < µ − 1 . 96 σ or T > µ + 1 . 96 σ ◮ This suggests a standardized z-score as a measure of extremeness: σ σ Z := T − µ g ( t ) σ 2 σ 2 σ ◮ Central range of sampling µ variation: | Z | ≤ 1 . 96 t SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 6 / 33

Inferential statistics Preliminaries Notation for random samples ◮ Random sample of n ≪ m = | Ω | items ◮ e.g. participants of survey, Wikipedia sample, . . . ◮ recall importance of completely random selection ◮ Sample described by observed values of r.v. X , Y , Z , . . . : x 1 , . . . , x n ; y 1 , . . . , y n ; z 1 , . . . , z n ☞ specific items ω 1 , . . . , ω n are irrelevant, we are only interested in their properties x i = X ( ω i ) , y i = Y ( ω i ) , etc. SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 7 / 33

Inferential statistics Preliminaries Notation for random samples ◮ Random sample of n ≪ m = | Ω | items ◮ e.g. participants of survey, Wikipedia sample, . . . ◮ recall importance of completely random selection ◮ Sample described by observed values of r.v. X , Y , Z , . . . : x 1 , . . . , x n ; y 1 , . . . , y n ; z 1 , . . . , z n ☞ specific items ω 1 , . . . , ω n are irrelevant, we are only interested in their properties x i = X ( ω i ) , y i = Y ( ω i ) , etc. ◮ Mathematically, x i , y i , z i are realisations of random variables X 1 , . . . , X n ; Y 1 , . . . , Y n ; Z 1 , . . . , Z n ◮ X 1 , . . . , X n are independent from each other and each one has the same distribution X i ∼ X ➜ i.i.d. random variables ☞ each random experiment now yields complete sample of size n SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 7 / 33

One-sample tests Testing the mean Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 8 / 33

One-sample tests Testing the mean A simple test for the mean ◮ Consider simplest possible H 0 : a point hypothesis H 0 : µ = µ 0 , σ = σ 0 ☞ together with normality assumption, population distribution is completely determined ◮ How would you test whether µ = µ 0 is correct? SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 9 / 33

One-sample tests Testing the mean A simple test for the mean ◮ Consider simplest possible H 0 : a point hypothesis H 0 : µ = µ 0 , σ = σ 0 ☞ together with normality assumption, population distribution is completely determined ◮ How would you test whether µ = µ 0 is correct? ◮ An intuitive test statistic is the sample mean n x = 1 � ¯ x i with x ≈ µ 0 under H 0 ¯ n i = 1 ◮ Reject H 0 if difference ¯ x − µ 0 is sufficiently large ☞ need to work out sampling distribution of ¯ X SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 9 / 33

One-sample tests Testing the mean The sampling distribution of ¯ X ◮ The sample mean is also a random variable: X = 1 ¯ � � X 1 + · · · + X n n ◮ ¯ X is a sensible test statistic for µ because it is unbiased : � n � n n 1 = 1 E [ X i ] = 1 E [ ¯ � � � X ] = E X i µ = µ n n n i = 1 i = 1 i = 1 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 10 / 33

Unit 3: Inferential Statistics for Continuous Data Statistics for - PowerPoint PPT Presentation

Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento, Italy 2 Corpus Linguistics Group

Inferential Statistics Inferential statistics are used to test

Descriptive Statistics Descriptive and Inferential Statistics Recall that statistical methods are

On Computational Thinking, Inferential Thinking and Big Data Michael I. Jordan University

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

towards an inferential lexicon of event selecting predicates for french Ingrid Falk and Fabienne

Validity-preservation properties of rules for combining inferential models combining

Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

Inferential Statistics Concepts IN TR OD U C TION TO L IN E AR MOD E L IN G IN P YTH ON Jason

Why use R? Introduction to R: To perform inferential statistics (e.g., use a statistical

Inferential Statistics Chapters 6 &7

Calibrated Bayes, and Inferential Paradigm for Of7icial Statistics in the Era of Big Data Rod

Continuous Improvement Continuous Improvement Update on Continuous Improvement Process Update on

Inferential Statistics Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

Statistical Significance and Performance Measures l Just a brief review of confidence intervals

CSE 312 Spring 2015 More on parameter estimation Bias; and Confidence Intervals 57 Bias

CS 147: Computer Systems Performance Analysis Comparing Systems and Analyzing Alternatives 1 /

Poli 30D Political Inquiry Normal Curve & Confidence Intervals Shane Xinyang Xuan

Confidence Intervals & Z-Scores Statistics is the grammar of science. Karl Pearson

Point Estimates: Parameters I (7.1.1) Point Estimates: Parameters I (7.1.1) Point Estimates:

Gov 2000: 5. Estimation and Statistical Inference Matthew Blackwell Fall 2016 1 / 56 1. Point

Part 6. Confidence Interval Min Chen School of Computer Science and Engineering Seoul National

Unit 3: Inferential Statistics for Continuous Data Statistics for - PowerPoint PPT Presentation

Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento, Italy 2 Corpus Linguistics Group

Inferential Statistics Inferential statistics are used to test

Descriptive Statistics Descriptive and Inferential Statistics Recall that statistical methods are

On Computational Thinking, Inferential Thinking and Big Data Michael I. Jordan University

Statistical Methods Statistical Methods Descriptive Inferential Statistics Statistics

towards an inferential lexicon of event selecting predicates for french Ingrid Falk and Fabienne

Validity-preservation properties of rules for combining inferential models combining

Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

Inferential Statistics Concepts IN TR OD U C TION TO L IN E AR MOD E L IN G IN P YTH ON Jason

Why use R? Introduction to R: To perform inferential statistics (e.g., use a statistical

Inferential Statistics Chapters 6 &amp;7

Calibrated Bayes, and Inferential Paradigm for Of7icial Statistics in the Era of Big Data Rod

Continuous Improvement Continuous Improvement Update on Continuous Improvement Process Update on

Inferential Statistics Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1

Official Statistics Matt Dray, Assistant Statistician Official Statistics 2 Official

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

Statistical Significance and Performance Measures l Just a brief review of confidence intervals

CSE 312 Spring 2015 More on parameter estimation Bias; and Confidence Intervals 57 Bias

CS 147: Computer Systems Performance Analysis Comparing Systems and Analyzing Alternatives 1 /

Poli 30D Political Inquiry Normal Curve &amp; Confidence Intervals Shane Xinyang Xuan

Confidence Intervals &amp; Z-Scores Statistics is the grammar of science. Karl Pearson

Point Estimates: Parameters I (7.1.1) Point Estimates: Parameters I (7.1.1) Point Estimates:

Gov 2000: 5. Estimation and Statistical Inference Matthew Blackwell Fall 2016 1 / 56 1. Point

Part 6. Confidence Interval Min Chen School of Computer Science and Engineering Seoul National

Inferential Statistics Chapters 6 &7

Poli 30D Political Inquiry Normal Curve & Confidence Intervals Shane Xinyang Xuan

Confidence Intervals & Z-Scores Statistics is the grammar of science. Karl Pearson