unit 3 inferential statistics for continuous data
play

Unit 3: Inferential Statistics for Continuous Data Statistics for - PowerPoint PPT Presentation

Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento, Italy 2 Corpus Linguistics Group


  1. Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R – A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 2 1 Center for Mind/Brain Sciences (CIMeC) University of Trento, Italy 2 Corpus Linguistics Group Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 1 / 33

  2. Outline Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 2 / 33

  3. Inferential statistics Preliminaries Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 3 / 33

  4. Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

  5. Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters ◮ In this session, we assume a Gaussian population distribution ◮ estimate/test parameters µ and σ of this distribution ◮ sometimes a scale transformation is necessary (e.g. lognormal) SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

  6. Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Goal: infer (characteristics of) population distribution from small random sample, or test hypotheses about population ◮ problem: overwhelmingly infinite coice of possible distributions ◮ can estimate/test characteristics such as mean µ and s.d. σ ◮ but H 0 doesn’t determine a unique sampling distribution then ☞ parametric model, where the population distribution of a r.v. X is completely determined by a small set of parameters ◮ In this session, we assume a Gaussian population distribution ◮ estimate/test parameters µ and σ of this distribution ◮ sometimes a scale transformation is necessary (e.g. lognormal) ◮ Nonparametric tests need fewer assumptions, but . . . ◮ cannot test hypotheses about µ and σ (instead: median m , IQR = inter-quartile range, etc.) ◮ more complicated and computationally expensive procedures ◮ correct interpretation of results often difficult SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 4 / 33

  7. Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

  8. Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 ◮ Question 1: What is a suitable statistic? ◮ depends on null hypothesis H 0 ◮ large difference T − E 0 [ T ] should provide evidence against H 0 ◮ e.g. unbiased estimator for population parameter to be tested SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

  9. Inferential statistics Preliminaries Inferential statistics for continuous data Rationale similar to binomial test for frequency data: measure observed statistic T in sample, which is compared against its expected value E 0 [ T ] ➜ if difference is large enough, reject H 0 ◮ Question 1: What is a suitable statistic? ◮ depends on null hypothesis H 0 ◮ large difference T − E 0 [ T ] should provide evidence against H 0 ◮ e.g. unbiased estimator for population parameter to be tested ◮ Question 2: what is “large enough”? ◮ reject if difference is unlikely to arise by chance ◮ need to compute sampling distribution of T under H 0 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 5 / 33

  10. Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Easy if statistic T has a Gaussian distribution T ∼ N ( µ, σ 2 ) ◮ µ and σ 2 are determined by null hypothesis H 0 ◮ reject H 0 at two-sided significance level α = . 05 if T < µ − 1 . 96 σ or T > µ + 1 . 96 σ σ σ g ( t ) 2 σ 2 σ µ t SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 6 / 33

  11. Inferential statistics Preliminaries Inferential statistics for continuous data ◮ Easy if statistic T has a Gaussian distribution T ∼ N ( µ, σ 2 ) ◮ µ and σ 2 are determined by null hypothesis H 0 ◮ reject H 0 at two-sided significance level α = . 05 if T < µ − 1 . 96 σ or T > µ + 1 . 96 σ ◮ This suggests a standardized z-score as a measure of extremeness: σ σ Z := T − µ g ( t ) σ 2 σ 2 σ ◮ Central range of sampling µ variation: | Z | ≤ 1 . 96 t SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 6 / 33

  12. Inferential statistics Preliminaries Notation for random samples ◮ Random sample of n ≪ m = | Ω | items ◮ e.g. participants of survey, Wikipedia sample, . . . ◮ recall importance of completely random selection ◮ Sample described by observed values of r.v. X , Y , Z , . . . : x 1 , . . . , x n ; y 1 , . . . , y n ; z 1 , . . . , z n ☞ specific items ω 1 , . . . , ω n are irrelevant, we are only interested in their properties x i = X ( ω i ) , y i = Y ( ω i ) , etc. SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 7 / 33

  13. Inferential statistics Preliminaries Notation for random samples ◮ Random sample of n ≪ m = | Ω | items ◮ e.g. participants of survey, Wikipedia sample, . . . ◮ recall importance of completely random selection ◮ Sample described by observed values of r.v. X , Y , Z , . . . : x 1 , . . . , x n ; y 1 , . . . , y n ; z 1 , . . . , z n ☞ specific items ω 1 , . . . , ω n are irrelevant, we are only interested in their properties x i = X ( ω i ) , y i = Y ( ω i ) , etc. ◮ Mathematically, x i , y i , z i are realisations of random variables X 1 , . . . , X n ; Y 1 , . . . , Y n ; Z 1 , . . . , Z n ◮ X 1 , . . . , X n are independent from each other and each one has the same distribution X i ∼ X ➜ i.i.d. random variables ☞ each random experiment now yields complete sample of size n SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 7 / 33

  14. One-sample tests Testing the mean Outline Inferential statistics Preliminaries One-sample tests Testing the mean Testing the variance Student’s t test Confidence intervals SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 8 / 33

  15. One-sample tests Testing the mean A simple test for the mean ◮ Consider simplest possible H 0 : a point hypothesis H 0 : µ = µ 0 , σ = σ 0 ☞ together with normality assumption, population distribution is completely determined ◮ How would you test whether µ = µ 0 is correct? SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 9 / 33

  16. One-sample tests Testing the mean A simple test for the mean ◮ Consider simplest possible H 0 : a point hypothesis H 0 : µ = µ 0 , σ = σ 0 ☞ together with normality assumption, population distribution is completely determined ◮ How would you test whether µ = µ 0 is correct? ◮ An intuitive test statistic is the sample mean n x = 1 � ¯ x i with x ≈ µ 0 under H 0 ¯ n i = 1 ◮ Reject H 0 if difference ¯ x − µ 0 is sufficiently large ☞ need to work out sampling distribution of ¯ X SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 9 / 33

  17. One-sample tests Testing the mean The sampling distribution of ¯ X ◮ The sample mean is also a random variable: X = 1 ¯ � � X 1 + · · · + X n n ◮ ¯ X is a sensible test statistic for µ because it is unbiased : � n � n n 1 = 1 E [ X i ] = 1 E [ ¯ � � � X ] = E X i µ = µ n n n i = 1 i = 1 i = 1 SIGIL (Baroni & Evert) 3b. Continuous Data: Inference sigil.r-forge.r-project.org 10 / 33

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend