Confidence Intervals II 18.05 Spring 2014 Agenda Polling: - PowerPoint PPT Presentation

Confidence Intervals II 18.05 Spring 2014

Agenda Polling: estimating θ in Bernoulli( θ ). CLT ⇒ large sample confidence intervals for the mean. Three views of confidence intervals. Constructing a confidence interval without normality: the exact binomial confidence interval for θ January 1, 2017 2 / 18

Polling confidence interval Also called a binomial proportion confidence interval Polling means sampling from a Bernoulli( θ ) distribution, i.e. data x 1 , . . . , x n Bernoulli( θ ). Consevative normal confidence interval for θ : 1 x ± z α/ 2 · 2 √ n � Proof uses the CLT and the observation σ = θ (1 − θ ) ≤ 1 / 2. Rule-of-thumb 95% confidence interval for θ : x ± 1 √ n (Reason: z 0 . 025 ≈ 2.) January 1, 2017 3 / 18

Binomial proportion confidence intervals Political polls often give a margin-of-error of ± 1 / √ n , i.e. they use the rule-of-thumb 95% confidence interval. There are many types of binomial proportion confidence intervals: http://en.wikipedia.org/wiki/Binomial_proportion_ confidence_interval January 1, 2017 4 / 18

Board question For a poll to find the proportion θ of people supporting X we know that a (1 − α ) confidence interval for θ is given by � � z x + z α/ 2 α/ 2 ¯ − 2 √ n , ¯ 2 √ . x n 1. How many people would you have to poll to have a margin of error of 0 . 01 with 95% confidence? (You can do this in your head.) 2. How many people would you have to poll to have a margin of error of 0 . 01 with 80% confidence. (You’ll want R or other calculator here.) 3. If n = 900, compute the 95% and 80% confidence intervals for θ . answer: See next slide. January 1, 2017 5 / 18

answer: 1. Need 1 / √ n = 0 . 01 So n = 10000. z α/ 2 qnorm(0.9) = 1 . 2816. So we need 2 √ 2. α = 0 . 2, so z α/ 2 = = . 01. n This gives n = 4106. 3. 95% interval: x ± 1 √ n = x ± 1 30 = x ± 0 . 0333 1 1 2 √ n = 80% interval: x ± z 0 . 1 x ± 1 . 2816 · 60 = x ± 0 . 021 . January 1, 2017 6 / 18

Concept question: overnight polling During the presidential election season, pollsters often do ‘overnight polls’ and report a ‘margin of error’ of about ± 5%. The number of people polled is in which of the following ranges? (a) 0 – 50 (b) 50 – 100 (c) 100 – 300 (d) 300 – 600 (e) 600 – 1000 Answer: 5% = 1/20. So 20 = √ n ⇒ n = 400. January 1, 2017 7 / 18

National Council on Public Polls: Press Release, Sept 1992 “The National Council on Public Polls expressed concern today about the current spate of overnight Presidential polls. [...] Overnight polls do a disservice to both the media and the research industry because of the considerable potential for the results to be misleading. The overnight interviewing period may well mean some methodological compromises, the most serious of which is..” ...what? “...the inability to make callbacks, resulting in samples that do not adequately represent such groups as single member households, younger people, and others who are apt to be out on any given night. As overnight polls often result in findings that are less reliable than those from more carefully conducted polls, if the media reports them, it should be with great caution.” http://www.ncpp.org/?q=node/42 January 1, 2017 8 / 18

Large sample confidence interval Data x 1 , . . . , x n independently drawn from a distribution that may not be normal but has finite mean and variance. A version of the central limit theorem says that large n , x ¯ − µ s / √ ≈ N(0 , 1) n i.e. the sampling distribution of the studentized mean is approximately standard normal: So for large n the (1 − α ) confidence interval for µ is approximately � � s x + s ¯ − √ n · z α/ 2 , ¯ √ n · x z α/ 2 This is called the large sample confidence interval. January 1, 2017 9 / 18

Review: confidence intervals for normal data Suppose the data x 1 , . . . , x n is drawn from N( µ, σ 2 ) Confidence level = 1 − α z confidence interval for the mean ( σ known) � x − z α/ 2 · σ x + z α/ 2 · σ � z α/ 2 · σ √ n , √ n or x ± √ n t confidence interval for the mean ( σ unknown) � x − t α/ 2 · s x + t α/ 2 · s � x ± t α/ 2 · s √ n √ n √ n , or χ 2 confidence interval for σ 2 � n − 1 n − 1 � s 2 , s 2 c α/ 2 c 1 − α/ 2 t and χ 2 have n − 1 degrees of freedom. January 1, 2017 10 / 18

Three views of confidence intervals View 1: Define/construct CI using a standardized point statistic. View 2: Define/construct CI based on hypothesis tests. View 3: Define CI as any interval statistic satisfying a formal mathematical property. January 1, 2017 11 / 18

View 1: Using a standardized point statistic Example. x . . . , x ∼ N( µ, σ 2 ), where σ is known. 1 n The standardized sample mean follows a standard normal distribution. z = x − µ σ/ √ n ∼ N(0 , 1) Therefore: P ( − z α/ 2 < x − µ σ/ √ < z α/ 2 | µ ) = 1 − α n Pivot to: P ( x − z α/ 2 · σ √ n < µ < x + z α/ 2 · σ √ n | ) = 1 − α µ This is the (1 − α ) confidence interval: x ± z α/ 2 · σ √ n Think of it as x ± error January 1, 2017 12 / 18

View 1: Other standardized statistics The t and χ 2 statistics fit this paradigm as well: t = x − µ s / √ n ∼ ( n − 1) t ( n − 1) s 2 X 2 2 ( n − 1) = ∼ χ σ 2 January 1, 2017 13 / 18

View 2: Using hypothesis tests Set up: Unknown parameter θ . Test statistic x . For any value θ 0 , we can run an NSHT with null hypothesis H 0 : θ = θ 0 at significance level α . Definition. Given x , the (1 − α ) confidence interval contains all θ 0 which are not rejected when they are the null hypothesis. Definition. A type 1 CI error occurs when the confidence interval does not contain the true value of θ . For a 1 − α confidence interval, the type 1 CI error rate is α . January 1, 2017 14 / 18

Board question: exact binomial confidence interval Use this table of binomial(8, θ ) probabilities to: 1 find the (two-sided) rejection region with significance level 0 . 10 for each value of θ . 2 Given x = 7, find the 90% confidence interval for θ . 3 Repeat for x = 4. θ/ x 0 1 2 3 4 5 6 7 8 .1 0.430 0.383 0.149 0.033 0.005 0.000 0.000 0.000 0.000 .3 0.058 0.198 0.296 0.254 0.136 0.047 0.010 0.001 0.000 .5 0.004 0.031 0.109 0.219 0.273 0.219 0.109 0.031 0.004 .7 0.000 0.001 0.010 0.047 0.136 0.254 0.296 0.198 0.058 .9 0.000 0.000 0.000 0.000 0.005 0.033 0.149 0.383 0.430 January 1, 2017 15 / 18

Solution For each θ , the non-rejection region is blue, the rejection region is red. In each row, the rejection region has probability at most α = 0 . 10. θ/ x 0 1 2 3 4 5 6 7 8 .1 0.430 0.383 0.149 0.033 0.005 0.000 0.000 0.000 0.000 .3 0.058 0.198 0.296 0.254 0.136 0.047 0.010 0.001 0.000 .5 0.004 0.031 0.109 0.219 0.273 0.219 0.109 0.031 0.004 .7 0.000 0.001 0.010 0.047 0.136 0.254 0.296 0.198 0.058 .9 0.000 0.000 0.000 0.000 0.005 0.033 0.149 0.383 0.430 For x = 7 the 90% confidence interval for p is [0 . 7 , 0 . 9]. These are the values of θ we wouldn’t reject as null hypotheses. They are the blue entries in the x = 7 column. For x = 4 the 90% confidence interval for p is [0 . 3 , 0 . 7]. January 1, 2017 16 / 18

View 3: Formal Recall: An interval statistic is an interval I x computed from data x . This is a random interval because x is random. Suppose x is drawn from f ( x | θ ) with unknown parameter θ . Definition: A (1 − α ) confidence interval for θ is an interval statistic I x such that P ( I x contains θ | θ ) = 1 − α for all possible values of θ (and hence for the true value of θ ). Note: equality in this equation is often relaxed to ≥ or ≈ . = : z , t , χ 2 ≥ : rule-of-thumb and exact binomial (polling) ≈ : large sample confidence interval January 1, 2017 17 / 18

MIT OpenCourseWare https://ocw.mit.edu 18.05 Introduction to Probability and Statistics Spring 2014 For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.

Confidence Intervals II 18.05 Spring 2014 Agenda Polling: - PowerPoint PPT Presentation

Confidence Intervals II 18.05 Spring 2014 Agenda Polling: estimating in Bernoulli( ). CLT large sample confidence intervals for the mean. Three views of confidence intervals. Constructing a confidence interval without normality: the

STAT 113 Confidence Intervals Colin Reimer Dawson Oberlin College October 3, 2017 1 / 51

STAT 113 Bootstrap Confidence Intervals Colin Reimer Dawson Oberlin College 3 March 2017

Creating Confidence Intervals using Excel 2013 XL8A-V0R XL8A-V0R XL8A-V0R Create Confidence

Creating Confidence Intervals using Excel 2010 5/08/2015 V0M V0M V0M Create Confidence

Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values

Intro to Confidence Intervals SECTION 10.1 1 Confidence Intervals Slides.notebook December 22,

Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

Confidence Intervals II 18.05 Spring 2014 Agenda Polling: estimating in Bernoulli( ). CLT

Confidence Intervals II 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda Polling:

M5S1 - Confidence Intervals Professor Jarad Niemi STAT 226 - Iowa State University October 9,

Confidence intervals and power Applied Statistics and Experimental Design Chapter 4 Peter Hoff

I05 - Confidence intervals STAT 587 (Engineering) Iowa State University September 24, 2020

CS70: Jean Walrand: Lecture 29. Confidence? Confidence? Confidence is essential is many

Introductory Statistics Day 24 One sample means - Confidence Intervals 4.3 One sample

SeaTides Hokkaido North 13 MHz Pacific Ocean Tsugaru Strait CTD 13 MHz 13 MHz Honshu

TESTING THE VALIDITY OF THE SINGLE-SPIN APPROXIMATION IN INSPIRAL-MERGER-RINGDOWN WAVEFORMS

Testing general relativity with X-ray reflection spectroscopy of MCG-06-30-15 Ashutosh Tripathi

A new view of the X-ray Sky through the Virtual Observatory Janet Evans, Ian Evans, and the CSC

Estimating parameters 5.3 Confidence Intervals 5.4 Sample Variance Prof. Tesler Math 186

Expected limits and 3 lepton control region Andy Nelson, Daniel Whiteson: UC Irvine Expected

Condence Intervals and the t Distribution Cohen Chapter 6 EDUC/PSY 6600 It is common

Reliability Analysis for What Is the Accuracy . . . Aerospace Applications: New Approach: Main

Sambuz

Useful Links

Newsletter

Mail Us