statistics 300 elementary statistics
play

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 - PDF document

Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation Point Estimate Best single value to use Question What is the probability this estimate is the correct value? Parameter Estimation


  1. Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5 Parameter Estimation • Point Estimate – Best single value to use • Question – What is the probability this estimate is the correct value? Parameter Estimation • Question – What is the probability this estimate is the correct value? • Answer – zero : assuming “ x” is a continuous random variable – Example for Uniform Distribution 1

  2. If X ~ U[100,500] then • P(x = 300) = (300-300)/(500-100) • = 0 100 300 400 500 Parameter Estimation µ • Pop. mean – Sample mean x p • Pop. proportion ˆ – Sample proportion p σ • Pop. standard deviation – Sample standard deviation s Problem with Point Estimates • The unknown parameter ( µ , p, etc.) is not exactly equal to our sample-based point estimate. • So, how far away might it be? • An interval estimate answers this question. 2

  3. Confidence Interval • A range of values that contains the true value of the population parameter with a ... • Specified “level of confidence”. • [L(ower limit),U(pper limit)] Terminology • Confidence Level (a.k.a. Degree of Confidence) – expressed as a percent (%) • Critical Values (a.k.a. Confidence Coefficients) Terminology • “alpha” “ α ” = 1-Confidence – more about α in Chapter 7 • Critical values – express the confidence level 3

  4. Confidence Interval for µ lf σ is known (this is a rare situation) = ± x E  σ  = ⋅   E z α 2   n Confidence Interval for µ lf σ is known (this is a rare situation) if x ~N(?, σ ) σ   = ± ⋅   x z α   n 2 Why does the Confidence Interval for µ look like this ? σ   = ± ⋅   x z α   n 2 4

  5. σ µ x ~ N ( , ) n make an x value into a z - score. The general z - score expression is − µ ) = x ( z σ for x , µ µ is : unchanged x and σ σ is x n 5

  6. so a z - score based on x is − µ x = z σ n Using the Empirical Rule Make a probabilit y statement :     − µ ( x ) − < < =   P 2 2 95 % σ     n Normal Distribution 0 4 . 5 Relative likelihood 0 . 4 0 α 0 . 3 5 α         0 . 3 0     2 2 0 2 . 5 . 0 2 0 0 1 . 5 0 . 1 0 0 . 5 0 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation 6

  7. Check out the “Confidence z-scores” on the WEB page. (In pdf format.) Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probabilit y statement :       ( ) σ σ − < − µ < =       P 2 x 2 0 . 95       n n 7

  8. Manipulate the probabilit y statement :       σ σ − − < − µ < − + =       P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Manipulate the probabilit y statement : multiply t hrough by (-1) and change the order of the terms       σ σ  − < µ < +  =     P x 2 x 2 0 . 95       n n Confidence = 95% α = 1 - 95% = 5% α/2 = 2.5% = 0.025 Confidence Interval for µ lf σ is not known (usual situation)   s = ± ⋅   x t α   n 2 8

  9. Sample Size Needed to Estimate µ within E, with Confidence = 1- α 2   ⋅ σ ˆ Z α   = 2 n   E   Components of Sample Size Formula when Estimating µ • Z α /2 reflects confidence level – standard normal distribution σ σ • ˆ is an estimate of , the standard deviation of the pop. • E is the acceptable “margin of error” when estimating µ Confidence Interval for p • The Binomial Distribution gives us a starting point for determining the distribution of the sample proportion : p ˆ x successes = = ˆ p n trials 9

  10. For Binomial “x” µ = np σ = npq For the Sample Proportion 1 ( ) x = = ˆ p x n n x is a random variable n is a constant Time Out for a Principle: µ If is the mean of X and “a” is a constant, what is the mean of aX? ⋅ µ . a Answer: 10

  11. Apply that Principle! • Let “a” be equal to “1/n”   1 X • so = = =   p ˆ aX X   n n µ = µ = • and a a ( np ) ˆ p x   1 = ⋅ =   np p   n Time Out for another Principle: σ 2 If is the variance of X and “a” x is a constant, what is the variance of aX? σ = a σ . 2 2 2 Answer: aX x Apply that Principle! • Let x be the binomial “x” • Its variance is npq = np(1-p), which is the square of is standard deviation 11

  12. Apply that Principle! • Let “a” be equal to “1/n”   1 X = = =   • so p ˆ aX X   n n ( ) σ = σ = 2 2 2 2 • and a 1 / n ( npq ) ˆ p X Apply that Principle! 2   1 pq ⋅ = = σ   2 npq p ˆ   n n and pq σ = ˆ p n When n is Large,   pq   µ = σ = ˆ p ~ N p ,    n  12

  13. What is a Large “n” in this situation? • Large enough so np > 5 • Large enough so n(1-p) > 5 • Examples: – (100)(0.04) = 4 (too small) – (1000)(0.01) = 10 (big enough) Now make a z-score − = ˆ p p z pq n And rearrange for a CI(p) Using the Empirical Rule Make a probability statement:     − ˆ p p   − < < = P 1.96 1.96 95%   pq     n 13

  14. Normal Distribution 0 4 . 5 Relative likelihood . 0 4 0 0 . 3 5  α   α      0 . 3 0     2 2 0 2 . 5 0 . 2 0 0 1 . 5 0 . 1 0 0 . 0 5 . 0 0 0 -3 -2 -1 0 1 2 3 Value of Observation Use basic rules of algebra to rearrange the parts of this z-score. Manipulate the probability statement: pq Step 1: Multiply through by : n   pq pq ( ) − < − < =   P 1.96 p ˆ p 1.96 0.95    n n  14

  15. Manipulate the probability statement: Step 2: Subract from all parts of the expression: ˆ p   pq pq − − < − < − + =   P p ˆ 1.96 p p ˆ 1.96 0.95    n n  Manipulate the probability statement: Step 3: Multiply through by -1: (remember to switch the directions of < >)   pq pq + > > − =   ˆ ˆ P p 1.96 p p 1.96 0.95    n n  Manipulate the probability statement: Step 4: Swap the left and right sides to put in conventional < < form: p   pq pq − < < + =   ˆ ˆ P p 1.96 p p 1.96 0.95   n n   15

  16. Confidence Interval for p (but the unknown p is in the formula. What can we do?) pq = ± ⋅ ˆ p z α n 2 Confidence Interval for p (substitute sample statistic for p) p ˆ q ˆ = ± 2 ⋅ ˆ p z α n Sample Size Needed to Estimate “p” within E, with Confid.=1- α   2 Z   α 2 ⋅ = ˆ ˆ n p q     2 E   16

  17. Components of Sample Size Formula when Estimating “p” • Z α /2 is based on α using the standard normal distribution • p and q are estimates of the population proportions of “successes” and “failures” • E is the acceptable “margin of error” when estimating µ Components of Sample Size Formula when Estimating “p” • p and q are estimates of the population proportions of “successes” and “failures” • Use relevant information to estimate p and q if available • Otherwise, use p = q = 0.5, so the product pq = 0.25 Confidence Interval for σ starts with this fact µ σ if ~ ( , ) x N then ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 17

  18. What have we studied already that connects with Chi-square random values? ( ) n − 2 1 s χ 2 ~ (chi square) σ 2 ∑ ( ) − µ 2 x ( ) − n 1 ( ) ( ) − − 2 n 1 s n 1 = σ σ 2 2 ∑ ( ) − µ 2 x = σ 2 ( ) ( )   2 − µ 2  − µ  x x ∑ ∑   =   σ σ 2       ∑ = 2 a sum of squared z standard normal values 18

  19. Confidence Interval for σ ( ) − 2 n 1 s = LB χ 2 R ( ) − 2 n 1 s = UB χ 2 L 19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend