4/19/2016 Chapter 6 Inference for categorical data 1.Quick Review - PowerPoint PPT Presentation

4/19/2016 Chapter 6 Inference for categorical data 1.Quick Review Huamei Dong 03/17/2016 Last class, we talked about the inference for a single proportion, that is, confidence (1)Confidence interval for one proportion interval and hypothesis test about one proportion. Today we will learn the inference for the difference of two proportions . 1. Quick Review 2. Sampling distribution of differences of two proportions 3. Confidence interval for p 1 -p 2 4. Hypothesis test when H 0 : p 1 =p 2 (3) For H 0 : p=0.5, H A : p>0.5 and significant level is (2) For H 0 : p=0.5, H A : p≠0.5 and significant level 0.05, is 0.05, Calculate Z value using your sample data. Find the p-value which is the right tail Area. If this right tail area is less than 0.05, we reject H 0. Calculate Z value using your sample data: Find the p-value which should be the area of two tails. If the two tails’ area is less than 0.05 ( or one tail’s area is less than 0.025 ) , we reject H 0. 1

4/19/2016 2. Sampling distribution of the difference of two proportions Sampling distribution of : (4) For H 0 : p=0.5, H A : p<0.5 and significant level Assume the true proportion for population 1 is , sample size from is 0.05, we population 1 is , the true proportion for population 2 is , sample size from population 2 is . (1) If samples from population 1 are independent of each other and (2) If samples from population 2 are independent of each other and (3) Samples from population 1 and samples from population 2 are independent. Calculate Z value using your sample data. Find the p-value which is the left tail Then the sampling distribution of is nearly normal with mean Area. If this left tail area is less than 0.05, we reject H 0. And standard error 3. Confidence interval for p 1 -p 2 4. Hypothesis test when H 0 : p 1 =p 2 Constructing a confidence interval for a proportion. If we do the hypothesis test when H 0: p 1 =p 2, we use (1) Verify the observations are independent and verify the success- failure condition using (2) If the condition are met, the sampling distribution of is nearly normal. (3) Standard error can be approximated by to verify the success-failure condition, that is to verify We also calculate standard error using , , that is (4) Confidence interval is 2

4/19/2016 Example1 There were 50 patients in the experiment who did not receive the blood thinner and 40 patients who did.(1) What is the observed survival rate in the control group? (2) And in the treatment group?(3) Provide a point estimate of the difference in survival proportion of the two groups: (4) Find 95% confidence interval for (5) Complete hypothesis test whether the blood thinner are helpful or harmful at significant level of 0.05. Standard error is Survived Died Total Control 11 39 50 Treatment 14 26 40 Total 25 65 90 So the confidence interval is Answer: (1) (2) (3) (4) To find 95% confidence interval, we need check the success-failure condition: We can just check the table to see if the successes are bigger or equal to 10, the failures are bigger or equal to 10. Or you can use (5) H 0 : p t =p c H A : p t ≠ p c Check the success-failure condition using If we use the Z table, we find the one side tail is 0.0853. So the p-value is 0.176. So we don’t reject H 0 . So the conditions is satisfied. Now we need estimate the standard error using Area is 0.0853 This area is 0.0853, too. Z=1.37 3

4/19/2016 Homework on 03/17/16: (due 03/24/16) 1. Using the data (breast_cancel.txt) (a) To calculate a 95% confidence interval for the difference between the proportion of women under 55 for those who underwent a radical mastectomy and the proportion of women under 55 for those who underwent a partial mastectomy accompanied by radiation therapy. (b) Test whether two proportions are equal at significance level of 0.05. 2. Using the data you sampled on 03/15/16 and the data your classmate sampled: (a) To calculate a 95% confidence interval for the difference between two proportions (b) Test whether two proportions are equal at significance level of 0.05. 4

4/19/2016 Chapter 6 Inference for categorical data 1.Quick Review - PowerPoint PPT Presentation

4/19/2016 Chapter 6 Inference for categorical data 1.Quick Review Huamei Dong 03/17/2016 Last class, we talked about the inference for a single proportion, that is, confidence (1)Confidence interval for one proportion interval and hypothesis

2/17/2016 1 2/17/2016 2 2/17/2016 3 2/17/2016 4 2/17/2016 5 2/17/2016 6 2/17/2016 7

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

Q1 2016 Press Presentation | Page 1 | February 25, 2016 | May 3, 2016 Q1 2016 At a Glance

(c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016 Fabbian USA Corp. (c) 2016

MOTION 2014 AGM Agenda Approval 1 13/01/2017 2016 AGM 2016 AGM AGM SPONSORS add

2016 half year results 29 July 2016 1 Overview Highlights H1 2016 Outlook 2016

Tucson Fire Department 2016 Awards Presentation September 22, 2016: December 1, 2016: December 15,

Q2/H1 2016 IR Presentation | Page 1 | February 25, 2016 | August 4, 2016 Thomas Ebeling CEO

2016-2018 BUDGET DRAFT 4 January 18, 2016 PROPOSED 2016-2018 BUDGET AND BUSINESS PLAN 2016-2018

CONTENT MARKETING: THIS IS HOW WE DO IT 1 2/05/2016 2 2/05/2016 Audience Dreamer Advocate

Q2/H1 2016 Press Presentation | August 4, 2016 | | Page 1 1 2 3 4 Q2/H1 2016 At a Glance

Proposed Budget Public Hearing Presented: June 21, 2016 Table of Contents 2016-17

Chapter 21: The Accuracy of Percentages The Bootstrap When we do not know what is in the box, we

Parameters and confidence inter v als FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor

Part 6. Confidence Interval Min Chen School of Computer Science and Engineering Seoul National

Gov 2000: 5. Estimation and Statistical Inference Matthew Blackwell Fall 2016 1 / 56 1. Point

Lecture 21: Bootstrap and Permutation Tests The bootstrap Bootstrapping generally refers to

Probability and Statistics for Computer Science In sta(s(cs we apply probability to draw

Statistical Intervals Vive La Diffrence! William Q. Meeker Department of Statistics Center for

Confidence Interval for the Variance of a Normal Population Bernd Schr oder logo1 Bernd

Sambuz

Useful Links

Newsletter

Mail Us