Hypothesis Testing Part I James J. Heckman University of Chicago - PowerPoint PPT Presentation

Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312, Spring 2019 Heckman Hypothesis Testing Part I

1. A Brief Review of Hypothesis Testing and Its Uses Common Phrase: Chicago Economics test Models What are Valid Tests? • Key Distinction: ex ante vs. ex post inference � �� classical likelihood inference principle; Bayesian inference Heckman Hypothesis Testing Part I

• P values and pure significance tests (R.A. Fisher)—focus on null hypothesis testing. • Neyman-Pearson tests—focus on null and alternative hypothesis testing. • Both involve an appeal to long run trials. They adopt an ex ante position (justify a procedure by the number of times it is successful if used repeatedly). Heckman Hypothesis Testing Part I

2. Pure Significance Tests Heckman Hypothesis Testing Part I

• Focuses exclusively on the null hypothesis • Let ( Y 1 , . . . , Y N ) be observations from a sample. • Let t ( Y 1 , . . . , Y N ) be a test statistic. • If 1 We know the distribution of t ( Y � ) under H 0 , and 2 The larger the value of t ( Y � ), the more the evidence against H 0 , • Then P obs = Pr( T ≥ t obs : H 0 ). Heckman Hypothesis Testing Part I

• Then a high value of P obs is evidence against the null hypothesis. • Observe that under the null P value is a uniform (0 , 1) variable. • For random variable with density (absolutely continuous with Lebesgue measure) Z = F X ( X ) is uniform for any X given that F X is continuous. • Prove this. It is automatic from the definition. • P value — probability that T would occur given that H 0 is a true state of affairs. • F test or t test for a regression coefficient is an example. Heckman Hypothesis Testing Part I

• • The higher the test statistic, the more likely we reject. • Ignores any evidence on alternatives. • R.A. Fisher liked this feature because it did not involve speculation about other possibilities than the one realized. • P values make an absolute statement about a model. • Questions to consider : 1 How to construct a ‘best’ test? Compare alternative tests. Any monotonic transformation of the “ t ” statistic produces the same P value. 2 Pure significance tests depend on the sampling rule used to collect the data. This is not necessarily bad. 3 How to pool across studies (or across coefficients)? Heckman Hypothesis Testing Part I

2.1 Bayesian vs. Frequentist vs. Classical Approach Heckman Hypothesis Testing Part I

• ISSUES: 1 In what sense and how well do significance levels or “ P ” values summarize evidence in favor of or against hypotheses? 2 Do we always reject a null in a big enough sample? Meaningful hypothesis testing—Bayesian or Classical—requires that “significance levels” decrease with sample size; 3 Two views: β = 0 tests something meaningful vs. β = 0 only an approximation, shouldn’t be taken too seriously. Heckman Hypothesis Testing Part I

4 How to quantify evidence about model? (How to incorporate prior restrictions?) What is “strength of evidence?” 5 How to account for model uncertainty: “fishing,” etc. • First consider the basic Neyman-Pearson structure- then switch over to a Bayesian paradigm. Heckman Hypothesis Testing Part I

• Useful to separate out: 1 Decision problems. 2 Acts of data description. • This is a topic of great controversy in statistics. Heckman Hypothesis Testing Part I

• Question: In what sense does increasing sample size always lead to rejection of an hypothesis? • If null not exactly true, we get rejections (The power of test → 1 for fixed sig. level as sample size increases) • Example to refresh your memory about Neyman-Pearson Theory. • Take one-tail normal test about a mean: • What is the test? � � ¯ µ 0 , σ 2 / T X ∼ N H 0 : � � ¯ µ A , σ 2 / T : X ∼ N H A • Assume σ 2 is known. Heckman Hypothesis Testing Part I

• For any c we get � ¯ � X − µ 0 > c − µ 0 Pr = α ( c ) . � � σ 2 / T σ 2 / T • (We exploit symmetry of standard normal around the origin). • For a fixed α , we can solve for c ( α ). σ Φ − 1 ( α ) . c ( α ) = µ 0 − √ T Heckman Hypothesis Testing Part I

• Now what is the probability of rejecting the hypothesis under alternatives? (The power of a test). • Let µ A be the alternative value of µ A . • Fix c to have a certain size. (Use the previous calculations) � ¯ � X − µ A > c − µ A Pr � � σ 2 / T σ 2 / T   T Φ − 1 ( α ) σ  ¯ µ 0 − µ A − X − µ A √  . � > = Pr √ √ � � � σ/ σ/ T T • We are evaluating the probability of rejection when we allow µ A to vary. Heckman Hypothesis Testing Part I

• Thus    ¯ X − µ A � > µ 0 − µ A � − Φ − 1 ( α ) = Pr √ √  � � σ/ T σ/ T = α when µ 0 = µ A • If µ A > µ 0 , this probability goes to one. • This is a consistent test . Heckman Hypothesis Testing Part I

• Now, suppose we seek to test H 0 : µ 0 > k . • Use ¯ X > k , fixed k • If µ 0 is true: ¯ X − µ 0 � > k − µ 0 � � � σ σ √ √ T T • The distribution becomes more and more concentrated at µ 0 . • We reject the null unless µ 0 = k . Heckman Hypothesis Testing Part I

� �� 1 0.9 0.8 Power of the test for T elements of Data 0.7 0.6 0.5 Prob[mean(X 1 ) > c � ( � 0 )] 0.4 Prob[mean(X 2 ) > c � ( � 0 )] 0.3 Prob[mean(X 10 ) > c � ( � 0 )] Prob[mean(X 100 ) > c � ( � 0 )] 0.2 Prob[mean(X 1000 ) > c � ( � 0 )] 0.1 0 0 1 2 3 4 5 True v alue � A ( � = 0.05, � = 1, � 0 =0) � � �� Heckman Hypothesis Testing Part I

• Parenthetical Note: • Observe that if we measure X with the slightest error and the errors do not have mean zero, we always reject H 0 for T big enough. Heckman Hypothesis Testing Part I

Design of Sample size • Suppose that we fix the power = β . • Pick c ( α ). • What sample size produces the desired power? • We postulate the alternative = µ 0 + ∆. Heckman Hypothesis Testing Part I

   ¯ X − µ A � > µ 0 − µ A � − Φ − 1 ( α ) Pr √ √  � � σ/ σ/ T T � � Φ − 1 ( α ) + µ A − µ 0 = Φ = β σ √ T Φ − 1 ( α ) + µ A − µ 0 Φ − 1 ( β ) = σ √ T [Φ − 1 ( β ) − Φ − 1 ( α )] √ = T � ∆ � σ Heckman Hypothesis Testing Part I

• Minimum T needed to reject null at specified alternative. • Has power of β for “effect” size ∆ /σ . • Pick sample size on this basis: (This is used in sample design) • What value of β to use? • Observe that two investigators with same α but different sample size T have different power. • This is often ignored in empirical work. • Why not equalize the power of the tests across samples? • Why use the same size of test in all empirical work? Heckman Hypothesis Testing Part I

3. Alternative Approaches to Testing and Inference Heckman Hypothesis Testing Part I

3.1 Classical Hypothesis Testing Heckman Hypothesis Testing Part I

1 Appeals to long run frequencies . 2 Designs an ex ante rule that on average works well. e.g. 5% of the time in repeated trials we make an error of rejecting the null for a 5% significance level. 3 Entails a hypothetical set of trials, and is based on a long run justification. Heckman Hypothesis Testing Part I

(4) Consistency of an estimator is an example of this mindset. E.g., Y = X β + U E ( U | X ) � = 0; OLS biased for β . Suppose we have an instrument: Cov ( Z , U ) = 0 Cov ( Z , X ) � = 0 plim β OLS = β + Cov ( X , U ) Var ( X ) plim β IV = β + Cov ( Z , U ) = β Cov ( Z , X ) � �� =0 • Because Cov ( Z , U ) = 0. • Assuming Cov ( Z , X ) � = 0. Heckman Hypothesis Testing Part I

• Another consistent estimator 1 Use OLS for first 10 100 observations 2 Then use IV . • Likely to have poor small sample properties. • But on a long run frequency justification, its just fine. Heckman Hypothesis Testing Part I

3.2 Examples of why some people get very unhappy about classical testing procedures Classical inference: ex ante Likelihood and Bayesian statistics: ex post Heckman Hypothesis Testing Part I

Example 1. (Sample size: T = 2) ( X 1 , X 2 ) X 1 ⊥ ⊥ X 2 . P θ 0 ( X = θ 0 − 1) = P θ ( X = θ 0 + 1) = 1 2 • One possible (smallest) confidence set for θ 0 is � 1 2 ( X 1 + X 2 ) if X 1 � = X 2 C ( X 1 , X 2 ) = X 1 − 1 if X 1 = X 2 Heckman Hypothesis Testing Part I

• Thus 75% of the time C ( X 1 , X 2 ) contains θ 0 (75% of repeated trials it covers θ 0 ). (Verify this) • Yet if X 1 � = X 2 , we are certain that the confidence interval exactly covers the true value 100% of the time it is right. • Ex post or conditional inference on the data, we get the exact value. Heckman Hypothesis Testing Part I

Hypothesis Testing Part I James J. Heckman University of Chicago - PowerPoint PPT Presentation

Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312, Spring 2019 Heckman Hypothesis Testing Part I 1. A Brief Review of Hypothesis Testing and Its Uses Common Phrase: Chicago Economics test Models What are Valid

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

STAT 215 Hypothesis Testing I Colin Reimer Dawson Oberlin College September 7, 2017 1 / 14

CME/STATS 195 CME/STATS 195 Lecture 7: Hypothesis Testing and Lecture 7: Hypothesis Testing and

Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

Testing Specification testing Michel Bierlaire Introduction to choice models Differences from

Hypothesis Testing Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Testing 6.1 Specification testing Michel Bierlaire A short reminder on hypothesis testing

Hypothesis testing get data that differ from the null hypothesis. If the data would be quite

Lecture 4: Hypothesis Testing Ani Manichaikul amanicha@jhsph.edu 20 April 2007 1 / 69 Steps of

Bayesian hypothesis testing Dr. Jarad Niemi STAT 544 - Iowa State University March 7, 2019

Sequential techniques for Hypothesis testing & Change detection George V. Moustakides

Data Mining and Machine Learning: Fundamental Concepts and Algorithms dataminingbook.info

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Ben

Users Really Do Plug in USB Drives They Find Matthew Tischer, Zakir Durumeric, Sam Foster, Sunny

When its better to ask forgiveness than get permission Chris Thompson, Maritza Johnson, Serge

Interesting Patterns Jilles Vreeken 15 May 2015 Questions of the Day What is interestingness?

Normalization and differential expression II Katharina H oel Statistical Analysis of RNA-Seq

Table of contents 1. Introduction: You are already an experimentalist 2. Conditions 3. Items

Disclosure Differential Effect of Plasma Estradiol Levels The authors have no financial

Hypothesis Testing Part I James J. Heckman University of Chicago - PowerPoint PPT Presentation

Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312, Spring 2019 Heckman Hypothesis Testing Part I 1. A Brief Review of Hypothesis Testing and Its Uses Common Phrase: Chicago Economics test Models What are Valid

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

Chapter 6 Hypothesis Testing What is Hypothesis Testing? the use of statistical

STAT 215 Hypothesis Testing I Colin Reimer Dawson Oberlin College September 7, 2017 1 / 14

CME/STATS 195 CME/STATS 195 Lecture 7: Hypothesis Testing and Lecture 7: Hypothesis Testing and

Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis

Cluster Validity Hypothesis Random Graph Hypothesis Random Label Hypothesis Relative Criteria

Testing Specification testing Michel Bierlaire Introduction to choice models Differences from

Hypothesis Testing Mark Lunt Centre for Epidemiology Versus Arthritis University of Manchester

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Testing 6.1 Specification testing Michel Bierlaire A short reminder on hypothesis testing

Hypothesis testing get data that differ from the null hypothesis. If the data would be quite

Lecture 4: Hypothesis Testing Ani Manichaikul amanicha@jhsph.edu 20 April 2007 1 / 69 Steps of

Bayesian hypothesis testing Dr. Jarad Niemi STAT 544 - Iowa State University March 7, 2019

Sequential techniques for Hypothesis testing &amp; Change detection George V. Moustakides

Data Mining and Machine Learning: Fundamental Concepts and Algorithms dataminingbook.info

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Ben

Users Really Do Plug in USB Drives They Find Matthew Tischer, Zakir Durumeric, Sam Foster, Sunny

When its better to ask forgiveness than get permission Chris Thompson, Maritza Johnson, Serge

Interesting Patterns Jilles Vreeken 15 May 2015 Questions of the Day What is interestingness?

Normalization and differential expression II Katharina H oel Statistical Analysis of RNA-Seq

Table of contents 1. Introduction: You are already an experimentalist 2. Conditions 3. Items

Disclosure Differential Effect of Plasma Estradiol Levels The authors have no financial

Sequential techniques for Hypothesis testing & Change detection George V. Moustakides