Economics 850 James G. MacKinnon September, 2020 James G. - - PowerPoint PPT Presentation

economics 850
SMART_READER_LITE
LIVE PREVIEW

Economics 850 James G. MacKinnon September, 2020 James G. - - PowerPoint PPT Presentation

Economics 850 James G. MacKinnon September, 2020 James G. MacKinnon Economics 850 September, 2020 1 / 26 Introduction Introduction ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. James G.


slide-1
SLIDE 1

Economics 850

James G. MacKinnon September, 2020

James G. MacKinnon Economics 850 September, 2020 1 / 26

slide-2
SLIDE 2

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students.

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-3
SLIDE 3

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. It is assumed that all students have taken a serious masters-level econometrics course.

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-4
SLIDE 4

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. It is assumed that all students have taken a serious masters-level econometrics course. Familiarity with basic concepts of mathematical statistics would also be very helpful.

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-5
SLIDE 5

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. It is assumed that all students have taken a serious masters-level econometrics course. Familiarity with basic concepts of mathematical statistics would also be very helpful. No attempt will be made to develop asymptotic theory in a fully rigorous way. That will be done in ECON 851.

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-6
SLIDE 6

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. It is assumed that all students have taken a serious masters-level econometrics course. Familiarity with basic concepts of mathematical statistics would also be very helpful. No attempt will be made to develop asymptotic theory in a fully rigorous way. That will be done in ECON 851. Tuesdays 10:00–11:20, Thursdays 10:00–11:20.

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-7
SLIDE 7

Introduction

Introduction

ECON 850 is the first course of a two-course sequence in econometrics intended for Ph.D. students. It is assumed that all students have taken a serious masters-level econometrics course. Familiarity with basic concepts of mathematical statistics would also be very helpful. No attempt will be made to develop asymptotic theory in a fully rigorous way. That will be done in ECON 851. Tuesdays 10:00–11:20, Thursdays 10:00–11:20. Lectures will be given using Zoom (or maybe Teams). Which do people prefer? Are there features of either that we should use?

James G. MacKinnon Economics 850 September, 2020 2 / 26

slide-8
SLIDE 8

Introduction

The first two-thirds of the course will be based on the first six chapters of the incomplete second edition of Econometric Theory and Methods by R. Davidson and J. G. MacKinnon.

James G. MacKinnon Economics 850 September, 2020 3 / 26

slide-9
SLIDE 9

Introduction

The first two-thirds of the course will be based on the first six chapters of the incomplete second edition of Econometric Theory and Methods by R. Davidson and J. G. MacKinnon. I will provide every student with a PDF copy.

James G. MacKinnon Economics 850 September, 2020 3 / 26

slide-10
SLIDE 10

Introduction

The first two-thirds of the course will be based on the first six chapters of the incomplete second edition of Econometric Theory and Methods by R. Davidson and J. G. MacKinnon. I will provide every student with a PDF copy. The remainder of the course will use material from a few chapters

  • f the first edition, plus some material from Estimation and

Inference in Econometrics and some supplementary notes.

James G. MacKinnon Economics 850 September, 2020 3 / 26

slide-11
SLIDE 11

Introduction

The first two-thirds of the course will be based on the first six chapters of the incomplete second edition of Econometric Theory and Methods by R. Davidson and J. G. MacKinnon. I will provide every student with a PDF copy. The remainder of the course will use material from a few chapters

  • f the first edition, plus some material from Estimation and

Inference in Econometrics and some supplementary notes. It will be assumed that everyone is familiar with Stata and/or R. Assignments could probably also be done in a matrix language such as Matlab, Octave, or Ox, but it would be more work.

James G. MacKinnon Economics 850 September, 2020 3 / 26

slide-12
SLIDE 12

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-13
SLIDE 13

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-14
SLIDE 14

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-15
SLIDE 15

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-16
SLIDE 16

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-17
SLIDE 17

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-18
SLIDE 18

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-19
SLIDE 19

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

8

Bootstrap Methods—Chapter 6 of ETM2.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-20
SLIDE 20

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

8

Bootstrap Methods—Chapter 6 of ETM2.

9

Nonlinear Least Squares—Chapter 6 of ETM + supp.

James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-21
SLIDE 21

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

8

Bootstrap Methods—Chapter 6 of ETM2.

9

Nonlinear Least Squares—Chapter 6 of ETM + supp.

10 Generalized Least Squares—Chapter 7 of ETM. James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-22
SLIDE 22

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

8

Bootstrap Methods—Chapter 6 of ETM2.

9

Nonlinear Least Squares—Chapter 6 of ETM + supp.

10 Generalized Least Squares—Chapter 7 of ETM. 11 Instrumental Variables—Chapter 8 of ETM. James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-23
SLIDE 23

Introduction

Course Outline

1

Introductory material based on Chapter 1 of ETM2.

2

The Geometry of Least Squares—Chapter 2 of ETM2.

3

Basic Properties of OLS—Chapter 3 of ETM2.

4

Introduction to Asymptotic Theory—Chapter 3 of ETM2.

5

Hypothesis Testing—Chapter 4 of ETM2.

6

Confidence Intervals—Chapter 5 of ETM2.

7

Alternative Covariance Matrices—Chapter 5 of ETM2.

8

Bootstrap Methods—Chapter 6 of ETM2.

9

Nonlinear Least Squares—Chapter 6 of ETM + supp.

10 Generalized Least Squares—Chapter 7 of ETM. 11 Instrumental Variables—Chapter 8 of ETM. 12 Other topics—Bayesian Methods? Binary Response Models? James G. MacKinnon Economics 850 September, 2020 4 / 26

slide-24
SLIDE 24

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer.

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-25
SLIDE 25

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely.

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-26
SLIDE 26

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely. Just what the final examination will look like and how it will be administered is not yet known.

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-27
SLIDE 27

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely. Just what the final examination will look like and how it will be administered is not yet known. In the past, the assignments have been worth 20%, the midterm 20%, and the final exam 60%.

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-28
SLIDE 28

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely. Just what the final examination will look like and how it will be administered is not yet known. In the past, the assignments have been worth 20%, the midterm 20%, and the final exam 60%. The T.A. is Mehtab Hanzroh.

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-29
SLIDE 29

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely. Just what the final examination will look like and how it will be administered is not yet known. In the past, the assignments have been worth 20%, the midterm 20%, and the final exam 60%. The T.A. is Mehtab Hanzroh. Tutorials? Virtual office hours?

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-30
SLIDE 30

Introduction

There will be a number of assignments, which collectively will account for 30% of the final mark. These assignments will make extensive use of the computer. The final examination will be worth 70%. It will be open-book (but with limited time), because I do not know how to administer conventional examinations remotely. Just what the final examination will look like and how it will be administered is not yet known. In the past, the assignments have been worth 20%, the midterm 20%, and the final exam 60%. The T.A. is Mehtab Hanzroh. Tutorials? Virtual office hours?

http://qed.econ.queensu.ca/pub/faculty/mackinnon/econ850/

James G. MacKinnon Economics 850 September, 2020 5 / 26

slide-31
SLIDE 31

Some Properties of PDFs and CDFs

Some Properties of PDFs and CDFs

Random variables may be discrete (binary, counts) or continuous. A continuous random variable x can take on real values. The realized value of x is often denoted X.

James G. MacKinnon Economics 850 September, 2020 6 / 26

slide-32
SLIDE 32

Some Properties of PDFs and CDFs

Some Properties of PDFs and CDFs

Random variables may be discrete (binary, counts) or continuous. A continuous random variable x can take on real values. The realized value of x is often denoted X. The distribution of x is described by a cumulative distribution function, or CDF: F(x) = Pr(X ≤ x).

James G. MacKinnon Economics 850 September, 2020 6 / 26

slide-33
SLIDE 33

Some Properties of PDFs and CDFs

Some Properties of PDFs and CDFs

Random variables may be discrete (binary, counts) or continuous. A continuous random variable x can take on real values. The realized value of x is often denoted X. The distribution of x is described by a cumulative distribution function, or CDF: F(x) = Pr(X ≤ x). 0 ≤ F(x) ≤ 1.

James G. MacKinnon Economics 850 September, 2020 6 / 26

slide-34
SLIDE 34

Some Properties of PDFs and CDFs

Some Properties of PDFs and CDFs

Random variables may be discrete (binary, counts) or continuous. A continuous random variable x can take on real values. The realized value of x is often denoted X. The distribution of x is described by a cumulative distribution function, or CDF: F(x) = Pr(X ≤ x). 0 ≤ F(x) ≤ 1. F(x) tends to 0 as x → −∞. F(x) tends to 1 as x → +∞.

James G. MacKinnon Economics 850 September, 2020 6 / 26

slide-35
SLIDE 35

Some Properties of PDFs and CDFs

Some Properties of PDFs and CDFs

Random variables may be discrete (binary, counts) or continuous. A continuous random variable x can take on real values. The realized value of x is often denoted X. The distribution of x is described by a cumulative distribution function, or CDF: F(x) = Pr(X ≤ x). 0 ≤ F(x) ≤ 1. F(x) tends to 0 as x → −∞. F(x) tends to 1 as x → +∞. F(x) must be a weakly increasing function of x.

James G. MacKinnon Economics 850 September, 2020 6 / 26

slide-36
SLIDE 36

Some Properties of PDFs and CDFs

The probability that x = X is always zero.

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-37
SLIDE 37

Some Properties of PDFs and CDFs

The probability that x = X is always zero. If a < b, then Pr(X ≤ b) = Pr(X ≤ a) + Pr(a < X ≤ b).

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-38
SLIDE 38

Some Properties of PDFs and CDFs

The probability that x = X is always zero. If a < b, then Pr(X ≤ b) = Pr(X ≤ a) + Pr(a < X ≤ b). Therefore, Pr(a ≤ X ≤ b) = F(b) − F(a).

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-39
SLIDE 39

Some Properties of PDFs and CDFs

The probability that x = X is always zero. If a < b, then Pr(X ≤ b) = Pr(X ≤ a) + Pr(a < X ≤ b). Therefore, Pr(a ≤ X ≤ b) = F(b) − F(a). If b = a, then we get F(a) − F(a) = 0.

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-40
SLIDE 40

Some Properties of PDFs and CDFs

The probability that x = X is always zero. If a < b, then Pr(X ≤ b) = Pr(X ≤ a) + Pr(a < X ≤ b). Therefore, Pr(a ≤ X ≤ b) = F(b) − F(a). If b = a, then we get F(a) − F(a) = 0. The probability density function, or PDF, is just the derivative of the CDF: f(x) ≡ F′(x). Evidently,

−∞f(x) dx =

−∞F′(x) dx = F(∞) − F(−∞) = 1.

(1)

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-41
SLIDE 41

Some Properties of PDFs and CDFs

The probability that x = X is always zero. If a < b, then Pr(X ≤ b) = Pr(X ≤ a) + Pr(a < X ≤ b). Therefore, Pr(a ≤ X ≤ b) = F(b) − F(a). If b = a, then we get F(a) − F(a) = 0. The probability density function, or PDF, is just the derivative of the CDF: f(x) ≡ F′(x). Evidently,

−∞f(x) dx =

−∞F′(x) dx = F(∞) − F(−∞) = 1.

(1) More generally,

b

a f(x) dx = Pr(a ≤ X ≤ b) = F(b) − F(a).

(2)

James G. MacKinnon Economics 850 September, 2020 7 / 26

slide-42
SLIDE 42

Some Properties of PDFs and CDFs

It is evident that f(x) ≥ 0, because F(x) is non-decreasing.

James G. MacKinnon Economics 850 September, 2020 8 / 26

slide-43
SLIDE 43

Some Properties of PDFs and CDFs

It is evident that f(x) ≥ 0, because F(x) is non-decreasing. f(x) is not bounded above by unity, because the value of a PDF at a point x is not a probability.

James G. MacKinnon Economics 850 September, 2020 8 / 26

slide-44
SLIDE 44

Some Properties of PDFs and CDFs

It is evident that f(x) ≥ 0, because F(x) is non-decreasing. f(x) is not bounded above by unity, because the value of a PDF at a point x is not a probability. The PDF of the standard normal distribution is φ(x) = (2π)−1/2 exp − 1 2x2 . (3)

James G. MacKinnon Economics 850 September, 2020 8 / 26

slide-45
SLIDE 45

Some Properties of PDFs and CDFs

It is evident that f(x) ≥ 0, because F(x) is non-decreasing. f(x) is not bounded above by unity, because the value of a PDF at a point x is not a probability. The PDF of the standard normal distribution is φ(x) = (2π)−1/2 exp − 1 2x2 . (3) The CDF of the standard normal distribution is Φ(x) =

x

−∞ φ(y) dy.

(4) This has no closed-form solution.

James G. MacKinnon Economics 850 September, 2020 8 / 26

slide-46
SLIDE 46

Some Properties of PDFs and CDFs

It is evident that f(x) ≥ 0, because F(x) is non-decreasing. f(x) is not bounded above by unity, because the value of a PDF at a point x is not a probability. The PDF of the standard normal distribution is φ(x) = (2π)−1/2 exp − 1 2x2 . (3) The CDF of the standard normal distribution is Φ(x) =

x

−∞ φ(y) dy.

(4) This has no closed-form solution. The maximum of the PDF φ(x) is at x = 0, where the slope of the CDF Φ(x) is steepest.

James G. MacKinnon Economics 850 September, 2020 8 / 26

slide-47
SLIDE 47

Some Properties of PDFs and CDFs

−3 −2 −1 1 2 3 0.5 1.0

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

x Φ(x) Standard Normal CDF: −3 −2 −1 1 2 3 0.1 0.2 0.3 0.4

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x

φ(x) Standard Normal PDF:

James G. MacKinnon Economics 850 September, 2020 9 / 26

slide-48
SLIDE 48

Some Properties of PDFs and CDFs

For a continuous random variable, µ ≡ E(x) ≡

−∞x f(x) dx.

(5) Since x can range from −∞ to ∞, this integral may well diverge.

James G. MacKinnon Economics 850 September, 2020 10 / 26

slide-49
SLIDE 49

Some Properties of PDFs and CDFs

For a continuous random variable, µ ≡ E(x) ≡

−∞x f(x) dx.

(5) Since x can range from −∞ to ∞, this integral may well diverge. The kth uncentered moment of x is mk(x) ≡

−∞xk f(x) dx.

(6)

James G. MacKinnon Economics 850 September, 2020 10 / 26

slide-50
SLIDE 50

Some Properties of PDFs and CDFs

For a continuous random variable, µ ≡ E(x) ≡

−∞x f(x) dx.

(5) Since x can range from −∞ to ∞, this integral may well diverge. The kth uncentered moment of x is mk(x) ≡

−∞xk f(x) dx.

(6) The kth central moment of the distribution of x is µk ≡ E(x − µ)k =

−∞(x − µ)k f(x) dx.

(7)

James G. MacKinnon Economics 850 September, 2020 10 / 26

slide-51
SLIDE 51

Some Properties of PDFs and CDFs

The second central moment is the variance, Var(x) = σ2. The square root of the variance, σ, is called the standard deviation.

James G. MacKinnon Economics 850 September, 2020 11 / 26

slide-52
SLIDE 52

Some Properties of PDFs and CDFs

The second central moment is the variance, Var(x) = σ2. The square root of the variance, σ, is called the standard deviation. Estimates of standard deviations of parameter estimates are called standard errors.

James G. MacKinnon Economics 850 September, 2020 11 / 26

slide-53
SLIDE 53

Some Properties of PDFs and CDFs

The second central moment is the variance, Var(x) = σ2. The square root of the variance, σ, is called the standard deviation. Estimates of standard deviations of parameter estimates are called standard errors. If ¯ x is the sample mean of xi, i = 1, . . . , N, then the sample standard deviation is s.d.(x) =

  • 1

N − 1

N

i=1

(xi − ¯ x)2

  • 1/2

. (8)

James G. MacKinnon Economics 850 September, 2020 11 / 26

slide-54
SLIDE 54

Some Properties of PDFs and CDFs

The second central moment is the variance, Var(x) = σ2. The square root of the variance, σ, is called the standard deviation. Estimates of standard deviations of parameter estimates are called standard errors. If ¯ x is the sample mean of xi, i = 1, . . . , N, then the sample standard deviation is s.d.(x) =

  • 1

N − 1

N

i=1

(xi − ¯ x)2

  • 1/2

. (8) Under the assumption that the xi are uncorrelated, s.e.(¯ x) = 1 √ N s.d.(x). (9)

James G. MacKinnon Economics 850 September, 2020 11 / 26

slide-55
SLIDE 55

Joint Distributions

Joint Distributions

A continuous, bivariate random variable (x1, x2) has the distribution function F(x1, x2) = Pr (X1 ≤ x1) ∩ (X2 ≤ x2)

  • .

(10) Thus the joint CDF F(x1, x2) is the joint probability that both X1 ≤ x1 and X2 ≤ x2.

James G. MacKinnon Economics 850 September, 2020 12 / 26

slide-56
SLIDE 56

Joint Distributions

Joint Distributions

A continuous, bivariate random variable (x1, x2) has the distribution function F(x1, x2) = Pr (X1 ≤ x1) ∩ (X2 ≤ x2)

  • .

(10) Thus the joint CDF F(x1, x2) is the joint probability that both X1 ≤ x1 and X2 ≤ x2. The joint density function is f(x1, x2) = ∂2F(x1, x2) ∂x1∂x2 . (11)

James G. MacKinnon Economics 850 September, 2020 12 / 26

slide-57
SLIDE 57

Joint Distributions

Joint Distributions

A continuous, bivariate random variable (x1, x2) has the distribution function F(x1, x2) = Pr (X1 ≤ x1) ∩ (X2 ≤ x2)

  • .

(10) Thus the joint CDF F(x1, x2) is the joint probability that both X1 ≤ x1 and X2 ≤ x2. The joint density function is f(x1, x2) = ∂2F(x1, x2) ∂x1∂x2 . (11) Like all densities, this joint PDF integrates to one:

−∞

−∞ f(x1, x2) dx1dx2 = 1.

(12)

James G. MacKinnon Economics 850 September, 2020 12 / 26

slide-58
SLIDE 58

Joint Distributions

The joint CDF is related to the joint PDF by F(x1, x2) =

x2

−∞

x1

−∞ f(y1, y2) dy1dy2.

(13)

James G. MacKinnon Economics 850 September, 2020 13 / 26

slide-59
SLIDE 59

Joint Distributions

The joint CDF is related to the joint PDF by F(x1, x2) =

x2

−∞

x1

−∞ f(y1, y2) dy1dy2.

(13) X1 and X2 are said to be independent if F(x1, x2) is the product of the marginal CDFs of x1 and x2: F(x1, x2) = F(x1, ∞)F(∞, x2) = F(x1)F(x2). (14)

James G. MacKinnon Economics 850 September, 2020 13 / 26

slide-60
SLIDE 60

Joint Distributions

The joint CDF is related to the joint PDF by F(x1, x2) =

x2

−∞

x1

−∞ f(y1, y2) dy1dy2.

(13) X1 and X2 are said to be independent if F(x1, x2) is the product of the marginal CDFs of x1 and x2: F(x1, x2) = F(x1, ∞)F(∞, x2) = F(x1)F(x2). (14) The marginal density of x1 is f(x1) =

−∞ f(x1, x2) dx2 = ∂F(x1, ∞)

∂x1 . (15) Thus f(x1) is obtained by integrating x2 out of the joint density.

James G. MacKinnon Economics 850 September, 2020 13 / 26

slide-61
SLIDE 61

Joint Distributions

If x1 and x2 are independent, so that (14) holds, then f(x1, x2) = f(x1) f(x2). (16)

James G. MacKinnon Economics 850 September, 2020 14 / 26

slide-62
SLIDE 62

Joint Distributions

If x1 and x2 are independent, so that (14) holds, then f(x1, x2) = f(x1) f(x2). (16) Suppose that A and B are any two events. Then Pr(A | B) and is defined implicitly by the equation Pr(A ∩ B) = Pr(B) Pr(A | B). (17) Evidently, Pr(B) = 0, since we cannot condition on B when Pr(B) = 0.

James G. MacKinnon Economics 850 September, 2020 14 / 26

slide-63
SLIDE 63

Joint Distributions

If x1 and x2 are independent, so that (14) holds, then f(x1, x2) = f(x1) f(x2). (16) Suppose that A and B are any two events. Then Pr(A | B) and is defined implicitly by the equation Pr(A ∩ B) = Pr(B) Pr(A | B). (17) Evidently, Pr(B) = 0, since we cannot condition on B when Pr(B) = 0. Equation (17) underlies all of Bayesian statistics. Pr(A ∩ B) = Pr(A) Pr(B | A) (18) is just (17) with A and B interchanged.

James G. MacKinnon Economics 850 September, 2020 14 / 26

slide-64
SLIDE 64

Joint Distributions

Equations (17) and (18) imply that Pr(A | B) = Pr(B | A) Pr(A) Pr(B) . (19)

James G. MacKinnon Economics 850 September, 2020 15 / 26

slide-65
SLIDE 65

Joint Distributions

Equations (17) and (18) imply that Pr(A | B) = Pr(B | A) Pr(A) Pr(B) . (19) For Bayesian estimation, the sample y plays the role of B, and the parameter vector θ plays the role of A. Thus we have f(θ| y) = f(y | θ) f(θ) f(y) , (20) where the f(·) denote densities. This is one version of Bayes’ Rule.

James G. MacKinnon Economics 850 September, 2020 15 / 26

slide-66
SLIDE 66

Joint Distributions

Equations (17) and (18) imply that Pr(A | B) = Pr(B | A) Pr(A) Pr(B) . (19) For Bayesian estimation, the sample y plays the role of B, and the parameter vector θ plays the role of A. Thus we have f(θ| y) = f(y | θ) f(θ) f(y) , (20) where the f(·) denote densities. This is one version of Bayes’ Rule. In words, the posterior density is equal to the likelihood times the prior density, divided by the unconditional density of y. If we ignore the denominator, then posterior ∝ prior × likelihood.

James G. MacKinnon Economics 850 September, 2020 15 / 26

slide-67
SLIDE 67

Joint Distributions

The conditional density, or conditional PDF, of x1 for a given value x2 is f(x1 | x2) = f(x1, x2) f(x2) . (21)

James G. MacKinnon Economics 850 September, 2020 16 / 26

slide-68
SLIDE 68

Joint Distributions

The conditional density, or conditional PDF, of x1 for a given value x2 is f(x1 | x2) = f(x1, x2) f(x2) . (21) If we let y denote x1 and x denote x2, then the conditional expectation of y given x is E(y | x) = h(x), (22) where h(x) could be any sort of function. It is a (deterministic) function that gives us E(y) for every possible value of x.

James G. MacKinnon Economics 850 September, 2020 16 / 26

slide-69
SLIDE 69

Joint Distributions

The conditional density, or conditional PDF, of x1 for a given value x2 is f(x1 | x2) = f(x1, x2) f(x2) . (21) If we let y denote x1 and x denote x2, then the conditional expectation of y given x is E(y | x) = h(x), (22) where h(x) could be any sort of function. It is a (deterministic) function that gives us E(y) for every possible value of x. A very simple example is the regression function E(y) = β1 + β2x. (23)

James G. MacKinnon Economics 850 September, 2020 16 / 26

slide-70
SLIDE 70

Joint Distributions

The Law of Iterated Expectations is very useful. It tells us that E

  • E(x1 | x2)

= E(x1). (24) In words, the unconditional expectation of x1 is equal to the expectation (using f(x2)) of the conditional expectation.

James G. MacKinnon Economics 850 September, 2020 17 / 26

slide-71
SLIDE 71

Joint Distributions

The Law of Iterated Expectations is very useful. It tells us that E

  • E(x1 | x2)

= E(x1). (24) In words, the unconditional expectation of x1 is equal to the expectation (using f(x2)) of the conditional expectation. Any deterministic function of a conditioning variable x2 is its own conditional expectation. Thus E(x2 | x2) = x2 and E(x2

2 | x2) = x2 2.

(25)

James G. MacKinnon Economics 850 September, 2020 17 / 26

slide-72
SLIDE 72

Joint Distributions

The Law of Iterated Expectations is very useful. It tells us that E

  • E(x1 | x2)

= E(x1). (24) In words, the unconditional expectation of x1 is equal to the expectation (using f(x2)) of the conditional expectation. Any deterministic function of a conditioning variable x2 is its own conditional expectation. Thus E(x2 | x2) = x2 and E(x2

2 | x2) = x2 2.

(25) Similarly, E

  • x1h(x2) | x2

= h(x2)E(x1 | x2) (26) for any deterministic function h(·).

James G. MacKinnon Economics 850 September, 2020 17 / 26

slide-73
SLIDE 73

Joint Distributions

An important special case arises when E(x1 | x2) = 0. In that case, for any function h(·), E(x1h(x2)) = 0, because E

  • x1h(x2)

= E

  • E(x1h(x2) | x2)
  • = E
  • h(x2)E(x1 | x2)
  • = E

(h(x2)0 = 0. (27)

James G. MacKinnon Economics 850 September, 2020 18 / 26

slide-74
SLIDE 74

Joint Distributions

An important special case arises when E(x1 | x2) = 0. In that case, for any function h(·), E(x1h(x2)) = 0, because E

  • x1h(x2)

= E

  • E(x1h(x2) | x2)
  • = E
  • h(x2)E(x1 | x2)
  • = E

(h(x2)0 = 0. (27) The first two equalities follow from (24) and (26). Since E(x1 | x2) = 0, the third equality then follows immediately.

James G. MacKinnon Economics 850 September, 2020 18 / 26

slide-75
SLIDE 75

The Specification of Regression Models

The Specification of Regression Models

Because E(ui | xi) = 0, E(yi | xi) = β1 + β2xi + E(ui | xi) = β1 + β2xi. (28)

James G. MacKinnon Economics 850 September, 2020 19 / 26

slide-76
SLIDE 76

The Specification of Regression Models

The Specification of Regression Models

Because E(ui | xi) = 0, E(yi | xi) = β1 + β2xi + E(ui | xi) = β1 + β2xi. (28) Suppose that we estimate the model (28) when in fact yi = β1 + β2xi + β3x2

i + vi.

(29)

James G. MacKinnon Economics 850 September, 2020 19 / 26

slide-77
SLIDE 77

The Specification of Regression Models

The Specification of Regression Models

Because E(ui | xi) = 0, E(yi | xi) = β1 + β2xi + E(ui | xi) = β1 + β2xi. (28) Suppose that we estimate the model (28) when in fact yi = β1 + β2xi + β3x2

i + vi.

(29) Then E(ui | xi) = E

  • β3x2

i + vi | xi

= β3x2

i ,

(30) which must be nonzero unless xi = 0.

James G. MacKinnon Economics 850 September, 2020 19 / 26

slide-78
SLIDE 78

The Specification of Regression Models

The Specification of Regression Models

Because E(ui | xi) = 0, E(yi | xi) = β1 + β2xi + E(ui | xi) = β1 + β2xi. (28) Suppose that we estimate the model (28) when in fact yi = β1 + β2xi + β3x2

i + vi.

(29) Then E(ui | xi) = E

  • β3x2

i + vi | xi

= β3x2

i ,

(30) which must be nonzero unless xi = 0. Should the sample be described as t = 1, . . . , T or i = 1, . . . , N?

James G. MacKinnon Economics 850 September, 2020 19 / 26

slide-79
SLIDE 79

The Specification of Regression Models

The Specification of Regression Models

Because E(ui | xi) = 0, E(yi | xi) = β1 + β2xi + E(ui | xi) = β1 + β2xi. (28) Suppose that we estimate the model (28) when in fact yi = β1 + β2xi + β3x2

i + vi.

(29) Then E(ui | xi) = E

  • β3x2

i + vi | xi

= β3x2

i ,

(30) which must be nonzero unless xi = 0. Should the sample be described as t = 1, . . . , T or i = 1, . . . , N? ETM mostly uses t = 1, . . . , n, but my slides will use i = 1, . . . , N except for time series.

James G. MacKinnon Economics 850 September, 2020 19 / 26

slide-80
SLIDE 80

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31)

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-81
SLIDE 81

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables.

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-82
SLIDE 82

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables. Disturbances rather than error terms.

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-83
SLIDE 83

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables. Disturbances rather than error terms. These are often assumed to be independent and identically distributed, or IID.

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-84
SLIDE 84

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables. Disturbances rather than error terms. These are often assumed to be independent and identically distributed, or IID. Serial correlation can arise when observations are ordered by

  • time. Then E(ut | us) = 0, perhaps only when |t − s| is small.

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-85
SLIDE 85

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables. Disturbances rather than error terms. These are often assumed to be independent and identically distributed, or IID. Serial correlation can arise when observations are ordered by

  • time. Then E(ut | us) = 0, perhaps only when |t − s| is small.

Heteroskedasticity means that Var(ui) is not constant. It may depend on Xi, or it may depend on lagged values of Var(ui).

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-86
SLIDE 86

The Specification of Regression Models

Information set: The set of potential explanatory variables, denoted Ωi. It is what we condition on. Instead of (28), E(yi | Ωi) = β1 + β2xi. (31) Exogenous and endogenous variables. Disturbances rather than error terms. These are often assumed to be independent and identically distributed, or IID. Serial correlation can arise when observations are ordered by

  • time. Then E(ut | us) = 0, perhaps only when |t − s| is small.

Heteroskedasticity means that Var(ui) is not constant. It may depend on Xi, or it may depend on lagged values of Var(ui). Clustering implies that Cov(ugi ugj) = 0. Here the sample is divided into clusters indexed by g.

James G. MacKinnon Economics 850 September, 2020 20 / 26

slide-87
SLIDE 87

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28):

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-88
SLIDE 88

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-89
SLIDE 89

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-90
SLIDE 90

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-91
SLIDE 91

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable. Evaluate β1 + β2xi for i = 1, . . . , N.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-92
SLIDE 92

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable. Evaluate β1 + β2xi for i = 1, . . . , N. Choose the distribution of the disturbances, if necessary specifying parameters such as mean and variance.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-93
SLIDE 93

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable. Evaluate β1 + β2xi for i = 1, . . . , N. Choose the distribution of the disturbances, if necessary specifying parameters such as mean and variance. Use a random-number generator, or RNG, to generate values

  • f ui.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-94
SLIDE 94

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable. Evaluate β1 + β2xi for i = 1, . . . , N. Choose the distribution of the disturbances, if necessary specifying parameters such as mean and variance. Use a random-number generator, or RNG, to generate values

  • f ui.

Form the simulated values yi by adding the disturbances to the values of the regression function.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-95
SLIDE 95

The Specification of Regression Models

Equation (31), by itself, is not a complete specification. If a model is completely specified, we can simulate it. For the regression model (28): Fix the sample size, N. Choose β1 and β2, the parameters of the deterministic specification. Obtain the N values xi, i = 1, . . . , N, of the explanatory variable. Evaluate β1 + β2xi for i = 1, . . . , N. Choose the distribution of the disturbances, if necessary specifying parameters such as mean and variance. Use a random-number generator, or RNG, to generate values

  • f ui.

Form the simulated values yi by adding the disturbances to the values of the regression function. For a dynamic model like yt = β1 + β2yt−1 + ut, the data need to be generated recursively.

James G. MacKinnon Economics 850 September, 2020 21 / 26

slide-96
SLIDE 96

The Specification of Regression Models

Alternative models for the mean of yi conditional on xi: yi = β1 + β2xi + β3x2

i + ui

(32) yi = γ1 + γ2 log xi + ui (33) yi = δ1 + δ2 1 xi + ui. (34)

James G. MacKinnon Economics 850 September, 2020 22 / 26

slide-97
SLIDE 97

The Specification of Regression Models

Alternative models for the mean of yi conditional on xi: yi = β1 + β2xi + β3x2

i + ui

(32) yi = γ1 + γ2 log xi + ui (33) yi = δ1 + δ2 1 xi + ui. (34) These are all linear models. A nonlinear (but rarely sensible) model is yi = eβ1xβ2

i2 xβ3 i3 + ui.

(35)

James G. MacKinnon Economics 850 September, 2020 22 / 26

slide-98
SLIDE 98

The Specification of Regression Models

Alternative models for the mean of yi conditional on xi: yi = β1 + β2xi + β3x2

i + ui

(32) yi = γ1 + γ2 log xi + ui (33) yi = δ1 + δ2 1 xi + ui. (34) These are all linear models. A nonlinear (but rarely sensible) model is yi = eβ1xβ2

i2 xβ3 i3 + ui.

(35) A better model is yi = eβ1xβ2

i2 xβ3 i3 evi.

(36)

James G. MacKinnon Economics 850 September, 2020 22 / 26

slide-99
SLIDE 99

The Specification of Regression Models

Alternative models for the mean of yi conditional on xi: yi = β1 + β2xi + β3x2

i + ui

(32) yi = γ1 + γ2 log xi + ui (33) yi = δ1 + δ2 1 xi + ui. (34) These are all linear models. A nonlinear (but rarely sensible) model is yi = eβ1xβ2

i2 xβ3 i3 + ui.

(35) A better model is yi = eβ1xβ2

i2 xβ3 i3 evi.

(36) If we take logarithms of both sides, we get log yi = β1 + β2 log xi2 + β3 log xi3 + vi, (37) which is a loglinear regression model.

James G. MacKinnon Economics 850 September, 2020 22 / 26

slide-100
SLIDE 100

Method-of-Moments Estimation

Method-of-Moments Estimation

The method of moments, or MM, replaces population quantities by sample analogs.

James G. MacKinnon Economics 850 September, 2020 23 / 26

slide-101
SLIDE 101

Method-of-Moments Estimation

Method-of-Moments Estimation

The method of moments, or MM, replaces population quantities by sample analogs. Suppose there is just one parameter (β1, the population mean) to

  • estimate. The sample mean of the disturbances is

1 N

N

i=1

ui = 1 N

N

i=1

(yi − β1). (38)

James G. MacKinnon Economics 850 September, 2020 23 / 26

slide-102
SLIDE 102

Method-of-Moments Estimation

Method-of-Moments Estimation

The method of moments, or MM, replaces population quantities by sample analogs. Suppose there is just one parameter (β1, the population mean) to

  • estimate. The sample mean of the disturbances is

1 N

N

i=1

ui = 1 N

N

i=1

(yi − β1). (38) Equating this to 0 yields 1 N

N

i=1

yi − β1 = 0. (39)

James G. MacKinnon Economics 850 September, 2020 23 / 26

slide-103
SLIDE 103

Method-of-Moments Estimation

Method-of-Moments Estimation

The method of moments, or MM, replaces population quantities by sample analogs. Suppose there is just one parameter (β1, the population mean) to

  • estimate. The sample mean of the disturbances is

1 N

N

i=1

ui = 1 N

N

i=1

(yi − β1). (38) Equating this to 0 yields 1 N

N

i=1

yi − β1 = 0. (39) The MM estimate ˆ β1 is just the mean of the observed values: ˆ β1 = 1 N

N

i=1

yi. (40)

James G. MacKinnon Economics 850 September, 2020 23 / 26

slide-104
SLIDE 104

Method-of-Moments Estimation

For a simple linear regression model with two parameters, (39) becomes 1 N

N

i=1

(yi − β1 − β2xi) = 0. (41) We need one more equation to solve for ˆ β1 and ˆ β2.

James G. MacKinnon Economics 850 September, 2020 24 / 26

slide-105
SLIDE 105

Method-of-Moments Estimation

For a simple linear regression model with two parameters, (39) becomes 1 N

N

i=1

(yi − β1 − β2xi) = 0. (41) We need one more equation to solve for ˆ β1 and ˆ β2. We use the fact that E(ui | xi) = 0. By the law of iterated expectations, E(xiui) = E

  • E(xiui | xi)

= E

  • xiE(ui | xi)

= 0. (42)

James G. MacKinnon Economics 850 September, 2020 24 / 26

slide-106
SLIDE 106

Method-of-Moments Estimation

For a simple linear regression model with two parameters, (39) becomes 1 N

N

i=1

(yi − β1 − β2xi) = 0. (41) We need one more equation to solve for ˆ β1 and ˆ β2. We use the fact that E(ui | xi) = 0. By the law of iterated expectations, E(xiui) = E

  • E(xiui | xi)

= E

  • xiE(ui | xi)

= 0. (42) Thus we can supplement (41) by the following equation: 1 N

N

i=1

xi(yi − β1 − β2xi) = 0. (43)

James G. MacKinnon Economics 850 September, 2020 24 / 26

slide-107
SLIDE 107

Method-of-Moments Estimation

Equations (41) and (43) can be written as β1 + 1 N

N

i=1

xi

  • β2 = 1

N

N

i=1

yi (44) 1 N

N

i=1

xi

  • β1 +

1 N

N

i=1

x2

i

  • β2 = 1

N

N

i=1

xiyi. (45)

James G. MacKinnon Economics 850 September, 2020 25 / 26

slide-108
SLIDE 108

Method-of-Moments Estimation

Equations (41) and (43) can be written as β1 + 1 N

N

i=1

xi

  • β2 = 1

N

N

i=1

yi (44) 1 N

N

i=1

xi

  • β1 +

1 N

N

i=1

x2

i

  • β2 = 1

N

N

i=1

xiyi. (45) After multiplication by N, these equations become

  • N

∑N

i=1 xi

∑N

i=1 xi

∑N

i=1 x2 i

β1 β2

  • =
  • ∑N

i=1 yi

∑N

i=1 xiyi

  • .

(46)

James G. MacKinnon Economics 850 September, 2020 25 / 26

slide-109
SLIDE 109

Method-of-Moments Estimation

Equations (41) and (43) can be written as β1 + 1 N

N

i=1

xi

  • β2 = 1

N

N

i=1

yi (44) 1 N

N

i=1

xi

  • β1 +

1 N

N

i=1

x2

i

  • β2 = 1

N

N

i=1

xiyi. (45) After multiplication by N, these equations become

  • N

∑N

i=1 xi

∑N

i=1 xi

∑N

i=1 x2 i

β1 β2

  • =
  • ∑N

i=1 yi

∑N

i=1 xiyi

  • .

(46) But (46) is just a special case of X⊤Xβ = X⊤y. (47)

James G. MacKinnon Economics 850 September, 2020 25 / 26

slide-110
SLIDE 110

Method-of-Moments Estimation

Thus we obtain the famous formula for the ordinary least squares, or OLS, estimator: ˆ β = (X⊤X)−1X⊤y. (48)

James G. MacKinnon Economics 850 September, 2020 26 / 26

slide-111
SLIDE 111

Method-of-Moments Estimation

Thus we obtain the famous formula for the ordinary least squares, or OLS, estimator: ˆ β = (X⊤X)−1X⊤y. (48) In general, of course, there are k moment conditions, one for each regressor: X⊤(y − Xβ) = 0. (49) Here we treat the constant term as a column of 1s within X.

James G. MacKinnon Economics 850 September, 2020 26 / 26

slide-112
SLIDE 112

Method-of-Moments Estimation

Thus we obtain the famous formula for the ordinary least squares, or OLS, estimator: ˆ β = (X⊤X)−1X⊤y. (48) In general, of course, there are k moment conditions, one for each regressor: X⊤(y − Xβ) = 0. (49) Here we treat the constant term as a column of 1s within X. We could also obtain ˆ β by minimizing the sum of squared residuals SSR(β) = (y − Xβ)⊤(y − Xβ) =

N

i=1

(yi − Xiβ)2. (50)

James G. MacKinnon Economics 850 September, 2020 26 / 26

slide-113
SLIDE 113

Method-of-Moments Estimation

Thus we obtain the famous formula for the ordinary least squares, or OLS, estimator: ˆ β = (X⊤X)−1X⊤y. (48) In general, of course, there are k moment conditions, one for each regressor: X⊤(y − Xβ) = 0. (49) Here we treat the constant term as a column of 1s within X. We could also obtain ˆ β by minimizing the sum of squared residuals SSR(β) = (y − Xβ)⊤(y − Xβ) =

N

i=1

(yi − Xiβ)2. (50) The first-order conditions are −2X⊤(y − Xβ) = 0. (51)

James G. MacKinnon Economics 850 September, 2020 26 / 26