The BLP Method of Demand Curve Estimation in Industrial Organization - PDF document

The BLP Method of Demand Curve Estimation in Industrial Organization 9 March 2006 Eric Rasmusen 1

IDEAS USED 1. Instrumental variables . We use instruments to correct for the en- dogeneity of prices, the classic problem in estimating supply and demand. 2. Product characteristics . We look at the effect of characteristics on demand, and then build up to products that have particular levels of the characteristics. Going from 50 products to 6 characteristics drastically reduces the number of parameters to be estimated. 3. Consumer and product characteristics interact. This is what is going on when consumer marginal utilities are allowed to depend on consumer characteristics. This makes the pattern of consumer pur- chases substituting from one product to another more sensible. 4. Structural estimation. We do not just look at conditional correlations of relevant variables with a disturbance term tacked on to ac- count for the imperfect fit of the regression equation. Instead, we start with a model in which individuals maximize their payoffs by choice of actions, and the model includes the disturbance term which will later show up in the regression. 5. The contraction mapping. A contraction mapping is used to estimate the parameters that are averaged across consumers, an otherwise difficult optimization problem. 6. Separating Linear and Nonlinear Estimation Problems. The estimation is divided into one part that uses a search algorithm to numer- ically estimate parameters that enter nonlinearly and a second part that uses an analytic formula to estimate the parameters that enter linearly. 7. The method of moments. The generalized method of moments is used to estimate the other parameters. 2

The Generalized Method of Moments Suppose we want to estimate y = x 1 β 1 + x 2 β 2 + ǫ, (1) where we observe y , x 1 , and x 2 , but not ǫ , though we know that ǫ has a mean of zero. We assume that the x ’s and the unobservable disturbances ǫ are un- correlated: the two “moment conditions” that we can write as M 1 : E ( x ′ M 2 : E ( x ′ 1 ǫ ) = 0 , 2 ǫ ) = 0 . (2) or EM 1 = 0 , EM 2 = 0 . (3) Note that x 1 is a T × 1 vector, but M 1 = x ′ 1 ǫ is 1 × 1. The sum of squares of the moment expressions (which are the M ’s that equal zero in the moment condition) is ( M 1 M 2 ) ′ ( M 1 M 2 ) (4) Think of M 1 as a random variable, made up from those T random variables ǫ in the T observations. The expected value of M 1 is zero, by assumption, but in our sample its realization might be positive or negative, because its variance is not zero. The matrix ( M 1 M 2 ) is 2 × 1, so the sum of squared moments is 1 × 1. 3

Another way to write the problem is to choose ˆ β to minimize M ′ M . If M = X ′ ǫ , we will find argmin ˆ (5) ǫ ′ XX ′ ˆ β = β ǫ ˆ Thus, we maximize the function f (ˆ β ): f (ˆ ǫ ′ XX ′ ˆ β ) = ˆ ǫ = ( y − Xˆ β ) ′ XX ′ ( y − Xˆ β ) (6) ′ X ′ XX ′ y − y ′ XX ′ X ˆ ′ X ′ XX ′ X ˆ = y ′ XX ′ y − ˆ β + ˆ β β β We can differentiate this with respect to ˆ β to get the first order condition ′ X ′ XX ′ X = 0 f ′ (ˆ β ) = − X ′ XX ′ y − y ′ XX ′ X + 2ˆ β ′ X ′ XX ′ X = 0 = − 2 X ′ XX ′ y + 2ˆ (7) β ′ X ′ X ) = 0 = 2 X ′ X ( − X ′ y + ˆ β in which case ˆ β = ( X ′ X ) − 1 X ′ y (8) and we have the OLS estimator. 4

We might also know that the x ’s and disturbances are independent : E ( ǫ | x 1 , x 2 ) = 0 . (9) We want to use all available information, for efficient estimation, so we would like to use that independence information. It will turn out to be useful information if the variance depends on X , though not otherwise. Independence gives us lots of other potential moment conditions. Here are a couple: 1 ) ′ ǫ ) = E ( M 3 ) = 0 , E (( x 2 ∗ x 1 ) ′ ǫ ) = E ( M 4 ) = 0 . E (( x 2 (10) Some of these conditions are more reliable than others. So we’d like to weight them when we use them. Since M 3 and M 4 are random variables, they have variances. So let’s weight them by the the inverse of their variances— more precisely, by the inverse of their variance-covariance matrix, since they have cross- correlations. Call the variance-covariance matrix of all the moment conditions Φ ( M ) . We can estimate that matrix consistently by running a preliminary consistent regression such as OLS and making use of the residuals. This is a weighting scheme that has been shown to be optimal (see Hansen [1982]). We minimize the weighted square of the moment conditions by choice of the parameters ˆ β . ( M 1 M 2 M 3 M 4 ) ′ ( Φ ( M ) − 1 )( M 1 M 2 M 3 M 4 ) (11) 5

The weighting matrix is crucial. OLS uses the most obviously useful information. We can throw in lots and lots of other moment conditions using the independence assumption, but they will contain less and less new information. Adding extra information is always good in itself, but in finite samples, the new information, the result of random chance, could well cause more harm than good. In such a case, we wouldn’t want to weight the less important moment conditions, which might have higher variance, as much as the basic exogeneity ones. Consider the moment condition M 5 : E (( x 3 2 ∗ x 5 1 ) ′ ǫ ) = E M 5 = 0 . (12) That moment condition doesn’t add a lot of information, and it could have a big variance not reflected in the consistent estimate of Φ( M ) that we happen to obtain from our finite sample. We have now gotten something like generalized least squares, GLS, from the generalized method of moments. I did not demonstrate it, but Φ ( M ) will turn out to be an estimate of the variance covariance matrix of ǫ . It is not the same as other estimates used in GLS, because it depends on exactly which moment conditions are used, but it is consistent. We have a correction for heteroskedasticity, which is something we need for estimation of the BLP problem. Notice that this means that GMM can be useful even though: (a) This is a linear estimation problem, not nonlinear. (b) No explanatory variables are endogenous, so this is not an instrumental variables problem. 6

(Hall 1996) Suppose one of our basic moment conditions fails. E x 2 ǫ � = 0, because x 1 is endogenous, and we have lost our moment conditions M 2 and M 4 . What we need is a new basic moment condition that will enable us to estimate β 2 — that is, we need an instrument correlated with x 1 but not with ǫ . Suppose we do have a number of such conditions, a set of variables z 1 and z 2 . We can use our old conditions M 1 and M 3 , and we’ll add a couple others too, ending up with this set: E (( x 2 1 ) ′ ǫ ) = 0 , E ( x 1 ǫ ) = 0 E ( z 1 ǫ ) = 0 (13) and E (( z 1 ∗ x 1 ) ′ ǫ ) = 0 E (( z 1 ∗ z 2 ) ′ ǫ ) = 0 . E ( z 2 ǫ ) = 0 (14) We will abbreviate these six moment conditions as E ( Z ′ ǫ ) = E ( M ) = 0 , (15) where the matrix Z includes separate columns for the original variable x 1 , the simple instruments z 1 and z 2 , and the interaction instruments. z 1 ∗ x 1 and z 1 ∗ z 2 . Let’s suppose also, for the moment, that we have the ex ante information that the disturbances are independent of each other and Z , so there is no heteroskedasticity. Then the weighting matrix is Φ ( M ) = Var ( M ) = Var ( Z ′ ǫ ) = E ( Z ′ ǫǫ ′ Z ) − E ( Z ′ ǫ ) E ( ǫ ′ Z ) (16) = E ( Z ′ ( I σ 2 ) Z ) − 0 2 = σ 2 Z ′ Z . 7

The GMM estimator solves the problem of choosing the parameters ˆ β 2 SLS to minimize 2SLS Z ( σ 2 Z ′ Z ) − 1 Z ′ ˆ f ( ˆ ǫ ′ β 2SLS ) = ˆ ǫ 2SLS (17) β 2SLS ) ′ Z ( σ 2 Z ′ Z ) − 1 Z ′ ( y − Xˆ = ( y − Xˆ β 2SLS ) We can differentiate this with respect to ˆ β 2SLS to get the first order condition β 2 SLS ) = − X ′ Z ( σ 2 Z ′ Z ) − 1 Z ′ ( y − Xˆ ˆ f ′ ( β 2SLS ) = 0 , (18) which solves to β 2SLS = [ X ′ Z ( Z ′ Z ) − 1 Z ′ X ] − 1 X ′ Z ( Z ′ Z ) − 1 Z ′ y ˆ (19) This estimator is both the GMM estimator and the 2SLS (two-stage least squares) estimator. 8

GMM and 2SLS are equivalent when the disturbances are indepen- dently distributed, though if there were heteroskedasticity they would be- come different because GMM would use the weighting matrix ( Φ ( M )) − 1 , which would not be the same as ( Z ′ Z ) − 1 . 2SLS could be improved upon with heteroskedasticity corrections, how- ever, in the same way as OLS can be improved. Notice that this is the 2SLS estimator, rather than the simpler IV estimator that is computed by calculating IV directly: β IV = [ X ′ Z ] − 1 Z ′ y ˆ (20) Two-stage least squares and IV are the same if the number of instruments is the same as the number of parameters to be estimated, but otherwise the formula in (20) cannot be used, because when X is T × J and Z is T × K , X ′ Z is J × K , which is not square and cannot be inverted . What 2SLS is doing differently from IV is projecting X onto Z with the projection matrix, Z ( Z ′ Z ) − 1 Z ′ to generate a square matrix that can be inverted. GMM does something similar, but with Φ ( M ) instead of Z ( Z ′ Z ) − 1 Z ′ . 9

The BLP Method of Demand Curve Estimation in Industrial Organization - PDF document

The BLP Method of Demand Curve Estimation in Industrial Organization 9 March 2006 Eric Rasmusen 1 IDEAS USED 1. Instrumental variables . We use instruments to correct for the en- dogeneity of prices, the classic problem in estimating supply

The BLP Method of Demand Curve Estimation in Industrial Organization 27 February 2006 Eric

Curve Curve Ninjas December 19, 2012 Curve Ninjas Curve Overview Using Curve Implementation

Elliptic Curve Cryptography Applications of Elliptic Curve Cryptography Elliptic Curve

Bending the Cost Curve and Improving Bending the Cost Curve and Improving Bending the Cost Curve

Peak Energy Management (PEMa) System November 2019 Demand-Supply 2018 Daily Demand/Supply

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Local Analysis of 2D Curve Patches Local Analysis of 2D Curve Patches Topic 4.2: Topic 4.2:

Maths Summary Intro: Zero coupon yield curve (Ex 13) TV concept Yield curve

Topic 7: Demand and Elasticity Market vs. firms demand 1 2 Elasticity and revenue 3

Small Area Estimation under the Growth Curve model Innocent Ngaruye, Link oping University,

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

The BLP classroom Guy Claxton RESILIENCE emotional strength Inquisitive : has a

BLP WEBINAR 18/02/20 Overcoming COVID-19 business challenges in PNG JULLIANE TERRY - OWNER

P4 and Open vSwitch Ben Pfaff blp@nicira.com Open vSwitch Commiter Open vSwitch Architecture

Encrypting OVN tunnels with IPsec Qiuyu Xiao (qiuyu.xiao.qyx@gmail.com) Ben Pfaff (blp@ovn.org)

1 2 3 4 5 6 subject cannot change current levels 7 Objections to BLP (1) Some

Regression Instructor: Prof. Shuai Huang Industrial and Systems Engineering University of

Workshop 8.2a: Heterogeneity Murray Logan July 23, 2016 Table of contents 1 Linear modelling

. Surajit Ray Ray SAMSI, June 3 2005 - slide #1 Outline Outline Recap of (ordinary)

Bayesian D s -Optimal Designs for Generalized Linear Models with Varying Dispersion Parameter

Marcel Dettling Institute for Data Analysis and Process Design Zurich University of Applied

Sparse Model Matrices for Generalized Linear Models Martin Maechler and Douglas Bates (R-Core)

lavaan : an R package for structural equation 2. introducing the lavaan package modeling and more

Single-Equation GMM Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu