Estimating Single-Agent Dynamic Models Paul T. Scott New York - PowerPoint PPT Presentation

Estimating Single-Agent Dynamic Models Paul T. Scott New York University PhD Empirical IO Fall 2017 1 / 34

Introduction Why dynamic estimation? External validity ◮ Famous example: Hendel and Nevo’s (2006) estimation of laundry detergent demand 2 / 34

Introduction Why dynamic estimation? External validity ◮ Famous example: Hendel and Nevo’s (2006) estimation of laundry detergent demand ◮ The long-run demand elasticity for laundry detergent might be zero (or very close) 2 / 34

Introduction Why dynamic estimation? External validity ◮ Famous example: Hendel and Nevo’s (2006) estimation of laundry detergent demand ◮ The long-run demand elasticity for laundry detergent might be zero (or very close) ◮ If detergent goes on sale periodically, we might see a nonzero short-run elasticity (perhaps even a large one) as customers might purchase during the sales and store the detergent. 2 / 34

Introduction Why dynamic estimation? External validity ◮ Famous example: Hendel and Nevo’s (2006) estimation of laundry detergent demand ◮ The long-run demand elasticity for laundry detergent might be zero (or very close) ◮ If detergent goes on sale periodically, we might see a nonzero short-run elasticity (perhaps even a large one) as customers might purchase during the sales and store the detergent. ◮ Dynamic estimation typically involves estimating the primitives of decision makers’ objective functions. We might estimate the model using short-run variation, but once we know the decision maker’s objective function, we could simulate a response to long-run variation. 2 / 34

Introduction Why are dynamics difficult? ◮ The computational burden of solving dynamic problems blows up as the state space gets large. With standard dynamic estimation techniques, this is especially problematic, for estimation may involve solving the dynamic problem many times. ◮ Serially correlated unobservables and unobserved heterogeneity (easy to confuse with state dependence) ◮ Modeling expectations ◮ Solving for equilibria, multiplicity (dynamic games) 3 / 34

Introduction Outline ◮ Introduction to dynamic estimation: Rust (1987) ◮ Conditional choice probabilities: Hotz and Miller (1993) ◮ Euler equation estimation: Scott (2014) 4 / 34

Rust (1987) and NFP estimation "Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher" John Rust (1987) 5 / 34

Rust (1987) and NFP estimation The “application” ◮ The decision maker decides whether replace bus engines or not, minimizing expected discounted cost ◮ The trade-off: engine replacement is costly, but with increased use, the probability of a very costly breakdown increases ◮ Single agent setting: prices are exogenous, no externalities across buses 6 / 34

Rust (1987) and NFP estimation Model, part I ◮ state variable: x t is the bus engine’s mileage ◮ For computational reasons, Rust discretizes the state space into 90 intervals. ◮ Action i t ∈ { 0 , 1 } , where ◮ i t = 1 - replace the engine, ◮ i t = 0 - keep the engine and perform normal maintenance. 7 / 34

Rust (1987) and NFP estimation Model, part II ◮ per-period profit function: � − c ( x t , θ 1 ) + ε t (0) if i t = 0 π ( i t , x t , θ 1 ) = − ( RC − c (0 , θ 1 )) + ε t (1) if i t = 1 where ◮ c ( x t , θ 1 ) - regular maintenance costs (including expected breakdown costs), ◮ RC - the net costs of replacing an engine, ◮ ε - payoff shocks. ◮ x t is observable to both agent and econometrician, but ε is only observable to the agent. ◮ ε is necessary for a coherent model, for sometimes we observe the agent making different decisions for the same value of x . 8 / 34

Rust (1987) and NFP estimation Model, part III ◮ Can define value function using Bellman equation: V θ ( x t , ε t ) = max [ π ( i , x t , θ ) + β EV θ ( x t , ε t , i t )] i where ˆ EV θ ( x t , ε t , i t ) = V θ ( y , η ) p ( dy , d η | x t , ε t , i t , θ 2 , θ 3 ) 9 / 34

Rust (1987) and NFP estimation Parameters ◮ θ 1 - parameters of cost function ◮ θ 2 - parameters of distribution of ε (these will be assumed/normalized away) ◮ θ 3 - parameters of x -state transition function ◮ RC - replacement cost ◮ discount factor β will be imputed (more on this later) 10 / 34

Rust (1987) and NFP estimation Conditional Independence Conditional Independence Assumption The transition density of the controlled process { x t , ε t } factors as: p ( x t +1 , ε t +1 | x t , ε t , i t , θ 2 , θ 3 ) = q ( ε t +1 | x t +1 , θ 2 ) p ( x t +1 | x t , i t , θ 3 ) ◮ CI assumption is very powerful: it means we don’t have to treat ε t as a state variable, which would be very difficult since it’s unobserved. ◮ While it is possible to allow the distribution of ε t +1 to depend on x t +1 , authors (including Rust) typically assume that any conditionally independent error terms are also identically distributed over time. 11 / 34

Rust (1987) and NFP estimation Theorem 1 preview ◮ Assumption CI has two powerful implications: ◮ We can write EV θ ( x t , i t ) instead of EV θ ( x t , ε t , i t ), ◮ We can consider a Bellman equation for V θ ( x t ), which is computationally simpler than the Bellman equation for V θ ( x t , ε t ). 12 / 34

Rust (1987) and NFP estimation Theorem 1 Theorem 1 Given CI, ∂ P ( i | x , θ ) = ∂π ( x , i , θ 1 ) W ( π ( x , θ 1 ) + β EV θ ( x ) | x , θ 2 ) and EV θ is the unique fixed point of the contraction mapping: ˆ EV θ ( x , i ) = W ( π ( y , θ 1 ) + β EV θ ( y ) | y , θ 2 ) p ( dy | x , i , θ 3 ) y where ◮ P ( i | x , θ ) is the probability of action i conditional on state x ◮ W ( ·| x , θ 2 ) is the surplus function: ˆ W ( v | x , θ 2 ) ≡ max [ v ( i ) + ε ( i )] q ( d ε | x , θ 2 ) i ε 13 / 34

Rust (1987) and NFP estimation Theorem 1 example: logit shocks ◮ v θ ( x , i ) ≡ π ( x , i , θ 1 ) + β EV θ ( x , i ) – the conditional value function . ◮ Suppose that ε ( i ) is distributed independenly across i with Pr ( ε ( i ) ≤ ε 0 ) = e − e − ε 0 – logit shocks. Then, max i [ v ( x , i ) + ε ( i )] � i e − ε ( i ) e − e − ε ( i ) d ε ´ W ( v ( x )) = ln ( � = i exp ( v ( x , i ))) + γ where γ ≈ . 577216 is Euler’s gamma. ◮ It is then easy to derive expressions for conditional choice probabilities: exp ( v θ ( x , i )) P ( i | x , θ ) = � i ′ exp ( v θ ( x , i ′ )) ◮ The conditional value function plays the same role as a static utility function when computing choice probabilities. 14 / 34

Rust (1987) and NFP estimation Some details ◮ He assumes ε is i.i.d with an extreme value type 1 distribution, and normalizes its mean to 0 and variance to π 2 / 6 (i.e., the case on the previous slide). ◮ Transitions on observable state: p ( x t +1 − x t = 0 | , x t , i t , θ 3 ) = θ 30 p ( x t +1 − x t = 1 | , x t , i t , θ 3 ) = θ 31 p ( x t +1 − x t = 2 | , x t , i t , θ 3 ) = 1 − θ 30 − θ 31 ◮ He tries several different specifications for the cost function and favors a linear form: c ( x , θ 1 ) = θ 11 x . 15 / 34

Rust (1987) and NFP estimation Nested Fixed Point Estimation ◮ Rust first considers a case with a closed-form expression for the value function, but this calls for restrictive assumptions on how mileage evolves. His nested fixed point estimation approach, however, is applicable quite generally. ◮ Basic idea: to evaluate objective function (likelihood) at a given θ , we should solve the value function for that θ 16 / 34

Rust (1987) and NFP estimation Nested Fixed Point Estimation Steps: 1. Impute a value of the discount factor β 2. Estimate θ 3 – the transition function for x – which can be done without the behavioral model 3. Inner loop: search over ( θ 1 , RC ) to maximize likelihood function. When evaluating the likelihood function for each candidate value of ( θ 1 , RC ): 3.1 Find the fixed point of the the Bellman equation for ( β, θ 1 , θ 3 , RC ). Iteration would work, but Rust uses a faster approach. 3.2 Using expression for conditional choice probabilities, evaluate likelihood: � T P ( i t | x t , θ ) p ( x t | x t − 1 , i t − 1 , θ 3 ) t =1 17 / 34

Rust (1987) and NFP estimation Estimates 18 / 34

Rust (1987) and NFP estimation Discount factor ◮ While Rust finds a better fit for β = . 9999 than β = 0, he finds that high levels of β basically lead to the same level of the likelihood function. ◮ Furthermore, the discount factor is non-parametrically non-identified. Note: He loses ability to reject β = 0 for more flexible cost function specifications. 19 / 34

Rust (1987) and NFP estimation Discount factor 20 / 34

Rust (1987) and NFP estimation Application 21 / 34

Hotz and Miller (1993) and CCPs "Conditional Choice Probabilities and the Estimation of Dynamic Models" Hotz and Miller (1993) 22 / 34

Estimating Single-Agent Dynamic Models Paul T. Scott New York - PowerPoint PPT Presentation

Estimating Single-Agent Dynamic Models Paul T. Scott New York University PhD Empirical IO Fall 2017 1 / 34 Introduction Why dynamic estimation? External validity Famous example: Hendel and Nevos (2006) estimation of laundry detergent

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Estimating Estimating Covariance . . . Statistical Characteristics Estimating . . . Proof of

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

S S S S calable calable Agent calable calable Agent Agent Plat forms Agent Plat forms

Agent-Based Systems Agent communication Speech act theory Michael Rovatsos Agent

Planning III-A: Planning III-A: Estimating Software Size - Estimating Software Size -

Estimating Frequency Moments Estimating F 0 Algorithm Correctness Further Anil Maheshwari

Estimating Frequency Moments Moments Estimating F 0 Algorithm Correctness Anil Maheshwari

The Player Agent The Player Agent Are they the most important league official right now? right

Rational Agents (Ch. 2) Rational agent An agent/robot must be able to perceive and interact with

Agent-Based Systems Michael Rovatsos mrovatso@inf.ed.ac.uk Lecture 6 Agent Communication 1

Dynamic agent architecture for Dynamic agent architecture for open e- -commerce commerce open

Agent-Based Systems Agent: autonomous Learning for Agent-Based Systems Environment: fully,

Estimating Relative Expression Mark Voorhies 4/6/2011 Mark Voorhies Estimating Relative

Single agent or multiple agents Many domains are characterized by multiple agents rather than a

AppsPlayground: Automatic Security Analysis of Smartphone Applications Vaibhav Rastogi , Yan Chen,

Nonuniform Timeslicing of Dynamic Graphs Based on Visual Complexity Yong Wang 1 , Daniel

IFC Inside: Retrofitting Languages with Dynamic Information Flow Control Stefan Heule, Deian

Dynamic Programming Dr. Mattox Beckman University of Illinois at Urbana-Champaign Department of

What about other storage reclamation schemes? Memory management options Manual /explicit

C ATCH T ARTAN : Representing and Summarizing Dynamic Multicontextual Behaviors Meng Jiang

Treatment Regime (iii) 11/07/2007 Discussion for today * * ( ,

Dynamic Memory Review: automa3c variables Automa'c variable :