Bayesian Estimation of Input‐Output Tables for Russia
Oleg Lugovoy (EDF, RANE) Andrey Polbin (RANE) Vladimir Potashnikov (RANE) WIOD Conference April 24, 2012 Groningen
Bayesian Estimation of InputOutput Tables for Russia Oleg Lugovoy - - PowerPoint PPT Presentation
Bayesian Estimation of InputOutput Tables for Russia Oleg Lugovoy (EDF, RANE) Andrey Polbin (RANE) Vladimir Potashnikov (RANE) WIOD Conference April 24, 2012 Groningen Outline Motivation Objectives Bayesian methods: a short
Oleg Lugovoy (EDF, RANE) Andrey Polbin (RANE) Vladimir Potashnikov (RANE) WIOD Conference April 24, 2012 Groningen
– Bayesian vs. RAS & Entropy: MC experiment – Updating IOT for Russia
– Incorporate uncertainties into IOT estimates – Apply Bayesian framework for IOT updating
– Full density profile estimates with covariates for Russian IOT (OKONH, OKVED 2001‐2010, 15, 23 and 79 activities)
to estimate, update, disaggregate, get their best estimate. Each procedure involves assumptions, which draw results. However, there are a number of ways to do that, and it is not always straightforward to prefer one assumption to another, based on available information.
be crucial for an analysis that applies the estimated IOT as an input data.
uncertainties into estimation process. Assigning probability distributions for unknown parameters it allows to trace the link from assumptions to economic analysis results.
‐ unknown parameters ‐ data ‐ prior distribution of parameters ‐ likelihood function ‐ Posterior distribution (combination of information from prior and data)
parameters, we will consider them to be random variables.
inferences about the parameters.
incorporated to the estimates using priors.
uncertain parameters which comes from two sources: the prior distribution and the observed data.
can be derived
posterior distribution, sampling methods applied
Monte Carlo Markov Chains (MCMC) method with Gibbs or Metropolis‐Hasting algorithm
– application of MCMC to sample elements of A‐
constrains.
, ,
i j j i j i
Y = AX (A – matrix is unknown) to Bz = Y* (z – vector is unknown, standard problem in linear algebra) where z – vectorized matrix A Y* ‐ combined Y and a_hat vectors therefore we have to sample z in form: where z‐tilde is a particular solution, F – fundamental matrix, psi – stochastic component.
(1) (1)
z z F ξ = +
– 1. Artificial data: MC experiment – 2. Historical data: IOT‐2003 (OKONH 23x23)
– 3. Historical USE‐2006 (OKVED 15x15) – 4. Forecasting USE 2007‐2010 (OKVED 15x15)
A‐matrices 4x4 for six years
Entropy, assuming we know A5, Y6 and X6
A1 …A5, Y6 and X6 (estimating standard deviation for A elements based on A1 …A4 , and assigning the information to priors)
1 2 6
, ,.., A A A
6
A
– i.i.d. – Stationary AR(1) – Random walk
2 , , ,
( , ), 3
t i j i j i j
a N m i σ ≤
1 2 , , , , , ,
(1 ) , (0, ), 3
t t t t i j i j i j i j i j i j
a m a N i ρ ρ ε ε σ
−
= − + + ≤
1 2 , , , , ,
, (0, ), 3
t t t t i j i j i j i j i j
a a N i ε ε σ
−
= + ≤
closer to true matrix
69.0% 67.1% 74.1% 72.4% 78.1% 76.2% MAPE 67.7% 66.1% 73.1% 71.2% 77.8% 76.0% MAE 63.0% 62.0% 67.8% 67.3% 73.2% 72.2% RMSE RAS Entropy RAS Entropy RAS Entropy Random-Walk AR(1) process Independent process
( )
2 4 4 1 1
ˆ 1/16
ij ij i j
RMSE a a
= =
= −
∑ ∑
4 4 1 1
ˆ 1/16
ij ij i j
MAE a a
= =
= −
∑ ∑
4 4 1 1
ˆ 1/16
ij ij i j ij
a a MAPE a
= =
− =
∑ ∑
0.01 0.02 0.03 0.04 0.05 0.06 0.01 0.02 0.03 0.04 0.05 0.06 RMSE in RAS Method RMSE in Bayesian Method
type) definition of activities
unknown
Entropy”
Experiment: Updating IOT for Russia (cont)
where RMSPE - root of mean squared percentage error
RMSE MAE MAPE RMSPE Bayes 0.0074 0.0029 0.1844 0.4502 RAS 0.0067 0.0026 0.1728 0.4604 Entropy 0.0065 0.0026 0.1797 0.4552 2 1 1
ˆ 1/ ( * )
m n ij ij i j ij
a a RMSPE m n a
= =
− =
∑ ∑
Experiment: Updating IOT for Russia (cont)
Estimation of USE matrix in OKVED (NACE) definition of activities
A – unknown input‐output matrix Y – known intermediate demand vector X – known output vector What inference can we make toward A, when no any other information is available?
What inference can we make toward A, when X and Y are known
Let’s sample N variants of matrix A, satisfying to the constrains: Y = AX, given X и Y, using MCMC, non‐informative priors:
aij ~ uniform(0,1)
Sampling USE‐2006 table; N = 45000
Estimated А‐matrix (blue) vs. true value (red)
zero – for activities with relatively small
columns sum (<1). If one of a value is relatively large, others should tend to zero.
Distribution of pair‐wise correlation coefficients
A‐matrix elements means that constraining
affect other.
coefficients: A(D,D), specifying ”tight” prior for it.
Sampling with “tight” prior for A(D,D)
Comparison of results with true values: no constrains
Comparison of results with true values: constrained А(D,D)
prior year 2007 year 2008 year 2009 year 2010
incorporate uncertainties in data into estimation process
and estimating IOT
distribution of the estimated parameters might be used as an input information for sensitivity analysis on a stage of implementation of the analysis.
involving data from National Accounts and
current and constant prices).
79.
apolbin@gmail.com potashnikov.vu@gmail.com