Analysis of Economic Data with Multiscale Spatio-Temporal Models - - PowerPoint PPT Presentation

analysis of economic data with multiscale spatio temporal
SMART_READER_LITE
LIVE PREVIEW

Analysis of Economic Data with Multiscale Spatio-Temporal Models - - PowerPoint PPT Presentation

Analysis of Economic Data with Multiscale Spatio-Temporal Models Marco A. R. Ferreira (University of Missouri - Columbia) Adelmo Bertolde (Federal University of Esp rito Santo, Brazil) Scott Holan (University of Missouri, Columbia) Outline


slide-1
SLIDE 1

Analysis of Economic Data with Multiscale Spatio-Temporal Models

Marco A. R. Ferreira (University of Missouri - Columbia) Adelmo Bertolde (Federal University of Esp´ ırito Santo, Brazil) Scott Holan (University of Missouri, Columbia)

slide-2
SLIDE 2

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-3
SLIDE 3

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-4
SLIDE 4

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 1990 Estimated -1990

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-5
SLIDE 5

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 1993 Estimated - 1993

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-6
SLIDE 6

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 1996 Estimated - 1996

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-7
SLIDE 7

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 1999 Estimated - 1999

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-8
SLIDE 8

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 2002 Estimated - 2002

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-9
SLIDE 9

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Esp´ ırito Santo: Log of agriculture production per county

Observed - 2005 Estimated - 2005

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-10
SLIDE 10

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-11
SLIDE 11

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Some background

◮ Many processes of interest are naturally spatio-temporal. ◮ Frequently, quantities related to these processes are available

as areal data.

◮ These processes may often be considered at several different

levels of spatial resolution.

◮ Related work on dynamic spatio-temporal multiscale

modeling: Berliner, Wikle and Milliff (1999), Johannesson, Cressie and Huang (2007).

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-12
SLIDE 12

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Data Structure

Here, the region of interest is divided in geographic subregions or blocks, and the data may be averages or sums over these subregions. Each state in Brazil is divided into counties, microregions and macroregions; counties are then grouped into microregions, which are then grouped into macroregions, according to their socioeconomic similarity. Thus, our analysis considers three levels

  • f resolution: county, microregion, and macroregion.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-13
SLIDE 13

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Geopolitical organization

(a) (b) (c)

Figure: Geopolitical organization of Esp´ ırito Santo State by (a) counties, (b) microregions, and (c) macroregions.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-14
SLIDE 14

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-15
SLIDE 15

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Multiscale factorization

At each time point we decompose the data into empirical multiscale coefficients using the spatial multiscale modeling framework of Kolaczyk and Huang (2001). See also Chapter 9 of Ferreira and Lee (2007). Interest lies in agricultural production observed at the county level, which we assume is the Lth level of resolution (i.e. the finest level

  • f resolution), on a partition of a domain S ⊂ R2.

For the jth county, let yLj, µLj = E(yLj), and σ2

Lj = V (yLj)

respectively denote agricultural production, its latent expected value and variance.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-16
SLIDE 16

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Let Dlj be the set of descendants of subregion (l, j). The aggregated measurements at the lth level of resolution are recursively defined by ylj =

  • (l+1,j′)∈Dlj

yl+1,j′. Analogously, the aggregated mean process is defined by µlj =

  • (l+1,j′)∈Dlj

µl+1,j′. Assuming conditional independence, σ2

lj =

  • (l+1,j′)∈Dlj

σ2

l+1,j′.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-17
SLIDE 17

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Then yDlj

  • ylj, µL, σ2

L ∼ N(νljylj + θlj, Ωlj),

with νlj = σ2

Dlj/σ2 lj,

θlj = µDlj − νljµlj, and Ωlj = ΣDlj − σ−2

lj σ2 Dlj

  • σ2

Dlj

′ .

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-18
SLIDE 18

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Consider θe

lj = yDlj − νljylj,

which is an empirical estimate of θlj. Then θe

lj|ylj, µL, σ2 L ∼ N(θlj, Ωlj),

which is a singular Gaussian distribution (Anderson, 1984).

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-19
SLIDE 19

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-20
SLIDE 20

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Exploratory Multiscale Data Analysis

Macroregion 1 Disaggregated Empirical total by microregion multiscale coefficient

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

Micro−region 1 Micro−region 2 Micro−region 3 Micro−region 4 Micro−region 5

1990 1995 2000 2005 −300 −200 −100 100 200 300 year

Micro−region 1 Micro−region 2 Micro−region 3 Micro−region 4 Micro−region 5

Esp´ ırito Santo: Agriculture production of Macroregion 1.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-21
SLIDE 21

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Macroregion 2 Disaggregated Empirical total by microregion multiscale coefficient

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

Micro−region 6 Micro−region 7

1990 1995 2000 2005 −300 −200 −100 100 200 300 year

Micro−region 6 Micro−region 7

Esp´ ırito Santo: Agriculture production of Macroregion 2.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-22
SLIDE 22

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Macroregion 3 Disaggregated Empirical total by microregion multiscale coefficient

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

Micro−region 8 Micro−region 9 Micro−region 10

1990 1995 2000 2005 −300 −200 −100 100 200 300 year

Micro−region 8 Micro−region 9 Micro−region 10

Esp´ ırito Santo: Agriculture production of Macroregion 3.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-23
SLIDE 23

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Macroregion 4 Disaggregated Empirical total by microregion multiscale coefficient

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

Micro−region 11 Micro−region 12

1990 1995 2000 2005 −300 −200 −100 100 200 300 year

Micro−region 11 Micro−region 12

Esp´ ırito Santo: Agriculture production of Macroregion 4.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-24
SLIDE 24

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-25
SLIDE 25

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

The multiscale spatio-temporal model

Observation equation: ytL = µtL + vtL, vtL ∼ N(0, ΣL) where ΣL = diag(σ2

L1, . . . , σ2 LnL).

Multiscale decomposition of the observation equation: yt1k | µt1k ∼ N(µt1k, σ2

1k)

θe

tlj | θtlj ∼ N(θtlj, Ωlj)

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-26
SLIDE 26

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

System equations: µt1k = µt−1,1k + wt1k, wt1k ∼ N(0, ξkσ2

1k)

θtlj = θt−1,lj + ωtlj, ωtlj ∼ N(0, ψljΩlj)

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-27
SLIDE 27

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Priors

µ01k|D0 ∼ N(m01k, c01kσ2

1k),

θ0lj|D0 ∼ N(m0lj, C0ljΩlj), ξk ∼ IG(0.5τk, 0.5κk), ψlj ∼ IG(0.5̺lj, 0.5ςlj),

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-28
SLIDE 28

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-29
SLIDE 29

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Empirical Bayes estimation of νlj and Ωlj

νlj: vector of relative volatilities of the descendants of (l, j), Ωlj: singular covariance matrix of the empirical multiscale coefficient of subregion (l, j) In order to obtain an initial estimate of σ2

Lj, we perform a

univariate time series analysis for each county using first-order dynamic linear models (West and Harrison, 1997). These analyses yield estimates ˜ σ2

Lj.

We estimate νlj and Ωlj by ˜ νlj = ˜ σ2

Dlj/˜

σ2

lj,

˜ Ωlj = ˜ ΣDlj − ˜ σ−2

lj .

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-30
SLIDE 30

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-31
SLIDE 31

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Posterior exploration

Let θ•lj = (θ′

0lj, . . . , θ′ Tlj)′,

θt•j = (θ′

t1j, . . . , θ′ tLj)′,

θ••• = (θ′

  • 11, . . . , θ′
  • 1n1, θ′
  • 21, . . . , θ′
  • 2n2, . . . , θ′
  • L1, . . . , θ′
  • LnL)′,

with analogous definitions for the other quantities in the model. It can be shown that, given σ2

  • , ξ•, and ψ••, the vectors

µ•11, . . . , µ•1n1, θ•11, . . . , θ•1n1, . . . , θ•L1, . . . , θ•LnL, are conditionally independent a posteriori.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-32
SLIDE 32

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Gibbs sampler

◮ µ•1k : Forward Filter Backward Sampler (FFBS) (Carter and

Kohn, 1994; Fruhwirth-Schnatter,1994).

◮ ξk|µ•1k, σ2 1k, DT ∼ IG (0.5τ ∗ k , 0.5κ∗ k) , where τ ∗ k = τk + T and

κ∗

k = κk + σ−2 1k

T

t=1(µt1k − µt−1,1k)2. ◮ ψlj|θ•lj, DT ∼ IG(0.5̺∗ lj, 0.5ς∗ lj), where ̺∗ lj = ̺lj + T(mlj − 1)

and ς∗

lj = ςlj + T t=1(θtlj − θt−1,lj)′Ω− lj (θtlj − θt−1,lj), where

Ω−

lj is a generalized inverse of Ωlj. ◮ θ•lj: Singular FFBS.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-33
SLIDE 33

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Singular FFBS

  • 1. Use the Kalman filter to obtain the mean and covariance

matrix of f (θ1lj|σ2, ψlj, D1), . . . , f (θTlj|σ2, ψlj, DT):

◮ posterior at t − 1: θt−1,lj|Dt−1 ∼ N (mt−1,lj, Ct−1,ljΩlj) ; ◮ prior at t: θtlj|Dt−1 ∼ N (atlj, RtljΩlj) , where atlj = mt−1,lj

and Rtlj = Ct−1,lj + ψlj;

◮ posterior at t: θtlj|Dt ∼ N (mtlj, CtljΩlj) , where

Ctlj = (1 + R−1

tlj )−1 and mtlj = Ctlj

  • θe

tlj + R−1 tlj atlj

  • .
  • 2. Simulate θTlj from θTlj|σ2, ψlj, DT ∼ N(mTlj, CTljΩlj).
  • 3. Recursively simulate θtlj, t = T − 1, . . . , 0, from

θtlj|θt+1,lj, . . . , θTlj, DT ≡ θtlj|θt+1,lj, Dt ∼ N(htlj, HtljΩlj), where Htlj =

  • C −1

tlj + ψ−1 lj

−1 and htlj = Htlj

  • C −1

tlj mtlj + ψ−1 lj θt+1,lj

  • .

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-34
SLIDE 34

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Reconstruction of the latent mean process

One of the main interests of any multiscale analysis is the estimation of the latent mean process at each scale of resolution. From the gth draw from the posterior distribution, we can recursively compute the corresponding latent mean process at each level of resolution using the equation µ(g)

t,Dlj = θ(g) tlj + νtljµ(g) tlj ,

proceeding from the coarsest to the finest resolution level. With these draws, we can then compute the posterior mean, standard deviation and credible intervals for the latent mean process.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-35
SLIDE 35

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-36
SLIDE 36

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Marginal posterior densities for the signal-to-noise ratio ξk

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

ξ1 ξ2

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

ξ3 ξ4

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-37
SLIDE 37

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Marginal posterior densities for the signal-to-noise ratio ψ1j

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

ψ11 ψ12

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0

ψ13 ψ14

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-38
SLIDE 38

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Mean process at coarse level

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

µt11 µt12

1990 1995 2000 2005 200 400 600 800 year 1990 1995 2000 2005 200 400 600 800 year

µt13 µt14

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-39
SLIDE 39

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Multiscale coefficient for Macroregion 1

1990 1995 2000 2005 −300 −200 −100 100 year 1990 1995 2000 2005 −300 −200 −100 100 year 1990 1995 2000 2005 −300 −200 −100 100 year

θt111 θt112 θt113

1990 1995 2000 2005 −300 −200 −100 100 year 1990 1995 2000 2005 −300 −200 −100 100 year

θt114 θt115

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-40
SLIDE 40

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Observed agriculture production and estimated mean

1993 1997 2001 2005 Observed Estimated

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira

slide-41
SLIDE 41

Outline

Motivation Introduction Multiscale factorization Exploratory Multiscale Data Analysis The multiscale spatio-temporal model Empirical Bayes estimation Posterior exploration Agricultural Production in Esp´ ırito Santo Conclusions

slide-42
SLIDE 42

Motivation Introduction Factorization Exploratory Analysis Model EB estimation MCMC Application Conclusions

Conclusions

◮ New multiscale spatio-temporal model for areal data. ◮ Dynamic multiscale coefficients. ◮ Efficient Bayesian estimation. ◮ Potential to be used with massive datasets.

Gaussian Multiscale Spatio-temporal Models Marco A. R. Ferreira