likelihood based estimation model selection and
play

Likelihood-based estimation, model selection, and forecasting of - PowerPoint PPT Presentation

Likelihood-based estimation, model selection, and forecasting of integer-valued trawl processes Almut E. D. Veraart Imperial College London New Results on Time Series and their Statistical Applications CIRM Luminy, 14-18 September 2020 1 / 25


  1. Likelihood-based estimation, model selection, and forecasting of integer-valued trawl processes Almut E. D. Veraart Imperial College London New Results on Time Series and their Statistical Applications CIRM Luminy, 14-18 September 2020 1 / 25

  2. Collaborators This is joint work with ➤ Mikkel Bennedsen (Aarhus University) ➤ Asger Lunde (Aarhus University) ➤ Neil Shephard (Harvard University) 2 / 25

  3. Introduction ➤ Time series of counts appear in various applications: Medical science, epidemiology, meteorology, network modelling, actuarial science, econometrics and finance. ➤ Count data: Non-negative and integer-valued, and often over-dispersed (i.e. variance > mean). ➤ Recently the class of (integer-valued) trawl (IVT) processes has been introduced as a flexible model, see Barndorff-Nielsen et al. (2014) for the univariate and Veraart (2019) for the multivariate case. Aim of the project ➠ Improve the estimation method for IVT processes (likelihood-based rather than moment-based); ➠ Tailor model selection tools to the IVT class; ➠ (Probabilistic) forecasting of IVT processes; 3 / 25

  4. A very short and incomplete review of the literature ➤ Recent surveys & some new developments: Cameron & Trivedi (1998), Kedem & Fokianos (2002),Cui & Lund (2009); Davis et al. (1999); Davis & Wu (2009); Jung & Tremayne (2011); McKenzie (2003); Weiß (2008), Karlis (2016), Fokianos (2016). ➤ Literature on count data is spread across different disciplines. ➤ Overall, two predominant modelling approaches: ➠ Discrete autoregressive moving-average (DARMA) models introduced by Jacobs & Lewis (1978a,b). ➠ Models obtained from thinning operations going back to the influential work of Steutel & van Harn (1979), e.g. INAR(MA), see e.g. Pedeli et al. (2015). ➤ Further models: Regression type models (typically based on generalised linear models, see e.g. Fokianos (2016)), also Fokianos et al. (2020); state-space and Bayesian approaches. Our approach: ➤ Use ”trawling” for modelling counts. ➤ This is a continuous-time framework based on the idea of ”thinning” points. 4 / 25

  5. Introduction What is trawling...? A first ”definition” “Trawling is a method of fishing that involves pulling a fishing net through the water behind one or more boats. The net that is used for trawling is called a trawl.” (Wikipedia) 5 / 25

  6. Theoretical framework Definition of trawl process We define a stationary integer-valued trawl (IVT) process ( X t ) t ≥ 0 by � X t = L ( A t ) = R × R I A ( x , s − t ) L ( dx , ds ) . ➤ L is the integer-valued, homogeneous L´ evy basis on [ 0 , 1 ] × R : ➠ L ( dx , ds ) : = � ∞ − ∞ yN ( dy , dx , ds ) , ( x , s ) ∈ [ 0 , 1 ] × R . ➠ N is a homogeneous Poisson random measure on Z × [ 0 , 1 ] × R with compensator η ⊗ Leb ⊗ Leb , i.e. E ( N ( dy , dx , ds )) = η ( dy ) dxds , where η is evy measure satisfying � ∞ a L´ − ∞ min ( 1 , | y | ) η ( dy ) < ∞ . ➤ A Borel set A t = A + ( 0 , t ) with A = A 0 ⊆ [ 0 , 1 ] × ( − ∞ , 0 ] and Leb ( A ) < ∞ is called the trawl. ➠ Typically, we choose A to be of the form A = { ( x , s ) : s ≤ 0 , 0 ≤ x ≤ d ( s ) } , where d : ( − ∞ , 0 ] �→ [ 0 , 1 ] is continuous and Leb ( A ) < ∞ . 6 / 25

  7. Example Poisson-Exponential trawl 1 0.8 0.6 0.4 0.2 0 10 11 12 13 14 15 16 17 18 19 15 15 10 10 5 5 0 0 10 10 11 11 12 12 13 13 14 14 15 15 16 16 17 17 18 18 19 19 7 / 25

  8. Example Negative binomial-Exponential trawl 1 0.8 0.6 0.4 0.2 0 10 11 12 13 14 15 16 17 18 19 25 25 20 20 15 15 10 10 5 5 0 0 10 10 11 11 12 12 13 13 14 14 15 15 16 16 17 17 18 18 19 19 8 / 25

  9. Some key properties of IVT processes Cumulants ➤ The IVT process is stationary and infinitely divisible. ➤ The IVT process is mixing ⇒ weakly mixing ⇒ ergodic. ➤ The cumulant (log-characteristic) function of a trawl process is, for θ ∈ R , given by C X t ( θ ) = C L ( A t ) ( θ ) = Leb ( A ) C L ′ ( θ ) , where the random variable L ′ (called the L´ evy seed) associated with L satisfies � � e i θ y − 1 � E [ exp ( i θ L ′ )] = exp ( C L ′ ( θ )) , C L ′ ( θ ) = η ( dy ) . with ➠ I.e. to any infinitely divisible integer-valued law π , say, there exists a stationary integer-valued trawl process having π as its one-dimensional marginal law. ➤ The autocorrelation function is given by . = Cor ( Y t , Y t + h ) = Leb ( A ∩ A h ) ρ ( h ) . for h > 0 . , Leb ( A ) 9 / 25

  10. Examples Modelling the marginal distribution Example 1 (Poissonian L´ evy seed) Let L ′ ∼ Poisson ( ν ) . Then X t ∼ Poisson ( ν Leb ( A )) , i.e., for all t ≥ 0, P ( X t = k ) = ( ν Leb ( A )) k e − ν Leb ( A ) / k ! , k = 0 , 1 , 2 , . . . . Example 2 (Negative Binomial L´ evy seed) Let L ′ ∼ NB ( m , p ) for m > 0 , p ∈ [ 0 , 1 ] . Then X t ∼ NB ( m Leb ( A ) , p ) , i.e., for all t ≥ 0, P ( X t = k ) = Γ ( Leb ( A ) m + k ) k ! Γ ( Leb ( A ) m ) ( 1 − p ) Leb ( A ) m p k , k = 0 , 1 , 2 , . . . , where Γ ( z ) = � ∞ 0 x z − 1 e − x dx for z > 0 is the Γ -function. 10 / 25

  11. Examples Modelling the trawl function/correlation structure ➤ Recall the typical choice for the trawl: A = A 0 = { ( x , s ) : s ≤ 0 , 0 ≤ x ≤ d ( s ) } , A t = A + ( 0 , t ) . ➤ Restrict attention to a class of superposition trawls : � ∞ e λ s π ( d λ ) , d ( s ) : = s ≤ 0 , 0 where π is a probability measure on R + . ➤ For h ≥ 0, the acf is given by � ∞ h d ( − s ) ds ρ ( h ) : = Cor ( L ( A t + h ) , L ( A t )) = Leb ( A h ∩ A ) = � ∞ 0 d ( − s ) ds . Leb ( A ) 11 / 25

  12. Examples Modelling the trawl function/correlation structure ➤ Exponential trawl function: Let λ > 0 and π ( dx ) = δ λ ( dx ) , then d ( s ) = e λ s for s ≤ 0 and ρ ( h ) = Cor ( X t + h , X t ) = exp ( − λ h ) , h ≥ 0 . ➤ Inverse Gaussian trawl function: Letting π be given by the inverse Gaussian distribution π ( dx ) = ( γ / δ ) 1 / 2 � − 1 � 2 ( δ 2 x − 1 + γ 2 x ) 2 K 1 / 2 ( δγ ) x − 1 / 2 exp dx , where K ν ( · ) is the modified Bessel function of the third kind and γ , δ ≥ 0 with both not zero simultaneously. Then � � − 1 / 2 � � �� � 1 − 2 s 1 − 2 s d ( s ) = 1 − s ≤ 0 , exp δγ , γ 2 γ 2 � � � ρ ( h ) = Cor ( X t + h , X t ) = exp δγ ( 1 − 1 + 2 h / γ 2 ) h ≥ 0 . , 12 / 25

  13. Examples Modelling the trawl function/correlation structure ➤ Gamma trawl function: Let π have the Γ ( 1 + H , α ) density, 1 Γ ( 1 + H ) α 1 + H λ H e − λα dx , π ( dx ) = where α > 0 and H > 0 . � − ( H + 1 ) 1 − s � d ( s ) = s ≤ 0 , , α and � − H ρ ( h ) = Cor ( X t + h , X t ) = Leb ( A h ∩ A ) � 1 + h = . Leb ( A ) α Note that in this case � ∞ � ∞ if H ∈ ( 0 , 1 ] , ρ ( h ) dh = α if H > 1 , 0 H − 1 i.e. the trawl process has long memory for H ∈ ( 0 , 1 ] . 13 / 25

  14. Estimation From method of moments to composite likelihood ➤ Suppose we have n ∈ N observations of the IVT process X , x 1 , . . . , x n , on an equidistant grid of size ∆ = T / n . ➤ Define n − h CL ( h ) ( θ ; x ) : = ∏ f ( x i + h , x i ; θ ) , h ≥ 1 . i = 1 ➤ Let Θ be a compact parameter space such that the true parameter vector, θ 0 , lies in the interior of Θ . ➤ Construct the composite likelihood function, for H ⊆ { 1 , 2 , . . . , n − 1 } , n − h L H CL ( θ ; x ) : = ∏ CL ( h ) ( θ ; x ) = ∏ ∏ f ( x i + h , x i ; θ ) . h ∈H h ∈H i = 1 ➤ The maximum composite likelihood (MCL) estimator of θ is defined as θ CL : = arg max ˆ θ ∈ Θ l H CL ( θ ; x ) , where l H CL ( θ ; x ) : = log L H CL ( θ ; x ) is the log composite likelihood function. 14 / 25

  15. Pairwise likelihood The general case and a simulation-based approach ➤ The joint probability mass function of two observations x i + h and x i is � � f ( x i + h , x i ; θ ) : = P θ X ( i + h ) ∆ = x i + h , X i ∆ = x i ∞ � � ∑ = P θ L ( A ( i + h ) ∆ \ A i ∆ ) = x i + h − c c = − ∞ � � · P θ L ( A i ∆ \ A ( i + h ) ∆ ) = x i − c � � · P θ L ( A ( i + h ) ∆ ∩ A i ∆ ) = c . evy basis L is positive, i.e. η ( y ) = 0 for y < 0. Then we ➤ Suppose the L´ min { x i + h , x i } can replace ∑ ∞ c = − ∞ by ∑ in the above formula. c = 0 ➤ Let t , s ≥ 0, choose C ∈ N and let c ( j ) ∼ L ( A t ∩ A s ) , j = 1 , 2 , . . . , C , be an iid sample. A simulation based unbiased estimator of f ( x t , x s ; θ ) is C f ( x t , x s ; θ ) = 1 ˆ P θ ( L ( A t \ A s ) = x t − c ( j ) ) P θ ( L ( A s \ A t ) = x s − c ( j ) ) . ∑ C j = 1 15 / 25

  16. MCL outperforms GMM for IVTs 2 2 m p 1.5 1.5 1 1 0.5 0.5 0 0 0 2000 4000 6000 8000 0 2000 4000 6000 8000 2 2 m p 1.5 1.5 1 1 0.5 0.5 0 0 0 2000 4000 6000 8000 0 2000 4000 6000 8000 2 2 m H p 1.5 1.5 H 1 1 0.5 0.5 0 0 0 2000 4000 6000 8000 0 2000 4000 6000 8000 RMSE of the MCL estimator divided by the RMSE of the GMM estimator. 16 / 25

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend