Modeling Periodicity in Point Processes Nan Shao and Keh-Shin Lii - - PowerPoint PPT Presentation

modeling periodicity in point processes
SMART_READER_LITE
LIVE PREVIEW

Modeling Periodicity in Point Processes Nan Shao and Keh-Shin Lii - - PowerPoint PPT Presentation

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Modeling Periodicity in Point Processes Nan Shao and Keh-Shin Lii Department of Statistics University of California, Riverside 10/07/09


slide-1
SLIDE 1

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Modeling Periodicity in Point Processes

Nan Shao and Keh-Shin Lii

Department of Statistics University of California, Riverside

10/07/09

Nan Shao and Keh-Shin Lii Modeling Periodicity in Point Processes

slide-2
SLIDE 2

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Outline

Introduction Motivation and Basic Concept of Point Process A Brief Review and Our Model Assumptions and Main Results Assumptions Estimation of the Parameters Prediction Simulation Computational Issue and Further Research

2 / 49

slide-3
SLIDE 3

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Point Process

Point processes arise when events occur at random times.

◮ Initiation of phone calls at a service center. ◮ Earthquake in certain area. ◮ The arrival of ambulance at ER. ◮ Stock transactions.

3 / 49

slide-4
SLIDE 4

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Periodic Point Processes

◮ Stock transaction with daily cycle. ◮ Weekly effect in phone calls at customer center. ◮ Imoto et al. (1999) discussed the periodic pattern of seismicity

  • bserved in central Japan.

4 / 49

slide-5
SLIDE 5

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Definition of Point Process

◮ Let {ti, i = 1, 2, . . .} be a sequence of nonnegative random variables on

a probability space (Ω, F, P) with 0 ≤ ti ≤ ti+1, then the sequence {ti} is called a point process on [0, ∞].

◮ If there is no multiple occurrence, namely, ti < ti+1 for any i, the

process is called a simple point process. We will restrict out attention to simple point process.

◮ Let the sequence {t1, t2, . . .} be a point process. Let N(s, t) be the

number of points in time interval (s, t], and denote N(t) = N(0, t). Then N(t) is called a counting process.

5 / 49

slide-6
SLIDE 6

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Intensity Function

The intensity function λ(t) of a point process, also called the mean rate of

  • ccurrence in Cox and Isham (1980), is defined as

λ(t) = lim

δ→0+

P{N(t, t + δ) > 0} δ = lim

δ→0+

E{N(t, t + δ)} δ . So E{dN(t)} = λ(t)dt. It describes the first-order moment property of the unconditional counting measure.

6 / 49

slide-7
SLIDE 7

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

time intensity 10 20 30 40 50 60 1 2 3

Figure: Solid line: the intensity function of a non-homogeneous Poisson process λ(t) = cos(

π 4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ) + 1.6. Triangle points: the occurrence time

  • f events.

7 / 49

slide-8
SLIDE 8

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

A Brief Review

◮ Lewis (1970, 1972) established the estimation and detection of a cyclic

varying rate of a non-homogeneous Poisson process when the frequency is known a priori. The rate function took the following form λ(t) = A exp{ρ cos(ωt + φ)}. Vere-Jones (1982) proposed a method to estimate the frequency ω in above model and established its asymptotic property.

◮ Helmers et al. (2003) constructed a consistent kernel-type

nonparametric estimate of the intensity function of a cyclic Poisson process where the intensity function is cyclic with only one period and the period is unknown.

8 / 49

slide-9
SLIDE 9

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Almost Periodic Point Processes

◮ General idea on ‘almost periodic’: any particular configuration that

  • ccurs once may recur not exactly, but within some accuracy.

◮ A sequence which appears to be ‘chaotic’ and ‘uninformative’ is a

superstition of several periodic components and it is almost periodic.

◮ Almost periodic point process is much more general than the periodic

  • ne.

20 40 60 80 100 0.0 0.5 1.0 1.5 2.0 2.5 3.0 t intensity

Figure: Almost periodic function λ(t) = cos(

π 4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ) + 1.6.

9 / 49

slide-10
SLIDE 10

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Our Model

The intensity function we consider is of the form λ(t) =

K

  • i=1

Ak cos(ωkt + φk) + B, (1) where Ak, B, ωk, φk are unknown parameters with A1 > A2 > · · · > AK > 0, K

k=1 Ak < B, 0 ≤ φk < 2π and ωk > 0, k = 1, . . . , K.

10 / 49

slide-11
SLIDE 11

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

Periodogram

Periodogram in point processes is defined by Bartlett (1963) as follows IT(ω) = dT(ω)dT(ω), where (0, T) is the observation interval, and dT(ω) is the finite Fourier transform, defined as dT(ω) = 1 √ 2πT T e−iωtdN(t) = 1 √ 2πT

N(t)

  • j=1

e−iωtj.

11 / 49

slide-12
SLIDE 12

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Motivation and Basic Concept of Point Process A Brief Review and Our Model

1 2 3 4 5 5 10 15 ω periodogram

Figure: The periodogram of a non-homogeneous Poisson process with intensity function λ(t) = cos(

π 4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ) + 1.6. The process consists 778

  • bservations with observation length T = 500.

12 / 49

slide-13
SLIDE 13

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Assumptions

◮ Assumption 1

N(t), t ≥ 0, is a non-homogeneous Poisson process with a periodic or almost periodic intensity function (1), observed over the time interval [0, T], and the number of periodic components K is given.

◮ Minimum separation:

Assumption 2 T mink=k′(|ωk − ωk′|) → ∞, as T → ∞.

◮ Searching range:

Assumption 3 O(Tδ′−1) ≤ ωk ≤ ΩT, k = 1, . . . , K, where 0 < δ′ < 1 and ΩT is the upper bound, possibly determined by

  • bservations on the process in the interval (0, T) with

E(ΩT) = O(T1−δ), δ > 0.

13 / 49

slide-14
SLIDE 14

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Estimation of the Frequencies

Let ω = (ω1, . . . , ωK)′, we determine ˆ ωT as frequencies corresponding to the K largest local maxima of the periodogram in the range defined in Assumption 3 and the corresponding frequencies are well separated where their shortest distance cannot decrease sufficiently faster than O(T−1) as T → ∞.

1 2 3 4 5 5 10 15 ω periodogram

Figure: The periodogram of a non-homogeneous Poisson process with intensity function λ(t) = cos(

π 4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ) + 1.6. The process consists 778

  • bservations with observation length T = 500.

14 / 49

slide-15
SLIDE 15

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

1 2 3 4 5 5 10 15 ω periodogram [ ]

15 / 49

slide-16
SLIDE 16

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

1 2 3 4 5 5 10 15 ω periodogram [ ]

16 / 49

slide-17
SLIDE 17

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Super Efficiency of ˆ ωT

Proposition 1 Under Assumptions 1, 2 and 3, ˆ ωT is a consistent estimate

  • f ω, and

(ˆ ωk,T − ωk) = o(T−1), (a.s.), k = 1, . . . , K.

17 / 49

slide-18
SLIDE 18

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Asymptotic Normality of ˆ ωT

Theorem 1 Under Assumptions 1, 2 and 3, T

3 2 (ˆ

ωT − ω) is asymptotically normally distributed as T → ∞, with mean 0, and variance-covariance

lim

T→∞ cov(T

3 2 (ˆ

ωk,T − ωk), T

3 2 (ˆ

ωk′,T − ω′

k)) =

12 AkAk′

  • 2Bδk,k′ +

K

  • j=1

Aj×

  • − cos(φj − φk − φk′)δj,k+k′ + cos(φj − φk + φk′)δj,k−k′ + cos(φj + φk − φk′)δj,k′−k
  • ,

where k, k′ = 1, . . . , K. In particular, the variance is lim

T→∞ var(T

3 2 (ˆ

ωk,T − ωk)) = 12 A2

k

  • 2B −

K

  • j=1

Aj cos(φj − 2φk)δj,k+k

  • .

Here δk,k′ = I{ωk = ωk′} δj,k+k′ = I{ωj = ωk + ωk′}, δj,k−k′ = I{ωj = ωk − ωk′}, δj,k′−k = I{ωj = ωk′ − ωk},

18 / 49

slide-19
SLIDE 19

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Estimation of the Amplitudes and Phases

Define ˆ Ak,T and ˆ φk,T as the estimates of Ak and φk by ˆ A2

k,T = (8π/T)IT(ˆ

ωk,T), so ˆ Ak,T =

  • ˆ

A2

k,T,

and tan ˆ φk,T = −T−1 T sin ˆ ωk,TtdN(t)

  • T−1

T cos ˆ ωk,TtdN(t), so ˆ φk,T = arctan tan ˆ φk,T, where k = 1, . . . , K.

19 / 49

slide-20
SLIDE 20

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Asymptotic Normality of ˆ Ak and ˆ φk

Theorem 2 Under Assumptions 1, 2 and 3, T

1 2 (ˆ

AT − A) and T

1 2 ( ˆ

φT − φ) are asymptotically normally distributed as T → ∞, with mean 0, and variance-covariance

lim

T→∞ cov(T

1 2 (ˆ

Ak,T − Ak), T

1 2 (ˆ

Ak′,T − Ak′)) = 2Bδk,k′ +

K

  • j=1

Aj×

  • cos(φj − φk − φk′)δj,k+k′ + cos(φj − φk + φk′)δj,k−k′ + cos(φj + φk − φk′)δj,k′−k
  • ,

lim

T→∞ cov(T

1 2 (ˆ

φk,T − φk), T

1 2 (ˆ

φk′,T − φk′)) = 4 AkAk′

  • 2Bδk,k′ +

K

  • j=1

Aj×

  • − cos(φj − φk − φk′)δj,k+k′ + cos(φj − φk + φk′)δj,k−k′ + cos(φj + φk − φk′)δj,k′−k
  • .

where k, k′ = 1, . . . , K.

20 / 49

slide-21
SLIDE 21

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

In particular, the variances are

lim

T→∞ var(T

1 2 (ˆ

Ak,T − Ak)) = 2B +

K

  • j=1

Aj cos(φj − 2φk)δj,k+k, lim

T→∞ var(T

1 2 (ˆ

φk,T − φk)) = 4 A2

k

  • 2B −

K

  • j=1

Aj cos(φj − 2φk)δj,k+k

  • .

21 / 49

slide-22
SLIDE 22

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Estimation of the Baseline Constant

Define ˆ BT as ˆ BT = max(

K

  • k=1

ˆ Ak,T, N(T)/T). Let ε = B − K

k=1 Ak > 0. We show that N(T)/T − K k=1 ˆ

Ak,T → ε > 0 in probability as T → ∞. So for large T, ˆ B has the same asymptotic property as N(T)/T. Theorem 3 Under Assumptions 1 and 2, T

1 2 (ˆ

BT − B) is asymptotically normally distributed as T → ∞, with mean 0, and variance B.

22 / 49

slide-23
SLIDE 23

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

Covariance of the Parameter Estimates

Theorem 4 Under Assumptions 1, 2 and 3, T

3 2 (ˆ

ωT − ω), T

1 2 (ˆ

AT − A), T

1 2 ( ˆ

φT − φ), and T

1 2 (ˆ

BT − B) are jointly normally distributed as T → ∞, with mean 0, and variance-covariance

lim

T→∞ cov(T

3 2 (ˆ

ωk,T − ωk), T

1 2 (ˆ

φk′,T − φk′)) = 6 AkAk′

K

  • j=1

Aj×

  • cos(φj − φk − φk′)δj,k+k′ − cos(φj − φk + φk′)δj,k−k′ − cos(φj + φk − φk′)δj,k′−k
  • ,

lim

T→∞ cov(T

1 2 (ˆ

Ak,T − Ak), T

1 2 (ˆ

φk′,T − φk′)) = 1 Ak′

K

  • j=1

Aj×

  • sin(φj − φk − φk′)δj,k+k′ − sin(φj − φk + φk′)δj,k−k′ + sin(φj + φk − φk′)δj,k′−k
  • ,

lim

T→∞ cov(T

1 2 (ˆ

Ak,T − Ak), T

1 2 (ˆ

BT − B)) = B,

23 / 49

slide-24
SLIDE 24

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research Assumptions Estimation of the Parameters

lim

T→∞ cov(T

3 2 (ˆ

ωk,T − ωk), T

1 2 (ˆ

Ak′,T − Ak′)) = 0, lim

T→∞ cov(T

3 2 (ˆ

ωk,T − ωk), T

1 2 (ˆ

BT − B)) = 0, lim

T→∞ cov(T

1 2 (ˆ

φk,T − φk), T

1 2 (ˆ

BT − B)) = 0,

where k, k′ = 1, . . . , K.

24 / 49

slide-25
SLIDE 25

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Prediction

The one-step prediction ˆ Tn+1 is defined to minimize the mean squared error, and is given by ˆ Tn+1 = E(Tn+1|Tn = tn, . . . , T1 = t1) = tn + ∞

tn

e−Λ(s)+Λ(tn)ds, n ≥ 1, where Λ(t) = t

0 λ(s)ds. And its mean squared error νn is given by

νn = E(Tn+1 − ˆ Tn+1)2 = ETn

  • 2

Tn

(s − Tn)e−Λ(s)+Λ(Tn)ds − ∞

Tn

e−Λ(s)+Λ(Tn)ds 2 , n ≥ 1.

25 / 49

slide-26
SLIDE 26

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Note: Λ(Tn) ∼ Gamma(α = n, β = 1). The calculation of the MSE νn is carried out by Monte-Carlo integration with the following two steps.

  • 1. Generate a large sample of Ψn = Λ(Tn) ∼ Gamma(α = n, β = 1). Solve

for Tn, namely, tn is the unique root of Λ(tn) − ψn = 0.

  • 2. Calculate 2

tn (s − tn)e−Λ(s)+Λ(tn)ds − [

tn e−Λ(s)+Λ(tn)ds]2, and

average over tn. We obtain νn.

26 / 49

slide-27
SLIDE 27

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Simulation and Estimation

◮ Simulate 100 independent non-homogeneous Poisson process with

intensity function λ(t) = cos(

π 4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ) + 1.6 by

thinning.

◮ Cut off each process at T = 500. The sample size in each replicate

ranges from 717 to 866.

◮ Table: sample means and standard errors of the parameter estimates.

ω1 ω2 A1 A2 φ1 φ2 B True mean 0.45345 0.74048 1 0.5 0.78540 1.6 Sample mean 0.45374 0.74116 1.01223 0.50671

  • 0.07806

0.59168 1.60616 Asymptotic sd 0.00055 0.00111 0.08000 0.08000 0.16000 0.32000 0.05657 Sample sd 0.00052 0.00112 0.07507 0.07838 0.16499 0.33445 0.05613

The sample means are close to the true means, and the sample standard deviations are close to the asymptotic standard deviations. So are the sample covariances.

27 / 49

slide-28
SLIDE 28

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

More Simulations

◮ We consider four different periodic or almost periodic intensity

functions of non-homogeneous Poisson processes. They are Case 1: λ(t) = 1.6 + cos π 4 √ 3 t

  • + 0.5 cos

π 3 √ 2 t + π 4

  • ,

Case 2: λ(t) =

  • 3.1 + 3 cos

π 3 √ 2 t

  • ,

Case 3: λ(t) = 0.1 + 0.5Mod[t, 2π], Case 4: λ(t) = 1.3 exp

  • cos

π 3 √ 2 t + π 4

  • .

◮ We also conduct the ‘out-of-sample’ one-step-ahead prediction using

the estimated intensity function, and compare the MSE with the MSE under homogeneous Poisson process model by taking their ratio, namely 1 100

100

  • i=1

(ti

n+1 −ˆ

ti

n+1)2 1

100

100

  • i=1

(ti

n+1 −˜

ti

n+1)2.

28 / 49

slide-29
SLIDE 29

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Case 1: λ(t) = 1.6 + cos( π

4 √ 3t) + 0.5 cos( π 3 √ 2t + π 4 ).

The number of data points used for estimation in 100 replicates ranges from 717 to 866. The observation length T = 500.

10 20 30 40 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 t intensity

Figure: Solid line: true intensity function. Dashed line: estimated intensity function from one replicate.

29 / 49

slide-30
SLIDE 30

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

10 20 30 40 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 t intensity

Figure: Dark solid line: true intensity function. Light solid lines: estimated intensity function from 100 replicates.

30 / 49

slide-31
SLIDE 31

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

900 910 920 930 940 950 0.6 0.7 0.8 0.9 1.0 1.1 1.2 n ratio of MSEs

Figure: The ‘out-of-sample’ one-step-ahead prediction is carried out for the 901th to 950th data points. Solid line: ratio of MSE under our model and MSE under the homogeneous Poisson process model. The averaged reduction in MSE is 19.1%.

31 / 49

slide-32
SLIDE 32

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Case 2: λ(t) =

  • 3.1 + 3 cos( π

3 √ 2t).

The number of data points used for estimation in 100 replicates ranges from 749 to 872. The observation length T = 500.

10 20 30 40 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 t intensity

Figure: Solid line: true intensity function. Dashed line: estimated intensity function from one replicate.

32 / 49

slide-33
SLIDE 33

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

10 20 30 40 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 t intensity

Figure: Dark solid line: true intensity function. Light solid lines: estimated intensity function from 100 replicates.

33 / 49

slide-34
SLIDE 34

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

900 910 920 930 940 950 0.7 0.8 0.9 1.0 1.1 n ratio of MSEs

Figure: The ‘out-of-sample’ one-step-ahead prediction is carried out for the 901th to 950th data points. Solid line: ratio of MSE under our model and MSE under the homogeneous Poisson process model. The averaged reduction in MSE is 11.2%.

34 / 49

slide-35
SLIDE 35

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Case 3: λ(t) = 0.1 + 0.5Mod[t, 2π]. The number of data points used for estimation in 100 replicates ranges from 733 to 906. The observation length T = 500.

5 10 15 20 25 30 1 2 3 4 t intensity

Figure: Solid line: true intensity function. Dashed line: estimated intensity function from one replicate.

35 / 49

slide-36
SLIDE 36

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

5 10 15 20 25 30 1 2 3 4 t intensity

Figure: Dark solid line: true intensity function. Light solid lines: estimated intensity function from 100 replicates.

36 / 49

slide-37
SLIDE 37

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

900 910 920 930 940 950 0.75 0.80 0.85 0.90 0.95 1.00 1.05 n ratio of MSEs

Figure: The ‘out-of-sample’ one-step-ahead prediction is carried out for the 901th to 950th data points. Solid line: ratio of MSE under our model and MSE under the homogeneous Poisson process model. The averaged reduction in MSE is 9.6%.

37 / 49

slide-38
SLIDE 38

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Case 4: λ(t) = 1.3 exp{cos( π

3 √ 2t + π 4 )}.

The number of data points used for estimation in 100 replicates ranges from 733 to 906. The observation length T = 500.

10 20 30 40 1 2 3 4 t intensity

Figure: Solid line: true intensity function. Dashed line: estimated intensity function from one replicate.

38 / 49

slide-39
SLIDE 39

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

10 20 30 40 1 2 3 4 t intensity

Figure: Dark solid line: true intensity function. Light solid lines: estimated intensity function from 100 replicates.

39 / 49

slide-40
SLIDE 40

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

900 910 920 930 940 950 0.5 0.6 0.7 0.8 0.9 1.0 n ratio of MSEs

Figure: The ‘out-of-sample’ one-step-ahead prediction is carried out for the 901th to 950th data points. Solid line: ratio of MSE under our model and MSE under the homogeneous Poisson process model. The averaged reduction in MSE is 20.7%.

40 / 49

slide-41
SLIDE 41

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Computational Issues

◮ The frequency estimates ˆ

ωT have to be located accurately.

◮ If the frequency estimates are not within o(T−1) of the corresponding

true frequencies, the amplitude and phase estimates are not consistent.

◮ The usual optimization algorithms do not work in here.

0.40 0.42 0.44 0.46 0.48 0.50 0.52 5 10 15 ω periodogram

Figure: Periodogram at ω ∈ [ω1 − 10π/T, ω1 + 10π/T]. It has many local maxima and local minima.

41 / 49

slide-42
SLIDE 42

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

◮ Initially search the periodogram on a grid mesh with the mesh length

  • (T−1), such as 2πT−3/2, and determine the initial values

corresponding to the K largest ordinates subject to the minimum separation condition, and then do a more refined search in the neighborhood of the initial values.

◮ Note that the fast Fourier transform cannot be used in calculating the

periodogram of a point process because the points {tj : tj ≤ T} are not equally spaced.

42 / 49

slide-43
SLIDE 43

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Two problems:

◮ How to choose the minimum separation for a particular set of data? ◮ How to determine the neighborhood of the initial values in the more

refined search?

43 / 49

slide-44
SLIDE 44

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Problem 1

◮ The choice of minimum separation is O(T−1+β) with β > 0, we can

use O(T− 1

2 ), but it may be too large.

◮ The periodogram may go down to the noise level when it is outside

ˆ ωk,T ± 6π/T.

0.4 0.5 0.6 0.7 0.8 5 10 15 ω periodogram

[ ] [ ] 44 / 49

slide-45
SLIDE 45

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

◮ In practice, we determine ˆ

ω1,T by maximizing the periodogram, and may determine ˆ ω2,T by maximizing the periodogram outside ˆ ω1,T ± 6π/T, and so on.

◮ The minimum separation can also be determined by prior knowledge of

how far apart the frequencies are. The suggestion of 6π/T here may be the smallest choice since we assume the true frequencies to be well separated with minimum distance greater than O(T−1).

45 / 49

slide-46
SLIDE 46

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Problem 2

◮ The initial search takes on a grid mesh with mesh length o(T−1). ◮ The initial value in the search for ˆ

ωk,T must fall in [ωk − 2π/T, ωk + 2π/T].

◮ The refined search may take place in a narrower neighborhood than

[ωk − 2π/T, ωk + 2π/T]; the suggested length of the narrower neighborhood is T−5/4 log T.

46 / 49

slide-47
SLIDE 47

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Open Problems

◮ Determine the number of periodic terms K in the model. Possible

solution: model selection criterion.

◮ Relax Assumption 1 that the process is non-homogeneous Poisson

process.

◮ Construct and study the process with periodic or almost periodic

correlation.

◮ ...

47 / 49

slide-48
SLIDE 48

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Questions? Suggestions? Datasets?

48 / 49

slide-49
SLIDE 49

Introduction Assumptions and Main Results Prediction Simulation Computational Issue and Further Research

Thank You!

49 / 49