Statistical aspects of determinantal point processes Jesper Mller , - - PowerPoint PPT Presentation

statistical aspects of determinantal point processes
SMART_READER_LITE
LIVE PREVIEW

Statistical aspects of determinantal point processes Jesper Mller , - - PowerPoint PPT Presentation

Introduction Definition Simulation Parametric models Inference Statistical aspects of determinantal point processes Jesper Mller , Department of Mathematical Sciences, Aalborg University. Joint work with Fr ed eric Lavancier,


slide-1
SLIDE 1

Introduction Definition Simulation Parametric models Inference

Statistical aspects of determinantal point processes

Jesper Møller, Department of Mathematical Sciences, Aalborg University. Joint work with Fr´ ed´ eric Lavancier, Laboratoire de Math´ ematiques Jean Leray, Nantes, and Ege Rubak, Department of Mathematical Sciences, Aalborg University.

May 6, 2012

slide-2
SLIDE 2

Introduction Definition Simulation Parametric models Inference

1 Introduction 2 Definition, existence and basic properties 3 Simulation 4 Parametric models 5 Inference

slide-3
SLIDE 3

Introduction Definition Simulation Parametric models Inference

Introduction

Determinantal point processes (DPP) form a class of

repulsive point processes.

They were introduced in their general form by O. Macchi

in 1975 to model fermions (i.e. particules with repulsion) in quantum mechanics.

Particular cases include the law of the eigenvalues of

certain random matrices (Gaussian Unitary Ensemble, Ginibre Ensemble,...)

Most theoretical studies have been published in the 2000’s. The statistical aspects have so far been largely unexplored.

slide-4
SLIDE 4

Introduction Definition Simulation Parametric models Inference

Examples

Poisson DPP DPP with stronger repulsion

slide-5
SLIDE 5

Introduction Definition Simulation Parametric models Inference

Statistical motivation

Do DPP’s constitute a tractable and flexible class of models for repulsive point processes?

slide-6
SLIDE 6

Introduction Definition Simulation Parametric models Inference

Statistical motivation

Do DPP’s constitute a tractable and flexible class of models for repulsive point processes? − → Answer: YES.

slide-7
SLIDE 7

Introduction Definition Simulation Parametric models Inference

Statistical motivation

Do DPP’s constitute a tractable and flexible class of models for repulsive point processes? − → Answer: YES. In fact:

DPP’s can be easily simulated. There are closed form expressions for the moments. There is a closed form expression for the density of a DPP

  • n any bounded set.

Inference is feasible, including likelihood inference.

slide-8
SLIDE 8

Introduction Definition Simulation Parametric models Inference

Statistical motivation

Do DPP’s constitute a tractable and flexible class of models for repulsive point processes? − → Answer: YES. In fact:

DPP’s can be easily simulated. There are closed form expressions for the moments. There is a closed form expression for the density of a DPP

  • n any bounded set.

Inference is feasible, including likelihood inference.

These properties are unusual for Gibbs point processes which are commonly used to model inhibition (e.g. the Strauss process).

slide-9
SLIDE 9

Introduction Definition Simulation Parametric models Inference

1 Introduction 2 Definition, existence and basic properties 3 Simulation 4 Parametric models 5 Inference

slide-10
SLIDE 10

Introduction Definition Simulation Parametric models Inference

Notation

We view a spatial point process X on Rd as a random

locally finite subset of Rd.

slide-11
SLIDE 11

Introduction Definition Simulation Parametric models Inference

Notation

We view a spatial point process X on Rd as a random

locally finite subset of Rd.

For any borel set B ⊆ Rd, XB = X ∩ B.

slide-12
SLIDE 12

Introduction Definition Simulation Parametric models Inference

Notation

We view a spatial point process X on Rd as a random

locally finite subset of Rd.

For any borel set B ⊆ Rd, XB = X ∩ B. For any integer n > 0, denote ρ(n) the n’th order product

density function of X. Intuitively, ρ(n)(x1, . . . , xn) dx1 · · · dxn is the probability that for each i = 1, . . . , n, X has a point in a region around xi of volume dxi.

slide-13
SLIDE 13

Introduction Definition Simulation Parametric models Inference

Notation

We view a spatial point process X on Rd as a random

locally finite subset of Rd.

For any borel set B ⊆ Rd, XB = X ∩ B. For any integer n > 0, denote ρ(n) the n’th order product

density function of X. Intuitively, ρ(n)(x1, . . . , xn) dx1 · · · dxn is the probability that for each i = 1, . . . , n, X has a point in a region around xi of volume dxi. In particular ρ = ρ(1) is the intensity function.

slide-14
SLIDE 14

Introduction Definition Simulation Parametric models Inference

Definition of a determinantal point process

For any function C : Rd × Rd → C, denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex.: [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .
slide-15
SLIDE 15

Introduction Definition Simulation Parametric models Inference

Definition of a determinantal point process

For any function C : Rd × Rd → C, denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex.: [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . .

slide-16
SLIDE 16

Introduction Definition Simulation Parametric models Inference

Definition of a determinantal point process

For any function C : Rd × Rd → C, denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex.: [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . . The Poisson process with intensity ρ(x) is the special case where C(x, x) = ρ(x) and C(x, y) = 0 if x = y.

slide-17
SLIDE 17

Introduction Definition Simulation Parametric models Inference

Definition of a determinantal point process

For any function C : Rd × Rd → C, denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex.: [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . . The Poisson process with intensity ρ(x) is the special case where C(x, x) = ρ(x) and C(x, y) = 0 if x = y. For existence, conditions on the kernel C are mandatory, e.g. C must satisfy: for all x1, . . . , xn, det[C](x1, . . . , xn) ≥ 0.

slide-18
SLIDE 18

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

slide-19
SLIDE 19

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

The intensity of X is ρ(x) = C(x, x).

slide-20
SLIDE 20

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness).

slide-21
SLIDE 21

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness). If X ∼ DPP(C), then XB ∼ DPPB(CB)

slide-22
SLIDE 22

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness). If X ∼ DPP(C), then XB ∼ DPPB(CB) Any smooth transformation or independent thinning of a

DPP is still a DPP with explicit given kernel.

slide-23
SLIDE 23

Introduction Definition Simulation Parametric models Inference

First properties

From the definition, if C is continuous,

ρ(n)(x1, . . . , xn) ≈ 0 whenever xi ≈ xj for some i = j, = ⇒ the points of X repel each other.

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness). If X ∼ DPP(C), then XB ∼ DPPB(CB) Any smooth transformation or independent thinning of a

DPP is still a DPP with explicit given kernel.

There exists at most one DPP(C).

slide-24
SLIDE 24

Introduction Definition Simulation Parametric models Inference

Existence

In all that follows we assume (C1) C is a continuous complex covariance function.

slide-25
SLIDE 25

Introduction Definition Simulation Parametric models Inference

Existence

In all that follows we assume (C1) C is a continuous complex covariance function. By Mercer’s theorem, for any compact set S ⊂ Rd, C restricted to S × S, denoted CS, has a spectral representation, CS(x, y) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, where λS

k ≥ 0 and

  • S φS

k (x)φS l (x) dx = 1{k=l}.

slide-26
SLIDE 26

Introduction Definition Simulation Parametric models Inference

Existence

In all that follows we assume (C1) C is a continuous complex covariance function. By Mercer’s theorem, for any compact set S ⊂ Rd, C restricted to S × S, denoted CS, has a spectral representation, CS(x, y) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, where λS

k ≥ 0 and

  • S φS

k (x)φS l (x) dx = 1{k=l}.

Theorem (Macchi, 1975; Hough et al., 2009; our paper) Under (C1), existence of DPP(C) is equivalent to that λS

k ≤ 1 for all compact S ⊂ Rd and all k.

slide-27
SLIDE 27

Introduction Definition Simulation Parametric models Inference

Density on a compact set S

Let XS ∼ DPPS(CS) with S ⊂ Rd compact. Recall that CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

slide-28
SLIDE 28

Introduction Definition Simulation Parametric models Inference

Density on a compact set S

Let XS ∼ DPPS(CS) with S ⊂ Rd compact. Recall that CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

Theorem (Macchi, 1975) Assuming λS

k < 1, for all k, then XS is absolutely continuous

with respect to the homogeneous Poisson process on S with unit intensity, and has density f({x1, . . . , xn}) = exp(|S| − D) det[ ˜ C](x1, . . . , xn), where D = − ∞

k=1 log(1 − λS k ) and ˜

C : S × S → C is given by ˜ C(x, y) =

  • k=1

λS

k

1 − λS

k

φS

k (x)φS k (y)

slide-29
SLIDE 29

Introduction Definition Simulation Parametric models Inference

1 Introduction 2 Definition, existence and basic properties 3 Simulation 4 Parametric models 5 Inference

slide-30
SLIDE 30

Introduction Definition Simulation Parametric models Inference

Let XS ∼ DPPS(CS) where S ⊂ Rd is compact. We want to simulate XS. Recall that CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

slide-31
SLIDE 31

Introduction Definition Simulation Parametric models Inference

Let XS ∼ DPPS(CS) where S ⊂ Rd is compact. We want to simulate XS. Recall that CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

Theorem (Hough et al., 2006) For k ∈ N, let Bk be independent Bernoulli r.v.’s with means λS

k . Define

K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Then DPPS(CS) d = DPPS(K).

slide-32
SLIDE 32

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S.

slide-33
SLIDE 33

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Let M = max{k ≥ 0; Bk = 0}. Note that M is a.s. finite, since λS

k =

  • S C(x, x) dx < ∞.
slide-34
SLIDE 34

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Let M = max{k ≥ 0; Bk = 0}. Note that M is a.s. finite, since λS

k =

  • S C(x, x) dx < ∞.

1 simulate a realization M = m (by the inversion method);

slide-35
SLIDE 35

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Let M = max{k ≥ 0; Bk = 0}. Note that M is a.s. finite, since λS

k =

  • S C(x, x) dx < ∞.

1 simulate a realization M = m (by the inversion method); 2 generate the Bernoulli variables B1, . . . , Bm−1 (these are

independent of {M = n});

slide-36
SLIDE 36

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Let M = max{k ≥ 0; Bk = 0}. Note that M is a.s. finite, since λS

k =

  • S C(x, x) dx < ∞.

1 simulate a realization M = m (by the inversion method); 2 generate the Bernoulli variables B1, . . . , Bm−1 (these are

independent of {M = n});

3 simulate the point process DPPS(K) given B1, . . . , BM and

M = m.

slide-37
SLIDE 37

Introduction Definition Simulation Parametric models Inference

So simulating XS is equivalent to simulate DPPS(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Let M = max{k ≥ 0; Bk = 0}. Note that M is a.s. finite, since λS

k =

  • S C(x, x) dx < ∞.

1 simulate a realization M = m (by the inversion method); 2 generate the Bernoulli variables B1, . . . , Bm−1 (these are

independent of {M = n});

3 simulate the point process DPPS(K) given B1, . . . , BM and

M = m. In step 3, the kernel K becomes a projection kernel, and w.l.o.g. K(x, y) =

n

  • k=1

φS

k (x)φS k (y)

where n = #{1 ≤ k ≤ M : Bk = 1}.

slide-38
SLIDE 38

Introduction Definition Simulation Parametric models Inference

Simulation of determinantal projection processes

Denoting v(x) = (φS

1 (x), . . . , φS n(x))T , we have

K(x, y) =

n

  • k=1

φS

k (x)φS k (y) = v(y)∗v(x)

slide-39
SLIDE 39

Introduction Definition Simulation Parametric models Inference

Simulation of determinantal projection processes

Denoting v(x) = (φS

1 (x), . . . , φS n(x))T , we have

K(x, y) =

n

  • k=1

φS

k (x)φS k (y) = v(y)∗v(x)

The point process DPPS(K) has a.s. n points (X1, . . . , Xn) that can be simulated by the following Gram-Schmidt procedure:

slide-40
SLIDE 40

Introduction Definition Simulation Parametric models Inference

Simulation of determinantal projection processes

Denoting v(x) = (φS

1 (x), . . . , φS n(x))T , we have

K(x, y) =

n

  • k=1

φS

k (x)φS k (y) = v(y)∗v(x)

The point process DPPS(K) has a.s. n points (X1, . . . , Xn) that can be simulated by the following Gram-Schmidt procedure: sample Xn from the distribution with density pn(x) = v(x)2/n. set e1 = v(Xn)/v(Xn). for i = (n − 1) to 1 do sample Xi from the distribution (given Xi+1, . . . , Xn) : pi(x) = 1 i  v(x)2 −

n−i

  • j=1

|e∗

jv(x)|2

  , x ∈ S set wi = v(Xi) − n−i

j=1

  • e∗

jv(Xi)

  • ej, en−i+1 = wi/wi
slide-41
SLIDE 41

Introduction Definition Simulation Parametric models Inference

Simulation of determinantal projection processes

Denoting v(x) = (φS

1 (x), . . . , φS n(x))T , we have

K(x, y) =

n

  • k=1

φS

k (x)φS k (y) = v(y)∗v(x)

The point process DPPS(K) has a.s. n points (X1, . . . , Xn) that can be simulated by the following Gram-Schmidt procedure: sample Xn from the distribution with density pn(x) = v(x)2/n. set e1 = v(Xn)/v(Xn). for i = (n − 1) to 1 do sample Xi from the distribution (given Xi+1, . . . , Xn) : pi(x) = 1 i  v(x)2 −

n−i

  • j=1

|e∗

jv(x)|2

  , x ∈ S set wi = v(Xi) − n−i

j=1

  • e∗

jv(Xi)

  • ej, en−i+1 = wi/wi

Theorem {X1, . . . , Xn} generated as above has distribution DPPS(K).

slide-42
SLIDE 42

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

Example: Let S = [−1/2, 1/2]2 and φk(x) = e2πik·x, k ∈ Z2, x ∈ S, for a set of indices k1, . . . , kn in Z2. So the projection kernel writes K(x, y) =

n

  • j=1

e2πikj·(x−y) and XS ∼ DPPS(K) is homogeneous and has a.s. n points.

slide-43
SLIDE 43

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

Step 1. The first point is sampled uniformly on S

slide-44
SLIDE 44

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

Step 1. The first point is sampled uniformly on S Step 2. The next point is sampled w.r.t. the following density :

0.2 0.4 0.6 0.8 1

slide-45
SLIDE 45

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

Step 3. The next point is sampled w.r.t. the following density :

0.2 0.4 0.6 0.8 1

slide-46
SLIDE 46

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

etc.

0.2 0.4 0.6 0.8 1

slide-47
SLIDE 47

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

etc.

0.2 0.4 0.6 0.8 1

slide-48
SLIDE 48

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

etc.

0.1 0.2 0.3 0.4 0.5 0.6

slide-49
SLIDE 49

Introduction Definition Simulation Parametric models Inference

Illustration of simulation algorithm

etc.

0.05 0.1 0.15

slide-50
SLIDE 50

Introduction Definition Simulation Parametric models Inference

1 Introduction 2 Definition, existence and basic properties 3 Simulation 4 Parametric models 5 Inference

slide-51
SLIDE 51

Introduction Definition Simulation Parametric models Inference

Stationary models

We focus on a kernel C of the form C(x, y) = C0(x − y), x, y ∈ Rd. (C1) C0 is a continuous covariance function Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd.

slide-52
SLIDE 52

Introduction Definition Simulation Parametric models Inference

Stationary models

We focus on a kernel C of the form C(x, y) = C0(x − y), x, y ∈ Rd. (C1) C0 is a continuous covariance function Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd. Theorem Under (C1), if C0 ∈ L2(Rd), then existence of DPP(C0) is equivalent to ϕ ≤ 1.

slide-53
SLIDE 53

Introduction Definition Simulation Parametric models Inference

Stationary models

We focus on a kernel C of the form C(x, y) = C0(x − y), x, y ∈ Rd. (C1) C0 is a continuous covariance function Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd. Theorem Under (C1), if C0 ∈ L2(Rd), then existence of DPP(C0) is equivalent to ϕ ≤ 1. To construct parametric families of DPP : Consider parametric families of C0 and rescale so that ϕ ≤ 1. → This will induce a bound on the parameter space.

slide-54
SLIDE 54

Introduction Definition Simulation Parametric models Inference

Several parametric families of covariance function are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

slide-55
SLIDE 55

Introduction Definition Simulation Parametric models Inference

Several parametric families of covariance function are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

Whittle-Mat´

ern (includes Exponential and Gaussian) : C0(x) = ρ 21−ν Γ(ν) x/ανKν(x/α), x ∈ Rd. DPP(C0) exists iff ρ ≤

Γ(ν) Γ(ν+d/2)(2√πα)d .

slide-56
SLIDE 56

Introduction Definition Simulation Parametric models Inference

Several parametric families of covariance function are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

Whittle-Mat´

ern (includes Exponential and Gaussian) : C0(x) = ρ 21−ν Γ(ν) x/ανKν(x/α), x ∈ Rd. DPP(C0) exists iff ρ ≤

Γ(ν) Γ(ν+d/2)(2√πα)d . Generalized Cauchy

C0(x) = ρ (1 + x/α2)ν+d/2 , x ∈ Rd. DPP(C0) exists iff ρ ≤

Γ(ν+d/2) Γ(ν)(√πα)d .

slide-57
SLIDE 57

Introduction Definition Simulation Parametric models Inference

Pair correlation functions of DPP(C0) for previous models : In blue : C0 is the circular covariance function. In red : C0 is Whittle-Mat´ ern, for different values of ν In green : C0 is generalized Cauchy, for different values of ν The parameter α is chosen such that the range of corr. ≈ 1.

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 r = |x − y| g(r)

slide-58
SLIDE 58

Introduction Definition Simulation Parametric models Inference

Spectral approach

Specify a parametric class of integrable functions

ϕθ : Rd → [0, 1] (spectral densities).

This is all we need for having a well-defined DDP. Is convenient for simulation and for (approximate) density

calculations as seen later.

slide-59
SLIDE 59

Introduction Definition Simulation Parametric models Inference

Spectral approach

Specify a parametric class of integrable functions

ϕθ : Rd → [0, 1] (spectral densities).

This is all we need for having a well-defined DDP. Is convenient for simulation and for (approximate) density

calculations as seen later.

Example: power exponential spectral model:

ϕρ,ν,α(x) = ρΓ(d/2 + 1)ναd dπd/2Γ(d/ν) exp (−αxν) with ρ > 0, ν > 0, 0 < α < αmax(ρ, ν) :=

  • 2πd/2Γ(d/ν + 1)

ρΓ(d/2) 1/d .

slide-60
SLIDE 60

Introduction Definition Simulation Parametric models Inference

Power exponential spectral model: (isotropic) spectral densities and pair correlation functions

5 10 15 0.0 0.2 0.4 0.6 0.8 1.0

ν = 1 ν = 2 (Gauss) ν = 3 ν = 5 ν = 10 ν = ∞

0.00 0.05 0.10 0.15 0.20 0.0 0.2 0.4 0.6 0.8 1.0

ν = 1 ν = 2 (Gauss) ν = 3 ν = 5 ν = 10 ν = ∞

Spectral densities Pair correlation functions

slide-61
SLIDE 61

Introduction Definition Simulation Parametric models Inference

Approximation of stationary models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

slide-62
SLIDE 62

Introduction Definition Simulation Parametric models Inference

Approximation of stationary models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S. The Fourier coefficients are ck =

  • S

C0(u)e−2πik·u du ≈

  • Rd C0(u)e−2πik·u du = ϕ(k)

which is a good approximation if C0(u) ≈ 0 for |u| > 1

2.

slide-63
SLIDE 63

Introduction Definition Simulation Parametric models Inference

Approximation of stationary models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S. The Fourier coefficients are ck =

  • S

C0(u)e−2πik·u du ≈

  • Rd C0(u)e−2πik·u du = ϕ(k)

which is a good approximation if C0(u) ≈ 0 for |u| > 1

2.

  • Example: For the circular covariance, this is true whenever ρ > 5.
slide-64
SLIDE 64

Introduction Definition Simulation Parametric models Inference

Approximation of stationary models

The approximation of DPP(C0) on S is then DPPS(Capp,0) with Capp,0(x − y) =

  • k∈Zd

ϕ(k)e2πi(x−y)·k, x, y ∈ S, where ϕ is the Fourier transform of C0.

slide-65
SLIDE 65

Introduction Definition Simulation Parametric models Inference

Approximation of stationary models

The approximation of DPP(C0) on S is then DPPS(Capp,0) with Capp,0(x − y) =

  • k∈Zd

ϕ(k)e2πi(x−y)·k, x, y ∈ S, where ϕ is the Fourier transform of C0. This approximation allows us

to simulate DPP(C0) on S; to compute the (approximated) density of DPP(C0) on S.

slide-66
SLIDE 66

Introduction Definition Simulation Parametric models Inference

1 Introduction 2 Definition, existence and basic properties 3 Simulation 4 Parametric models 5 Inference

slide-67
SLIDE 67

Introduction Definition Simulation Parametric models Inference

Consider a stationary and isotropic parametric DPP(C), i.e. C(x, y) = C0(x − y) = ρRα(x − y), with Rα(0) = 1. The first and second moments are easily deduced:

The intensity is ρ. The pair correlation function is

g(x, y) = g0(x − y) = 1 − R2

α(x − y).

Ripley’s K-function is easily expressible in terms of Rα: if

d = 2, Kα(r) := 2π r tg0(t) dt = πr2 − 2π r t|Rα(t)|2 dt.

slide-68
SLIDE 68

Introduction Definition Simulation Parametric models Inference

Inference

Parameter estimation can be conducted as follows.

1 Estimate ρ by #{obs. points}/area of obs. window. 2 Estimate α

either by minimum contrast estimator (MCE): ˆ α = argminα rmax

  • K(r) −
  • Kα(r)
  • 2

dr

  • r by maximum likelihood estimator: given ˆ

ρ, the likelihood is deduced from the kernel approximation.

slide-69
SLIDE 69

Introduction Definition Simulation Parametric models Inference

Two model examples

Exponential model with ρ = 200

and α = 0.014: C0(x) = ρ exp(−x/α).

Gaussian model with

ρ = 200 and α = 0.02: C0(x) = ρ exp(−x/α2).

0.00 0.02 0.04 0.06 0.08 0.10 0.0 0.2 0.4 0.6 0.8 1.0

  • Exponential model

Gaussian model

− Solid lines: theoretical pair correlation function

  • Circles: pair correlation from the approximated kernel
slide-70
SLIDE 70

Introduction Definition Simulation Parametric models Inference

Samples from the Gaussian model on [0, 1]2:

  • Samples from the exponential model on [0, 1]2:
slide-71
SLIDE 71

Introduction Definition Simulation Parametric models Inference

Estimation of α from 200 realisations

  • MCE

MLE 0.005 0.015 0.025 0.035 MCE MLE 0.000 0.010 0.020 0.030

Gaussian model Exponential model

slide-72
SLIDE 72

Introduction Definition Simulation Parametric models Inference

Example: 134 Norwegian pine trees observed in a 56 × 38 m region

  • Møller and Waagpetersen (2004): a five parameter multiscale

process is fitted using elaborate MCMC MLE methods. Here we fit a more parsimonious DPP models.

slide-73
SLIDE 73

Introduction Definition Simulation Parametric models Inference

First,

Whittle-Mat´

ern model;

Cauchy model; Gaussian model: the best fit, but plots of summary

statistics indicate a lack of fit.

slide-74
SLIDE 74

Introduction Definition Simulation Parametric models Inference

First,

Whittle-Mat´

ern model;

Cauchy model; Gaussian model: the best fit, but plots of summary

statistics indicate a lack of fit. Second,

power exponential spectral model: provides a much better

fit, with ˆ ν = 10, ˆ α = 6.36 ≈ αmax = 6.77 i.e. close to the “most repulsive possible stationary DPP”.

slide-75
SLIDE 75

Introduction Definition Simulation Parametric models Inference

2 4 6 8 −1.2 −0.8 −0.4 0.0

Gauss Power exp. Data

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 0.0 0.2 0.4 0.6 0.8

Gauss Power exp. Data

0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.2 0.4 0.6 0.8

Gauss Power exp. Data

0.0 0.5 1.0 1.5 2.0 2.5 1 2 3 4

Gauss Power exp. Data

Clockwise from top left: L(r) − r; G(r); F(r); J(r). Simulated 2.5% and 97.5% envelopes are based on 4000 realizations of the fitted Gaussian model resp. power exponential spectral model.

slide-76
SLIDE 76

Introduction Definition Simulation Parametric models Inference

Conclusions

  • DPP’s provide flexible parametric models of repulsive point

processes.

slide-77
SLIDE 77

Introduction Definition Simulation Parametric models Inference

Conclusions

  • DPP’s provide flexible parametric models of repulsive point

processes.

  • DPP’s possess the following appealing properties:

Easily simulated. Closed form expressions for the moments. Closed form expression for the density of a DPP on any

bounded set.

Inference is feasible, including likelihood inference.

slide-78
SLIDE 78

Introduction Definition Simulation Parametric models Inference

Conclusions

  • DPP’s provide flexible parametric models of repulsive point

processes.

  • DPP’s possess the following appealing properties:

Easily simulated. Closed form expressions for the moments. Closed form expression for the density of a DPP on any

bounded set.

Inference is feasible, including likelihood inference.

⇒ Promising alternative to repulsive Gibbs point processes.

slide-79
SLIDE 79

Introduction Definition Simulation Parametric models Inference

References

Hough, J. B., M. Krishnapur, Y. Peres, and B. Vir` ag (2006). Determinantal processes and independence. Probability Surveys 3, 206–229. Macchi, O. (1975). The coincidence approach to stochastic point processes. Advances in Applied Probability 7, 83–122. McCullagh, P. and J. Møller (2006). The permanental process. Advances in Applied Probability 38, 873–888. Scardicchio, A., C. Zachary, and S. Torquato (2009). Statistical properties of determinantal point processes in high-dimensional Euclidean spaces. Physical Review E 79(4). Shirai, T. and Y. Takahashi (2003). Random point fields associated with certain Fredholm determinants. I. Fermion, Poisson and boson point processes. Journal of Functional Analysis 2, 414–463. Soshnikov, A. (2000). Determinantal random point fields. Russian Math. Surveys 55, 923–975.