Determinantal point process models and statistical inference Fr ed - - PowerPoint PPT Presentation

determinantal point process models and statistical
SMART_READER_LITE
LIVE PREVIEW

Determinantal point process models and statistical inference Fr ed - - PowerPoint PPT Presentation

Introduction Definition Stationary models Approximation Inference Conclusion Determinantal point process models and statistical inference Fr ed eric Lavancier , Laboratoire de Math ematiques Jean Leray, Nantes (France) Joint work


slide-1
SLIDE 1

Introduction Definition Stationary models Approximation Inference Conclusion

Determinantal point process models and statistical inference

Fr´ ed´ eric Lavancier, Laboratoire de Math´ ematiques Jean Leray, Nantes (France) Joint work with Jesper Møller (Aalborg University, Danemark) and Ege Rubak (Aalborg University, Danemark). Some recent results with Christophe Biscio (Nantes).

slide-2
SLIDE 2

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-3
SLIDE 3

Introduction Definition Stationary models Approximation Inference Conclusion

Examples of point pattern datasets

  • (a) Spanish towns
  • ●●
  • (b) Kidney cells of

two types (hamster)

  • ● ●
  • ● ●
  • ● ●
  • ● ●
  • (c) Japanese pines
slide-4
SLIDE 4

Introduction Definition Stationary models Approximation Inference Conclusion

First look at DPPs

Determinantal point processes (DPPs): a class of repulsive (or regular, or inhibitive) point processes. Some examples :

  • Poisson

DPP DPP with stronger repulsion

slide-5
SLIDE 5

Introduction Definition Stationary models Approximation Inference Conclusion

First look at DPPs

Determinantal point processes (DPPs): a class of repulsive (or regular, or inhibitive) point processes. Some examples :

  • Poisson

DPP DPP with stronger repulsion Statistical motivation : Do DPPs constitute a tractable and flexible class of models for repulsive point processes?

slide-6
SLIDE 6

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

Gibbs point processes: The usual class when modelling repulsiveness (e.g. Strauss model, Area interaction model).

slide-7
SLIDE 7

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

Gibbs point processes: The usual class when modelling repulsiveness (e.g. Strauss model, Area interaction model). In general:

moments are not expressible in closed form; likelihoods involve intractable normalizing constants; elaborate McMC methods are needed for simulations and

approximate likelihood inference;

for infinite Gibbs point processes defined on Rd, ‘things’

become rather complicated (existence and uniqueness)

slide-8
SLIDE 8

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties:

slide-9
SLIDE 9

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties: (a) simple conditions for existence;

slide-10
SLIDE 10

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties: (a) simple conditions for existence; (b) all orders of moments are known;

slide-11
SLIDE 11

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties: (a) simple conditions for existence; (b) all orders of moments are known; (c) the density of the DPP restricted to any compact set S ⊂ Rd is expressible on closed form;

slide-12
SLIDE 12

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties: (a) simple conditions for existence; (b) all orders of moments are known; (c) the density of the DPP restricted to any compact set S ⊂ Rd is expressible on closed form; (d) the DPP on any compact set can easily be simulated;

slide-13
SLIDE 13

Introduction Definition Stationary models Approximation Inference Conclusion

Gibbs point processes vs DPPs

DPPs possess a number of appealing properties: (a) simple conditions for existence; (b) all orders of moments are known; (c) the density of the DPP restricted to any compact set S ⊂ Rd is expressible on closed form; (d) the DPP on any compact set can easily be simulated; (e) parametric models are available, and inference can be done by MLEs or using the moments.

slide-14
SLIDE 14

Introduction Definition Stationary models Approximation Inference Conclusion

Background

slide-15
SLIDE 15

Introduction Definition Stationary models Approximation Inference Conclusion

Background

DPPs were introduced in their general form by O. Macchi

in 1975 to model fermions in quantum mechanics.

Particular cases include the law of the eigenvalues of

certain random matrices (Gaussian Unitary Ensemble, Ginibre Ensemble,...)

Most theoretical studies have been published in the 2000’s. Recent growing interest for statistical applications (in

machine learning, telecommunications, forestry, biology,...)

slide-16
SLIDE 16

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-17
SLIDE 17

Introduction Definition Stationary models Approximation Inference Conclusion

Notation

X : spatial point process on Rd

slide-18
SLIDE 18

Introduction Definition Stationary models Approximation Inference Conclusion

Notation

X : spatial point process on Rd For any borel set B ⊆ Rd, XB = X ∩ B.

slide-19
SLIDE 19

Introduction Definition Stationary models Approximation Inference Conclusion

Notation

X : spatial point process on Rd For any borel set B ⊆ Rd, XB = X ∩ B. For any integer n > 0, denote ρ(n) the n’th order product density

function of X. Intuitively, ρ(n)(x1, . . . , xn) dx1 · · · dxn is the probability that for each i = 1, . . . , n, X has a point in a region around xi of volume dxi.

slide-20
SLIDE 20

Introduction Definition Stationary models Approximation Inference Conclusion

Notation

X : spatial point process on Rd For any borel set B ⊆ Rd, XB = X ∩ B. For any integer n > 0, denote ρ(n) the n’th order product density

function of X. Intuitively, ρ(n)(x1, . . . , xn) dx1 · · · dxn is the probability that for each i = 1, . . . , n, X has a point in a region around xi of volume dxi. In particular ρ = ρ(1) is the intensity function.

slide-21
SLIDE 21

Introduction Definition Stationary models Approximation Inference Conclusion

Definition of a determinantal point process

Let C be a function from Rd × Rd → C. Denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex : [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .
slide-22
SLIDE 22

Introduction Definition Stationary models Approximation Inference Conclusion

Definition of a determinantal point process

Let C be a function from Rd × Rd → C. Denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex : [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . .

slide-23
SLIDE 23

Introduction Definition Stationary models Approximation Inference Conclusion

Definition of a determinantal point process

Let C be a function from Rd × Rd → C. Denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex : [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . . The Poisson process with intensity ρ(x) is the special case where C(x, x) = ρ(x) and C(x, y) = 0 if x = y.

slide-24
SLIDE 24

Introduction Definition Stationary models Approximation Inference Conclusion

Definition of a determinantal point process

Let C be a function from Rd × Rd → C. Denote [C](x1, . . . , xn) the n × n matrix with entries C(xi, xj). Ex : [C](x1) = C(x1, x1) [C](x1, x2) = C(x1, x1) C(x1, x2) C(x2, x1) C(x2, x2)

  • .

Definition X is a determinantal point process with kernel C, denoted X ∼ DPP(C), if its product density functions satisfy ρ(n)(x1, . . . , xn) = det[C](x1, . . . , xn), n = 1, 2, . . . The Poisson process with intensity ρ(x) is the special case where C(x, x) = ρ(x) and C(x, y) = 0 if x = y. Some conditions on C are necessary for existence (see later), at least C must satisfy : for all x1, . . . , xn, det[C](x1, . . . , xn) ≥ 0.

slide-25
SLIDE 25

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

slide-26
SLIDE 26

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

The intensity of X is ρ(x) = C(x, x).

slide-27
SLIDE 27

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = det[C](x, y) C(x, x)C(y, y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness) if C is Hermitian.

slide-28
SLIDE 28

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = det[C](x, y) C(x, x)C(y, y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness) if C is Hermitian. If X ∼ DPP(C), then XB ∼ DPP(CB)

slide-29
SLIDE 29

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = det[C](x, y) C(x, x)C(y, y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness) if C is Hermitian. If X ∼ DPP(C), then XB ∼ DPP(CB) Any smooth transformation or independent thinning of a DPP is

still a DPP with explicit given kernel.

slide-30
SLIDE 30

Introduction Definition Stationary models Approximation Inference Conclusion

First properties (if X ∼ DPP(C) exists)

The intensity of X is ρ(x) = C(x, x). The pair correlation function is

g(x, y) := ρ(2)(x, y) ρ(x)ρ(y) = det[C](x, y) C(x, x)C(y, y) = 1 − C(x, y)C(y, x) C(x, x)C(y, y)

Thus g ≤ 1 (i.e. repulsiveness) if C is Hermitian. If X ∼ DPP(C), then XB ∼ DPP(CB) Any smooth transformation or independent thinning of a DPP is

still a DPP with explicit given kernel.

Given a kernel C, there exists at most one DPP(C).

slide-31
SLIDE 31

Introduction Definition Stationary models Approximation Inference Conclusion

Existence

C must be non-negative definite. Henceforth assume (C1) C is a continuous complex covariance function.

slide-32
SLIDE 32

Introduction Definition Stationary models Approximation Inference Conclusion

Existence

C must be non-negative definite. Henceforth assume (C1) C is a continuous complex covariance function. By Mercer’s theorem, for any compact set S ⊂ Rd, C restricted to S × S, denoted CS, has a spectral representation, CS(x, y) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, where λS

k ≥ 0 and

  • S φS

k (x)φS l (x) dx = 1{k=l}.

slide-33
SLIDE 33

Introduction Definition Stationary models Approximation Inference Conclusion

Existence

C must be non-negative definite. Henceforth assume (C1) C is a continuous complex covariance function. By Mercer’s theorem, for any compact set S ⊂ Rd, C restricted to S × S, denoted CS, has a spectral representation, CS(x, y) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, where λS

k ≥ 0 and

  • S φS

k (x)φS l (x) dx = 1{k=l}.

Theorem (Macchi, 1975) Under (C1), existence of DPP(C) is equivalent to : λS

k ≤ 1 for all compact S ⊂ Rd and all k.

slide-34
SLIDE 34

Introduction Definition Stationary models Approximation Inference Conclusion

Density on a compact set S

Let X ∼ DPP(C) and S ⊂ Rd be any compact set. Recall that CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

Theorem (Macchi (1975)) If λS

k < 1 ∀k, then XS ≪ Poisson(S, 1), with density

f({x1, . . . , xn}) = exp(|S| − D) det[ ˜ C](x1, . . . , xn), where D = − ∞

k=1 log(1 − λS k ) and ˜

C : S × S → C is given by ˜ C(x, y) =

  • k=1

λS

k

1 − λS

k

φS

k (x)φS k (y)

slide-35
SLIDE 35

Introduction Definition Stationary models Approximation Inference Conclusion

Simulation

Let X ∼ DPP(C). We want to simulate XS for S ⊂ Rd compact. Recall that XS ∼ DPP(CS) with CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

slide-36
SLIDE 36

Introduction Definition Stationary models Approximation Inference Conclusion

Simulation

Let X ∼ DPP(C). We want to simulate XS for S ⊂ Rd compact. Recall that XS ∼ DPP(CS) with CS(x, y) = ∞

k=1 λS k φS k (x)φS k (y).

Theorem (Hough et al. (2006)) For k ∈ N, let Bk be independent Bernoulli r.v. with mean λS

k .

Define K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S. Then DPP(CS) d = DPP(K).

slide-37
SLIDE 37

Introduction Definition Stationary models Approximation Inference Conclusion

So simulating XS is equivalent to simulate DPP(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S.

slide-38
SLIDE 38

Introduction Definition Stationary models Approximation Inference Conclusion

So simulating XS is equivalent to simulate DPP(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S.

1 Simulate M := sup{k ≥ 0 : Bk = 0} (by the inversion

method).

slide-39
SLIDE 39

Introduction Definition Stationary models Approximation Inference Conclusion

So simulating XS is equivalent to simulate DPP(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S.

1 Simulate M := sup{k ≥ 0 : Bk = 0} (by the inversion

method).

2 Given M = m, generate B1, . . . , Bm−1

slide-40
SLIDE 40

Introduction Definition Stationary models Approximation Inference Conclusion

So simulating XS is equivalent to simulate DPP(K) with K(x, y) =

  • k=1

BkφS

k (x)φS k (y),

(x, y) ∈ S × S.

1 Simulate M := sup{k ≥ 0 : Bk = 0} (by the inversion

method).

2 Given M = m, generate B1, . . . , Bm−1 3 simulate DPP(K) given B1, . . . , BM and M = m.

The kernel K becomes w.l.o.g. K(x, y) =

n

  • k=1

φS

k (x)φS k (y)

and DPP(K) is a determinantal projection process.

slide-41
SLIDE 41

Introduction Definition Stationary models Approximation Inference Conclusion

Simulation of a determinantal projection process: K(x, y) =

n

  • k=1

φS

k (x)φS k (y) = v(y)∗v(x),

v(x) = (φS

1 (x), . . . , φS n(x))T .

DPP(K) has a.s. n points (X1, . . . , Xn) that can be generated by the following Gram-Schmidt procedure sample Xn from the density pn(x) = v(x)2/n; set e1 = v(Xn)/v(Xn); for i = (n − 1) to 1 do sample Xi from the density (given Xi+1, . . . , Xn) : pi(x) = 1 i  v(x)2 −

n−i

  • j=1

|e∗

jv(x)|2

  , x ∈ S set wi = v(Xi) − n−i

j=1

  • e∗

jv(Xi)

  • ej, en−i+1 = wi/wi
slide-42
SLIDE 42

Introduction Definition Stationary models Approximation Inference Conclusion

Summary

Therefore, given a kernel C :

condition for existence of DPP(C) are known* all moments of DPP(C) are explicitly known the density of DPP(C) on any compact set is known* DPP(C) can be easily and quickly simulated on any

compact set* * if the spectral representation of CS is known on any S (see later for an approximation).

slide-43
SLIDE 43

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-44
SLIDE 44

Introduction Definition Stationary models Approximation Inference Conclusion

Stationary models

Consider a stationary kernel : C(x, y) = C0(x − y), x, y ∈ Rd.

slide-45
SLIDE 45

Introduction Definition Stationary models Approximation Inference Conclusion

Stationary models

Consider a stationary kernel : C(x, y) = C0(x − y), x, y ∈ Rd. Recall (C1): C0 is a continuous covariance function. Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd.

slide-46
SLIDE 46

Introduction Definition Stationary models Approximation Inference Conclusion

Stationary models

Consider a stationary kernel : C(x, y) = C0(x − y), x, y ∈ Rd. Recall (C1): C0 is a continuous covariance function. Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd. Theorem Under (C1), if C0 ∈ L2(Rd), then existence of DPP(C0) is equivalent to ϕ ≤ 1.

slide-47
SLIDE 47

Introduction Definition Stationary models Approximation Inference Conclusion

Stationary models

Consider a stationary kernel : C(x, y) = C0(x − y), x, y ∈ Rd. Recall (C1): C0 is a continuous covariance function. Moreover, if C0 ∈ L2(Rd) we can define its Fourier transform ϕ(x) =

  • C0(t)e−2πix·t dt,

x ∈ Rd. Theorem Under (C1), if C0 ∈ L2(Rd), then existence of DPP(C0) is equivalent to ϕ ≤ 1. To construct parametric families of DPP : Consider parametric families of C0 and rescale so that ϕ ≤ 1. → This will induce restriction on the parameter space.

slide-48
SLIDE 48

Introduction Definition Stationary models Approximation Inference Conclusion

Several parametric families of covariance functions are available, with closed form expressions for their Fourier transform.

slide-49
SLIDE 49

Introduction Definition Stationary models Approximation Inference Conclusion

Several parametric families of covariance functions are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

slide-50
SLIDE 50

Introduction Definition Stationary models Approximation Inference Conclusion

Several parametric families of covariance functions are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

Whittle-Mat´

ern (includes Exponential and Gaussian) : C0(x) = ρ 21−ν Γ(ν) x/ανKν(x/α), x ∈ Rd, DPP(C0) exists iff ρ ≤

Γ(ν) Γ(ν+d/2)(2√πα)d

slide-51
SLIDE 51

Introduction Definition Stationary models Approximation Inference Conclusion

Several parametric families of covariance functions are available, with closed form expressions for their Fourier transform.

For d = 2, the circular covariance function with range α is given by

C0(x) = ρ 2 π

  • arccos(x/α) − x/α
  • 1 − (x/α)2
  • 1x<α.

DPP(C0) exists iff ϕ ≤ 1 ⇔ ρα2 ≤ 4/π. ⇒ Tradeoff between the intensity ρ and the range of repulsion α.

Whittle-Mat´

ern (includes Exponential and Gaussian) : C0(x) = ρ 21−ν Γ(ν) x/ανKν(x/α), x ∈ Rd, DPP(C0) exists iff ρ ≤

Γ(ν) Γ(ν+d/2)(2√πα)d Generalized Cauchy

C0(x) = ρ (1 + x/α2)ν+d/2 , x ∈ Rd. DPP(C0) exists iff ρ ≤

Γ(ν+d/2) Γ(ν)(√πα)d

slide-52
SLIDE 52

Introduction Definition Stationary models Approximation Inference Conclusion

Pair correlation functions of DPP(C0) for previous models when ρ = 1 and α = αmax(ν): In blue : C0 is the circular covariance function. In red : C0 is Whittle-Mat´ ern, for different values of ν In green : C0 is generalized Cauchy, for different values of ν

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 r = |x − y| g(r)

slide-53
SLIDE 53

Introduction Definition Stationary models Approximation Inference Conclusion

Are the previous parametric families ”rich enough”? How repulsive a stationary DPP can be?

  • Less repulsive DPP = Poisson Point Process.
  • What is the most repulsive DPP?

We introduce criteria of repulsiveness based on the pair

correlation function g.

slide-54
SLIDE 54

Introduction Definition Stationary models Approximation Inference Conclusion

How repulsive a stationary DPP can be?

slide-55
SLIDE 55

Introduction Definition Stationary models Approximation Inference Conclusion

How repulsive a stationary DPP can be?

Criteria of repulsiveness based on the pair correlation function.

0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 Attractive zone Repulsive zone

slide-56
SLIDE 56

Introduction Definition Stationary models Approximation Inference Conclusion

How repulsive a stationary DPP can be?

Criteria of repulsiveness based on the pair correlation function.

0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 Attractive zone Repulsive zone

Let X and Y be two DPPs with the same intensity ρ and pcf gX, gY . Definition

  • X is globally more repulsive than Y if ”gX has a larger blue zone”,

i.e.

  • (1 − gX) ≥
  • (1 − gY ).
slide-57
SLIDE 57

Introduction Definition Stationary models Approximation Inference Conclusion

How repulsive a stationary DPP can be?

Criteria of repulsiveness based on the pair correlation function.

0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 Attractive zone Repulsive zone

Let X and Y be two DPPs with the same intensity ρ and pcf gX, gY . Definition

  • X is globally more repulsive than Y if ”gX has a larger blue zone”,

i.e.

  • (1 − gX) ≥
  • (1 − gY ).
  • X is locally more repulsive than Y if ”gX is more flat near 0”,

i.e. gX(0) = gY (0) = 0, ∇gX(0) = ∇gY (0) = 0 and ∆gX(0) ≤ ∆gY (0). Note that for a hardcore point process: g(0) = ∇g(0) = ∆g(0) = 0.

slide-58
SLIDE 58

Introduction Definition Stationary models Approximation Inference Conclusion

Let Jν be the Bessel function of the first kind with order ν. Proposition There exists a unique DPP that is both the most globally and the most locally repulsive DPP with intensity ρ. Its kernel in dimension d = 2 is given by: C0(x) = √ρ J1(2√πρ||x||)

√π||x||

and ϕ(x) = 1||x||2≤ρπ−1

slide-59
SLIDE 59

Introduction Definition Stationary models Approximation Inference Conclusion

Let Jν be the Bessel function of the first kind with order ν. Proposition There exists a unique DPP that is both the most globally and the most locally repulsive DPP with intensity ρ. Its kernel in dimension d = 2 is given by: C0(x) = √ρ J1(2√πρ||x||)

√π||x||

and ϕ(x) = 1||x||2≤ρπ−1

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 r = |x − y| g(r)

slide-60
SLIDE 60

Introduction Definition Stationary models Approximation Inference Conclusion

New parametric families of kernels

slide-61
SLIDE 61

Introduction Definition Stationary models Approximation Inference Conclusion

New parametric families of kernels

  • Laguerre-Gauss family, for n ∈ N and 0 < α ≤ αmax(d, ρ, n)

C0(x) ∝ L

d 2

n−1

1 n

  • x

α

  • 2

e− 1

n x α 2

ϕ(x) ∝ e−n(παx)2 n−1

  • k=0

(π√nαx)2k k! − → covers all possible degrees of repulsiveness.

0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 r = |x − y| g(r)

slide-62
SLIDE 62

Introduction Definition Stationary models Approximation Inference Conclusion

  • Generalized-Sinc family, for σ ≥ 0, 0 < α ≤ αmax(d, ρ, σ)

C0(x) ∝ J σ+d

2

  • 2 x

α

  • σ+d

2

  • 2 x

α

  • σ+d

2

σ+d

2

ϕ(x) ∝

  • 1 − 2(παx)2

σ + d σ

2

+

− → covers all possible degrees of repulsiveness. − → ϕ is compactly supported.

0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 r = |x − y| g(r)

slide-63
SLIDE 63

Introduction Definition Stationary models Approximation Inference Conclusion

A mixing property for stationary DPPs

A point process is said to be Brillinger mixing if for any k ≥ 2, its k-th reduced factorial cumulant moment measure has a finite total variation. Theorem Let C0 ∈ L2(Rd) be a continuous covariance function such that ϕ ≤ 1, then DPP(C0) is Brillinger mixing. This mixing property is a first step towards asymptotic statistics for DPPs.

slide-64
SLIDE 64

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-65
SLIDE 65

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary DPP’s models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

slide-66
SLIDE 66

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary DPP’s models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider w.l.g. the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S.

slide-67
SLIDE 67

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary DPP’s models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider w.l.g. the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S. The Fourier coefficients are ck =

  • S

C0(u)e−2πik·u du

slide-68
SLIDE 68

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary DPP’s models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider w.l.g. the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S. The Fourier coefficients are ck =

  • S

C0(u)e−2πik·u du ≈

  • Rd C0(u)e−2πik·u du = ϕ(k)

which is a good approximation if C0(u) ≈ 0 for |u| > 1

2.

slide-69
SLIDE 69

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary DPP’s models

Consider a stationary kernel C0 and X ∼ DPP(C0).

  • The simulation and the density of XS requires the expansion

CS(x, y) = C0(y − x) =

  • k=1

λS

k φS k (x)φS k (y),

(x, y) ∈ S × S, but in general λS

k and φS k are not expressible on closed form.

  • Consider w.l.g. the unit box S = [− 1

2, 1 2]d and the Fourier expansion

C0(y − x) =

  • k∈Zd

cke2πik·(y−x), y − x ∈ S. The Fourier coefficients are ck =

  • S

C0(u)e−2πik·u du ≈

  • Rd C0(u)e−2πik·u du = ϕ(k)

which is a good approximation if C0(u) ≈ 0 for |u| > 1

2.

  • Example: For the circular covariance, this is true whenever ρ|S| > 5.
slide-70
SLIDE 70

Introduction Definition Stationary models Approximation Inference Conclusion

Approximation of stationary models

So, DPP(C0) on S can be approximated by DPP(Capp,0) with Capp,0(y − x) =

  • k∈Zd

ϕ(k)e2πi(y−x)·k, x, y ∈ S, where ϕ is the Fourier transform of C0. This kernel approximation allows us

to simulate DPP(C0) on S, by simulating DPP(Capp,0) to compute the (approximated) density of DPP(C0) on S.

Turns out to be a very good approximation in most cases.

slide-71
SLIDE 71

Introduction Definition Stationary models Approximation Inference Conclusion

Exemples of approximations

− Solid lines : theoretical pair correlation function

  • Circles : pair correlation from the approximated kernel

0.00 0.01 0.02 0.03 0.04 0.05 0.0 0.2 0.4 0.6 0.8 1.0

  • ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ●
  • ● ●
  • ● ● ● ● ● ● ● ●

ν = 0.25, ρmax = 324 ν = 0.50, ρmax = 337 ν = 1.00, ρmax = 329 ν = 2.00, ρmax = 315 ν = ∞, ρmax = 293

0.00 0.01 0.02 0.03 0.04 0.05 0.0 0.2 0.4 0.6 0.8 1.0

  • ● ● ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ● ●
  • ● ● ● ● ● ● ● ● ● ●
  • ● ●
  • ● ● ● ● ● ● ● ●

ν = 0.50, ρmax = 232 ν = 1.00, ρmax = 275 ν = 2.00, ρmax = 294 ν = 4.00, ρmax = 298 ν = ∞, ρmax = 293

Whittle-Mat´ ern Generalized Cauchy

slide-72
SLIDE 72

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-73
SLIDE 73

Introduction Definition Stationary models Approximation Inference Conclusion

Consider a stationary and isotropic parametric DPP(C0), with C0(x − y) = ρRψ(x − y), where Rψ(0) = 1 and ψ is some parameter

slide-74
SLIDE 74

Introduction Definition Stationary models Approximation Inference Conclusion

Consider a stationary and isotropic parametric DPP(C0), with C0(x − y) = ρRψ(x − y), where Rψ(0) = 1 and ψ is some parameter First and second moments are easily deduced:

ρ = intensity. Pair correlation function:

g(x, y) = 1 − Rψ(x − y)2.

slide-75
SLIDE 75

Introduction Definition Stationary models Approximation Inference Conclusion

Consider a stationary and isotropic parametric DPP(C0), with C0(x − y) = ρRψ(x − y), where Rψ(0) = 1 and ψ is some parameter First and second moments are easily deduced:

ρ = intensity. Pair correlation function:

g(x, y) = 1 − Rψ(x − y)2.

Ripley’s K-function is given in terms of Rψ:

If e.g. d = 2, Kψ(r) :=

  • x≤r
  • 1 − Rψ(x)2

dx = πr2−2π r tRψ(t)2 dt.

slide-76
SLIDE 76

Introduction Definition Stationary models Approximation Inference Conclusion

Parameter estimation

When C0(x − y) = ρRψ(x − y) estimate

1 using the kernel approximation for the likelihood

(ρ, ψ) by MLE

  • r ρ by #{obs. points}/[area of obs. window]

and ψ by MLE;

2 or using the moments

ρ by #{obs. points}/[area of obs. window] and ψ by minimum contrast estimation e.g. ˆ ψ = argminψ rmax

  • K(r) −
  • Kψ(r)
  • 2

dr → Consistent and asymptotically normal

slide-77
SLIDE 77

Introduction Definition Stationary models Approximation Inference Conclusion

Non homogeneous case

A point process is second-order intensity-reweighted stationary if

its intensity ρ(x) is not constant its pcf is invariant by translation : g(x, y) = g0(x − y)

For a DPP, this means that the kernel can be written: C(x, y) =

  • ρ(x)Rθ(x − y)
  • ρ(y)

where ρ(x) = C(x, x) and Rθ(0) = 1.

slide-78
SLIDE 78

Introduction Definition Stationary models Approximation Inference Conclusion

Non homogeneous case

A point process is second-order intensity-reweighted stationary if

its intensity ρ(x) is not constant its pcf is invariant by translation : g(x, y) = g0(x − y)

For a DPP, this means that the kernel can be written: C(x, y) =

  • ρ(x)Rθ(x − y)
  • ρ(y)

where ρ(x) = C(x, x) and Rθ(0) = 1. Existence : if ρ(x) ≤ ρmax and if DPP(ρmaxRθ(x − y)) is well-defined, then DPP(C) is an independent thinning with retention probability ρ(x)/ρmax, so it is well defined.

slide-79
SLIDE 79

Introduction Definition Stationary models Approximation Inference Conclusion

Non homogeneous case

A point process is second-order intensity-reweighted stationary if

its intensity ρ(x) is not constant its pcf is invariant by translation : g(x, y) = g0(x − y)

For a DPP, this means that the kernel can be written: C(x, y) =

  • ρ(x)Rθ(x − y)
  • ρ(y)

where ρ(x) = C(x, x) and Rθ(0) = 1. Existence : if ρ(x) ≤ ρmax and if DPP(ρmaxRθ(x − y)) is well-defined, then DPP(C) is an independent thinning with retention probability ρ(x)/ρmax, so it is well defined. Inference : 1- Fit a parametric model to ρ(x) (e.g. depending on covariables), by Poisson likelihood. 2- Estimate θ by minimum contrast method based e.g. on gθ(x, y) = 1 − Rθ(x − y)2 or on Kθ(r).

slide-80
SLIDE 80

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-81
SLIDE 81

Introduction Definition Stationary models Approximation Inference Conclusion

Spanish Towns dataset

Ripley (1988) fitted a 4 parameters Strauss hardcore model. We fit a DPP(ρRα,ν) model with 3 parameters:

  • ρ : intensity
  • Rα,ν : Whittle-Mat´

ern correlation function (2 parameters)

  • 2

4 6 8 10 −1.5 −1.0 −0.5 0.0

Strauss Matern Data

1 2 3 0.0 0.2 0.4 0.6 0.8

Strauss Matern Data

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 0.0 0.2 0.4 0.6 0.8

Strauss Matern Data

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 1 2 3 4 5 6

Strauss Matern Data

slide-82
SLIDE 82

Introduction Definition Stationary models Approximation Inference Conclusion

Hamster cells dataset

  • ●●
  • X1 (o) : 266 dividing (living) cells

X2 (+) : 77 pyknotic (dying) cells. Are pyknotic cells a random labelling? If so and if X1 ∪ X2 is a DPP, then X1 and X2 are DPPs with the same kernel, up to the intensities. We fit DPP(ρRα), where Rα : Gaussian kernel, to

  • X1 ∪ X2 : ˆ

α = 0.0181

  • X1 : ˆ

α1 = 0.0188

  • X2 : ˆ

α2 = 0.0082 We test H0 : α = α1 = α2 by simulations based on the statistic T = |ˆ α − ˆ α1||ˆ α − ˆ α2|. The p-value is 0.55 and the assumption of random labelling is not rejected.

slide-83
SLIDE 83

Introduction Definition Stationary models Approximation Inference Conclusion

Japanese pines dataset

A non homogeneous DPP(

  • ρ(x)R(x − y)
  • ρ(y)) is fitted.
  • log ρ(x) is a cubic polynomial in x, estimated by Poisson likelihood.
  • R is a Gaussian kernel, estimated by contrast method based on g.

1 2 3 4

  • ● ●
  • ● ●●
  • ● ● ●
  • ● ●●● ●
  • ● ●
  • 0.0

0.2 0.4 0.6 0.8 1.0 0.0 0.4 0.8 1.2 r ginhom(r) Empirical Gaussian

0.0 0.5 1.0 1.5 2.0 2.5 −0.15 −0.10 −0.05 0.00 0.05 0.10 0.15 0.20

r Linhom(r) − r Pointwise Rank Data

0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0

r Ginhom(r) Pointwise Rank Data

0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0

r Finhom(r) Pointwise Rank Data

slide-84
SLIDE 84

Introduction Definition Stationary models Approximation Inference Conclusion

1 Introduction 2 Definition, existence and basic properties 3 Parametric stationary models 4 Approximation and modelling of the eigen expansion 5 Inference 6 Examples of real data 7 Conclusion and references

slide-85
SLIDE 85

Introduction Definition Stationary models Approximation Inference Conclusion

Conclusion

  • DPPs provide flexible parametric models of repulsive point

processes.

slide-86
SLIDE 86

Introduction Definition Stationary models Approximation Inference Conclusion

Conclusion

  • DPPs provide flexible parametric models of repulsive point

processes.

  • DPPs possess appealing properties:

Easily and quickly simulated Closed form expressions for all moments. Closed form expression for the density on any bounded set. Parametric models are available Inference is feasible, including likelihood inference.

  • Parametric models, inference methods and so on are available

in R (developed by E. Rubak) in the package spatstat.

slide-87
SLIDE 87

Introduction Definition Stationary models Approximation Inference Conclusion

References

Macchi, O. (1975). The coincidence approach to stochastic point processes. Advances in Applied Probability 7, 83–122. Soshnikov, A. (2000). Determinantal random point fields. Russian Math. Surveys 55, 923–975. Shirai, T. and Y. Takahashi (2003). Random point fields associated with certain Fredholm determinants. I. Fermion, Poisson and boson point processes. Journal of Functional Analysis 2, 414–463. Hough, J. B., M. Krishnapur, Y. Peres, and B. Vir` ag (2006). Determinantal processes and independence. Probability Surveys 3, 206–229.

  • F. Lavancier, J. Møller, and E. Rubak (2015).

Determinantal point process models and statistical inference. JRSS B 77, 853–877.

  • C. Biscio and F. Lavancier.

Quantifying repulsiveness of determinantal point processes. to appear in Bernoulli.