Multivariate Extreme Value models Michel Bierlaire - - PowerPoint PPT Presentation

multivariate extreme value models
SMART_READER_LITE
LIVE PREVIEW

Multivariate Extreme Value models Michel Bierlaire - - PowerPoint PPT Presentation

Multivariate Extreme Value models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility LAboratory Multivariate Extreme Value models p. 1/44 Logit Random utility: U in = V in + in in is i.i.d. EV (Extreme Value)


slide-1
SLIDE 1

Multivariate Extreme Value models

Michel Bierlaire

michel.bierlaire@epfl.ch

Transport and Mobility LAboratory

Multivariate Extreme Value models – p. 1/44

slide-2
SLIDE 2

Logit

  • Random utility:

Uin = Vin + εin

  • εin is i.i.d. EV (Extreme Value) distributed
  • εin is the maximum of many r.v. capturing unobservable

attributes, measurement and specification errors.

  • Key assumption: Independence

Multivariate Extreme Value models – p. 2/44

slide-3
SLIDE 3

Relax the independence assumption

  

U1n

. . .

UJn

   =   

V1n

. . .

VJn

   +   

ε1n

. . .

εJn

   that is

Un = Vn + εn

and εn is a vector of random variables. Assumption about the random term: multivariate distribution

Multivariate Extreme Value models – p. 3/44

slide-4
SLIDE 4

Relax the independence assumption

A multivariate random variable ε is represented by a density function

f(ε1, . . . , εJ)

and

P(ε ≤ x) =

x1

−∞

· · ·

xJ

−∞

f(ε)dεJ . . . dε1

where x ∈ RJ is a J × 1 vector of constants.

Multivariate Extreme Value models – p. 4/44

slide-5
SLIDE 5

Probit model

  • Multivariate normal variable N(µ, Σ)
  • µ ∈ RJ
  • Σ ∈ RJ×J, definite positive
  • Density function:

f(ε) = (2π)− J

2 |Σ|− 1 2 e− 1 2 (ε−µ)T Σ−1(ε−µ)

Multivariate Extreme Value models – p. 5/44

slide-6
SLIDE 6

Probit model

Example: trinomial model

U1 = V1 + ε1 U2 = V2 + ε2 U3 = V3 + ε3

and ε ∼ N(0, Σ). We have P(2) = P(Ui − U2 ≤ 0

i = 1, 2, 3) U1 − U2 = V1 − V2 + ε1 − ε2 U3 − U2 = V3 − V2 + ε3 − ε2

Multivariate Extreme Value models – p. 6/44

slide-7
SLIDE 7

Probit model

Matrix notation with

∆2 =

  • 1

−1 −1 1

  • ∆2U =
  • U1 − U2

U3 − U2

  • ∼ N(∆2V, ∆2Σ∆T

2 )

Multivariate Extreme Value models – p. 7/44

slide-8
SLIDE 8

Probit model

In general, we have

∆iU ∼ N(∆iV, ∆iΣ∆T

i )

and P(i) =

P(∆iU ≤ 0) =

−∞

· · ·

−∞

f(∆iε)d(∆iε)1 . . . d(∆iε)J−1

with

f(∆iε) = (2π)− J

2 |∆iΣ∆T

i |− 1

2 e− 1 2 (∆iε−∆iV )T (∆iΣ∆T i )−1(∆iε−∆iV )

Multivariate Extreme Value models – p. 8/44

slide-9
SLIDE 9

Probit model

  • The integral of the density function has no closed form
  • In high dimensions, numerical integration is computationally

infeasible

  • Therefore, the probit model with more than 5 alternatives is

very difficult to use in practice

Multivariate Extreme Value models – p. 9/44

slide-10
SLIDE 10

Relax the independence assumption

Assume that εn is a multivariate random variable with

  • CDF: Fεn(ξ1, . . . , ξJn)
  • pdf: fεn(ξ1, . . . , ξJn) =

∂JnF ∂ξ1···∂ξJn (ξ1, . . . , ξJn).

  • The choice probability is

Pn(1) = Pr(V2n + ε2n ≤ V1n + ε1n, . . . , VJn + εJn ≤ V1n + ε1n),

  • r

Pn(1) = Pr(ε2n − ε1n ≤ V1n − V2n, . . . , εJn − ε1n ≤ V1n − VJn).

Multivariate Extreme Value models – p. 10/44

slide-11
SLIDE 11

Relax the independence assumption

Change of variables:

ξ1n = ε1n, ξin = εin − ε1n, i = 2, . . . , Jn,

that is        

ξ1n ξ2n

. . .

ξ(Jn−1)n ξJnn

       

=

       

1 · · · −1 1 · · ·

. . .

−1 · · · 1 −1 · · · 1

               

ε1n ε2n

. . .

ε(Jn−1)n εJnn

       

.

and

Pn(1) = Pr(ξ2n ≤ V1n − V2n, . . . , ξJnn ≤ V1n − VJnn).

Multivariate Extreme Value models – p. 11/44

slide-12
SLIDE 12

Relax the independence assumption

Pn(1) = Pr(ξ2n ≤ V1n − V2n, . . . , ξJnn ≤ V1n − VJnn).

  • Only Jn − 1 inequalities.
  • ξ1n can take any value.
  • Choice probability = CDF of (ξ2n, . . . , ξJnn) evaluated at

(V1n − V2n, . . . , V1n − VJnn). Pn(1) = Fξ1n,ξ2n,...,ξJn (+∞, V1n − V2n, . . . , V1n − VJnn) =

+∞

ξ1=−∞

V1n−V2n

ξ2=−∞

· · ·

V1n−VJnn

ξJn=−∞

fξ1n,ξ2n,...,ξJn (ξ1, ξ2, . . . , ξJn)dξ,

Multivariate Extreme Value models – p. 12/44

slide-13
SLIDE 13

Relax the independence assumption

Pn(1)

+∞

ξ1=−∞

V1n−V2n

ξ2=−∞

· · ·

V1n−VJnn

ξJn=−∞

fξ1n,ξ2n,...,ξJn (ξ1, ξ2, . . . , ξJn)dξ.

  • Change of variables: determinant 1.
  • pdf of (ξ1n, . . . , ξJnn) = pdf of (ε1n, . . . , εJnn)

Pn(1) =

+∞

ε1=−∞

V1n−V2n+ε1

ε2=−∞

· · ·

V1n−VJnn+ε1

εJn=−∞

fε1n,ε2n,...,εJn (ε1, ε2, . . . , εJn)dε.

  • r

Pn(1) =

+∞

ε1=−∞

∂Fε1n,ε2n,...,εJn ∂ε1 (ε1, V1n−V2n+ε1, . . . , V1n−VJnn+ε1)dε1.

Multivariate Extreme Value models – p. 13/44

slide-14
SLIDE 14

Multivariate Extreme Value model

  • εn = (ε1n, . . . , εJn) follows a multivariate extreme value

distribution if it has the CDF:

Fεn(ε1n, . . . , εJn) = e−G(e−ε1n,...,e−εJn),

where G : RJn

+ → R+ is a positive function with positive

arguments.

  • To be a valid CDF, it must verify the following properties.
  • (i) the limit property

Fεn(ε1n, . . . , −∞, . . . , εJn) = 0,

  • r

G(y1n, . . . , +∞, . . . , yJn) = +∞.

Multivariate Extreme Value models – p. 14/44

slide-15
SLIDE 15

Multivariate Extreme Value model

  • (ii) the zero property

Fεn(+∞, . . . , +∞) = 1.

  • r

G(0, . . . , 0) = 0.

  • (iii) the strong alternating sign property:
  • Any partial derivative of Fεn defines a density function of a

marginal distribution.

  • To be a valid density function, it has to be non negative.
  • For any set of

Jn ≤ Jn distinct indices i1, . . . , i

Jn,

JnFεn

∂εi1n · · · ∂εi

Jnn

(ε1n, . . . , εJnn) ≥ 0.

Multivariate Extreme Value models – p. 15/44

slide-16
SLIDE 16

Multivariate Extreme Value model

Fεn(ε1n, . . . , εJn) = e−G(e−ε1n,...,e−εJn),

  • (iii) the strong alternating sign property (ctd).
  • The right-hand side changes sign each time it is

differentiated.

  • To obtain a non negative sign, G must also change sign

each time it is differentiated.

  • For any set of

Jn distinct indices i1, . . . , i

Jn,

(−1)

Jn−1Gi1,...,i

Jn ≥ 0.

Multivariate Extreme Value models – p. 16/44

slide-17
SLIDE 17

Multivariate Extreme Value model

We need another property: homogeneity.

  • A function G is homogeneous of degree µ, or µ-homogeneous,

if

G(αy) = αµG(y), ∀α > 0 and y ∈ RJ

+.

  • It will imply two results:
  • the marginals are univariate extreme value distributions,
  • the choice model has a closed form.

Multivariate Extreme Value models – p. 17/44

slide-18
SLIDE 18

Multivariate Extreme Value model

  • ith marginal distribution:

Fεn(+∞, . . . , +∞, εin, +∞, . . . , +∞) = e−G(0,...,0,e−εin,0,...,0).

  • If G is µ-homogeneous, we have

G(0, . . . , 0, e−εin, 0, . . . , 0) = e−µεinG(0, . . . , 0, 1, 0, . . . , 0),

  • r equivalently,

G(0, . . . , 0, e−εin, 0, . . . , 0) = e−µεin+log G(0,...,0,1,0,...,0),

  • Define log G(0, . . . , 0, 1, 0, . . . , 0) = µη, so that

Fεn(+∞, . . . , +∞, εin, +∞, . . . , +∞) = exp

  • −e−µ(εin−η)

.

Multivariate Extreme Value models – p. 18/44

slide-19
SLIDE 19

Multivariate Extreme Value model

Fεn(ε1n, . . . , εJn) = e−G(e−ε1n,...,e−εJn), Fεn(+∞, . . . , +∞, εin, +∞, . . . , +∞) = exp

  • −e−µ(εin−η)

.

  • Four properties (actually, three).
  • Valid CDF.
  • Marginals: univariate extreme value distribution.
  • We have a multivariate extreme value distribution.

Multivariate Extreme Value models – p. 19/44

slide-20
SLIDE 20

MEV: choice model

Fεn(ε1n, . . . , εJn) = e−G(e−ε1n,...,e−εJn), Pn(i) =

+∞

ε=−∞

∂Fε1n,ε2n,...,εJn ∂εi (. . . , Vin−V(i−1)n+ε, ε, Vin−V(i+1)n+ε, . . .)dε.

As G is µ-homogeneous, Gi = ∂G/∂yi is µ − 1-homogeneous and

∂Fε1n,ε2n,...,εJn ∂εi (. . . , Vin − V(i−1)n + ε, ε, Vin − V(i+1)n + ε, . . .) = e−εGi(. . . , e−Vin+V(i−1)n−ε, e−ε, e−Vin+V(i+1)n−ε, . . .) exp

  • −G(. . . , e−Vin+V(i−1)n−ε, e−ε, e−Vin+V(i+1)n−ε, . . .)
  • = e−εe−(µ−1)εe−(µ−1)VinGi(. . . , eV(i−1)n, eVin, eV(i+1)n, . . .)

exp

  • −e−µεe−µVinG(. . . , eV(i−1)n, eVin, eV(i+1)n, . . .)
  • .

Multivariate Extreme Value models – p. 20/44

slide-21
SLIDE 21

MEV: choice model

We now denote

eV =

  • . . . , eV(i−1)n, eVin, eV(i+1)n, . . .
  • ,

and simplify the terms to obtain

∂Fε1n,ε2n,...,εJn ∂εi (. . . , Vin − V(i−1)n + ε, ε, Vin − V(i+1)n + ε, . . .) = e−µεe−µVineVinGi(eV ) exp

  • −e−µεe−µVinG(eV )
  • .

Therefore,

Pn(i) = e−µVineVinGi(eV )

+∞

ε=−∞

e−µε exp

  • −e−µεe−µVinG(eV )
  • dε.

Multivariate Extreme Value models – p. 21/44

slide-22
SLIDE 22

MEV: choice model

Pn(i) = e−µVineVinGi(eV )

+∞

ε=−∞

e−µε exp

  • −e−µεe−µVinG(eV )
  • dε.

Define t = − exp(−µε), so that dt = µ exp(−µε)dε:

Pn(i) = e−µVineVinGi(eV ) 1 µ

ε=−∞

exp

  • te−µVinG(eV )
  • dt,

which simplifies to

Pn(i) = eVinGi(eV ) µG(eV ) .

From Euler’s theorem:

Pn(i) = eVin+log Gi(eV )

  • j eVjn+log Gj(eV ) .

Multivariate Extreme Value models – p. 22/44

slide-23
SLIDE 23

MEV: choice model

The multivariate extreme value model:

Pn(i) = eVin+log Gi(eV )

  • j eVjn+log Gj(eV ) .

where Gi = ∂G/∂yi, and G verifies

  • (i) the limit property: G(y1n, . . . , +∞, . . . , yJn) = +∞.
  • (ii) the strong alternating sign property: for any set of

Jn distinct

indices i1, . . . , i

Jn,

(−1)

Jn−1Gi1,...,i

Jn ≥ 0.

  • (iii) the homogeneity property:

G(αy) = αµG(y), ∀α > 0 and y ∈ RJ

+.

Multivariate Extreme Value models – p. 23/44

slide-24
SLIDE 24

MEV vs GEV

  • McFadden introduces the General Extreme Value model (GEV)
  • In statistics, a Generalized Extreme Value distribution

(Jenkinson, 1955) is a univariate distribution with CDF

FX(x) =

      

e−(1+ξ((x−µ)/σ))−1/ξ) −∞ < x ≤ µ − σ/ξ for ξ < 0 µ − σ/ξ ≤ x < ∞ for ξ > 0 e−e−(x−µ)/σ −∞ < x < ∞ for ξ = 0

  • ξ = 0

Type 1 EV distribution

ξ > 0

Type 2 EV distribution

ξ < 0

Type 3 EV distribution

Multivariate Extreme Value models – p. 24/44

slide-25
SLIDE 25

MEV models

Example: G(y) = J

i=1 yµ i

  • 1. G(αy) =

J

  • i=1

(αyi)µ = αµ

J

  • i=1

i = αµG(y)

2.

lim

yi→+∞ G(y) = +∞, i = 1, . . . , J

  • 3. ∂G

∂yi = µyµ−1

i

and

∂2G ∂yi∂yj = 0 G complies with the theory

Multivariate Extreme Value models – p. 25/44

slide-26
SLIDE 26

MEV models

Example: G(y) = J

i=1 yµ i

F(ε1, . . . , εJ) = e−G(e−ε1,...,e−εJ ) = e− J

i=1 e−µεi

=

J

i=1 e−e−µεi

Product of i.i.d EV Logit Model

Multivariate Extreme Value models – p. 26/44

slide-27
SLIDE 27

MEV models

Example: G(eV1, . . . , eVJ) = J

i=1 eµVi

P(i) = eVi+ln Gi(eV1,...,eVJ )

  • j∈C eVj+ln Gj(eV1,...,eVJ ) with Gi(x) = µxµ−1

i

eVi+ln Gi(eV1,...,eVJ ) = eVi+ln µ+(µ−1) ln eVi = eln µ+µVi P(i) = eln µ+µVi

  • j∈C eln µ+µVj =

eµVi

  • j∈C eµVj

Multivariate Extreme Value models – p. 27/44

slide-28
SLIDE 28

MEV models

Example: G(eV1, . . . , eVJ) = J

i=1 eµVi

VC =

1 µ

  • ln G(eV1, . . . , eVJ) + γ
  • =

1 µ ln J

  • i=1

eµVi + γ µ

Remember the NL formulation?

Multivariate Extreme Value models – p. 28/44

slide-29
SLIDE 29

MEV models

Example: Nested logit

G(y) =

M

  • m=1

Jm

  • i=1

yµm

i

  • µ

µm

  • 1. G(αy) =

M

  • m=1

Jm

  • i=1

(αyi)µm

  • µ

µm

= αµ

M

  • m=1

Jm

  • i=1

yµm

i

  • µ

µm

2.

lim

yi→+∞ G(y) = +∞, i = 1, . . . , J

Multivariate Extreme Value models – p. 29/44

slide-30
SLIDE 30

MEV models

Example: G(y) = M

m=1

Jm

i=1 yµm i

  • µ

µm

3.

∂G ∂yi = µ µm µmyµm−1

i

Jm

  • i=1

yµm

i

  • µ

µm −1

≥ 0 ∂2G ∂yiyj = µµmyµm−1

i

yµm−1

j

( µ µm − 1)

Jm

  • i=1

yµm

i

  • µ

µm −2

≤ 0

Multivariate Extreme Value models – p. 30/44

slide-31
SLIDE 31

MEV models

  • The logit model is a MEV model
  • The nested logit model is also a MEV model

G(y) =

M

  • m=1

Jm

  • i=1

yµm

i

  • µ

µm

  • If

µ µm ≤ 1, then G complies with the theory

  • Are there other such models?

Multivariate Extreme Value models – p. 31/44

slide-32
SLIDE 32

Cross-Nested logit model

  • MEV model with

G(y1, . . . , yJ) =

M

  • m=1
  • j

(αjm

1/µyj)µm

  • µ

µm

,

with

µ µm ≤ 1, αjm ≥ 0, and ∀j, ∃m s.t. αjm > 0

  • Generalization of the nested-logit model

Multivariate Extreme Value models – p. 32/44

slide-33
SLIDE 33

Nested Logit Model

⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦

Bus Train Car Ped. Bike Public Private

❅ ❅ ❅

❅ ❅ ❅

❅ ❅ ❅ ❅ ❅ ❅ ❅

Multivariate Extreme Value models – p. 33/44

slide-34
SLIDE 34

Nested Logit Model

⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦

Bus Train Car Ped. Bike Motorized Unmotorized

❅ ❅ ❅ P P P P P P P P P P P P ❅ ❅ ❅ ❅

❅ ❅ ❅ ❅ ❅ ❅ ❅

Multivariate Extreme Value models – p. 34/44

slide-35
SLIDE 35

Cross-Nested Logit Model

⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦ ⑦

Bus Train Car Ped. Bike Nest 1 Nest 2

❅ ❅ ❅

  • P

P P P P P P P P P P P ❅ ❅ ❅ ❅

❅ ❅ ❅ ❅ ❅ ❅ ❅

Multivariate Extreme Value models – p. 35/44

slide-36
SLIDE 36

Cross-Nested Logit Model

P(i|C) =

M

  • m=1
  • j∈C αµm/µ

jm

eµmVj

  • µ

µm

M

n=1

  • j∈C αµn/µ

jn

eµnVj

µ

µn

αµm/µ

im

eµmVi

  • j∈C αµm/µ

jm

eµmVj .

which can nicely be interpreted as

P(i|C) =

  • m

P(m|C)P(i|m).

Multivariate Extreme Value models – p. 36/44

slide-37
SLIDE 37

MEV models

  • Provide a great deal of flexibility
  • Require significant imagination
  • Require heavy proofs

Multivariate Extreme Value models – p. 37/44

slide-38
SLIDE 38

Network MEV

Daly & Bierlaire (2006) Motivations:

  • Extension of the tree representation for Nested Logit
  • Investigate new MEV models
  • Provide the proof once for all

Multivariate Extreme Value models – p. 38/44

slide-39
SLIDE 39

Network MEV

Let (V, E) be a network with link parameters α(i,j) ≥ 0 Assumptions:

  • 1. No circuit.
  • 2. One node without predecessor: root.
  • 3. J nodes without successor: alternatives.
  • 4. For each node vi, there exists at least one path from the root to

vi such that P

k=1 α(ik−1,ik) > 0.

Multivariate Extreme Value models – p. 39/44

slide-40
SLIDE 40

Network MEV

✡ ✡ ✡ ✡ ✡ ✡ ✢ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ✡ ✡ ✡ ✡ ✡ ✡ ✢ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ❍❍❍❍❍❍❍❍❍❍❍ ❍ ❥ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ✡ ✡ ✡ ✡ ✡ ✡ ✢ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✙ ② ② ② ② ② ② v1 v2 v3 v4 v5 v0

Multivariate Extreme Value models – p. 40/44

slide-41
SLIDE 41

Network MEV

For each node vi, we define

◮ a set of indices Ii ⊆ {1, . . . , J} of Ji relevant alternatives, ◮ a homogeneous function Gi : RJi −

→ R, and

◮ a parameter µi.

Recursive definition of Ii:

  • Ii = {i} for alternatives,
  • Ii =

j∈succ(i) Ij for all other nodes.

Multivariate Extreme Value models – p. 41/44

slide-42
SLIDE 42

Network MEV

Recursive definition of Gi: For alternatives:

Gi :

R −

→ R : Gi(yi) = yµi

i

i = 1, . . . , J

For all others:

Gi :

RJi −

→ R : Gi(y) =

  • j∈succ(i)

α(i,j)Gj(y)

µi µj

Theorem If all Gj(y) are MEV generating functions, so is Gi

Multivariate Extreme Value models – p. 42/44

slide-43
SLIDE 43

Network MEV

Example: Cross-Nested Logit

G =

  • m
  • j∈C

αjmyµm

j

  • µ

µm

✡ ✡ ✡ ✡ ✡ ✡ ✢ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ✡ ✡ ✡ ✡ ✡ ✡ ✢ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ❍❍❍❍❍❍❍❍❍❍❍ ❍ ❥ ❏ ❏ ❏ ❏ ❏ ❏ ❫ ✡ ✡ ✡ ✡ ✡ ✡ ✢ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✙ ② ② ② ② ② ② yµ1

1

yµ2

2

yµ3

3

α51yµ5

1 + α52yµ5 2 + α53yµ5 3

  • i=4,5 α0i (αi1yµi

1 + αi2yµi 2 + αi3yµi 3 )

µ0 µi

Multivariate Extreme Value models – p. 43/44

slide-44
SLIDE 44

Summary

  • Need to relax the independence assumption
  • Probit
  • MEV family
  • CNL

Multivariate Extreme Value models – p. 44/44