Stochastic Filtering by Projection The Example of the Quadratic - - PowerPoint PPT Presentation

stochastic filtering by projection
SMART_READER_LITE
LIVE PREVIEW

Stochastic Filtering by Projection The Example of the Quadratic - - PowerPoint PPT Presentation

Stochastic Filtering by Projection Stochastic Filtering by Projection The Example of the Quadratic Sensor John Armstrong (Kings College London) collaboration with Damiano Brigo (Imperial College) GSI2013 Stochastic Filtering by Projection


slide-1
SLIDE 1

Stochastic Filtering by Projection

Stochastic Filtering by Projection

The Example of the Quadratic Sensor John Armstrong (King’s College London) collaboration with Damiano Brigo (Imperial College) GSI2013

slide-2
SLIDE 2

Stochastic Filtering by Projection Stochastic Filtering

Motivation

Estimate the current state of a stochastic system from imperfect measurements

slide-3
SLIDE 3

Stochastic Filtering by Projection Stochastic Filtering

Motivation

Estimate the current state of a stochastic system from imperfect measurements

◮ Estimate the position of a car

slide-4
SLIDE 4

Stochastic Filtering by Projection Stochastic Filtering

Motivation

Estimate the current state of a stochastic system from imperfect measurements

◮ Estimate the position of a car ◮ Estimate the volatility of a stock from option prices

slide-5
SLIDE 5

Stochastic Filtering by Projection Stochastic Filtering

Motivation

Estimate the current state of a stochastic system from imperfect measurements

◮ Estimate the position of a car ◮ Estimate the volatility of a stock from option prices ◮ Applications in weather forecasting, oil extraction ...

slide-6
SLIDE 6

Stochastic Filtering by Projection Stochastic Filtering

Motivation

Estimate the current state of a stochastic system from imperfect measurements

◮ Estimate the position of a car ◮ Estimate the volatility of a stock from option prices ◮ Applications in weather forecasting, oil extraction ...

The calculation should be performed online.

slide-7
SLIDE 7

Stochastic Filtering by Projection Stochastic Filtering

Mathematical formulation

dXt = ft(Xt) dt + σt(Xt) dWt, X0, dYt = bt(Xt) dt + dVt, Y0 = 0 .

◮ Xt is a process representing the state. ◮ Yt is a process representing the measurement. ◮ Wt and Vt are independent Wiener processes.

slide-8
SLIDE 8

Stochastic Filtering by Projection Stochastic Filtering

Mathematical formulation

dXt = ft(Xt) dt + σt(Xt) dWt, X0, dYt = bt(Xt) dt + dVt, Y0 = 0 .

◮ Xt is a process representing the state. ◮ Yt is a process representing the measurement. ◮ Wt and Vt are independent Wiener processes.

Question

What is the probability distribution for Xt given the values of Yt up to time t?

slide-9
SLIDE 9

Stochastic Filtering by Projection Stochastic Filtering

The Kushner–Stratonovich equation

With sufficient regularity and bounds, one can show that the probability density pt satisfies: dpt = L∗

t ptdt + pt[bt − Ept{bt}][dYt − Ept{bt}dt] .

where:

L∗ = −ft ∂ ∂x + 1 2at ∂ ∂x2 is the backward diffusion operator

◮ aT t a = σ and a is a square root of σ. ◮ Ept denotes expectation with respect to pt.

slide-10
SLIDE 10

Stochastic Filtering by Projection Stochastic Filtering

The Kushner–Stratonovich equation

With sufficient regularity and bounds, one can show that the probability density pt satisfies: dpt = L∗

t ptdt + pt[bt − Ept{bt}][dYt − Ept{bt}dt] .

where:

L∗ = −ft ∂ ∂x + 1 2at ∂ ∂x2 is the backward diffusion operator

◮ aT t a = σ and a is a square root of σ. ◮ Ept denotes expectation with respect to pt.

Question

How can we efficiently approximate solutions to the infinite dimensional Kushner–Stratonovich equation?

slide-11
SLIDE 11

Stochastic Filtering by Projection The geometric idea

The geometric idea

◮ Choose a submanifold of the space of probability distributions

so that points in the manifold can approximate pt well.

slide-12
SLIDE 12

Stochastic Filtering by Projection The geometric idea

The geometric idea

◮ Choose a submanifold of the space of probability distributions

so that points in the manifold can approximate pt well.

◮ View the partial differential equation as defining a stochastic

vector field.

slide-13
SLIDE 13

Stochastic Filtering by Projection The geometric idea

The geometric idea

◮ Choose a submanifold of the space of probability distributions

so that points in the manifold can approximate pt well.

◮ View the partial differential equation as defining a stochastic

vector field.

◮ Use projection to restrict the vector field to the tangent space.

slide-14
SLIDE 14

Stochastic Filtering by Projection The geometric idea

The geometric idea

◮ Choose a submanifold of the space of probability distributions

so that points in the manifold can approximate pt well.

◮ View the partial differential equation as defining a stochastic

vector field.

◮ Use projection to restrict the vector field to the tangent space. ◮ Solve the resulting finite dimensional stochastic differential

equation.

slide-15
SLIDE 15

Stochastic Filtering by Projection Choosing the submanifold

The linear problem

If:

◮ the coefficient functions a, b and f in the problem are all linear ◮ p0, which represents the prior probability distribution for the

state, is a Gaussian then

slide-16
SLIDE 16

Stochastic Filtering by Projection Choosing the submanifold

The linear problem

If:

◮ the coefficient functions a, b and f in the problem are all linear ◮ p0, which represents the prior probability distribution for the

state, is a Gaussian then

◮ pt is always a Gaussian ◮ The mean and standard deviation of pt follow a finite

dimensional SDE. This is called the Kalman filter.

slide-17
SLIDE 17

Stochastic Filtering by Projection Choosing the submanifold

The linear problem

If:

◮ the coefficient functions a, b and f in the problem are all linear ◮ p0, which represents the prior probability distribution for the

state, is a Gaussian then

◮ pt is always a Gaussian ◮ The mean and standard deviation of pt follow a finite

dimensional SDE. This is called the Kalman filter. One can linearize any filtering problem at each point in time to

  • btain the Extended Kalman filter.
slide-18
SLIDE 18

Stochastic Filtering by Projection Choosing the submanifold

Two important families

For multi modal problems, project onto one of the following families:

slide-19
SLIDE 19

Stochastic Filtering by Projection Choosing the submanifold

Two important families

For multi modal problems, project onto one of the following families:

◮ A mixture of m Gaussian distributions:

pt(x) =

  • i

λie(x−µi)/2σ2

i ◮ λi ≥ 0.

i λi = 1.

◮ Gives rise to a 3m − 1 dimensional family.

slide-20
SLIDE 20

Stochastic Filtering by Projection Choosing the submanifold

Two important families

For multi modal problems, project onto one of the following families:

◮ A mixture of m Gaussian distributions:

pt(x) =

  • i

λie(x−µi)/2σ2

i ◮ λi ≥ 0.

i λi = 1.

◮ Gives rise to a 3m − 1 dimensional family.

◮ The exponential family

pt(x) = exp(a0 + a1x + a2x2 + . . . a2nx2n)

◮ a2n < 0 ◮ Gives rise to a 2n dimensional family.

slide-21
SLIDE 21

Stochastic Filtering by Projection Projecting the equations The choice of metric

Choice of metric for the projection

Need to choose a Hilbert space structure on the space of probability distributions (more precisely some enveloping space).

slide-22
SLIDE 22

Stochastic Filtering by Projection Projecting the equations The choice of metric

Choice of metric for the projection

Need to choose a Hilbert space structure on the space of probability distributions (more precisely some enveloping space).

◮ The Hellinger metric.

◮ Theoretical advantage of coordinate independence ◮ Works well with exponential families (Brigo) ◮ Meaningful for problems where density p does not exist. ◮ Requires numerical approximation of integrals to implement.

slide-23
SLIDE 23

Stochastic Filtering by Projection Projecting the equations The choice of metric

Choice of metric for the projection

Need to choose a Hilbert space structure on the space of probability distributions (more precisely some enveloping space).

◮ The Hellinger metric.

◮ Theoretical advantage of coordinate independence ◮ Works well with exponential families (Brigo) ◮ Meaningful for problems where density p does not exist. ◮ Requires numerical approximation of integrals to implement.

◮ The direct L2 metric.

◮ Works well with mixture families. ◮ All integrals that occur can be calculated analytically.

slide-24
SLIDE 24

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

Understanding stochastic differential equations

A stochastic differential equation such as: dXt = ft(Xt) dt + σt(Xt) dWt is shorthand for an integral equation such as: XT = T ft(Xt) dt + T σt(Xt) dWt where the right hand integral is defined by the Ito integral: T f (t) dWt = lim

n→∞ ∞

  • i=1

f (ti)(Wti+1 − Wti).

slide-25
SLIDE 25

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

The Stratonovich integral

◮ Take the Ito integral:

T f (t) dWt = lim

n→∞ ∞

  • i=1

f (ti)(Wti+1 − Wti). and change the point where you evaluate the integrand T f (t) ◦ dWt = lim

n→∞ ∞

  • i=1

f (ti + ti+1 2 )(Wti+1 − Wti). to get the Stratonvich integral. Hence you can define Stratonovich SDE’s.

slide-26
SLIDE 26

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

The Stratonovich integral

◮ Take the Ito integral:

T f (t) dWt = lim

n→∞ ∞

  • i=1

f (ti)(Wti+1 − Wti). and change the point where you evaluate the integrand T f (t) ◦ dWt = lim

n→∞ ∞

  • i=1

f (ti + ti+1 2 )(Wti+1 − Wti). to get the Stratonvich integral. Hence you can define Stratonovich SDE’s.

◮ The difference between the two integrals is an ordinary

  • integral. This allows you to convert between the two

formulations.

◮ Ito SDE’s model causality more naturally ◮ Stratonovich SDE’s transform like vector fields.

slide-27
SLIDE 27

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

A recipe for projecting SDE’s

To project an SDE onto a submanifold parameterized by θ = (θ1, θ2, . . . , θn):

◮ Write the SDE as an SDE with vector coefficients in

Stratonovich form.

◮ Project all the coefficients onto the tangent space. ◮ Equate both sides of the projected equations to get an SDE

for the θi.

slide-28
SLIDE 28

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

A recipe for projecting SDE’s

To project an SDE onto a submanifold parameterized by θ = (θ1, θ2, . . . , θn):

◮ Write the SDE as an SDE with vector coefficients in

Stratonovich form.

◮ Project all the coefficients onto the tangent space. ◮ Equate both sides of the projected equations to get an SDE

for the θi. Since Stratonovich SDE’s transform like vector fields, this recipe is invariant of the parameterization.

slide-29
SLIDE 29

Stochastic Filtering by Projection Projecting the equations Projecting SDE’s

The projected equations

The end result for the case of L2 projection is: dθi =

m

  • j=1

hij p(θ), Lvjdt − γ0(p(θ)), vjdt + γ1(p(θ)), vj ◦ dY

  • .

Where:

◮ The vj = ∂p ∂θj give a basis for the tangent space ◮ hij and hij are the Riemannian metric tensor vi, vj. ◮ γ0 t (p) := 1 2 [|bt|2 − Ep{|bt|2}] ◮ γ1 t (p) := [bt − Ep{bt}]p ◮ ·, · is the L2 inner product.

Note that the inner products and expectations give rise to integrals. We can compute these analytically for the normal mixture family.

slide-30
SLIDE 30

Stochastic Filtering by Projection Solving the SDE’s

Solving the finite system of SDE’s

◮ Approximate the differential equation as a difference equation

and solve numerically.

◮ This is more delicate for stochastic equations than ordinary

  • nes. See Kloeden and Platen. We use the

Stratonovich–Heun cheme.

◮ Note that the resulting difference equation will depend upon

the choice of parameterization of the submanifold. Choose coordinates φ : Rn − → M so that φ is defined on all of Rn.

slide-31
SLIDE 31

Stochastic Filtering by Projection Numerical example

The quadratic sensor

dXt = dWt dYt = X 2 + dVt .

slide-32
SLIDE 32

Stochastic Filtering by Projection Numerical example

The quadratic sensor

dXt = dWt dYt = X 2 + dVt .

◮ We do not receive any information on the sign of X. ◮ We expect that once X has hit the origin, p will be

approximately symmetrical.

◮ We expect a bimodal distribution

slide-33
SLIDE 33

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 0 Projection Exact Extended Kalman Exponential

slide-34
SLIDE 34

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 1 Projection Exact Extended Kalman Exponential

slide-35
SLIDE 35

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 2 Projection Exact Extended Kalman Exponential

slide-36
SLIDE 36

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 3 Projection Exact Extended Kalman Exponential

slide-37
SLIDE 37

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 4 Projection Exact Extended Kalman Exponential

slide-38
SLIDE 38

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 5 Projection Exact Extended Kalman Exponential

slide-39
SLIDE 39

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 6 Projection Exact Extended Kalman Exponential

slide-40
SLIDE 40

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 7 Projection Exact Extended Kalman Exponential

slide-41
SLIDE 41

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 8 Projection Exact Extended Kalman Exponential

slide-42
SLIDE 42

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 9 Projection Exact Extended Kalman Exponential

slide-43
SLIDE 43

Stochastic Filtering by Projection Numerical example

Simulation for the Quadratic Sensor

0.2 0.4 0.6 0.8 1

  • 8
  • 6
  • 4
  • 2

2 4 6 8 X Distribution at time 10 Projection Exact Extended Kalman Exponential

slide-44
SLIDE 44

Stochastic Filtering by Projection Numerical example

L2 residuals for the quadratic sensor

0.1 0.2 0.3 0.4 0.5 0.6 0.7 2 4 6 8 10 Time Residuals Projection Residual (L2 norm) Extended Kalman Residual (L2 norm) Hellinger Projection Residual (L2 norm)

slide-45
SLIDE 45

Stochastic Filtering by Projection Numerical example

L´ evy residuals for the quadratic sensor

0.02 0.04 0.06 0.08 0.1 0.12 0.14 1 2 3 4 5 6 7 8 9 10 Time ProkhorovResiduals Prokhorov Residual (L2NM) Prokhorov Residual (HE) Best possible residual (3Deltas)

slide-46
SLIDE 46

Stochastic Filtering by Projection Conclusions

Conclusions

◮ Projection methods allow us to approximate the solution to

nonlinear problems with surprising accuracy using only low dimensional manifolds.

◮ This conclusion holds for a variety of projection metrics and

manifolds.

◮ L2 projection of normal mixtures is particularly promising since

all integrals can be computed analytically.