Particle Methods for Rare Event Monte Carlo Paul Dupuis Division of - - PowerPoint PPT Presentation

particle methods for rare event monte carlo
SMART_READER_LITE
LIVE PREVIEW

Particle Methods for Rare Event Monte Carlo Paul Dupuis Division of - - PowerPoint PPT Presentation

Particle Methods for Rare Event Monte Carlo Paul Dupuis Division of Applied Mathematics Brown University (Thomas Dean (Oxford)) ASEAS, Arlington, VA March, 2009 Paul Dupuis (Brown University) March, 2009 Particle methods for rare event


slide-1
SLIDE 1

Particle Methods for Rare Event Monte Carlo

Paul Dupuis

Division of Applied Mathematics Brown University (Thomas Dean (Oxford)) ASEAS, Arlington, VA

March, 2009

Paul Dupuis (Brown University) March, 2009

slide-2
SLIDE 2

Particle methods for rare event Monte Carlo

The use of branching processes to estimate small probabilities

Paul Dupuis (Brown University) March, 2009

slide-3
SLIDE 3

Particle methods for rare event Monte Carlo

The use of branching processes to estimate small probabilities Summary:

Paul Dupuis (Brown University) March, 2009

slide-4
SLIDE 4

Particle methods for rare event Monte Carlo

The use of branching processes to estimate small probabilities Summary: The design of such schemes was (until recently) poorly understood.

Paul Dupuis (Brown University) March, 2009

slide-5
SLIDE 5

Particle methods for rare event Monte Carlo

The use of branching processes to estimate small probabilities Summary: The design of such schemes was (until recently) poorly understood. Design should be based on subsolutions to an associated HJB equation.

Paul Dupuis (Brown University) March, 2009

slide-6
SLIDE 6

Particle methods for rare event Monte Carlo

The use of branching processes to estimate small probabilities Summary: The design of such schemes was (until recently) poorly understood. Design should be based on subsolutions to an associated HJB equation. Obtain necessary and su¢cient conditions for asymptotically optimal performance.

Paul Dupuis (Brown University) March, 2009

slide-7
SLIDE 7

Example: A tandem queue with server slow-down (Ethernet control)

Paul Dupuis (Brown University) March, 2009

slide-8
SLIDE 8

Example: A tandem queue with server slow-down (Ethernet control)

Paul Dupuis (Brown University) March, 2009

slide-9
SLIDE 9

Example: A tandem queue with server slow-down (Ethernet control)

pn = P fQ2 exceeds n before Q = (0; 0)jQ(0) = (1; 0)g :

Paul Dupuis (Brown University) March, 2009

slide-10
SLIDE 10

Example: A tandem queue with server slow-down (Ethernet control)

pn = P fQ2 exceeds n before Q = (0; 0)jQ(0) = (1; 0)g : Also, analogous non-Markovian model.

Paul Dupuis (Brown University) March, 2009

slide-11
SLIDE 11

Model problem and large deviation scaling

As a general Markov model one can consider iid random vector …elds

  • vi(y); y 2 Rd

, and the process X n

i+1 = X n i + 1

nvi(X n

i );

X n

0 = x:

Paul Dupuis (Brown University) March, 2009

slide-12
SLIDE 12

Model problem and large deviation scaling

As a general Markov model one can consider iid random vector …elds

  • vi(y); y 2 Rd

, and the process X n

i+1 = X n i + 1

nvi(X n

i );

X n

0 = x:

De…ne H(y; ) = log E exp h; vi(y)i ; L(y; ) = sup

2Rd [h; i H(y; )]

X n(i=n) = X n

i ;

piecewise linear interpolation for t 6= i=n:

Paul Dupuis (Brown University) March, 2009

slide-13
SLIDE 13

Model problem and large deviation scaling (cont’d)

Under conditions fX n()g satis…es a Large Deviation Principle with rate function IT () = Z T L(; _ )dt if is AC and (0) = x, and IT () = 1 else.

Paul Dupuis (Brown University) March, 2009

slide-14
SLIDE 14

Model problem and large deviation scaling (cont’d)

Under conditions fX n()g satis…es a Large Deviation Principle with rate function IT () = Z T L(; _ )dt if is AC and (0) = x, and IT () = 1 else. Heuristically, for T < 1, given , small > 0 and large n P ( sup

0tT

kX n(t) (t)k ) enIT ():

Paul Dupuis (Brown University) March, 2009

slide-15
SLIDE 15

Model problem and large deviation scaling (cont’d)

Let C = f trajectories that hit B prior to A g. To estimate: pn(x) = P fX n 2 CjX n(0) = xg :

Paul Dupuis (Brown University) March, 2009

slide-16
SLIDE 16

Model problem and large deviation scaling (cont’d)

Under mild conditions: 1 n log pn(x) ! inf fIT () : enters B prior to A before T; T < 1g = (x):

Paul Dupuis (Brown University) March, 2009

slide-17
SLIDE 17

Some estimation generalities

1

For standard Monte Carlo we average iid copies of 1fX n2C g. One needs K en samples for bounded relative error [std dev/pn(x)].

Paul Dupuis (Brown University) March, 2009

slide-18
SLIDE 18

Some estimation generalities

1

For standard Monte Carlo we average iid copies of 1fX n2C g. One needs K en samples for bounded relative error [std dev/pn(x)].

2

Alternative approach: construct iid random variables n

1; : : : ; n K with

En

1 = pn(x) and use the unbiased estimator

^ qn;K (x) : = n

1 + + n K

K :

Paul Dupuis (Brown University) March, 2009

slide-19
SLIDE 19

Some estimation generalities

1

For standard Monte Carlo we average iid copies of 1fX n2C g. One needs K en samples for bounded relative error [std dev/pn(x)].

2

Alternative approach: construct iid random variables n

1; : : : ; n K with

En

1 = pn(x) and use the unbiased estimator

^ qn;K (x) : = n

1 + + n K

K :

3

Performance determined by variance of n

1, and since unbiased by

E (n

1)2.

Paul Dupuis (Brown University) March, 2009

slide-20
SLIDE 20

Some estimation generalities

1

For standard Monte Carlo we average iid copies of 1fX n2C g. One needs K en samples for bounded relative error [std dev/pn(x)].

2

Alternative approach: construct iid random variables n

1; : : : ; n K with

En

1 = pn(x) and use the unbiased estimator

^ qn;K (x) : = n

1 + + n K

K :

3

Performance determined by variance of n

1, and since unbiased by

E (n

1)2.

4

By Jensen’s inequality 1 n log E (n

1)2 2

n log En

1 = 2

n log pn(x) ! 2(x):

Paul Dupuis (Brown University) March, 2009

slide-21
SLIDE 21

Some estimation generalities

1

For standard Monte Carlo we average iid copies of 1fX n2C g. One needs K en samples for bounded relative error [std dev/pn(x)].

2

Alternative approach: construct iid random variables n

1; : : : ; n K with

En

1 = pn(x) and use the unbiased estimator

^ qn;K (x) : = n

1 + + n K

K :

3

Performance determined by variance of n

1, and since unbiased by

E (n

1)2.

4

By Jensen’s inequality 1 n log E (n

1)2 2

n log En

1 = 2

n log pn(x) ! 2(x):

5

An estimator is called asymptotically e¢cient if lim inf

n!1 1

n log E (n

1)2 2(x):

Paul Dupuis (Brown University) March, 2009

slide-22
SLIDE 22

Splitting type schemes

Pure branching methods (also called multi-level splitting)

Paul Dupuis (Brown University) March, 2009

slide-23
SLIDE 23

Splitting type schemes

Pure branching methods (also called multi-level splitting) Branching with killing [RESTART, DPR]

Paul Dupuis (Brown University) March, 2009

slide-24
SLIDE 24

Splitting type schemes

Pure branching methods (also called multi-level splitting) Branching with killing [RESTART, DPR] Interacting particle systems (Del Moral et. al.)

Paul Dupuis (Brown University) March, 2009

slide-25
SLIDE 25

Construction of splitting estimators

Pure branching. A certain number [proportional to n] of splitting thresholds C n

r are de…ned which enhance migration, e.g.,

Paul Dupuis (Brown University) March, 2009

slide-26
SLIDE 26

Construction of splitting estimators

Pure branching. A certain number [proportional to n] of splitting thresholds C n

r are de…ned which enhance migration, e.g.,

A single particle is started at x that follows the same law as X n, but branches into a number of independent copies each time a new level is reached.

Paul Dupuis (Brown University) March, 2009

slide-27
SLIDE 27

Construction of splitting estimators (cont’d)

Paul Dupuis (Brown University) March, 2009

slide-28
SLIDE 28

Construction of splitting estimators (cont’d)

The number of new particles M can be random (though independent of past data), and a multiplicative weight wi is assigned to the ith descendent, where E

M

X

i=1

wi = 1:

Paul Dupuis (Brown University) March, 2009

slide-29
SLIDE 29

Construction of splitting estimators (cont’d)

Evolution continues until every particle has reached either A or B. Let Mn

x

= total number of particles generated X n

j (t)

= trajectory of jth particle, W n

j

= product of weights assigned to j along path

Paul Dupuis (Brown University) March, 2009

slide-30
SLIDE 30

Construction of splitting estimators (cont’d)

Evolution continues until every particle has reached either A or B. Let Mn

x

= total number of particles generated X n

j (t)

= trajectory of jth particle, W n

j

= product of weights assigned to j along path Then n =

M n

x

X

j=1

1fX n

j 2CgW n

j

Paul Dupuis (Brown University) March, 2009

slide-31
SLIDE 31

Subsolutions for branching processes

Now consider the asymptotic rate of decay as a function of y: (y) = lim

n!1 1

n log pn(y) = inf fIT () : (0) = y; enters B prior to A before T; T < 1g :

Paul Dupuis (Brown University) March, 2009

slide-32
SLIDE 32

Subsolutions for branching processes

Now consider the asymptotic rate of decay as a function of y: (y) = lim

n!1 1

n log pn(y) = inf fIT () : (0) = y; enters B prior to A before T; T < 1g : Let H(y; ) = H(y; ) [recall H(y; ) = log E exp h; vi(y)i ]:

Paul Dupuis (Brown University) March, 2009

slide-33
SLIDE 33

Subsolutions for branching processes (cont’d)

(y) is a weak-sense solution to the PDE

Paul Dupuis (Brown University) March, 2009

slide-34
SLIDE 34

Subsolutions for branching processes (cont’d)

Subsolutions should satisfy (in the viscosity sense)

Paul Dupuis (Brown University) March, 2009

slide-35
SLIDE 35

Statement of results

Implementation and Performance for Pure Splitting [analogous results for RESTART, etc.]:

Paul Dupuis (Brown University) March, 2009

slide-36
SLIDE 36

Statement of results

Implementation and Performance for Pure Splitting [analogous results for RESTART, etc.]: Consider a continuous function W and suppose splitting levels are the level sets fW (y) i log EM=ng, where EM is the mean number of particles per split.

Paul Dupuis (Brown University) March, 2009

slide-37
SLIDE 37

Statement of results

Implementation and Performance for Pure Splitting [analogous results for RESTART, etc.]: Consider a continuous function W and suppose splitting levels are the level sets fW (y) i log EM=ng, where EM is the mean number of particles per split. Then the number of particles needed to construct a single sample n

1

grows subexponentially if and only if W is a viscosity subsolution.

Paul Dupuis (Brown University) March, 2009

slide-38
SLIDE 38

Statement of results

Implementation and Performance for Pure Splitting [analogous results for RESTART, etc.]: Consider a continuous function W and suppose splitting levels are the level sets fW (y) i log EM=ng, where EM is the mean number of particles per split. Then the number of particles needed to construct a single sample n

1

grows subexponentially if and only if W is a viscosity subsolution. Given u = EM, consider particular scheme that randomizes between buc and buc + 1 and uses weights wi = 1=u.

Paul Dupuis (Brown University) March, 2009

slide-39
SLIDE 39

Statement of results

Implementation and Performance for Pure Splitting [analogous results for RESTART, etc.]: Consider a continuous function W and suppose splitting levels are the level sets fW (y) i log EM=ng, where EM is the mean number of particles per split. Then the number of particles needed to construct a single sample n

1

grows subexponentially if and only if W is a viscosity subsolution. Given u = EM, consider particular scheme that randomizes between buc and buc + 1 and uses weights wi = 1=u. Then lim inf

n!1 1

n log E (n

1)2 W (x) + (x):

Paul Dupuis (Brown University) March, 2009

slide-40
SLIDE 40

Remarks

Subsolutions for interesting models (networks with feedback, non-Markovian systems, serve-the-longer discipline, server-slowdown dynamics, open/closed networks, path-dependent events) known. When available the Freidlin-Ventsel quasipotential can be used to construct subsolutions with optimal value. Subsolutions for importance sampling must be at least piecewise classical sense.

Paul Dupuis (Brown University) March, 2009

slide-41
SLIDE 41

References

Splitting for rare event simulation: A large deviations approach to design and analysis (T. Dean and D.), Stochastic Processes and their Applications, 119, (2009), 562–587. A generalized DPR algorithm for rare event simulation (T. Dean and D.), submitted to Annals of OR.

Paul Dupuis (Brown University) March, 2009