MDP Algorithms for Portfolio Optimization Problems in pure Jump - - PowerPoint PPT Presentation

mdp algorithms for portfolio optimization problems in
SMART_READER_LITE
LIVE PREVIEW

MDP Algorithms for Portfolio Optimization Problems in pure Jump - - PowerPoint PPT Presentation

1 MDP Algorithms for Portfolio Optimization Problems in pure Jump Markets Nicole B auerle (joint work with U. Rieder) Universit at Karlsruhe (TH) Research University founded 1825 Linz, October 2008 2 Overview Model and


slide-1
SLIDE 1

1

MDP Algorithms for Portfolio Optimization Problems in pure Jump Markets

Nicole B¨ auerle

(joint work with U. Rieder)

Universit¨ at Karlsruhe (TH)

Research University • founded 1825

Linz, October 2008

slide-2
SLIDE 2

2

Overview

  • Model and optimization problem
  • Solution via discrete-time MDPs
  • Solution method and computational aspects
  • Model extensions

Financial optimization problems October 2008 Nicole B¨ auerle

slide-3
SLIDE 3

3

Model and optimization problem

Financial optimization problems October 2008 Nicole B¨ auerle

slide-4
SLIDE 4

4

The market model

Suppose we have a financial market with one bond and d risky assets whose prices evolve as follows

  • Price process (S0

t) of the bond:

S0

t := ert,

r ≥ 0.

  • Price processes (Sk

t ) of the risky assets k = 1, . . . , d:

dSk

t = Sk t−

  • µkdt + dCk

t

  • where µk ∈ I

R are constants and Ct := Nt

n=1 Yn with N = (Nt) a

Poisson process with rate λ > 0 and iid random vectors (Yn) with values in (−1, ∞)d and distribution Q where I EYn < ∞.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-5
SLIDE 5

5

The price processes of the risky assets

If we denote by 0 := T0 < T1 < T2 < . . . the jump time points of the Poisson process and if t ∈ [Tn, Tn+1), then for k = 1, . . . , d Sk

t = Sk Tn exp

  • µk(t − Tn)
  • .

At the time of a jump we have Sk

Tn − Sk Tn− = Sk Tn−Y k n .

In what follows we denote St := (S1

t, . . . , Sd t ).

(St) is a so-called Piecewise Deterministic Markov Process (PDMP).

Financial optimization problems October 2008 Nicole B¨ auerle

slide-6
SLIDE 6

6

A typical sample path

Simulated stock price in the PDMP model.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-7
SLIDE 7

7

Portfolios and self-financing strategies

Portfolio strategy: (Ft)-predictable stochastic process (πt) with values in U := {u ∈ I Rd | u ≥ 0, u · e ≤ 1} where πt = (π1

t, . . . , πd t ) gives the fractions of wealth invested in the risky

assets at time t. 1 − πt · e is the fraction invested in the bond. The equation for the wealth process (Xπ

t ) is then:

dXπ

t = Xπ t

  • r + πt · (µ − re)dt + πtdCt
  • .

Financial optimization problems October 2008 Nicole B¨ auerle

slide-8
SLIDE 8

8

The optimization problem

Let U : (0, ∞) → I R+ be an increasing, concave utility function and define for a portfolio strategy π and t ∈ [0, T ], x > 0: Vπ(t, x) := I Et,xU(Xπ

T ).

V (t, x) := sup

π Vπ(t, x).

Obviously Vπ(T, x) := U(x) = V (T, x). Davis (1993), Norberg (2003), Sch¨ al (2004,2005), Kirch and Runggaldier (2005), Jacobsen (2006), B. and Rieder (2008)

Financial optimization problems October 2008 Nicole B¨ auerle

slide-9
SLIDE 9

9

Solution via discrete-time MDPs

Financial optimization problems October 2008 Nicole B¨ auerle

slide-10
SLIDE 10

10

Solution via discrete-time MDPs: The model

  • State space: E = [0, T ] × (0, ∞). A state (t, x) gives the jump time

point t and the wealth x of the process directly after the jump.

  • Action space: A := {α : [0, T ] → U measurable}. For α ∈ A we define

the movement of the wealth between jumps by

φα

t (x) := x exp

„Z t r + αs · (µ − re)ds « .

  • Transition probability:

q

  • B | t, x, α
  • := λ

T −t e−λs

  • 1B
  • t + s, φα

s (x)

  • 1 + αs · y
  • Q(dy)ds
  • One-stage reward function: r
  • t, x, α
  • := e−λ(T −t)U
  • φα

T −t(x)

  • .

Financial optimization problems October 2008 Nicole B¨ auerle

slide-11
SLIDE 11

11

Solution via discrete-time MDPs

A sequence (fn) with fn ∈ F := {f : E → A measurable} is called Markov policy. The expected reward of such a Markov policy is given by J(fn)(t, x) := I E(fn)

t,x

  • k=0

r

  • Tk, Xk, fk(Tk, Xk)
  • ,

(t, x) ∈ E. Define J(t, x) := sup(fn) J(fn)(t, x), for (t, x) ∈ E. Theorem 1: We have V (t, x) = J(t, x), for (t, x) ∈ E and the optimal portfolio strategies ”coincide”.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-12
SLIDE 12

12

Some important operators

We define the following operators L, Tf, T which act on I M := {v : E → I R+ | v is measurable}: (Lv)(t, x, α) := e−λ(T −t)U

  • φα

T −t(x)

  • +
  • v(s, y)q(ds, dy | t, x, α).

(Tfv)(t, x) := Lv(t, x, f(t, x)), (t, x) ∈ E, f ∈ F. (T v)(t, x) = sup

α∈A

Lv(t, x, α). From MDP theory it follows that J(fn) = lim

n→∞ Tf0 . . . Tfn−10

Jf := J(f) = lim

n→∞ T n f 0

Jf = TfJf.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-13
SLIDE 13

13

A peculiar norm

Let b(t, x) := eβ(T −t)(1 + x), (t, x) ∈ E for β ≥ 0. Next we introduce the weighted supremum norm · b on I M by vb := sup

(t,x)∈E

v(t, x) b(t, x) and I Bb := {v ∈ I M | vb < ∞}. Finally, we define the set I Mc := {v ∈ I Bb | v is continuous, v(t, x) is concave and increasing in x, decreasing in t}.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-14
SLIDE 14

14

Properties of T and Tf

Theorem 2: It holds that a) T : I Mc → I Mc. b) For v, w ∈ I Bb and f ∈ F we have Tfv − Tfwb ≤ cβv − wb T v − T wb ≤ cβv − wb. with cβ :=

λ(1+¯ y) β+λ− ¯ µ

  • 1 − e−T (β+λ− ¯

µ)

. Thus the operators Tf, T are contracting if cβ < 1 which is the case if β is large enough.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-15
SLIDE 15

15

Characterization of the value function

Theorem 3: a) The value function V is the unique fixed point of T in I Mc. b) For J0 ∈ I Mc it holds that V − T nJ0b ≤ cn

β

1 − cβ T J0 − J0b. c) There exists an optimal stationary portfolio strategy π, i.e. there exists an f ∈ F such that πt = f(Tn, Xn)(t − Tn) for t ∈ [Tn, Tn+1).

Financial optimization problems October 2008 Nicole B¨ auerle

slide-16
SLIDE 16

16

Solution method and computational aspects

Financial optimization problems October 2008 Nicole B¨ auerle

slide-17
SLIDE 17

17

Howard’s policy improvement algorithm

Theorem 4: Howard’s policy improvement algorithm works for the portfolio

  • problem. More precisely, let f, g ∈ F .

a) If for some subset E0 ⊂ E

g(t, x) ∈ D(t, x, f) := {α ∈ A | LJf(t, x, α) > Jf(t, x)}, (t, x) ∈ E0 g(t, x) = f(t, x), (t, x) / ∈ E0

then Jg ≥ Jf and Jg(t, x) > Jf(t, x) for (t, x) ∈ E0. b) If D(t, x, f) = ∅ for all (t, x) ∈ E then Jf = V , i.e. the decision rule f defines the optimal stationary portfolio strategy. Application: Suppose r = µi and U is continuously differentiable and U ′(x + u · Y )Y is integrable for all x > 0 and u small. Then ”invest all the money in the bond” is optimal if and only if I EY ≤ 0.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-18
SLIDE 18

18

Approximating the utility function

Let U (n), U be utility functions and denote by V (n), V the corresponding value functions and by A∗

n(t, x) := {α ∈ A : T (n)V (n)(t, x) = L(n)V (n)(t, x, α)}

A∗(t, x) := {α ∈ A : T V (t, x) = LV (t, x, α)}. Moreover, let us denote by LsA∗

n(t, x)

:= {α ∈ A; α is an accumulation point of a sequence (αn) with αn ∈ A∗

n(t, x) ∀n ∈ I

N} the upper limit of the set sequence

  • A∗

n(t, x)

  • .

Jouini and Napp (2004), Sch¨ al (1975)

Financial optimization problems October 2008 Nicole B¨ auerle

slide-19
SLIDE 19

19

Approximating the utility function

Theorem 5: a) If U and ˜ U are two utility functions with corresponding value functions V and ˜ V , then V − ˜ V b ≤ U − ˜ Ub eT ¯

µ

1 − cβ . b) Let

  • U (n)

be a sequence of utility functions with limn→∞ U (n)−Ub = 0. Then limn→∞ V (n) − V b = 0 and we get that ∅ = LsA∗

n(t, x) ⊂ A∗(t, x)

for all (t, x) ∈ E, i.e. in particular, the limit f ∗ of a sequence of decision rules (f ∗

n) with f ∗ n(t, x) ∈ A∗ n(t, x) for all (t, x) ∈ E defines an optimal

portfolio strategy for the model with utility function U.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-20
SLIDE 20

20

State space discretization

Choose a grid G ⊂ E and define the grid operator TG on I Mc by TGv(t, x) :=

  • T v(t, x),

for (t, x) ∈ G linear interpolation, else and bG : E → I R+ by a linear interpolation of b. For v ∈ I Mc:

vG := sup

(t,x)∈E

v(t, x) bG(t, x) ≤ vb.

Theorem 6: Suppose that cG < 1. Then it holds for J0 ∈ I Mc that V − T n

G J0G ≤

1 1 − cG

  • cn

GTGJ0 − J0G + m(h)

  • where m(h) := V − TGV G → 0 if the mesh size tends to zero.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-21
SLIDE 21

21

Numerical example

One stock (d = 1): µ = r = 0. Relative jump distribution: Q(dy) =

  • pλ+e−λ+ydy

, y ≥ 0 (1 − p)λ−(y + 1)λ−−1 , −1 < y < 0 p = 0.5, λ+ = λ− = 1. Utility function U(x) = 1 γxγ, γ = 0.5. Horizon T = 1 Mesh size: h = 0.01

Financial optimization problems October 2008 Nicole B¨ auerle

slide-22
SLIDE 22

22

Numerical example

2 4 6 8 10 12 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x

Upper solid line: V (0, x). Upper dotted line: J1(0, x). Lower solid line: V (1, x) = U(x).

Financial optimization problems October 2008 Nicole B¨ auerle

slide-23
SLIDE 23

23

Model extensions

Financial optimization problems October 2008 Nicole B¨ auerle

slide-24
SLIDE 24

24

Model extensions: Dynamic risk constraints

Next we impose an additional dynamic risk constraint of the form U(x) = {u ∈ U | ρ “ Xπ

T1 − x

” = uxρ(Y ) ≤ 0.1} where ρ is AVaR. Solid line: Value function without con- straints. Dotted line: Utility function. Dashed line: First iteration of the grid

  • perator with risk constraints.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-25
SLIDE 25

25

Model extensions: Partial information

Suppose we do not know the jump intensity Λ ∈ {λ1, . . . , λm}. Define pk(t) := I P

  • Λ = λk | FS

t

  • ,

ˆ λt := λkpk(t) = I E

  • Λ | FS

t

  • .

Filter equation: pk(t) = pk(0) + t pk(s−) λk − ˆ λs− ˆ λs−

ηs where ˆ ηt = Nt − t ˆ λsds. The filter becomes part of the state space, i.e. the value function is of the form V (t, x, p). The optimization problem can be solved as before by reduction to a discrete time MDP.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-26
SLIDE 26

26

Comparison: Full - Partial Information

Let U(x) = 1

γxγ with 0 < γ < 1, d = 1 and Q = δy0 with y0 < 0.

Here, the optimal fractions of wealth invested in the stock u∗

λ (full

information), u∗(t, p) (partial information) do not depend on the wealth

  • itself. Let λ = (λ1, . . . λm).

Theorem 7: The optimal fraction u∗(t, p) invested in the stock has the following property for all (t, p) ∈ [0, T ] × P: u∗

λ′p ≤ u∗(t, p).

B¨ auerle and Rieder (2007)

Financial optimization problems October 2008 Nicole B¨ auerle

slide-27
SLIDE 27

27

References

[1] B¨ auerle, N., Rieder, U. (2008) : MDP algorithms for portfolio optimization problems in pure jump markets. Preprint. [2] Davis, M.H.A. (1993) : Markov models and optimization. Chapman & Hall. [3] Jacobsen, M. (2006) : Point process theory and applications. Birkh¨ auser. [4] Kirch, M., Runggaldier, W. (2005) : Efficient hedging when asset prices follow a geometric Poisson process with unknown intensity . SIAM J. Control Optim., 43:1174-1195. [5] Norberg, R. (2003) : The Markov chain market. Astin Bulletin, 33:265-287. [6] Sch¨ al, M. (2005) : Control of ruin probabilities by discrete-time investments, Math. Methods Oper. Res., 62:141–158.

Financial optimization problems October 2008 Nicole B¨ auerle

slide-28
SLIDE 28

28

Thank you for your attention!

Financial optimization problems October 2008 Nicole B¨ auerle