Dual Approximate Dynamic Programming for Large Scale Hydro Valleys - - PowerPoint PPT Presentation

dual approximate dynamic programming for large scale
SMART_READER_LITE
LIVE PREVIEW

Dual Approximate Dynamic Programming for Large Scale Hydro Valleys - - PowerPoint PPT Presentation

Dams management problem DADP in a nutshell Numerical experiments Dual Approximate Dynamic Programming for Large Scale Hydro Valleys P. Carpentier, J-P. Chancelier, V. Lecl` ere, F. Pacaud supported by the FMJH Program Gaspard Monge for


slide-1
SLIDE 1

Dams management problem DADP in a nutshell Numerical experiments

Dual Approximate Dynamic Programming for Large Scale Hydro Valleys

  • P. Carpentier, J-P. Chancelier, V. Lecl`

ere, F. Pacaud

supported by the FMJH Program Gaspard Monge for Optimization.

ENSTA ParisTech and ENPC ParisTech, France

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

1 / 27

slide-2
SLIDE 2

Dams management problem DADP in a nutshell Numerical experiments

Motivation

Electricity production management for hydro valleys 1 year time horizon:

compute each month the “values of water” (Bellman functions)

stochastic framework:

rain, market prices

large-scale valley:

5 dams and more

We wish to remain within the scope of Dynamic Programming.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

2 / 27

slide-3
SLIDE 3

Dams management problem DADP in a nutshell Numerical experiments

How to push the curse of dimensionality limits?

Aggregation methods

fast to run method require some homogeneity between units

Stochastic Dual Dynamic Programming (SDDP)

efficient method for this kind of problems strong assumptions (convexity, linearity)

Dual Approximate Dynamic Programming (DADP)

spatial decomposition method complexity almost linear in the number of dams approximation methods in the stochastic framework

This talk: present numerical results for large-scale hydro valleys using DADP, and comparison with DP and SDDP.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

3 / 27

slide-4
SLIDE 4

Dams management problem DADP in a nutshell Numerical experiments

Lecture outline

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

4 / 27

slide-5
SLIDE 5

Dams management problem DADP in a nutshell Numerical experiments Hydro valley modeling Optimization problem

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

5 / 27

slide-6
SLIDE 6

Dams management problem DADP in a nutshell Numerical experiments Hydro valley modeling Optimization problem

Operating scheme

x3 t Dam 1 Dam 2 Dam 3 a1 t x1 t u1 t a2 t u2 t x2 t a3 t u3 t

ui

t : water turbinated by dam i at time t,

xi

t : water volume of dam i at time t,

ai

t : water inflow at dam i at time t,

pi

t : water price at dam i at time t,

Randomness: wi

t = (ai t, pi t) ,

wt = (w1

t , . . . , wN t ).

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

6 / 27

slide-7
SLIDE 7

Dams management problem DADP in a nutshell Numerical experiments Hydro valley modeling Optimization problem

Dynamics and cost functions

Dam i xi t ai t si t zi t ui t ⊕ zi+1 t

Dam dynamics: xi

t+1 = f i t (xi t, ui t, wi t, zi t) ,

= xi

t−ui t+ai t+zi t−si t ,

zi+1

t

being the outflow of dam i: zi+1

t

= gi

t(xi t, ui t, wi t, zi t) ,

= ui

t+ max

  • 0,xi

t−ui t+ai t+zi t−xi

  • si

t

. We assume the Hazard-Decision information structure (ui

t is chosen

  • nce wi

t is observed), so that ui ≤ ui t ≤ min

  • ui, xi

t + ai t + zi t − xi

. Gain at time t < T: Li

t(xi t, ui t, wi t, zi t) = pi

tui t−ǫ(ui t)2.

Final gain at time T: K i xi

T

  • = −ai min{0,xi

T −

xi}2.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

7 / 27

slide-8
SLIDE 8

Dams management problem DADP in a nutshell Numerical experiments Hydro valley modeling Optimization problem

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

8 / 27

slide-9
SLIDE 9

Dams management problem DADP in a nutshell Numerical experiments Hydro valley modeling Optimization problem

Stochastic optimization problem

The global optimization problem reads: max

(X,U,Z) E

N

  • i=1

T−1

  • t=0

Li

t

  • Xi

t, Ui t, Wi t, Zi t

  • + K i

Xi

T

  • ,

subject to: Xi

t+1 = f i t (Xi t, Ui t, Wi t, Zi t) , ∀i , ∀t ,

σ

  • Ui

t

  • ⊂ σ
  • W0, . . . , Wt
  • ,

∀i , ∀t , Zi+1

t

= gi

t(Xi t, Ui t, Wi t, Zi t) , ∀i , ∀t .

  • Assumption. Noises W0, . . . , WT−1 are independent over time.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

9 / 27

slide-10
SLIDE 10

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

10 / 27

slide-11
SLIDE 11

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

Price decomposition

Dualize the coupling constraints Zi+1

t

= gi

t(Xi t, Ui t, Wi t, Zi t).

Note that the associated multiplier Λi+1

t

is a random variable. Minimize the dual problem (using a gradient-like algorithm).

Dam i xi t ai t si t zi t ui t zi+1 t Dam i + 1

At iteration k, the duality term: Λi+1,(k)

t

·

  • Zi+1

t

−g i

t(Xi t, Ui t, Wi t, Zi t)

  • ,

is the difference of two terms:

Λi+1,(k)

t

· Zi+1

t

dam i +1, Λi+1,(k)

t

· g i

t

  • · · ·
  • dam i.

Dam by dam decomposition for the maximization in (X, U, Z) at Λi+1,(k)

t

fixed.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

11 / 27

slide-12
SLIDE 12

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

12 / 27

slide-13
SLIDE 13

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

DADP core idea

The i-th subproblem writes:

max

Ui,Zi,Xi E

T−1

  • t=0
  • Li

t

  • Xi

t, Ui t, Wi t, Zi t

  • + Λi,(k)

t

· Zi

t

− Λi+1,(k)

t

· g i

t

  • Xi

t, Ui t, Wi t, Zi t

  • + K i

Xi

T

  • ,

but Λi,(k)

t

depends on the whole past of noises (W0, . . . , Wt). . . The core idea of DADP is to replace the constraint Zi+1

t

− g i

t(Xi t, Ui t, Wi t, Zi t) = 0 by its

conditional expectation with respect to Yi

t:

E

  • Zi+1

t

− g i

t(Xi t, Ui t, Wi t, Zi t)

  • Yi

t

  • = 0 ,

where (Yi

0, . . . , Yi T−1) is a “well-chosen” information process.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

13 / 27

slide-14
SLIDE 14

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

DADP core idea

The i-th subproblem writes:

max

Ui,Zi,Xi E

T−1

  • t=0
  • Li

t

  • Xi

t, Ui t, Wi t, Zi t

  • + Λi,(k)

t

· Zi

t

− Λi+1,(k)

t

· g i

t

  • Xi

t, Ui t, Wi t, Zi t

  • + K i

Xi

T

  • ,

but Λi,(k)

t

depends on the whole past of noises (W0, . . . , Wt). . . The core idea of DADP is to replace the constraint Zi+1

t

− g i

t(Xi t, Ui t, Wi t, Zi t) = 0 by its

conditional expectation with respect to Yi

t:

E

  • Zi+1

t

− g i

t(Xi t, Ui t, Wi t, Zi t)

  • Yi

t

  • = 0 ,

where (Yi

0, . . . , Yi T−1) is a “well-chosen” information process.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

13 / 27

slide-15
SLIDE 15

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

Subproblems in DADP

DADP thus consists of a constraint relaxation. This constraint relaxation is equivalent to replace the original multiplier Λi,(k)

t

by its conditional expectation E

  • Λi,(k)

t

  • Yi−1

t

.

The expression of the i-th subproblem becomes:

max

Ui,Zi,Xi E

T−1

  • t=0
  • Li

t

  • Xi

t, Ui t, Wi t, Zi t

  • + E
  • Λi,(k)

t

  • Yi−1

t

  • · Zi

t

− E

  • Λi+1,(k)

t

  • Yi

t

  • · g i

t

  • Xi

t, Ui t, Wi t, Zi t

  • + K i

Xi

T

  • .

If each process Yi follows a dynamical equation, DP applies.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

14 / 27

slide-16
SLIDE 16

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

A crude relaxation: Yi

t ≡ cste

1 The multipliers Λi,(k)

t

appear only in the subproblems by means of their expectations E

  • Λi,(k)

t

  • , so that each

subproblem involves a 1-dimensional state variable.

2 For the gradient algorithm, the coordination task reduces to:

E

  • Λi,(k+1)

t

  • = E
  • Λi,(k)

t

  • + ρtE
  • Zi+1,(k)

t

− g i

t

  • Xi,(k)

t

, Ui,(k)

t

, Wi

t, Zi,(k) t

  • .

3 The constraints taken into account by DADP are in fact:

E

  • Zi+1

t

− g i

t

  • Xi

t, Ui t, Wi t, Zi t

  • = 0 .

The DADP solutions do not satisfy the initial constraints: need to use an heuristic method to regain admissibility.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

15 / 27

slide-17
SLIDE 17

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

How to regain admissible policies?

We have computed N local Bellman functions V i

t at each time t,

each depending on a single state variable xi, whereas we need

  • ne global Bellman function Vt depending on the global state
  • x1, . . . , xN

in order to design the decisions at time t. Heuristic procedure: form the following global Bellman function:

  • Vt
  • x1, . . . , xN

=

N

  • i=1

V i

t

  • xi

, and solve at each time t the one-step DP problem: max

u,z N

  • i=1

Li

t

  • xi, ui, wi

t, zi

+ Vt+1

  • x1

t+1, . . . , xN t+1

  • ,

with xi

t+1 = f i t

  • xi, ui, wi

t, zi

and zi+1 = gi

t(xi, ui, wi t, zi) .

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

16 / 27

slide-18
SLIDE 18

Dams management problem DADP in a nutshell Numerical experiments Spatial decomposition Constraint weakening

How to regain admissible policies?

We have computed N local Bellman functions V i

t at each time t,

each depending on a single state variable xi, whereas we need

  • ne global Bellman function Vt depending on the global state
  • x1, . . . , xN

in order to design the decisions at time t. Heuristic procedure: form the following global Bellman function:

  • Vt
  • x1, . . . , xN

=

N

  • i=1

V i

t

  • xi

, and solve at each time t the one-step DP problem: max

u,z N

  • i=1

Li

t

  • xi, ui, wi

t, zi

+ Vt+1

  • x1

t+1, . . . , xN t+1

  • ,

with xi

t+1 = f i t

  • xi, ui, wi

t, zi

and zi+1 = gi

t(xi, ui, wi t, zi) .

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

16 / 27

slide-19
SLIDE 19

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

17 / 27

slide-20
SLIDE 20

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Full optimization and simulation process

Optimization

Apply DADP and compute the cost-to-go functions V i

t .

Form the approximated global Bellman functions Vt.

Simulation

Draw a large number of noise scenarios. Compute the control values along each scenario by solving the one-step DP problems involving the Vt’s, thus satisfying all the constraints of the initial problem: payoff value for each scenario, state and control trajectories. Evaluate the quality of the solution: mean payoff,. . .

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

18 / 27

slide-21
SLIDE 21

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Academic case studies of increasing complexity

dam 2 dam 1 dam 3 dam 4

Discretization T 12 X 41 U 6 W 10

4-Dams

dam 1 dam 2 dam 3 dam 5 dam 6 dam 4

6-Dams

dam 5 dam 3 dam 4 dam 2 dam 6 dam 7 dam 8 dam 5 dam 3 dam 1

8-Dams

dam 5 dam 3 dam 4 dam 2 dam 6 dam 5 dam 3 dam 7 dam 1 dam 8 dam 9 dam 10

10-Dams

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

19 / 27

slide-22
SLIDE 22

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Optimal values and computational times

Valley 4-Dams 6-Dams 8-Dams 10-Dams DP CPU time 1.6 10 3’ ∼ 10 8’ ∼ ∞ ∼ ∞ DP value 3743 N.A. N.A. N.A. SDDP value 3742 7026 11834 17069 SDDP CPU time 5’ 7’ 9’ 50’ Valley 4-Dams 6-Dams 8-Dams 10-Dams Table: Results obtained by DP and SDDP1 Valley 4-Dams 6-Dams 8-Dams 10-Dams DADP CPU time 7’ 12’ 17’ 24’ DADP value 3667 6816 11573 16760 Gap with SDDP −2.0% −3.0% −2.2% −1.8% Table: Results obtained by DADP “Expectation”

1SDDP method is an alternative to DP, pushing away the dimension barrier. SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

20 / 27

slide-23
SLIDE 23

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Optimal values and computational times

Valley 4-Dams 6-Dams 8-Dams 10-Dams DP CPU time 1.6 10 3’ ∼ 10 8’ ∼ ∞ ∼ ∞ DP value 3743 N.A. N.A. N.A. SDDP value 3742 7026 11834 17069 SDDP CPU time 5’ 7’ 9’ 50’ Valley 4-Dams 6-Dams 8-Dams 10-Dams Table: Results obtained by DP and SDDP1 Valley 4-Dams 6-Dams 8-Dams 10-Dams DADP CPU time 7’ 12’ 17’ 24’ DADP value 3667 6816 11573 16760 Gap with SDDP −2.0% −3.0% −2.2% −1.8% Table: Results obtained by DADP “Expectation”

1SDDP method is an alternative to DP, pushing away the dimension barrier. SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

20 / 27

slide-24
SLIDE 24

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Optimal values and computational times

Valley 4-Dams 6-Dams 8-Dams 10-Dams DP CPU time 1.6 10 3’ ∼ 10 8’ ∼ ∞ ∼ ∞ DP value 3743 N.A. N.A. N.A. SDDP value 3742 7026 11834 17069 SDDP CPU time 5’ 7’ 9’ 50’ Valley 4-Dams 6-Dams 8-Dams 10-Dams Table: Results obtained by DP and SDDP1 Valley 4-Dams 6-Dams 8-Dams 10-Dams DADP CPU time 7’ 12’ 17’ 24’ DADP value 3667 6816 11573 16760 Gap with SDDP −2.0% −3.0% −2.2% −1.8% Table: Results obtained by DADP “Expectation”

1SDDP method is an alternative to DP, pushing away the dimension barrier. SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

20 / 27

slide-25
SLIDE 25

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

4-Dams valley in detail: DADP convergence

Multipliers convergence (dam1↔dam2 and dam2↔dam3)

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

21 / 27

slide-26
SLIDE 26

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

4-Dams valley in detail: payoff distributions

DP payoff distribution DADP payoff distribution

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

22 / 27

slide-27
SLIDE 27

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

CPU time summary

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

23 / 27

slide-28
SLIDE 28

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

1

Dams management problem Hydro valley modeling Optimization problem

2

DADP in a nutshell Spatial decomposition Constraint weakening

3

Numerical experiments Academic examples More realistic examples

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

24 / 27

slide-29
SLIDE 29

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Two “true” valleys

Soulcem Gnioure Izourt Auzat Sabart

Discretization T 12, W 10 realistic grids for U and X

Vicdessos

Chastang Bort Mareges Aigle Sablier

Dordogne

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

25 / 27

slide-30
SLIDE 30

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Results

Valley Vicdessos Dordogne SDDP CPU time 10’ 17’ SDDP value 2244 22136 Table: Results obtained by SDDP Valley Vicdessos Dordogne DADP CPU time 10’ 155’ DADP value 2237 21499 Gap with SDDP

d

−0.3% −2.8% Table: Results obtained by DADP “Expectation”

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

26 / 27

slide-31
SLIDE 31

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Results

Valley Vicdessos Dordogne SDDP CPU time 10’ 17’ SDDP value 2244 22136 Table: Results obtained by SDDP Valley Vicdessos Dordogne DADP CPU time 10’ 155’ DADP value 2237 21499 Gap with SDDP

d

−0.3% −2.8% Table: Results obtained by DADP “Expectation”

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

26 / 27

slide-32
SLIDE 32

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

Conclusions and perspectives

Conclusions for DADP Fast numerical convergence of the method. Near-optimal results even when using a “crude” relaxation. Method that can be used for very large valleys General perspectives Apply to more complex topologies (smart grids). Connection with other decomposition methods. Theoretical study.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

27 / 27

slide-33
SLIDE 33

Dams management problem DADP in a nutshell Numerical experiments Academic examples More realistic examples

  • P. Carpentier et G. Cohen.

D´ ecomposition-coordination en optimisation d´ eterministe et stochastique.

En pr´ eparation, Springer, 2016.

  • P. Girardeau.

R´ esolution de grands probl` emes en optimisation stochastique dynamique.

Th` ese de doctorat, Universit´ e Paris-Est, 2010. J.-C. Alais.

Risque et optimisation pour le management d’´ energies.

Th` ese de doctorat, Universit´ e Paris-Est, 2013.

  • V. Lecl`

ere.

Contributions aux m´ ethodes de d´ ecomposition en optimisation stochastique.

Th` ese de doctorat, Universit´ e Paris-Est, 2014.

  • K. Barty, P. Carpentier, G. Cohen and P. Girardeau,

Price decomposition in large-scale stochastic optimal control.

arXiv, math.OC 1012.2092, 2010.

SOWG DADP applied to large scale hydro valleys

  • Sept. 20, 2017

27 / 27