SLIDE 1 Robust control of a risk-sensitive performance measure
Paul Dupuis
Division of Applied Mathematics Brown University
- R. Atar, A. Budhiraja, R. Wu
ICERM, June 2019
SLIDE 2
Three settings for robust optimization/control
SLIDE 3
Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures
SLIDE 4 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
SLIDE 5 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
Stochastic uncertain systems with rare event perfomance measures
SLIDE 6 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
Stochastic uncertain systems with rare event perfomance measures Historically second predates …rst:
SLIDE 7 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
Stochastic uncertain systems with rare event perfomance measures Historically second predates …rst: Linear and nonlinear H1 control (Zames, 1981, Glover and Doyle, 1988, Helton and James, 1999)
SLIDE 8 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
Stochastic uncertain systems with rare event perfomance measures Historically second predates …rst: Linear and nonlinear H1 control (Zames, 1981, Glover and Doyle, 1988, Helton and James, 1999) Robust properties of risk-sensitive control (Jacobson, 1973, D, James and Petersen, 2000, Hansen and Sargent, 2001 & 2008)
SLIDE 9 Three settings for robust optimization/control
Stochastic uncertain systems with ordinary performance measures Deterministic uncertain systems subject to “disturbances”, with
- rdinary performance measures
Stochastic uncertain systems with rare event perfomance measures Historically second predates …rst: Linear and nonlinear H1 control (Zames, 1981, Glover and Doyle, 1988, Helton and James, 1999) Robust properties of risk-sensitive control (Jacobson, 1973, D, James and Petersen, 2000, Hansen and Sargent, 2001 & 2008) Current work (Atar, Budhiraja, D and Wu, see also D, Katsoulakis, Pantazis and Rey-Bellet 2018)
SLIDE 10
Stochastic uncertain systems with ordinary performance
Elements of the framework Probability models, on space S, often a path space P = nominal (computational, design) vs Q = true (impractical)
SLIDE 11
Stochastic uncertain systems with ordinary performance
Elements of the framework Probability models, on space S, often a path space P = nominal (computational, design) vs Q = true (impractical) Performance measures, for f : S ! R EQ[f ] = EQ[f (X)]
SLIDE 12
Stochastic uncertain systems with ordinary performance
Elements of the framework Probability models, on space S, often a path space P = nominal (computational, design) vs Q = true (impractical) Performance measures, for f : S ! R EQ[f ] = EQ[f (X)] Here f may combine a cost with dynamics that take random variables under Q (or P) into the system state: f (w) = Z T c(G[w](t))dt; G : W ! X; dX(t) = b(X(t))dt + dW (t):
SLIDE 13 Stochastic uncertain systems with ordinary performance
Elements of the framework A notion of distance between models, here taken to be relative entropy, aka Kullback-Leibler divergence: R (Q kP ) =
dP
R
S log
dQ
dP (s)
if Q P 1 else. De…nes neighborhoods of P via fQ : R (Q kP ) rg.
SLIDE 14 Stochastic uncertain systems with ordinary performance
Elements of the framework A notion of distance between models, here taken to be relative entropy, aka Kullback-Leibler divergence: R (Q kP ) =
dP
R
S log
dQ
dP (s)
if Q P 1 else. De…nes neighborhoods of P via fQ : R (Q kP ) rg. R ( k) is jointly convex and lsc, R (Q kP ) 0 and = 0 i¤ Q = P.
SLIDE 15 Stochastic uncertain systems with ordinary performance
Elements of the framework A notion of distance between models, here taken to be relative entropy, aka Kullback-Leibler divergence: R (Q kP ) =
dP
R
S log
dQ
dP (s)
if Q P 1 else. De…nes neighborhoods of P via fQ : R (Q kP ) rg. R ( k) is jointly convex and lsc, R (Q kP ) 0 and = 0 i¤ Q = P. Optimality (tightest bounds with respect to neighborhoods). This automatically introduces nonlinearity, akin to Legendre transform. E.g., if performance measure EQ[f ], Lagrange multipliers lead to quantities like P(; f ) = sup
Q
[EQ[f ] R (Q kP )] :
SLIDE 16
Stochastic uncertain systems with ordinary performance
Elements of the framework The mapping f : S ! R may include parameter 2 A to optimize f = f. Then may want to solve problems like min
2A
max
Q:R(QkP )r EQ[f]:
SLIDE 17
Stochastic uncertain systems with ordinary performance
Elements of the framework The mapping f : S ! R may include parameter 2 A to optimize f = f. Then may want to solve problems like min
2A
max
Q:R(QkP )r EQ[f]:
In a dynamical setting also consider optimal control under model uncertainty, and often with Q and P measures on the “driving noise.”
SLIDE 18
Stochastic uncertain systems with ordinary performance
Elements of the framework The mapping f : S ! R may include parameter 2 A to optimize f = f. Then may want to solve problems like min
2A
max
Q:R(QkP )r EQ[f]:
In a dynamical setting also consider optimal control under model uncertainty, and often with Q and P measures on the “driving noise.” Key is the variational formula relating QoI under Q with functional of P is log EP h ecf i = sup
QP
[cEQ[f ] R (Q kP )] :
SLIDE 19
Stochastic uncertain systems with ordinary performance
Elements of the framework The mapping f : S ! R may include parameter 2 A to optimize f = f. Then may want to solve problems like min
2A
max
Q:R(QkP )r EQ[f]:
In a dynamical setting also consider optimal control under model uncertainty, and often with Q and P measures on the “driving noise.” Key is the variational formula relating QoI under Q with functional of P is log EP h ecf i = sup
QP
[cEQ[f ] R (Q kP )] : Hence whenever Q P, cEQ[f ] R (Q kP ) + log EP h ecf i : Minimizing Q is dQ = ecf dP= R ecf dP.
SLIDE 20 Stochastic uncertain systems with ordinary performance
Example: how parts come together Suppose f = f with 2 A and we want to solve “optimally robust
- ptimization”: with r > 0 …xed
min
2A
max
Q:R(QkP )r EQ[f]:
SLIDE 21 Stochastic uncertain systems with ordinary performance
Example: how parts come together Suppose f = f with 2 A and we want to solve “optimally robust
- ptimization”: with r > 0 …xed
min
2A
max
Q:R(QkP )r EQ[f]:
Then using Lagrange multipliers ( = 1=c) min
2A
Q
min
c>0
c [r R (Q kP )]
2A
c>0 max Q
c R (Q kP )
c r
2A min c>0
1 c
h ecfi : Final problem phrased purely in terms of the design model, with nice properties in c.
SLIDE 22 Stochastic uncertain systems with ordinary performance
If for some …xed performance requirement B < 1 we …nd r such that min
2A min c>0
1 c
h ecfi = B:
SLIDE 23 Stochastic uncertain systems with ordinary performance
If for some …xed performance requirement B < 1 we …nd r such that min
2A min c>0
1 c
h ecfi = B: Then with the minimizer EQ[f] B for all Q : R (Q kP ) r, and r is largest possible value.
SLIDE 24 Stochastic uncertain systems with ordinary performance
Special case: uncertain model aspects of Jacobson’s LEQG. In the 70s Jacobson introduced the linear/exponential/quadratic/Gaussian formulation of control design. Here choose m(; ) to minimize in Sc(x0) = inf
m E
Z T (hX(s); QX(s)i + hu(s); Ru(s)i) ds
- with u(s) = m(X(s); s) and
dX(s) = AX(s)ds + Bu(s)ds + CdW (s); X(0) = x0:
SLIDE 25 Stochastic uncertain systems with ordinary performance
Special case: uncertain model aspects of Jacobson’s LEQG. In the 70s Jacobson introduced the linear/exponential/quadratic/Gaussian formulation of control design. Here choose m(; ) to minimize in Sc(x0) = inf
m E
Z T (hX(s); QX(s)i + hu(s); Ru(s)i) ds
- with u(s) = m(X(s); s) and
dX(s) = AX(s)ds + Bu(s)ds + CdW (s); X(0) = x0: For optimal feedback control m a PDE argument gives 1 c log Sc(x0) = sup
v E
Z T X(s); Q X(s)
u(s); R u(s)i
c Z T kv(s)k2 ds
where sup over progressively measurable v and d X(s) = A X(s)ds + B u(s)ds + Cv(s)ds + CdW (s); X(0) = x0:
SLIDE 26 Stochastic uncertain systems with ordinary performance
Thus for any v E Z T X(s); Q X(s)
u(s); R u(s)i
1 c E Z T kv(s)k2 ds + 1 c log Sc(x0): Can use v to represent model error [e.g., if Ax should be Ax + Ca(x) take v(s) = a( X(s))].
SLIDE 27 Deterministic uncertain systems with ordinary performance
H1-control (state space formulation, adapted to context). A completely deterministic approach uses _ (s) = A(s) + Bu(s) + Cv(s); (0) = x0; with u(s) = m((s); s) and v(s) : [0; T] ! Rk a “disturbance.” The control m(; ) is chosen to minimize in V (x0) = inf
m(;) sup v
Z T
u(s); R u(s)i 1 2c kv(s)k2
SLIDE 28 Deterministic uncertain systems with ordinary performance
H1-control (state space formulation, adapted to context). A completely deterministic approach uses _ (s) = A(s) + Bu(s) + Cv(s); (0) = x0; with u(s) = m((s); s) and v(s) : [0; T] ! Rk a “disturbance.” The control m(; ) is chosen to minimize in V (x0) = inf
m(;) sup v
Z T
u(s); R u(s)i 1 2c kv(s)k2
If m is a minimizer, then for any disturbance v Z T (h(s); Q(s)i + hu(s); Ru(s)i) ds Z T 1 2c kv(s)k2 ds + V (x0): Here an original motivation was that v could represent model error [e.g., if Ax should be Ax + Ca(x) use v(s) = a((s))].
SLIDE 29
Stochastic uncertain systems with rare event perfomance
The variational bound based on relative entropy cEQ[f ] R (Q kP ) + log EP h ecf i is not useful when EQ[f ] is determined by rare events (e.g., escape probability under the true).
SLIDE 30 Stochastic uncertain systems with rare event perfomance
The variational bound based on relative entropy cEQ[f ] R (Q kP ) + log EP h ecf i is not useful when EQ[f ] is determined by rare events (e.g., escape probability under the true). What is a good replacement for 1 c log EP h ecf i = sup
QP
c R (Q kP )
SLIDE 31
Rare event performance measures and Rényi divergence
Recent variational formula relates risk-sensitive QoI and Rényi divergence.
SLIDE 32 Rare event performance measures and Rényi divergence
Recent variational formula relates risk-sensitive QoI and Rényi divergence. Let 0 < < . Then 1 log EP h ef i = sup
QP
1 log EQ h ef i
R
where for mutually absolutely continuous P; Q and > 1 R(Q kP ) = 1 ( 1) log Z
S
dQ dP 1 dQ:
SLIDE 33 Rare event performance measures and Rényi divergence
Recent variational formula relates risk-sensitive QoI and Rényi divergence. Let 0 < < . Then 1 log EP h ef i = sup
QP
1 log EQ h ef i
R
where for mutually absolutely continuous P; Q and > 1 R(Q kP ) = 1 ( 1) log Z
S
dQ dP 1 dQ: As # 0 recover relative entropy formula. Bounds on risk-sensitive QoI for various Q at level in terms of one at level in terms of design: 1 log EQ h ef i 1 log EP h ef i + 1 R
SLIDE 34 Rare event performance measures and Rényi divergence
Some qualitative properties of Rényi divergence: Bounds independent of underlying probability space (data processing inequality) A chain rule for product measures, but not for Markov measures However, bounds still scale with meaningful limits large time/system size (Rényi rate), even for Markov measures Quantity one would optimize over in robust design (here ) appears also in R
- (Q kP ). Complicates formulation of robust optimization
SLIDE 35
Rare event performance measures and Rényi divergence
Fix a class of models Q (e.g., fQ : R1(Q kP ) rg) and de…ne g() = supfR(Q kP ) : Q 2 Qg; 2 (1; 1):
SLIDE 36 Rare event performance measures and Rényi divergence
Fix a class of models Q (e.g., fQ : R1(Q kP ) rg) and de…ne g() = supfR(Q kP ) : Q 2 Qg; 2 (1; 1):
Theorem
Under integrability conditions on f , sup
Q2Q
1 log EQef inf
F(; );
F(; ) = 2 4 g
log EP h ef i 3 5 ; and the in…mum over is a convex minimization problem.
SLIDE 37 Rare event performance measures and Rényi divergence
Fix a class of models Q (e.g., fQ : R1(Q kP ) rg) and de…ne g() = supfR(Q kP ) : Q 2 Qg; 2 (1; 1):
Theorem
Under integrability conditions on f , sup
Q2Q
1 log EQef inf
F(; );
F(; ) = 2 4 g
log EP h ef i 3 5 ; and the in…mum over is a convex minimization problem. As with ordinary performance measures, there is an optimization/control
- generalization. Also, the bounds scale properly with time, and one can
consider in…nite time problem with Rényi divergence rate.
SLIDE 38
Rare event performance measures and Rényi divergence
Example: Optimal optimization/control of tail behavior with model uncertainty Design model: Arrivals are Poisson with rates (intensities) i, service (when allocated to i) are exponential with mean 1=i, and control is which class to serve. Signi…cant criticism of the model: exponential interarrival and (especially) service times.
SLIDE 39 Rare event performance measures and Rényi divergence
Let Xi(t) be queue length at time t under some control, X n(t) = 1
nX(nt),
and consider as tail-type performance measure EPe Pd
i=1 ciX n i (T ): On the risk-sensitive cost for a Markovian multiclass queue with priority, Atar,
Goswami, Shwarz, 2014.
SLIDE 40 Rare event performance measures and Rényi divergence
Let Xi(t) be queue length at time t under some control, X n(t) = 1
nX(nt),
and consider as tail-type performance measure EPe Pd
i=1 ciX n i (T ):
Then when n ! 1 one can show optimal to allocate service time to solve min ( d X
i=1
[ieci ii(1 eci )]+ : i 0;
d
X
i=1
i = 1 ) : This can be implemented via prioritize service according to largest i(1 eci ); a risk-sensitive analogue of c rule. But what if not P?
On the risk-sensitive cost for a Markovian multiclass queue with priority, Atar,
Goswami, Shwarz, 2014.
SLIDE 41
Rare event performance measures and Rényi divergence
We consider the robust problem with non-exponential service/interarrival distributions, e.g., hazard rate.
SLIDE 42
Rare event performance measures and Rényi divergence
We consider the robust problem with non-exponential service/interarrival distributions, e.g., hazard rate. True model: Let hi;1 and hi;2 denote hazard rates for times between arrivals and services for class i, and assume ai;1 hi;1() i bi;1; ai;2 hi;2() i bi;2; and let Q be corresponding family of models.
SLIDE 43 Rare event performance measures and Rényi divergence
We consider the robust problem with non-exponential service/interarrival distributions, e.g., hazard rate. True model: Let hi;1 and hi;2 denote hazard rates for times between arrivals and services for class i, and assume ai;1 hi;1() i bi;1; ai;2 hi;2() i bi;2; and let Q be corresponding family of models. Then for Q 2 Q we have R(Q[0;nT ]
with the constraint tight for some such Q, and g0() =
d
X
i=1
[k(ai;1) _ k(bi;1)] i +
d
X
i=1
[k(ai;2) _ k(bi;2)] i; with k(x) = x x + 1 ( 1) :
SLIDE 44 Rare event performance measures and Rényi divergence
For min/max optimum with regard to Q, should solve min 8 < : g0
X
i=1
[ieci ii(1 eci )]+ : i 0;
d
X
i=1
i = 1 9 = ; ; where min is over and fig.
SLIDE 45 Summary
Risk-sensitive control and relative entropy give a useful approach to certain problems of optimization under model uncertainty for ordinary costs. Costs based on rare events require a di¤erent approach, and we propose a related one based on risk-sensitive control and Renyi divergence. Initial applications are to control of queuing models to handle, among
- ther things, old complaints regarding service time distributions.
Tightness of the bounds, in the sense that there is a model within Q for which the bounds give equality, has been established for some circumstances (e.g. > 0 small), but is an area that needs more investigation.
SLIDE 46 References
Jacobson’s LQEG control problem: Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic di¤erential games, D.H. Jacobson, IEEE Trans. on Auto. Control, 18, (1973), 124–131. Start of H1 control: Feedback and optimal sensitivity: model reference transformations, multaplicative seminorms, and approximate inverses, G. Zames, IEEE
- Trans. Auto. Control, 26, (1981), 301–320.
State space formulation of H1 control: State space formulae for all stabilizing controllers that satisfy an H1 norm bound and relations to risk sensitivity, K. Glover and J. C. Doyle, Systems Control Lett., 11, (1988), 167–172. Extension to nonlinear systems: Extending H1 Control to Nonlinear Systems: Control of Nonlinear Systems to Achieve Performance Objectives, J. W. Helton and M. R. James, (1999), SIAM.
SLIDE 47 References
Paper that considers di¤usions with uncertain drift, and makes connections with H1 control: Robust properties of risk–sensitive control (D, James and Petersen),
- Math. of Control, Signals and Systems, 13, (2000), pp. 318–332.
Solution for particular classes of models including uncertain linear/quadratic: Minimax optimal control of stochastic uncertain systems with relative entropy constraints (Petersen, James and D), IEEE Trans. on Auto. Control., 45, (2000), pp. 398–412. Applications to economics: Robust control and model uncertainty (Hansen and Sargent), The American Economic Review, 91, (2001), pp. 60-66. Robustness, (Hansen and Sargent), Wiley, 2008.
SLIDE 48 References
Rare event papers: Robust bounds on risk-sensitive functionals via Rényi divergence, (R. Atar, K. Chowdhary and D), SIAM/ASA J. Uncertainty Quanti…cation, 3, (2015), 18–33. Sensitivity analysis for rare events based on Rényi divergence (D, M.A. Katsoulakis, Y. Pantazis and L. Rey-Bellet), to appear in Ann.
Robust bounds and optimization of tail properties of queueing models via Rényi divergence, R. Atar, A. Budhiraja, D. R. Wu, preprint.