Plans Are Worthless but Options Estimating L 0 Planning Is - - PowerPoint PPT Presentation

plans are worthless but
SMART_READER_LITE
LIVE PREVIEW

Plans Are Worthless but Options Estimating L 0 Planning Is - - PowerPoint PPT Presentation

Eisenhowers Observation Rational Decision . . . In These Terms, What . . . Plans Are Worthless but Options Estimating L 0 Planning Is Everything: Estimating L 1 Conclusions A Theoretical Explanation Acknowledgments of Eisenhowers


slide-1
SLIDE 1

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 1 of 14 Go Back Full Screen Close Quit

Plans Are Worthless but Planning Is Everything: A Theoretical Explanation

  • f Eisenhower’s Observation

Angel F. Garcia Contreras, Martine Ceberio, and Vladik Kreinovich

Department of Computer Science University of Texas at El Paso 500 W. University El Paso, Texas 79968, USA afgarciacontreras@miners.utep.edu, mceberio@utep.edu@utep.edu, vladik@utep.edu

slide-2
SLIDE 2

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 2 of 14 Go Back Full Screen Close Quit

1. Eisenhower’s Observation

  • Dwight D. Eisenhower was:

– the Supreme Commander of the Allied Expedi- tionary Forces in Europe during WW2 – and later the US President.

  • He emphasized that his war experience taught him that

“plans are worthless, but planning is everything”.

  • At first glance, this sounds paradoxical: if plans are

worthless, why bother with planning at all?

  • In this paper, we show that this Eisenhower’s observa-

tion has a meaning: – while following the original plan in constantly changing circumstances is often not a good idea, – the existence of a pre-computed original plan en- ables us to produce an almost-optimal strategy.

slide-3
SLIDE 3

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 3 of 14 Go Back Full Screen Close Quit

2. Rational Decision Making: a Brief Reminder

  • According to decision making theory:

– decisions by a rational decision maker – can be described as maximize the value a certain function known as utility.

  • E.g., in financial situations, when a company needs to

make a decision, the overall profit can be used as utility.

  • To describe a possible action x, we usually need to

describe the values of several quantities x1, . . . , xn.

  • E.g.,

a decision about a plant involves selecting amounts xi of manufactured gadgets of different type.

  • Similarly, we need several quantities a1, . . . , am to de-

scribe a situation.

  • Let u(x, a) denote the utility that results from perform-

ing action x in situation a.

slide-4
SLIDE 4

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 4 of 14 Go Back Full Screen Close Quit

3. In These Terms, What Is Planning

  • Let

a describe the original situation.

  • Based on this situation, we come up with an action

x that maximizes the corresponding utility: u( x, a) = max

x

u(x, a).

  • Computing this optimal action

x is what we usually call planning.

  • When we need to start acting, the situation may have

changed to a = a.

  • Let us denote the corresponding change by ∆a

def

= a− a, then a = a + ∆a.

slide-5
SLIDE 5

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 5 of 14 Go Back Full Screen Close Quit

4. Options

  • One possibility is to simply ignore the change, and ap-

ply the original plan x to the new situation a = a+∆a.

  • This plan is, in general, not optimal for the new situ-

ation.

  • The actually optimal plan is xopt for which

u(xopt, a + ∆a) = max

x

u(x, a + ∆a).

  • In comparison with the optimal plan, we lose the

amount L0

def

= u(xopt, a + ∆a) − u( x, a + ∆a).

  • Why cannot we just find the optimal solution for the

new situation?

  • Optimization is NP-hard, so, it is not possible to find

the exact optimum in reasonable time.

slide-6
SLIDE 6

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 6 of 14 Go Back Full Screen Close Quit

5. Options (cont-d)

  • What we can do is:

– try to use some feasible algorithm – e.g., solving a system of linear equations, – to modify the plan x into x + ∆x.

  • Due to NP-hardness, this feasibly modified plan is, in

general, not optimal.

  • We hope that the resulting loss L1

def

= u(xopt, a+∆a)− u( x + ∆x, a + ∆a) is much smaller than L0.

  • In this paper, we show that indeed L1 ≪ L0; so:

– even if L0 is so large that the original plan is worth- less, – the modified plan may leads to a reasonably small loss L1 ≪ L0.

  • This explains Eisenhower’s observation.
slide-7
SLIDE 7

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 7 of 14 Go Back Full Screen Close Quit

6. Estimating L0

  • We assume that the difference ∆a is reasonably small.
  • So, the corresponding difference in action ∆xopt

def

= xopt − x is also small.

  • We can therefore expand L0 in Taylor series and keep
  • nly terms linear and quadratic in ∆x:

L0 = u(xopt, a + ∆a) − u(xopt − ∆xopt, a + ∆a) =

n

  • i=1

∂u ∂xi (xopt, a + ∆a) · ∆xopt

i +

1 2·

n

  • i=1

n

  • i′=1

∂2u ∂xi∂xi′ (xopt, a+∆a)·∆xopt

i

·∆xopt

i′ +o((∆a)2).

  • By definition, the action xopt maximizes u(x,

a + ∆a).

  • Thus, we have ∂u

∂xi (xopt, a + ∆a) = 0.

slide-8
SLIDE 8

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 8 of 14 Go Back Full Screen Close Quit

7. Estimating L0 (cont-d)

  • So, the above expression for L0 takes the simplified

form L0 = 1 2·

n

  • i=1

n

  • i′=1

∂2u ∂xi∂xi′ (xopt, a+∆a)·∆xopt

i ·∆xopt i′ +o((∆a)2).

  • ∆xopt

i

can be estimated from the condition: ∂u ∂xi (xopt, a + ∆a) = ∂u ∂xi ( x + ∆xopt, a + ∆) = 0.

  • For a =

a, u is max when x = x, so ∂u ∂xi ( x, a) = 0.

  • Expanding the equation in Taylor series in ∆xi and

∆aj and taking ∂u ∂xi ( x, a) = 0 into account, we get:

n

  • i′=1

∂2u ∂xi∂xi′ ( x, a)·∆xopt

i′ + m

  • j=1

∂2u ∂xi∂aj ( x, a)·∆aj+o(∆x, ∆a) = 0.

slide-9
SLIDE 9

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 9 of 14 Go Back Full Screen Close Quit

8. Estimating L0 (final)

  • Thus, the first approximation ∆xi to the values ∆xopt

i

satisfies a system of linear equations:

n

  • i′=1

∂2u ∂xi∂xi′ ( x, a) · ∆xj = −

m

  • j=1

∂2u ∂xi∂aj ( x, a) · ∆aj.

  • A solution to a system of linear equations is a linear

combination of the right-hand sides.

  • Thus, the values ∆xi are a linear function of ∆aj.
  • Substituting these linear expressions into the formula

for L0, we conclude that L0 is quadratic in ∆aj: L0 =

m

  • j=1

m

  • j′=1

kjj′ · ∆aj · ∆aj′ + o((∆a)2) for some kjj′.

slide-10
SLIDE 10

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 10 of 14 Go Back Full Screen Close Quit

9. Estimating L1

  • The 1st approximation ∆x to the difference ∆xopt can

be obtained by solving a system of linear equations.

  • How much do we lose if we use xlin =

x + ∆x?

  • Here, ∆xopt = ∆x+δx, where δx is of 2nd order in ∆x

and ∆a: δx = O((∆a)2).

  • The loss L1 of using xlin = xopt − δx instead of xopt is:

L1 = u(xopt, a + ∆a) − u(xlin, a + ∆a) = u(xopt, a + ∆a) − u(xopt − δx, a + ∆a).

  • If we expand this expression in δx, we get:

L1 =

n

  • i=1

∂u ∂xi (xopt, a + ∆a) · δxi+ 1 2 ·

n

  • i=1

n

  • i′=1

∂2u ∂xi∂xi′ (xopt, a + ∆a) · δxi · δxi′ + o((δx)2).

slide-11
SLIDE 11

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 11 of 14 Go Back Full Screen Close Quit

10. Estimating L1 (cont-d)

  • Since xopt is the action that, for a =

a+∆a, maximizes utility, we get ∂u ∂xi (xopt, a + ∆a) = 0.

  • Thus, the expression for L1 gets a simplified form

L1 = 1 2·

n

  • i−1

n

  • i′=1

∂2u ∂xi∂xi′ (xopt, a+∆a)·δxi·δxi′+o((δx)2).

  • We know that the values δxi are quadratic in ∆a.
  • Thus, we conclude that for the modified action, the loss

L1 is a 4-th order function of ∆aj: L1 =

m

  • j=1

m

  • j′=1

m

  • j′′=1

m

  • j′′′=1

kjj′j′′j′′′·∆aj·∆aj′·∆aj′′·∆aj′′′+o((∆a)5).

slide-12
SLIDE 12

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 12 of 14 Go Back Full Screen Close Quit

11. Conclusions

  • We conclude that:

– the loss L0 related to using the original plan is quadratic in ∆a, while – the loss L1 related to using a feasibly modified plan is of 4th order in terms of ∆a.

  • For small ∆a, we have L1 ∼ (∆a)4 ≪ L0 ∼ (∆a)2.
  • Let ε > 0 be the maximum loss that we tolerate.
  • Since L1 ≪ L0, we have three possible cases:

(1) ε < L1, (2) L1 ≤ ε ≤ L0, and (3) L0 < ε.

  • In the 1st case, even the modified action does not help.
  • In the 3rd case, the change in the situation is so small

that it is Ok to use the original plan x.

slide-13
SLIDE 13

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 13 of 14 Go Back Full Screen Close Quit

12. Conclusions (cont-d)

  • In the second case, we have exactly the Eisenhower

situation: – if we use the original plan x, the resulting loss L0 much larger than we can tolerate; – in this sense, the original plan is worthless; – on the other hand, if we feasible modify the original plan into xlin, then we get an acceptable action.

  • So, we indeed get a theoretical justification of Eisen-

hower’s observation.

slide-14
SLIDE 14

Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Options Estimating L0 Estimating L1 Conclusions Acknowledgments Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 14 of 14 Go Back Full Screen Close Quit

13. Acknowledgments

  • This work was supported in part by the National Sci-

ence Foundation grants:

  • HRD-0734825 and HRD-1242122 (Cyber-ShARE

Center of Excellence) and

  • DUE-0926721, and
  • by an award from Prudential Foundation.