plans are worthless but
play

Plans Are Worthless but Options Estimating L 0 Planning Is - PowerPoint PPT Presentation

Eisenhowers Observation Rational Decision . . . In These Terms, What . . . Plans Are Worthless but Options Estimating L 0 Planning Is Everything: Estimating L 1 Conclusions A Theoretical Explanation Acknowledgments of Eisenhowers


  1. Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . Plans Are Worthless but Options Estimating L 0 Planning Is Everything: Estimating L 1 Conclusions A Theoretical Explanation Acknowledgments of Eisenhower’s Observation Home Page Title Page Angel F. Garcia Contreras, Martine Ceberio, and Vladik Kreinovich ◭◭ ◮◮ ◭ ◮ Department of Computer Science University of Texas at El Paso Page 1 of 14 500 W. University El Paso, Texas 79968, USA Go Back afgarciacontreras@miners.utep.edu, Full Screen mceberio@utep.edu@utep.edu, vladik@utep.edu Close Quit

  2. Eisenhower’s Observation 1. Eisenhower’s Observation Rational Decision . . . In These Terms, What . . . • Dwight D. Eisenhower was: Options – the Supreme Commander of the Allied Expedi- Estimating L 0 tionary Forces in Europe during WW2 Estimating L 1 – and later the US President. Conclusions Acknowledgments • He emphasized that his war experience taught him that Home Page “plans are worthless, but planning is everything”. Title Page • At first glance, this sounds paradoxical: if plans are worthless, why bother with planning at all? ◭◭ ◮◮ ◭ ◮ • In this paper, we show that this Eisenhower’s observa- tion has a meaning: Page 2 of 14 – while following the original plan in constantly Go Back changing circumstances is often not a good idea, Full Screen – the existence of a pre-computed original plan en- Close ables us to produce an almost-optimal strategy. Quit

  3. Eisenhower’s Observation 2. Rational Decision Making: a Brief Reminder Rational Decision . . . In These Terms, What . . . • According to decision making theory: Options – decisions by a rational decision maker Estimating L 0 – can be described as maximize the value a certain Estimating L 1 function known as utility. Conclusions Acknowledgments • E.g., in financial situations, when a company needs to Home Page make a decision, the overall profit can be used as utility. Title Page • To describe a possible action x , we usually need to describe the values of several quantities x 1 , . . . , x n . ◭◭ ◮◮ • E.g., a decision about a plant involves selecting ◭ ◮ amounts x i of manufactured gadgets of different type. Page 3 of 14 • Similarly, we need several quantities a 1 , . . . , a m to de- Go Back scribe a situation. Full Screen • Let u ( x, a ) denote the utility that results from perform- Close ing action x in situation a . Quit

  4. Eisenhower’s Observation 3. In These Terms, What Is Planning Rational Decision . . . In These Terms, What . . . • Let � a describe the original situation. Options • Based on this situation, we come up with an action � x Estimating L 0 that maximizes the corresponding utility: Estimating L 1 Conclusions u ( � x, � a ) = max u ( x, � a ) . x Acknowledgments Home Page • Computing this optimal action � x is what we usually call planning . Title Page ◭◭ ◮◮ • When we need to start acting, the situation may have changed to a � = � a . ◭ ◮ def • Let us denote the corresponding change by ∆ a = a − � a , Page 4 of 14 then a = � a + ∆ a . Go Back Full Screen Close Quit

  5. Eisenhower’s Observation 4. Options Rational Decision . . . In These Terms, What . . . • One possibility is to simply ignore the change, and ap- Options ply the original plan � x to the new situation a = � a +∆ a . Estimating L 0 • This plan is, in general, not optimal for the new situ- Estimating L 1 ation. Conclusions • The actually optimal plan is x opt for which Acknowledgments Home Page u ( x opt , � a + ∆ a ) = max u ( x, � a + ∆ a ) . Title Page x ◭◭ ◮◮ • In comparison with the optimal plan, we lose the def = u ( x opt , � amount L 0 a + ∆ a ) − u ( � x, � a + ∆ a ) . ◭ ◮ • Why cannot we just find the optimal solution for the Page 5 of 14 new situation? Go Back • Optimization is NP-hard, so, it is not possible to find Full Screen the exact optimum in reasonable time. Close Quit

  6. Eisenhower’s Observation 5. Options (cont-d) Rational Decision . . . In These Terms, What . . . • What we can do is: Options – try to use some feasible algorithm – e.g., solving a Estimating L 0 system of linear equations, Estimating L 1 – to modify the plan � x into � x + ∆ x . Conclusions • Due to NP-hardness, this feasibly modified plan is, in Acknowledgments Home Page general, not optimal. def Title Page = u ( x opt , � • We hope that the resulting loss L 1 a +∆ a ) − u ( � x + ∆ x, � a + ∆ a ) is much smaller than L 0 . ◭◭ ◮◮ • In this paper, we show that indeed L 1 ≪ L 0 ; so: ◭ ◮ – even if L 0 is so large that the original plan is worth- Page 6 of 14 less, Go Back – the modified plan may leads to a reasonably small Full Screen loss L 1 ≪ L 0 . Close • This explains Eisenhower’s observation. Quit

  7. Eisenhower’s Observation 6. Estimating L 0 Rational Decision . . . In These Terms, What . . . • We assume that the difference ∆ a is reasonably small. Options def • So, the corresponding difference in action ∆ x opt = Estimating L 0 x opt − � x is also small. Estimating L 1 Conclusions • We can therefore expand L 0 in Taylor series and keep Acknowledgments only terms linear and quadratic in ∆ x : Home Page a + ∆ a ) − u ( x opt − ∆ x opt , � L 0 = u ( x opt , � a + ∆ a ) = Title Page n � ∂u a + ∆ a ) · ∆ x opt ( x opt , � i + ◭◭ ◮◮ ∂x i i =1 ◭ ◮ n n � � ∂ 2 u 1 a +∆ a ) · ∆ x opt · ∆ x opt ∂x i ∂x i ′ ( x opt , � i ′ + o ((∆ a ) 2 ) . 2 · Page 7 of 14 i i =1 i ′ =1 Go Back • By definition, the action x opt maximizes u ( x, � a + ∆ a ). Full Screen • Thus, we have ∂u ( x opt , � a + ∆ a ) = 0. Close ∂x i Quit

  8. Eisenhower’s Observation 7. Estimating L 0 (cont-d) Rational Decision . . . In These Terms, What . . . • So, the above expression for L 0 takes the simplified Options form Estimating L 0 n n � � ∂ 2 u L 0 = 1 a +∆ a ) · ∆ x opt i · ∆ x opt ∂x i ∂x i ′ ( x opt , � i ′ + o ((∆ a ) 2 ) . Estimating L 1 2 · Conclusions i =1 i ′ =1 Acknowledgments • ∆ x opt can be estimated from the condition: i Home Page ∂u a + ∆ a ) = ∂u ( x opt , � x + ∆ x opt , � ( � a + ∆) = 0 . Title Page ∂x i ∂x i ◭◭ ◮◮ x , so ∂u • For a = � a , u is max when x = � ( � a ) = 0. x, � ◭ ◮ ∂x i • Expanding the equation in Taylor series in ∆ x i and Page 8 of 14 ∆ a j and taking ∂u ( � a ) = 0 into account, we get: x, � Go Back ∂x i Full Screen � n � m ∂ 2 u ∂ 2 u a ) · ∆ x opt ∂x i ∂x i ′ ( � i ′ + ( � a ) · ∆ a j + o (∆ x, ∆ a ) = 0 . x, � x, � Close ∂x i ∂a j i ′ =1 j =1 Quit

  9. Eisenhower’s Observation 8. Estimating L 0 (final) Rational Decision . . . In These Terms, What . . . • Thus, the first approximation ∆ x i to the values ∆ x opt i Options satisfies a system of linear equations: Estimating L 0 n m � � ∂ 2 u ∂ 2 u Estimating L 1 ∂x i ∂x i ′ ( � a ) · ∆ x j = − ( � a ) · ∆ a j . x, � x, � ∂x i ∂a j Conclusions i ′ =1 j =1 Acknowledgments • A solution to a system of linear equations is a linear Home Page combination of the right-hand sides. Title Page • Thus, the values ∆ x i are a linear function of ∆ a j . ◭◭ ◮◮ • Substituting these linear expressions into the formula ◭ ◮ for L 0 , we conclude that L 0 is quadratic in ∆ a j : Page 9 of 14 � m � m Go Back k jj ′ · ∆ a j · ∆ a j ′ + o ((∆ a ) 2 ) for some k jj ′ . L 0 = Full Screen j =1 j ′ =1 Close Quit

  10. Eisenhower’s Observation 9. Estimating L 1 Rational Decision . . . In These Terms, What . . . • The 1st approximation ∆ x to the difference ∆ x opt can Options be obtained by solving a system of linear equations. Estimating L 0 • How much do we lose if we use x lin = � x + ∆ x ? Estimating L 1 • Here, ∆ x opt = ∆ x + δx , where δx is of 2nd order in ∆ x Conclusions and ∆ a : δx = O ((∆ a ) 2 ). Acknowledgments • The loss L 1 of using x lin = x opt − δx instead of x opt is: Home Page Title Page L 1 = u ( x opt , � a + ∆ a ) − u ( x lin , � a + ∆ a ) = a + ∆ a ) − u ( x opt − δx, � ◭◭ ◮◮ u ( x opt , � a + ∆ a ) . ◭ ◮ • If we expand this expression in δx , we get: � n Page 10 of 14 ∂u ( x opt , � L 1 = a + ∆ a ) · δx i + ∂x i Go Back i =1 � n � n Full Screen ∂ 2 u 1 ∂x i ∂x i ′ ( x opt , � a + ∆ a ) · δx i · δx i ′ + o (( δx ) 2 ) . 2 · Close i =1 i ′ =1 Quit

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend