best first utility guided search
play

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do - PowerPoint PPT Presentation

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto Research Center { ruml , minhdo } @parc.com Wheeler Ruml (PARC) Best-first Utility-guided Search 1 / 10 Overview Overview Anytime Algs


  1. Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto Research Center { ruml , minhdo } @parc.com Wheeler Ruml (PARC) Best-first Utility-guided Search – 1 / 10

  2. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  3. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary U ( s, t ) = − w f · f ( s ) − w t · t Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  4. Overview ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation A* takes too long: we must trade cost for time. ■ Gridworld ■ MSA ■ Planning ■ Summary U ( s, t ) = − w f · f ( s ) − w t · t Anytime algorithms are annoying to use and to design. Utility-guided search is a promising alternative. Wheeler Ruml (PARC) Best-first Utility-guided Search – 2 / 10

  5. The Anytime Approach ■ Overview ■ Anytime Algs ■ Bugsy ■ Properties ■ Evaluation ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  6. The Anytime Approach Requires a termination policy, assuming: ■ Overview ■ Anytime Algs relevant features for predicting progress are known ■ ■ Bugsy ■ Properties training data available ■ ■ Evaluation new instance is similar in relevant aspects to training ■ ■ Gridworld ■ MSA relevant aspects are known ■ ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  7. The Anytime Approach Requires a termination policy, assuming: ■ Overview ■ Anytime Algs relevant features for predicting progress are known ■ ■ Bugsy ■ Properties training data available ■ ■ Evaluation new instance is similar in relevant aspects to training ■ ■ Gridworld ■ MSA relevant aspects are known ■ ■ Planning ■ Summary Impossible to design optimally: f = 5 f = 4 d = 1 d = 2 Must know the user’s utility function! Wheeler Ruml (PARC) Best-first Utility-guided Search – 3 / 10

  8. Best-first Utility-guided Search, Yes! Want best-first search according to: ■ ■ Overview ■ Anytime Algs ■ Bugsy U ( n ) = s under n ( − w f · f ( s ) − w t · t ( s )) max ■ Properties ■ Evaluation ■ Gridworld ■ MSA ■ Planning ■ Summary Wheeler Ruml (PARC) Best-first Utility-guided Search – 4 / 10

  9. Best-first Utility-guided Search, Yes! Want best-first search according to: ■ ■ Overview ■ Anytime Algs ■ Bugsy U ( n ) = s under n ( − w f · f ( s ) − w t · t ( s )) max ■ Properties ■ Evaluation ■ Gridworld ■ MSA Approximate s under n by cheapest and nearest ■ ■ Planning ■ Summary ◆ f ( cheapest ) = f ( n ) = g ( n ) + h ( n ) ◆ f ( nearest ) seems straightforward in many domains Estimate d ( n ) and convert to t ( n ) . ■ ◆ d ( n ) seems straightforward in many domains Wheeler Ruml (PARC) Best-first Utility-guided Search – 4 / 10

  10. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning cost ■ Summary Bugsy anytime solutions utility time Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  11. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning cost ■ Summary anytime solutions utility Bugsy time Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  12. Properties Different from anytime algorithms ■ Overview ■ Anytime Algs no need for termination policy, training data ■ ■ Bugsy ■ Properties can spend all effort pursuing one solution ■ ■ Evaluation no fixed trade-off ■ ■ Gridworld ■ MSA ■ Planning ■ Summary Similar to weighted A* iff h = d . but intuitive meaning for weight ■ otherwise, exploits additional information ■ Reasonable properties Complete if h and d are reasonable ■ Optimal if h and d are perfect ■ Wheeler Ruml (PARC) Best-first Utility-guided Search – 5 / 10

  13. Empirical Evaluation Algorithms: ■ ■ Overview ■ Anytime Algs ◆ Bugsy ■ Bugsy ■ Properties Anytime Replanning A* (ARA*), Likhachev et al. (2004) ◆ ■ Evaluation ◆ Anytime A* (AA*), Hansen et al. (1997) ■ Gridworld ■ MSA ◆ Greedy (Gr), Doran and Michie (1966) ■ Planning ◆ A*, Hart et al. (1968) ■ Summary Wide variety of utility functions. ■ Record CPU time and solution quality for every solution. ■ Assume clairvoyant termination for anytime algorithms. ■ Normalize utilities from 0–100. ■ See paper for full results. ■ (These results are conservative.) Wheeler Ruml (PARC) Best-first Utility-guided Search – 6 / 10

  14. Gridworld Pathfinding U () ARA* AA* Gr A* Bugsy ■ Overview ■ Anytime Algs time only 59 100 100 100 100 ■ Bugsy 500 microsec 99 99 99 59 100 ■ Properties ■ Evaluation 1 msec 98 98 59 99 99 ■ Gridworld 5 msec 91 93 90 59 99 ■ MSA ■ Planning 10 msec 82 86 80 59 99 ■ Summary 50 msec 97 25 54 19 65 0.1 sec 97 60 63 19 82 cost only 19 98 98 98 98 Wheeler Ruml (PARC) Best-first Utility-guided Search – 7 / 10

  15. Multiple Sequence Alignment U () ARA* AA* Gr A* Bugsy ■ Overview ■ Anytime Algs time only 54 100 100 100 100 ■ Bugsy 0.1 sec 97 98 96 54 99 ■ Properties ■ Evaluation 0.5 sec 83 88 76 52 92 ■ Gridworld 1 sec 68 79 54 51 80 ■ MSA ■ Planning 5 sec 68 71 25 73 75 ■ Summary 10 secs 78 75 74 25 78 cost only 82 82 82 24 82 Wheeler Ruml (PARC) Best-first Utility-guided Search – 8 / 10

  16. Temporal Planning U () ARA* AA* A* Bugsy ■ Overview ■ Anytime Algs zenotravel-7 ■ Bugsy 500 microsec 69 0 81 100 ■ Properties ■ Evaluation 1 msec 71 0 83 100 ■ Gridworld 5 msec 74 0 85 100 ■ MSA ■ Planning 10 msec 84 0 96 100 ■ Summary 50 msec 91 91 0 100 0.5 sec 97 97 0 100 5 sec 99 99 0 100 rovers-5 500 microsec 67 0 62 100 1 msec 72 0 66 100 5 msec 77 0 71 100 10 msec 92 0 93 100 50 msec 78 0 93 100 Wheeler Ruml (PARC) Best-first Utility-guided Search – 9 / 10

  17. Summary Anytime algorithms are annoying to use and to design. ■ ■ Overview ■ Anytime Algs ■ Bugsy Utility-based search is a promising and practical alternative. ■ ■ Properties ■ Evaluation Extendable to solving deadlines and plan execution deadlines. ■ ■ Gridworld ■ MSA ■ Planning ■ Summary There’s information beyond g ( n ) and h ( n ) , namely d ( n ) . ■ Wheeler Ruml (PARC) Best-first Utility-guided Search – 10 / 10

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend