iterative budgeted exponential search
play

Iterative Budgeted Exponential Search Malte Helmert 1 , Tor Lattimore - PowerPoint PPT Presentation

Iterative Budgeted Exponential Search Malte Helmert 1 , Tor Lattimore 2 , Levi H.S. Lelis 3 , Laurent Orseau 2 , Nathan Sturtevant 4 1 University of Basel, Switzerland 2 DeepMind, UK 3 Federal de Vi cosa, Brazil 4 University of Alberta, Canada


  1. Iterative Budgeted Exponential Search Malte Helmert 1 , Tor Lattimore 2 , Levi H.S. Lelis 3 , Laurent Orseau 2 , Nathan Sturtevant 4 1 University of Basel, Switzerland 2 DeepMind, UK 3 Federal de Vi¸ cosa, Brazil 4 University of Alberta, Canada IJCAI 2019 - Macau, China Talk jointly presented by Laurent Orseau and Nathan Sturtevant

  2. Start 5 4 3 2 1 h = 11 1 2 3 4 5 Goal

  3. Shortest Path Algorithm History 1960 1970 1980 1990 2000 2010 IDA* worst case IDA* IBEX A* A* worst case † Dijkstra’s Algorithm

  4. The N 2 problem N 2 • Algorithms perform � re-expansions of N states N 2 N = b d • IDA*: � expansions of � states 2 N • A*: � expansions of N states N 2 • B & B’: � (Martelli 1977; Mero, 1984) N 2 • Weighted A*: � (Chen et al, 2019) Iterative Budgeted Exponential Search 4

  5. IDA* Refresher • IDA* does iterative deepening search on f -costs • f(n) = g(n) + h(n) • Next iteration f -cost: • Smallest unexplored from previous iteration • If f -cost layers grow exponentially • IDA* is asymptotically optimal in node expansions Iterative Budgeted Exponential Search 5

  6. IDA* - Unit Costs

  7. f-cost 11 2 States x8 f-cost 13 16 States x5 79 States f-cost 15

  8. IDA* Worst Case • f -cost layers grow exponentially 1 + b + b 2 + b 3 + … + b d ≈ b d • � • What if f -cost layers grew linearly? 1 + 2 + 3 + 4 + … + b d ≈ ( b d ) 2 • � • Happens with non-unit edge costs: t + 2 • STP: Cost of moving tile t : � t + 1 Iterative Budgeted Exponential Search 8

  9. f-cost 11 11.25 13.5 13.7 13.45 13.83 13.87 13.62

  10. Previous Work • Iterative Deepening • IDA* (Korf, 1985) • IDA* CR (Sarkar et al, 1990) • IDA* IM (Burns & Ruml, 2013) • EDA* (Sharon et al, 2014) Iterative Budgeted Exponential Search 11

  11. A* (B/B’) • A* with a consistent heuristic is optimal* • With an inconsistent heuristic A* might do 2 N expansions of N states • B and B’ immediately propagate shorter paths • Will do at most N 2 expansions of N states Iterative Budgeted Exponential Search 13

  12. 0 N states 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 N states

  13. Previous Work • Inconsistent Heuristics • B/B’ (Martelli 1977; Mero, 1984) • Delay (Sturtevant et al, 2008) • BPMX (Felner et al, 2011) Iterative Budgeted Exponential Search 16

  14. Demos • All demos shown here runnable online: • https://www.movingai.com/SAS/BTS/ • Also running on my iPad; can give detailed demo Iterative Budgeted Exponential Search 18

  15. IDA* — Good Case

  16. IDA* — Good Case

  17. IDA* — Good Case

  18. IDA* — Good Case

  19. IDA* — Good Case

  20. IDA* — Good Case

  21. IDA* — Bad Case

  22. IDA* — Bad Case

  23. IDA* — Bad Case

  24. IDA* — Bad Case

  25. IDA* — Bad Case

  26. IDA* — Bad Case

  27. IDA* — Bad Case

  28. IDA* — Bad Case

  29. IDA* — Bad Case

  30. EDA*

  31. EDA*

  32. EDA*

  33. EDA*

  34. EDA*

  35. EDA*

  36. EDA*

  37. IBEX (main idea) For budget = 2, 4, 8, 16, ...: cost_bound = Oracle(budget) search within cost_bound and within budget Oracle(budget) returns largest cost for which budget is su ffi cient.

  38. IBEX (main idea): With an oracle budget f -cost expansions 2

  39. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2

  40. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2

  41. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4

  42. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5

  43. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5 4

  44. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5 4 8 13.9 8

  45. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5 4 8 13.9 8 16 14.4 16

  46. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5 4 8 13.9 8 16 14.4 16 32 16.8 30

  47. IBEX (main idea): With an oracle budget f -cost expansions 2 11.2 2 4 13.5 4 8 13.9 8 16 14.4 16 32 16.8 30 64 17.7 62

  48. IBEX (main idea) For budget = 2, 4, 8, 16, ...: cost_bound = Oracle(budget) search within cost_bound and within budget

  49. IBEX (main idea) For budget = 2, 4, 8, 16, ...: cost_bound = Oracle(budget) search within cost_bound and within budget Oracle(budget) returns largest cost for which budget is su ffi cient. Oracle(budget) = (budgeted) exponential search

  50. IBEX (main idea)

  51. IBEX (main idea)

  52. IBEX (main idea)

  53. IBEX (main idea)

  54. IBEX (main idea)

  55. IBEX (main idea)

  56. IBEX (main idea)

  57. IBEX (main idea)

  58. IBEX (main idea)

  59. IBEX (main idea) N ↓ For budget = 2, 4, 8, 16, 32, 64 : ≤ 4 N  With exponential search :  Refine lower and upper bound on cost  O (log C ∗ ) Proves no solution when lower = upper

  60. Application: Tree Search IBEX + Depth-First Search = Budgeted Tree Search (BTS) Alg. Worst case Θ ( N 2 ) IDA* (1985) EDA* (2014) unbounded BTS O ( N log C ∗ ) C ∗ : cost of the optimal solution N : number of nodes of cost at most C ∗ All algorithms return an optimal solution (under usual assumptions).

  61. Application: Graph Search IBEX + Uniform-Cost Search = Budgeted Graph Search (BGS) Alg. Admissible Heuristic Consistent Inconsistent Ω (2 N ) A* (1968) O ( N ) Θ ( N 2 ) B (1977) O ( N ) O ( N ) † BGS O ( N log C ∗ ) C ∗ : cost of the optimal solution N : number of nodes of cost at most C ∗ All algorithms return an optimal solution (under usual assumptions).

  62. Enhancements: Tree search Recover IDA* when exponential unit-cost domains 1) Infinite iteration at next f -cost 2) If expansions grow exponentially, go to 1) else exponential search

  63. Enhancements: Tree search Recover IDA* when exponential unit-cost domains 1) Infinite iteration at next f -cost 2) If expansions grow exponentially, go to 1) else exponential search Budget window [2 , 8] to be more aggressive

  64. Experiments Algorithm 15-Puzzle (unit) 15-Puzzle (real) Coconut Exp. × 10 6 Exp. × 10 6 Exp. × 10 4 Solved Solved Solved BTS 100 242.5 100 673.1 100 84.7 EDA* 99 5 586.0 100 2 882.3 3 ≥ 554 249 . 0 IDA* CR 100 868.4 100 700.6 3 ≥ 516 937 . 7 IDA* 100 242.5 57 62 044.3 100 5 484.2 100 100.8 100 258.1 100 2.7 N BTS (Budgeted Tree Search) = IBEX + Depth-First Search

  65. Experiments Algorithm 15-Puzzle (unit) 15-Puzzle (real) Coconut Exp. × 10 6 Exp. × 10 6 Exp. × 10 4 Solved Solved Solved BTS 100 242.5 100 673.1 100 84.7 EDA* 99 5 586.0 100 2 882.3 3 ≥ 554 249 . 0 IDA* CR 100 868.4 100 700.6 3 ≥ 516 937 . 7 IDA* 100 242.5 57 62 044.3 100 5 484.2 100 100.8 100 258.1 100 2.7 N BTS (Budgeted Tree Search) = IBEX + Depth-First Search

  66. Experiments Algorithm 15-Puzzle (unit) 15-Puzzle (real) Coconut Exp. × 10 6 Exp. × 10 6 Exp. × 10 4 Solved Solved Solved BTS 100 242.5 100 673.1 100 84.7 EDA* 99 5 586.0 100 2 882.3 3 ≥ 554 249 . 0 IDA* CR 100 868.4 100 700.6 3 ≥ 516 937 . 7 IDA* 100 242.5 57 62 044.3 100 5 484.2 100 100.8 100 258.1 100 2.7 N BTS (Budgeted Tree Search) = IBEX + Depth-First Search

  67. Conclusion IBEX: Budgeting → O ( N log C ) Tree search: drop-in replacement for IDA* DovIBEX: For applications where oracle returns only if a solution is found Interleaves budget iterations Do you have a problem where the current worst case is O ( N 2 ), due to being cautious? Maybe (Dov)IBEX can help! Come and see our poster!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend