Sokoban:( Enhancing(general(single2agent( - PowerPoint PPT Presentation

Sokoban:( Enhancing(general(single2agent( search(methods(using(domain( knowledge8 Andreas Junghanns – Jonathan Schaeffer presented by Pascal Düblin

What(is(Sokoban?8 • Computer game • Example: • Goal: o Push stones with the man (smiley) on goal squares (red shaded squares) • Rules: o No pull, only push moves o If a stone cannot be pushed and isn‘t on a goal square, the game is lost

Sokobans(search2space8 Property( Specifics( 240Puzzle( Rubik‘s8Cube( Sokoban( Branching(factor8 Average8 2.378 13.358 128 Range8 1238 122158 021368 Solution(length8 Average8 100+8 188 2608 Range8 12unkown8 12208 9726748 Search2space(size8 Upper(bound8 10 358 10 198 10 988 Calculation(of8 Full8 O(n)8 O(n)8 O(n 3 )8 Lower(bound8 Incremental8 O(1)8 O(1)8 O(n 2 )8 Underlying(graph8 Undirected8 Undirected8 Directed8

Application2dependent( techniques8 • Sokoban-solver: Rolling Stone • Node limit: 20 million nodes • Basis for Rolling-Stone: IDA* • 3 years of work • 90 Sokoban test problems

Basic(Implementation:( IDA*8 • IDA* = Iterative deepening A* • Similar approach like iterative deepening depth first search • F-value of A* is limited • This limit surges with each iteration of the depth first search

Simple(lower(Bound8 • Heuristic for IDA* • Example: • Manhattan Distance to nearest goal square • Sum of all distances • In example: 5 • Problems solved: 0

Minimum(matching( lower(bound((R0)((1)8 • Improved heuristic for • Example: IDA* A 2 1 • Each goal square can B8 only be taken from one 3 stone C • Manhattan Distance to nearest and reachable goal squares

Minimum(matching( lower(bound((R0)((2)8 • Algorithm realize that • Example: the goal square of a stone is not always the A 2 nearest 1 B8 3 28 18 A 68 98 C 68 28 B8 ∞ ∞ ∞ 18 38 C 28

Minimum(matching( lower(bound((R0)((3)8 • Algorithm realize that • Example: the goal square of a stone is not always the A 2 nearest 1 B8 3 28 18 A 68 98 C 68 28 B8 ∞ ∞ Goals:( A( B( C( ∞ 18 1( 28 6( 98 38 C 28 2( 6( 28 28 3( 28 18 2(

Minimum(matching( lower(bound((R0)((4)8 • Heuristic value: 14 • Example: • Better heuristic value, h A 2 closer at h* 1 B8 3 • The result is an C enormous reduction of the search space Goals:( A( B( C( 1( 28 6( 98 • Problems solved: 0 2( 6( 28 28 3( 28 18 2(

Transposition(table((R1)8 • All visited states stored in the transposition table • Avoid visiting duplicated states/nodes • Duplicate elimination before expanding next node • Similar to close list • Problems solved: 5

Move(ordering((R2)8 • Order how nodes will be expanded • Actions (Moves) are sorted with the most promising actions first. • Sorting criteria: 1. Move the same stone 2. Move that mimimize the lower bound (optimal move) 3. Stone first that is nearest of its goal square 4. Non optimal moves sorted by the same criteria as above • Problems solved: 4

Deadlock(table((R3)8 • 4 x 5 region • Example: • Find all arangements of stones, wall squares and the man • Store all deadlocks • During IDA* search: check state against the deadlock table • Problems solved: 5

Tunnel(macros((R4)8 • Macros: Combine a group of moves • All tunnel moves are made all at once • Problems solved: 6

Goal(macros((R5)((1)8 • In Sokoban goal • Example: squares are often grouped in a gaol area • In this case Sokoban can be split in two subproblems:

Goal(macros((R5)((1)8 • In Sokoban goal • Example: squares are often grouped in a gaol area • In this case Sokoban can be split in two subproblems: o Push a stone on an entrance square

Goal(macros((R5)((1)8 • In Sokoban goal • Example: squares are often grouped in a gaol area • In this case Sokoban can be split in two subproblems: o Push a stone on an entrance square o Push a the stone on a goal square

Goal(macros((R5)((2)8 • Precomputed for each • Example: problem and room, in which specified order the man has to push the stones on their goal squares.

Goal(macros((R5)((3)8 • If a stone is on an • Example: entrance square, the goal macro will be executed • Problems solved: 17

Goal(macros((R5)((4)8 • Goal macro moves are • Example: grouped to one • All other possible moves are ignored • Problems solved: 17

Goal(cuts((R6)((1)8 • If push „b“ starts a goal • Example: macro (stone reaches a entrance square), all childs of the parent of „b“ will be pruned b • Problems solved: 24

Goal(cuts((R6)((2)8 • If push „b“ starts a goal macro (stone reaches a entrance square), all childs of the parent of a8 „b“ will be pruned b8 c8 • In example the branch with move „c“ and „d“ is pruned after the goal Goal( (d8 macro is reached macro8 • Problems solved: 24

Goal(cuts((R6)((3)8 • If push „b“ starts a goal macro (stone reaches a entrance square), all childs of the parent of a8 „b“ will be pruned b8 c8 • In example the branch with move „c“ and „d“ is pruned after the goal Goal( (d8 macro is reached macro8 • Problems solved: 24

Pa_ern(search((R7)((1)8 • Goal: Find Deadlock or increase lower bound estimation (Slide: Overestimation) • PIDA*: IDA* version for pattern search • Example:

Pa_ern(search((R7)((2)8 • Original maze: • Test maze:

Pa_ern(search((R7)((2)8 • Original maze: • Test maze: • => Deadlock occurs

Pa_ern(search((R7)((3)8 • Final steps: • Minimum pattern: • Calculate minimum deadlock pattern • Add pattern to pattern table • During IDA* search: • Check state against pattern search table • Problems solved: 48

Relevance(cuts((R8)8 • Goal: to find independent subproblems • Move only if the last # number of moves influence it • Properties of influence/ relevance: 1. Alternatives 2. Goal-Skew 3. Connection 4. Tunnel • Problems solved: 50

Overestimation((R9)8 • A* is optimal, if h() is admissible • Admissible: � n, h(n) ≤ C(n) | C(n) : actual cost to reach goal from n • Non-admissible h => search often more accurate, but no longer optimal • Sum of max penalty per stone added to h (penalties calculated from pattern search) • => optimality no longer garanteed • Problems solved: 54

Rapid(random(restarts( (R10)8 • Restarts with more randomization in move ordering (less strictness) • A certain number of restarts with the same f-limit • If f-limit increases, the randomization of move ordering will fall back to zero • Problems solved: 57

Comparison(table8 #8 In8prop.8 In8prop.8 #8less8 In8prop.8 In8prop.8 solved( to890( to857( solved8 to890( To857( without8this8 approach( Transposition(table8 58 5.6%8 8.8%8 198 21.1%8 33.3%8 Move(ordering8 48 4.4%8 7.0%8 0218 021.1%8 021.8%8 Deadlock(table8 58 5.6%8 8.8%8 0218 021.1%8 021.8%8 Tunnel(macros8 68 6.7%8 10.5%8 0218 021.1%8 021.8%8 Goal(macros8 178 18.9%8 29.8%8 338 36.7%8 57.9%8 Goal(cuts8 248 26.7%8 42.1%8 0218 021.1%8 021.8%8 Pa_ern(search8 488 53.3%8 84.2%8 228 24.4%8 38.6%8 Relevance(cuts8 508 55.6%8 87.7%8 0218 021.1%8 021.8%8 Overestimation8 548 60%8 94.7%8 0218 021.1%8 021.8%8 RR(Restart8 578 63.3%8 100%8 0218 021.1%8 021.8%8

Comparison(table8 #8 In8prop.8 In8prop.8 #8less8 In8prop.8 In8prop.8 solved( to890( to857( solved8 to890( To857( without8this8 approach( Minimum(matching8 08 0%8 0%8 0218 021.1%8 021.8%8 Transposition(table8 58 5.6%8 8.8%8 198 21.1%8 33.3%8 Move(ordering8 48 4.4%8 7.0%8 0218 021.1%8 021.8%8 Deadlock(table8 58 5.6%8 8.8%8 0218 021.1%8 021.8%8 Tunnel(macros8 68 6.7%8 10.5%8 0218 021.1%8 021.8%8 Goal(macros8 178 18.9%8 29.8%8 338 36.7%8 57.9%8 Goal(cuts8 248 26.7%8 42.1%8 0218 021.1%8 021.8%8 Pa_ern(search8 488 53.3%8 84.2%8 228 24.4%8 38.6%8 Relevance(cuts8 508 55.6%8 87.7%8 0218 021.1%8 021.8%8 Overestimation8 548 60%8 94.7%8 0218 021.1%8 021.8%8 RR(Restart8 578 63.3%8 100%8 0218 021.1%8 021.8%8

Sokoban:( Enhancing(general(single2agent( - PowerPoint PPT Presentation

Sokoban:( Enhancing(general(single2agent( search(methods(using(domain( knowledge8 Andreas Junghanns Jonathan Schaeffer presented by Pascal Dblin What(is(Sokoban?8 Computer game Example: Goal: o Push stones with the man

A Tabled Prolog Program for Solving Sokoban Neng-Fa Zhou Agostino Dovier Department of Computer

Outline DM811 Fall 2009 Heuristics for Combinatorial Optimization 1. Complete Search Methods

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Direct Search Methods (nongradient methods) 1. Random search methods 2. Univariate method (one

Enhancing the Search Box Greg Gershman Baltimore, MD greg@shelrick.com

Wednesday, November 30, 2016 3:41 PM General Page 1 General Page 2 General Page 3 General Page

Tabu Search Search Tabu Page 1 Part I Part I Tabu Search Principles Search Principles Tabu

Uninformed Search 2 Informed Search Rest of blind search An informed search strategyone

Informed search algorithms Outline Best-first search Greedy best-first search A *

Foundations of Artificial Intelligence 9. State-Space Search: Tree Search and Graph Search Malte

Stochastic Local Search Methods Dynamic Local Search Iterated Local Search Tabu Search Marco

Graph Search Methods Graph Search Methods A search method starts at a given vertex v and

Contents Foundations of Artificial Intelligence Best-First Search 1 4. Informed Search Methods

Elastic Search - Aditi Choksi (EW18455) Elastic Search Search engine Distributed

2 EBI Search 3 EBI Search 4 EBI

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

The Graphplan Planner Searching the Planning Graph 1 Literature Malik Ghallab, Dana Nau,

Java for Non Majors Midterm Study Guide April 3, 2017 The test consists of 1. Multiple choice

Single Student Project Spec Sandy Louchart Mike Chantler, 4/7/2011 Overview You have to

Games I An Introduction Lecture 26 COMPSCI 111/111G S2 2020 Definitions: Play } Range of

T-DUALITY FOR MASSIVE STATES IN STRING THEORY Jnan Maharana March 8, 2013 In Memory of Sumitra

Social bias and fairness in NLP GAIA Conference 2020 Olof Mogren, PhD RISE Research Institutes

Words Matter: Upgrading Your Communications With Inclusive Language Nikki Hopewell (she/her)

Voter, What Message Will Motivate You to Verify Your Vote? Maina Olembo, Karen Renaud, Steffen