An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem - PowerPoint PPT Presentation

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem 1 Tom Everitt Marcus Hutter Australian National University September 3, 2015 Everitt, T. and Hutter, M. (2015a). Analytical Results on the BFS vs. DFS Algorithm Selection Problem. Part I: Tree Search. In 28th Australian Joint Conference on Artificial Intelligence Everitt, T. and Hutter, M. (2015b). Analytical Results on the BFS vs. DFS Algorithm Selection Problem. Part II: Graph Search. In 28th Australian Joint Conference on Artificial Intelligence 1 BFS=Breadth-first search, DFS=Depth-first search Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 1 / 21

Outline Motivation and Background 1 Simple model 2 Expected Runtimes Decision Boundary More General Models 3 Experimental Results 4 Conclusions 5 Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 2 / 21

Motivation (Graph) search is a fundamental AI problem: planning, learning, problem solving Hundreds of algorithms have been developed, including metaheuristics such as simulated annealing, genetic algorithms. These are often heuristically motivated, lacking solid theoretical footing. For theoretical approach, return to basics: BFS and DFS. So far, mainly worst-case results have been available (we focus on average/expected runtime). Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 3 / 21

Breadth-first Search (BFS) Korf et al. (2001) found a clever way to analyse IDA*, which essentially is a generalisation of BFS. Later generalised by Zahavi et al. (2010). Both are essentially worst-case results. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 4 / 21

Depth-first Search (DFS) Knuth (1975) developed a way to estimate search tree size and DFS worst-case performance. s 0 Refinements and applications Purdom (1978): Use several branches instead of one Chen (1992): Use stratified sampling Kilby et al. (2006): The estimates can be used to select Assume the same number of children best SAT algorithm in other branches. Estimate ≈ 2 · 3 · 3 · 2 = 36 leaves. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 5 / 21

Potential gains We focus on average or expected runtime of BFS and DFS rather than worst-case. Selling points: Good to have an idea how long a search might take Useful for algorithm selection (Rice, 1975) May be used for constructing meta-heuristics Precise understanding of basics often useful Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 6 / 21

BFS and DFS BFS and DFS are opposites. DFS BFS 1 1 2 9 2 3 3 10 13 4 6 7 6 5 4 5 7 8 11 12 14 15 8 9 10 11 12 13 14 15 focuses near the start node focuses far from the start node Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 7 / 21

Formal setups We analyse BFS and DFS expected runtime in a sequence of increasingly general models. 1 Tree with a single level of goals 2 Tree with multiple levels of goals 3 General graph Increasingly coarse approximations are required Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 8 / 21

Simplest model - Tree with Single Goal Level Our simplest model assumes a complete tree with: A max search depth D ∈ N , D = 3 , Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 9 / 21

Simplest model - Tree with Single Goal Level Our simplest model assumes a complete tree with: A max search depth D ∈ N , A goal level g ∈ { 0 , . . . , D } D = 3 , g = 2 , Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 9 / 21

Simplest model - Tree with Single Goal Level Our simplest model assumes a complete tree with: A max search depth D ∈ N , A goal level g ∈ { 0 , . . . , D } Nodes on level g are goals with goal probability p ∈ [0 , 1] (iid). D = 3 , g = 2 , p = 1 / 3 Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 9 / 21

BFS Runtime 1 Expected BFS search time is E [ t BFS ] = 2 g − 1 + 1 /p 2 3 The position Y of the first Proof. 4 5 6 7 goal is geometrically distributed with E [ Y ] = 1 /p . 8 9 10 11 12 13 14 15 Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 10 / 21

DFS Runtime Expected DFS search time is 1 2 D − g +1 E [ t DFS ] ≈ (1 /p − 1) 2 9 � �� size of number of subtrees subtrees 3 6 10 13 Proof. There are (1 /p − 1) red mini- trees of size ≈ 2 D − g +1 . It turns out 4 5 7 8 11 12 14 15 that the blue nodes do not substan- tially affect the count in most cases. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 11 / 21

expected search time 10 4 10 3 10 2 DFS BFS 4 6 8 10 12 14 16 g Expected BFS and DFS search time as a function of goal depth in a tree of depth D = 15 , and goal probability p = 0 . 07 . The initially high expectation of BFS is because likely no goal exists � whole tree searched (artefact of model). Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 12 / 21

BFS vs. DFS Combining the runtime estimates yields an elegant decision boundary for when BFS is better: E [ t BFS ] − E [ t DFS ] < 0 ⇐ ⇒ g < D/ 2 + γ � �� BFS Better where γ = log 2 ( 1 − p p ) / 2 is inversely related to p ( γ small when p not very close to 0 or 1). Observations: BFS is better when goal near start node (expected) DFS benefits when p is large Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 13 / 21

BFS vs. DFS 16 DFS wins BFS wins 14 BFS = DFS E [ t BFS ] = E [ t DFS ] 12 10 g 8 6 4 2 4 6 8 10 12 14 16 D Plot of BFS vs. DFS decision boundary with goal level g and goal probability p = 0 . 07 . The decision boundary gets 79% of the winners correct. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 14 / 21

BFS vs. DFS 16 DFS wins BFS wins 14 BFS = DFS E [ t BFS ] = E [ t DFS ] 12 10 g 8 6 4 2 4 6 8 10 12 14 16 D Plot of BFS vs. DFS decision boundary with goal level g and goal probability p = 0 . 07 . The decision boundary gets 79% of the winners correct. Time to generalise. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 14 / 21

Tree with Multiple Goal Levels As before, assume a complete tree with: A maximum search depth D D = 3 , Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 15 / 21

Tree with Multiple Goal Levels As before, assume a complete tree with: A maximum search depth D Instead of goal level g and goal probability p : Use a goal probability vector p = [ p 0 , . . . , p D ] . Nodes on level k are goals with iid D = 3 , p = [0 , 1 3 , 1 3 , 1 3 ] probability p k . This is arguably much more realistic :) ways to estimate the goal probabilities is an important future question. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 15 / 21

Tree with Multiple Goal Levels As before, assume a complete tree with: A maximum search depth D Instead of goal level g and goal probability p : Use a goal probability vector p = [ p 0 , . . . , p D ] . Nodes on level k are goals with iid D = 3 , p = [0 , 1 3 , 1 3 , 1 3 ] probability p k . This is arguably much more realistic :) ways to estimate the goal probabilities is an important future question. Both BFS and DFS analysis can be carried back to the single goal level case with some hacks. BFS analysis is fairly straightforward DFS requires approximation of geometric distribution with exponential distribution Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 15 / 21

Decision Boundary 14 DFS wins BFS wins MGL =˜ t BFS t DFS MGL 12 10 µ 8 6 10 − 2 10 − 1 10 0 10 1 10 2 σ 2 The goal probabilities are highest at a peak level µ , and decays around it depending on σ 2 . Some takeaways: BFS still likes goals close to the root BFS likes larger spread more than DFS does (increases probability of really easy goal) Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 16 / 21

General graphs DFS BFS 1 1 2 13 2 3 3 7 9 14 4 6 7 5 4 5 6 8 10 11 12 15 8 9 10 11 12 13 14 15 We capture the various topological properties of graphs in a collection of parameters called the descendant counter. Similarly to before, we get approximate expressions for BFS and DFS expected runtime given a goal probability vector. We analytically derive the descendant counter for two concrete grammar problems (it could potentially be inferred empirically in other cases). Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 17 / 21

One observation is that DFS can spend an even greater fraction of the initial search time far away from the root. Binary Grammar Complete Binary Tree 10 6 10 6 10 4 10 4 t BFS BG ˜ t DFS BG 10 2 t BFS 10 2 t DFS SGL BGL ˜ t DFS t DFS SGL BGU 5 10 15 20 5 10 15 20 g g So BFS will be better for a wider range of goal levels in graph search than in tree search. Tom Everitt, Marcus Hutter (ANU) BFS vs. DFS September 3, 2015 18 / 21

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem - PowerPoint PPT Presentation

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem 1 Tom Everitt Marcus Hutter Australian National University September 3, 2015 Everitt, T. and Hutter, M. (2015a). Analytical Results on the BFS vs. DFS Algorithm Selection

Graphs Lecture 2 Today BFS/DFS Review; proof about DFS tree Implementation Running time

Massive Data Algorithmics Lecture 11: BFS and DFS Massive Data Algorithmics Lecture 11: BFS and

BFS/DFS Applications BFS and DFS applications Tyler Moore Shortest path between two nodes in a

BFS and DFS Problem Solving Club Nov 2 2016 Breadth First Search (BFS) Review What is the

Kolganov A.S., MSU The BFS algorithm Graph500 && GGraph500 Implementation of

Scalable GPU graph traversal BFS Compressed Row Format Sequential BFS Parallel BFS

Week 5 Kullmann Analysing BFS Depth-first search Depth-first search Analysing DFS

Integrating non-DCE/DFS Desktops into an existing DCE/DFS Environment Markus Zahn Computer

Reminder: Recursive DFS Algorithm dfs ( G ) 1. Initialise Boolean array visited by setting all

Local Search Toolbox so far Uninformed search BFS, DFS,

Interoperability of retail DFS What is Interoperability? DFS interoperability models

DFS(v) Recursive version Global Initialization: // mark v " undiscovered " for

CS 401: Computer Algorithm I DFS / Topological Ordering Xiaorui Sun 1 BFS Application:

AP CAPSTONE DIPLOMA at BFS BFS students are prepared to enter top universities worldwide and

Framework - Feed the Future Presenters Emily Hogue Krista Jacobs Farzana Ramzan USAID/BFS

Operationalizing CSA: Applications and CSA Metrics Moffatt Ngugi, BFS/CSI Tatiana Pulido,

Top-down Definite Clause Proof Procedure Idea: search backward from a query to determine if it is

Probabilistic and Logistic Circuits: A New Synthesis of Logic and Machine Learning Guy Van den

Feature Selection Yingyu Liang Computer Sciences 760 Fall 2017

Top-down Ground Proof Procedure Idea: search backward from a query to determine if it is a logical

7 th Symposium on Educational Advances in Artificial Intelligence 2017 Eric Eaton Sven Koenig

Career and Financial Incentives Hyuncheol Bryant Kim (Cornell University) Seonghoon Kim

Conducting a Mitiga tion Ne e ds Asse ssme nt 2019 CDBG-DR Problem Solving Clinic Kansas City

Assessments Street Reclaim and Pave Unit Assessment 50% of Street Cost or 2020

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem - PowerPoint PPT Presentation

An Analytical Approach to the BFS vs. DFS Algorithm Selection Problem 1 Tom Everitt Marcus Hutter Australian National University September 3, 2015 Everitt, T. and Hutter, M. (2015a). Analytical Results on the BFS vs. DFS Algorithm Selection

Graphs Lecture 2 Today BFS/DFS Review; proof about DFS tree Implementation Running time

Massive Data Algorithmics Lecture 11: BFS and DFS Massive Data Algorithmics Lecture 11: BFS and

BFS/DFS Applications BFS and DFS applications Tyler Moore Shortest path between two nodes in a

BFS and DFS Problem Solving Club Nov 2 2016 Breadth First Search (BFS) Review What is the

Kolganov A.S., MSU The BFS algorithm Graph500 &amp;&amp; GGraph500 Implementation of

Scalable GPU graph traversal BFS Compressed Row Format Sequential BFS Parallel BFS

Week 5 Kullmann Analysing BFS Depth-first search Depth-first search Analysing DFS

Integrating non-DCE/DFS Desktops into an existing DCE/DFS Environment Markus Zahn Computer

Reminder: Recursive DFS Algorithm dfs ( G ) 1. Initialise Boolean array visited by setting all

Local Search Toolbox so far Uninformed search BFS, DFS,

Interoperability of retail DFS What is Interoperability? DFS interoperability models

DFS(v) Recursive version Global Initialization: // mark v &quot; undiscovered &quot; for

CS 401: Computer Algorithm I DFS / Topological Ordering Xiaorui Sun 1 BFS Application:

AP CAPSTONE DIPLOMA at BFS BFS students are prepared to enter top universities worldwide and

Framework - Feed the Future Presenters Emily Hogue Krista Jacobs Farzana Ramzan USAID/BFS

Operationalizing CSA: Applications and CSA Metrics Moffatt Ngugi, BFS/CSI Tatiana Pulido,

Top-down Definite Clause Proof Procedure Idea: search backward from a query to determine if it is

Probabilistic and Logistic Circuits: A New Synthesis of Logic and Machine Learning Guy Van den

Feature Selection Yingyu Liang Computer Sciences 760 Fall 2017

Top-down Ground Proof Procedure Idea: search backward from a query to determine if it is a logical

7 th Symposium on Educational Advances in Artificial Intelligence 2017 Eric Eaton Sven Koenig

Career and Financial Incentives Hyuncheol Bryant Kim (Cornell University) Seonghoon Kim

Conducting a Mitiga tion Ne e ds Asse ssme nt 2019 CDBG-DR Problem Solving Clinic Kansas City

Assessments Street Reclaim and Pave Unit Assessment 50% of Street Cost or 2020

Kolganov A.S., MSU The BFS algorithm Graph500 && GGraph500 Implementation of

DFS(v) Recursive version Global Initialization: // mark v " undiscovered " for