combinatorial search algorithms as rational agents
play

Combinatorial Search Algorithms as Rational Agents Wheeler Ruml - PowerPoint PPT Presentation

Combinatorial Search Algorithms as Rational Agents Wheeler Ruml Palo Alto Research Center ruml@parc.com Wheeler Ruml (PARC) Learning to Search Trees 2 / 40 Motivation Research goal: What algorithm to run? Introduction


  1. Combinatorial Search Algorithms as Rational Agents Wheeler Ruml Palo Alto Research Center ruml@parc.com Wheeler Ruml (PARC) Learning to Search Trees – 2 / 40

  2. Motivation Research goal: “What algorithm to run?” Introduction ➢ Motivation ➢ Combinatorial fundamental properties of various algorithms ■ Optimization fundamental properties of problems ➢ Constraint ■ Satisfaction ➢ Types of Search Problems ➢ The Problem ➢ The Central Idea How to best use available information in a tree search? Previous Approaches Basic BLFS BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 3 / 40

  3. Combinatorial Optimization Given: set of variables Introduction ➢ Motivation possible values for each variable ➢ Combinatorial Optimization objective function over assignments ➢ Constraint Find: assignment that minimizes objective function Satisfaction ➢ Types of Search Problems ➢ The Problem One approach: search tree for best leaf ➢ The Central Idea Previous Approaches variable 1 Basic BLFS BLFS with Learning value 1 value 2 variable 2 variable 2 value 1 value 2 value 1 value 2 1.6 2.3 1.5 3.9 Wheeler Ruml (PARC) Learning to Search Trees – 4 / 40

  4. Constraint Satisfaction Given: set of variables Introduction ➢ Motivation possible values for each variable ➢ Combinatorial Optimization set of constraints between variables ➢ Constraint Find: complete and feasible assignment Satisfaction ➢ Types of Search Problems ➢ The Problem Treat as combinatorial optimization: ➢ The Central Idea Previous Approaches variable 1 Basic BLFS BLFS with Learning value 1 value 2 variable 2 variable 2 value 1 value 2 value 1 value 2 1 3 0 4 Wheeler Ruml (PARC) Learning to Search Trees – 5 / 40

  5. Types of Search Problems Shortest path: find shallowest node that is a goal Introduction eg, shortest plan ➢ Motivation ➢ Combinatorial Constraint satisfaction: find any leaf node that is a goal Optimization ➢ Constraint eg, valid configuration Satisfaction ➢ Types of Search Combinatorial optimization: find best-scoring leaf node Problems ➢ The Problem eg, balanced partitioning ➢ The Central Idea Adversarial search: find best-scoring leaf we can surely reach Previous Approaches eg, chess Basic BLFS BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 6 / 40

  6. Types of Search Problems Shortest path: find shallowest node that is a goal Introduction eg, shortest plan ➢ Motivation ➢ Combinatorial Optimization ➢ Constraint Satisfaction ➢ Types of Search Combinatorial optimization: find best-scoring leaf node Problems ➢ The Problem eg, balanced partitioning ➢ The Central Idea Adversarial search: find best-scoring leaf we can surely reach Previous Approaches eg, chess Basic BLFS BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 6 / 40

  7. Types of Search Problems Shortest path: find shallowest node that is a goal Introduction eg, shortest plan ➢ Motivation ➢ Combinatorial Optimization ➢ Constraint Satisfaction ➢ Types of Search Problems ➢ The Problem ➢ The Central Idea Adversarial search: find best-scoring leaf we can surely reach Previous Approaches eg, chess Basic BLFS BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 6 / 40

  8. The Problem For large problems or when optimum is recognizable, Introduction search order matters. ➢ Motivation ➢ Combinatorial Optimization ➢ Constraint Satisfaction ➢ Types of Search Where was the mistake? Problems ➢ The Problem ➢ The Central Idea Previous Approaches Basic BLFS BLFS with Learning Truncated depth-first is not necessarily optimal! Wheeler Ruml (PARC) Learning to Search Trees – 7 / 40

  9. The Central Idea Where to backtrack first? Introduction ➢ Motivation ➢ Combinatorial Optimization ➢ Constraint Satisfaction ➢ Types of Search Problems ➢ The Problem ➢ The Central Idea Previous Approaches Predetermined order = strong assumptions = ad hoc = brittle Basic BLFS BLFS with Learning Use a model of leaf costs on-line to guide search. [Ruml, 2001; Boyan, 1998; Baluja, 1996] Wheeler Ruml (PARC) Learning to Search Trees – 8 / 40

  10. Introduction Previous Approaches ➢ DFS ➢ Discrepancy Search ➢ A Best-First Approach ➢ Predicting Leaf Cost ➢ Avoid Previous Approaches Bookkeeping ➢ BLFS Basic BLFS BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 9 / 40

  11. Depth-First Search (DFS) Introduction Previous Approaches ➢ DFS ➢ Discrepancy Search ➢ A Best-First Approach ➢ Predicting Leaf Cost ➢ Avoid Bookkeeping ➢ BLFS Basic BLFS BLFS with Learning 1. Prune provably bad nodes (branch and bound) 2. Sort children left to right using a heuristic ordering function h Assumes penalty at top is enormous. Wheeler Ruml (PARC) Learning to Search Trees – 10 / 40

  12. Depth-First Search (DFS) Introduction Previous Approaches ➢ DFS ➢ Discrepancy Search ➢ A Best-First Approach ➢ Predicting Leaf Cost ➢ Avoid Bookkeeping ➢ BLFS Basic BLFS BLFS with Learning 1. Prune provably bad nodes (branch and bound) 2. Sort children left to right using a heuristic ordering function h Assumes penalty at top is enormous. Wheeler Ruml (PARC) Learning to Search Trees – 10 / 40

  13. Discrepancy Search Harvey and Ginsberg (1995): Limited Discrepancy Search Introduction discrepancy : a choice against the heuristic ordering Previous Approaches ➢ DFS Explore all paths with k discrepancies before any with k + 1 . ➢ Discrepancy Search ➢ A Best-First Approach ➢ Predicting Leaf Cost ➢ Avoid Bookkeeping ➢ BLFS Basic BLFS BLFS with Learning Korf (1996): ILDS Also Walsh (1997), Ginsberg and Harvey (1992), Meseguer (1997) Wheeler Ruml (PARC) Learning to Search Trees – 11 / 40

  14. A Best-First Approach Fixed order ↔ fixed predictions for leaf costs Introduction Want predicted costs to match current problem Previous Approaches ➢ DFS ➢ Discrepancy Use run-time heuristic information to help make predictions. Search ➢ A Best-First Approach ➢ Predicting Leaf Use predictions to guide search: Cost ➢ Avoid Bookkeeping Rational order: increasing predicted leaf cost = best-first ➢ BLFS Basic BLFS BLFS with Learning 1.6 2.3 2.1 3.9 1.5 2.6 3.2 4.4 [Ruml, 2002 Wheeler Ruml (PARC) Learning to Search Trees – 12 / 40

  15. Predicting Leaf Cost Want to visit leaves in increasing order of predicted cost. Introduction Previous Approaches Where are they? ➢ DFS ➢ Discrepancy Search f ( n ) = predicted cost of best leaf at or below n ■ ➢ A Best-First Approach can use any info at n or on path from root ■ ➢ Predicting Leaf Cost want f ( n ) consistent ■ ➢ Avoid Bookkeeping ➢ BLFS Basic BLFS BLFS with Learning f ( n ) = 1 . 5 f ( n ) = 1 . 7 f ( n ) = 1 . 5 f ( n ) = 2 . 2 f ( n ) = 3 . 1 f ( n ) = 1 . 5 f ( n ) = 4 . 8 1.6 2.3 2.1 3.9 1.5 2.6 3.2 4.4 Wheeler Ruml (PARC) Learning to Search Trees – 13 / 40

  16. Avoid Bookkeeping Want to visit leaves in increasing order of predicted cost. Introduction Previous Approaches How to keep track of them? ➢ DFS ➢ Discrepancy Search don’t — allow slight misordering ■ ➢ A Best-First Approach use iteratively increasing cost bound ■ ➢ Predicting Leaf Cost ➢ Avoid Bookkeeping Cost bound = 2 ➢ BLFS Basic BLFS BLFS with Learning f ( n ) = 1 . 5 f ( n ) = 1 . 7 f ( n ) = 1 . 5 f ( n ) = 2 . 2 f ( n ) = 3 . 1 f ( n ) = 1 . 5 f ( n ) = 4 . 8 1.6 2.3 2.1 3.9 1.5 2.6 3.2 4.4 Wheeler Ruml (PARC) Learning to Search Trees – 14 / 40

  17. Avoid Bookkeeping Want to visit leaves in increasing order of predicted cost. Introduction Previous Approaches How to keep track of them? ➢ DFS ➢ Discrepancy Search don’t — allow slight misordering ■ ➢ A Best-First Approach use iteratively increasing cost bound ■ ➢ Predicting Leaf Cost ➢ Avoid Bookkeeping Cost bound = 3 ➢ BLFS Basic BLFS BLFS with Learning f ( n ) = 1 . 5 f ( n ) = 1 . 7 f ( n ) = 1 . 5 f ( n ) = 2 . 2 f ( n ) = 3 . 1 f ( n ) = 1 . 5 f ( n ) = 4 . 8 1.6 2.3 2.1 3.9 1.5 2.6 3.2 4.4 Wheeler Ruml (PARC) Learning to Search Trees – 14 / 40

  18. Best-Leaf-First Search (BLFS) BLFS ( root ) Introduction Previous Approaches Visit a few leaves ➢ DFS ➢ Discrepancy Nodes-desired ← number of nodes visited so far Search Loop until time runs out: ➢ A Best-First Approach Double nodes-desired ➢ Predicting Leaf Cost Estimate cost bound that visits nodes-desired nodes ➢ Avoid Bookkeeping BLFS-expand( root , bound ) ➢ BLFS Basic BLFS BLFS with Learning BLFS-expand ( node , bound ) If leaf( node ), visit( node ) else, for each child of node : If best-completion( child ) ≤ bound BLFS-expand( child , bound ) Wheeler Ruml (PARC) Learning to Search Trees – 15 / 40

  19. Introduction Previous Approaches Basic BLFS ➢ Indecision Search ➢ Choosing the Cost Bound ➢ Best-Leaf-First Search (BLFS) ➢ Test Domains Basic BLFS ➢ Latin Squares ➢ Random Binary CSPs BLFS with Learning Wheeler Ruml (PARC) Learning to Search Trees – 16 / 40

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend