CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer - PowerPoint PPT Presentation

CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison slide 1

Uninformed Search slide 2

The search problem • State space S : all valid configurations • Initial states (nodes) I ={(CSDF,)}  S ▪ Where ’ s the boat? • Goal states G ={(,CSDF)}  S C S D F • Successor function succs (s)  S : states reachable in one step (one arc) from s ▪ succs ((CSDF,)) = {(CD, SF)} ▪ succs ((CDF,S)) = {(CD,FS), (D,CFS), (C, DFS)} • Cost(s,s ’ )=1 for all arcs. (weighted later) • The search problem: find a solution path from a state in I to a state in G . ▪ Optionally minimize the cost of the solution. slide 3

General State-Space Search Algorithm function general-search(problem, QUEUEING-FUNCTION) ;; problem describes the start state, operators, goal test, and ;; operator costs ;; queueing-function is a comparator function that ranks two states ;; general-search returns either a goal node or "failure" nodes = MAKE-QUEUE(MAKE-NODE(problem.INITIAL-STATE)) loop if EMPTY(nodes) then return "failure" node = REMOVE-FRONT(nodes) if problem. GOAL-TEST (node.STATE) succeeds then return node nodes = QUEUEING-FUNCTION (nodes, EXPAND (node, problem.OPERATORS)) ;; succ(s)=EXPAND(s, OPERATORS) ;; Note: The goal test is NOT done when nodes are generated ;; Note: This algorithm does not detect loops end slide 4

Search on Trees: Breadth-first search (BFS) Expand the shallowest node first • Examine states one step away from the initial states • Examine states two steps away from the initial states • and so on … ripple g o a l slide 5

Depth-first search Expand the deepest node first 1. Select a direction, go deep to the end 2. Slightly change the end 3. Slightly change the end some more … fan g o a l slide 6

Iterative deepening 1. DFS, but stop if path length > 1. 2. If goal not found, repeat DFS, stop if path length >2. 3. And so on … g fan within ripple o a l g g o o a a l l slide 7

What you should know • Problem solving as search: state, successors, goal test • Uninformed search ▪ Breadth-first search • Uniform-cost search ▪ Depth-first search ▪ Iterative deepening ▪ Bidirectional search • Can you unify them (except bidirectional) using the same algorithm, with different priority functions? • Performance measures ▪ Completeness, optimality, time complexity, space complexity slide 9

Example slide 10

Example slide 11

Informed Search slide 12

Uninformed vs. informed search Uninformed search (BFS, uniform-cost, DFS, ID etc.) Knows the actual path cost g(s) from start to a node s in the fringe, but that ’ s it. s start goal g(s) Informed search s start goal g(s) h(s) also has a heuristic h(s) of the cost from s to goal. ( ‘ h ’ = heuristic, non-negative) Can be much faster than uninformed search. slide 13

Third attempt: A* search • use g(s) + h(s) , but the heuristic function h() has to satisfy h(s)  h*(s) , where h*(s) is the true cost from node s to the goal. • Such heuristic function h() is called admissible . • An admissible heuristic never over-estimates It is always optimistic • A search with admissible h() is called A* search . slide 14

What you should know Know why best-first greedy search is bad. Thoroughly understand A* Trace simple examples of A* execution. Understand admissible heuristics. slide 15

Example slide 16

Example slide 17

Advanced Search: Optimization slide 18

Optimization problems Previously we want a path from start to goal Uninformed search: g(s): Iterative Deepening Informed search: g(s)+h(s): A* Now a different setting : Each state s has a score f(s) that we can compute The goal is to find the state with the highest score, or a reasonably high score Do not care about the path This is an optimization problem Enumerating the states is intractable Even previous search algorithms are too expensive slide 19

Hill climbing algorithm 1. Pick initial state s 2. Pick t in neighbors( s ) with the largest f ( t ) 3. IF f ( t )  f ( s ) THEN stop, return s 4. s = t . GOTO 2. • Not the most sophisticated algorithm in the world. • Very greedy. • Easily stuck. your enemy: local optima slide 20

Repeated hill climbing with random restarts Very simple modification 1. When stuck, pick a random new start, run basic hill climbing from there. 2. Repeat this k times. 3. Return the best of the k local optima. • Can be very effective • Should be tried whenever hill climbing is used slide 21

Example slide 22

Example slide 23

Simulated Annealing 1. Pick initial state s 2. Randomly pick t in neighbors( s ) 3. IF f ( t ) better THEN accept s  t . 4. ELSE /* t is worse than s */ accept s  t with a small probability 5. 6. GOTO 2 until bored. How to choose the small probability? idea: p decreases with time, also as the ‘ badness ’ | f ( s )- f ( t )| increases Typical choice:    Boltzmann | ( ) ( ) | f s f t    exp distribution     Temp slide 24

Example slide 25

Example slide 26

Genetic algorithm Genetic algorithm : a special way to generate neighbors, using the analogy of cross-over, mutation, and natural selection. Number of non- prob. reproduction  fitness attacking pairs  Next generation slide 27

Game Playing slide 28

Two-player zero-sum discrete finite deterministic games of perfect information Definitions: Zero-sum: one player ’ s gain is the other player ’ s loss. Does not mean fair . Discrete: states and decisions have discrete values Finite: finite number of states and decisions Deterministic: no coin flips, die rolls – no chance Perfect information: each player can see the complete game state. No simultaneous decisions. slide 29

The game tree for II-Nim Two players: Max and Min (ii ii) Max (i ii) Min (- ii) Min (- ii) Max (i i) Max (- i) Max (- i) Max (- -) Max +1 (- i) Min (- -) Min (- i) Min (- -) Min (- -) Min -1 -1 -1 (- -) Max (- -) Max Max wants the largest score +1 +1 Min wants the smallest score slide 30

Game theoretic value Game theoretic value (a.k.a. minimax value) of a node = the score of the terminal node that will be reached if both players play optimally. = The numbers we filled in. Computed bottom up In Max ’ s turn, take the max of the children (Max will pick that maximizing action) In Min ’ s turn, take the min of the children (Min will pick that minimizing action) Implemented as a modified version of DFS: minimax algorithm slide 31

Minimax algorithm • function Max-Value(s) Time complexity? inputs: O( b m )  bad s: current state in game, Max about to play • Space complexity? output: best-score (for Max) available from s O( bm ) if ( s is a terminal state ) then return ( terminal value of s ) else α := –  for each s ’ in Succ(s) α := max( α , Min-value(s ’ )) return α function Min-Value(s) output: best-score (for Min) available from s if ( s is a terminal state ) then return ( terminal value of s) else β :=  for each s ’ in Succs(s) β := min( β , Max-value(s ’ )) return β slide 32

Example slide 33

Example slide 34

Alpha-Beta Motivation max S A min B 100 C D E F G 200 100 120 20 Depth-first order After returning from A, Max can get at least 100 at S After returning from F, Max can get at most 20 at B At this point, Max losts interest in B There is no need to explore G. The subtree at G is pruned. Saves time. slide 35

Alpha-beta pruning function Max-Value (s,α,β) Starting from the root: inputs: Max-Value(root, -  , +  ) s: current state in game, Max about to play α: best score (highest) for Max along path to s β: best score (lowest) for Min along path to s output: min (β , best-score (for Max) available from s ) if ( s is a terminal state ) then return ( terminal value of s ) else for each s ’ in Succ(s) α := max( α , Min-value(s ’,α,β)) if ( α ≥ β ) then return β /* alpha pruning */ return α function Min-Value (s,α,β) output: max (α , best-score (for Min) available from s ) if ( s is a terminal state ) then return ( terminal value of s) else for each s ’ in Succs(s) β := min( β , Max-value(s ’,α,β)) if (α ≥ β ) then return α /* beta pruning */ return β slide 36

Example slide 37

Example slide 38

Math Basics slide 39

Probability Axioms: P(A)  [0,1] ▪ ▪ P(true)=1, P(false)=0 P(A  B) = P(A) + P(B) – P(A  B) ▪ Properties: • P(  A) = 1 – P(A) • If A can take k different values a 1 … a k : P(A=a 1 ) + … P(A=a k ) = 1 • P(B) =  i=1 … k P(B  A=a i ), if A can take k values slide 40

Probability • Joint/marginal/conditional probability • Chain rule: • Bayes ’ rule: • Independence/conditional independence • Expectation slide 41

Example slide 42

Example slide 43

CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer - PowerPoint PPT Presentation

CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison slide 1 Uninformed Search slide 2 The search problem State space S : all valid configurations Initial states (nodes)

Basic Probability and Statistics CS540 Bryan R Gibson University of Wisconsin-Madison Slides

Introduction to Artificial Intelligence CS540-1 Yingyu Liang slide 1 Logistics Course

CS540 Uninformed Search Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department

Introduction to Artificial Intelligence CS540-1 slide 1 AI Examples: Cars that drive themselves

Robust Digital Filters Part 1: Minimax FIR Filters Wu-Sheng Lu Takao Hinamoto University of

Minimax Rates for Memory-Constrained Sparse Linear Regression Jacob Steinhardt John Duchi

Inf2D 04: Adversarial Search Valerio Restocchi School of Informatics, University of Edinburgh

Foundations of Artificial Intelligence 42. Board Games: Minimax Search and Evaluation Functions

Chapter6 Adversarial Search 20070419 Chap6 1 Game Theory Studied by mathematicians,

Homework 7.1 C D Here is the payoff matrix for the most commonly used version of the

CSC304 Lecture 6 Game Theory : Minimax Theorem via Expert Learning CSC304 - Nisarg Shah 1

CMU 15-896 Noncooperative games 2: Learning and minimax Teacher: Ariel Procaccia Reminder: The

Imprecision in learning: introduction Sebastien Destercke Universit de Technologie de

Simpler Optimal Algorithm for Contextual Bandits under Realizability Yunzong Xu MIT Joint work

Decision Problems Decision Making under Uncertainty, Part III Christos Dimitrakakis Chalmers

a chaining algorithm for online non parametric regression Pierre Gaillard December 2, 2015

Outline 1. Standing on the Shoulders of Giants . . . 2. What is Information? 3. Shannon

Introduction to Machine Learning 25. Multiplicative Updates, Games and Boosting Alex Smola

Theory and Statistics Constantinos Daskalakis CSAIL and EECS, MIT Min-Max Optimization Solve:

Multi-agent learning Erik Berbee & Bas van Gijzel , Master Student AT, Utrecht University Erik

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Large-scale machine learning and convex optimization Francis Bach INRIA - Ecole Normale Sup

The Challenge Initiative: Business Unusual Approach to Scale up Kojo Lokko Bill & Melinda

On Scalable and Efficient Computation of Large Scale Optimal Transport Yujia Xie, Minshuo Chen,

CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer - PowerPoint PPT Presentation

CS540 Midterm Review Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison slide 1 Uninformed Search slide 2 The search problem State space S : all valid configurations Initial states (nodes)

Basic Probability and Statistics CS540 Bryan R Gibson University of Wisconsin-Madison Slides

Introduction to Artificial Intelligence CS540-1 Yingyu Liang slide 1 Logistics Course

CS540 Uninformed Search Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department

Introduction to Artificial Intelligence CS540-1 slide 1 AI Examples: Cars that drive themselves

Robust Digital Filters Part 1: Minimax FIR Filters Wu-Sheng Lu Takao Hinamoto University of

Minimax Rates for Memory-Constrained Sparse Linear Regression Jacob Steinhardt John Duchi

Inf2D 04: Adversarial Search Valerio Restocchi School of Informatics, University of Edinburgh

Foundations of Artificial Intelligence 42. Board Games: Minimax Search and Evaluation Functions

Chapter6 Adversarial Search 20070419 Chap6 1 Game Theory Studied by mathematicians,

Homework 7.1 C D Here is the payoff matrix for the most commonly used version of the

CSC304 Lecture 6 Game Theory : Minimax Theorem via Expert Learning CSC304 - Nisarg Shah 1

CMU 15-896 Noncooperative games 2: Learning and minimax Teacher: Ariel Procaccia Reminder: The

Imprecision in learning: introduction Sebastien Destercke Universit de Technologie de

Simpler Optimal Algorithm for Contextual Bandits under Realizability Yunzong Xu MIT Joint work

Decision Problems Decision Making under Uncertainty, Part III Christos Dimitrakakis Chalmers

a chaining algorithm for online non parametric regression Pierre Gaillard December 2, 2015

Outline 1. Standing on the Shoulders of Giants . . . 2. What is Information? 3. Shannon

Introduction to Machine Learning 25. Multiplicative Updates, Games and Boosting Alex Smola

Theory and Statistics Constantinos Daskalakis CSAIL and EECS, MIT Min-Max Optimization Solve:

Multi-agent learning Erik Berbee &amp; Bas van Gijzel , Master Student AT, Utrecht University Erik

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Large-scale machine learning and convex optimization Francis Bach INRIA - Ecole Normale Sup

The Challenge Initiative: Business Unusual Approach to Scale up Kojo Lokko Bill &amp; Melinda

On Scalable and Efficient Computation of Large Scale Optimal Transport Yujia Xie, Minshuo Chen,

Multi-agent learning Erik Berbee & Bas van Gijzel , Master Student AT, Utrecht University Erik

The Challenge Initiative: Business Unusual Approach to Scale up Kojo Lokko Bill & Melinda