Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1

Introduction Alpha-beta pruning is the standard searching procedure used for 2-person perfect-information zero sum games. Definitions: • A position p . • The value of a position p , f ( p ) , is a numerical value computed from evaluating p . ⊲ Value is computed from the root player’s point of view. ⊲ Positive values mean in favor of the root player. ⊲ Negative values mean in favor of the opponent. ⊲ Since it is a zero sum game, thus from the opponent’s point of view, the value can be assigned − f ( p ) . • A terminal position: a position whose value can be know. ⊲ A position where win/loss/draw can be concluded. ⊲ A position where some constraints are met. • A position p has d legal moves p 1 , p 2 , . . . , p d . TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 2

Tree node numbering 1 2 3 1.1 3.1 1.2 3.2 2.2 1.3 2.1 3.1.2 3.1.1 From the root, number a node in a search tree by a sequence of integers a.b.c.d · · · • Meaning from the root, you first take the a th branch, then the b th branch, and then the c th branch, and then the d th branch · · · • The root is specified as an empty sequence. • The depth of a node is the length of the sequence of integers specifying it. This is called “Dewey decimal system.” TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 3

Mini-max formulation 7 max 1 2 7 min 8 max 7 2 5 1 6 7 min 8 1 Mini-max formulation: • � f ( p ) if d = 0 F ′ ( p ) = max { G ′ ( p 1 ) , . . . , G ′ ( p d ) } if d > 0 • � f ( p ) if d = 0 G ′ ( p ) = min { F ′ ( p 1 ) , . . . , F ′ ( p d ) } if d > 0 • An indirect recursive formula! • Equivalent to AND-OR logic. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 4

Algorithm: Mini-max Algorithm F ′ (position p ) // max node • determine the successor positions p 1 , . . . , p d • if d = 0 , then return f ( p ) else begin ⊲ m := −∞ ⊲ for i := 1 to d do t := G ′ ( p i ) ⊲ ⊲ if t > m then m := t // find max value • end; return m Algorithm G ′ (position p ) // min node • determine the successor positions p 1 , . . . , p d • if d = 0 , then return f ( p ) else begin ⊲ m := ∞ ⊲ for i := 1 to d do t := F ′ ( p i ) ⊲ ⊲ if t < m then m := t // find min value • end; return m A brute-force method to try all possibilities! TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 5

Mini-max: revised (1/2) Algorithm F ′ (position p ) // max node • determine the successor positions p 1 , . . . , p d • if d = 0 // a terminal node or depth reaches the cutoff threshold // from iterative deepening or time is running up // from timing control or some other constraints are met // add knowledge here then return f ( p ) // current board value else begin ⊲ m := −∞ // initial value ⊲ for i := 1 to d do // try each child ⊲ begin t := G ′ ( p i ) ⊲ ⊲ if t > m then m := t // find max value ⊲ end end • return m TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 6

Mini-max: revised (2/2) Algorithm G ′ (position p ) // min node • determine the successor positions p 1 , . . . , p d • if d = 0 // a terminal node or depth reaches the cutoff threshold // from iterative deepening or time is running up // from timing control or some other constraints are met // add knowledge here then return f ( p ) // current board value else begin ⊲ m := ∞ // initial value ⊲ for i := 1 to d do // try each child ⊲ begin t := F ′ ( p i ) ⊲ ⊲ if t < m then m := t // find min value ⊲ end end • return m TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 7

Nega-max formulation 7 max neg neg neg −1 −7 −2 min neg neg neg neg neg neg neg 8 max 7 5 1 6 7 2 neg neg min −8 −1 Nega-max formulation: Let F ( p ) be the greatest possible value achievable from position p against the optimal defensive strategy. • � h ( p ) if d = 0 F ( p ) = max {− F ( p 1 ) , . . . , − F ( p d ) } if d > 0 ⊲ � f ( p ) if depth of p is 0 or even h ( p ) = − f ( p ) if depth of p is odd TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 8

Algorithm: Nega-max Algorithm F (position p ) • determine the successor positions p 1 , . . . , p d • if d = 0 // a terminal node or depth reaches the cutoff threshold // from iterative deepening or time is running up // from timing control or some other constraints are met // add knowledge here • then return h ( p ) else • begin ⊲ m := −∞ ⊲ for i := 1 to d do ⊲ begin ⊲ t := − F ( p i ) // recursive call, the returned value is negated ⊲ if t > m then m := t // always find a max value ⊲ end • end • return m Also a brute-force method to try all possibilities, but with a simpler code. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 9

Intuition for improvements Branch-and-bound: using information you have so far to cut or prune branches. • A branch is cut means we do not need to search it anymore. • If you know for sure the value of your result is more than x and the current search result for this branch so far can give you no more than x , ⊲ then there is no need to search this branch any further. Two types of approaches • Exact algorithms: through mathematical proof, it is guaranteed that the branches pruned won’t contain the solution. ⊲ Alpha-beta pruning: reinvented by several researchers in the 1950’s and 1960’s. ⊲ Scout. ⊲ · · · • Approximated heuristics: with a high probability that the solution won’t be contained in the branches pruned. ⊲ Obtain a good estimation on the remaining cost. ⊲ Cut a branch when it is in a very bad position and there is little hope to gain back the advantage. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 10

Alpha cut-off V>=15 1 2 V <= 10 V=15 2.1 2.2 cut V=10 Alpha cut-off: • On a max node ⊲ Assume you have finished exploring the branch at 1 and obtained the best value from it as bound . ⊲ You now search the branch at 2 by first searching the branch at 2 . 1 . ⊲ Assume branch at 2 . 1 returns a value that is ≤ bound . ⊲ Then no need to evaluate the branch at 2 . 2 and all later branches of 2 , if any, at all. ⊲ The best possible value for the branch at 2 must be ≤ bound . ⊲ Hence we should take value returned from the branch at 1 as the best possible solution. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 11

Beta cut-off V<=10 2 1 1.1 V >= 15 1.2 V=10 cut 1.2.1 1.2.2 V=15 Beta cut-off: • On a min node ⊲ Assume you have finished exploring the branch at 1 . 1 and obtained the best value from it as bound . ⊲ You now search the branches at 1 . 2 by first exploring the branch at 1 . 2 . 1 . ⊲ Assume the branch at 1 . 2 . 1 returns a value that is ≥ bound . ⊲ Then no need to evaluate the branch at 1 . 2 . 2 and all later branches of 1 . 2 , if any, at all. ⊲ The best possible value for the branch at 1 . 2 is ≥ bound . ⊲ Hence we should take value returned from the branch at 1 . 1 as the best possible solution. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 12

Deep alpha cut-off For alpha cut-off: ⊲ For a min node u , the branch of its ancestor (e.g., elder brother of its parent) produces a lower bound V l . ⊲ The first branch of u produces an upper bound V u for v . ⊲ If V l ≥ V u , then there is no need to evaluate the second branch and all later branches, of u . Deep alpha cut-off: ⊲ Def: For a node u in a tree and a positive integer g , Ancestor( g , u ) is the direct ancestor of u by tracing the parent’s link g times. ⊲ When the lower bound V l is produced at and propagated from u ’s great grand parent, i.e., Ancestor(3, u ), or any Ancestor( 2 i + 1 , u ), i ≥ 1 . ⊲ When an upper bound V u is returned from the a branch of u and V l ≥ V u , then there is no need to evaluate all later branches of u . We can find similar properties for deep beta cut-off. TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 13

Illustration — Deep alpha cut-off V>=15 1 2 V=15 2.1 2.2 V>=15 2.1.1 V <= 7 cut 2.1.1.1 2.1.1.2 V=7 TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 14

Ideas for refinements During searching, maintain two values alpha and beta so that • alpha is the current lower bound of the possible returned value; • beta is the current upper bound of the possible returned value. If during searching, we know for sure alpha > beta , then there is no need to search any more in this branch. • The returned value cannot be in this branch. • Backtrack until it is the case alpha ≤ beta . The two values alpha and beta are called the ranges of the current search window. • These values are dynamic. • Initially, alpha is −∞ and beta is ∞ . TCG: α - β Pruning, 20131106, Tsan-sheng Hsu c � 15

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction Alpha-beta pruning is the standard searching procedure used for 2-person perfect-information zero sum games.

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

ECE 4524 Artificial Intelligence and Engineering Applications Meeting 6: Alpha-Beta Pruning,

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

CS440/ECE448 Lecture 11: Alpha-Beta Pruning; Limited Horizon Slides by Mark Hasegawa-Johnson

Extensive Form Games 2/10/17 Alpha-Beta Pruning Exercise + + + + + + +

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Minimax strategies, alpha beta pruning Lirong Xia Reminder Project 1 due tonight Makes

Why is Go hard for computers to play? Game tree complexity = b d Brute force search intractable:

Types Sequences: Lists Strings Exercises on the above and loops

CSE 473: Artificial Intelligence Spring 2014 Adversarial Search Hanna Hajishirzi

String Matching II Algorithm : Design & Analysis [19] In the last class Simple String

Effective R Programming Jacob Colvin February 21, 2009 Jacob Colvin () Effective R Programming

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Game Theory -- Lecture 1 Patrick Loiseau EURECOM Fall 2016 1 Lecture 1 outline 1.

Case Examples bayesDP R package Analysis types Single-arm: treatment data only

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu - PowerPoint PPT Presentation

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction Alpha-beta pruning is the standard searching procedure used for 2-person perfect-information zero sum games.

Alpha- -beta pruning beta pruning Example Alpha Example reduce the branching factor of

Natural Target Pruning Making Proper Pruning Cuts Natural Target Pruning In this lesson we

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Alpha-Beta Pruning: Algorithm and Analysis Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

More on games (Ch. 5.4-5.6) Alpha-beta pruning Previously on CSci 4511... We talked about how to

Beta star measurement G. Wang and M.Bai Yellow beta star and chromatic beta beat measurement

BASICS Natural Target Pruning Terminology and Tools Reasons for Pruning Fruit Trees

Pruning for Cropload Management and Productivity 2013 Winter Pruning Workshop Dr. Mercy

ECE 4524 Artificial Intelligence and Engineering Applications Meeting 6: Alpha-Beta Pruning,

More on games (Ch. 5.4-5.6) Announcements Writing 2 posted Minimax Pruning in real life:

EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis Chaoqi Wang , Roger Grosse,

CS440/ECE448 Lecture 11: Alpha-Beta Pruning; Limited Horizon Slides by Mark Hasegawa-Johnson

Extensive Form Games 2/10/17 Alpha-Beta Pruning Exercise + + + + + + +

Game Playing Part 2 Alpha-Beta Pruning Yingyu Liang yliang@cs.wisc.edu Computer Sciences

Minimax strategies, alpha beta pruning Lirong Xia Reminder Project 1 due tonight Makes

Why is Go hard for computers to play? Game tree complexity = b d Brute force search intractable:

Types Sequences: Lists Strings Exercises on the above and loops

CSE 473: Artificial Intelligence Spring 2014 Adversarial Search Hanna Hajishirzi

String Matching II Algorithm : Design &amp; Analysis [19] In the last class Simple String

Effective R Programming Jacob Colvin February 21, 2009 Jacob Colvin () Effective R Programming

Serverless Boom or Bust? An Analysis of Economic Incentives Xiayue Charles Lin, Joseph E.

Game Theory -- Lecture 1 Patrick Loiseau EURECOM Fall 2016 1 Lecture 1 outline 1.

Case Examples bayesDP R package Analysis types Single-arm: treatment data only

String Matching II Algorithm : Design & Analysis [19] In the last class Simple String