Monte Carlo Tree Search Mark Maloof Department of Computer Science - PowerPoint PPT Presentation

Mar 22, 2023 •48 likes •178 views

Monte Carlo Tree Search Mark Maloof Department of Computer Science Georgetown University Washington, DC 20057 1 January 1970 Overview MCTS consists of four main steps (Browne et al., 2012) 1. Selection: Starting at the root, select the

Monte Carlo Tree Search Mark Maloof Department of Computer Science Georgetown University Washington, DC 20057 1 January 1970
Overview ◮ MCTS consists of four main steps (Browne et al., 2012) 1. Selection: Starting at the root, select the best action until reaching a node that has not been fully explored (i.e., a node with untried and therefore unevaluated actions). 2. Expansion: Choose an action, and expand the tree by adding a child node. 3. Simulation: From the newly added child, uniformly randomly select actions until reaching a leaf node and receiving a reward (e.g., +1 for winning, − 1 for losing). 4. Backpropagation: Starting at the new child node, propagate the reward to the root by adjusting the visit count N ( v ) and the simulation reward Q ( v ) of the nodes along the path.
Figure 2, Brown et al. (2012)
Upper-confidence Bound for Trees (UCT) 1: function uctSearch( s 0 ) create a root node v 0 with state s 0 2: while within computational budget do 3: v l ← treePolicy( v 0 ) 4: ∆ ← defaultPolicy(( s ( v l )) 5: backup( v l , ∆) 6: end while 7: return a (bestChild( v 0 , 0)) 8: 9: end function
Tree Policy 1: function treePolicy( v ) while v is non-terminal do 2: if v not fully expanded then 3: return expand( v ) 4: else 5: v ← bestChild( v , C p ) 6: end if 7: end while 8: return v 9: 10: end function
Expand 1: function expand( v ) choose a ∈ untried actions from A ( s ( v )) 2: add a new child v ′ to v with s ( v ′ ) = f ( s ( v ) , a ) and 3: a ( v ′ ) = a return v ′ 4: 5: end function
Best Child 1: function bestChild( v , c ) � Q ( v ′ ) 2 ln N ( v ) return argmax v ′ ∈ Children ( v ) N ( v ′ ) + c 2: N ( v ′ ) 3: end function
Default Policy 1: function defaultPolicy( s ) while s is non-terminal do 2: choose a ∈ A ( s ) uniformly at random 3: s ← f ( s , a ) 4: end while 5: return reward for state s 6: 7: end function
Backup 1: function backup( v , ∆) while s is not null do 2: N ( v ) ← N ( v ) + 1 3: Q ( v ) ← Q ( v ) + ∆( v , p ) ⊲ p is player 4: v ← parent of v 5: end while 6: 7: end function
Backup Negamax 1: function backupNegamax( v , ∆) while s is not null do 2: N ( v ) ← N ( v ) + 1 3: Q ( v ) ← Q ( v ) + ∆ 4: ∆ ← − ∆ 5: v ← parent of v 6: end while 7: 8: end function
Figure 3, Brown et al. (2012)
Monte Carlo Tree Search Mark Maloof Department of Computer Science Georgetown University Washington, DC 20057 1 January 1970
References I C. Browne, E. Powley, D. Whitehouse, S. Lucas, P. I. Cowling, P. Fohlfshagen, S. Tavener, D. Perez, S. Samothrakis, and S. Colton. A survey of Monte Carlo tree search methods. IEEE Transactions on Computational Intelligence and AI in Games , 4(1):1–43, 2012. doi: 10.1109/TCIAIG.2012.2186810 .

Recommend

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

P . Skands QCD Lecture III Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P . Skands 1 P . Skands QCD Lecture III A Monte Carlo technique: is any technique making use of random numbers to solve a

2.37k views • 105 slides

Monte-Carlo tree search for Monte-Carlo tree search for multi-player, no-limit multi-player,

Monte-Carlo tree search for Monte-Carlo tree search for multi-player, no-limit multi-player, no-limit Texas hold'em poker Texas hold'em poker Guy Van den Broeck Should I bluff? Deceptive play Should I bluff? Is he bluffing? Opponent

1.14k views • 112 slides

Monte Carlo Tree Search 2-15-16 Reading Quiz What is the relationship between Monte Carlo tree

Monte Carlo Tree Search 2-15-16 Reading Quiz What is the relationship between Monte Carlo tree search and upper confidence bound applied to trees? a) MCTS is a type of UCB b) UCB is a type of MCTS c) both (they are the same algorithm) d)

331 views • 17 slides

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1. Introduction 2. History 3. Examples Introduction Monte Carlo methods are stochastic techniques. Monte Carlo method is very

1.27k views • 68 slides

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include: Arnaud Doucet, Axel Finke, Anthony Lee, Nick Whiteley 7th January 2014 Introduction Monte Carlo Approximationof Monte Carlo Filters Approximating

1.46k views • 31 slides

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L 6 x 1L 6 x 1L 6 x 1L PINEAPPLE- GOLD DEL MONTE DEL MONTE COCCO DEL MONTE DEL MONTE DEL MONTE DEL MONTE DEL MONTE 8x1 lt PINEAPPLE

531 views • 12 slides

Modern Monte Carlo Tree Search Andrew Li, John Chen, Keiran Paster 1 Outline Motivation

Modern Monte Carlo Tree Search Andrew Li, John Chen, Keiran Paster 1 Outline Motivation Optimistic Exploration and Bandits Monte Carlo Tree Search (MCTS) Learning to Search in MCTS Thinking Fast and Slow with Deep Learning

567 views • 35 slides

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is a binary search tree if each element in the left subtree is smaller than the root, each element in the right subtree is larger than the root,

757 views • 51 slides

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience values, policy Monte Carlo methods can be used in two ways: ! model-free: No model necessary and still attains optimality ! Simulated: Needs only a

2.17k views • 32 slides

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

1 Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference, Stanford University, August 2016 2 Draft Program Monte Carlo, Quasi-Monte Carlo, Randomized quasi-Monte Carlo QMC point sets and

1.98k views • 148 slides

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational methods that use random sampling to estimate results Named after the famous Monte Carlo Casino 7 January 2019 OSU CSE 2 Throwing Darts 3 5

392 views • 12 slides

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Outline Introduction MCL Mixture-MCL End Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1 Outline Introduction MCL Mixture-MCL End Introduction 1 Localization Problem Bayes Filter Monte Carlo

414 views • 23 slides

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1. Recap 2. Estimating Action Values 3. Monte Carlo Control 4. Importance Sampling 5. Off-Policy Monte Carlo Control Recap: Monte Carlo vs.

263 views • 22 slides

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

C hapter 4: Monte Carlo Modeling of Grain Growth and Recrystallization, A.D. Rollett & P. Manohar 4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo method for the simulation of grain growth and

2.05k views • 37 slides

CS171: Artificial Intelligence Monte Carlo Tree Search and Alpha Go Jia Chen Dec 5, 2017 1

CS171: Artificial Intelligence Monte Carlo Tree Search and Alpha Go Jia Chen Dec 5, 2017 1 Schedule Introduction Monte-Carlo Tree Search Policy and Value Networks Results 2 Introduction Go originated 2,500+ years ago

882 views • 47 slides

Monte Carlo Tree Search for Algorithm Configuration: MOSAIC Herilalaina Rakotoarison and Mich`

Monte Carlo Tree Search for Algorithm Configuration: MOSAIC Herilalaina Rakotoarison and Mich` ele Sebag TAU CNRS INRIA LRI Universit e Paris-Sud NeurIPS MetaLearning Wshop Dec. 8, 2018 1 / 14 Monte Carlo Tree Search for

574 views • 28 slides

Sensitivity Estimates Using a Toy Monte Carlo Dave Waters, University College London with Sean

Sensitivity Estimates Using a Toy Monte Carlo Dave Waters, University College London with Sean Danaher, Chris Rhodes, Terry Sloan & Lee Thompson Goals of the Study. Details of the Toy Monte Carlo. Validating the Monte Carlo. Generating the

514 views • 24 slides

Contents 1 Introduction 1 1.1 When We Dont Need Simulation . . . . . . . . . . . . . . . .

Statistical Simulation An Introduction Contents 1 Introduction 1 1.1 When We Dont Need Simulation . . . . . . . . . . . . . . . . . . 1 1.2 Why We Often Need Simulation . . . . . . . . . . . . . . . . . . 2 1.3 Basic Ways We

326 views • 21 slides

Monte-Carlo Tree Search Mich` ele Sebag TAO: Theme Apprentissage & Optimization

Monte-Carlo Tree Search Mich` ele Sebag TAO: Theme Apprentissage & Optimization Acknowledgments: Olivier Teytaud , Sylvain Gelly, Philippe Rolet, Romaric Gaudel CP 2012 Foreword Disclaimer 1 There is no shortage of tree-based

1.07k views • 106 slides

Ch.8.1-8.3: Random numbers and Monte Carlo simulation Joakim Sundnes 1 , 2 Hans Petter Langtangen

Ch.8.1-8.3: Random numbers and Monte Carlo simulation Joakim Sundnes 1 , 2 Hans Petter Langtangen 1 , 2 Simula Research Laboratory 1 University of Oslo, Dept. of Informatics 2 Nov 15, 2017 Plan for this week Wednesday November 15: Exer E.21,

184 views • 16 slides

Introduction to Bayesian Computation Dr. Jarad Niemi STAT 544 - Iowa State University March 26,

Introduction to Bayesian Computation Dr. Jarad Niemi STAT 544 - Iowa State University March 26, 2019 Jarad Niemi (STAT544@ISU) Introduction to Bayesian Computation March 26, 2019 1 / 30 Bayesian computation Goals: E | y [ h ( ) |

630 views • 30 slides

QUASI-EQUILIBRIUM MONTE-CARLO: OFF-LATTICE KINETIC MONTE CARLO SIMULATION OF HETEROEPITAXY

QUASI-EQUILIBRIUM MONTE-CARLO: OFF-LATTICE KINETIC MONTE CARLO SIMULATION OF HETEROEPITAXY WITHOUT SADDLE POINTS Henry A. Boateng University of Michigan, Ann Arbor Joint work with Tim Schulze and Peter Smereka Support from NSF FRG grant

610 views • 30 slides

Example: Monte Carlo Simulation Marco Chiarandini (marco@imada.sdu.dk) Department of Mathematics

FF505/FY505 Computational Science Example: Monte Carlo Simulation Marco Chiarandini (marco@imada.sdu.dk) Department of Mathematics and Computer Science (IMADA) University of Southern Denmark Outline Exercise: MC Simul. 1. Exercise: Monte

252 views • 11 slides

Computational Statistical Modeling of Dynamic Socioeconomic, Geopolitical and Financial Systems NYU

Computational Statistical Modeling of Dynamic Socioeconomic, Geopolitical and Financial Systems NYU Courant Institute of Mathematical Sciences Applied Mathematics Advanced Topics Course Michael Kwak April 3, 2012 Empirical Study of gammaPoisson

809 views • 42 slides