Monte Carlo Tree Search for Algorithm Configuration: MOSAIC
Herilalaina Rakotoarison and Mich` ele Sebag TAU CNRS − INRIA − LRI − Universit´ e Paris-Sud NeurIPS MetaLearning Wshop − Dec. 8, 2018
1 / 14
Monte Carlo Tree Search for Algorithm Configuration: MOSAIC - - PowerPoint PPT Presentation
Monte Carlo Tree Search for Algorithm Configuration: MOSAIC Herilalaina Rakotoarison and Mich` ele Sebag TAU CNRS INRIA LRI Universit e Paris-Sud NeurIPS MetaLearning Wshop Dec. 8, 2018 1 / 14 Monte Carlo Tree Search for
1 / 14
1 / 14
2 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
◮ Gradually grow the search tree ◮ Building Blocks ◮ Select next action (bandit-based phase)
◮ Add a node (leaf of the search tree) ◮ Select next action bis (random phase) ◮ Compute instant reward ◮ Update information in visited nodes ◮ Returned solution ◮ Path visited most often
3 / 14
4 / 14
6 / 14
7 / 14
8 / 14
9 / 14
10 / 14
11 / 14
12 / 14
13 / 14
14 / 14