Monte Carlo Tree Search guided by Symbolic Advice for MDPs
Damien Busatto-Gaston, Debraj Chakraborty and Jean-Francois Raskin
Université Libre de Bruxelles
September 16, 2020 HIGHLIGHTS 2020
1/13
Monte Carlo Tree Search guided by Symbolic Advice for MDPs Damien - - PowerPoint PPT Presentation
Monte Carlo Tree Search guided by Symbolic Advice for MDPs Damien Busatto-Gaston, Debraj Chakraborty and Jean-Francois Raskin Universit Libre de Bruxelles September 16, 2020 HIGHLIGHTS 2020 1/13 Markov Decision Process 1 s 0 4 a 1 a 2 1
Université Libre de Bruxelles
1/13
2 3 1 3
1 2 1 2 3 4 1 4
a1
2 3
a3
1 2
2/13
2 3 1 3
1 2 1 2 3 4 1 4
a1
2 3
a3
1 2
2/13
3/13
3/13
4
1
3/13
4
1
3/13
4/13
5/13
5/13
5/13
6/13
6/13
6/13
7/13
7/13
7/13
8/13
9/13
10/13
10/13
10/13
11/13
Algorithm % of win % of loss % of no result1 % of food eaten MCTS 17 59 24 67 MCTS+Selection advice 25 54 21 71 MCTS+Simulation advice 71 29 88 MCTS+both advice 85 15 94 Human 44 56 75
1after 300 steps
12/13
13/13
13/13