Non-Stationary Reinforcement Learning
Ruihao Zhu
MIT IDSS
Joint work with Wang Chi Cheung (NUS) and David Simchi-Levi (MIT)
1 / 18
Non-Stationary Reinforcement Learning Ruihao Zhu MIT IDSS Joint - - PowerPoint PPT Presentation
Non-Stationary Reinforcement Learning Ruihao Zhu MIT IDSS Joint work with Wang Chi Cheung (NUS) and David Simchi-Levi (MIT) 1 / 18 Epidemic Control A DM iteratively: 1. Pick a measure to contain the virus. 2. See the corresponding outcome.
1 / 18
2 / 18
2 / 18
3 / 18
4 / 18
4 / 18
4 / 18
5 / 18
5 / 18
5 / 18
6 / 18
6 / 18
6 / 18
6 / 18
7 / 18
8 / 18
8 / 18
8 / 18
8 / 18
9 / 18
9 / 18
10 / 18
10 / 18
10 / 18
10 / 18
10 / 18
11 / 18
11 / 18
12 / 18
12 / 18
12 / 18
12 / 18
12 / 18
13 / 18
13 / 18
13 / 18
13 / 18
13 / 18
13 / 18
13 / 18
14 / 18
14 / 18
14 / 18
14 / 18
14 / 18
15 / 18
15 / 18
15 / 18
15 / 18
15 / 18
15 / 18
16 / 18
1 4
3 4
16 / 18
17 / 18
18 / 18