SLIDE 1
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Tang Shangqi Guo Tian Tan Xiaolin Hu Feng Chen
Subgoals in Hierarchical Reinforcement Learning Tianren Tang Tian - - PowerPoint PPT Presentation
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning Tianren Tang Tian Tan Shangqi Guo Xiaolin Hu Feng Chen Background Goal-Conditional HRL High policy suffers from non-stationary problem From MARL's
Tianren Tang Shangqi Guo Tian Tan Xiaolin Hu Feng Chen
is sub-goal for low policy usually unreachable
where 𝜐∗ = (𝑡0. . . 𝑡𝑈𝐿), 𝜍∗ = (0. . . (𝑈−1)𝐿)