SLIDE 44 Online Adaptation: Adaptive RIP (AdaRIP)
Algorithm 9: Adaptive RIP (AdaRIP) // ``train'': {θk}K
k=1 1 learn from demonstrations with MLE 2 for step in environment do
// ``init'': y sample random plan while note converged do // ``evaluate'': {q(y|x, G; θk)}K
1 3
score plan under imitative model(s) // ``aggregate'': ⊕, mink,
1 K ∑k 4
consolidate evaluations of ensemble // ``plan'': y ← y + η ∂U
∂y 5
improve plan with online SGD // ``adapt'': u(y∗) > τ
6
if epistemically uncertain then
7
query expert
8
update model to reduce uncertainty // ``act'': y∗
9
submit plan to environment
(Normalized) Uncertainty RIP AdaRIP