Calibrated Model-Based Deep Reinforcement Learning
Ali Malik*, Volodymyr Kuleshov*, Jiaming Song, Danny Nemer, Harlan Seymour, Stefano Ermon June 13, 2019
IC ML 2019
*equal contribution
Calibrated Model-Based Deep Reinforcement Learning IC ML 2019 Ali - - PowerPoint PPT Presentation
Calibrated Model-Based Deep Reinforcement Learning IC ML 2019 Ali Malik*, Volodymyr Kuleshov*, Jiaming Song, Danny Nemer, Harlan Seymour, Stefano Ermon June 13, 2019 *equal contribution Overview Importance of predictive uncertainty
*equal contribution
Kahn et al. (2018) Chua et al. (2018)
Berkenkamp et al. (2017)
Saria (2018) Heckerman et al. (1989) Safe exploration Diagnosis, risk prediction, treatment recommendation. Obstacle avoidance, reward planning
Smith & Cheeseman (1986) McAllister et al. (2017) Segmentation, object detection, depth estimation.
Auer et al. (2002) Li et al. (2010) Balancing exploration and exploitation
100
%
100
%
100
%
100
%
Predictive distributions should be focused i.e have low variance Uncertainty should be empirically accurate i.e. true value should fall in a p% confidence interval p% of the time
75
%
75
%
75
%
75
%
%
Forecaster
%
%
%
Forecaster
%
%
%
Uncalib. reward
New Forecast
Input
Forecast
what model predicts what data says
… …
X (p))
Reward: Sales revenue, minus shipment costs.