safe bayesian optimization for optimal control
play

Safe Bayesian Optimization for Optimal Control Schillinger, M., - PowerPoint PPT Presentation

Safe Bayesian Optimization for Optimal Control Schillinger, M., Hartmann, B., Skalecki, P., Meister, M., Nguyen-Tuong, D., & Nelles, O. (2017). Safe Active Learning and Safe Bayesian Optimization for Tuning a PI-Controller. IFAC-PapersOnLine ,


  1. Safe Bayesian Optimization for Optimal Control Schillinger, M., Hartmann, B., Skalecki, P., Meister, M., Nguyen-Tuong, D., & Nelles, O. (2017). Safe Active Learning and Safe Bayesian Optimization for Tuning a PI-Controller. IFAC-PapersOnLine , 50 (1), 5967-5972. Sui, Y., Gotovos, A., Burdick, J., & Krause, A. (2015, June). Safe exploration for optimization with Gaussian Processes. In International Conference on Machine Learning (pp. 997-1005). Wachi, A., Sui, Y., Yue, Y., & Ono, M. (2018). Safe Exploration and Optimization of Constrained MDPs using Gaussian Processes. In AAAI Conference on Artificial Intelligence (AAAI). Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 1

  2. Safety concept ❑ Starting from some initial safe points (e.g. machinery settings): ❑ Search for the optimum (e.g. maximum of some KPI) … ❑ … avoiding to violate a given (safety) threshold (e.g. a minimum performance level) Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 2

  3. Safety concept ❑ Use a probabilistic model of the objective function to: ❑ Expand «safe region» safely ❑ While searching for the optimum ❑ Parameters of the probabilitic model have to be set up properly, otherwise: ❑ We might violate the safety threshold ❑ We could be unable to expand the safe region Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 3

  4. Matern_3_2 Matern_5_2 Exponential Squared-exponential Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 4

  5. Lipschitz-dependent Squared Exponential ❑ Most of the real world systems (e.g. industrial systems, are characterized by Lipschitz continuous objective functions) ❑ Knowledge about the Lipschitz constant allows for a proper set up of the probabilistic model (in particular its kernel aka covariance function) ❑ Lipschitz constant could be ❑ known a priori (or at least a good estimation) ❑ Inferred during the safe optimization process Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 5

  6. Application domains • Manufacturing processes • Control of complex systems (e.g. water/energy/oil&gas supply networks) • Design of experiments • Clinical studies – therapy design Antonio Candelieri, Statistics for Big Data and Machine Learning, Cardiff, 6-8 November 2018 6

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend