Scalable Global Optimization via Local Bayesian Optimization David - PowerPoint PPT Presentation

Feb 14, 2024 •311 likes •436 views

Scalable Global Optimization via Local Bayesian Optimization David Eriksson Uber AI eriksson@uber.com Matthias Poloczek Michael Pearce Jake Gardner Ryan Turner Global Optimization Find such that

Scalable Global Optimization via Local Bayesian Optimization David Eriksson Uber AI eriksson@uber.com Matthias Poloczek Michael Pearce Jake Gardner Ryan Turner
Global Optimization Find 𝑦 ∗ ∈ Ω such that 𝑔 𝑦 ∗ ≤ 𝑔 𝑦 , ∀𝑦 ∈ Ω • 𝑔 is a continuous, computationally expensive, black-box function • Ω ⊂ ℝ + is a hyper-rectangle Planning and control Design of aerodynamic structures
Bayesian Optimization (BO) Common restrictions: • A few hundred evaluations • Less than 10 tunable parameters True function Sample Observed points Next point
Bayesian Optimization (BO) Common restrictions: • A few hundred evaluations • Less than 10 tunable parameters True function Sample Observed points Next point
Bayesian Optimization (BO) Common restrictions: • A few hundred evaluations • Less than 10 tunable parameters True function Sample Observed points Next point
High-dimensional BO is challenging Challenges: 1. The search space grows exponentially with dimensionality 2. A global GP model may not fit the data everywhere 3. Large areas of uncertainty leads to over-exploration Previous work makes strong assumptions : • Additive structure • Low-dimensional structure
Trust-region methods Main idea: • Optimize a (simple) model in a local region • Expand/shrink this region based on progress Linear (e.g. COBYLA) Quadratic (e.g. BOBYQA) GP (TuRBO, this paper) • Only requires a locally accurate model
Trust-region BO (TuRBO) 1. Avoids over-exploration by using a trust-region framework 2. Balances exploration/exploitation by using BO inside the trust-region 3. Uses Thompson sampling to scale to large batch sizes GP True Model Function Trust Region Update
Experimental results Robot pushing: 10,000 evaluations, batch size 50 Rover trajectory planning: 20,000 evaluations, batch size 100
Experimental results 200D Ackley function: 10,000 evaluations, batch size 100 16 14 TuRBO-1 Thompson 12 BOCK Bohamiann Value HeSBO 10 CMA-ES BOBYQA 8 Nelder-Mead BFGS 6 Random 4 0 2000 4000 6000 8000 10000 Number of evaluations
Summary TuRBO: • Achieves excellent results for high-dimensional problems • Combines BO with trust-regions to avoid over-exploration • Makes no assumptions about low-dimensional structure Paper: https://arxiv.org/abs/1910.01739 Code: https://github.com/uber-research/TuRBO Poster #9

Recommend

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian RL Prior knowledge, policy optimization, discussion, Bayesian approaches for other RL variants Model-free Bayesian RL Gaussian

1.23k views • 63 slides

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian Approach t o St ruct ure Discovery in Bayesian Net works Nir Friedman and Daphne Koller 04/ 21/ 2005 CS673 1 Roadmap Roadmap Bayesian lear

657 views • 21 slides

Bayesian Optimization CSC2541 - Topics in Machine Learning Scalable and Flexible Models of

Bayesian Optimization CSC2541 - Topics in Machine Learning Scalable and Flexible Models of Uncertainty University of Toronto - Fall 2017 Overview 1. Bayesian Optimization of Machine Learning Algorithms 2. Gaussian Process Optimization in the

1.53k views • 94 slides

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed memory plus coherent replication Scalable distributed memory machines P-C-M nodes connected by network communication assist interprets

1.13k views • 87 slides

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik,

CS440/ECE448 Lecture 15: Bayesian Inference and Bayesian Learning Slides by Svetlana Lazebnik, 10/2016 Modified by Mark Hasegawa-Johnson, 3/2019 Bayesian Inference and Bayesian Learning Bayes Rule Bayesian Inference Misdiagnosis

829 views • 38 slides

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression Bayesian Gaussian Mixture Models Non-parametric Bayes 2 Take Away ... 1. Maximum Likelihood Estimate (MLE) = arg max p ( D| )

609 views • 27 slides

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks have revolutionized AI Youve seen what they are There are two nagging questions: 1. How do you come up with a Bayesian network

196 views • 17 slides

Scalable and Robust Bayesian Inference via the Median Posterior CS 584: Big Data Analytics

Scalable and Robust Bayesian Inference via the Median Posterior CS 584: Big Data Analytics Material adapted from David Dunsons talk (http://bayesian.org/sites/default/files/Dunson.pdf) & Lizhen Lins ICML talk

638 views • 35 slides

Global Optimization Global constant propagation Liveness analysis 2 Local

Lecture Outline Global flow analysis Global Optimization Global constant propagation Liveness analysis 2 Local Optimization Global Optimization Recall the simple basic-block optimizations These optimizations can be extended

434 views • 14 slides

BAYESIAN GLOBAL OPTIMIZATION Using Optimal Learning to Tune Deep Learning Pipelines Scott Clark

BAYESIAN GLOBAL OPTIMIZATION Using Optimal Learning to Tune Deep Learning Pipelines Scott Clark scott@sigopt.com OUTLINE 1. Why is Tuning AI Models Hard? 2. Comparison of Tuning Methods 3. Bayesian Global Optimization 4. Deep Learning

983 views • 67 slides

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization Types of optimization problems Unconstrained optimization Constrained optimization Practical optimization 2 Outline Introduction to optimization

603 views • 59 slides

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the Cell BE Processor BE Processor Cell Cell BE Processor Daniele Scarpazza, Oreste Villa, Fabrizio Petrini Applied Computer Science Group Pacific

408 views • 21 slides

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321

CSC321 Lecture 21: Bayesian Hyperparameter Optimization Roger Grosse Roger Grosse CSC321 Lecture 21: Bayesian Hyperparameter Optimization 1 / 25 Overview Todays lecture: a neat application of Bayesian parameter estimation to automatically

745 views • 25 slides

Global and local alignments Global vs. local alignments Global: align all nucleotides

Global and local alignments Global vs. local alignments Global: align all nucleotides Local: align subsequences with best score Align these sequences: GCAT, GCT (match = 1, mismatch = -1, gap = -1) global alignment: local alignment: ?

506 views • 19 slides

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS

GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - GLOBAL RISKS - - - GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS GLOBAL RISKS - - - - What should the Asian market worry about? Dr. Paul Fisher, LBMA Chairman Asia Pacific Precious Metals Conference,

517 views • 9 slides

A simple Bayesian regression model Alicia Johnson Associate Professor, Macalester College

DataCamp Bayesian Modeling with RJAGS BAYESIAN MODELING WITH RJAGS A simple Bayesian regression model Alicia Johnson Associate Professor, Macalester College DataCamp Bayesian Modeling with RJAGS Chapter 3 goals Engineer a simple Bayesian

858 views • 60 slides

13 th February 2020 Bristol Temple Quarter AGENDA Purpose To provide an update to the

Scrutiny Committee 13 th February 2020 Bristol Temple Quarter AGENDA Purpose To provide an update to the Growth and Regeneration Scrutiny Committee on: o Progress on current Temple Quarter projects o Future proposals for Temple Mead

783 views • 42 slides

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime (Meridian) Meridian b a International Dateline (180 lon) Latitude ( y ) Longitude ( x ) (Parallel) ( lon , lat ) = ( x , y ) A ( x 1 , y 1 ) =

665 views • 7 slides

Olympic Park site Chinese media delegation briefing James Apted, Ian Morrissey, Ian Mead 11

Atkins Lectures Rehabilitating the London 2012 Olympic Park site Chinese media delegation briefing James Apted, Ian Morrissey, Ian Mead 11 October 2013 The challenges Minimising Contamination Occupation Scale disruption to communities

600 views • 20 slides

Electronic Trial Master File Standard Technical Committee Meeting Agenda Comment Review Period

OASIS Electronic Trial Master File Standard Technical Committee Meeting Agenda Comment Review Period October 6, 2014 9:00 10:00 AM PDT Dial 860-970-0010 ID: 567-753-030# Agenda Topic Presenter 9:00 - 9:05 Call to Order (secretary:

836 views • 13 slides

Hyperparameter Tuning in Python Using Optunity http://www.optunity.net Marc Claesen Jaak Simm

Introduction Optunity overview Example: tuning an SVM References Hyperparameter Tuning in Python Using Optunity http://www.optunity.net Marc Claesen Jaak Simm Dusan Popovic Bart De Moor ESAT-STADIUS, KU Leuven iMinds Medical IT Department

306 views • 29 slides

Social Cognition experimenter Results: faster if primed rude Other studies: (continued)

Schemas influence how we behave Bargh and colleagues IV: primed polite or rude DV: how long participant waited to interrupt the Social Cognition experimenter Results: faster if primed rude Other studies: (continued) --primed elderly

390 views • 6 slides

Early days of message-passing computing: transputers, occam and all that Tony Hey Chief Data

Early days of message-passing computing: transputers, occam and all that Tony Hey Chief Data Scientist STFC Rutherford Appleton Laboratory Harwell Campus, UK The Beginnings In 1981 I was on sabba,cal at Caltech as a theore,cal

504 views • 14 slides

Code R Region B Based A Auto-Tu Tuning En Enabled C Compilers M. James Kalyan

Code R Region B Based A Auto-Tu Tuning En Enabled C Compilers M. James Kalyan Xiang Wang Ahmed Eltantawy Yaoqing Gao Motivation Binary Developer 2 Motivation Binary Auto-Tuner 3 Approach .6% speedup over

605 views • 17 slides