Preference-Based Bayesian Optimization in High Dimensions with Human - - PowerPoint PPT Presentation

preference based bayesian optimization in high dimensions
SMART_READER_LITE
LIVE PREVIEW

Preference-Based Bayesian Optimization in High Dimensions with Human - - PowerPoint PPT Presentation

Preference-Based Bayesian Optimization in High Dimensions with Human Feedback Myra Cheng, Ellen Novoseller, Maegan Tucker, Richard Cheng, Joel Burdick, Yisong Yue California Institute of Technology At every iteration: LineCoSpar Algorithm


slide-1
SLIDE 1

Preference-Based Bayesian Optimization in High Dimensions with Human Feedback

Myra Cheng, Ellen Novoseller, Maegan Tucker, Richard Cheng, Joel Burdick, Yisong Yue

California Institute of Technology

slide-2
SLIDE 2

LineCoSpar Algorithm

  • Gaussian process-based

model of the underlying utilities

  • Iteratively update the

posterior from preference feedback

  • Learn in high dimensions

via 1-D subspaces

Bayesian Preference Model of utilities

  • ver 1-D subspace

and visited actions Human user Pairwise preferences and coactive feedback Actions selected via posterior sampling

At every iteration:

slide-3
SLIDE 3

Validated in User Studies

Cartpole Simulation (4-D) Wearable Exoskeleton (6-D)