Preference-Based Bayesian Optimization in High Dimensions with Human - - PowerPoint PPT Presentation

▶

Feb 06, 2024 40 likes •71 views

Preference-Based Bayesian Optimization in High Dimensions with Human Feedback Myra Cheng, Ellen Novoseller, Maegan Tucker, Richard Cheng, Joel Burdick, Yisong Yue California Institute of Technology At every iteration: LineCoSpar Algorithm

SLIDE 1

Preference-Based Bayesian Optimization in High Dimensions with Human Feedback

Myra Cheng, Ellen Novoseller, Maegan Tucker, Richard Cheng, Joel Burdick, Yisong Yue

California Institute of Technology

SLIDE 2

LineCoSpar Algorithm

Gaussian process-based

model of the underlying utilities

Iteratively update the

posterior from preference feedback

Learn in high dimensions

via 1-D subspaces

Bayesian Preference Model of utilities

ver 1-D subspace

and visited actions Human user Pairwise preferences and coactive feedback Actions selected via posterior sampling

At every iteration:

SLIDE 3

Preference-Based Bayesian Optimization in High Dimensions with Human - - PowerPoint PPT Presentation

Preference-Based Bayesian Optimization in High Dimensions with Human Feedback

Myra Cheng, Ellen Novoseller, Maegan Tucker, Richard Cheng, Joel Burdick, Yisong Yue

California Institute of Technology

LineCoSpar Algorithm

model of the underlying utilities

posterior from preference feedback

via 1-D subspaces

At every iteration:

Validated in User Studies

Cartpole Simulation (4-D) Wearable Exoskeleton (6-D)