event triggered interactive gradient descent for real
play

Event-Triggered Interactive Gradient Descent for Real-Time - PowerPoint PPT Presentation

Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization Pio Ong and Jorge Cort es Mechanical and Aerospace Engineering University of California, San Diego http://carmenere.ucsd.edu/jorge 56th IEEE Conference on


  1. Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization Pio Ong and Jorge Cort´ es Mechanical and Aerospace Engineering University of California, San Diego http://carmenere.ucsd.edu/jorge 56th IEEE Conference on Decision and Control: Event-Triggered and Self-Triggered Control I Melbourne, Australia December 12-15, 2017

  2. Flying to Australia Options Transit (Hours) Cost (Dollars) 1 10 1100 2 5 1500 3 2 1400 How would a robot know which option is the best? P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 2 / 17

  3. Flying to Australia Options Transit (Hours) Cost (Dollars) 1 10 1100 2 5 1500 3 2 1400 How would a robot know which option is the best? Some options are obviously worse. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 2 / 17

  4. Flying to Australia Options Transit (Hours) Cost (Dollars) 1 10 1100 3 2 1400 How would a robot know which option is the best? Some options are obviously worse. But, we are left with mathematically ambiguous options ( Pareto Solutions ) P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 2 / 17

  5. Flying to Australia Options Transit (Hours) Cost (Dollars) Happiness 1 10 1100 3 2 1400 How would a robot know which option is the best? Some options are obviously worse. But, we are left with mathematically ambiguous options ( Pareto Solutions ) Ask Human! P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 2 / 17

  6. My Talk in One Slide Motivation: Rise of robots that will eventually coexist with human 1 Robot solve a optimization problem to do something 2 Robot becomes more complex, can do more than one thing 3 Scenario: Human interacts with robot to help solve multiobjective optimization problem Robot Accommodate Human: Human cannot be asked too often 1 Human needs some time to answer 2 Approach: Use Event-Trigger Control to minimize human interaction. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 3 / 17

  7. Outline Describing Scenario Problem Statement and Assumptions Our approach: Interactive Gradient Descent Modeling Humans Human needs to rest. Designing Event Trigger Adding Human Response Time Limiting design parameter Wrapping up my Talk Simulations Conclusions P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 4 / 17

  8. Problem and Assumptions Our problem: minimize f ( x ) x ∈ R n with f ( x ) ∈ R m , m objective functions In general, infinite number of Pareto solutions P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 5 / 17

  9. Problem and Assumptions Our problem: minimize f ( x ) x ∈ R n with f ( x ) ∈ R m , m objective functions In general, infinite number of Pareto solutions The human has an implicit cost function, c : R m → R , that ranks them P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 5 / 17

  10. Problem and Assumptions Our problem: minimize f ( x ) x ∈ R n with f ( x ) ∈ R m , m objective functions In general, infinite number of Pareto solutions The human has an implicit cost function, c : R m → R , that ranks them Implicit because the human cannot express what it is 1 P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 5 / 17

  11. Problem and Assumptions Our problem: minimize f ( x ) x ∈ R n with f ( x ) ∈ R m , m objective functions In general, infinite number of Pareto solutions The human has an implicit cost function, c : R m → R , that ranks them Implicit because the human cannot express what it is 1 Human can respond to queries; we assume he can give the gradient 2 P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 5 / 17

  12. Problem and Assumptions Our problem: minimize f ( x ) x ∈ R n with f ( x ) ∈ R m , m objective functions In general, infinite number of Pareto solutions The human has an implicit cost function, c : R m → R , that ranks them Implicit because the human cannot express what it is 1 Human can respond to queries; we assume he can give the gradient 2 Assumptions: To assure there is a unique solution, Each objective function is strictly convex. 1 The implicit function is strictly convex, increasing w.r.t. each objective value. 2 The implicit function is bounded from below and is radially unbounded. 3 P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 5 / 17

  13. Restate the problem What do we mean by solving a multiobjective optimization problem? P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  14. Restate the problem What do we mean by solving a multiobjective optimization problem? Answer: Find the Pareto solution that the human likes the best. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  15. Restate the problem What do we mean by solving a multiobjective optimization problem? Answer: Find the Pareto solution that the human likes the best. Problem that we will solve: minimize ( c ◦ f )( x ) x ∈ R n P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  16. Restate the problem What do we mean by solving a multiobjective optimization problem? Answer: Find the Pareto solution that the human likes the best. Problem that we will solve: minimize ( c ◦ f )( x ) x ∈ R n Scenario: human and robot working together to get the best Pareto solution. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  17. Restate the problem What do we mean by solving a multiobjective optimization problem? Answer: Find the Pareto solution that the human likes the best. Problem that we will solve: minimize ( c ◦ f )( x ) x ∈ R n Scenario: human and robot working together to get the best Pareto solution. Single objective optimization. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  18. Restate the problem What do we mean by solving a multiobjective optimization problem? Answer: Find the Pareto solution that the human likes the best. Problem that we will solve: minimize ( c ◦ f )( x ) x ∈ R n Scenario: human and robot working together to get the best Pareto solution. Single objective optimization. No objective function avaliable . Only gradient value available upon requests. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 6 / 17

  19. Interactive Gradient Descent Let’s try gradient descent! x ( t ) = − ( ∇ ( c ◦ f )( x ( t ))) T ˙ Role : What’s the human and robot role in this optimization? P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 7 / 17

  20. Interactive Gradient Descent Let’s try gradient descent! x ( t ) = − ( ∇ ( c ◦ f )( x ( t ))) T ˙ Role : What’s the human and robot role in this optimization? x ( t ) = − ( ∇ c ( f ( x ( t ))) J f ( x ( t ))) T ˙ P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 7 / 17

  21. Interactive Gradient Descent Let’s try gradient descent! x ( t ) = − ( ∇ ( c ◦ f )( x ( t ))) T ˙ Role : What’s the human and robot role in this optimization? ) T x ( t ) = − ( ∇ c ( f ( x ( t ))) ˙ J f ( x ( t )) � �� � � �� � human robot P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 7 / 17

  22. Interactive Gradient Descent Let’s try gradient descent! x ( t ) = − ( ∇ ( c ◦ f )( x ( t ))) T ˙ Role : What’s the human and robot role in this optimization? ) T x ( t ) = − ( ∇ c ( f ( x ( t ))) ˙ J f ( x ( t )) � �� � � �� � human robot Humans cannot update the value continuously! P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 7 / 17

  23. Event-Triggered Interactive Gradient Descent Preferably, only ask for human help only when it really needs to. ) T x ( t ) = − ( ∇ c ( f ( x ( t k ))) ˙ J f ( x ( t )) � �� � � �� � human robot with t k to be determined by the robot iteratively. P. Ong and J. Cort´ es (UCSD) Event-Triggered Interactive Gradient Descent for Real-Time Multiobjective Optimization December 15, 2017 8 / 17

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend