iterative linearized control stable algorithms and
play

Iterative Linearized Control: Stable Algorithms and Complexity - PowerPoint PPT Presentation

Iterative Linearized Control: Stable Algorithms and Complexity Guarantees Vincent Roulet, Dmitriy Drusvyatskiy, Siddhartha Srinivasa, Zaid Harchaoui ICML 2019 1 / 4 Problem Nonlinear control Iterative linearization (ILQR) around


  1. Iterative Linearized Control: Stable Algorithms and Complexity Guarantees Vincent Roulet, Dmitriy Drusvyatskiy, Siddhartha Srinivasa, Zaid Harchaoui ICML 2019 1 / 4

  2. Problem Nonlinear control → Iterative linearization (ILQR) − around current x t , u t T T � � � y ⊤ t H t y t + v ⊤ � � min t G t v t � min h t ( x t ) + g t ( u t ) v 0 ,..., v T − 1 u 0 ,..., u T − 1 y 0 ,... y T t = 0 x 0 ,... x T t = 0 s.t. y t + 1 = Φ t , x y t + Φ t , u v t s.t. x t + 1 = φ t ( x t , u t ) = 0 y 0 = ˆ x 0 x 0 → Next iterate u + t = u t + v ∗ t Questions 1. Does ILQR converge? Can it be accelerated? 2. How do we characterize complexities for nonlinear control? 2 / 4

  3. Contributions Regularized and Accelerated ILQR 1. ILQR is Gauss-Newton → Regularized ILQR gets convergence to a stationary point 2. Potential acceleration by extrapolation steps → Accelerated ILQR akin to Catalyst acceleration ILQR RegILQR AccILQR Cost Iteration 3 / 4

  4. Contributions Oracles complexities 1. Oracles are solved by dynamic programming → Gradient and Gauss-Newton have both cost in O ( T ) 2. Automatic-differentiation software libraries available → Use auto.-diff. as oracle for direct implementation Code summary available at https://github.com/vroulet/ilqc dynamics , cost = define_ctrl_pb () ctrl = rand(dim_ctrl) auto_diff_oracle = define_auto_diff_oracle(ctrl , dynamics) dual_sol = sovle_dual_step(ctrl , cost , auto_diff_oracle ) next_ctrl = get_primal(dual_sol , auto_diff_oracle , cost) Come see Poster #39 in Pacific Ballroom! 4 / 4

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend