direct runge kutta discretization achieves acceleration
play

Direct Runge-Kutta Discretization Achieves Acceleration Jingzhao - PowerPoint PPT Presentation

Direct Runge-Kutta Discretization Achieves Acceleration Jingzhao Zhang , Aryan Mokhtari, Suvrit Sra, Ali Jadbabaie NeurIPS 2018 This work is supported by DARPA Lagrange Program under grant No. FA 8650-18-2-7838 Acceleration in first order


  1. Direct Runge-Kutta Discretization Achieves Acceleration Jingzhao Zhang , Aryan Mokhtari, Suvrit Sra, Ali Jadbabaie NeurIPS 2018 This work is supported by DARPA Lagrange Program under grant No. FA 8650-18-2-7838

  2. Acceleration in first order convex optimization Optimize smooth convex function: 2

  3. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent: 3

  4. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent: 4

  5. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent:

  6. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent: Accelerated Gradient Descent [Nesterov 1983]:

  7. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent: Accelerated Gradient Descent [Nesterov 1983]: [SBC 2015] [SBC 2015] Su, Weijie, Stephen Boyd, and Emmanuel Candes. "A differential equation for modeling Nesterov’s accelerated gradient method: Theory and insights." Advances in Neural Information Processing Systems . 2014. 7

  8. Acceleration in first order convex optimization Optimize smooth convex function: Gradient Descent: Accelerated Gradient Descent [Nesterov 1983]: [SBC 2015] [SBC 2015] Su, Weijie, Stephen Boyd, and Emmanuel Candes. "A differential equation for modeling Nesterov’s accelerated gradient method: Theory and insights." Advances in Neural Information Processing Systems . 2014. 8

  9. Convergence in continuous time 9

  10. Convergence in continuous time Arbitrary acceleration [WWJ 2016] by change of variable [WWJ 2016] Wibisono, A., Wilson, A. C., & Jordan, M. I. (2016). A variational perspective on accelerated methods in optimization. Proceedings of the National Academy of Sciences , 113 (47), E7351-E7358. 10

  11. Convergence in continuous time Arbitrary acceleration [WWJ 2016] by change of variable However, smooth convex optimization algorithms cannot achieve faster rate than: [WWJ 2016] Wibisono, A., Wilson, A. C., & Jordan, M. I. (2016). A variational perspective on accelerated methods in optimization. Proceedings of the National Academy of Sciences , 113 (47), E7351-E7358. 11

  12. Question: How to relate the convergence rate in continuous time ODE to the convergence rate of a discrete optimization algorithm? 12

  13. Question: How to relate the convergence rate in continuous time ODE to the convergence rate of a discrete optimization algorithm? Our approach: Discretize the ODE with known Runge-Kutta integrators (e.g. Euler, midpoint, RK44) and provide theoretical guarantees for convergence rates. 13

  14. Main theorem: For a p-flat, (s+2)-differentiable convex function, if we discretize the ODE with order-s Runge-Kutta integrator, we have

  15. Main theorem: For a p-flat, (s+2)-differentiable convex function, if we discretize the ODE with order-s Runge-Kutta integrator, we have p-flat: 15

  16. Main theorem: For a p-flat, (s+2)-differentiable convex function, if we discretize the ODE with order-s Runge-Kutta integrator, we have p-flat: Order-s: Discretization error scales as , h is the step size . 16

  17. Main theorem: For a p-flat, (s+2)-differentiable convex function, if we discretize the ODE with order-s Runge-Kutta integrator, we have p-flat: Objective Integrator Rate L-smooth (p=2) RK44 (s=4) (p=4) Midpoint(s=2) Order-s: Discretization error scales as ; h is the step size. 17

  18. Our poster session: Thu Dec 6th 05:00 -- 07:00 PM Room 210 & 230 AB Poster Number: 9 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend