Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed - PowerPoint PPT Presentation

Jan 24, 2024 •169 likes •260 views

Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed Online Convex Optimization Gautam Goel Based on joint work with Yiheng Lin, Haoyuan Sun, and Adam Wierman 1 / 7 Portfolio Optimization Adaptive Control 2 / 7 Portfolio

Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed Online Convex Optimization Gautam Goel Based on joint work with Yiheng Lin, Haoyuan Sun, and Adam Wierman 1 / 7
Portfolio Optimization Adaptive Control 2 / 7
Portfolio Optimization Adaptive Control This talk: how do we design online learning algorithms that adapt to dynamic environments while accounting for switching costs? 2 / 7
Online Convex Optimization (OCO) with one-step lookahead and switching costs An online learner plays a series of rounds against an adaptive adversary. In the t -th round: 1. The adversary chooses an m -strongly-convex cost function f t : R d → R ≥ 0 . 2. After observing f t , the learner picks a point x t ∈ R d . 3. The online learner pays the hitting cost f t ( x t ) as well as a switching cost 1 2 � x t − x t − 1 | 2 2 which penalizes the learner for changing its decisions between rounds. 3 / 7
� T t =1 f t ( x t ) + 1 2 � x t − x t − 1 � 2 Competitive Ratio = sup . T f 1 ,... f T f t ( x t ) + 1 � 2 � x t − x t − 1 � 2 min x 1 ,... x T t =1 � �� Dynamic optimal solution 4 / 7
Online Balanced Descent (OBD) Key idea #1: Project onto level sets (otherwise you incur extra switching cost!). 5 / 7
Online Balanced Descent (OBD) Key idea #1: Project onto level sets (otherwise you incur extra switching cost!). Key idea #2: Pick level set so that switching cost ≈ hitting cost. 5 / 7
Theorem (Goel, Lin, Sun, Wierman ’19) Suppose the hitting cost functions are m-strongly convex with respect to the ℓ 2 norm and the switching cost is given by c ( x t , x t − 1 ) = 1 2 � x t − x t − 1 � 2 2 . Any online algorithm � � � must have a competitive ratio at least 1 1 + 4 1 + . A modified version of OBD, 2 m � � � called Regularized-OBD (R-OBD) exactly achieves the optimal 1 1 + 4 1 + 2 m competitive ratio. 6 / 7
Thanks for listening! See poster #50 at 5pm today. Gautam Goel Yiheng Lin Haoyuan Sun Adam Wierman Connections to statistics and control: An Online algorithm for Smoothed Regression and LQR Control [Goel and Wierman, AISTATS’19] Non-convex cost functions: Online Optimization with Predictions and Non-convex Losses [Lin, Goel, and Wierman arXiv 1911.03827] 7 / 7

Recommend

Conjugate gradient training algorithm Steepest descent algorithm Definitions: So far: j

Conjugate gradient training algorithm Steepest descent algorithm Definitions: So far: j Heuristic improvements to gradient descent (momentum) w j = weight vector at step . Steepest descent training algorithm [ ] E w j j

376 views • 20 slides

Continuous Descent Operation (CDO) Continuous Descent Operation (CDO) Doc 9331 Doc 9331 Erwin

Continuous Descent Operation (CDO) Continuous Descent Operation (CDO) Doc 9331 Doc 9331 Erwin Lassooij PBN Programme Office ICAO 1 PBN Seminar, Hong Kong Continuous Descent Continuous Descent Operations (CDO) Operations (CDO) Continuous

145 views • 11 slides

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm 20. Dec 2016 1 / 21 Outline Introduction 1 Online Algorithm The Secretary Problem Optimal Stopping 2 Odds Algorithm 3 Algorithm Proof

1.27k views • 55 slides

Moving Beyond Market Moving Beyond Market Fundamentalism to a Fundamentalism to a More Balanced

Moving Beyond Market Moving Beyond Market Fundamentalism to a Fundamentalism to a More Balanced Economy More Balanced Economy By By Joseph E. Stiglitz Joseph E. Stiglitz Moving Beyond Market Moving Beyond Market Fundamentalism to a More

471 views • 23 slides

CS 6316 Machine Learning Gradient Descent Yangfeng Ji Department of Computer Science University

CS 6316 Machine Learning Gradient Descent Yangfeng Ji Department of Computer Science University of Virginia Overview 1. Gradient Descent 2. Stochastic Gradient Descent 3. SGD with Momentum 4. Adaptive Learning Rates 1 Gradient Descent

767 views • 66 slides

The Metropolis Hastings algorithm : introduction and optimal scaling of the transient phase

Optimal scaling of the RWMH algorithm Introduction to the Metropolis-Hastings algorithm Optimal scaling of the transient phase of RWMH Optimisation strategies for the RWMH algorithm The Metropolis Hastings algorithm : introduction and optimal

509 views • 36 slides

Gradient Descent Michail Michailidis & Patrick Maiden Outline

Gradient Descent Michail Michailidis & Patrick Maiden Outline Mo4va4on Gradient Descent Algorithm Issues & Alterna4ves Stochas4c Gradient Descent

840 views • 29 slides

Compilers Recursive Descent Algorithm Alex Aiken RD Algorithm Let TOKEN be the type of

Compilers Recursive Descent Algorithm Alex Aiken RD Algorithm Let TOKEN be the type of tokens Special tokens INT, OPEN, CLOSE, PLUS, TIMES Let the global next point to the next input token Alex Aiken RD Algorithm Define boolean

87 views • 8 slides

Bonsai: Balanced Lineage Authentication Ashish Gehani Bonsai:Balanced Lineage Authentication

Bonsai: Balanced Lineage Authentication Ashish Gehani Bonsai:Balanced Lineage Authentication p. 1/19 What is data lineage ? Output Operation Input 1 Input n (a) Primitive operation (b) Compound operation tree Bonsai:Balanced Lineage

651 views • 19 slides

Wind Turbines Wind Turbines A balanced wind turbine rotates smoothly A balanced wind turbine

Wind Turbines 1 Wind Turbines 2 Observations about Observations about Wind Turbines Wind Turbines Wind turbines are symmetrical and balanced Wind turbines are symmetrical and balanced Wind Turbines Wind Turbines A balanced wind

205 views • 5 slides

An Optimal Jumper An Optimal Jumper Insertion Algorithm for Antenna Insertion Algorithm for

An Optimal Jumper An Optimal Jumper Insertion Algorithm for Antenna Insertion Algorithm for Antenna Avoidance/Fixing on General Routing Avoidance/Fixing on General Routing Trees with Obstacles Trees with Obstacles Bor- -Yiing Yiing Su and

649 views • 46 slides

Painless Stochastic Gradient Descent : Interpolation, Line-Search, and Convergence Rates. MLSS

Painless Stochastic Gradient Descent : Interpolation, Line-Search, and Convergence Rates. MLSS 2020 Aaron Mishkin, amishkin@cs.ubc.ca 1 21 Stochastic Gradient Descent: Workhorse of ML? Stochastic gradient descent (SGD) is today one of

634 views • 21 slides

Learning to learn by gradient descent by gradient descent Liyan Jiang July 18, 2019 1

Learning to learn by gradient descent by gradient descent Liyan Jiang July 18, 2019 1 Introduction The general aim of machine learning is always learning the data by itself, with as less human efforts as possible. Then it comes to the focus

396 views • 10 slides

DESCENT into the DARK AGES DESCENT into the DARK AGES A. Falcone Battle of the Romans

PCES 4.1 DESCENT into the DARK AGES DESCENT into the DARK AGES A. Falcone Battle of the Romans & Barbarians A great deal can of course be said about the period from 450-1450 AD, but we will by and large pass it over (see however

313 views • 13 slides

Applied Machine Learning Gradient Descent Methods Siamak Ravanbakhsh COMP 551 (Fall 2020)

Applied Machine Learning Gradient Descent Methods Siamak Ravanbakhsh COMP 551 (Fall 2020) Learning objectives Basic idea of gradient descent stochastic gradient descent method of momentum using an adaptive learning rate sub-gradient

569 views • 34 slides

Machine Learning (CSE 446): Gradient Descent and Stochastic Gradient Descent Sham M Kakade

Machine Learning (CSE 446): Gradient Descent and Stochastic Gradient Descent Sham M Kakade 2018 c University of Washington cse446-staff@cs.washington.edu 1 / 12 Announcements Midterm: Weds, Feb 7th. Policies: You may use a single

333 views • 18 slides

Fixed points and iterations for nonexpansive maps Elias Pipping Freie Universit at Berlin

References Fixed points and iterations for nonexpansive maps Elias Pipping Freie Universit at Berlin 10th of December 2014 Fixed points and iterations for nonexpansive maps E. Pipping References Lipschitz mappings Consider T : C C

230 views • 18 slides

Functional Central Limit Theorem for Heavy Tailed Stationary Infinitely Divisible Processes

Introduction The limit Ergodic The process Functional Central Limit Theorem for Heavy Tailed Stationary Infinitely Divisible Processes Generated by Conservative Flows Takashi Owada and Gennady Samorodnitsky November 2012 Introduction The

385 views • 19 slides

On maximality of bounded groups on Banach spaces and on the Hilbert space Valentin Ferenczi,

On maximality of bounded groups on Banach spaces and on the Hilbert space Valentin Ferenczi, University of S ao Paulo FADYS, February 2015 Valentin Ferenczi, University of S ao Paulo On maximality of bounded groups on Banach spaces and on

1.04k views • 60 slides

Dynamics of rational maps on the projective line of the field of p -adic numbers Lingmin LIAO

Dynamics of rational maps on the projective line of the field of p -adic numbers Lingmin LIAO (Universit e Paris-Est Cr eteil) (joint with Ai-Hua Fan , Shi-Lei Fan and Yue-Fei Wang ) International Conference on p -adic Mathematical Physics

531 views • 30 slides

Secretary Club Officer Training Agenda Secretary Secretary Secretary Role

Secretary Club Officer Training Agenda Secretary Secretary Secretary Role Responsibilities Resources www.toastmasters.org Session Objectives Identify your role Fulfill your responsibilities Find resources that help

338 views • 18 slides

Vice President Membership (VPM) Club Officer Training Agenda VPM VPM VPM Role

Vice President Membership (VPM) Club Officer Training Agenda VPM VPM VPM Role Responsibilities Resources www.toastmasters.org Session Objectives Identify your role Fulfill your responsibilities Find resources that

476 views • 20 slides

Building on Achievement for Continued Success Club Officer Training Agenda Moments

Building on Achievement for Continued Success Club Officer Training Agenda Moments Sharing Best Club Success of Truth Practices Plan www.toastmasters.org Objectives Identify and describe the key elements of Moments of Truth

299 views • 10 slides

ToASt ASIC development update Daniela Calvo, Gianni Mazza INFN sez. di Torino mazza@to.infn.it

ToASt ASIC development update Daniela Calvo, Gianni Mazza INFN sez. di Torino mazza@to.infn.it November 5 th , 2019 November 5 th , 2019 G. Mazza (INFN Torino) ToASt ASIC 1 / 9 Specifications Specification Min Max Unit Input capacitance

265 views • 9 slides