open problem parameter free and scale free online
play

Open Problem: Parameter-Free and Scale-Free Online Algorithms - PowerPoint PPT Presentation

Open Problem: Parameter-Free and Scale-Free Online Algorithms Francesco Orabona D avid P al Yahoo Research, New York June 25, 2016 COLT 2016 Online Linear Optimization Given a convex set K R N For t = 1, 2, . . . predict w t


  1. Open Problem: Parameter-Free and Scale-Free Online Algorithms Francesco Orabona D´ avid P´ al Yahoo Research, New York June 25, 2016 COLT 2016

  2. Online Linear Optimization Given a convex set K ⊆ R N For t = 1, 2, . . . • predict w t ∈ K • receive loss vector ℓ t ∈ R N • suffer loss � ℓ t , w t � T T ∑ ∑ Regret T ( u ) = � ℓ t , w t � − � ℓ t , u � t = 1 t = 1 � �������� �� �������� � � ������ �� ������ � algorithm’s loss competitor’s loss We focus on: 1 K = R N 2 K = ∆ N = { x ∈ R N : x ≥ 0, � x � 1 = 1 }

  3. Two Types of Adaptivity 1 Adaptivity to competitor u (parameter-free, quantile bounds, ...) 2 Adaptivity to scale of ℓ 1 , ℓ 2 , . . . , ℓ T (scale-free, second-order bounds, ...) Open Problem (Informal) Design efficient doubly adaptive algorithms.

  4. FTRL Bound Theorem (CBL’06, SS’11) If R : K → R is a non-negative 1 -strongly convex function w.r.t. �·� , then FTRL with regularizer R and learning rate η > 0 satisfies T Regret T ( u ) ≤ R ( u ) � ℓ t � 2 ∑ ∀ u ∈ K + η ∗ η t = 1 � R ( u ) / ∑ T t = 1 � ℓ t � 2 With learning rate η = ∗ � � T � � ℓ t � 2 ∑ Regret T ( u ) ≤ � R ( u ) ∗ t = 1 Two cheats 1 Bound holds only for fixed u t = 1 � ℓ t � 2 2 Need to know ∑ T ∗

  5. Existing Results for K = ∆ N � � 1 Regularizer R ( u ) = D ( u � π ) sup u ∈ ∆ N R ( u ) = max i ln π i 1 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [deREGK’11, OP’15] � � 1 � T � � � ℓ t � 2 ∑ Regret T ( u ) ≤ � max ln ∞ i π i t = 1 2 Assuming that � ℓ t � ∞ ≤ 1 [CFH’09, CV’10, LS’14, LS’15, KE’15, FRS’15, OP’16] � Regret T ( u ) ≤ T ( 1 + D ( u � π )) 3 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [FRS’15+OP’15] � � T � � ℓ t � 2 ∑ Regret T ( u ) ≤ � ( 1 + D ( u � π )) ∞ t = 1 O ( N max i log log 1 π i ) memory and time per round

  6. Existing Results for K = R N 2 � u � 2 Regularizer R ( u ) = 1 sup u ∈ R N R ( u ) = + ∞ 2 1 For any ℓ 1 , ℓ 2 , . . . , ℓ T ∈ R N [OP’15] � � √ T � � � 1 + � u � 2 � ℓ t � 2 ∑ Regret T ( u ) ≤ 2 + 1 ≤ t ≤ T � ℓ t � 2 � T max 2 t = 1 2 Assuming that � ℓ t � 2 ≤ 1 [SM’12, O’13, MA’13, MO’14, O’14, OP’16] � � � 1 + � u � 1 Regret T ( u ) ≤ T log ( 1 + � u � 2 ) 2

  7. Open Problems Open Problem #1 Find an algorithm for K = ∆ N with O ( N ) per-round time complexity such that for any π ∈ ∆ N and any ℓ 1 , ℓ 2 , . . . , ℓ T � � T � � ℓ t � 2 ∑ ∀ u ∈ ∆ N Regret T ( u ) ≤ � ( 1 + D ( u � π )) ∞ t = 1 Open Problem #2 — Reward $100 for positive solution Find an algorithm for K = R N with O ( N ) per-round time complexity such that for any ℓ 1 , ℓ 2 , . . . , ℓ T � � T � � ℓ t � 2 ∀ u ∈ R N ∑ Regret T ( u ) ≤ ( 1 + � u � 2 · polylog ( 1 + � u � 2 )) � 2 t = 1

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend