From Nesterovโs Estimate Sequence To Riemannian Acceleration
Kwangjun Ahn, Suvrit Sra COLT 2020 arXiv: https://arxiv.org/abs/2001.08876
From Nesterovs Estimate Sequence To Riemannian Acceleration - - PowerPoint PPT Presentation
From Nesterovs Estimate Sequence To Riemannian Acceleration Kwangjun Ahn, Suvrit Sra COLT 2020 arXiv: https://arxiv.org/abs/2001.08876 Riemannian Optimization? : (Euclidean) Optimization: ()
Kwangjun Ahn, Suvrit Sra COLT 2020 arXiv: https://arxiv.org/abs/2001.08876
๐: โ โ โ ๐: ๐ โ โ
Nesterov showed: ๐ ๐ง ๐ ๐ฆโ ๐ 1
We only need t ๐
๐ด log
Acceleration! For ๐ โผ ๐ผ๐ ๐ฆ โผ ๐ (and indeed optimal for this class!) For ๐ โผ ๐ผ๐ ๐ฆ โผ ๐ ๐ ๐ฆ ๐ ๐ฆโ ๐ 1
We need ๐
๐ด log
C.f. Gradient Descent:
Not clear whether whether these equations are even feasible or tractably solvable.
Not clear whether the discretization yields accel.
not clear if it generalizes to non-linear space like Riem. manifolds.
Hard to understand; its scope has puzzled researchers for years.
(Euclidean) Accel. Gradient Descent:
๐ฆ๐ข+1 ๐ง๐ข ๐ฝ๐ข+1 ๐จ๐ข ๐ง๐ข ๐ง๐ข+1 ๐ฆ๐ข+1 ๐ฟ๐ข+1๐ผ๐ ๐ฆ๐ข+1 ๐จ๐ข+1 ๐ฆ๐ข+1 ๐พ๐ข+1 ๐จ๐ข ๐ฆ๐ข+1 ๐๐ข+1๐ผ๐๐ฆ๐ข+1
โ1 ๐จ๐ข
โ1
โ/ 1โ
๐บ ๐ 2
๐ ๐ฃ ๐ฃ๐ฃ ๐/๐ 1 ๐ฃ ๐ ๐ฃ ๐ฃ2
๐ ๐ฃ 1 2 ๐ฃ2 ๐ ๐ฃ 1 5 ๐ฃ2 ๐ 5
๐๐ข+1๐๐ข+1โ2๐ฮ 1โ๐๐ข+1
๐บ๐+๐ ๐๐ข 2
Find ๐๐ข+1 โ 2๐ฮ, 1 such that
the magnitude of metric distortion at iteration t
where ๐บ๐+๐ ๐ผ๐๐๐, ๐๐ for some computable function ๐.
s.t. 1 ๐๐ข ๐/๐ for all ๐ข. (2) ๐๐ข quickly converges to ๐/๐. quickly acheives ๐ ๐ฏ๐ฆ๐ฆ acceleartion! strictly ๐ ๐๐ญ๐ฎ๐๐ฌ than nonaccel GD!
Remarks
โ Using strongly convex perturbation can be done โ But, extra
โ More crucially, our current proof needs to ensure allโจ