Transporting iterative algorithms from Euclidean space to manifolds - - PowerPoint PPT Presentation

transporting iterative algorithms from euclidean space to
SMART_READER_LITE
LIVE PREVIEW

Transporting iterative algorithms from Euclidean space to manifolds - - PowerPoint PPT Presentation

Transporting iterative algorithms from Euclidean space to manifolds Jochen Trumpf Jochen.Trumpf@anu.edu.au Department of Information Engineering Research School of Information Sciences and Engineering The Australian National University and


slide-1
SLIDE 1

Transporting iterative algorithms from Euclidean space to manifolds

Jochen Trumpf

Jochen.Trumpf@anu.edu.au

Department of Information Engineering Research School of Information Sciences and Engineering The Australian National University and National ICT Australia Ltd.

Transporting iterative algorithms from Euclidean space to manifolds – p. 1/18

slide-2
SLIDE 2
  • verview

the Newton iteration

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-3
SLIDE 3
  • verview

the Newton iteration parametrisations

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-4
SLIDE 4
  • verview

the Newton iteration parametrisations the new algorithm

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-5
SLIDE 5
  • verview

the Newton iteration parametrisations the new algorithm convergence properties

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-6
SLIDE 6
  • verview

the Newton iteration parametrisations the new algorithm convergence properties an example

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-7
SLIDE 7
  • verview

the Newton iteration parametrisations the new algorithm convergence properties an example general iterates

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-8
SLIDE 8
  • verview

the Newton iteration parametrisations the new algorithm convergence properties an example general iterates joint work with J. Manton

Transporting iterative algorithms from Euclidean space to manifolds – p. 2/18

slide-9
SLIDE 9

the Newton iteration

Newton’s method

xk+1 = xk − {Hess f(xk)}−1 grad f(xk), x0 ∈ Rn

is an iteration

xk+1 = N(f)(xk), x0 ∈ Rn

which is defined for any twice differentiable function f : Rn −

→ R.

Transporting iterative algorithms from Euclidean space to manifolds – p. 3/18

slide-10
SLIDE 10

the Newton iteration

The sequence

xk = {N(f)}k (x0)

it generates converges locally quadratic to non-degenerate critical points of f.

Transporting iterative algorithms from Euclidean space to manifolds – p. 4/18

slide-11
SLIDE 11

the Newton iteration

The sequence

xk = {N(f)}k (x0)

it generates converges locally quadratic to non-degenerate critical points of f. In particular, it converges locally to any (isolated) strict local maximum of f.

Transporting iterative algorithms from Euclidean space to manifolds – p. 4/18

slide-12
SLIDE 12

parametrisations

Sometimes a to be maximised function is not naturally defined on an Rn but rather on some smooth manifold (curved space), e.g. the sphere.

Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

slide-13
SLIDE 13

parametrisations

Sometimes a to be maximised function is not naturally defined on an Rn but rather on some smooth manifold (curved space), e.g. the sphere. One description of manifolds is that they look locally like an Rn. This means that the manifold can be covered by a collection of subsets for each of which there is a homeomorphism (coordinate chart) onto an open set in Rn.

Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

slide-14
SLIDE 14

parametrisations

Sometimes a to be maximised function is not naturally defined on an Rn but rather on some smooth manifold (curved space), e.g. the sphere. One description of manifolds is that they look locally like an Rn. This means that the manifold can be covered by a collection of subsets for each of which there is a homeomorphism (coordinate chart) onto an open set in Rn. The whole atlas has to fit nicely together, i.e. via diffeomorphisms in overlapping regions.

Transporting iterative algorithms from Euclidean space to manifolds – p. 5/18

slide-15
SLIDE 15

parametrisations

This implies that for each point p of the manifold

M there exists a local parametrisation, i.e. a

smooth injective map

µp : Rn − → M, µp(0) = p

Transporting iterative algorithms from Euclidean space to manifolds – p. 6/18

slide-16
SLIDE 16

parametrisations

This implies that for each point p of the manifold

M there exists a local parametrisation, i.e. a

smooth injective map

µp : Rn − → M, µp(0) = p

We consider the special case where µp varies locally smoothly with the base point, which might

  • nly be possible in a small neighborhood of a

given point p∗ (hedgehog theorem).

Transporting iterative algorithms from Euclidean space to manifolds – p. 6/18

slide-17
SLIDE 17

parametrisations

Take e.g. the sphere and the operation of the special orthogonal group on it

φ :SO(n + 1) × Sn − → Sn, (Q, p) → Qp

and consider the exponential map

exp :so(n + 1) − → SO(n + 1), Ω → exp Ω .

Transporting iterative algorithms from Euclidean space to manifolds – p. 7/18

slide-18
SLIDE 18

parametrisations

It can be shown that

φ(exp(.), p∗) : so(n + 1) − → Sn

is locally injective around 0 when restricted to the subspace

    

Z −Z⊤

  | Z ∈ Rk×(n−k)    .

This defines a local parametrisation µp∗ which can be “moved around” Sn by applying φ.

Transporting iterative algorithms from Euclidean space to manifolds – p. 8/18

slide-19
SLIDE 19

the new algorithm

Let µp and νp be two families of local parametrisations and consider the iteration

pk+1 = νpk(N(f ◦ µpk)(0)), p0 ∈ M

which is defined for every twice differentiable function f : M −

→ R.

Transporting iterative algorithms from Euclidean space to manifolds – p. 9/18

slide-20
SLIDE 20

the new algorithm

Let µp and νp be two families of local parametrisations and consider the iteration

pk+1 = νpk(N(f ◦ µpk)(0)), p0 ∈ M

which is defined for every twice differentiable function f : M −

→ R.

Note that for M = Rn and νp = µp the obvious parametrisation x → p + x this is the standard Newton method.

Transporting iterative algorithms from Euclidean space to manifolds – p. 9/18

slide-21
SLIDE 21

convergence properties

Theorem: If µp and νp are smooth around a non-degenerate critical point p∗ of f and if moreover µ′

p∗(0) = ν′ p∗(0) then our algorithm

converges locally quadratic to p∗.

Transporting iterative algorithms from Euclidean space to manifolds – p. 10/18

slide-22
SLIDE 22

convergence properties

Theorem: If µp and νp are smooth around a non-degenerate critical point p∗ of f and if moreover µ′

p∗(0) = ν′ p∗(0) then our algorithm

converges locally quadratic to p∗. In general, nothing is said (and known) about global convergence.

Transporting iterative algorithms from Euclidean space to manifolds – p. 10/18

slide-23
SLIDE 23

an example

Consider a real symmetric n × n matrix N with eigenvalues λ1 ≥ · · · ≥ λk > λk+1 ≥ · · · ≥ λn. Its

k-dimensional principal eigenspace is the

subspace spanned by the eigenvectors to

λ1, . . . , λk.

Transporting iterative algorithms from Euclidean space to manifolds – p. 11/18

slide-24
SLIDE 24

an example

Consider a real symmetric n × n matrix N with eigenvalues λ1 ≥ · · · ≥ λk > λk+1 ≥ · · · ≥ λn. Its

k-dimensional principal eigenspace is the

subspace spanned by the eigenvectors to

λ1, . . . , λk.

Consider the function (generalised Rayleigh quotient)

f : Grass(k, n) − → R, [X] → tr X⊤NX

Transporting iterative algorithms from Euclidean space to manifolds – p. 11/18

slide-25
SLIDE 25

an example

µp is given by p =

 Q  I    

µp(Z) =

 Q exp  

Z −Z⊤

   I    

where Q ∈ O(n) and Z is k × (n − k).

Transporting iterative algorithms from Euclidean space to manifolds – p. 12/18

slide-26
SLIDE 26

an example

Then

grad(f ◦ µp)(0) =

 Q⊤NQ,  I 0

0 0

    =   0

−N12 N ⊤

12  

Transporting iterative algorithms from Euclidean space to manifolds – p. 13/18

slide-27
SLIDE 27

an example

Then

grad(f ◦ µp)(0) =

 Q⊤NQ,  I 0

0 0

    =   0

−N12 N ⊤

12  

and

Hess(f ◦ µp)(0)Z =

 

ZN22 − N11Z Z⊤N11 − N12Z⊤

 

Transporting iterative algorithms from Euclidean space to manifolds – p. 13/18

slide-28
SLIDE 28

an example

So computing N(f ◦ µp)(0) amounts to solving the Sylvester equation

N11Z − ZN22 = −N12

Transporting iterative algorithms from Euclidean space to manifolds – p. 14/18

slide-29
SLIDE 29

an example

So computing N(f ◦ µp)(0) amounts to solving the Sylvester equation

N11Z − ZN22 = −N12

This Z could than be plugged into

νp(Z) =

 Q exp  

Z −Z⊤

   I    

to get a new Q.

Transporting iterative algorithms from Euclidean space to manifolds – p. 14/18

slide-30
SLIDE 30

an example

It’s much better though to use an orthogonal projection onto O(n) instead by computing a

QR-decomposition of

  I

−Z⊤

  = QZR

and to use QQZ as the new Q.

Transporting iterative algorithms from Euclidean space to manifolds – p. 15/18

slide-31
SLIDE 31

general iterates

Replacing the Newton iteration N(f) : Rn −

→ Rn

by any other iteration G(f) : Rn −

→ Rn that is

locally order q converging to non-degenerate critical points of f, we can derive sufficient conditions on a family µp of local parametrisations that guarantee local order q convergence of the “transported algorithm”

pk+1 = µpk(G(f ◦ µpk)(0)), p0 ∈ M

Transporting iterative algorithms from Euclidean space to manifolds – p. 16/18

slide-32
SLIDE 32

general iterates

Let G(f) : Rn −

→ Rn be defined by G(f)(x) := g(x, f(x), grad f(x), Hess f(x))

where

g : Rn × R × Rn × Rn×n − → Rn

is sufficiently smooth.

Transporting iterative algorithms from Euclidean space to manifolds – p. 17/18

slide-33
SLIDE 33

general iterates

Let G(f) : Rn −

→ Rn be defined by G(f)(x) := g(x, f(x), grad f(x), Hess f(x))

where

g : Rn × R × Rn × Rn×n − → Rn

is sufficiently smooth. For N(f) it would be g(x, α, y, Z) = x − Z−1y.

Transporting iterative algorithms from Euclidean space to manifolds – p. 17/18

slide-34
SLIDE 34

general iterates

Theorem: If G(.) is order q locally convergent to non-degenerate critical points and µp is a locally smooth family of local parametrisations which are local diffeomorphisms then the transported algorithm is locally order q convergent to non-degenerate critical points.

Transporting iterative algorithms from Euclidean space to manifolds – p. 18/18

slide-35
SLIDE 35

general iterates

Theorem: If G(.) is order q locally convergent to non-degenerate critical points and µp is a locally smooth family of local parametrisations which are local diffeomorphisms then the transported algorithm is locally order q convergent to non-degenerate critical points. This result can be further generalised (see forthcoming paper).

Transporting iterative algorithms from Euclidean space to manifolds – p. 18/18

slide-36
SLIDE 36

general iterates

Theorem: If G(.) is order q locally convergent to non-degenerate critical points and µp is a locally smooth family of local parametrisations which are local diffeomorphisms then the transported algorithm is locally order q convergent to non-degenerate critical points. This result can be further generalised (see forthcoming paper).

Thank you.

Transporting iterative algorithms from Euclidean space to manifolds – p. 18/18