Subriemannian minimizers H. J. Sussmann Department of Mathematics - - PowerPoint PPT Presentation

subriemannian minimizers
SMART_READER_LITE
LIVE PREVIEW

Subriemannian minimizers H. J. Sussmann Department of Mathematics - - PowerPoint PPT Presentation

Subriemannian minimizers H. J. Sussmann Department of Mathematics Rutgers University Piscataway, NJ 08854, USA sussmann@math.rutgers.edu Conference on Nonlinear Partial Differential Equations and Applications In honor of Jean-Michel


slide-1
SLIDE 1

Subriemannian minimizers

  • H. J. Sussmann

Department of Mathematics — Rutgers University Piscataway, NJ 08854, USA sussmann@math.rutgers.edu

Conference on Nonlinear Partial Differential Equations and Applications In honor of Jean-Michel Coron’s 60th birthday Paris, June 20-24, 2016

slide-2
SLIDE 2

HAPPY BIRT HDAY J EAN − MICHEL !!

2

slide-3
SLIDE 3

VECTOR DISTRIBUTIONS AND SECTIONS

A vector distribution on a smooth manifold M is a vector subbundle E of the tangent bundle TM of M. (That is, E assigns to each x ∈ M a linear subspace E(x) of the tangent space TxM, in such a way that the dimension of E(x) is the same for all x ∈ M. The fiber dimension of E is the dimension of the spaces E(x). A section of a vector distribution E on M over an open subset Ω of M is a vector field V on Ω such that V (x) ∈ E(x) for each x ∈ Ω. If κ = ∞ or κ = ω, and M is of class Cκ, we use Γκ(E, Ω) to denote the set of all sections of E over Ω that are of class Cκ. ———————

In this lecture,

  • a. “manifold” means “finite-dimensional paracompact manifold without boundary”,

b.“smooth” means “of class C∞”,

  • c. If M is a smooth manifold, then TM, T ∗M are, respectively, the tangent and cotangent

bundles of M. If x ∈ M, then TxM, T ∗

xM are, respectively, the tangent and cotangent spaces

  • f M at x.

3

slide-4
SLIDE 4

SMOOTH AND REAL ANALYTIC DISTRIBUTIONS

The vector distribution E is smooth if E is a smooth submanifold

  • f TM.

(Equivalently, E is smooth iff for every x ∈ M and every v ∈ E(x) there exists a smooth section V on some neighborhood Ω

  • f x such that V (x) = v.)

If M is real analytic, then the vector distribution E is real analytic if E is a real analytic submanifold of TM. (Equivalently, E is real analytic iff for every x ∈ M and every v ∈ E(x) there exists a real analytic section V on some neighborhood Ω of x such that V (x) = v.) A TRIVIAL WELL-KNOWN FACT: If E is a smooth (or real ana- lytic) distribution on M with fiber dimension d, then for every x ∈ M there exists a local basis of smooth (or real analytic) sections of E near x, that is, a d-tuple (X1, . . . , Xd) of smooth (or real analytic) sections of E defined on an open neighborhood Ω of x such that (X1(y), . . . , Xd(y)) is a basis of E(y) for every y ∈ Ω.

4

slide-5
SLIDE 5

GLOBAL SECTIONS

A global section of E is a section of E over M. A WELL-KNOWN FACT: If E is a smooth distribution on M and dim M = n, then E has 2n+1 smooth global sections X1, . . . , X2n+1 such that the vectors X1(x), . . . , X2n+1(x) linearly span E(x) for each x ∈ M. (Proof: Use Whitney’s mbedding theorem.) ANOTHER WELL-KNOWN FACT: If E is a real analytic distribu- tion on the real analytic manifold M and dim M = n, then E has 4n + 2 real analytic global sections X1, . . . , X4n+2 such that the vec- tors X1(x), . . . , X4n+2(x) linearly span E(x) for each x ∈ M. (Proof:

Use the Morrey-Grauert embedding theorem.)

5

slide-6
SLIDE 6

RIEMANNIAN METRICS

A smooth Riemannian metric on a smooth distribution E on a smooth manifold M is a map M ∋ x → Gx such that (i) each Gx is a strictly positive definite symmetric bilinear form on E(x), (ii) whenever V, W are smooth sections on E over an open subset Ω of M, it follows that the function Ω ∋ x → Gx(V (x), W(x)) is smooth. If M and E are real analytic then the metric G is real analytic if (ii’) whenever V, W are real analytic sections on E over an open subset Ω of M, it follows that the function Ω ∋ x → Gx(V (x), W(x)) is real analytic.

6

slide-7
SLIDE 7

A WELL-KNOWN FACT: If E is a smooth (or real analytic) dis- tribution on the smooth (or real analytic) manifold M and G is a smooth (or real analytic) Riemannian metric on E, then G is the restriction to E of a smooth (or real analytic) Riemannian metric on M (that is, a Riemannian metric on the full tangent bundle TM). A TRIVIAL WELL-KNOWN FACT: If E is a smooth (or real ana- lytic) distribution on M with fiber dimension d, and G is a smooth (or real analytic) Riemannian metric on E, then for every x ∈ M there exists an orthonormal local basis of smooth (or real analytic) sections of E near x, that is, a d-tuple (X1, . . . , Xd) of smooth (or real analytic) sections of E defined on an open neighborhood Ω of x such that, for every y ∈ Ω, (X1(y), . . . , Xd(y)) is a basis of E(y) for which Gy(Xi(y), Xj(y)) = δij .

7

slide-8
SLIDE 8

BRACKET-GENERATING DISTRIBUTIONS

If L is any linear space of smooth vector field on a manifold M, and x ∈ M, we define L(x) def = {V (x) : V ∈ L} . We say that L has full rank at x if L(x) = TxM. If S is any set of smooth vector fields on M, then L[S] will denote the Lie algebra of vector fields generated by S, that is, the smallest Lie algebra (over R) of vector fields that contains S. A smooth distribution E of fiber dimension d on a manifold M is bracket-generating if one of the following equivalent conditions hold: (1) L[Γ∞(E, M)] has full rank at every x ∈ M . (2) For every x ∈ M, if (X1, . . . , Xd) is any basis of sections of E defined on a neighborhood Ω of x, then L[X1, . . . , Xd] has full rank at x.

8

slide-9
SLIDE 9

SUBRIEMANNIAN MANIFOLDS

A smooth subriemannian manifold is a triple (M, E, G) such that (1) M is a smooth manifold, (2) E is a smooth vector distribution on M, (3) G is a smooth Riemannian metric on E. Naturally, we call (M, E, G) real analytic if M, E and G are real analytic.

9

slide-10
SLIDE 10

ADMISSIBLE ARCS

If (M, E, G) is a subriemannian manifold, an admissible arc is an absolutely continuous arc ξ : [a, b] → M, defined on some compact interval [a, b], such that ˙ ξ(t) ∈ E(ξ(t)) for almost every t ∈ [a, b] . If we define the G-length vG of a tangent vector v ∈ TxM by vG =

  • Gx(v, v)

if v ∈ E(x) , vG = +∞ if v / ∈ E(x) , and let the G-length of an arbitrary absolutely continuous arc ξ : [a, b] → M be the (finite or infinite) number ξG

def

=

b

a ˙

ξ(t)Gdt , then it is easy to see that an absolutely continuous arc ξ : [a, b] → M is admissible if and

  • nly if

ξG < ∞ .

10

slide-11
SLIDE 11

THE SUBRIEMANNIAN DISTANCE

If M = (M, E, G) is a subriemannian manifold, and ξ : [a, b] → M is an absolutely continuous arc in M, we use ∂ξ to denote the ordered pair (ξ(a), ξ(b)), and refer to ∂ξ as the endpoint value, or boundary value, of ξ. We then define the distance dM(x, y) between two points x, y of M to be the infimum of ξG, taken over all absolutely continuous arcs ξ in M such that ∂ξ = (x, y). A WELL-KNOWN FACT: If M is connected, then (M, dM) is a metric space, whose topology is the same as the manifold topology

  • f M. (So, in particular, dM is continuous on M × M.) (Reason: The

bracket-generating condition and Chow’s Theorem imply that dM(x, y) < ∞ for all x, y.)

11

slide-12
SLIDE 12

MINIMIZERS AND PAL MINIMIZERS

An absolutely continuous arc ξ : [a, b] → M is a length minimizer if ξG = dM(∂ξ). An admissible arc ξ : [a, b] → M is parametrized by arc-length, or a PAL arc, if ˙ ξ(t)G = 1 for almost all t ∈ [a, b]. TRIVIAL FACT: Every admissible arc can be reparametrized to be- come a PAL arc. (Reason: Just use arc-length as the new time parameter.)

12

slide-13
SLIDE 13

REMARK: In the real analytic case, for the purpose of the results considered here, we may always assume, without loss of generality, that the bracket-generating condition holds even if it originally does not hold. The reason is: through every point of M there passes a unique maxi- mal integral submanifold of the Lie algebra of vector fields generated by the real analytic global sections of E. And, in addition: all the results of this talk are about regularity of distance-minimizing arcs, and every such arc is entirely contained in an integral manifold, so in order to study such arcs we can restrict

  • urselves to an integral manifold.

13

slide-14
SLIDE 14

AN IMPORTANT OPEN QUESTION Are all PAL minimizers smooth? This is well known to be true in the Riemannian case (i.e. when E = TM), because in that case the PAL minimizers satisfy (in coordinates) the geodesic equation ¨ ξi(t) +

  • j,k

Γi

jk(ξ(t)) ˙

ξj(t) ˙ ξk(t) = 0 , where the Γi

jk are the Christoffel symbols, which

are smooth functions on M.

14

slide-15
SLIDE 15

THE SMOOTHNEESS PROBLEM

To study the smoothness problem, it suffices to work with sufficiently short arcs, so we are allowed to assume

(*) M is an open subset of Rn, and E has an

  • rthonormal basis (f1, f2, . . . , fd) of smooth

(or real analytic) sections.

It follows that the admissible arcs are exactly the arcs ξ that are trajectories of the control system ˙ x =

d

  • i=1

uifi(x) . (1)

15

slide-16
SLIDE 16

Precisely, an arc ξ : [a, b] → M is admissible if and only if it is absolutely continuous and satisfies ˙ ξ(t) =

d

  • i=1

ηi(t)fi(ξ(t)) for a.e. t , (2) for some integrable function η = (η1, . . . , ηd) : [a, b] → Rd. Furthermore, ξ is PAL if and only if d

i=1 ηi(t)2 = 1 for a.e. t.

And, for a PAL arc ξ : [a, b] → M, the length of ξ is exactly b − a. So the PAL minimizers are the minimum time arcs for the control system (1) with contol constraint d

i=1 ui(t)2 = 1.

For convenience, we use instead the control constraint

d

  • i=1

ui(t)2 ≤ 1 . (3) (This does not change the minimizers.)

slide-17
SLIDE 17

So, from now on we study the minimum-time trajec-

tories for the system ˙ x =

d

  • i=1

uifi(x) , (4) with control constraint

d

i=1 u2 i ≤ 1 .

Here, (i) the state x belongs to M, an open subset of Rn, (ii) f1, . . . , fd are smooth (or real-analytic) vector fields

  • n M,

(iii) if L is the Lie algebra of vector fields on M gener- ated by the fi, then L(f1, . . . , fd)(x) = TxM for every point x ∈ M.

slide-18
SLIDE 18

To prove regularity theorems for optimal trajectories, we use necessary conditions for optimality, of which the most famous one is the Pontryagin Maximum Principle (PMP). According to the PMP, if ξ : [a, b] → M is a time-optimal

trajectory for the system ˙ x =

d

  • i=1

uifi(x) , x ∈ M ,

d

  • i=1

u2

i ≤ 1 ,

(5) and η = (η1, . . . , ηd) is the corresponding control, then

ξ is the projection of a Hamiltonian-maximizing trajec- tory Ξ : [a, b] → T #M of the Hamiltonian lift of (5), i.e. the system ˙ X =

d

  • i=1

ui Fi(X) , X ∈ T #M ,

d

  • i=1

u2

i ≤ 1 .

(6)

16

slide-19
SLIDE 19

Here (i) T #M is the cotangent bundle of M with the zero section removed, that is, the set of all pairs (x, p) such that x ∈ M, p ∈ T ∗

xM, and p = 0.

(ii) For j = 1, . . . , d, Fj is the Hamiltonian lift of fj, that is, the Hamilton vector field on T #M corresponding to the function T #M ∋ (x, p) → p, fj(x)def = Fj(x, p) .

17

slide-20
SLIDE 20

Recall that, if v is any vector field on M, then the “mo- mentum function” µv is the function on T ∗M given by µv(x, p) = p, v(x) for x ∈ M , p ∈ T ∗

xM .

Also, ϕ is a smooth function on an open subset U of T ∗M, then the Hamilton vector field arising from ϕ is the vector field ϕ on U given in local coordinates by

  • ϕ =

n

  • i=1

∂ϕ ∂pi · ∂ ∂xi −

n

  • i=1

∂ϕ ∂xi · ∂ ∂pi , so that the integral curves t → (x(t), p(t)) of ϕ on U satisfy the “Hamilton equations” ˙ xi = ∂ϕ ∂pi , ˙ pi = − ∂ϕ ∂xi .

18

slide-21
SLIDE 21

So the Fj are the Hamilton vector fields arising from the momentum functions Fj = µfj. The Hamiltonian of our control system ˙ x =

d

i=1 uifi(x)

is the momentum function of the u-dependent vector field M ∋ x →

d

i=1 uifi(x) ∈ TxM.

So the Hamiltonian H is a u-dependent function on T #M, given by Hu(x, p) =

d

  • j=1

ujp, fj(x) , for u = (u1, . . . , ud), x ∈ M, p ∈ T ∗

xM, p = 0.

19

slide-22
SLIDE 22

A trajectory of the lifted system for a control η = (η1, . . . , ηd), consists of a pair (ξ, π) such that (i) ξ is a trajectory of the original system for the control η, (ii) π is a field of nonzero covectors along ξ (that is, π(t) is a nonzero convector at ξ(t) for each t), (iii) π satisfies, in coordinates, the “adjoint equatoon” ˙ π(t) = −

d

  • j=1

ηj(t)π(t) · ∂fj ∂x . (In that case, π is called an ”adjoint vector” for (ξ, η).)

20

slide-23
SLIDE 23

HAMILTONIAN MAXIMIZATION A trajectory Ξ of the lifted system is Hamiltonian-maximizing if there exists a real constant π0 such that π0 = Hη(t)(Ξ(t)) = max{Hu(Ξ(t)) : u ∈ Bd} for almost every t. (Here Bd = {u = (u1, . . . , ud) ∈ Rd :

d

j=1 u2 j ≤ 1}.)

21

slide-24
SLIDE 24

Therefore, if (ξ, η) is an optimal trajectory-control pair, there exist an adjoint vector π and a constant π0 such that π0 = Hη(t)(Ξ(t)) = max{Hu(Ξ(t)) : u ∈ Bd} for a.e. t . Since u → Hu(X) is a linear function d

j=1 ujaj(X) for each X, the

maximum of this function on the unit ball Bd is attained for uj = aj(X)

d

k=1 ak(X)2

if

d

  • k=1

ak(X)2 = 0 , and u an arbitrary member of Bd if d

k=1 ak(X)2 = 0.

Furthermore, the maximum value of Hu(X) is

d

k=1 ak(X)2.

Since aj(X) = p, fj(x) if X = (x, p), we obtain

22

slide-25
SLIDE 25

If (ξ, η) is an optimal trajectory-control pair, then there exists a solution π of the adjoint equation along (ξ, η) such that (i) the number π0 =

d

j=1π(t), fj(ξ(t))2 is constant

(i.e. does not depend on t), (ii) If π0 > 0, then the control η is given by ηj(t) = π(t), fj(ξ(t))

d

k=1π(t), fj(ξ(t))2 = 1

π0 π(t), fj(ξ(t)) .

23

slide-26
SLIDE 26

A Hamiltonian-maximizing adjoint vector π along a trajectory-control pair (ξ, η) is normal if π0 > 0, and abnormal if π0 = 0. A minimizer (ξ, η) is normal if it has a normal Hamiltonian-maximizing adjoint vector, and abnormal if it has an abnormal Hamiltonian- maximizing adjoint vector.

(Notice that a minimizer can be both normal and abnormal.)

The minimizer (ξ, η) is strictly abnormal if it is abnormal and not normal (that is, if all the Hamiltonian-maximizing adjoint vectors along (ξ, η) are abnormal). Then the following follows from the PMP:

Every normal minimizer is smooth.

24

slide-27
SLIDE 27

This is because if (ξ, η) is a normal minimizer, and π is a normal Hamiltonian-maximizing adjoint vector along (ξ, η), then the pair (ξ, π) is (in coordinates) a solution of the system of ordinary differ- ential equations ˙ ξ = 1 π0

d

  • j=1

π, fj(ξ)fj(ξ) , ˙ π = − 1 π0

d

  • j=1

π, fj(ξ)∂fj ∂x (ξ) , from it follows easily that ξ and π are smooth functions. And then the control η is also smooth, because ηj(t) = 1 π0 π(t), fj(ξ(t)) .

25

slide-28
SLIDE 28

REMARK: The previous result contains, in particular, the theorem

  • n the smoothness of Riemannian geodesics.

That’s because in the Riemannian case every minimizer is normal. Indeed, if d = n = dim M, then the sum d

j=1π(t), fj(ξ(t))2 cannot

vanish, because if it did then all the numbers π(t), fj(ξ(t)) would vanish, so π(t) would be the zero covector, because the vectors fj(ξ(t)) span the tangent space Tξ(t)M. But π(t) = 0, because (π(t), ξ(t)) ∈ T #M.

26

slide-29
SLIDE 29

Clearly, now, the problem of the smoothness of optimal trajectories reduces to that of the smoothness of strictly abnormal extremals. For this problem, there is another useful necessary condition called the Goh condition:

If (ξ, η) is a strictly abnormal minimizer, then there exists a Hamiltonian maximizing adjoint vector π along (ξ, η) such that π(t), [fj, fk](ξ(t)) = 0 for all t and all j, k.

27

slide-30
SLIDE 30

Call a smooth distribution E 2-generating if, for every x ∈ M, the vectors f(x), for f a smooth section of E near x, together with the vectors [f, g](x), for f, g smooth sections of E near x, span the full tangent space TxM. It the follows from the Goh condition that If E is 2-generating, then every PAL minimizer is smooth (and real analytic in the real analytic case).

28

slide-31
SLIDE 31

Is it possible that the Goh condition can be generalized to the fol- lowing statement? If (ξ, η) is a strictly abnormal minimizer, then there exists a Hamiltonian-maximizing adjoint vector π along (ξ, η) such that π(t), [[fj1, [fj2, . . . , [fjk−1, fjk] . . .]](ξ(t)) = 0 for all t, all k, and all (j1, . . . , jk). The answer is NO, because if such a condition was true, it would follow from the bracket-generating condition that strictly abnormal minimizers do not exist. But:

Strictly abnormal miminizers exist. (Examples due to R. Montgomery, Sussmann-Liu, Kupka.)

29

slide-32
SLIDE 32

A RECENT THEOREM If M = (M, E, G) is a real analytic sub- riemannian manifold, then every PAL minimizer is real analytic on an open dense subset of its interval of defini- tion.

The proof is by induction on the fiber dimension d. The case d = 1 is trivial. So we assume that the theorem is true for fiber dimensions ≤ d − 1, and prove it’s true for fiber dimension d.

30

slide-33
SLIDE 33

Assume that M = (M, E, G) is a real analytic subrieman- nian manifold with fiber dimension d. Also, as explained before, we assume that M is an open subset of a Euclidean space Rn, and E has an orthonor- mal basis (f1, . . . , fd) of real analytic sections. ——————– The key to the proof will the construction of a stratifi- cation of T #M with certain special properties. So we have to start by explaining what a “stratification” is.

31

slide-34
SLIDE 34

A stratification of a smooth manifold Q is a locally finite partition P of Q into smooth, connected, relatively com- pact embedded submanifolds (called the “strata” of P) such that the following conditions hold: (1) the closure Clos S of each stratum is a union of strata, (2) if S is a stratum, then the “frontier strata”

  • f S—that is, the strata that are contained in

(Clos S)\S —are of dimension strictly smaller than dim S.

32

slide-35
SLIDE 35

A stratification P of Q is compatible with a family A of subsets of Q if every A ∈ A is a union of strata of P. Now, in the special case when Q = T #M, we consider stratifications whose strata are “nice”, in the sense described below.

33

slide-36
SLIDE 36

For a submanifold S of T #M, we let US(X) be, for X ∈ S, the set of all u = (u1, . . . , ud) ∈ Rd such that the vector u. F(X) (defined by u. F(X) =

d

i=1 ui

Fi(X)) is tangent to S. It is clear that each US(X) is a linear subspace of

Rd.

We call a submanifold S of T #M “nice” if S is a real an- alytic submanifold, the subspace US(X) is of constant dimension (as X varies over S), and US(X) depends real analytically on X, in the sense that there exist real analytic maps ej : S → Rd, for j = 1, . . . , δS (where δS = dim US(X) for X ∈ S) such that (e1(X), . . . , eδS(X)) is a basis of US(X) for each X ∈ S.

34

slide-37
SLIDE 37

THE KEY TECHNICAL LEMMA

LEMMA: If M = (M, E, G) is a real analytic subrieman- nian manifold of fiber dimension d, then there exists a stratification P of T #M such that (1) P is compatible with {A}, where A is the “abnormal set”, given by A = {(x, p) ∈ T #M :

d

  • j=1

p, fj(x)2 = 0 } . (2) Every stratum of P is nice.

35

slide-38
SLIDE 38

HOW THE LEMMA IMPLIES OUR THEOREM

Let P be a stratification of T #M having the properties of the lemma. Suppose (ξ, η) is a PAL trajectory-control pair which is a minimizer, and is defined on an interval [a, b]. It suffices to prove that there exists τ such that a < τ < b and ξ is real analytic on a neighborhood ]τ − ε, τ + ε[ of τ. We may assume that (ξ, η) is strictly abnormal. Let π be a Hamiltonian-maximizing adjoint vector, and let Ξ = (ξ, π) be the corresponding Hamiltonian lift. Then Ξ is entirely contained in the abnormal set A. So each point Ξ(t) of Ξ belongs to a unique stratum S(t) of P, and S(t) ⊆ A. Let δ(t) = dim S(t).

36

slide-39
SLIDE 39

Pick ¯ t such that δ(¯ t) = max{δ(t) : a < t < b}. Let S = S(¯ t). We claim that Ξ(t) ∈ S for t in some neighborhood of ¯ t. To see this suppose otherwise. Then there exists a sequence {tk}∞

k=1,

converging to ¯ t, such that S(tk) = S for every k. Since P is locally finite, the set of all the S(tk) is finite, so we may assume, after passing to a subsequence, that all the S(tk) are one and the same stratum ˇ S. Naturally, ˇ S = S. Since the point Ξ(¯ t) is in S, so Ξ(¯ t) / ∈ ˜ S, but Ξ(¯ t) is a limit of points of ˇ S, it follows that Ξ(¯ t) ∈ (Clos ˜ S)\˜ S. So S ⊆ (Clos ˇ S)\ˇ S, and this implies that dim S < dim ˇ

  • S. (Notice the crucial role played by

the condition on the dimension of the frontier strata!)

Hence δ(¯ t) < δ(tk) contradicting our choice of ¯ t as a time when δ has its maximum value.

37

slide-40
SLIDE 40

We now know that, for some positive ε,

Ξ(t) ∈ S for t ∈ I, where I = [¯ t−ε,¯ t+ε].

Since S is a nice stratum, the spaces US(X), for all X ∈ S, have the same dimension ¯ d, and there exist real analytic maps ej : S → Rd, for j = 1, . . . , ¯ d, such that e(X)def = (e1(X), . . . , e¯

d(X))

is a basis of US(X) for each X. Furthemore, we can apply the Gram-Schmidt orthogonalization procedure and assune that e(X) is an orthonormal basis of US(X).

38

slide-41
SLIDE 41

Let ΞI be the restriction of Ξ to I. Then ΞI is a trajectory of the system ˙ X =

d

  • j=1

uj Fj(X) , but the vectors ˙ ΞI(t), for t ∈ I, are tangent to S. Therefore the control η is such that η(t) ∈ US(ΞI(t)) for each t ln I, and then η(t) is a linear combination η(t) =

¯ d

  • k=1

ck(t)ek(t) , with coefficients ck(t) such that ¯

d k=1 ck(t)2 = 1.

39

slide-42
SLIDE 42

So ΞI is a PAL trajectory of the system ˙ X =

¯ d

  • k=1

vkGk(X) , (7) with state space S and control constraint ¯

d k=1 v2 k ≤ 1.

(Here Gk = ek · F.) Furthermore, ΞI is a time-optimal trajectory of (7). (This is easy.) We now prove the

KEY OBSERVATION: ¯ d < d

40

slide-43
SLIDE 43

To prove this, assume ¯ d = d. (¯ d cannot be larger than d, because ¯ d = dim US(X), and US(X) is a subspace of Rd.) This would mean that US(X) = Rd for every X ∈ S. Then every vector field Fj would be tangent to S. Then every iterated Lie bracket [ Fi1, [ Fi2, . . . , [ Fiℓ−1, Fiℓ] . . .]] would be tangent to S as well. Since the momentum functions Fi vanish on S (because S ⊆ A), it follows that the directional deriva- tives [ Fi1, [ Fi2, . . . , [ Fiℓ−1, Fiℓ] . . .]]Fj vanish on S. That is, all the iterated Poisson brackets Ψ = {{Fi1, {Fi2, . . . , {Fiℓ−1, Fiℓ} . . .}}, Fj} vanish.

41

slide-44
SLIDE 44

But Ψ(x, p) = p, [[fi1, [fi2, . . . , [fiℓ−1, fiℓ] . . .]], fj](x) . Hence p vanishes against all the iterated brackets of the fk. By the bracket-generating assumption, it follows that p = 0, contradicting the fact that S ⊆ T #M. So ΞI is a PAL solution of a minimum-time optimal control problem exactly like our original problem, except that the fiber dimension of the new problem is smaller. Hence, by the inductive hypothesis, there is a point τ of I such that ΞI is real analytic near ¯ τ. And then of course, ξ is real analytic near ¯ τ.

Q.E.D.

42

slide-45
SLIDE 45

This was the easy part of the proof. Now we need the technical part:

How do we prove the existence of a nice strat- ification compatible with {A} ?

This is done using the machinery of subanalytic sets. The construction is long and tedious, but conceptually quite simple.

43

slide-46
SLIDE 46

SUBANALYTIC SETS

A semianalytic subset of a real analytic manifold Q is a subset S of Q such that every point x of Q has an open neighborhood U on which there exist a finite collection {f1, . . . , fm} of real analytic functions such that S ∩ U belongs to the Boolean algebra generated by the sets {y ∈ U : fj(y) > 0} and {y ∈ U : fj(y) = 0}. A subanalytic subset of Q is a subset S of QQ such that there exist a real analytic manifold R, a semianalytic sub- set T of R, and a real analytic map f : R → Q, such that f is proper on ClosR T and f(T) = S.

44

slide-47
SLIDE 47

THE BASIC STRATIFICATION THEOREM

Given a locally finite collection A of subanalytic subsets

  • f a real analytic manifold Q, there exists a stratification

P of Q by real analytic submanifolds that are subanalytic subsets of Q, such that P is compatible with A. Using this fact, we can start the construction of the strat- ification we want by observing that the abnormal set A is

  • bviously subanalytic (because it is in fact a real analytic

set) and then constructing a stratification P0 compati- ble with {A} and consisting of real analytic submanifolds that are subanalytic subsets of T #M.

45

slide-48
SLIDE 48

Next we use various techniques to construct refinements

  • f P0 until we get one that has the desired properties.

The key fact to be proved first is:

If S is a connected embedded relatively com- pact real analytic submanifold of T #M which is a subanalytic subset of T #M, then the graph G(US) of US is a subanalytic subset of T #M×Rd.

Here G(US)def = {(x, u) : x ∈ S and u · F(x) ∈ TxS} , so G(US) ⊆ T #M × Rd.

46

slide-49
SLIDE 49

Then you take each stratum S of P0, and prove that for each dimension δ between 0 and d, the set Σδ(S) = {x ∈ S : dim US(x) = δ} is subanalytic. Clearly, the Σδ(S), for δ = 0, 1, . . . , d, form a finite parti- tion of S into subanalytic sets. Also, the Σδ(S), for all S ∈ P0 and all δ ∈ {0, 1, . . . , d}, form a locally finite family of subanalytic subsets of T #M.

47

slide-50
SLIDE 50

So we can construct a stratification P1 by real analytic submanifolds that are subanalytic subsets, such that P1 is compatible with the Σδ(S). So now we have a stratification such that, if T ∈ P1, T ⊆ S, and S ∈ P0, then the function T ∋ x → dim US(x) is constant. Then, with a lot of extra work (including the inductiove construction of several refinements of P1), we end up with a stratification ˜ P such that the function S ∋ x → dim US(x) is constant for each S ∈ ˜ P.

48

slide-51
SLIDE 51

Now we know that for each stratum S of ˜ P, the dimension dS of US(x) is the same for all x ∈ S. Let ES(x) be the set of all orthonormal bases (e1, . . . , eds)

  • f US(x).

Then we prove that the graph G(ES)def = {(x, e) : x ∈ S and e ∈ ES(x)} is a subanalytic subset of T #M × (Rd)ds. (This requires some work.)

49

slide-52
SLIDE 52

We regard ES as a set-valued function from S to (Rd)ds, and make a single-valued subanalytic selection ES, so ES is an ordinary single-valued function from S to (Rd)dS. Then you need a few more refinements to end up with a stratification in which the functions ES are not just subanalytic (i.e. with a suban- alytic graph) but actually real analytic. And this gives the desired nice stratification.

50

slide-53
SLIDE 53

Can one do something similar in the smooth (i.e., C∞) case?. Yes, but only (so far) for d = 2.

ANOTHER RECENT THEOREM If M = (M, E, G) is a smooth subrie- mannian manifold, and d = 2, then every PAL minimizer is smooth on an

  • pen dense subset of its interval of def-

inition.

REMARK: This argument does not seem to work for d > 2.

51

slide-54
SLIDE 54

PROOF: Write our system (locally) as ˙ x = u1f1(x) + u2f2(x) , x ∈ U , u2

1 + u2 2 ≤ 1 ,

where U is an open subset of Rn. Let B be the set of all iterated Lie brackets B of the form B = [fim, [fim−1, [· · · , [f2, f1] · · ·]]] where each ij is 1 or 2. (The number m is the degree of B.) It is easy to see, using the Jacobi identity, that every iterated of f1 and f2 is a linear combination of the brackets in B. (For example, [[f1, f2], [f1, [f1, f2]]] = [f1, [f2, [f1, [f1, f2]]]] − [f2, [f1, [f1, f1, f2]]] , and so on.)

52

slide-55
SLIDE 55

It then follows from the bracket-generating condition that

For every x ∈ U, the linear span of the vectors B(x), B ∈ B, is the whole space.

53

slide-56
SLIDE 56

To each vector field X on Rn, associate the momentum function µX : Rn × Rn → R , given by µX(p, x) = p† · X(x) . (Here p is a covector, and Rn × Rn is the cotangent bundle of the state space.) Then there is a simple formula for differentiating a momentum func- tion µX along Hamiltonian lifts of trajectories. Roughly: ˙ µX = η1µ[f1,X] + η2µ[f2,X] .

54

slide-57
SLIDE 57

More precisely, if ξ : [a, b] → Rn is a trajectory, corresponding to a control [a, b] ∋ t → η(t) = (η1(t), η2(t)) ∈ R2 , and Ξ : [a, b] → Rn × Rn , is a Hamiltonian lift of ξ, given by Ξ(t) = (π(t), ξ(t)) , where ˙ π(t) = −η1(t)π(t)† · ∂f1 ∂x (ξ(t)) − η2(t)π(t)† · ∂f2 ∂x (ξ(t)) we have d dt

  • µX(Ξ(t))
  • = η1(t)µ[f1,X](Ξ(t)) + η2(t)µ[f2,X](Ξ(t))

for every smooth vector field X

55

slide-58
SLIDE 58

If (ξ, η) is a PAL minimizer, then it is either a normal extremal, in which case it is smooth, or a strictly abnormal one, in which case there exists a Hamiltonian lift Ξ = (π, ξ) such that µf1(Ξ(t)) ≡ µf2(Ξ(t)) ≡ µ[f1,f2](Ξ(t)) ≡ 0 . For every t ∈ [a, b], there exists a B ∈ B such that µB(Ξ(t)) = π(t)† · B(ξ(t)) = 0 . (Reason: if π(t)† · B(ξ(t)) = 0 for every B ∈ B, then π(t)†v = 0 for every v ∈ Rn, so π(t) = 0.) Given t, let m∗(t) be the smallest m such that µB(Ξ(t)) = 0 for some B ∈ B of degree m.

56

slide-59
SLIDE 59

Pick a t0 ∈ [a, b] such that m∗(t0) ≤ m∗(t) for every t ∈ [a, b] . Let ˆ m = m∗(t0). Let B be a bracket in B of degree ˆ m such that µB(t0) = 0. Write B = [fi ˆ

m, C], where C ∈ B is of degree ˆ

m − 1. Then µC(Ξ(t)) = 0 for all t ∈ [a, b]. Assume i ˆ

m = 1. (The case when i ˆ m = 2 is identical.)

57

slide-60
SLIDE 60

Then d dt

  • µC(ξ(t))
  • ≡ 0 .

Therefore η1(t)µ[f1,C](Ξ(t)) + η2(t)µ[f2,C](Ξ(t)) ≡ 0 , that is η1(t)µB(Ξ(t)) + η2(t)µ[f2,C](Ξ(t)) ≡ 0 , Let ε be a positive number such that µB(Ξ(t)) = 0 for t ∈ (t0ε, t0 + ε) . Let I = (t0ε, t0 + ε) . Since η1(t)2 + η2(t)2 = 1, the vector η(t) is determined uniquely, up to a sign, for t ∈ I.

58

slide-61
SLIDE 61

More precisely: let Ω be the set of points (p, x) ∈ Rn × Rn such that µB(p, x) = 0. Then Ξ(t) ∈ Ω for all t ∈ I. On Ω, define “feedback controls” ϕj : Ω :→ R by letting ϕ1(p, x) = − µ[f2,C](p, x)

  • µB(p, x)2 + µ[f2,C](p, x)2 ,

ϕ2(p, x) = µB(p, x)

  • µB(p, x)2 + µ[f2,C](p, x)2 .

(These are smooth functions on Ω because µB and µ[f2,C] are smooth functions and µB is = 0 on Ω.) Then define a “feedback Hamiltonian vector field” V on Ω by letting V (p, x) = ϕ1(p, x)F1(p, x) + ϕ2(p, x)F2(p, x) , where F1, F2 are the Hamiltonian lifts of f1 and f2.

59

slide-62
SLIDE 62

Then our Hamiltonian lift Ξ satisfies ˙ Ξ(t) = ±(V (Ξ(t))) for a.e. t ∈ I . So Ξ is “almost” a trajectory of a smooth vector field. So ξ is smooth, if we can make sure that the sign in the previous equation does not change. Furthermore, the sign cannot change, because if it did our trajectory would not be optimal. (Easy proof.) Hence Ξ is smooth on I, and so is ξ. So we have proved that ξ is smooth on a nonempty open subinterval

  • f [a, b].

This result can be applied to the restriction of ξ to any nontrivial subinterval of [a, b], and our conclusion follows.

60

slide-63
SLIDE 63

One can also prove that for every optimal trajectory the control is

  • btained by “shuffling smooth feedback controls”, in the following

sense:

There exists a sequence (Sj)∞

j=0 of measurable sets

and a sequence (Vj)∞

j=1 of smooth feedback vector

fields defined on open subsets Ωj of Rn × Rn, such that S0 has measure zero, and [a, b] =

  • j=0

Sj , Ξ(t) ∈ Ωj whenever j ≥ 1 and t ∈ Sj , ˙ Ξ(t) = ± Vj(Ξ(t)) whenever j ≥ 1 and t ∈ Sj .

The proof is essentially as before:

61

slide-64
SLIDE 64
  • 1. For each B ∈ B, let AB be the set of all points t ∈ [a, b] such that

µB(Ξ(t)) = 0 but µC(Ξ(t)) = 0 for all C ∈ B of smaller degree.

  • 2. For each B ∈ B, construct a smooth feedback vector field VB as

before.

  • 3. Fix B. Let B = [fi, C], where i = 1 or i = 2, and C ∈ B.
  • 4. Let SB be the set of all points of density of AB that are also

Lebesgue points of the control η. (This is a huge subset of AB. In particular, AB\SB is a null set.)

  • 5. Assume i = 1.

62

slide-65
SLIDE 65
  • 6. For a point t ∈ SB, the numbers µC(Ξ(sj)) vanish for the points

sj in a sequence (sj)∞

j=1 of points that converge to t and are

distinct from t.

  • 7. So we can compute the derivative ˙

µC(Ξ(t)) as the limit as j → ∞

  • f the quotient

µC(Ξ(sj)) − µC(Ξ(t)) sj − t .

  • 8. This derivative vanishes, so

η1(t)µB(Ξ(t)) + η2(t)µ[f2,C](Ξ(t)) = 0 .

  • 9. Hence ˙

Ξ(t) = ± VB(Ξ(t)) .

  • 10. Clearly, the set S0 = [a, b]\

B∈B SB has measure zero. So we are

done.

63

slide-66
SLIDE 66

What is to be done next? If you want to prove that all minimizers are smooth, then you must continue the analysis that was started in work by Leonardi and Monti, and more recent work by Hakavuori and Le Donne, who proved that PAL subriemannian minimizers cannot have corners. If, on the other hand, you want to prove that it is not true that all minimizers are smooth, then you have to find an example of a nonsmooth minimizer. This will be, of course, a strictly abnormal extremal. Finding nonsmooth abnormal extremals is easy. And finding lots of strictly abnormal minimizers is also easy. (See HJS “A cornucopia of abnormal minimizers”.)

64

slide-67
SLIDE 67

The hard part is to prove that a given extremal is actually a mini- mizer. For normal extremals there are methods, especially the construction

  • f fields of extremals. But these do not work for abnormal extremals.

The methods of the “cornucopia” paper do work for lots of fam- ilies of abnormal extremals, but I do not know how to construct nonsmooth abnormal extremals to which those methods apply.

65