Introduction to Christoffel-Darboux kernels for polynomial - - PowerPoint PPT Presentation

introduction to christoffel darboux kernels for
SMART_READER_LITE
LIVE PREVIEW

Introduction to Christoffel-Darboux kernels for polynomial - - PowerPoint PPT Presentation

Introduction to Christoffel-Darboux kernels for polynomial optimization Edouard Pauwels POEMA Online Workshop July 2020 1 / 48 Exponential separation of the support : Lebesgue restricted to S R p , compact, non-empty interior. d p exp(


slide-1
SLIDE 1

Introduction to Christoffel-Darboux kernels for polynomial optimization

Edouard Pauwels

POEMA Online Workshop

July 2020

1 / 48

slide-2
SLIDE 2

Exponential separation of the support

µ: Lebesgue restricted to S ⊂ Rp, compact, non-empty interior.

dp exp(αd)

29 / 48

slide-3
SLIDE 3

Exponential separation of the support

µ: Lebesgue restricted to S ⊂ Rp, compact, non-empty interior.

dp exp(αd) dp+1 dp+2 exp(α √ d)

Thresholding scheme: C > 0, q > p {x, vd(x)TM−1

µ,dvd(x) ≤ Cdq}

“ →

d→∞ ”

cl(int(S)). Extends to positive densities on S. =

30 / 48

slide-4
SLIDE 4

Outline

  • 1. CD kernel, Christoffel function, orthogonal polynomials, moments
  • 2. CD kernel captures measure theoretic properties: univariate case
  • 3. Quantitative asymptotics
  • 4. The singular case
  • 5. Using approximate moments
  • 6. An application to polynomial optimal control

31 / 48

slide-5
SLIDE 5

The singular case

µ: Borel probability measure in Rp, compact support S, absolutely continuous. Rd[X]: p-variate polynomials of degree at most d (of dimension s(d) = p+d

d

  • ).

(P, Q) → ⟪P, Q⟫µ :=

  • PQdµ,

defines a valid scalar product on Rd[X].a positive semidefinite bilinear form on Rd[X].

32 / 48

slide-6
SLIDE 6

Specificity of the singular case

µ: Borel probability measure in Rp, asbolutely continuous, compact support: S. Rd[X]: p-variate polynomials of degree at most d (of dimension s(d) = p+d

d

  • ).

Moment based computation Let {Pi}s(d)

i=1 be any basis of Rd[X],

vd : x → (P1(x), . . . , Ps(d)(x))T. Mµ,d =

  • vdvT

d dµ ∈ Rs(d)×s(d).

Then, for all x, y ∈ Rp, K µ

d (x, y) = vd(x)TM−1 µ,dvd(y) vd(x)TM−1 µ,dvd(y)

Let P(x) = s(d)

i=1 piPi(x)P ∈ Rd[X]. We have

  • P2dµ = pTMµ,dp.

If P vanishes on S, if and only if p ∈ ker(Mµ,d). Singular moment matrix, morally, CD kernel should be +∞.

33 / 48

slide-7
SLIDE 7

Christoffel function to the rescue

µ: Borel probability measure in Rp, asbolutely continuous, compact support: S. Rd[X]: p-variate polynomials of degree at most d (of dimension s(d) = p+d

d

  • ).

Variational formulation: for all z ∈ Rp 1 K µ

d (z, z) = Λµ d (z) =

min

P∈Rd [X]

  • P2dµ :

P(z) = 1

  • .

Λµ

d (z) =

min

P∈Rd [X]

  • P2dµ :

P(z) = 1

  • .

Given z ∈ Rp, such that there exists P ∈ Rd[X] such that P(z) = 0 P vanishes on S. Then Λµ

d (z) = 0.

34 / 48

slide-8
SLIDE 8

Getting the CD kernel back (and computation from moments)

µ: Borel probability measure in Rp,compact support: S. Rd[X]: p-variate polynomials of degree at most d (of dimension s(d) = p+d

d

  • ).

V denotes the Zariski closure of S (smallest algebraic set containing S). For d large enough, V = {z ∈ Rp, Λµ

d (z) > 0}.

Polynomials on V : L2

µ,d = Rd[X] / {P ∈ Rd[X], P vanishes on V }.

RKHS: (L2

µ,d, ⟪·, ·⟫µ) is a Hilbert space of functions on V . K µ d is its reproducing kernel

(defined on V ). For any x ∈ V and P ∈ L2

µ,d, P(x) =

  • P(y)K µ

d (x, y)dµ(y).

Relation with Christoffel function: Λµ

d (z)K µ d (z, z) = 1, for z ∈ V .

Pseudo inverse computation: let vd be any basis of Rd[X], Mµ,d moment matrix: ∀x, y ∈ V K µ

d (x, y) = vd(x)M† µ,dvd(y).

Average value and Hilbert function:

  • K µ

d (x, x)dµ(x) = dim(L2 µ,d) ≤ s(d).

35 / 48

slide-9
SLIDE 9

Outline

  • 1. CD kernel, Christoffel function, orthogonal polynomials, moments
  • 2. CD kernel captures measure theoretic properties: univariate case
  • 3. Quantitative asymptotics
  • 4. The singular case
  • 5. Using approximate moments
  • 6. An application to polynomial optimal control

36 / 48

slide-10
SLIDE 10

Motivation for approximate moments

“I am a Lasserre hierarchist, I work with pseudo-moments.” “I am a statistician, I work with empirical moments.” “I am a numerician, among others, I care about sensitivity to errors.”

37 / 48

slide-11
SLIDE 11

A stability result

Choose a basis vd of Rd[X]. Approximation of Christoffel function: Let Q(x, y) = vd(x)M−1vd(y) where M ∈ Rs(d)×s(d) is positive definite, then for all x ∈ Rp, |Q(x, x)Λµ

d (x) − 1| ≤ I − M

1 2

µ,dM−1M

1 2

µ,dop

If M ≃ Mµ,d, then Λµ

d (x) ≃ 1 Q(x,x).

38 / 48

slide-12
SLIDE 12

Regularization

“Using pseudo inverse is like saying 0 = +∞”. Regularization: Let µ0 be a simple absolutely continuous measure (moments are easy to compute). Replace µ by µ + βµ0, β > 0. Mµ+βµ0,d = Mµ,d + βMµ0,d ≻ 0 Λd

µ+βµ0 ≥ Λd µ + βΛd µ0

  • (Λd

µ+βµ0)−1dµ ≤

  • (Λd

µ+βµ0)−1d(µ + βµ0) = s(d) = O(dp)

The moment matrix is positive definite If Λd

µ+βµ0 is small, then Λd µ is also small.

Λd

µ+βµ0 stays reasonably big on the support of µ.

Λd

µ+βµ0 stays reasonably small outside the support of µ (if β is small).

39 / 48

slide-13
SLIDE 13

Outline

  • 1. CD kernel, Christoffel function, orthogonal polynomials, moments
  • 2. CD kernel captures measure theoretic properties: univariate case
  • 3. Quantitative asymptotics
  • 4. The singular case
  • 5. Using approximate moments
  • 6. An application to polynomial optimal control

40 / 48

slide-14
SLIDE 14

Acknowledgement

The content of this section is taken from Marx, S., Pauwels, E., Weisser, T., Henrion, D., & Lasserre, J. (2019). Tractable semi-algebraic approximation using Christoffel-Darboux kernel. arXiv preprint arXiv:1904.01833.

41 / 48

slide-15
SLIDE 15

From the tutorial of Didier

Controled ODE, ˙ x(t) = f (x(t), u(t)), x(t) ∈ X, u(t) ∈ U, t ∈ [0, 1], x(0) = 0 Occupation measure, given a classical trajectory dµ(x, u, t) = dδx(t)(x)dδu(t)(u)dt Relaxation: Replace classical trajectories satisfying an ODE by measures satisfying a linear transport PDE.

42 / 48

slide-16
SLIDE 16

A heuristic argument

Hierarchy: f polynomial, X, U basic semi-algebraic: level d provides pseudo-moments up to degree 2d in variables t, u, x. PMd Heuristic: As d grows PMd should get close to Mµ,d where µ is an occupation measure supported on optimal trajectories. Use the Christoffel Darboux kernel: “(x, u, t)TPM−1

d (x, u, t)”

The measure is singular, we only have pseudo moments . . . Morally, it is small on the support of µ and large outside the support. Morally, it is small on the optimal trajectory and large outside.

43 / 48

slide-17
SLIDE 17

A semi-algebraic estimator

Hierarchy: f polynomial, X, U basic semi-algebraic: level d provides pseudo-moments up to degree 2d in variables t, u, x. PMd Christoffel Darboux kernel: “(x, u, t)TPM−1

d (x, u, t)” = Qd(x, u, t)

Morally, it is small on the optimal trajectory and large outside. A semi-algebraic estimator: For all t ∈ [0, 1] (ˆ u(t), ˆ x(t)) ∈ argmin(x,u)Q(x, u, t). An example with x(t) = sign(t)/2 and exact moments

Legendre projection Christoffel-Darboux approximation

  • 1.0
  • 0.5

0.0 0.5 1.0

  • 1.0
  • 0.5

0.0 0.5 1.0

  • 0.8
  • 0.4

0.0 0.4 0.8

x ˆ fd(x)

10 20 30

d

44 / 48

slide-18
SLIDE 18

Convergence guaranties

A semi-algebraic estimator: Qd(x, u, t) = “(x, u, t)TPM−1

d (x, u, t)”

(ˆ u, ˆ x): t → (ˆ u(t), ˆ x(t)) ∈ argmin(x,u)Q(x, u, t). Assumption: x, u in L1, bounded, continuous almost everywhere, exact moments. Strong convergence in L1. Assumption: x, u Lipschitz, exact moments. Rate of order O(1/ √ d). Assumption: x, u have bounded total variation, exact moments. Conjecture: Rate of order O(1/d

1 4 ). 45 / 48

slide-19
SLIDE 19

Illustration on the double integrator with constraints

Minimal time to reach the origin. u ∈ [−1, 1], x1 ≥ −1. ˙ x2(t) = x1(t) ˙ x1(t) = u(t)

0.2 0.4 0.6 0.8 1 time

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 0.4 0.6 0.8 1 control 0.2 0.4 0.6 0.8 1 time

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 0.4 0.6 0.8 1 second state 0.2 0.4 0.6 0.8 1 time

  • 0.1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 first state

With True moments:

0.2 0.4 0.6 0.8 1 time

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 0.4 0.6 0.8 1 control

46 / 48

slide-20
SLIDE 20

Illustration in Chemo-Immuno therapy modeling

Moussa, K., Fiacchini, M., & Alamir, M. (2019). Robust Optimal Control-based Design

  • f Combined Chemo-and Immunotherapy Delivery Profiles. IFAC-PapersOnLine, 52(26),

76-81.

47 / 48

slide-21
SLIDE 21

Conclusion

dp exp(αd) dp+1 dp+2 exp(α √ d)

CD kernel is computed from moments of a measure µ. It captures the support of µ. Century old mathematical history and still active. Proper set up, proof guaranties, require some subtleties. Can be combined with Lassere’s Hierarchy: example in polynomial optimal control.

48 / 48