Robust rational approximation with barycentric representations - - PowerPoint PPT Presentation

robust rational approximation with barycentric
SMART_READER_LITE
LIVE PREVIEW

Robust rational approximation with barycentric representations - - PowerPoint PPT Presentation

Robust rational approximation with barycentric representations Silviu Filip , CAIRN team, Univ Rennes, Inria, CNRS, IRISA joint work with Bernhard Beckermann, Yuji Nakatsukasa and Lloyd N. Trefethen March 13, 2018 1 / 17 Rational functions Why


slide-1
SLIDE 1

Robust rational approximation with barycentric representations

Silviu Filip, CAIRN team, Univ Rennes, Inria, CNRS, IRISA

joint work with Bernhard Beckermann, Yuji Nakatsukasa

and Lloyd N. Trefethen March 13, 2018

1 / 17

slide-2
SLIDE 2

Rational functions

Why are they important? → powerful approximations near singularities or on unbounded domains

2 / 17

slide-3
SLIDE 3

Rational functions

Why are they important? → powerful approximations near singularities or on unbounded domains Some applications: elementary + special functions recursive filter design matrix exponentials & stiff PDEs

  • ptimal control problems

...

2 / 17

slide-4
SLIDE 4

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

  • s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f)

3 / 17

slide-5
SLIDE 5

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

  • s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results:

3 / 17

slide-6
SLIDE 6

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

  • s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh]

3 / 17

slide-7
SLIDE 7

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

  • s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh] presence of the defect: → if r∗ = p∗/q∗ in irreducible form, then its defect is d = min {m − deg p∗, n − deg q∗}

3 / 17

slide-8
SLIDE 8

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

  • s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh] presence of the defect: → if r∗ = p∗/q∗ in irreducible form, then its defect is d = min {m − deg p∗, n − deg q∗} Alternation Theorem [Achieser 1930]: → f − r∗ equioscillates at least m + n + 2 − d times

3 / 17

slide-9
SLIDE 9

A classic example: f(x) = |x|, x ∈ [−1, 1]

4 / 17

slide-10
SLIDE 10

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ consider best uniform approximations: degree 8 polynomial Error curve

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 0.4 0.6 0.8 1

  • 0.04
  • 0.03
  • 0.02
  • 0.01

0.01 0.02 0.03 0.04

4 / 17

slide-11
SLIDE 11

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ consider best uniform approximations: degree 8 polynomial type (4, 4) rational function Error curve

  • 1
  • 0.8
  • 0.6
  • 0.4
  • 0.2

0.2 0.4 0.6 0.8 1

  • 0.04
  • 0.03
  • 0.02
  • 0.01

0.01 0.02 0.03 0.04

4 / 17

slide-12
SLIDE 12

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993]

5 / 17

slide-13
SLIDE 13

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80

5 / 17

slide-14
SLIDE 14

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80 → codes (the Remez algorithm): Maple: numapprox[minimax] Mathematica: MinimaxApproximation

5 / 17

slide-15
SLIDE 15

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80 → codes (the Remez algorithm): Maple: numapprox[minimax] Mathematica: MinimaxApproximation Chebfun (Matlab): minimax

5 / 17

slide-16
SLIDE 16

A classic example: f(x) = |x|, x ∈ [−1, 1]

10 20 30 40 50 60 70 80 10 -15 10 -10 10 -5 10 0 10 -12 10 -10 10 -8 10 -6 10 -4 10 -2 10 0

  • 8
  • 6
  • 4
  • 2

2 4 6 10 -12

6 / 17

slide-17
SLIDE 17

Barycentric representations

→ many different ways of representing rational functions monomial/monomial, Chebyshev/Chebyshev, partial fraction, etc.

7 / 17

slide-18
SLIDE 18

Barycentric representations

→ many different ways of representing rational functions monomial/monomial, Chebyshev/Chebyshev, partial fraction, etc. → barycentric form for type (n, n) rational functions r(z) = N(z) D(z) =

n

  • k=0

αk z − tk

  • n
  • k=0

βk z − tk Notation: {αk}, {βk} barycentric coefficients {tk} support points

7 / 17

slide-19
SLIDE 19

Barycentric representations

→ many different ways of representing rational functions monomial/monomial, Chebyshev/Chebyshev, partial fraction, etc. → barycentric form for type (n, n) rational functions r(z) = N(z) D(z) =

n

  • k=0

αk z − tk

  • n
  • k=0

βk z − tk Notation: {αk}, {βk} barycentric coefficients {tk} support points Why use adaptive barycentric formulas? → problem dependent {tk} ⇒ well conditioned representation

7 / 17

slide-20
SLIDE 20

The rational Remez algorithm

8 / 17

slide-21
SLIDE 21

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience

8 / 17

slide-22
SLIDE 22

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b

8 / 17

slide-23
SLIDE 23

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence:

8 / 17

slide-24
SLIDE 24

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence: Step 2: find r ∈ Rn,n and λ ∈ R s.t. f(xk) − r(xk) = (−1)k+1λ, k = 0, . . . , 2n + 1

8 / 17

slide-25
SLIDE 25

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence: Step 2: find r ∈ Rn,n and λ ∈ R s.t. f(xk) − r(xk) = (−1)k+1λ, k = 0, . . . , 2n + 1 Step 3: among local extrema of f − r, take 2n + 2 new points a x′

0 < · · · < x′ 2n+1 b,

f − r alternates in sign + at least one global extrema over [a, b] and |f(x′

k) − r(x′ k)| |λ| ,

k = 0, . . . , 2n + 1

8 / 17

slide-26
SLIDE 26

The rational Remez algorithm

Convergence: → usually quadratic [Curtis & Osborne 1966] → guaranteed only if starting reference set is close enough to optimal

9 / 17

slide-27
SLIDE 27

The rational Remez algorithm

Convergence: → usually quadratic [Curtis & Osborne 1966] → guaranteed only if starting reference set is close enough to optimal What can go wrong? → no pole-free solution in Step 2

9 / 17

slide-28
SLIDE 28

Step 1: initial reference set

→ need suff. good initial guess for {xk}

10 / 17

slide-29
SLIDE 29

Step 1: initial reference set

→ need suff. good initial guess for {xk} Our strategy: use Carathéodory-Fejér (CF) approximation [Trefethen & Gutknecht 1983] AAA-Lawson approx. (adaptively re-weighted least squares AAA variant) extrapolation from lower degree approx. ((2, 2), (3, 3), (4, 4), . . .)

10 / 17

slide-30
SLIDE 30

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1

11 / 17

slide-31
SLIDE 31

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1 → matrix form Cα =           f(x0) f(x1) ... f(x2n+1)      − λ      −1 1 −1 ...           Cβ, C ∈ R(2n+2)×(n+1) Cauchy matrix, Ck,j = 1/(xk − tj)

11 / 17

slide-32
SLIDE 32

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1 → matrix form Cα =           f(x0) f(x1) ... f(x2n+1)      − λ      −1 1 −1 ...           Cβ, C ∈ R(2n+2)×(n+1) Cauchy matrix, Ck,j = 1/(xk − tj) → generalized eigenvalue problem C −FC α β

  • = λ

−SC α β

  • F = diag(f(xk)), S = diag((−1)k+1)

11 / 17

slide-33
SLIDE 33

Step 2: find r

→ generalized eigenvalue problem C −FC α β

  • = λ

−SC α β

  • 12 / 17
slide-34
SLIDE 34

Step 2: find r

→ generalized eigenvalue problem C −FC α β

  • = λ

−SC α β

  • → can transform it into a symmetric eigenvalue problem

QT

1 (SF)Q1Rβ = λRβ,

where ωx(x) = 2n+1

k=0 (x − xk),

ωt(x) = n

j=0(x − tj),

∆ = diag ωt(x0)2 ω′

x(x0) , . . . , ωt(x2n+1)2

ω′

x(x2n+1)

  • and |∆|1/2C = Q1R

12 / 17

slide-35
SLIDE 35

Step 2: find r

→ generalized eigenvalue problem C −FC α β

  • = λ

−SC α β

  • → can transform it into a symmetric eigenvalue problem

QT

1 (SF)Q1Rβ = λRβ,

where ωx(x) = 2n+1

k=0 (x − xk),

ωt(x) = n

j=0(x − tj),

∆ = diag ωt(x0)2 ω′

x(x0) , . . . , ωt(x2n+1)2

ω′

x(x2n+1)

  • and |∆|1/2C = Q1R

→ we show that this problem is usually well conditioned if tk = x2k+1, k = 0, . . . , n

12 / 17

slide-36
SLIDE 36

Example: f(x) = |x|, x ∈ [−1, 1], type (20, 20)

monomial p/q vs barycentric N/D

  • 1
  • 0.5

0.5 1 1 2 q |D|

  • 1
  • 0.5

0.5 1 10-15 10-10 10-5 100 q |D|

→ dots represent the xk’s

13 / 17

slide-37
SLIDE 37

Step 3: next reference set

Approach:

14 / 17

slide-38
SLIDE 38

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b]

14 / 17

slide-39
SLIDE 39

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b] → detect singularities of f + further decomp. of [a, b] → use splitting on [Pachón, Platte & Trefethen 2010]

14 / 17

slide-40
SLIDE 40

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b] → detect singularities of f + further decomp. of [a, b] → use splitting on [Pachón, Platte & Trefethen 2010] → Chebyshev interpolants of e(x) = f(x) − r(x) on each subinterval → colleague matrix root finding [Specht, Good]

14 / 17

slide-41
SLIDE 41

Examples

DEMO

15 / 17

slide-42
SLIDE 42

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding

16 / 17

slide-43
SLIDE 43

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding → what I did not talk about: what to do in degenerate d > 0 cases how do we handle m = n problem instances

16 / 17

slide-44
SLIDE 44

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding → what I did not talk about: what to do in degenerate d > 0 cases how do we handle m = n problem instances → the details:

  • B. Beckermann, S.-I. Filip, Y. Nakatsukasa, L. N. Trefethen, Rational minimax

approximation via adaptive barycentric representations, arXiv:1705.10132

16 / 17

slide-45
SLIDE 45

Conclusion

→ directions for future work: handle weighted approximation problems (e.g., relative error) extensions to:

the complex case bivariate approximation

17 / 17

slide-46
SLIDE 46

Conclusion

→ directions for future work: handle weighted approximation problems (e.g., relative error) extensions to:

the complex case bivariate approximation

Thank you!

17 / 17