Rational minimax approximation via adaptive barycentric - - PowerPoint PPT Presentation

▶

rational minimax approximation via adaptive barycentric

Rational minimax approximation via adaptive barycentric - - PowerPoint PPT Presentation

Jan 05, 2024 453 likes •973 views

Rational minimax approximation via adaptive barycentric representations Silviu Filip , CAIRN team, Univ Rennes, Inria, CNRS, IRISA joint work with Bernhard Beckermann, Yuji Nakatsukasa and Lloyd N. Trefethen January 22, 2018 1 / 18 Rational

slide-1

SLIDE 1

Rational minimax approximation via adaptive barycentric representations

Silviu Filip, CAIRN team, Univ Rennes, Inria, CNRS, IRISA

joint work with Bernhard Beckermann, Yuji Nakatsukasa

and Lloyd N. Trefethen January 22, 2018

1 / 18

slide-2

SLIDE 2

Rational functions

Why are they important? → powerful approximations near singularities or on unbounded domains

2 / 18

slide-3

SLIDE 3

Rational functions

Why are they important? → powerful approximations near singularities or on unbounded domains Some applications: elementary + special functions recursive filter design matrix exponentials & stiff PDEs

ptimal control problems

...

2 / 18

slide-4

SLIDE 4

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f)

3 / 18

slide-5

SLIDE 5

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results:

3 / 18

slide-6

SLIDE 6

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh]

3 / 18

slide-7

SLIDE 7

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh] presence of the defect: → if r∗ = p∗/q∗ in irreducible form, then its defect is d = min {m − deg p∗, n − deg q∗}

3 / 18

slide-8

SLIDE 8

Rational minimax approximation

Input: f ∈ C([a, b]), target type (m, n) ∈ N2 Output: r∗ ∈ Rm,n = p q , p ∈ Rm[x], q ∈ Rn[x]

s.t.

f − r∗∞ is minimal. → denote this minimax error with Em,n(f) → theoretical results: existence & unicity of r∗ [de la Vallée Poussin, Walsh] presence of the defect: → if r∗ = p∗/q∗ in irreducible form, then its defect is d = min {m − deg p∗, n − deg q∗} Alternation Theorem [Achieser 1930]: → f − r∗ equioscillates at least m + n + 2 − d times

3 / 18

slide-9

SLIDE 9

A classic example: f(x) = |x|, x ∈ [−1, 1]

4 / 18

slide-10

SLIDE 10

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ consider best uniform approximations: degree 8 polynomial Error curve

1
0.8
0.6
0.4
0.2

0.2 0.4 0.6 0.8 1

0.04
0.03
0.02
0.01

0.01 0.02 0.03 0.04

4 / 18

slide-11

SLIDE 11

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ consider best uniform approximations: degree 8 polynomial type (4, 4) rational function Error curve

1
0.8
0.6
0.4
0.2

0.2 0.4 0.6 0.8 1

0.04
0.03
0.02
0.01

0.01 0.02 0.03 0.04

4 / 18

slide-12

SLIDE 12

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993]

5 / 18

slide-13

SLIDE 13

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80

5 / 18

slide-14

SLIDE 14

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80 → codes (the Remez algorithm): Maple: numapprox[minimax] Mathematica: MinimaxApproximation

5 / 18

slide-15

SLIDE 15

A classic example: f(x) = |x|, x ∈ [−1, 1]

→ asymptotic behavior En,0(f) ∼ β/n, β = 0.2801... [Varga & Carpenter 1985] En,n(f) ∼ 8e−√n, [Newman 1964, Stahl 1993] → rational minimax approximations can be difficult to compute e.g. [Varga, Ruttan & Carpenter 1991] conjecture Stahl’s result using 200-digit arithmetic for n 80 → codes (the Remez algorithm): Maple: numapprox[minimax] Mathematica: MinimaxApproximation Chebfun (Matlab): minimax

5 / 18

slide-16

SLIDE 16

Barycentric representations

→ many different ways of representing rational functions

6 / 18

slide-17

SLIDE 17

Barycentric representations

→ many different ways of representing rational functions → barycentric form for type (n, n) rational functions r(z) = N(z) D(z) =

n

k=0

αk z − tk

n
k=0

βk z − tk Notation: {αk}, {βk} barycentric coefficients {tk} support points

6 / 18

slide-18

SLIDE 18

Barycentric representations

Why use adaptive barycentric formulas? → problem dependent {tk} ⇒ well conditioned representation

7 / 18

slide-19

SLIDE 19

Barycentric representations

Why use adaptive barycentric formulas? → problem dependent {tk} ⇒ well conditioned representation Example: → the adaptive Antoulas-Anderson (AAA) algorithm [Nakatsukasa, Sète & Trefethen 2018]: greedy least squares approximation

7 / 18

slide-20

SLIDE 20

Example: f(x) = |x|, x ∈ [−1, 1], type (20, 20) p/q vs N/D

1
0.5

0.5 1 1 2 q |D|

1
0.5

0.5 1 10-15 10-10 10-5 100 q |D|

8 / 18

slide-21

SLIDE 21

The rational Remez algorithm

9 / 18

slide-22

SLIDE 22

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience

9 / 18

slide-23

SLIDE 23

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b

9 / 18

slide-24

SLIDE 24

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence:

9 / 18

slide-25

SLIDE 25

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence: Step 2: find r ∈ Rn,n and λ ∈ R s.t. f(xk) − r(xk) = (−1)k+1λ, k = 0, . . . , 2n + 1

9 / 18

slide-26

SLIDE 26

The rational Remez algorithm

Some assumptions: no defect (d = 0) → required diagonal case m = n → for convenience Step 1: choose a reference set a x0 < · · · < x2n+1 b → iterate the following steps until convergence: Step 2: find r ∈ Rn,n and λ ∈ R s.t. f(xk) − r(xk) = (−1)k+1λ, k = 0, . . . , 2n + 1 Step 3: among local extrema of f − r, take 2n + 2 new points a x′

0 < · · · < x′ 2n+1 b,

f − r alternates in sign + at least one global extrema over [a, b] and |f(x′

k) − r(x′ k)| |λ| ,

k = 0, . . . , 2n + 1

9 / 18

slide-27

SLIDE 27

The rational Remez algorithm

Convergence: → usually quadratic [Curtis & Osborne 1966] → guaranteed only if starting reference set is close enough to optimal

10 / 18

slide-28

SLIDE 28

The rational Remez algorithm

Convergence: → usually quadratic [Curtis & Osborne 1966] → guaranteed only if starting reference set is close enough to optimal What can go wrong? → no pole-free solution in Step 2

10 / 18

slide-29

SLIDE 29

Step 1: initial reference set

→ need suff. good initial guess for {xk}

11 / 18

slide-30

SLIDE 30

Step 1: initial reference set

→ need suff. good initial guess for {xk} Our strategy: use Carathéodory-Fejér (CF) approximation [Trefethen & Gutknecht 1983]

11 / 18

slide-31

SLIDE 31

Step 1: initial reference set

→ need suff. good initial guess for {xk} Our strategy: use Carathéodory-Fejér (CF) approximation [Trefethen & Gutknecht 1983] AAA-Lawson approx. (adaptively re-weighted least squares AAA variant)

11 / 18

slide-32

SLIDE 32

Step 1: initial reference set

→ need suff. good initial guess for {xk} Our strategy: use Carathéodory-Fejér (CF) approximation [Trefethen & Gutknecht 1983] AAA-Lawson approx. (adaptively re-weighted least squares AAA variant) extrapolation from lower degree approx. ((2, 2), (3, 3), (4, 4), . . .)

11 / 18

slide-33

SLIDE 33

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1

12 / 18

slide-34

SLIDE 34

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1 → matrix form Cα =           f(x0) f(x1) ... f(x2n+1)      − λ      −1 1 −1 ...           Cβ, C ∈ R(2n+2)×(n+1) Cauchy matrix, Ck,j = 1/(xk − tj)

12 / 18

slide-35

SLIDE 35

Step 2: find r

→ find r = N/D ∈ Rn,n s.t. N(xk) = D(xk)(f(xk) − (−1)k+1λ), k = 0, . . . , 2n + 1 → matrix form Cα =           f(x0) f(x1) ... f(x2n+1)      − λ      −1 1 −1 ...           Cβ, C ∈ R(2n+2)×(n+1) Cauchy matrix, Ck,j = 1/(xk − tj) → generalized eigenvalue problem C −FC α β

= λ

−SC α β

F = diag(f(xk)), S = diag((−1)k+1)

12 / 18

slide-36

SLIDE 36

Step 2: find r

→ generalized eigenvalue problem

C

−FC α β

= λ
−SC

α β

13 / 18

slide-37

SLIDE 37

Step 2: find r

→ generalized eigenvalue problem

C

−FC α β

= λ
−SC

α β

→ can transform it into a symmetric eigenvalue problem

QT

1 (SF)Q1Rβ = λRβ,

where ωx(x) = 2n+1

k=0 (x − xk),

ωt(x) = n

j=0(x − tj),

∆ = diag ωt(x0)2 ω′

x(x0) , . . . , ωt(x2n+1)2

ω′

x(x2n+1)

and |∆|1/2C = Q1R

13 / 18

slide-38

SLIDE 38

Step 2: find r

→ generalized eigenvalue problem

C

−FC α β

= λ
−SC

α β

→ can transform it into a symmetric eigenvalue problem

QT

1 (SF)Q1Rβ = λRβ,

where ωx(x) = 2n+1

k=0 (x − xk),

ωt(x) = n

j=0(x − tj),

∆ = diag ωt(x0)2 ω′

x(x0) , . . . , ωt(x2n+1)2

ω′

x(x2n+1)

and |∆|1/2C = Q1R

→ well conditioned eigenvalue computation

13 / 18

slide-39

SLIDE 39

Step 2: choice of the {tk}

14 / 18

slide-40

SLIDE 40

Step 2: choice of the {tk}

→ because we perform the QR factorization of |∆|1/2C, take {tk} to minimize min

Γ κ2(|∆|1/2CΓ),

Γ diagonal scaling matrix

14 / 18

slide-41

SLIDE 41

Step 2: choice of the {tk}

→ because we perform the QR factorization of |∆|1/2C, take {tk} to minimize min

Γ κ2(|∆|1/2CΓ),

Γ diagonal scaling matrix → we show that this happens (with optimum 1) for tk = x2k+1, k = 0, . . . , n

14 / 18

slide-42

SLIDE 42

Step 3: next reference set

Approach:

15 / 18

slide-43

SLIDE 43

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b]

15 / 18

slide-44

SLIDE 44

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b] → detect singularities of f + further decomp. of [a, b] → use splitting on [Pachón, Platte & Trefethen 2010]

15 / 18

slide-45

SLIDE 45

Step 3: next reference set

Approach: → decompose [a, b] into nondegenerate intervals [a, x0], [x0, x1], . . . , [x2n, x2n+1], [x2n+1, b] → detect singularities of f + further decomp. of [a, b] → use splitting on [Pachón, Platte & Trefethen 2010] → Chebyshev interpolants of e(x) = f(x) − r(x) on each subinterval → colleague matrix root finding [Specht, Good]

15 / 18

slide-46

SLIDE 46

Examples

DEMO

16 / 18

slide-47

SLIDE 47

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding

17 / 18

slide-48

SLIDE 48

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding → what I did not talk about: what to do in degenerate d > 0 cases how do we handle m = n problem instances

17 / 18

slide-49

SLIDE 49

Conclusion

→ robust rational Remez algorithm (available now in Chebfun): adaptive barycentric representation → eigenvalue problem with good stability good choice of the initial reference points (CF, AAA-Lawson, extrapolation) colleague matrix root finding → what I did not talk about: what to do in degenerate d > 0 cases how do we handle m = n problem instances → the details:

B. Beckermann, S.-I. Filip, Y. Nakatsukasa, L. N. Trefethen, Rational minimax

approximation via adaptive barycentric representations, arXiv:1705.10132, under minor revision for SIAM Journal on Scientific Computing

17 / 18

slide-50

SLIDE 50

Conclusion

→ directions for future work: handle weighted approximation problems (e.g., relative error) extensions to:

the complex case bivariate approximation

18 / 18

slide-51

SLIDE 51

Conclusion

→ directions for future work: handle weighted approximation problems (e.g., relative error) extensions to:

the complex case bivariate approximation

Thank you!

18 / 18