Convergence acceleration by orthogonal polynomials Dirk Laurie - PowerPoint PPT Presentation

Convergence acceleration by orthogonal polynomials Dirk Laurie University of Stellenbosch Luminy, 29 September 2009 1

Sequence or series Let s n , n = 1, 2, 3, . . . be a sequence in a vector space S . Often S is a field, in fact usually R or C . • s n → s, not very fast. n − 1 � • a k = ∆s k = s k + 1 − s k , s n = a k . k = 0 • Definition implies s 0 = 0. For some sequences, defining s 0 = 0 when it was not given, is nonsense. 2

The convergence acceleration matrix s k,n depends on precisely s k , s k + 1 , s k + 2 , . . . , s k + n . Thus s k,0 = s k . Not all methods define s k,n for all possible pairs ( k, n ) .   0 s 0,1 s 0,2 . . . s 0,n − 1 s 0,n   s 1,0 s 1,1 s 1,2 . . . s 1,n − 1     . . . ... . . .   . . .     s n − 2,0 s n − 2,1 s n − 2,2       s n − 1,0 s n − 1,1       s n,0 3

Transformed sequences The first (if s 0 is allowed) or second row, s ′ s ′′ k = s 0,k , k = s 1,k − 1 , is in practice a good compromise between a single value and the full acceleration matrix. 4

Linearity and convergence ⇒ t k,n = αs k,n + u k,n Linear acceleration methods t k = αs k + u k = Convergence rate A real number such that lim k →∞ r − k a k exists and is nonzero. (Signed convergence radius.) Logarithmic convergence if r = 1. For this talk, − 1 � r < 1. 5

Recursion formula All linear methods can be written as r k,j s k,j = s k + 1,j − 1 + ( s k + 1,j − 1 − s k,j − 1 ) , 1 − r k,j where r k,j is preassigned. Not the same as saying there is a simple formula for r k,j . → s k,j s k,j − 1 s k + 1,j − 1 ր 6

Choice of r k,j If r − k a k is constant, r k,1 = r = ⇒ s k,j = s, j � 1. Generalized Euler method Choose r k,j = r j equal to the (known, expected or guessed) convergence rate of s · ,j − 1 . Original Euler method If r < 0, choose r k,j = − 1 always. 7

An example when linear acceleration works spectacularly well Let s k be the perimeter of the regular polygon with n k = 3 . 2 k sides A RCHIMEDES [ − 240 ± ε ] ∗ inscribed in the unit circle. s k = n k sin ( π/n k ) Since the power series for ( sin πx ) /x contains only even powers, take r j = 4 − j . Case j = 1 : H UYGENS [1654] ∗ A palimpsest containing the words “Archimedes scripsit anno CCXL ante natem Christi” is believed to be a fake, possibly because it is in Latin, not in Greek. 8

Acceleration by Richardson extrapolation 4 − j s k,j = s k + 1,j − 1 + 1 − 4 − j ( s k + 1,j − 1 − s k,j − 1 ) . s ′′ k s k k 1 3.000000000000000 3.00000000000000 2 3.105828541230249 3.14110472164033 3 3.132628613281238 3.14159245389765 4 3.139350203046867 3.14159265357789 5 3.141031950890510 3.14159265358979 s k has approximately 0.6 ( k + 1 ) correct digits. (log 10 4 ) s ′′ k has approximately 0.6k ( k + 1 ) correct digits. 10

What can we learn from this example? For linear extrapolation methods, the following two properties tend to occur together. • Very precise information about error expansion. • Very fast convergence of transformed sequence. 11

Are there ‘universal’ linear methods? • No. • Not in the sense that they work in all cases. • Yes, if r is known and r � = 1. (That’s not universal!) • Well, r = − 1 includes the slowest convergent alternating series. • Actually, methods based on assuming r = − 1 work fairly well for other r < 0. 12

The case r = − 1 (alternating series) Model problem: s k = 1 − 1/2 + 1/3 − 1/4 + · · · + a k , a k = (− 1 ) k + 1 /k s = log 2 . = 0.693147180559945 13

Euler’s method error = 1.0 10 − 7 , s 1,15 = 0.693147281939320, For model problem error = 5.2 10 − 9 . But s 6,10 = 0.693147175361531, Coincidence, or is it always better to stop earlier? 14

Van Wijngaarden’s method (or something equivalent to it) Take the subsequence s 1 + k,2k , k = 1, 2, . . . , of Euler’s method. Equivalently, if only one value is needed: given 3n + 1 terms of an alternating series with r = − 1, throw away the first n of the partial sums and apply Euler’s method to the remainder. Obviously this idea should not be repeated until only one or two terms are left! Why does it work the first time round? 15

Totally alternating series Totally monotone sequence: a k and its differences ∆a k = a k + 1 − a k , ∆ 2 a k , . . . , are monotone. Totally alternating series: s k = � k − 1 m = 0 (− 1 ) m b m where b k is totally monotone. Hausdorff’s Theorem, generalized: A series with convergence rate r is totally monotone when 0 � r < 1, or totally alternating when − 1 � r � 0, if and only if its terms can be � r t k w ( t ) d t, where w does not change sign. represented as a k = 0 16

Application to convergence acceleration [CRVZ] H. C OHEN , F. R ODRIGUEZ V ILLEGAS , D. Z AGIER , Experimental Mathematics 9 (2000) 3–12. Assume that s 0 = 0 is valid. Partial sums and limit: � r � r k − 1 � ( 1 − t k ) w ( t ) d t t j w ( t ) d t = s k = 1 − t 0 0 j = 0 � r w ( t ) d t s = 1 − t 0 17

Accelerated values and error General linear: s k,n = c 0,n s k + c 1,n s k + 1 + · · · + c n,n s k + n Define p n ( t ) = c 0,n + c 1,n t + · · · + c n,n t n � r ( p n ( 1 ) − t k p n ( t )) w ( t ) d t ⇒ s k,n = = 1 − t 0 ⇒ p n ( 1 ) = 1. Acceleration leaves a constant series invariant = � r t k p n ( t ) s − s k,n = 1 − t w ( t ) d t 0 18

Error bounds For − 1 � r < 1 : � r t k p n ( t ) s − s k,n = 1 − t w ( t ) d t 0 � r r k � � � � | s − s k,n | � p n ( t ) w ( t ) d t � � � � 1 − r � � 0 � � � 1 − s 0,n � � � � � � � p n � r � � � � s � p n � r = max | p n ( t ) | where I r = [ min ( 0, r ) , max ( 0, r )] . t ∈ I r 19

Euler’s and Van Wijngaarden’s method revisited Take r = − 1. For Euler’s method, p n ( t ) = (( 1 + t ) /2 ) n = ⇒ � p n � − 1 = 2 − n . For Van Wijngaarden’s method, for n divisible by 3, � n/3 = n/3 = 3 − n . � �� t ( 1 + t ) 2 /4 − 1 ⇒ � p n � − 1 = p n ( t ) = � p 3 � � � � 3 � 20

How to choose p n p n ( 1 ) = 1 because a constant sequence must be unchanged when “accelerated”. What polynomial is as small as possible over [ 0, r ] but satisfies this constraint? 21

Shifted Chebyshev polynomials p n ( t ) = P n ( t ) = k n T n (− 1 + 2t/r ) , where k n is chosen so that p n ( 1 ) = 1. G USTAFSON [1978] When r = − 1, � P n � − 1 = 5.828 − n . √ 2 . ( 3 + 2 = 5.828 ) NB: Good convergence only along rows. Nothing special about t k p n ( t ) when k increases and n is fixed. 22

0.666666666666667 0.705882352941177 0.693602693602694 0.693240901213172 0.693150956487263 0.693148308759757 0.693147230444045 0.693147198701394 0.693147181406835 0.693147180897730 0.693147180576273 Compare with 5.828 − 11 = 3.8 10 − 9 . | s − s 0,11 | = 1.6 10 − 11 . 23

[CRVZ]: even better If w is assumed to be smooth, integration by parts is justified. � One can select p n ( t ) from the matrix of Zagier polynomials [PARI/GP] P m,n = k m,n ∇ 2m ( n m + 1 P n ) , ∇ 2 P n = P n − P n − 2 where Two subsequences recommended by [CRVZ]: A: m = n − 1. We need P − k = P k . Remember P k ( t ) = cos ( kθ ( t )) . B: m = ⌊ n/2 ⌋ . 24

Error bounds for the CRVZ polynomials [CRVZ] says: If w is analytic in the small region, Case A has error 7.89 − n and Case B has error 9.56 − n . If w is analytic in the large region, Case A has error 17.93 − n and Case B has error 14.41 − n . 25

De Boor classification ∗ of Zagier polynomials A cautious person will only use m = 0. (Chebyshev polynomials) A reasonable person will use m = ⌊ n/2 ⌋ . (Case B) An adventurous person will use m = n − 1. (Case A) ∗ The need to think analytically . . . has been replaced by psychoanalytic introspection into the user’s own personality. — Phil Davis 26

CRVZ polynomials (Case A) in action on model problem 0.666666666666667 0.686274509803922 0.693693693693694 0.693176196711771 0.693145649808004 0.693147065004871 0.693147178638311 0.693147180715762 0.693147180574906 0.693147180560469 0.693147180559937 Compare with 17.93 − 11 = 1.6 10 − 14 . | s − s 0,11 | = 8.2 10 − 15 . 27

Why optimize bounds? The error estimate is � r t k p n ( t ) s − s k,n = 1 − t w ( t ) d t 0 Can’t we do better than to optimize error bounds? 28

Acceleration by orthogonal polynomials It’s just a moment integral, after all. Remember, r < 0, so ( 1 − t ) − 1 is harmless. � r t k p n ( t ) s − s k,n = 1 − t w ( t ) d t 0 Why not rather use the shifted Legendre polynomials? 29

0.666666666666667 0.692307692307692 0.693121693121693 0.693146417445483 0.693147157853040 0.693147179886528 0.693147180540013 0.693147180559356 0.693147180559928 0.693147180559945 0.693147180559945 | s − s 0,10 | = 5.5 10 − 16 . For CRVZ Case A we have | s − s 0,10 | = 5.2 10 − 13 . 30

Maybe that was just lucky Example: r 2 r 3 η ( β, r ) = 1 r β − 1 + β + 2 + β − 3 + β + · · · with r = 0.94, β = 0.125. Plot the error for Legendre polynomials, scaled to [− r, 0 ] . Close to, but not as good as, Chebyshev polynomials scaled to [− 1, 0 ] . 31

Convergence acceleration by orthogonal polynomials Dirk Laurie - PowerPoint PPT Presentation

Convergence acceleration by orthogonal polynomials Dirk Laurie University of Stellenbosch Luminy, 29 September 2009 1 Sequence or series Let s n , n = 1, 2, 3, . . . be a sequence in a vector space S . Often S is a field, in fact usually R or C

Properties of orthogonal polynomials Kerstin Jordaan University of South Africa LMS Research

Orthogonal Complements and Orthonormal Matrices Orthogonal Complements Defn. For a set W , the

Orthogonal Polynomials and Spectral Algorithms Nisheeth K. Vishnoi 1.0 d=0 d=1 0.5 d=5

Orthogonal range searching Orthogonal range searching Problem: Given a set of n points Orthogonal

Orthogonal Polynomials on Polynomial Lemniscates Brian Simanek (Vanderbilt University, USA) MWAA

Orthogonal polynomials and zeros of optimal approximants Daniel Seco (with Bnteau, Khavinson,

Zeros of classical orthogonal polynomials in two variables Lidia Fern andez Joint work with

Matrix Orthogonal Polynomials and Time and Band limiting Mirta M. Castro Smirnova Universidad de

Planar orthogonal polynomials and related determinantal processes: random normal matrices and

Random Orthogonal Polynomials: From matrices to point processes Diane Holcomb, KTH Integrability

Composition of Power Series, Change of Basis and Orthogonal Polynomials Bruno Salvy

Asymptotic Analysis of Random Matrices and Orthogonal Polynomials Arno Kuijlaars University of

Quadrature of highly oscillatory integrals: the role of (complex) orthogonal polynomials Alfredo

Orthogonal Polynomials for Bernoulli and Euler Polynomials Lin JIU Dalhousie University Number

Small-span characteristic polynomials of integer symmetric matrices James McKee (RHUL) ANTS 9,

Latin Squares and Orthogonal Arrays Lucia Moura School of Electrical Engineering and Computer

On the zeros of Meixner and Meixner-Pollaczek polynomials Alta Jooste University of Pretoria

Orthogonal polynomials, zeros and electrostatics F. Marcell an Universidad Carlos III de

Properties of orthogonal polynomials Kerstin Jordaan University of South Africa LMS Research

Lecture 12: SVD, Procrustes Analysis COMPSCI/MATH 290-04 Chris Tralie, Duke University 2/23/2016

The two matrix model with quartic potential Arno Kuijlaars Katholieke Universiteit Leuven,

The Power of Low-Degree Polynomials for Solving Statistical Problems Alex Wein Courant

Series of Chromatic Differences Gilbert G. Walter UW-Milwaukee February 2014 Outline of talk

Orthogonal polynomials and