error estimates in linear systems with an application to
play

Error estimates in linear systems with an application to - PowerPoint PPT Presentation

Error estimates in linear systems with an application to regularization Claude BREZINSKI University of Lille - France The norm of the error Extrapolation The formula for extrapolation Estimates of the norm of the error A


  1. Error estimates in linear systems with an application to regularization Claude BREZINSKI University of Lille - France ➜ The norm of the error ➜ Extrapolation ➜ The formula for extrapolation ➜ Estimates of the norm of the error ➜ A generalization ➜ An application to regularization 1

  2. T HE NORM OF THE ERROR Ax = b y an approximation of x and e = x − y . We want to obtain an estimation of � e � . We set r = b − Ay . We have Ae = r and the bounds � r � � A � ≤ � e � ≤ � A − 1 � · � r � . These bounds require the knowledge of � A � and � A − 1 � , But � A − 1 � is difficult to compute, and the lower bound can be quite a bad estimate of � e � . Estimates for the error in the conjugate gradient were given by Golub and Meurant. T HE NORM OF THE ERROR 2

  3. E XTRAPOLATION We will now obtain estimates of � e � by an extrapolation method . We have c 0 = ( A 0 r , A 0 r ) A 0 A 0 0 + 0 = 0 c 1 = ( A 0 r , A 1 r ) A 0 A 1 0 + 1 = 1 c 2 = ( A 1 r , A 1 r ) A 1 A 1 1 + 1 = 2 c − 1 = ( A 0 r , A − 1 r ) A − 1 A 0 0 + ( − 1) = − 1 c − 2 = ( A − 1 r , A − 1 r ) A − 1 A − 1 ( − 1) + ( − 1) = − 2 ( A 0 r , A − 1 r ) = ( e , Ae ) = A-norm of the error c − 1 ( A − 1 r , A − 1 r ) = ( e , e ) = � e � 2 = norm of the error c − 2 E XTRAPOLATION 3

  4. We will interpolate the points ( 0 , c 0 ) , ( 1 , c 1 ) and ( 2 , c 2 ) by some function and then .... extrapolate at the point − 2 . E XTRAPOLATION 4

  5. W HAT IS EXTRAPOLATION ? W HAT IS EXTRAPOLATION ? 5

  6. Which curve? Answer: a curve which mimics the exact behavior of the c i If the function mimics the behaviour of the c i , then its value at − 2 will be a good approximation of � e 2 � . For choosing the interpolation function, we have to analyze the behaviour of c 0 , c 1 and c 2 . So, let us now analyze this behavior. W HAT IS EXTRAPOLATION ? 6

  7. We consider the Singular Value Decomposition ( SVD ) of the matrix A : U Σ V T A = U = [ u 1 , . . . , u p ] V = [ v 1 , . . . , v p ] where UU T = V V T = I , Σ = diag ( σ 1 , . . . , σ p ) with σ 1 ≥ σ 2 ≥ · · · ≥ σ p > 0 . W HAT IS EXTRAPOLATION ? 7

  8. Let v be any vector . It holds p � A v = σ i ( v i , v ) u i i =1 p � A T v = σ i ( u i , v ) v i i =1 p � A − 1 v σ − 1 = ( u i , v ) v i . i i =1 W HAT IS EXTRAPOLATION ? 8

  9. p � ( U T r, U T r ) = α 2 c 0 = ( r, r ) = i , α i = ( u i , r ) i =1 p � ( V T r, V T r ) = β 2 = i , β i = ( v i , r ) i =1 p � c 1 = ( r, Ar ) = σ i α i β i i =1 p � σ 2 i β 2 c 2 = ( Ar, Ar ) = i i =1 p � σ − 1 c − 1 = ( A − 1 r, r ) = ( e, Ae ) = α i β i i i =1 p � c − 2 = ( A − 1 r, A − 1 r ) σ − 2 α 2 = ( e, e ) = i . i i =1 W HAT IS EXTRAPOLATION ? 9

  10. T HE FORMULA FOR EXTRAPOLATION The function we will use for extrapolation has to mimic as closely as possible the behavior of the c i . So, we will keep only the first term in each of the preceding formulae, that is we will look for α, β and σ satisfying the interpolation conditions α 2 = β 2 c 0 = c 1 = σαβ σ 2 β 2 . c 2 = and then extrapolate for the values − 1 and − 2 of the index. Thus, c − 1 and c − 2 will be approximated by c − 2 = � e � 2 ≃ σ − 2 α 2 . c − 1 ≃ σ − 1 αβ and T HE FORMULA FOR EXTRAPOLATION 10

  11. E STIMATES OF THE NORM OF THE ERROR The preceding system has 3 unknowns and 4 equations which are not compatible. Thus, it has several solutions. For example, we get the following e 2 i which are approximations of � e � 2 e 2 c 4 1 /c 3 = 1 2 e 2 c 0 c 2 1 /c 2 = 2 2 e 2 c 2 = 0 /c 2 3 e 2 c 3 0 /c 2 = 4 1 e 2 c 4 0 c 2 /c 4 = 1 . 5 These estimates were numbered so that e 1 ≤ e 2 ≤ e 3 ≤ e 4 ≤ e 5 . E STIMATES OF THE NORM OF THE ERROR 11

  12. M ORE ESTIMATES : More estimates can be obtained by replacing c 2 in all formulae above by p � c 2 = ( A T r, A T r ) = σ 2 i α 2 i , � i =1 and approximating it by σ 2 α 2 . Similar results and properties are obtained. They will be denoted by putting a � over the letters. It holds e 2 ν ≤ � e � 2 , � ∀ ν ≤ 3 . The estimate � e 3 was given by Auchmuty in 1992. E STIMATES OF THE NORM OF THE ERROR 12

  13. N UMERICAL EXAMPLES : x = A − 1 b is the exact solution of the linear system. y is any approximate solution of it. So, our estimates apply either to a direct method or to an iterative method for the solution of a system of linear equations. They estimate both the rounding errors and the error of the method. E STIMATES OF THE NORM OF THE ERROR 13

  14. BiCGSTAB: residual, error and estimate 10 9 10 6 10 3 10 0 10 -3 10 -6 0 10 20 30 40 50 60 70 80 90 100 Figure 1: BiCGSTAB for I+50*CIRCUL(100); cond ( A ) = 101 E STIMATES OF THE NORM OF THE ERROR 14

  15. Figure 2: inv(I+50*circul(100)) κ = 101 . 0408 , � A � = 4 . 0016 · 10 − 4 , � A − 1 � = 2 . 5250 · 10 5 E STIMATES OF THE NORM OF THE ERROR 15

  16. A GENERALIZATION : From here: joint work with G. Rodriguez and S. Seatzu (University of Cagliari, Italy). The five estimates e 1 , . . . , e 5 can be gathered into only one formula i = c i − 1 1 ) 3 − i c i − 4 e 2 ( c 2 , i = 1 , . . . , 5 . 0 2 Moreover , this formula is not only valid for i = 1 , . . . , 5 , but also for any real number ν , that is ν = c ν − 1 1 ) 3 − ν c ν − 4 e 2 ( c 2 , ν ∈ R . 0 2 E STIMATES OF THE NORM OF THE ERROR 16

  17. P ROPERTIES : We have � c 6 � c 0 c 2 � ν � = ρ ν e 2 e 2 1 ν = · 0 . c 2 c 0 c 4 1 2 So, e 2 ν is an increasing function of ν in ( −∞ , + ∞ ) since ρ = ( c 0 c 2 ) / c 2 1 ≥ 1 . Therefore, it exists ν e such that e 2 ν e = � e � 2 . This ν e is given by the formula ν e = 2 ln( � e � /e 0 ) / ln ρ. E STIMATES OF THE NORM OF THE ERROR 17

  18. A N APPLICATION TO REGULARIZATION When a system is ill-conditioned, its solution cannot be computed accurately. Tikhonov’s regularization consists in computing the vector x λ which minimizes the quadratic functional J ( λ, x ) = � Ax − b � 2 + λ 2 � Hx � 2 over all vectors x , where λ is a parameter, and H a given q × p ( q ≤ p ) matrix. This vector x λ is the solution of the system ( C + λ 2 E ) x λ = A T b, where C = A T A and E = H T H . A N APPLICATION TO REGULARIZATION 18

  19. If λ is close to zero, then x λ is badly computed while, if λ is far away from zero, x λ is well computed but the error x − x λ is quite large. For decreasing values of λ , the norm of the error � x − x λ � first decreases, and then increases when λ approaches 0. Thus the error , which is the sum of the theoretical error and the error due to the computer’s arithmetic, passes through a minimum corresponding to the optimal choice of the regularization parameter λ . A N APPLICATION TO REGULARIZATION 19

  20. Several methods have been proposed to obtain an effective choice of λ . The L – curve consists in plotting in log – log scale the values of � Hx λ � versus � r λ � . The resulting curve is typically L –shaped and the selected value of λ is the one corresponding to the corner of the L . However, there are many cases where the L –curve exhibits more than one corner, or no one at all. The Generalized Cross Validation ( GCV ) searches for the minimum of a function of λ which is a statistical estimate of the norm of the residual. Occasionally, the value of the parameter furnished by this method may be inaccurate because the function is rather flat near the minimum. A N APPLICATION TO REGULARIZATION 20

  21. But each of these methods can fail. We are proposing another test based on the preceding estimates of the norm of the error . Warning : We don’t pretend that this new procedure never fails!!! A N APPLICATION TO REGULARIZATION 21

  22. There are two questions that have to be answered: • Is x λ well computed? For answering this question, we propose the following test. Remember that the vector x λ is the solution of ( C + λ 2 E ) x λ = A T b, where C = A T A and E = H T H . Set r λ = b − Ax λ . Since A T r λ = λ 2 Ex λ , it holds λ 2 � Ex λ � � A T r λ � = 1 . So, it could be checked if this ratio is close to 1 for all λ . A N APPLICATION TO REGULARIZATION 22

  23. • Is x λ a good approximation of x ? For this question, the preceding estimates could be used. c 2 = � A T r λ � 2 is badly However, due to the ill-conditioning, � computed when λ approches zero. So, again, we will replace A T r λ by λ 2 Ex λ in � A T r λ � and in ( r λ , Ar λ ) = ( A T r λ , r λ ) . A N APPLICATION TO REGULARIZATION 23

  24. In order to find the best value of the parameter λ , we will now apply our estimates of the norm of the error to Tikhonov’s regularization method. Effecting the preceding substitutions, we finally obtain the error estimates ν = � r λ � 2 ν − 2 ( r λ , Ex λ ) 6 − 2 ν � Ex λ � 2 ν − 8 λ − 4 . e 2 � Contrarily to the more general estimates which are always valid, this new formula is only valid for Tikhonov’s regularization. So, it should lead to better numerical results. Testing the equality λ 2 � Ex λ � / � A T r λ � = 1 is also only valid for Tikhonov’s regularization. Let us remark that ( r λ , Ex λ ) = ( Hr λ , Hx λ ) which avoids computing the matrix E and, in several cases, leads to a more stable procedure. A N APPLICATION TO REGULARIZATION 24

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend