Error estimates in linear systems with an application to - PowerPoint PPT Presentation

Error estimates in linear systems with an application to regularization Claude BREZINSKI University of Lille - France ➜ The norm of the error ➜ Extrapolation ➜ The formula for extrapolation ➜ Estimates of the norm of the error ➜ A generalization ➜ An application to regularization 1

T HE NORM OF THE ERROR Ax = b y an approximation of x and e = x − y . We want to obtain an estimation of � e � . We set r = b − Ay . We have Ae = r and the bounds � r � � A � ≤ � e � ≤ � A − 1 � · � r � . These bounds require the knowledge of � A � and � A − 1 � , But � A − 1 � is difficult to compute, and the lower bound can be quite a bad estimate of � e � . Estimates for the error in the conjugate gradient were given by Golub and Meurant. T HE NORM OF THE ERROR 2

E XTRAPOLATION We will now obtain estimates of � e � by an extrapolation method . We have c 0 = ( A 0 r , A 0 r ) A 0 A 0 0 + 0 = 0 c 1 = ( A 0 r , A 1 r ) A 0 A 1 0 + 1 = 1 c 2 = ( A 1 r , A 1 r ) A 1 A 1 1 + 1 = 2 c − 1 = ( A 0 r , A − 1 r ) A − 1 A 0 0 + ( − 1) = − 1 c − 2 = ( A − 1 r , A − 1 r ) A − 1 A − 1 ( − 1) + ( − 1) = − 2 ( A 0 r , A − 1 r ) = ( e , Ae ) = A-norm of the error c − 1 ( A − 1 r , A − 1 r ) = ( e , e ) = � e � 2 = norm of the error c − 2 E XTRAPOLATION 3

We will interpolate the points ( 0 , c 0 ) , ( 1 , c 1 ) and ( 2 , c 2 ) by some function and then .... extrapolate at the point − 2 . E XTRAPOLATION 4

W HAT IS EXTRAPOLATION ? W HAT IS EXTRAPOLATION ? 5

Which curve? Answer: a curve which mimics the exact behavior of the c i If the function mimics the behaviour of the c i , then its value at − 2 will be a good approximation of � e 2 � . For choosing the interpolation function, we have to analyze the behaviour of c 0 , c 1 and c 2 . So, let us now analyze this behavior. W HAT IS EXTRAPOLATION ? 6

We consider the Singular Value Decomposition ( SVD ) of the matrix A : U Σ V T A = U = [ u 1 , . . . , u p ] V = [ v 1 , . . . , v p ] where UU T = V V T = I , Σ = diag ( σ 1 , . . . , σ p ) with σ 1 ≥ σ 2 ≥ · · · ≥ σ p > 0 . W HAT IS EXTRAPOLATION ? 7

Let v be any vector . It holds p � A v = σ i ( v i , v ) u i i =1 p � A T v = σ i ( u i , v ) v i i =1 p � A − 1 v σ − 1 = ( u i , v ) v i . i i =1 W HAT IS EXTRAPOLATION ? 8

p � ( U T r, U T r ) = α 2 c 0 = ( r, r ) = i , α i = ( u i , r ) i =1 p � ( V T r, V T r ) = β 2 = i , β i = ( v i , r ) i =1 p � c 1 = ( r, Ar ) = σ i α i β i i =1 p � σ 2 i β 2 c 2 = ( Ar, Ar ) = i i =1 p � σ − 1 c − 1 = ( A − 1 r, r ) = ( e, Ae ) = α i β i i i =1 p � c − 2 = ( A − 1 r, A − 1 r ) σ − 2 α 2 = ( e, e ) = i . i i =1 W HAT IS EXTRAPOLATION ? 9

T HE FORMULA FOR EXTRAPOLATION The function we will use for extrapolation has to mimic as closely as possible the behavior of the c i . So, we will keep only the first term in each of the preceding formulae, that is we will look for α, β and σ satisfying the interpolation conditions α 2 = β 2 c 0 = c 1 = σαβ σ 2 β 2 . c 2 = and then extrapolate for the values − 1 and − 2 of the index. Thus, c − 1 and c − 2 will be approximated by c − 2 = � e � 2 ≃ σ − 2 α 2 . c − 1 ≃ σ − 1 αβ and T HE FORMULA FOR EXTRAPOLATION 10

E STIMATES OF THE NORM OF THE ERROR The preceding system has 3 unknowns and 4 equations which are not compatible. Thus, it has several solutions. For example, we get the following e 2 i which are approximations of � e � 2 e 2 c 4 1 /c 3 = 1 2 e 2 c 0 c 2 1 /c 2 = 2 2 e 2 c 2 = 0 /c 2 3 e 2 c 3 0 /c 2 = 4 1 e 2 c 4 0 c 2 /c 4 = 1 . 5 These estimates were numbered so that e 1 ≤ e 2 ≤ e 3 ≤ e 4 ≤ e 5 . E STIMATES OF THE NORM OF THE ERROR 11

M ORE ESTIMATES : More estimates can be obtained by replacing c 2 in all formulae above by p � c 2 = ( A T r, A T r ) = σ 2 i α 2 i , � i =1 and approximating it by σ 2 α 2 . Similar results and properties are obtained. They will be denoted by putting a � over the letters. It holds e 2 ν ≤ � e � 2 , � ∀ ν ≤ 3 . The estimate � e 3 was given by Auchmuty in 1992. E STIMATES OF THE NORM OF THE ERROR 12

N UMERICAL EXAMPLES : x = A − 1 b is the exact solution of the linear system. y is any approximate solution of it. So, our estimates apply either to a direct method or to an iterative method for the solution of a system of linear equations. They estimate both the rounding errors and the error of the method. E STIMATES OF THE NORM OF THE ERROR 13

BiCGSTAB: residual, error and estimate 10 9 10 6 10 3 10 0 10 -3 10 -6 0 10 20 30 40 50 60 70 80 90 100 Figure 1: BiCGSTAB for I+50*CIRCUL(100); cond ( A ) = 101 E STIMATES OF THE NORM OF THE ERROR 14

Figure 2: inv(I+50*circul(100)) κ = 101 . 0408 , � A � = 4 . 0016 · 10 − 4 , � A − 1 � = 2 . 5250 · 10 5 E STIMATES OF THE NORM OF THE ERROR 15

A GENERALIZATION : From here: joint work with G. Rodriguez and S. Seatzu (University of Cagliari, Italy). The five estimates e 1 , . . . , e 5 can be gathered into only one formula i = c i − 1 1 ) 3 − i c i − 4 e 2 ( c 2 , i = 1 , . . . , 5 . 0 2 Moreover , this formula is not only valid for i = 1 , . . . , 5 , but also for any real number ν , that is ν = c ν − 1 1 ) 3 − ν c ν − 4 e 2 ( c 2 , ν ∈ R . 0 2 E STIMATES OF THE NORM OF THE ERROR 16

P ROPERTIES : We have � c 6 � c 0 c 2 � ν � = ρ ν e 2 e 2 1 ν = · 0 . c 2 c 0 c 4 1 2 So, e 2 ν is an increasing function of ν in ( −∞ , + ∞ ) since ρ = ( c 0 c 2 ) / c 2 1 ≥ 1 . Therefore, it exists ν e such that e 2 ν e = � e � 2 . This ν e is given by the formula ν e = 2 ln( � e � /e 0 ) / ln ρ. E STIMATES OF THE NORM OF THE ERROR 17

A N APPLICATION TO REGULARIZATION When a system is ill-conditioned, its solution cannot be computed accurately. Tikhonov’s regularization consists in computing the vector x λ which minimizes the quadratic functional J ( λ, x ) = � Ax − b � 2 + λ 2 � Hx � 2 over all vectors x , where λ is a parameter, and H a given q × p ( q ≤ p ) matrix. This vector x λ is the solution of the system ( C + λ 2 E ) x λ = A T b, where C = A T A and E = H T H . A N APPLICATION TO REGULARIZATION 18

If λ is close to zero, then x λ is badly computed while, if λ is far away from zero, x λ is well computed but the error x − x λ is quite large. For decreasing values of λ , the norm of the error � x − x λ � first decreases, and then increases when λ approaches 0. Thus the error , which is the sum of the theoretical error and the error due to the computer’s arithmetic, passes through a minimum corresponding to the optimal choice of the regularization parameter λ . A N APPLICATION TO REGULARIZATION 19

Several methods have been proposed to obtain an effective choice of λ . The L – curve consists in plotting in log – log scale the values of � Hx λ � versus � r λ � . The resulting curve is typically L –shaped and the selected value of λ is the one corresponding to the corner of the L . However, there are many cases where the L –curve exhibits more than one corner, or no one at all. The Generalized Cross Validation ( GCV ) searches for the minimum of a function of λ which is a statistical estimate of the norm of the residual. Occasionally, the value of the parameter furnished by this method may be inaccurate because the function is rather flat near the minimum. A N APPLICATION TO REGULARIZATION 20

But each of these methods can fail. We are proposing another test based on the preceding estimates of the norm of the error . Warning : We don’t pretend that this new procedure never fails!!! A N APPLICATION TO REGULARIZATION 21

There are two questions that have to be answered: • Is x λ well computed? For answering this question, we propose the following test. Remember that the vector x λ is the solution of ( C + λ 2 E ) x λ = A T b, where C = A T A and E = H T H . Set r λ = b − Ax λ . Since A T r λ = λ 2 Ex λ , it holds λ 2 � Ex λ � � A T r λ � = 1 . So, it could be checked if this ratio is close to 1 for all λ . A N APPLICATION TO REGULARIZATION 22

• Is x λ a good approximation of x ? For this question, the preceding estimates could be used. c 2 = � A T r λ � 2 is badly However, due to the ill-conditioning, � computed when λ approches zero. So, again, we will replace A T r λ by λ 2 Ex λ in � A T r λ � and in ( r λ , Ar λ ) = ( A T r λ , r λ ) . A N APPLICATION TO REGULARIZATION 23

In order to find the best value of the parameter λ , we will now apply our estimates of the norm of the error to Tikhonov’s regularization method. Effecting the preceding substitutions, we finally obtain the error estimates ν = � r λ � 2 ν − 2 ( r λ , Ex λ ) 6 − 2 ν � Ex λ � 2 ν − 8 λ − 4 . e 2 � Contrarily to the more general estimates which are always valid, this new formula is only valid for Tikhonov’s regularization. So, it should lead to better numerical results. Testing the equality λ 2 � Ex λ � / � A T r λ � = 1 is also only valid for Tikhonov’s regularization. Let us remark that ( r λ , Ex λ ) = ( Hr λ , Hx λ ) which avoids computing the matrix E and, in several cases, leads to a more stable procedure. A N APPLICATION TO REGULARIZATION 24

Error estimates in linear systems with an application to - PowerPoint PPT Presentation

Error estimates in linear systems with an application to regularization Claude BREZINSKI University of Lille - France The norm of the error Extrapolation The formula for extrapolation Estimates of the norm of the error A

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Error estimates for the Galerkin finite element approximation for a linear second order hyperbolic

Error estimates of some non linear problems Vanessa Lleras Institut montpelli erain Alexander

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

Error Codes Correcting Gary Lecture 11 toolkit CMU Preliminaries Setting Error of

Combining Estimates from Related Surveys via Bivariate Models (Application: using ACS estimates

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE

A posteriori error estimates Part II Complementary estimates Tom a s Vejchodsk y

Parametric Linear System Solving with Error Correction. Cleveland Waddell S YMBOLIC -N UMERIC C

Estimating the Error at Given Test Estimating the Error at Given Test Input Points for Linear

Hand Calculations Calculation of 0 , 1 , and SS E from the formulas is tedious; these

The Normal-Normal Model Alicia Johnson Associate Professor, Macalester College DataCamp

Laser Based H - Beam Diagnostics Yun Liu for Beam Instrumentation Team Research Accelerator

Hindley-Milner elaboration in applicative style Fran cois Pottier This pearl presents This

1 Benefits of exception handling Handling exceptions Generally, there are many ways to handle an

Exception Handling Genome 559 Review - classes Use your own classes to: - package together

Exceptions and Exception Handling 4003-232-07 (Winter 2007-2008) Week 3: Exceptions, Wrapper

61A Lecture 24 Friday, November 1 2 Heard on the Dread Pirate Lambda's Fibbonautical Voyage

Error estimates in linear systems with an application to - PowerPoint PPT Presentation

Error estimates in linear systems with an application to regularization Claude BREZINSKI University of Lille - France The norm of the error Extrapolation The formula for extrapolation Estimates of the norm of the error A

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Error estimates for the Galerkin finite element approximation for a linear second order hyperbolic

Error estimates of some non linear problems Vanessa Lleras Institut montpelli erain Alexander

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

llvm::Error Rich Error Handling in LLVM Error Handling History LLVMs APIs historically

Error Codes Correcting Gary Lecture 11 toolkit CMU Preliminaries Setting Error of

Combining Estimates from Related Surveys via Bivariate Models (Application: using ACS estimates

Graphics 2014 Linear Algebra II Linear Maps &amp; Matrices Linear Maps &amp; Matrices CORE

A posteriori error estimates Part II Complementary estimates Tom a s Vejchodsk y

Parametric Linear System Solving with Error Correction. Cleveland Waddell S YMBOLIC -N UMERIC C

Estimating the Error at Given Test Estimating the Error at Given Test Input Points for Linear

Hand Calculations Calculation of 0 , 1 , and SS E from the formulas is tedious; these

The Normal-Normal Model Alicia Johnson Associate Professor, Macalester College DataCamp

Laser Based H - Beam Diagnostics Yun Liu for Beam Instrumentation Team Research Accelerator

Hindley-Milner elaboration in applicative style Fran cois Pottier This pearl presents This

1 Benefits of exception handling Handling exceptions Generally, there are many ways to handle an

Exception Handling Genome 559 Review - classes Use your own classes to: - package together

Exceptions and Exception Handling 4003-232-07 (Winter 2007-2008) Week 3: Exceptions, Wrapper

61A Lecture 24 Friday, November 1 2 Heard on the Dread Pirate Lambda's Fibbonautical Voyage

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Graphics 2014 Linear Algebra II Linear Maps & Matrices Linear Maps & Matrices CORE