Why Curvature in L-Curve: Analysis of the . . . Additional - - PowerPoint PPT Presentation

why curvature in l curve
SMART_READER_LITE
LIVE PREVIEW

Why Curvature in L-Curve: Analysis of the . . . Additional - - PowerPoint PPT Presentation

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Why Curvature in L-Curve: Analysis of the . . . Additional Invariance . . . Combining Soft Constraints Main Result Acknowledgments Proof Anibal


slide-1
SLIDE 1

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 1 of 14 Go Back Full Screen Close Quit

Why Curvature in L-Curve: Combining Soft Constraints

Anibal Sosa, Martine Ceberio, and Vladik Kreinovich

Cyber-ShARE Center University of Texas at El Paso 500 W. University El Paso, TX 79968, USA usosaaguirre@miners.utep.edu mceberio@utep.edu, vladik@utep.edu

slide-2
SLIDE 2

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 2 of 14 Go Back Full Screen Close Quit

1. Inverse Problem: A Brief Reminder

  • In science & engineering, we are interested in the state
  • f the world, i.e., in the values of different quantities.
  • Some of these quantities we can directly measure, but

many quantities are difficult to measure directly.

  • For example, in geophysics, we are interested in the

density at different depths and different locations.

  • In principle, we can drill a borehole and directly mea-

sure these properties, but this is very expensive.

  • To find the values of such difficult-to-measure quanti-

ties q = (q1, . . . , qn), we: – measure auxiliary quantities a = (a1, . . . , am) re- lated to qi by a known dependence ai = fi(q1, . . . , qn), – and then reconstruct the values qj from these mea- surement results.

slide-3
SLIDE 3

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 3 of 14 Go Back Full Screen Close Quit

2. Enter Soft Constraints

  • Objective: describe the constraint that the values qj

are consistent with the observations ai.

  • Assumption: measurement errors ai−fi(q1, . . . , qn) are
  • indep. normal variables with 0 mean and same σ2.
  • Resulting constraint: s ≤ s0, where

s

def

=

m

  • i=1

(ai − fi(q1, . . . , qn))2.

  • Fact: for each s0, there is a certain probability that

this constraint will be violated.

  • Soft constraints: such constraints are called soft.
  • For convenience: this constraint is sometimes described

in a log scale, as x ≤ x0, where x

def

= ln(s).

slide-4
SLIDE 4

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 4 of 14 Go Back Full Screen Close Quit

3. Regularization

  • Methods for taking additional regularity constraints

into account are known as regularization methods.

  • Example: In geophysics, the density values at nearby

locations are usually close to each other: qj − qj′ ≈ 0.

  • Assumption: differences qj−qj′ are indep. and normally

distributed with 0 mean and the same σ2

d.

  • Resulting constraint: t ≤ t0, where t

def

=

(j,j′)

(qj − qj′)2.

  • In log scale: y ≤ y0, where y

def

= ln(t).

  • How to combine constraints: e.g., we can use the Max-

imum Likelihood method.

  • Result: we find the values qj that minimize the sum

s + λ · t, where λ depends on the variances σ2 and σ2

d.

slide-5
SLIDE 5

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 5 of 14 Go Back Full Screen Close Quit

4. How to Determine the Parameter λ

  • Fact: for each λ, we can find qj(λ), and, based on this

solution, compute x(λ) and y(λ).

  • Question: what value λ shall we choose?
  • Often: the curve (x(λ), y(λ)) has a turning point (is

L-shaped).

  • In this case: it is reasonable to select this turning point.
  • How to describe it: it is a point where the absolute

value |C| of the curvature C is the largest: C

def

= x′′ · y′ − y′′ · x′ ((x′)2 + (y′)2)3/2.

  • Fact: this approach often works well.
  • Natural question: explain why curvature works well.
  • What we do: we show that reasonable properties select

a class of functions that include curvature.

slide-6
SLIDE 6

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 6 of 14 Go Back Full Screen Close Quit

5. Analysis of the Problem: Scale-Invariance

  • The numerical values of each quantity depend on the

selection of a measuring unit a.

  • If we change a to a new measuring unit ca times smaller,

then ai and ai − fi(q1, . . . , qn) get multiplied by ca.

  • So, s =

n

  • i=1

(ai − fi(q1, . . . , qn))2 get multiplied by c2

a,

and x = ln(s) changes to x + ∆x, where ∆x

def

= ln(c2

a).

  • If we change a measuring unit q by a new one cq times

smaller, then qj and qj − qj′ get multiplied by c2

q.

  • Also t = (qj − qj′)2 , and y = ln(t) get multiplied by

c2

q, and y = ln(t) changes to y + ∆y

  • Under these changes x(λ) → x(λ) + ∆x and y(λ) →

y(λ) + ∆y, and the curvature does not change.

slide-7
SLIDE 7

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 7 of 14 Go Back Full Screen Close Quit

6. Additional Invariance and Our Main Idea

  • Instead of the original parameter λ, we can use a new

parameter µ for which λ = g(µ).

  • This re-scaling of a parameter does not change the

curve itself and thus, does not change its curvature.

  • Our idea: to describe all the functions which are in-

variant with respect to both types of re-scalings.

  • By a parameter selection criterion we mean a function

F(x, y, x′, y′, x′′, y′′) of six variables.

  • F is scale-invariant if for all values ∆x and ∆y,

F(x + ∆x, y + ∆y, x′, y′, x′′, y′′) = F(x, y, x′, y′, x′′, y′′);

  • F is invariant w.r.t. parameter re-scaling if for every

function g(z), for x(µ) = x(g(µ)), y(µ) = y(g(µ)), F( x, y, x′, y′, x′′, y′′) = F(x, y, x′, y′, x′′, y′′).

slide-8
SLIDE 8

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 8 of 14 Go Back Full Screen Close Quit

7. Main Result Main Result. A parameter selection criterion is scale- invariant and invariant w.r.t. parameter re-scaling if and

  • nly if it has the form

F(x, y, x′, y′, x′′, y′′) = f

  • C(x, y, x′, y′, x′′, y′′), y′

x′

  • for some function f(C, z), where

C

def

= x′′ · y′ − y′′ · x′ ((x′)2 + (y′)2)3/2.

  • Comment. Once a criterion is selected, for each problem,

we use the value λ for which the value F(x(λ), y(λ), x′(λ), y′(λ), x′′(λ), y′′(λ)) is the largest.

slide-9
SLIDE 9

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 9 of 14 Go Back Full Screen Close Quit

8. Acknowledgments This work was supported in part:

  • by the National Science Foundation grants HRD-0734825

and DUE-0926721,

  • by Grant 1 T36 GM078000-01 from the National Insti-

tutes of Health, and

  • by UTEP’s Computational Science program.
slide-10
SLIDE 10

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 10 of 14 Go Back Full Screen Close Quit

9. Proof

  • For each tuple (x, y, x′, y′, x′′, y′′), by taking ∆x = −x

and ∆y = −y, we conclude that F(x, y, x′, y′, x′′, y′′) = F(0, 0, x′, y′, x′′, y′′).

  • Thus, we conclude that

F(x, y, x′, y′, x′′, y′′) = F0(x′, y′, x′′, y′′), where we denoted F0(x′, y′, x′′, y′′)

def

= F(0, 0, x′, y′, x′′, y′′).

  • So, we conclude that the value of the parameter selec-

tion criterion does not depend on x and y at all.

  • In terms of the function F0, invariance w.r.t. parameter

re-scaling means that F0( x′, y′, x′′, y′′) = F0(x′, y′, x′′, y′′).

  • Re-scaling means that we go from the original function

x(λ) to the new function x(µ) = x(g(µ)).

slide-11
SLIDE 11

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 11 of 14 Go Back Full Screen Close Quit

10. Proof (cont-d)

  • In terms of the function F0, invariance w.r.t. parameter

re-scaling means that F0( x′, y′, x′′, y′′) = F0(x′, y′, x′′, y′′).

  • Re-scaling means that we go from the original function

x(λ) to the new function x(µ) = x(g(µ)).

  • The chain rule for differentiation leads to

x′ = x′ · g′ and thus, x′′ = x′′ · (g′)2 + x′ · g′′.

  • Similarly,

y′ = y′ · g′ and y′′ = y′′ · (g′)2 + y′ · g′′.

  • In particular, at the point where g′ = 1, we have

x′ = x,

  • x′′ = x′′ + x′ · g′′,

y′ = y′, and y′′ = y′′ + y′ · g′′.

  • Thus, invariance w.r.t.

parameter re-scaling means that F0(x′, y′, x′′ +x′ ·g′′, y′′ +y′ ·g′′) = F0(x′, y′, x′′, y′′).

  • In particular, for g′′ = −y′′

y′ , we have y′′+y′·g′′ = 0 and thus, F0(x′, y′, x′′, y′′) = F0

  • x′, y′, x′′ − x′ · y′′

y′ , 0

  • .
slide-12
SLIDE 12

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 12 of 14 Go Back Full Screen Close Quit

11. Proof (cont-d)

  • We proved: F0(x′, y′, x′′, y′′) = F0
  • x′, y′, x′′ − x′ · y′′

y′ , 0

  • .
  • One can check that x′′ − x′ · y′′

y′ = C · ((x′)2 + (y′)2)3/2 y′ .

  • Thus, F0(x′, y′, x′′, y′′) = h(C, x′, y′), where

h(C, x′, y′)

def

= F0

  • x′, y′, C · ((x′)2 + (y′)2)3/2

y′ , 0

  • .
  • The curvature C is invariant w.r.t. parameter re-scaling.
  • So, for h(C, x′, y′), invariance means that h(C,

x′, y′) = h(C, x′, y′), i.e., h(C, x′, y′) = h(C, x′ · g′, y′ · g′).

  • In particular, for g′ = 1

x′, we have x′ · g′ = 1 and thus, h(C, x′, y′) = h

  • C, 1, y′

x′

  • .
slide-13
SLIDE 13

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 13 of 14 Go Back Full Screen Close Quit

12. Proof (cont-d)

  • We have proven:

– that F(x, y, x′, y′, x′′, y′′) = F0(x′, y′, x′′, y′′), – that F0(x′, y′, x′′, y′′) = h(C, x′, y′), and – that h(C, x′, y′) = h

  • C, 1, y′

x′

  • .
  • Thus, we conclude that F(x, y, x′, y′, x′′, y′′) = h
  • C, 1, y′

x′

  • .
  • In other words, we get F(x, y, x′, y′, x′′, y′′) = f
  • C, y′

x′

  • for f(C, z)

def

= h(C, 1, z).

  • The main result is thus proven.
slide-14
SLIDE 14

Inverse Problem: A . . . Enter Soft Constraints Regularization How to Determine the . . . Analysis of the . . . Additional Invariance . . . Main Result Acknowledgments Proof Home Page Title Page ◭◭ ◮◮ ◭ ◮ Page 14 of 14 Go Back Full Screen Close Quit

References [1] Hansen, P.C. Analysis of discrete ill-posed problems by means of the L-curve, SIAM Review 34(4), 561–580 (1992) [2] Moorkamp, M., Jones, A.G., Fishwick, S.: Joint inver- sion of receiver functions, surface wave dispersion, and magnetotelluric data. Journal of Geophysical Research 115, B04318 (2010) [3] Rabinovich, S.: Measurement Errors and Uncertainties: Theory and Practice. American Institute of Physics, New York (2005) [4] Tikhonov, A.N., Arsenin, V.Y.: Solutions of Ill-Posed Problems, W.H. Whinston & Sons, Washington, D.C. (1977)