The L-Curve - Regularization Methods - Discrete Inverse Problem

Regularization Methods

4.7 The L-Curve

and assume again that bêxact satisfies the model (4.13). Then, for a fixed value of k,

min discrete Picard condition is obvious: α must be larger than 1 in order to ensure that the TSVD and Tikhonov solutions produce similar results.

4.7 The L-Curve

In the previous sections we have used the SVD extensively to deﬁne and study the behavior of two important regularization methods. We continue our use of the SVD as an analysis tool, and we take a closer look at the solution norm and the corre-sponding residual norm. These norms play a central role in the practical treatment of discrete ill-posed problems, because they can always be computed no matter which regularization method is used; they do not require the computation of the SVD or any other decomposition of the matrix.

The norm of the TSVD solution and associated residual vary monotonically with k, as can be seen from the expressions

xk²2= the column space of A. Similarly, the norm of the Tikhonov solution and its residual are given by monotonically with λ, we introduce the notation

ξ =xλ²2 and ρ =A xλ− b²2.

Downloaded 07/26/14 to 129.107.136.153. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

72 Chapter 4. Computational Aspects: Regularization Methods Then we can show that

ξ≡ d ξ d λ =−4

n i =1

(1− ϕ^[λ]i )!

ϕ^[λ]_i "2(u_i^Tb)² σ_i² ,

ρ≡ d ρ d λ = 4

n i =1

(1− ϕ^[λ]i )²ϕ^[λ]_i (u_i^Tb)²=−λ²ξ.

We see that ξ< 0 and ρ> 0 for all λ, conﬁrming the monotonicity of the norms as functions of λ. Also, from the relation ρ=−λ²ξwe obtain d ξ/d ρ =−λ⁻², showing that that the squared solution normxλ²2 is a monotonically decreasing function of the squared residual normA xλ− b²2. Since the square root function is monotonic, the same is true for the norms themselves.

There is also a simple relationship between the second derivatives of ξ and ρ.

Speciﬁcally, we have

ρ≡ d²ρ d λ² = d

d λ

!−λ²ξ"

=−2 λ ξ− λ²ξ,

in which ξ≡ d²ξ/d λ². Consider now the curve obtained by plotting ξ versus ρ, with λ as the parameter. The curvature c_λ of this curve, as a function of λ (see any book on analytical geometry for its deﬁnition), is given by

cλ= ρξ− ρξ

((ρ)²+ (ξ)²)^3/2 = 2 λ (ξ)² ((ρ)²+ (ξ)²)^3/2,

where we have inserted the above relation for ρ. Hence, c_λ > 0 for all λ, and therefore the curve (ρ, ξ) is convex.

The curve is of interest because it shows how the regularized solution changes as the regularization parameter λ changes. We can say more about this curve; for example, it is easy to see that

0≤ xλ2≤ A⁻¹b2 and 0≤ A xλ− b2≤ b2. Also, since any point (ρ, ξ) on the curve is a solution to the problem

ρ = minA x − b²2 subject to x²2≤ ξ

in (4.12), it follows that the curve defines a boundary between two regions of the first quadrant: it is impossible to choose a vector x such that the point (A x −b²2,x²2) lies below the curve; only points on or above the curve can be attained. The same is, of course, true for the curve (A x − b2,x2). But when either of these curves is plotted in linear scale, it is difficult to inspect its features because of the large range of values for the two norms.

As shown in [40], the features become more pronounced (and easier to inspect) when the curve is plotted in double-logarithmic scale. The associated curve

(¹₂log ρ , ¹₂log ξ ) = ( logA xλ− b2, logxλ2)

Downloaded 07/26/14 to 129.107.136.153. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

4.7. The L-Curve 73

10⁻¹ 10⁰ 10¹

10⁰ 10¹ 10² 10³

Residual norm || A x

λ − b ||

Solution norm || x λ || 2

λ = 1 λ = 0.1

λ = 1e−4 λ = 1e−5

Figure 4.10. The L-curve for Tikhonov regularization: A log-log plot of the solution normxλ2 versus the residual normA xλ− b2 with λ as the parameter.

Notice the vertical and horizontal parts, corresponding to solutions that are under-and over-smoothed, respectively.

is referred to as the L-curve for Tikhonov regularization, and it plays a central role in the analysis of discrete ill-posed problems. Figure 4.10 shows an example of a typical L-curve.

We have already seen that the Tikhonov solution behaves quantitatively differ-ently for small and large values of λ. The same is true for the different parts of the L-curve. When λ is large, then x_λis dominated by SVD coefficients whose main con-tribution is from the exact right-hand side bêxact—the solution is oversmoothed. A careful analysis (see [34]) shows that for large values of λ we have that

xλ2≈ x^exact2(a constant) and A xλ− b2increases with λ.

For small values of λ the Tikhonov solution is dominated by the perturbation errors coming from the inverted noise—the solution is undersmoothed, and we have that

xλ2increases with λ⁻¹ and A xλ− b2≈ e2 (a constant).

The conclusion is that the L-curve has two distinctly diﬀerent parts, namely, a part that is approximately horizontal, and a part that is approximately vertical. We return to these two parts of the L-curve in Section 5.3.

The “corner” that separates these two parts is located roughly at the point ( loge2, logx^exact2).

Toward the right, for λ → ∞, the L-curve starts to bend down as the increasing amount of regularization forces the solution norm toward zero. Likewise, toward the

Downloaded 07/26/14 to 129.107.136.153. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

74 Chapter 4. Computational Aspects: Regularization Methods top left and above the vertical part, the L-curve will eventually become less steep as λ→ 0 and xλ2→ A⁻¹b2 (this part is not visible in Figure 4.10).

This curve is found in many books and papers (perhaps starting with the pi-oneering book by Lawson and Hanson [51]), and a careful analysis of many of its properties can be found in [30]. It can also be used to determine a suitable value of the regularization parameter (this use of the L-curve was not mentioned in [51]); we return to this aspect in Chapter 5.

4.8 When the Noise Is Not White—Regularization

In document Discrete Inverse Problem - Insight and Algorithms (Page 69-72)