• No results found

Optimization Of Compound Regularization Parameters Based On Stein'S Unbiased Risk Estimate

N/A
N/A
Protected

Academic year: 2021

Share "Optimization Of Compound Regularization Parameters Based On Stein'S Unbiased Risk Estimate"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

OPTIMIZATION OF COMPOUND REGULARIZATION PARAMETERS BASED ON

STEIN'S UNBIASED RISK ES TI MATE

Feng Xuea,

Hanjie Panb,

Runhui Wua,

Xin Liu

a

and Jiaqi Liu

a

aNational Key Laboratory of Science and Technology on Test Physics and Numerical Mathematics,

Beijing, 100076, China

b

Audiovisual Communications Laboratory, School of Computer and Communication Sciences,

Ecole Polytechnique Federale de Lausanne (EPFL), Switzerland

ABSTRACT

Recently, the type of compound regularizers has become a popular choice for signal reconstruction. The estimation quality is generally sensitive to the values of multiple regu-larization parameters. In this work, based on BDF algorithm, we develop a data-driven optimization scheme based on minimization of Stein's unbiased risk estimate (SURE)-statistically equivalent to mean squared error (MSE). We propose a recursive evaluation of SURE to monitor the MSE during BDF iteration; the optimal values of the multiple pa-rameters are then identified by the minimum SURE. Monte-Carlo simulation is applied to compute SURE for large-scale data. We exemplify the proposed method with image de-convolution. Numerical experiments show that the proposed method leads to highly accurate estimates of regularization parameters and nearly optimal restoration performance.

Index Terms- Stein's unbiased risk estimate (SURE), compound regularizers, regularization parameter, BDF al go-rithm, signal deconvolution

1. INTRODUCTION

Consider the standard linear inverse problem: find a good estimate of Xo E R N from the following observation model [1,2]:

y=Axo+E

(1)

where y E RM is the observed noisy data, A E RMxN is an

ob-servation matrix, E E RM is an additive Gaussian white noise

with known variance (T2 > O.

Regularization has been a standard technique for solving the inverse problem. Recently, people considered the regular-izer as a linear combination of "simple" regularregular-izers, i.e., the objective function is [3-5]:

(2)

v L(x)

where both 3l and 32 are simple regularizers, Al and ,12 are their respective regularization parameters.

This type of hybrid regularizers stems mainly from the following observation: it may be desired to encourage the solution to exhibit characteristics that are not easily en-forced/described by a single regularizer. In this paper, we choose BDF algorithm to solve (2) [6], since it provides a basic scheme for tackling the multiple regularizers, and that is easy to extend for other types of regularizer. The 'BDF' stands for the last names 'Bioucas-Dias' and 'Figueiredo' of both authors of [6].

For a pleasant reconstruction quality, it is essential to se-lect the proper values of multiple regularization parameters, to keep a good balance between data fidelity and cOlnpound regularizers. The choice of Al and ,12 is generally a difficuIt problem. There are two well-known general approaches capa-ble of selecting the parameters in non-linear inverse procapa-blems: maximum Iikelihood and cross validation [7]. However, both methods suffer from a problem of computational complexity. In this paper, we quantify the reconstruction performance by the mean squared error (MSE) [1,8]:

MSE =

~lE{11x - xoll~}

(3)

and attempt to select the values of Al and ,12, such that the

cor-responding solution

x

achieves minimum MSE. Notice that

MSE (3) is inaccessible due to the unknown Xo. In practice, Stein's unbiased risk estimate (SURE) has been proposed as a statistical substitute for MSE (if Ais full-rank matrix) [9,10]: SURE =

MIIx1I~

-

2yTA(AT A)- Ix +2(T2Tr(A(A TA)- I

JyW)

+

Ilxoll~)

(4)

since it depends on the observed data y onlyl. Here, JyW

E RNxN is a J acobian matrix defined as [11]:

[J

W] -

Oxn

y n.m - 8Ym

Recently, SURE has become a popular criterion for opti-rnization, in the context of non-linear denoising and decon-volution [1 , 8], and tl-based sparse reconstruction [11-13].

IThe last term of (4)-llxolli - is constant irrelevant to the optimization ofx.

(2)

However, there are very few literature on the application of SURE to the compound regularizers.

This paper is to optimize the regularization parameters AI

and ,12 for (2), based on minimization of SURE (4). Our main

contribution is to develop a recursive evaluation of SURE for BDF algorithm, wh ich finally provides a reliable estimate of the MSE for the non-linear reconstruction. The optimal AI and ,12 can then be identified by exhaustive search for mini-mum SURE. In addition, the Monte-Carlo method is applied for practical computation of large-scale data.

2. AN APPLICATION OF BDF ALGORITHM TO TV+lI MINIMIZATION

2.1. Basic scheme of BDF algorithm

The problem (2) is equivalent to the following:

min

~IIAx-yll~

+,11 'J"I(ZI)+A2 'J"2(Z2)

x

subject to Ilzl - DI

xll~

= 0; IIz2 -

D2XII~

= 0 wh ich, by Lagrangian, is equivalent to:

mJn

~IIAX

-

yll~

+ ,1.131 (ztl + h.T2(Z2) +

~I

Ilzl -

Dlxll~

+

J1;

IIz2 -

D2XII~

where f11 and f12 are the augmented Lagrangian penalty pa-rameters.

To minimize this functional W.r.l. x, ZI and Z2, BDF al go-rithm is to alternatively minimize w.r.t. these variables:

wh ich can be efficiently expressed and computed by Moreau's proximal operator for a number of typical regularizers of in-terest [14, 15].

2.2. Exemplification with wavelet-ll and TV regularizers To exemplify the iterative algorithm, we consider signal deconvolution problem, with both wavelet-ll and TV regu-larizers, i.e., 31 (Dlx) = liDlxiii and 32(X) = TV(x), where D1 denotes wavelet decomposition. Thus, the problem becomes:

~n ~IIAx-YII~

+Alilztill + A2TV(Z2) +

Jl211IDIX-ZIII~

+ Jl;

Ilx-Z211~

wh ich yields the following iteration:

1

xU) = B-1 (A Ty + JlIDT z;i- l) + Jl2Z~- I))

z;i) ='T,l, /I',(D1x(i))

z~) =argminZ2 ~IIZ2 -x(i) II~ + ~ . TV(Z2)

(5)

where B = AT A + f1IDTDI + f12I, 'TTO denotes the pointwise soft-thresholding with threshold T [11]. z~) can be efficiently

solved by Chambolle's algorithm [16].

For 2-D case, we consider the TV definition as TV(x) =

,\,N I(D(I)x) 12 + I(D(2)x) 12 + a where D(I) and D(2) denote

L..n= 1 2 n 2 n , 2 2 the first-order differences along horizontal and vertical direc-tions, 0: is a very small number (e.g. 10- 12) [17]. Such an

approximation simplifies numerical computations due to the differentiability of TV, and may help to avoid the staircasing effect in some cases [17] .

Chambolle's algorithm for solving z~) of (5) can be ex-pressed in matrix language as (iterate by j):

u(i·j+ l) = v(i,j)(u(i·j) - TJl2 D2 (x(i) + ,12

Diu(i,j)))

(6)

,12 Jl2

---where T is a step-size, D2 and diagonal matrix V(i·J) are

[D(

l)]

..

[VU.j)

D = 2 E rrt.2NxN . Vi .]) =

2 (2) , 0

D2

with diagonal V(i,j) E RNXN given by:

V(i·j) =

(1

+ TJl2 n.n ,12

Finally, z~) is obtained by the convergence ofChambolle's iteration (6): z~) = z~,oo) at j = 00 when converged.

3. RECURSIVE EVALUATION OF SURE FOR BDF

ALGORITHM 3.1. Recursive evaluation of SURE

From (4), the SURE for the i-th iterate is2:

SURE =

~(llx(i)II~

- 2yT A(A TA)- lx(i) + 2a.2Tr(A(A TA)- l Jy(x(i) ))) (7) The computation of SURE requires to compute Jy(x(i)), wh ich can be evaluated in a recursive manner, as shown later.

From (5), by the basic property of Jacobian, we have:

where pU) is a diagonal matrix with diagonal element:

if I(Dlx(i) nl ~ Al/Jll

otherwise

The recursion of Jy(z~)) has to be obtained by Chambolle's algorithm. We consider 2-D case only.

2In the remainder of this paper, the last constant term-Il xoll~-is ignored for brevity.

(3)

3.2. Recursion of Jy(z~)) for Chambolle's algorithm of (6) as u(i,j+ l) =

After derivations, we obtain3:

W (i,j)

,

l

V (i,)) 0

j

J (u(i,)+ l)) = .. J (u(i,))) _ TJ1. 2 .

y 0 V(l ,}) Y ,.12

where cii,j) and c~,j) are diagonal:

( (i) )2

[cii,))]

= am · vm,m ; [c~'))] =

n,n a~l + b~l + a n,n

( (i)

)2

bm , Vm,m

with a -D(I)z(i,j) and b - D(2)z(i,j) w(i,j) and w(i,j) are

diag-- 2 2 - 2 2 ' I 2

Onal' . [W(i,j)] I n,n - [wi,j] - I n and [W(i,j)] 2 n,n - [wi,j] - 2

Note that z(i,j) = xCi) + :2 DT u(i,j) we have:

2 J12 '

(10)

3.3. Summary of BDF algorithm with SURE evaluation Finally, we sununarize the proposed algorithm as Algo-rithm 1, which enables us to solve (2) with a prescribed val-ues of AI and A2, and simultaneously evaluate the SURE dur-ing the BDF iterations.

Algorithm 1: SURE evaluation for BDF algorithm for i = 1,2, ... (BDF iteration) do

1 update x (i), zii) and z~) by (5) and (6);

2 update Jy(x(i)), Jy(z;i)) and Jy(z~)) by (8), (9) and (10);

3 compute SURE of i-th iterate by (7); end

3.4. Monte-Carlo for practical computation

For 2-D case, due to the limited computational resources (e,g, RAM), it is impractical to store and cOlnpute the huge matrices A, DI and Jacobians. Monte-Carlo (MC) simulation provides an alternative way to compute the trace by the fol-lowing fact [8]:

Tr(A(AT A)- I Jy(x(i))) = lE(nJA(AT A)- I Jy(x(i))no) (11)

3The derivation of Jy( u(i,))) based on vector calculus is very complicated, we omit it here to save the page space.

with nO ~ N(O,IN)' From (8), we have:

n~i) "1 n~:;l) n~i2-1)

~~ ~ ~

{ Jy(x(i))no =B- IATno+J1IB- loT Jy(z;i-1))no+J12B- 1 Jy(z~- I))no Jy(z;i))no = P(i)OI Jy(x(i))no

' - v - " ' - v - "

(12) The n~; can be obtained by Chambolle's algorithm from n~):

n~/+ I) n~'t n~i})

~~ ~

{ J (y u(i,j+ 1 I))n = 0 y(i,j) J (u(i,j))n -Y 1 0,12 TJ1 2 Q(i,j) I J (z(i,j))n Y 2 0 J (u(i,j+I))n - y(i,j) J (u(i,j))n _ TJ1 2 QU,j) J (z(i,j))n (13)

y 2 0 - y 2 0,12 2 Y 2 0

~ ' - v - ' ' - v - "

and

(i,j+ l )

n"2 n (i,j) " 2

Jy(z~')))no = Jy(x(i))no + ,.12 (D~I))T Jy(u;i')))no + ,.12 (D~2)) TJy(u~')))no

~ ~ J12 ~ J12 ' - - . . . - '

" (i,j)

7.2 " (i,j)

",

D(i,}) " 2

(14)

where QU,j) = (W(U)C(i,j) + y(i,j))O(l) + W(i,j)CU,j)O(2) and Q(i,j) =

1 1 1 2 1 2 2 2

W(i,j) dU)o( I) + (W(i,j) C(i,j) + y(i,j))O(2)

2 I 2 2 2 2'

Thus, instead of using (8), (9) and (10), we can success-fully compute the trace by (11)-(14), without the explicit stor-age of huge matrices, summarized as folIows. The ftowchart is shown in Fig, 1.

Algorithm 2: MC for SURE ofBDF algorithm (2-D) for i = 1,2, ... (BDF iteration) do

1 update x(i), zii) and z~) by (5) and (6);

(i) (i) (i)

2 update nx , nZ1 and nZ2 by (12), (13) and (14);

3 cOlnpute the trace by (11);

4 compute SURE of i-th iterate by (7); end

input :known y and A: : initial z (O) z (O) ,

~ ______ L_'_ 2.. _1

Fig. 1. SURE-MC evaluation for BDF algorithm (Cham-bolle's algorithm is tor z~).

To find the optimal values of AI and A2, an intuitive idea is to repeatedly implement Alg. 1 for various tentative values of AI and A2, then, the minimum SURE indicates the optimal values (see FigA for example). This global search has been frequently used in [11-13].

(4)

4. EXPERIMENTAL RESULTS AND DISCUSSIONS

In this seetion, we exemplify the proposed algorithm with image deconvolution, by considering a test image Camerman,

blurred by a Gaussian kerne!. The noise level corresponds to blur-SNR4 of 30dB. For image processing, we have to use

Me to compute SURE as Aig. 2 and Fig.l, due to the large sizes of A, Dl and Jacobians.

First, we solve (2) with fixed values of Al and ,12. Fig.2 shows the BDF convergence and the evolution of SURE. We can see that the SURE is always a reliable substitute for MSE during the iterations.

(1) fixed AI = ,12 = 0.01 3xl0' - objeclivevalue -- - MSE - . - SURE lXl0,L'=:;:=::=;==:;:===~ w ~ ~ W 100 iterationnumber (2) fixed AI = ,12 = 1.00 2. 10' - objective value 80 --- MSE 2xl0' - . - SURE 40 2•10' 280 2'10'L~":':=::=;:=:;===:-j 10 15 iteration number

Fig. 2. The convergence of BDF algorithm with fixed values of ,I I

and ,12.

We repeatedly implement Aig. 2 to perform global opti-mization of Al or ,12, with fixed another, and show the results in Fig.3. Fig.4 shows the global optimization of AI and ,12, within the interval of [10-3,10°]. The optimal values of AI

and ,12 obtained by minimizing SURE are very elose to the

oraele results of minimum MSE.

(1) with fixed ,12 = 10-2 opt. ,I = 1.08 X 10-2 I

,

2 ~-~ ~_"'!"_ - - ""-- t-;':';';'-"" .(---lE-3 0.Q1 regularization parameter Al 260 (2) with fixed ,I I = 10-2 ~ MSE I SURE opt. ,I = 4.52 X 10-2 "

,

... .I' - - - ' , : ; - - - I _______ ~J __ . . . ~ . . 4 . . . I I I 0.1 180'E+-.3~~~0~.01~~j.-,: ~0.1~~~

reg ularization parameter A2

Fig. 3. The global optimization of AI or ,12, when fixing another.

4Blur signal-to-noise ratio (BSNR) is defined as:

101 oglO ( IIAXo-mean(AXo)II~ ). dB Mcr2 I n . '" ,,' 10" 10~ AI = 2.71 X 10-3 optimal ,12 = 1.93 X 10-2

Fig. 4. The global optimization of AI and ,12.

Fig.5 shows a visual comparison between SURE and MSE minimization. We can see that the SURE minimization yields the PSNR loss within 0.2dB, compared to the oraele optimal performance.

observed image

PSNR=22.42dB SURE optimized PSNR=25.30dB PSNR=25.47dB MSE o)Jtimized

Fig. 5. A visual example of Cameraman. 5. CONCLUSIONS

In this paper, we presented a SURE-based automatie method of tuning multiple regularization parameters for (TV +t I) compound regularizers, based on BDF algorithm [6]. Future work will deal with extension of this technique to handle other hybrid regularizers and the development of fast optimization algorithm instead of the time-consuming global search (shown as Fig.4).

6. ACKNOWLEDGMENTS

This work was supported by the National Natural Science Foun-dation of China under grant number 61401013.

7. REFERENCES

[l] F. Xue, F. Luisier, and T. Blu, "Multi-Wiener SURE-LET deconvo-lution," IEEE Transactions on Image Processing, vol. 22, no. 5, pp.

1954- 1968, 2013 .

[2] F. Xue and T. Blu, "A novel SURE-based criterion for parametric PSF estimation," IEEE Transactions on Image Processing, vol. 24, no. 2,

pp. 595- 607, 2015.

[3] M. Lustig, D. Donoho, and J. Pauly, "Sparse MRI : the application of compressed sensing for rapid MR imaging," Magnetic Resonance in Medicine, vol. 58, pp. 1182- 1195, 2007.

[4] J. Starck, M. Elad, and D. Donoho, "Image decomposition via the com-bination of sparse representations and a variational approach," IEEE Trans. Image. Proc. , vol. 14, no. 10, pp. 1570- 1582, 2005.

(5)

[5] You-Wei Wen, Michael K. Ng, and Wai-Ki Ching, "Iterative s lgorithms based on the decouple of deblurring and denoising for image restora-tion," SIAM Journal on Seientifie Computing, vol. 30, no. 5, pp. 2655-2674, 2008.

[6] J.M. Bioucas-Dias and M.A.T. Figueiredo, "An iterative algorithm for linear inverse problems with compound regularizers," in IEEE Int. Cont on Image Proe., 2008, pp. 685- 688.

[7] F. Bauer and M.A. Lukas, "Comparing parameter choice methods for regularization of iU-posed problem," Mathematies and Computers in Simulation, vol. 81 , no. 9, pp. 1795- 1841 , 2011.

[8] T. Blu and F. Luisier, "The SURE-LET approach to image denoising,"

IEEE Transaetions on Image Proeessing, vol. 16, no. 11, pp. 2778-2786, 2007.

[9] Charles M Stein, "Estimation of the mean of a multivariate normal distribution," The Annals ofStatisties, pp. 1135- 1151, 1981. [10]

yc.

Eldar, "Generalized SURE for exponential families: Applications

to regulari zation ," IEEE Transaetions on Signal Proeessing, vol. 57, no. 2, pp. 471-481 , 2009.

[11] F. Xue, Anatoly G. Yagola, lQ. Liu, and G. Meng, "Recursive SURE for iterative reweighted least square algorithms," Inverse Problems in Seienee and Engineering, vol. 24, no. 4, pp. 625- 646, 2016.

[12] R. Giryes, M. Elad, and

yc.

Eldar, "The projected GSURE for auto-matie parameter tuning in iterative shrinkage methods," Applied and Computational Harmonie Analysis, vol. 30, no. 3, pp. 407-422, 2010. [13] C. Vonesch, S. Ramani, and M. Unser, " Recursive risk estimation for

non-linear image deconvolution with a wavelet-domain sparsity con-straint," in IEEE Int. Cont on Image Proe., 2008, pp. 665- 668. [14] P.L. Combettes and VR. Wajs, "Signal recovery by proximal

forward-backward splitting," Multiseale Modeling and Simulation, vol. 4, no. 4,

pp. 1168- 1200, 2005.

[15] N. Parikh and S. S. Boyd, " Proximal algorithms," Foundations and Trends in Optimization, vol. 1, no. 3, pp. 123- 231,2013.

[16] A. ChamboUe, "An algorithm for total variation minimization and ap-plieations," Journal of Mathematieallmaging and Vision , vol. 20, pp. 89- 97, 2004.

[17] S. Osher, M. Burger, D. Goldfarb, J. Xu, and w.T. Yin, "An itera-tive regularization method for total variation-based image restoration,"

References

Related documents

Chemical Composition of Microcline from the Kingman, Rare Metals and Wagon Bow Pegmatites continued ...138... Chemical Composition of Plagioclase from the Kingman,

The Strategies to Develop Bilingual Ability for Vietnamese. International Journal of Language and Linguistics. Therefore, no one can deny that it is not easy to study a language

To do so, we com- bine a present-value model with an unobserved component model to write down the observed excess-consumption asset ratio as a linear function of unobserved

“The Study and Treatment of Chinese Dry Lacquer Sculpture.” Paper, The 17th International Symposium on the Conservation and Restoration of Cultural Property, Tokyo National Research

Aim of the present study was to explore the influence of demographic and disease related factors on parental HRQoL, mediated by employment, income, leisure time, holiday and

All European medical schools should ensure that a core curriculum of radiology is delivered to their undergraduates.. A suitable framework for this core

The histological effect of radiation and biological agents was assessed using biopsy samples containing tumor and adjacent tissue that were taken from treated mice 24 days

When TCDD and quercetin given together, quercetin reduced lipid peroxidation induced by TCDD treatment via decreasing the levels of MDA in testicular tissue of TCDD-treated