A primal dual algorithm framework for convex saddle point optimization

(1)

R E S E A R C H

Open Access

A primal-dual algorithm framework for

convex saddle-point optimization

Benxin Zhang

1

_{and Zhibin Zhu}

2*

*_{Correspondence:}

[email protected] 2_{School of Mathematics and}

Computing Science, Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, Guilin University of Electronic Technology, Jinji Road, Guilin, 541004, China

Full list of author information is available at the end of the article

Abstract

In this study, we introduce a primal-dual prediction-correction algorithm framework for convex optimization problems with known saddle-point structure. Our unified frame adds the proximal term with a positive definite weighting matrix. Moreover, different proximal parameters in the frame can derive some existing well-known algorithms and yield a class of new primal-dual schemes. We prove the convergence of the proposed frame from the perspective of proximal point algorithm-like contraction methods and variational inequalities approach. The convergence rate O(1/t) in the ergodic and nonergodic senses is also given, wheretdenotes the iteration number.

Keywords: primal-dual method; proximal point algorithm; convex optimization; variational inequalities

1 Introduction

We consider the following model that arises from various signal and image processing applications:

min

x f(Bx) +f(x), ()

whereB is a continuous linear operator, and f and f are proper convex

lower-semi-continuous functions. We can easily write problem () in its primal-dual formulation through Fenchel duality []:

min

x∈Xmaxv∈V L(x,v) :=f(x) +Bx,v–f ∗

(v), ()

whereX∈RN _and_V_∈_RM _{are two ﬁnite-dimensional vector spaces, and}_f∗

 is the convex

conjugate function offdeﬁned as

f_∗(v) = sup

ω∈RM

v,ω–f(ω).

As analyzed in [, ], the saddle-point problem () can be regarded as the primal-dual formulation, and more and more scholars have proposed some primal-dual algorithms. Zhu and Chan [] proposed the famous primal-dual hybrid gradient (PDHG) algorithm

(2)

with adaptive stepsize. Though the algorithm is quite fast, the convergence is not proved. He, You, and Yuan [] showed that PDHG with constant step sizes is indeed convergent if one of the functions of the saddle-point problem is strongly convex. Chambolle and Pock [] gave a primal-dual algorithm with convergence rateO(/k) for the complete class of these problems. They further showed accelerations of the proposed algorithm to yield improved rates on problems with some degree of smoothness. In particular, they showed that the algorithm can achieve theO(/k_{) convergence in problems where the primal}

or the dual objective is uniformly convex, and the method can show linear convergence, that is,O(ςk) (for someς∈(, )), on smooth problems. Bonettini and Ruggiero [] es-tablished the convergence of a general primal-dual method for nonsmooth convex opti-mization problems and showed that the convergence of the scheme can be considered as an-subgradient method on the primal formulation of the variational problem when the steplength parameters are a priori selected sequences. He and Yuan [] did a novel study on these primal-dual algorithms from the perspective of contraction perspective. Their method simplified the existing convergence analysis. Cai, Han, and Xu [] proposed a new correction strategy for some first-order primal-dual algorithms. Later, He, Desai, and Wang [] introduced another new primal-dual prediction-correction algorithm for solving a saddle-point optimization problem, which serves as a bridge between the algo-rithms proposed in [] and []. Recently, Zhang, Zhu, and Wang [] proposed a simple primal-dual method for total-variation image restoration problems and showed that their iterative scheme has theO(/k) convergence rate in the ergodic sense. When we had fin-ished this paper, we found the algorithm proposed in [], where convergence analysis was similar to our proposed frame. However, the algorithm proposed in [] is actually a particular case of our unified framework when the precondition matrix in our frame is fixed.

More speciﬁcally, the iterative schemes of existing primal-dual algorithms for the prob-lem () can be uniﬁed as the following procedure:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

vk+=arg minv∈RM–L(x_k,v) + 

γv–vk  ,

yk+=vk++θ(vk+–vk),

xk+=arg minx∈RNL(x,y_k_+) + 

τx–xk  ,

()

whereγ,τ >  andθ∈R. The combination parameterθis set to zero in the original PDHG algorithm. Whenθ∈[, ], the primal-dual algorithm proposed in [] was recovered. He and Yuan [] showed that the range of the combination parameterθ can be enlarged to [–, ]. Komodakis and Pesquet [] recently wrote a wonderful overview of recent primal-dual method for solving large-scale optimization problems. So, we refer the reader to [] for more details.

(3)

The organization of this paper is as follows. In Section , we propose the primal-dual-based contraction algorithm framework in prediction-correction fashion. In Section , we present convergence analysis. The iteration complexity in the ergodic and nonergodic senses is established in Sections  and . In Section , connections with well-known meth-ods, and some new schemes are discussed. Finally, a conclusion is given.

2 Proposed frame

Problem () can be reformulated as the following monotone variational inequality (VI): Find (x∗,v∗)∈X×Vsuch that

v–v∗ x–x∗

T

∂f_∗(v∗) –Bx∗

∂f(x∗) +BTv∗

≥, ∀(x,v)∈X×V, ()

where∂denotes the subdiﬀerential operator of a convex function. By denoting

u=

v x

, F(u) =

∂f_∗(v) –Bx

∂f(x) +BTv

, =X×V,

the VI () can be written as follows (denotedVI( ,F)):

u–u∗ TFu∗ ≥, =X×V.

Note that the monotonicity of the variational inequality is guaranteed by the convexity of the function∂f_∗and∂f.

Recall that the primal-dual algorithm for () presented in [] (θ= ) is

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

vk+=arg minv∈RM–L(xk,v) + 

γv–vk  , yk+= vk+–vk,

τx–xk  .

()

We can easily verify that the iteration (vk+,xk+) generated by () can be characterized as

follows:

v–vk+ x–xk+

T

∂f_∗(vk+) –Bxk+ ∂f(xk+) +BTvk+

+



γI B

BT _τI

vk+–vk

xk+–xk

≥. ()

The convergence of iteration () was proved in [] with the condition on the stepsize

γ τBT_B_{< . Motivated by the idea in [], the scheme () can be considered as a} pre-diction step. So, in the following, we propose a primal-dual-based contraction method for problem (). To present new methods in the prediction-correction fashion, we denote the iterationu˜k= (v˜k,x˜k) generated by the following primal-dual procedure (), where the prediction step can be redescribed as

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

˜

vk=arg minv∈RM–L(xk,v) + 

γv–vk  ,

˜

yk= ˜vk–vk,

˜

xk=arg minx∈RNL(x,˜yk) + 

τx–xk 

P,

(4)

wherePis a positive deﬁnite matrix to be selected properly in diﬀerent applications. Then, the new iteration is yielded by correctingu˜kvia

uk+=uk–ρ(uk–u˜k), ()

where  <ρ< . Similarly to (), the predictor scheme () can also be written in the VI form as follows:

v–v˜k

x–x˜k

T

∂f∗(v˜k) –Bx˜k

∂f(x˜k) +BTv˜k

+



γI B

BT  τP

˜

vk–vk

˜

xk–xk

≥. ()

Setting

Q=



γI B

BT  τP

()

and using the notation in (), we have the following compact form of ():

(u–u˜k)TF(u˜k) +Q(u˜k–uk)

≥, ∀u∈ . ()

So, we can prove the convergence of the proposed algorithm in the form of proximal point algorithm [, ]. Next, we use this idea to prove that the scheme ()-() converges.

3 Convergence analysis

In this section, we show the convergence of the proposed frame. Convergence results easily follow from proximal point algorithm-like contraction methods [] and VI approach [].

Lemma  Let B be the given operator,letγ,τ > ,and let Q be deﬁned by().Then Q is positive deﬁnite if

γ τBTB<p,

where p> is the minimal eigenvalue of P.

Proof For any nonzero vectorssandt, we have

sT,tT



γI B

BT _τP s t

=s



γ +

t_P

τ + s

T_Bt

≥s

γ +

pt

τ – 

BT_B_s_t

≥

p

γ τ –

BT_B

st,

where we used the Cauchy-Schwarz inequality. The proof is completed.

(5)

Lemma  For iteration sequences{uk}and{˜uk},we have

uk–u∗ T

Q(uk–u˜k)≥(uk–u˜k)TQ(uk–u˜k), ∀u∗∈ . ()

Proof Since () holds for anyu∈ , we setu=u∗, whereu∗is an arbitrary solution, and obtain

u∗–u˜k T

F(u˜k) +Q(u˜k–uk)

≥. ()

Thus () leads to

u∗–u˜k T

Q(u˜k–uk)≥

˜

uk–u∗ T

F(u˜k). ()

Note that the mappingF(u) is monotone. We thus have

˜

uk–u∗ T

F(u˜k) –Fu∗ ≥ ()

and also

˜

uk–u∗ T

F(u˜k)≥u˜k–u∗ T

Fu∗ ≥.

Replacingu∗–u˜kby (u∗–uk) + (uk–u˜k) in () and using (), we get the assertion. Lemma  The sequence{uk}generated by the proposed scheme()-()satisﬁes

uk+–u∗_Q≤uk–u∗_Q–ρ( –ρ)uk–u˜kQ, ∀u∗∈ . ()

Proof Using () and (), by a simple manipulation we obtain

uk+–u∗_Q=uk–u∗–ρ(uk–u˜k)

=uk–u∗



Q– ρ

uk–u∗ T

Q(uk–u˜k) +ρuk–u˜kQ

≤uk–u∗



Q– ρuk–u˜k



Q+ρuk–u˜kQ =uk–u∗



Q–ρ( –ρ)uk–u˜k



Q.

The assertion is proved.

The following theorem states that the proposed iterative scheme converges to an optimal primal-dual solution.

Theorem  If Q in()is positive deﬁnite,then any sequence generated by the scheme

()-()converges to a solution of the minimax problem().

Proof From () we know that the normuk–u∗Qis nonincreasing. We also can get that

(6)

least one cluster point. We denote it byu∞. Let{ukj}be a subsequence converging tou∞.

Thus we have

lim

j→∞ukj–u˜kj= . ()

Due to the facts () and (), we have

lim j→∞

u–ukj T_F₍_u

kj)≥, ∀u∈ .

Because{ukj}converges tou∞, this inequality becomes

u–u∞ TFu∞ ≥, ∀u∈ .

Thus, the cluster pointu∞ satisﬁes the optimality condition of (). Note that inequality () is true for all solution points ofVI( ,F). Hence we have

uk+–u∞≤uk–u∞,

and thus the sequenceukconverges tou∞. This proof is completed.

4 Convergence rate in an ergodic sense

In the following, using proximal point algorithm-like contraction methods for convex op-timization [], the convergence rate in the ergodic and nonergodic senses is given. First, we prove a lemma, which is the base for the proofs of the convergence rate in the ergodic sense.

Lemma  The sequence{uk}is generated by the proposed scheme()-().Then we have

(u–u˜k)TF(u) + 

ρu–uk 

Q≥ 

ρu–uk+ 

Q, ∀u∈ . ()

Proof Using (), the right-hand side of () can be written as

ρ(u–u˜k)TF(u˜k)≥(u–u˜k)TQ(uk–uk+), ∀u∈ . ()

For the right-hand side of (), taking

a=u, b=u˜k, c=uk, d=uk+,

and applying the identity

(a–b)TQ(c–d) =  

a–d_Q–a–c_Q+ 

c–b_Q–d–b_Q,

we obtain

(u–u˜k)TQ(uk–uk+) =

 

u–uk+Q–u–ukQ

+ 

uk–u˜kQ–uk+–u˜kQ

(7)

For the last term of the right-hand side of (), we have

uk–u˜kQ–uk+–u˜kQ

=uk–u˜kQ–uk–u˜k– (uk–uk+) 

Q

=uk–u˜kQ–( –ρ)(uk–u˜k)



Q

=ρ( –ρ)uk–u˜kQ. ()

Substituting () and () into (), we get

ρ(u–u˜k)TF(u˜k)≥  

u–uk+Q–u–ukQ +

ρ( –ρ)

 uk–u˜k



Q. ()

Using the property of the mappingF, we have

(u–u˜k)TF(u)≥(u–u˜k)TF(u˜k).

Substituting it into (), the lemma is proved.

Theorem  Let{uk}be the sequence generated by the scheme()-(),and letu˜tbe deﬁned

by

˜

ut= 

t+  t

k=

˜

uk. ()

Then,for any integer t> ,we have thatu˜t∈ and

(u˜t–u)TF(u)≤ 

ρ(t+ )u–u



Q, ∀u∈ . ()

Proof By the convexity of it is clear thatu˜t∈ . Summing () overk= , , . . . ,t, we have

(t+ )u– t

k=

˜

uk

T

F(u) + 

ρu–u 

Q≥, ∀u∈ .

By the deﬁnition ofu˜t, the assertion of the theorem directly follows.

5 Convergence rate in a nonergodic sense

In this section, we show that a worst-caseO(/t) convergence rate in a nonergodic sense can also be established for the proposed algorithm frame. We ﬁrst prove the following lemma.

Lemma  Let the sequence{uk}be generated by the proposed scheme()-().Then we have

(uk–u˜k)TQ

uk–u˜k– (uk+–u˜k+)

≥ 

ρuk–u˜k– (uk+–u˜k+) 

(8)

Proof Settingu=u˜k+in (), we get

(u˜k+–u˜k)TF(u˜k)≥(u˜k+–u˜k)TQ(uk–u˜k). () Note that () is also true fork:=k+ , and we have

(u–u˜k+)TF(u˜k+)≥(u–u˜k+)TQ(uk+–u˜k+), ∀u∈ .

Settingu=u˜kin this inequality, we obtain

(u˜k–u˜k+)TF(u˜k+)≥(u˜k–u˜k+)TQ(uk+–u˜k+). ()

Adding () and () and using the monotonicity ofF, we obtain

(u˜k–u˜k+)TQ

(uk–u˜k) – (uk+–u˜k+)

≥. ()

Adding the term

(uk–u˜k) – (uk+–u˜k+)

T

Q(uk–u˜k) – (uk+–u˜k+)

to both sides of (), we have

(uk–uk+)TQ

(uk–u˜k) – (uk+–u˜k+)

≥

uk–u˜k– (uk+–u˜k+)



QT₊_Q.

Substitutinguk–uk+=ρ(uk–u˜k) into the left-hand side of the inequality, we obtain the

lemma.

Next, we are ready to prove the key inequality of this section.

Lemma  Let the sequence{uk}be generated by the proposed scheme()-().Then we have

uk+–u˜k+Q≤ uk–u˜kQ. ()

Proof Takinga=uk–u˜k,b=uk+–u˜k+in the identity

a_Q–b_Q= aTQ(a–b) –a–b_Q,

we have

uk–u˜kQ–uk+–u˜k+Q = (uk–u˜k)TQ

(uk–u˜k) – (uk+–u˜k+) –(uk–u˜k) – (uk+–u˜k+) 

Q.

Since inequality () holds, we obtain

uk–u˜kQ–uk+–u˜k+Q

≥ –ρ

ρ (uk–u˜k) – (uk+–u˜k+) 

Q≥.

(9)

Now, we establish a worst-caseO(/t) convergence rate in a nonergodic sense.

Theorem  Let{uk}be the sequence generated by the scheme()-().Then,for any integer

t> ,we have

ut–u˜tQ≤ 

ρ( –ρ)(t+ )u–u ∗

Q, ∀u∈ . ()

Proof It follows from () that

∞

k=

ρ( –ρ)uk–u˜kQ≤u–u∗ 

Q, ∀u

∗_∈ _. _()

By Lemma  the sequence{uk–u˜kQ}is nonincreasing. So, we obtain

(t+ )uk–u˜kQ≤ t

k=

uk–u˜kQ. ()

Assertion () immediately follows from () and ().

6 Connections with existing methods

In this section, we focus on a speciﬁc version of problem (),

min x f(Bx) +



Ax–b

_, _()

which arises in imaging processing, where f(x) = _Ax–b is quadratic. For discrete

total-variation regularization, Bis the gradient operator, and Ais a possibly large and ill-conditioned matrix representing a linear transform. If Ais the identity matrix, then problem () is the well-known Rudin-Osher-Fatemi denoising model []. Because total-variation regularization can preserve sharp discontinuities in an image for removing noise, the above problem has received a lot of attention by most scholars in image processing, including computerized tomography [] and parallel magnetic resonance imaging [].

(10)

6.1 Linearized primal-dual method

The linearized primal-dual method in [] can be directly induced by settingP=I–τAT_A and

Q=



γI B

BT  τI–A

T_A

. ()

We can also easily show that the positive deﬁniteness of the matrices P andQin () is guaranteed ifγ > ,  <τ < /AT_A_{,  <}_τ _{< /}_AT_A₊_γ_BT_B_{. In this situation, the} scheme () can be written as follows:

⎧ ⎨ ⎩

˜

vk= (I+γ ∂f∗)–(vk+γBxk),

˜

xk=xk–τ∇f(xk) –τBT(v˜k–vk),

()

and the scheme () can be expressed as

⎧ ⎨ ⎩

xk+=xk–ρ(xk–x˜k),

vk+=vk–ρ(vk–v˜k).

()

The idea is also similar to that of [, ], which uses the symmetric positive semi-deﬁnite matrix instead of the identity matrix in the proximal term. But their methods [, ] do not have overrelaxation or correction step. In [, ], the authors developed ﬁrst-order splitting algorithm for solving jointly the primal and dual formulations of large-scale convex minimization problems involving the sum of a smooth function with Lipschitzian gradient, a nonsmooth proximable function, and linear composite functions. Actually, the linearized primal-dual method () and () is a particular case where a nonsmooth prox-imable function is missing in [, ].

Whenρ= , we can see that there is no correction step, that is, (xk+,vk+) = (x˜k,v˜k).

In the following subsection, we focus on the scheme ()-() with diﬀerentPandQwhen

ρ= , that is,

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

vk+=arg minv∈RM–L(x_k,v) + 

γv–vk  ,

yk+= vk+–vk,

τx–xk 

P.

()

IfP=I, the the CP method is a particular case of () as discussed in []. We also ﬁnd that diﬀerentPin () can induce some existing famous algorithms.

6.2 Split inexact Uzawa method

Forf(x) = _Ax–b, the explicit SIU algorithm can be described as follows:

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+=xk–τAT(Axk–b) –τ γBT(Bxk–dk+vγk),

dk+=prox

γf(Bxk++

vk γ),

vk+=vk+γ(Bxk+–dk+),

(11)

whereγ > ,  <τ< /AT_A₊_γ_BT_B_{, and}

prox

γ f(v) =arg minf(v) +  γv–z



, z∈V.

LetP=I–τAT_A_{in (). Then}

Q=



γI B

BT _τI–ATA

,

whereγ > ,  <τ < /AT_A_{,  <}_τ_{< /}_AT_A₊_γ_BT_B_{. So, the scheme () can be} ex-pressed as

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

vk+= (I+γ ∂f∗)–(vk+γBxk),

yk+= vk+–vk,

xk+=xk–τ∇f(xk) –τBTyk+.

()

Using the relationprox_γ_f∗

 = (I+γ ∂f

∗

)–and changing the order of these equations, the

scheme () is equivalent to

⎧ ⎨ ⎩

xk+=xk–τAT(Axk–b) –τBT(vk–vk–),

vk+=proxγf_∗(vk+γBxk+).

()

By the Moreau decomposition (see equation (.) in []), for allv∈RM _and_λ_{> , we} have

v=prox_λ_f(v) +λprox

λf∗(v/λ).

Then

⎧ ⎨ ⎩

vk+=vk+γBxk+–γprox

γf(Bxk++

vk γ).

()

By introducing the variabledk+=prox

γf(Bxk++

vk

γ), the scheme () can be further

ex-pressed as

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

dk+=prox

γf(Bxk++

vk γ),

vk+=vk+γ(Bxk+–dk+).

()

Noting thatvk=vk–+γ(Bxk–dk), we have

(12)

Substituting () into the ﬁrst equation of (), the scheme () is equivalent to ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+=xk–τAT(Axk–b) –τ γBT(Bxk–dk+v_γk),

dk+=prox

γf(Bxk++

vk γ),

()

We can see that the method () is equivalent to the SIU. Obviously, the explicit SIU method is a particular case of the proposed frame withP=I–τAT_A_and_ρ_{= . If}_ρ_{= ,} then a linearized primal-dual method is presented in Section .. So, the algorithm in [] can be considered as a relaxed SIU method.

6.3 Bregman operator splitting

The BOS algorithm for solving problem () was recently introduced in [] based on the primal dual formulation of the model. It can be described as

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (τI+γB

T_B₎–₍ τxk–A

T₍_Ax

k–b) +γBT(dk–vγk)),

dk+=prox

γf(Bxk++

vk γ),

()

whereγ > ,  <τAT_A_{< .}

Similarly, letP=I–τAT_A₊_{τ γ}_BT_B_{in (). Then}

Q=



γI B

BT _τI–ATA+γBTB

,

whereγ > ,  <τ< /AT_A_{,  <}_τ_{< /}_AT_A_–_γ_BT_B_{. The scheme () can be expressed} as ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (τI+γBTB)–(( 

τI–ATA+γBTB)xk–BT(vk–vk–) +ATb), dk+=prox

γf(Bxk++

vk γ),

()

Using relation (), we arrive at

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (τI+γB

T_B₎–₍ τxk–A

T₍_Ax

k–b) +γBT(dk–vγk)),

dk+=prox

γf(Bxk++

vk γ),

()

Now, we can see that the scheme () is the method (). Clearly, the iterative scheme () is a particular case of the frame withP=I–τAT_A₊_{τ γ}_BT_B_and_ρ_{= . Also, when}_ρ_{= ,} we can get a new primal-dual method for solving () as follows:

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ ˜

vk= (I+γ ∂f∗)–(vk+γBxk),

˜

xk= (τ +γB

T_B₎–₍₍ τ –A

T_A₊_γ_BT_B₎_x

k–BT(v˜k–vk) +ATb),

(13)

In fact, the scheme () can be considered as a relaxed BOS algorithm. Iff(·) = · , then

we can deduce that (I+∂f_∗)–₍_v_{) =}_proj₍_v_{), where}_proj_{is the projection operator. If the}

image satisﬁes periodic boundary conditions and if we use total-variation regularization, then the matrixBT_B_{is block circulant; hence, it can be diagonalized by the Fourier} trans-form matrix as noted in []. So, the new algorithm () can be computed eﬃciently and does not need the inner iteration to solve the subproblem.

6.4 Split Bregman

In this subsection, we identify the split Bregman algorithm as a particular case of the pro-posed algorithm. Firstly, we reformulate model () as an equivalent constrained mini-mization problem

min x,d

f(d) +f(x) :Bx–d= 

.

The split Bregman algorithm for solving this constrained problem is as follows:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

xk+=arg minxf(x) +vk,Bx–dk+γ_Bx–dk,

dk+=arg mindf(d) +vk,Bxk+–d+γ_Bxk+–d, vk+=vk+γ(Bxk+–dk+).

Whenf(x) =_Ax–b, it can be also described as

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (ATA+γBTB)–(ATb+BT(γdk–vk)),

dk+=prox

γf(Bxk++

vk γ),

()

where γ > . The diﬃculty of implementing the scheme () is mainly due to that the inverse of the matrix is not easy to obtain. Next, we show that our method can induce the scheme ().

Let

Q=



γI B

BT _γ_BT_B

,

whereτ= ,γ > . We see that the matrix is not positive. But the scheme () with thisQ

can induce the famous split Bregman algorithm. In this situation, the scheme () can be expressed as

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (ATA+γBTB)–(ATb–BT(vk–vk–) +γBTBxk),

dk+=prox

γf(Bxk++

vk γ),

()

Using the relation

(14)

the scheme () can be further expressed as

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (ATA+γBTB)–(ATb+BT(γdk–vk)),

dk+=prox

γf(Bxk++

vk γ),

()

The method () is the split Bregman scheme (). So, the split Bregman algorithm can be identiﬁed as a particular case of our proposed algorithm framework withP=γBT_B_and

ρ= . Ifρ= , then, forP=γBT_B_and_τ_{= , a new primal-dual scheme can be described} as follows:

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

˜

vk= (I+γ ∂f∗)–(vk+γBxk),

˜

xk= (ATA+γBTB)–(γBTBxk–BT(v˜k–vk–) +ATb), xk+=xk–ρ(xk–x˜k),

()

Because the matrixQis not positive definite, the split Bregman method may be not convergent. This case was also discussed in [] with the same result. Based on Lemma , it suffices to replace the matrixQby its perturbation in positive definite style. Similarly to [], we modified the matrixQas

Q=



γI B

BT γ θBTB+α( –θ)I

, ()

whereαis a positive number, andθis a number between  and . Using a similar derivation as before, the modiﬁed split Bregman algorithm is

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

xk+= (ATA+γ θBTB+α( –θ)I)–(( –θ)(α–γBTB)xk

+AT_b₊_BT₍_γ_d k–vk)),

dk+=prox

γf(Bxk++

vk γ),

()

Checking the convergence condition of the theorem, if α_γ >B_,α> , andθ∈[, ), we can easily get that the sequencexkgenerated from () converges to a solution of problem (). We remark that whenθ= , the scheme () reduces to the split Bregman method (). Whenθ= , the scheme () is the preconditioned alternating method of multipliers as discussed in [, ]. Also, whenQis deﬁned by () andρ= , the new primal-dual method can be expressed as

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

˜

vk= (I+γ ∂f∗)–(vk+γBxk),

˜

xk= (ATA+γ θBTB+α( –θ)I)–((γ θBTB+α( –θ)I)xk

–BT(˜vk–vk) +ATb),

(15)

The other positive deﬁnite matrixQmay be chosen as

Q=



γI B

BT  τI+γB

T_B

. ()

By a simple manipulation we obtain

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩

xk+= (ATA+γBTB+_τI)–(_τxk+ATb+BT(γdk–vk)),

dk+=prox

γf(Bxk++

vk γ),

()

According to [], the eigenvalues of the matrixBT_B_{all lie in the interval [, ). So, to} guarantee the positive deﬁnite ofQ, we should setγ τ> . In fact, the scheme () is the third case of Algorithm  in []. Finally, similarly to the previous subsection, we can also get a new relaxed splitting Bregman algorithm whenρ=  andQis given by (). Then, the new primal-dual algorithm can be reformulated as

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

˜

vk= (I+γ ∂f∗)–(vk+γBxk),

˜

xk= (ATA+γBTB+τI) –₍₍

τI+γB

T_B₎_x

k–BT(v˜k–vk) +ATb),

vk+=vk–ρ(vk–v˜k).

()

7 Conclusions

We proposed a primal-dual-based contraction framework in the prediction-correction fashion. The convergence and convergence rate of the proposed framework are also given. Some well-known algorithms, for example, the linearized primal-dual method, SIU, Breg-man operator splitting method, and split BregBreg-man method can be considered as particular cases of our algorithm framework. Some new primal-dual schemes such as (), (), (), and () are induced. Finally, how to choose the adaptive parameterρ is an interesting problem, which will be discussed in a forthcoming work.

Acknowledgements

This work is supported by the National Natural Science Foundation of China (11361018, 11461015), Guangxi Natural Science Foundation (2014GXNSFFA118001), Guangxi Key Laboratory of Cryptography and Information Security (GCIS201624), and Innovation Project of Guangxi Graduate Education.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

Both authors read and approved the ﬁnal manuscript.

Author details

1_{School of Electronic Engineering and Automation, Guangxi Key Laboratory of Automatic Detecting Technology and}

Instruments, Guilin University of Electronic Technology, Jinji Road, Guilin, China.2_{School of Mathematics and Computing} Science, Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, Guilin University of Electronic Technology, Jinji Road, Guilin, 541004, China.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional aﬃliations.

(16)

References

1. Rockafellar, T: Convex Analysis. Princeton University Press, Princeton (1970)

2. Chambolle, A, Pock, T: A ﬁrst-order primal-dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis.40, 120-145 (2011)

3. Esser, E, Zhang, X, Chan, T: A general framework for a class of ﬁrst order primal-dual algorithms for convex optimization in imaging science. SIAM J. Imaging Sci.3, 1015-1046 (2010)

4. Zhu, M, Chan, T: An eﬃcient primal-dual hybrid gradient algorithm for total variation image restoration. CAM report, 08-34 (2008)

5. He, B, You, Y, Yuan, X: On the convergence of primal-dual hybrid gradient algorithm. SIAM J. Imaging Sci.7, 2526-2537 (2014)

6. Bonettini, S, Ruggiero, V: On the convergence of primal-dual hybrid gradient algorithms for total variation image restoration. J. Math. Imaging Vis.44, 236-253 (2012)

7. He, B, Yuan, X: Convergence analysis of primal-dual algorithms for a saddle-point problem: from contraction perspective. SIAM J. Imaging Sci.5, 119-149 (2012)

8. Cai, X, Han, D, Xu, L: An improved ﬁrst-order primal-dual algorithm with a new correction step. J. Glob. Optim.57(4), 1419-1428 (2013)

9. He, H, Desai, J, Wang, K: A primal-dual prediction-correction algorithm for saddle point optimization. J. Glob. Optim.

66(3), 573-583 (2016)

10. Zhang, B, Zhu, Z, Wang, S: A simple primal-dual method for total variation image restoration. J. Vis. Commun. Image Represent.38, 814-823 (2016)

11. Tian, WY, Yuan, XM: Linearized primal-dual methods for linear inverse problems with total variation regularization and ﬁnite element discretization. Inverse Probl.32(11), 115011 (2016)

12. Komodakis, N, Pesquet, J-C: Playing with duality: an overview of recent primal-dual approaches for solving large-scale optimization problems. IEEE Signal Process. Mag.32(6), 31-54 (2015)

13. Chen, Y, Hager, WW, Yashtini, M, Ye, X, Zhang, H: Bregman operator splitting with variable stepsize for total variation image reconstruction. Comput. Optim. Appl.54, 317-342 (2013)

14. Chen, P, Huang, J, Zhang, X: A primal-dual ﬁxed point algorithm for convex separable minimization with applications to image restoration. Inverse Probl.29(2), 025011 (2013)

15. Combettes, PL, Condat, L, Pesquet, J-C, Vu, BC: A forward-backward view of some primal-dual optimization methods in image recovery. In: 2014 IEEE International Conference on Image Processing (ICIP), pp. 4141-4145. IEEE Press, New York (2014)

16. Li, Q, Shen, L, Xu, Y: Multi-step ﬁxed-point proximity algorithms for solving a class of optimization problems arising from image processing. Adv. Comput. Math.41(2), 387-422 (2015)

17. Zhang, X, Burger, M, Osher, S: A uniﬁed primal-dual algorithm framework based on Bregman iteration. J. Sci. Comput.

46, 20-46 (2011)

18. Rockafellar, T: Monotone operators and the proximal point algorithm. SIAM J. Control Optim.14, 877-898 (1976) 19. He, B: PPA-like contraction methods for convex optimization: a framework using variational inequality approach.

J. Oper. Res. Soc. China3(4), 391-420 (2015)

20. Rudin, L, Osher, S, Fatemi, E: Nonlinear total variation based noise removal algorithms. Physica D60, 259-268 (1992) 21. Goldstein, T, Osher, S: The split Bregman method forL1-regularized problems. SIAM J. Imaging Sci.2, 323-343 (2009) 22. Sheﬁ, R, Teboulle, M: Rate of convergence analysis of decomposition methods based on the proximal method of

multipliers for convex minimization. SIAM J. Optim.24(1), 269-297 (2014)

23. Xu, M: Proximal alternating directions method for structured variational inequalities. J. Optim. Theory Appl.134, 107-117 (2007)

24. Chambolle, A, Pock, T: On the ergodic convergence rates of a ﬁrst-order primal-dual algorithm. Math. Program.

159(1), 253-287 (2016)

25. Condat, L: A primal-dual splitting method for convex optimization involving Lipschitzian, proximable and linear composite terms. J. Optim. Theory Appl.158(2), 460-479 (2013)

26. Combettes, PL, Wajs, VR: Signal recovery by proximal forward-backward splitting. SIAM J. Multiscale Model. Simul.4, 168-200 (2005)

27. Wang, Y, Yang, J, Yin, W, Zhang, Y: A new alternating minimization algorithm for total variation image reconstruction. SIAM J. Imaging Sci.1(3), 248-272 (2008)