• No results found

Almost sure convergence of the forward–backward–forward splitting algorithm

N/A
N/A
Protected

Academic year: 2021

Share "Almost sure convergence of the forward–backward–forward splitting algorithm"

Copied!
23
0
0

Loading.... (view fulltext now)

Full text

(1)

O R I G I NA L PA P E R

Almost sure convergence of the forward–

backward–forward splitting algorithm

B`˘ang Công V ˜u1,2

Received: 13 October 2014 / Accepted: 9 May 2015 / Published online: 23 May 2015 © Springer-Verlag Berlin Heidelberg 2015

Abstract In this paper, we propose a stochastic forward–backward–forward split-ting algorithm and prove its almost sure weak convergence in real separable Hilbert spaces. Applications to composite monotone inclusion and minimization problems are demonstrated.

Keywords Monotone inclusion · Monotone operator · Operator splitting · Lipschitzian operators · Forward–backward–forward algorithm · Composite operator·Duality·Primal-dual algorithm

1 Introduction

Forward–backward–forward splitting algorithm was firstly proposed in [24] for solv-ing the problem of findsolv-ing a zero point of the sum of a maximally monotone operator A:H → 2H and a monotone Lipschitzian operator C: HH, where H is a real Hilbert space. This splitting algorithm plays a role in solving a large class of composite monotone inclusions [3] and monotone inclusions involving the parallel sums [2,10,11,15] as well as applications to conposite convex optimization problem involving the infimal-convolutions [2–4,11,15]. However, these works are limitted to deterministic setting.

Very recently, we have found out in the literature that there appears the study of some splitting algorithms for solving monotone inclusions in the stochastic setting as in

B

B`˘ang Công V˜u [email protected]

1 LCSL, Istituto Italiano di Tecnologia, Genova, Italy

2 Massachusetts Institute of Technology, Bldg. 46-5155, 77 Massachusetts Avenue,

(2)

[12,19,21], and primal-dual splitting algorithm for composite monotone inclusions in [12,19]. Some iterations in [12,19,21] are designed for monotone inclusions involv-ing cocoercive operators. For solvinvolv-ing monotone inclusions involvinvolv-ing Lipschitzian monotone operators, one can often use the iterations which has the structure of the forward–backward–forward splitting methods as cited above, but the convergence of their proposed methods is no longer available, in the literature, in the stochastic setting.

The objective of this note is to study the convergence of the forward–backward– forward splitting in the stochastic setting for monotone inclusions involving Lip-schitzian monotone operators as well as for composite monotone inclusions involving parallel sums.

In Sect.2, we recall some notations, background and preliminary results. We prove the almost sure convergence of the stochastic forward–backward–forward splitting algorithm in Sect.3. In the last section, we provide applications to composite monotone inclusions involving the parallel sums as well as minimization problems involving infimal convolutions.

2 Notation–background and premilary results

Throughout,H,G, and(Gi)1≤imare real separable Hilbert spaces. Their scalar

prod-ucts and associated norms are respectively denoted by · | ·and · . We denote by B(H,G)the space of bounded linear operators from H to G. The adjoint of L ∈B(H,G)is denoted byL∗. We setB(H)=B(H,H). Id denotes the identity operator. The symbolsand→denote weak and strong convergence, respectively. We denote by 1+(N)the set of summable sequences in [0,+∞[. The class of all proper lower semicontinuous convex functions fromHto] − ∞,+∞]is denoted by

0(H). LetM1andM2be self-adjoint operators inB(H), we writeM1M2if and only if(∀xH) M1x|xM2x|x.Letα∈]0,+∞[. We set

Pα(H)=M ∈B(H)|M∗=M and M αId. (2.1)

LetA:H→2Hbe a set-valued operator. The domain ofAis domA=xH| Ax =∅, and the graph of A is graA = (x,u)H×H | uAx. The set of zeros of Ais zerA = xH| 0∈ Ax, and the range of Ais ranA =uH |

(∃xH)uAx. The inverse of AisA−1:H→2H:uxH|uAx, and the resolvent ofAis

JA=(Id+A)−1. (2.2)

Moreover,Ais monotone if

(∀(x,y)H×H)(∀(u, v)Ax×Ay) xy|uv ≥0, (2.3) and maximally monotone if it is monotone and there exists no monotone operator B: H → 2H such that graA ⊂ graB and A = B. We say that A is uniformly monotone at x ∈ domA if there exists an increasing function φA: [0,+∞[→

(3)

uAx∀(y, v)∈graA xy|uv ≥φA(xy). (2.4)

Given a probability space(,F,P), we denote byσ (x)theσ-field generated by a random vector x: H, where His endowed with the Borelσ-algebra. The expectation of a random variablexis denoted byE[x]. The conditional expectation of xgiven a sub-sigma algebraFFis denoted byE[x|F]. The conditional expectation ofxgivenyis denoted byE[x|y].

Lemma 2.1 [23, Theorem 1]Let(Fn)n∈Nbe an increasing sequence of sub-sigma

algebras ofF. For every n∈N, let zn,ξn,ζnand tnbe non-negative,Fn-measurable

random variable such that(ζn)nNand(tn)nNare summable and

(∀n∈N) E[zn+1|Fn] ≤(1+tn)zn+ζnξn Pa.s. (2.5)

Then(zn)n∈Nconverges and(ξn)n∈Nis summableP-a.s.

Lemma 2.2 [12, Proposition 2.3]LetHbe a real separable Hilbert space, let C be a non-empty closed subset ofH, letφ: [0,∞[→ [0,∞[, let(xn)n∈Nbe a sequence

of random vectors inH. Suppose that, for every xC, there exist non-negative summable sequences of random variables(ζn(x))n∈Nand(tn(x))n∈Nsuch that, for

every n∈N,ζn(x)and tn(x)areFn=σ(x0, . . . ,xn)-measurable, and

(∀n ∈N) E[φ(xn+1−x)|Fn] ≤(1+tn(x))φ(xnx)+ζn(x) P-a.s. (2.6)

Suppose thatφis strictly increasing andlimξ→∞φ(ξ) = +∞. Then the following hold.

(i) (xnx)n∈Nis bounded and convergesP-a.s.

(ii) There exists a subsetwithP()=1such that for every xC and every

ω,(xn(ω)x)n∈Nconverges.

(iii) (xn)n∈Nconverges weaklyP-a.s. to a C-valued random vector if and only if every

its weak cluster point is in CP-a.s.

Remark 2.3 A sequence(xn)n∈Nsatisfying (2.6) is called a stochasticφ-quasi-Fejér

monotone with respect to the target setC. The connections of Lemma2.2to existing work can be found in [12, Remark 2.4].

In view of the work in [13, Theorem 3.3], we also have a variable metric extension of Lemma2.2.

Proposition 2.4 LetHbe a real separable Hilbert space, let C be a non-empty closed subset ofH, letφ: [0,∞[→ [0,∞[, letα ∈]0,∞[, let W ∈ Pα(H)and(Wn)n∈N

be a sequence inPα(H)such that WnW pointwise, let(xn)n∈Nbe a sequence

of random vectors inH. Suppose that, for every xC, there exist non-negative summable sequences of random variables(ζn(x))nNand(tn(x))nNsuch that, for

every n∈N,ζn(x)and tn(x)areFn=σ(x0, . . . ,xn)-measurable, and

(∀n∈N) E[φ(xn+1−xWn+1)|Fn] ≤(1+tn(x))φ(xnxWn)+ζn(x) P-a.s.

(4)

Suppose thatφis strictly increasing andlimξ→∞φ(ξ) = +∞. Then the following hold.

(i) (xnxWn)n∈Nis bounded and convergesP-a.s.

(ii) There exists a subsetwithP()=1such that for every xC and every

ω,(xn(ω)xWn)n∈Nconverges.

(iii) (xn)n∈Nconverges weaklyP-a.s. to a C-valued random vector if and only if every

its weak cluster point is in CP-a.s.

Proof (i): Set(∀n∈N) ξn= xnzWn. It follows from (2.7) and Lemma2.1that (φ(ξn))n∈NconvergesP-a.s., sayφ(ξn)λ. In turn, since limt→+∞φ(t)= +∞,

(ξn)n∈Nis boundedP-a.s. Letωsuch that(ξn(ω))n∈Nis bounded and, to show that

it converges, it suffices to show that it cannot have two distinct cluster points. Suppose to the contrary that we can extract two subsequences(ξkn(ω))n∈Nand(ξln)n∈N(ω)such

thatξkn(ω)η(ω)andξln(ω)ζ(ω) > η(ω), and fixε∈]0, (ζη)/2[. Then, for

nsufficiently large,ξkn(ω)η(ω)+ε < ζ(ω)εξln(ω)and, sinceφis strictly

increasing,φ(ξkn(ω))φ(η(ω)+ε) < φ(ζ(ω)−ε)≤φ(ξln(ω)). Taking the limit as

n→ +∞yieldsλ(ω)φ(η(ω)+ε) < φ(ζ(ω)ε)λ(ω), which is impossible. (ii): SinceHis separable, so isC and hence there exists a countable subset X of C such that X = C. In view of (i), for eachxX, there exists a subsetx with

probability 1 such that(xn(ω)xWn)n∈Nconverges for every ωx. Define

=

xXx. Since X is countable,P()=1. Now, letx0Candω0 ∈∗. Then, there exists a sequence(ck)k∈NinXsuch thatckx0. By (i), we have

(∀k∈N)(∃τk:→ [0,+∞[)(∀ω∈ck) xn(ω)ckWnτk(ω). (2.8)

Moreover, setμ =supnNWn. Thenμ < +∞by Banach–Steinhaus Theorem.

Then, for everyn∈Nandk∈N, we have

−√μckx0 ≤ −ckx0Wnxn(ω0)x0Wnxn(ω0)ckWnckx0Wn ≤√μckx0. (2.9) Therefore, (∀k∈N)−√μckx0 ≤ lim n→∞ xn(ω0)x0Wnnlim→∞xn(ω0)ckWn = lim n→∞ xn(ω0)x0Wnτk(ω0) ≤ lim n→∞xn(ω0)x0Wnτk(ω0) ≤√μckx0. (2.10)

Now, letk→ ∞, we get limn→∞xn(ω0)x0Wn =limk→∞τk(ω0)which proves

(ii).

(iii): Necessity is clear. To show sufficiency, let be the set of allωsuch that every weak sequential cluster point of (xn(ω))n∈N is in C. Then has

(5)

cluster points of (xn(ω))n∈N, say xkn(ω) x(ω) and xln(ω) y(ω). Then it

follows from (ii) that (xn(ω)x(ω)Wn)n∈N and(xn(ω)y(ω)Wn)n∈N

con-verge. Moreover,x(ω)2Wn = Wnx(ω)|x(ω) → W x(ω)|x(ω)and, likewise,

y(ω)2WnW y(ω)|y(ω). Therefore, since (∀n ∈N) Wnxn(ω)|x(ω)y(ω)= 1 2 xn(ω)−y(ω)2Wnxn(ω)x(ω) 2 Wn + x(ω)2Wny(ω)2Wn, (2.11) the sequence(Wnxn(ω)|x(ω)y(ω))n∈Nconverges, sayWnxn(ω)|x(ω)y(ω)

λ(ω)∈R, which implies that

xn(ω)|Wn(x(ω)y(ω)) →λ(ω)∈R. (2.12)

However, sincexkn(ω) x(ω)andWkn(x(ω)y(ω))W(x(ω)y(ω)), it

fol-lows from (2.12) and [5, Lemma 2.41(iii)] thatx(ω)|W(x(ω)y(ω)) = λ(ω). Likewise, passing to the limit along the subsequence(xln(ω))n∈N in (2.12) yields y(ω)|W(x(ω)y(ω)) =λ. Thus,

0= x(ω)|W(x(ω)y(ω)) − y(ω)|W(x(ω)y(ω))

= x(ω)y(ω)|W(x(ω)y(ω))

αx(ω)y(ω)2. (2.13)

This shows thatx(ω)=y(ω). Upon invoking (ii) and [5, Lemma 2.38], we conclude thatxn(ω) x(ω)and hence we obtain the conclusion.

3 A stochastic forward–backward–forward splitting algorithm

The forward–backward–forward splitting algorithm was firstly proposed in [24] to solve inclusion involving the sum of a maximally monotone operator and a Lipschitzian monotone operator. In [3], it was revisited to include computational errors. Below, we extend it to a stochastic setting. The following theorem is a stochastic version of [25, Theorem 3.1].

Theorem 3.1 LetKbe a real separable Hilbert space with the scalar product· | · and the associated norm||| · |||. Letαandβbe in]0,+∞[, let(ηn)n∈Nbe a sequence

in1+(N), and let(Un)n∈Nbe a sequence inB(K)such that

μ=sup

n∈N

Un<+∞ and (∀xK) (1+ηn)x |Un+1x

x|Unxα|||x|||2. (3.1)

Let A: K → 2Kbe maximally monotone, let B:KKbe a monotone andβ -Lipschitzian operator onKsuch thatzer(A+B)= ∅. Let(an)n∈N,(bn)n∈N, and

(6)

square integrableK-valued random vector, letε∈]0,1/(βμ+1)[, let(γn)n∈Nbe a

sequence in[ε, (1−ε)/(βμ)], and set

(∀n ∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ yn=xnγnUn(Bxn+an) pn=JγnUnAyn+bn qn= pnγnUn(B pn+cn) xn+1=xnyn+qn. (3.2) Suppose that E[|||an|||2|Fn] n∈N, E[|||bn|||2|Fn] n∈N, and E[|||cn|||2|Fn]

n∈Nare summableP-a.s., the following hold for somezer(A+B)

-valued random vectorx.

(i) nNE[|||xnpn|||2|Fn] < +∞andnNE[|||ynqn|||2|Fn] < +∞

P-a.s.

(ii) xn xand JγnUnA(xnγnUnBxn) xP-a.s.

(iii) Suppose that one of the following is satisfied for some subsetwithP()=

1.

(a) A+Bis demiregular(see[1, Definition 2.3])atx(ω)for everyω. (b) Aor Bis uniformly monotone atx(ω)for everyω.

Thenxnxand JγnUnA(xnγnUnBxn)xP-a.s.

Proof It follows from [14, Lemma 3.7] that the sequences(xn)n∈N,(yn)n∈N,(pn)n∈N

and(qn)nNare well defined. Moreover, using [13, Lemma 2.1(i)(ii)] and (3.1), for

every sequence of random vectorsK-valued(zn)nN, we have

n∈N E[|||zn|||2|Fn]< +∞ P-a.s. ⇔ n∈N E[|||zn|||2U1 n |Fn]<+∞ P-a.s. (3.3) and n∈N E[|||zn|||2|Fn]<+∞ P-a.s. ⇔ n∈N E[|||zn|||2Un|Fn]<+∞ P-a.s. (3.4) Let us set, for everyn ∈N,

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ yn=xnγnUnBxn pn=JγnUnAyn qn=pnγnUnBpn xn+1=xnyn+qn, and ⎧ ⎪ ⎨ ⎪ ⎩ un=γn−1Un1(xnpn)+BpnBxn en=xn+1−xn+1 dn=qnqn+ynyn. (3.5) Then (3.5) yields (∀n∈N) un=γn−1Un1(ynpn)+BpnApn+Bpn, (3.6)

(7)

and (3.5), Lemma [14, Lemma 3.7(ii)], and the Lipschitzianity of BonKyield (∀n∈N) ⎧ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎩ |||ynyn|||Un1 ≤(βμ) −1|||a n|||Un |||pnpn|||U−1 n ≤ |||bn|||Un1 +(βμ) −1|||a n|||Un |||qnqn|||U−1 n ≤2 |||bn|||U−1 n +(βμ) −1|||a n|||Un +(βμ)−1|||c n|||Un. (3.7) Since E[|||an|||2|Fn] nN, E[|||bn|||2|Fn] nN, and E[|||cn|||2|Fn] nN

are summable P-a.s., using Jensen’s inequality, we derive from (3.3), (3.4), (3.5), and (3.7) that ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ n∈NE[|||pnpn||||Fn]<+∞ and n∈NE[|||pnpn|||Un1|Fn] <+∞ P-a.s. n∈NE[|||qnqn||||Fn]<+∞ and n∈NE[|||qnqn|||Un1|Fn] <+∞ P-a.s. n∈NE[|||dn||||Fn]<+∞ and n∈NE[|||dn|||U−1 n |Fn]<+∞ P-a.s. (3.8) Noting that 2E[|||ynyn|||2U−1 n |Fn] ≤2(βμ) −2 E[|||an|||2Un|Fn], (3.9) and 2E[|||qnqn|||2U−1 n |Fn] ≤24 E[|||bn|||2U−1 n |Fn] +(βμ) −2 E[|||an|||2Un + |||cn|||2Un|Fn] (3.10) Therefore, upon settingc=max{26(βμ)−2,24}, and adding (3.9) and (3.10), we get

2E[|||ynyn|||2U−1 n |Fn] +2E[|||qnqn||| 2 U−1 n |Fn] ≤cE[|||an|||U2n|Fn] +E[|||bn||| 2 Un1|Fn] +E[|||cn||| 2 Un|Fn] . (3.11) Now, using (3.11), (3.33), (3.34), (3.3), (3.4) and (3.5), we have

n∈N E[|||dn|||2U−1 n |Fn]≤2 n∈N E[|||ynyn|||2U−1 n |Fn]+2 n∈N E[|||qnqn|||2U−1 n |Fn] ≤c n∈N E[|||an|||2Un|Fn] + n∈N E[|||bn|||2U−1 n |Fn] + n∈N E[|||cn|||2Un|Fn]

(8)

0 n∈N E[|||an|||2Un|Fn] + n∈N E[|||bn|||2U−1 n |Fn] + n∈N E[|||cn|||2Un|Fn] <+∞ P-a.s., (3.12) where we define τ0=sup n∈N E[|||an|||2Un|Fn], E[|||bn|||2U−1 n |Fn], E[|||cn|||2Un|Fn] <+∞ P-a.s. (3.13)

Now, let x ∈ zer(A+B). Then, for everyn ∈ N,(x,−γnUnBx) ∈ gra(γnUnA)

and (3.5) yields (pn,ynpn) ∈ gra(γnUnA). Hence, by monotonicity of UnA with respect to the scalar product · | ·Un−1, we have pnx |

pnynγnUnBxUn1 ≤0.Moreover, by monotonicity ofUnBwith respect to

the scalar product· | ·U−1

n , we also havepnx|γnUnBxγnUnBpnUn1 ≤

0.By adding the last two inequalities, we obtain

(∀n∈N) pnx|pnynγnUnBpnUn1 ≤0. (3.14)

In turn, we derive from (3.5) that

(∀n ∈N) 2γnpnx |UnBxnUnBpnUn1 =2pnx|pnynγnUnBpnUn1 +2pnx|γnUnBxn+ynpnU−1 n ≤2pnx |γnUnBxn+ynpnUn1 =2pnx|xnpnUn1 = |||xnx|||2U−1 n − |||pnx||| 2 U−1 n − |||xnpn||| 2 U−1 n . (3.15)

Hence, using (3.5), (3.15), theβ-Lipschitz continuity of B, and [13, Lemma 2.1(ii)], for everyn∈N, we obtain

|||xn+1−x|||2U−1 n = |||qn+xnynx||| 2 U−1 n = |||(pnx)+γnUn(BxnBpn)|||2U−1 n = |||pnx|||2U−1 n +2γnpnx |BxnBpn +γ2 n|||Un(BxnBpn)|||2U−1 n ≤ |||xnx|||2U−1 n − |||xnpn||| 2 U−1 n +γ2 nμβ2|||xnpn|||2

(9)

≤ |||xnx|||2U−1 nμ −1||| xnpn|||2 +γ2 nμβ 2||| xnpn|||2. (3.16)

Hence, it follows from (3.1) and [13, Lemma 2.1(i)] that

(∀n ∈N) |||xn+1−x|||2U−1 n+1 ≤(1+ηn)|||xnx|||2U−1 nμ −1( 1−γn2β2μ2)|||xnpn|||2. (3.17) Consequently, (∀n∈N) |||xn+1−x|||U−1 n+1 ≤(1+ηn)|||xnx|||U− 1 n . (3.18)

For everyn∈N, set

εn= μα−1 2|||bn|||U−1 n +(βμ) −1|||a n|||Un +(βμ)−1|||cn|||Un+(βμ)−1|||an|||Un . (3.19) Then (En|Fn])n∈N is summable P-a.s. by (3.3) and we derive from [13,

Lemma 2.1(ii)(iii)], and (3.8) that

(∀n∈N) |||en|||U−1 n+1 = |||xn+1−xn+1|||Un+11 ≤ α−1|||x n+1−xn+1||| ≤ μα−1|||x n+1−xn+1|||U−1 nμα−1(|||y nyn|||Un1+ |||qnqn|||Un1)εn. (3.20)

In turn, we derive from (3.18) that

(∀n∈N) |||xn+1−x|||U−1 n+1 ≤ |||xn+1−x|||U− 1 n+1+ |||xn+1−xn+1|||U− 1 n+1 ≤ |||xn+1−x|||U−1 n+1+εn(1+ηn)|||xnx|||U−1 n +εn. (3.21)

By assumption, sinceE[x02]is finite, by induction, for everyn ∈N,E[xn2]is

finite and hence E[xn]andE[xnU−1

n ]are finite too. By taking the conditional

expectation with respect toFnand note that|||xnx|||U−1

n isFn-measurable, we

obtain

(∀n ∈N) E[|||xn+1−x|||U−1

n+1|Fn] ≤(1+ηn)|||xnx|||Un1+En|Fn]. (3.22)

This shows that(xn)n∈N is| · |–quasi-Fejér monotone with respect to the target set

zer(A+B)relative to(Un1)n∈N. Moreover,(|||xnx|||U−1

(10)

sinceBand(JγnUnA)n∈Nare Lipschitzian, and(∀n∈N)x=JγnUnA(x−γnUnBx),

we deduce from (3.5) that(yn)n∈N, (pn)n∈N, and(qn)n∈Nare bounded. Therefore,

τ =sup

n∈N

{|||xnyn+qnx|||Un−1,|||xnx|||Un1}<+∞ P-a.s. (3.23)

Hence, using (3.5), Cauchy–Schwarz for the norms(||| · |||U−1

n )n∈N, and (3.16), we

get, for everyn ∈N,

|||xn+1−x|||2U−1 n = |||xnyn+qnx||| 2 U−1 n = |||qn+xnynx+dn|||2U−1 n ≤|||qn+xnynx|||2U−1 n +2|||qn+xnynx|||U− 1 n |||dn|||Un1 + |||dn|||2U−1 n ≤ |||xnx|||2U−1 nμ −1( 1−γn2β2μ2)|||xnpn|||2+ε1,n, (3.24) where(∀n∈N) ε1,n=2|||qn+xnynx|||Un1|||dn|||Un−1+ |||dn|||2U−1 n . In turn,

for everyn∈N, by (3.1) and [13, Lemma 2.1(i)],

|||xn+1−x|||2U−1 n+1 ≤( 1+ηn)|||xn+1−x|||2U−1 n(1+ηn)|||xnx|||2U−1 nμ −1( 1−γn2β2μ2)|||xnpn|||2 +(1+ηn)ε1,n. (3.25)

Since,JγnA(Id−γnB)is continuous,pnisFn-measurable. In turn, for everyn∈N,

E[|||xn+1−x|||2 Un+11|Fn]≤(1+ηn)|||xnx||| 2 Un1−μ −1( 1−γn2β2μ2)|||xnpn|||2 +E[(1+ηn)ε1,n|Fn]. (3.26)

Let us prove that

n∈N

E[ε1,n|Fn]<+∞ P-a.s. (3.27)

Indeed, since Id−γnBis continuous,ynandqnareFn-measurable. Therefore,|||xn

yn+qnx|||U−1

(11)

n∈N E[|||xnyn+qnx|||U−1 n |||dn|||Un1|Fn] = n∈N |||xnyn+qnx|||U−1 n E[|||dn|||Un1|Fn] ≤τ nN E[|||dn|||Un1|Fn] <+∞ P-a.s., (3.28)

which and (3.12) prove (3.27). It follows from Lemma2.1that

nN

|||xnpn|||2<+∞ P-a.s. (3.29)

(i): It follows from (3.29) and (3.8) that n∈N E[|||xnpn|||2|Fn] ≤2 n∈N |||xnpn|||2+2 n∈N E[|||pnpn|||2|Fn] <+∞ P-a.s. (3.30)

Furthermore, we derive from (3.8), (3.3) and (3.12) that n∈N E[|||ynqn|||2] = n∈N E[|||qnyn+dn|||2|Fn] = n∈N E[|||pnxn+γnUn(BxnBpn)+dn|||2|Fn] ≤3 n∈N |||xnpn|||2+E[|||γnUn(BxnBpn)|||2 + |||dn|||2|Fn] <+∞ P-a.s. (3.31)

(ii): Let0be the set of allωsuch that(xn(ω))n∈Nis bounded and (3.29)

is satisfied. We haveP(0)= 1. Fixω0. Let x(ω)be a weak cluster point of

(xn(ω))n∈N. Then there exists a subsequence(xkn(ω))n∈Nthat converges weakly to x(ω). Thereforepkn(ω) x(ω)by (3.29) and by the definition of0. Furthermore,

it follows from (3.5) thatukn(ω) → 0. Hence, since(∀n ∈ N) (pkn(ω),ukn(ω))

gra(A+B), we obtain,x(ω)∈zer(A+B)[5, Proposition 20.33 (ii)]. Altogether, it follows Proposition 2.4 thatxnxand hence thatpnx.

Now, let1be the set of allωsuch thatxn(ω) x(ω)andpn(ω) x(ω),

andpn(ω)xn(ω)→0. ThenP(1)=1 and henceP(1)=1.

(iii)(a): Fixω1. Thenxn(ω) x(ω)andpn(ω) x(ω). Furthermore,

it follows from (3.5) thatun(ω) → 0. Hence, since(∀n ∈ N) (pn(ω),un(ω))

gra(A+ B) and since A+ B is demiregular at x(ω) by our assumption, by [1, Definition 2.3],pn(ω)x(ω), and thereforexn(ω)x(ω).

(12)

(iii)(b): Fixω1. If Aor Bis uniformly monotone atx(ω), then A+B

is uniformly monotone at x(ω). Therefore, the result follows from [1, Proposition

2.4(i)].

Corollary 3.2 LetKbe a real separable Hilbert space with the scalar product· | · and the associated norm||| · |||. Letβbe in]0,+∞[, letA:K→2Kbe maximally monotone, let B:KKbe a monotone andβ-Lipschitzian operator onKsuch thatzer(A+B) = ∅. Let(an)nN,(bn)nN, and(cn)nNbe sequences of square

integrableK-valued random vectors. Letx0be a square integrableK-valued random vector , letε∈]0,1/(β+1)[, let(γn)n∈Nbe a sequence in[ε, (1−ε)/β], and set

(∀n∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ yn=xnγn(Bxn+an) pn= JγnAyn+bn qn= pnγn(B pn+cn) xn+1=xnyn+qn. (3.32)

Suppose that the following conditions are satisfied withFn=σ(x0, . . . ,xn),

n∈N E[|||an|||2|Fn]<∞, n∈N E[|||bn|||2|Fn]<∞ (3.33) and n∈N E[|||cn|||2|Fn]<P-a.s. (3.34)

Then the following hold for somezer(A+B)-valued random vectorx. (i) nNE[|||xnpn|||2|Fn] < +∞and

n∈NE[|||ynqn|||2|Fn] < +∞

P-a.s.

(ii) xn xand JγnA(xnγnBxn) xP-a.s.

(iii) Suppose that one of the following is satisfied for some subsetwithP ()=1.

(a) A+Bis demiregular(see[1, Definition 2.3])atx(ω)for everyω. (b) Aor Bis uniformly monotone atx(ω)for everyω.

Thenxnxand JγnA(xnγnBxn)xP-a.s.

Remark 3.3 Here are some remarks. In the case when B is a general multi-valued maximally monotone operator or a cocoercive operator, the almost sure convergence of the Douglas–Rachford or forward–backward are proved in [12] under the same type of condition on the stochastic errors. Furthermore, in the case whenBis cocoercive and uniformly monotone, the almost sure convergence of the forward–backward splitting is also proved in [21] under different conditions on stepsize and stochastic errors. One of the early work concerns with Lipschitzian monotone operator was in [18]. Example 3.4 Let f: K → [−∞,+∞] be a proper lower semicontinuous con-vex function, let α ∈ ]0,+∞[, let β ∈ ]0,+∞[, let B: KKbe a monotone and β-Lipschitzian operator. Let (an)n∈N, (bn)n∈N, and (cn)n∈N be sequences of

square integrable K-valued random vectors such that (3.33) and (3.34) are sat-isfied. Furthermore, let x0 be a square integrable K-valued random vector, let

(13)

ε ∈]0,min{1,1/(β +1)}[, let(γn)n∈N be a sequence in [ε, (1−ε)/β]. Suppose

that the variational inequality

find xK such that (∀yK) xy|Bx + f(x)f(y) (3.35) admits at least one solution and set

(∀n∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ yn=xnγn(Bxn+an) pn=arg min xK f(x)+21γn|||xyn|||2 +bn qn= pnγn(B pn+cn) xn+1=xnyn+qn. (3.36)

Then, for almost allω,(xn(ω))n∈Nconverges weakly to a solutionx(ω)to (3.35).

Proof SetA=f in Corollary3.2(ii). Remark 3.5 Since(γn)nNis bounded away from 0, we have

n∈N γn= +∞ and n∈N γ2 n = +∞. (3.37)

While, in the standard stochastic gradient method [20], we often require n∈N γn= +∞ and n∈N γ2 n <+∞. (3.38)

Under the condition (3.38), the conditions on the stochastic errors in the stochastic gradient method are weaker than (3.33)–(3.34) (see also [26, Assumption 2 and Eq (4) ] for the case of the projected stochastic gradient method).

We end this section by noting that, in the case when Un =U, we obtain a

pre-conditioned version of (3.32). Some other preconditioned algorithms can be found in [17,19].

4 Monotone inclusions involving Lipschitzian operators

The applications of the forward–backward-forward splitting algorithm considered in [3,11,24] can be extended to a stochastic setting using Theorem3.1. As an illustration, we present a stochastic version of the algorithm proposed in [11, Eq. (3.1)]. Recall that the parallel sum ofA:H→2HandB:H→2His [5]

AB=(A−1+B−1)−1. (4.1) Problem 4.1 LetHbe a real separable Hilbert space, letmbe a strictly positive integer, letzH, let A: H → 2H be maximally monotone operator, letC:HHbe monotone andν0-Lipschitzian for someν0∈]0,+∞[. For everyi ∈ {1, . . . ,m}, letGi

(14)

be a real separable Hilbert space, letriGi, letBi: Gi →2Gibe maximally monotone

operator, letDi:Gi →2Gibe monotone and such thatDi 1isνi-Lipschitzian for some

νi ∈]0,+∞[, and let Li:HGi be a nonzero bounded linear operator. Suppose

that z∈ran A+ m i=1 Li(Bi Di)(Li · −ri) +C . (4.2)

The problem is to solve the primal inclusion

zAx+ m i=1 Li(Bi Di)(Lixri) +C x, (4.3)

and the dual inclusion

(∀i ∈ {1, . . . ,m}) ri ∈ −Li(A+C)−1 zm i=1 Livi +Bi−1vi+Di−1vi. (4.4)

We denote byPandDbe the set of solutions to (4.3) and (4.4), respectively. As shown in [11], Problem4.1covers a wide class of problems in nonlinear analysis and convex optimization problems. However, the algorithm in [11, Theorem 3.1] is studied in the deterministic. The following result extends this result to a stochastic setting.

Let us defineK=HG1⊕· · ·⊕Gmthe Hilbert direct sum of the Hilbert spacesH

and(Gi)1im, the scalar product and the associated norm ofKrespectively defined

by · | ·:((x,v), (y,w))x|y + m i=1 vi |wi and||| · |||:(x,v)x2+ m i=1 vi2, (4.5)

wherev=(v1, . . . , vm)andw=(w1, . . . , wm)are generic elements inG1⊕· · ·⊕Gm.

Corollary 4.2 Let(a1,n)n∈N, (b1,n)n∈N, and(c1,n)n∈Nbe sequences of square

inte-grable H-valued random vectors, and for every i ∈ {1, . . . ,m}, let (a2,i,n)n∈N,

(b2,i,n)n∈N, and(c2,i,n)n∈Nbe sequences of square integrableGi-valued random

vec-tors. Furthermore, set

β=max{ν0, ν1, . . . , νm} + m i=1 Li2, (4.6)

(15)

let x0be a square integrableH-valued random vector, and, for every i ∈ {1, . . . ,m}, letvi,0 be a square integrableGi-valued random vector, let ε ∈]0,1/(1+β)[, let

(γn)n∈Nbe a sequence in[ε, (1−ε)/β]. Set (∀n∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ y1,n=xnγn C xn+ m i=1Livi,n+a1,n p1,n=JγnA(y1,n+γnz)+b1,n for i =1, . . . ,m ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ y2,i,n=vi,n+γn LixnDi−1vi,n+a2,i,n p2,i,n=JγnB−1 i (y2,i,nγnri)+b2,i,n q2,i,n=p2,i,n+γn Lip1,nDi−1p2,i,n+c2,i,n vi,n+1=vi,ny2,i,n+q2,i,n q1,n= p1,nγn C p1,n+ m i=1Li p2,i,n+c1,n xn+1=xny1,n+q1,n. (4.7)

Suppose that the following conditions hold forFn=σ ((xk, (vi,k)1≤im)0≤kn,

⎧ ⎪ ⎨ ⎪ ⎩ n∈N E[|||(a1,n, (a2,i,n)1≤im)|||2|Fn]<+∞ nN E[(b1,n, (b2,i,n)1≤im)|||2|Fn]<+∞ n∈N E[|||(c1,n, (c2,i,n)1≤im)|||2|Fn]<+∞. (4.8)

Then the following hold.

(i) nNE[xnp1,n2|Fn] < +∞and (∀i ∈ {1, . . . ,m})

n∈NE[vi,n

p2,i,n2|Fn]<+∞P-a.s.

(ii) There exist a P-valued random vector x and a D-valued random vector

(v1, . . . , vm)such that the following hold.

(a) xnx and JγnA(xnγn(C xn+ m i=1Livi,n)+γnz) xP-a.s. (b) (∀i∈ {1, . . . ,m}) vi,n viand JγnB−1 i (vi,nn LixnDi−1vi,n)−γnri) viP-a.s.

(c) Suppose that A or C is uniformly monotone at x(ω)for everyωwith

P()=1, then xnx and JγnA(xnγn(C xn+

m

i=1Livi,n)+γnz)x

P-a.s.

(d) Suppose that Bj1 or Dj1 is uniformly monotone at vj(ω) for every ω

with P() = 1, for some j ∈ {1, . . . ,m}, then vj,nvj and

JγnB−1 j (vj,n+γn LjxnDj1vj,n)γnrj)vjP-a.s. Proof Set ⎧ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎩ A:K→2K:(x, v1, . . . , vm)(−z+Ax)×(r1+B1−1v1) × · · · ×(rm+Bm−1vm) B:KK:(x, v1, . . . , vm)C x+mi=1Livi,D−11v1−L1x, . . . , Dm−1vmLmx . (4.9)

(16)

SinceAis maximally monotone [5, Propositions 20.22 and 20.23],Bis monotone and

β-Lipschitzian [11, Eq. (3.10)] with domB=K,A+Bis maximally monotone [5, Corollary 24.24(i)]. In addition, [5, Propositions 23.15(ii) and 23.16] yield (∀γ

]0,+∞[)(∀n ∈N)(∀(x, v1, . . . , vm)K) A(x, v1, . . . , vm)= JγA(x+γz), JγB−1 i (viγri) 1≤im . (4.10) It is shown in [11, Eq. (3.12)] and [11, Eq. (3.13)] that under the condition (4.2), zer(A+B)=∅. Moreover, [11, Eq. (3.21)] and [11, Eq. (3.22)] yield

(x, v1, . . . , vm)∈zer(A+B)xsolves (4.3) and(v1, . . . , vm)solves (4.4).

(4.11) Let us next set, for everyn∈N,

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ xn=(xn, v1,n, . . . , vm,n) yn=(y1,n,y2,1,n, . . . ,y2,m,n) pn=(p1,n,p2,1,n, . . . ,p2,m,n) qn=(q1,n,q2,1,n, . . . ,q2,m,n) and ⎧ ⎪ ⎨ ⎪ ⎩ an=(a1,n,a2,1,n, . . . ,a2,m,n) bn=(b1,n,b2,1,n, . . . ,b2,m,n) cn=(c1,n,c2,1,n, . . . ,c2,m,n). (4.12) Then our assumptions imply that

n∈N E[|||an|||2|Fn]<∞, n∈N E[|||bn|||2|Fn]<∞, and nN E[|||cn|||2|Fn]<∞. (4.13)

Furthermore, it follows from the definition of B, (4.10), and (4.12) that (4.7) can be rewritten inKas (∀n∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ yn=xnγn(Bxn+an) pn=JγnAyn+bn qn = pnγn(B pn+cn) xn+1=xnyn+qn, (4.14)

which is (3.32). Moreover, every specific conditions in Corollary3.2are satisfied. (i): By Corollary3.2(i),nNE[|||xnpn|||2|Fn]<∞.

(ii)(a)&(ii)(b): It follows from Corollary3.2(ii) that

xnx and (∀i∈ {1, . . . ,m}) vi,n vi Pa.s. (4.15)

Corollary3.2(ii) shows that(x, v1, . . . , vm)∈zer(A+B). Hence, it follows from [11,

Eq (3.19)] that(x, v1, . . . , vm)satisfies the inclusions

m

i=1LiviC x ∈ −z+Ax

(∀i ∈ {1, . . . ,m})LixDi−1viri +Bi−1vi.

(17)

For everyn∈Nand everyi∈ {1, . . . ,m}, set y1,n=xn−γn C xn+ m i=1Livi,n p1,n=JγnA(y1,nnz) and y2,i,n=vi,n+γn LixnDi−1vi,n p2,i,n=JγnB−1 i (y2,i,nγnri). (4.17) We note that (4.13) implies that

E[|||an|||2|Fn] →0, E[|||bn|||2|Fn] →0 and E[|||cn|||2|Fn] →0 P-a.s.

(4.18) Then, using [5, Corollary 23.10], we get

p1,np1,nb1,n +β−1a1,n,

(∀i ∈ {1, . . . ,m}) p2,i,np2,i,nb2,i,n +β−1a2,i,n,

(4.19)

which and (4.18) imply that ⎧ ⎪ ⎨ ⎪ ⎩ E[p1,np1,n2|Fn] ≤2E[b1,n2+β−2a1,n2|Fn] →0P-a.s. (∀i∈ {1, . . . ,m})E[p2,i,np2,i,n2|Fn] ≤2E[b2,i,n2 +β−2a2 ,i,n2|Fn] →0P-a.s. (4.20) Since(x, v1. . . , vm)JγnA(xγn C x+mi=1Livi)+γnz)is continuous from

KH,p1,nisFn-measurable. By the same way, for everyi∈ {1, . . . ,m},p2,i,nis

Fn-measurable. In turn, by (i),(ii)(a), and (ii)(b), we obtain

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ p1,nxn2=E[p1,nxn2|Fn] ≤2E[p1,nxn2+ p1,np1,n2|Fn] →0P-a.s. (∀i ∈ {1, . . . ,m})p2,i,nvi,n2=E[p2,i,nvi,n2|Fn] ≤2E[p2,i,np2,i,n2+ p2,i,n −vi,n2|Fn] →0P-a.s.

p1,nx P-a.s. and (∀i ∈ {1, . . . ,m}) p2,i,n vi P-a.s.

(4.21)

(ii)(c): We derive from (4.17) that

(∀n∈N) ⎧ ⎪ ⎨ ⎪ ⎩ γ−1 n (xnp1,n)m i=1Livi,nC xn∈ −z+Ap1,n (∀i ∈ {1, . . . ,m}) γn−1(vi,np2,i,n)+LixnDi−1vi,nri +Bi−1p2,i,n. (4.22) Let3be the set of allωsuch that(xn(ω)x(ω))n∈N, (p1,n(ω)x(ω))n∈Nand

(18)

(∀i ∈ {1, . . . ,m})p2,i,n(ω)−vi,n(ω)→0,p1,n(ω)xn(ω)→0. Set4=3.

Then4has probability 1. Now fixω4. Since Ais uniformly monotone atx(ω), using (4.16) and (4.22), there exists an increasing functionφA: [0,+∞[→[0,+∞]

vanishing only at 0 such that, for everyn ∈N,

φA(p1,n(ω)x(ω)) p1,n(ω)x(ω)|γn(xn(ω)p1,n(ω))m i=1(Livi,n(ω)Livi(ω))χn(ω) =p1,n(ω)x(ω)|γn−1(xn(ω)p1,n(ω))χn(ω)m i=1 p1,n(ω)x(ω)|Livi,n(ω)Livi(ω) ! , (4.23) where we denote∀n ∈ N χn(ω) = p1,n(ω)− ¯x(ω)|C xn(ω)Cx¯(ω) ! . Since

(Bi−1)1≤imare monotone, for everyi∈ {1, . . . ,m}, we obtain

(∀n∈N) 0 p2,i,n(ω)vi(ω)|Lixn(ω)+γn−1(vi,n(ω)p2,i,n(ω))Lix(ω)βi,n(ω) =p2,i,n(ω)vi(ω)|Li(xn(ω)x(ω))+γn−1(vi,n(ω)p2,i,n(ω))βi,n(ω), (4.24) where ∀n ∈ N βi,n(ω) = p2,i,n(ω)vi(ω)| Di−1vi,n(ω)Di 1v¯i(ω) . Now, adding (4.24) fromi =1 toi =mand (4.23), we obtain, for everyn∈N,

φA(p1,n(ω)x(ω))p1,n(ω)x(ω)|γn−1(xn(ω)p1,n(ω)) + " p1,n(ω)x(ω)| m i=1 Li(p2,i,n(ω)vi,n(ω)) # + m i=1 p2,i,n(ω)vi(ω)| Li(xn(ω)p1,n(ω)) +γ−1 n (vi,n(ω)p2,i,n(ω)) −χn(ω)m i=1 βi,n(ω). (4.25)

(19)

For everyn∈Nand everyi∈ {1, . . . ,m}, we expandχn(ω)andβi,n(ω)as ⎧ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎩ χn(ω)= xn(ω)x(ω)|C xn(ω)C x(ω) + p1,n(ω)xn(ω)|C xn(ω)C x(ω) ! , βi,n(ω)= vi,n(ω)vi(ω)| Di−1vi,n(ω)Di−1vi(ω) +p2,i,n(ω)vi,n(ω)| Di 1vi,n(ω)Di−1vi(ω) . (4.26) By monotonicity ofCand(Di 1)1≤im, (∀n∈N) xn(ω)x(ω)|C xn(ω)C x(ω) ≥0, (∀i ∈ {1, . . . ,m}) vi,n(ω)vi(ω)| Di−1vi,n(ω)Di−1vi(ω) ≥0. (4.27) Therefore, for everyn∈N, we derive from (4.26) and (4.25) that

φA(p1,n(ω)−x(ω))≤φA(p1,n(ω)x(ω))+xn(ω)−x(ω)|C xn(ω)−C x(ω) + m i=1 vi,n(ω)vi(ω)| Di−1vi,n(ω)Di−1vi(ω)p1,n(ω)x(ω)|γn−1(xn(ω)p1,n(ω)) + " p1,n(ω)x(ω)| m i=1 Li(p2,i,n(ω)vi,n(ω)) # + m i=1 p2,i,n(ω)vi(ω)|Li(xn(ω)p1,n(ω)) +γ−1 n (vi,n(ω)p2,i,n(ω))p1,n(ω)xn(ω)|C xn(ω)C x(ω) ! − m i=1 p2,i,n(ω)vi,n(ω)| Di−1vi,n(ω)Di−1vi(ω) . (4.28) We set ζ = max 1≤imsupnN xn(ω)−x(ω)||,p1,n(ω)−x(ω),vi,n(ω)−vi(ω),p2,i,n(ω) −vi(ω) . (4.29)

Then it follows from the definition of4thatζ <∞, and from our assumption that

(∀n ∈ N) γn−1 ≤ ε−1. Therefore, using the Cauchy–Schwarz inequality, and the

(20)

φA(p1,n(ω)x(ω))≤ε−1ζxn(ω)−p1,n(ω)+ζ m i=1 Li xn(ω)−p1,n(ω) +ε−1v i,np2,i,nm i=1 Lip2,i,n(ω)−vi,n(ω) +ν0p1,n(ω)xn(ω) + m i=1 νip2,i,n(ω)vi,n(ω) →0. (4.30)

We deduce from (4.30) and (4.21) thatφA(p1,n(ω)x(ω))→0, which implies

thatp1,n(ω)x(ω). In turn,xn(ω)x(ω). Likewise, ifCis uniformly monotone

atx(ω), there exists an increasing functionφC: [0,+∞[→ [0,+∞] that vanishes

only at 0 such that

φC(xn(ω)x(ω))ε−1ζxn(ω)p1,n(ω) +ζ m i=1 Li xn(ω)p1,n(ω) +ε−1vi,n(ω)−p2,i,n(ω)m i=1 Lip2,i,n(ω)vi,n(ω)+ν0p1,n(ω)−xn(ω)+ m i=1 νip2,i,n(ω)−vi,n(ω) →0, (4.31) in turn,xn(ω)x(ω).

(ii)(d): Proceeding as in the proof of (ii)(c), we obtain the conclusions. We provide an application to minimization problems in [11, Section 4] which cover a wide class of convex optimization problems in the literature. We recall that the infimal convolution of the two functions f andgfromHto] − ∞,+∞]is

f g:x→ inf

y∈H(f(y)+g(xy)). (4.32)

The proximity operator of f0(H), denoted by proxf, which maps each point

xHto the unique minimizer of the function f +12x− ·2.

Example 4.3 Letm be a strictly positive integer. LetHbe a real separable Hilbert space, letzH, let f0(H), leth:H→Rbe convex differentiable function with

ν0-Lipschitz continuous gradient, for someν0∈]0,+∞[. For everyk∈ {1, . . . ,m}, let(Gk,· | ·)be a real separable Hilbert space, letrkGk, letgk0(Gk), letk

0(Gk)be 1/νk-strongly convex, for someνk ∈]0,+∞[. For everyk ∈ {1, . . . ,m},

(21)

minimize x∈H f(x)x|z+ m k=1 k gk) Lkxrk +h(x), (4.33)

and the dual problem is to

minimize v1∈G1,...,vm∈Gm (fh) zm i=1 Livi + m i=1 gi(vi)+i(vi)+ vi |ri . (4.34) We denote byP1andD1be the set of solutions to (4.33) and (4.34), respectively. Corollary 4.4 In Example4.3, suppose that

z∈ran ∂f + m i=1 Li(∂gi ∂i)(Li · −ri) + ∇h . (4.35) Let(a1,n)n∈N, (b1,n)n∈N, and(c1,n)n∈Nbe sequences of square integrableH-valued

random vectors, and for every i ∈ {1, . . . ,m}, let (a2,i,n)nN, (b2,i,n)nN, and

(c2,i,n)nN be sequences of square integrable Gi-valued random vectors.

Further-more, set β=max{ν0, ν1, . . . , νm} + m i=1 Li2, (4.36)

let x0be a square integrableH-valued random vector, and, for every i ∈ {1, . . . ,m}, letvi,0 be a square integrableGi-valued random vector, let ε ∈]0,1/(1+β)[, let

(γn)nNbe a sequence in[ε, (1−ε)/β]. Set (∀n∈N) ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ y1,n=xnγnh(xn)+ m i=1Livi,n+a1,n p1,n=proxγnf(y1,n+γnz)+b1,n for i =1, . . . ,m ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ y2,i,n=vi,n+γn Lixn− ∇i(vi,n)+a2,i,n p2,i,n=proxγngi(y2,i,nγnri)+b2,i,n q2,i,n= p2,i,n+γn Lip1,n− ∇∗i(p2,i,n)+c2,i,n vi,n+1=vi,ny2,i,n+q2,i,n q1,n= p1,nγnh(p1,n)+mi=1Lip2,i,n+c1,n xn+1=xny1,n+q1,n. (4.37)

Suppose that the following conditions hold forFn=σ ((xk, (vi,k)1≤im)0≤kn,

⎧ ⎪ ⎨ ⎪ ⎩ nN E[|||(a1,n, (a2,i,n)1im)|||2|Fn]<+∞ n∈N E[(b1,n, (b2,i,n)1≤im)|||2|Fn]<+∞ nN E[|||(c1,n, (c2,i,n)1im)|||2|Fn]<+∞. (4.38)

(22)

(i) nNE[xnp1,n2|Fn] < +∞and (∀i ∈ {1, . . . ,m}) nNE[vi,n

p2,i,n2|Fn]<+∞.

(ii) There exist a P1-valued random vector x and a D1-valued random vector

(v1, . . . , vm)such that the following hold.

(a) xnx andproxγnf(xnγn(∇h(xn)+

m i=1Livi,n)+γnz) xP-a.s. (b) (∀i ∈ {1, . . . ,m}) vi,n vi and proxγgi(vi,n+γn Lixn − ∇∗i(vi,n)γnri) vi P-a.s.

(c) Suppose that f orh is uniformly convex at x(ω)for everyωwith

P() = 1, then xnx andproxγnf(xnγn(∇h(xn)+mi=1Livi,n)+

γnz)xP-a.s.

(d) Suppose that gj orj is uniformly convex atvj(ω)for everyωwith

P() = 1, for some j ∈ {1, . . . ,m}, thenvj,nvj and proxγngj(vj,n+

γn

Ljxn− ∇j(vj,n)γnrj)vj P-a.s.

Proof Using the same argument as in the proof [11, Theorem 4.2], the conclusions

follows from Corollary4.2.

Remark 4.5 Here are some comments.

(i) By using Remark3.1, an extension of Corollary4.2to the variable metric setting is straightforward.

(ii) Almost sure convergence for some primal-dual splitting methods solving compos-ite monotone inclusions and composcompos-ite minimization problems are also presented in [12,19].

(iii) In the deterministic setting and in the case when eachkis the indicator function

of{0}, and(∀k∈ {1, . . . ,m})rk =0, andz=0, a preconditioned algorithm for

solving (4.33) can be found in [22].

Acknowledgments I thank Professor Patrick L. Combettes for helpful discussions. I thank the referees for their suggestions and correction which helped to improve the first version of the manuscript. This work is funded by Vietnam National Foundation for Science and Technology Development (NAFOSTED) under Grant No. 102.01-2014.02.

References

1. Attouch, H., Briceño-Arias, L.M., Combettes, P.L.: A parallel splitting method for coupled monotone inclusions. SIAM J. Control Optim.48, 3246–3270 (2010)

2. Bo¸t, R.I., Hendrich, C.: An algorithm for solving monotone inclusions involving parallel sums of linearly composed maximally monotone operators (2013).arXiv:1306.3191

3. Briceño-Arias, L.M., Combettes, P.L.: A monotone+skew splitting model for composite monotone inclusions in duality. SIAM J. Optim.21, 1230–1250 (2011)

4. Briceño-Arias, L.M., Combettes, P.L.: Monotone operator methods for Nash equilibria in non-potential games. In: Bailey, D., Bauschke, H.H., Borwein, P., Garvan, F., Théra, M., Vanderwerff, J., Wolkowicz, H. (eds.) Computational and Analytical Mathematics. Springer, New York (2013)

5. Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. Springer, New York (2011)

6. Barty, K., Roy, J.-S., Strugarek, C.: Hilbert-valued perturbed subgradient algorithms. Math. Oper. Res. 32, 551–562 (2007)

7. Bennar, A., Monnez, J.-M.: Almost sure convergence of a stochastic approximation process in a convex set. Int. J. Appl. Math.20, 713–722 (2007)

(23)

8. Combettes, P.L.: Quasi-Fejérian analysis of some optimization algorithms. In: Butnariu, D., Censor, Y., Reich, S. (eds.) Inherently Parallel Algorithms in Feasibility and Optimization and their Applications, pp. 115–152. Elsevier, New York (2001)

9. Combettes, P.L.: Solving monotone inclusions via compositions of nonexpansive averaged operators. Optimization53, 475–504 (2004)

10. Combettes, P.L.: Systems of structured monotone inclusions: duality, algorithms, and applications. SIAM J. Optim.23, 2420–2447 (2013)

11. Combettes, P.L., Pesquet, J.-C.: Primal-dual splitting algorithm for solving inclusions with mixtures of composite, Lipschitzian, and parallel-sum type monotone operators. Set-Valued Var. Anal.20, 307–330 (2012)

12. Combettes, P.L., Pesquet, J.-C.: Stochastic quasi-Fejér block-coordinate fixed point iterations with random sweeping, SIAM J. Optim (to appear)

13. Combettes, P.L., V˜u, B.C.: Variable metric quasi-Fejér monotonicity. Nonlinear Anal.78, 17–31 (2013) 14. Combettes, P.L., V˜u, B.C.: Variable metric forward-backward splitting with applications to monotone

inclusions in duality. Optimization63, 1289–1318 (2014)

15. D˜ung, D., V˜u, B.C.: A splitting algorithm for system of composite monotone inclusions, Vietnam J. Maths. (2014, to appear)

16. Ermol’ev, Y.M.: On the method of generalized stochastic gradients and quasi-Fejér sequences. Cyber-netics5, 208–220 (1969)

17. Jiang, H., Xu, H.: Stochastic approximation approaches to the stochastic variational inequality problem. IEEE Trans. Automat. Control53, 1462–1475 (2008)

18. Juditsky, A., Nemirovski, A., Tauvel, C.: Solving variational inequalities with stochastic mirror-prox algorithm. Stoch. Syst.1, 17–58 (2011)

19. Pesquet, J.-C., Repetti, A.: A class of randomized primal-dual algorithms for distributed optimization (2014).arXiv:1406.6404

20. Polyak, B.: Introduction to Optimization. Optimization Software, Inc., New York (1987)

21. Rosasco, L., Villa, S., V˜u, B.C.: A stochastic forward-backward splitting method for solving monotone inclusions in Hilbert spaces (2014, preprint).arXiv:1403.7999

22. Repetti, A., Chouzenoux, E., Pesquet, J.-C.: A penalized weighted least squares approach for restoring data corrupted with signal-dependent noise. In: Proceedings of the 20th European Signal Processing (SIPCO 2012), Aug 27–31, pp. 1553–1557. Bucharest, Romania (2012)

23. Robbins, H., Siegmund, D.: A convergence theorem for non negative almost supermartingales and some applications. In: Rustagi, J.S. (ed.) Optimizing Methods in Statistic, pp. 233–257. Academic Press, New York (1971)

24. Tseng, P.: A modified forward-backward splitting method for maximal monotone mappings. SIAM J. Control Optim.38, 431–446 (2000)

25. V˜u, B.C.: A variable metric extension of the forward-backward-forward algorithm for monotone oper-ators. Numer. Funct. Anal. Optim.34, 1050–1065 (2013)

26. Yousefian, F., Nedi´c, A., Shanbhag, U.V.: On stochastic gradient and subgradient methods with adaptive steplength sequences. Automatica48, 56–67 (2012)

References

Related documents

We find that firm-specific training leads to a significant reduction in on-the-job search activities of Swiss workers and a weaker but still significant impact of

The objectives of this dissertation are to: (1) understand international entrepreneurship activities of SMEs in Thailand in terms of their antecedents and consequences; (2)

'SL resembles the challenges of a real brand manager in many ways : Inventing a product that fits a determined target market (students) and to think. about POPs

The objective of this work is to evaluate the effect of the modernization work carried out at Rio Grande Harbor access channel on a specific sand spit called Pontal Sul as

One of the pyroclastic unilS of the Alban Hills, the «Tufo di Villa Senni» has been dated with extremely good accuracy lit 338 ± 8 ka and it is 10 be considered as the best

We introduce a new class of coupled sequential fractional differential equations involving Riemann–Liouville and Caputo derivatives, integral and nonintegral type nonlinearities

The second chapter illustrates the influence of the tax policies to the development of PE, and then summaries the inward type (including the patterns of Trust, Company

Improving global shipping corporations' accountability, by compelling them to disclose information regarding their GHG emissions reduction performance is a market based mechanism,