Convex Constraints and Related Operators - Iterative regularization methods for the solution of

SFP and Projections onto Affine Subspaces

In this chapter we develop and discuss the iteration methods for the solution of the split feasibility problem (SFP) and the computation of metric and Bregman projections onto affine subspaces. At first we examine what operators may be used in the iterative process to handle different kinds of constraints appearing in the SFP. The ones related to constraints in the range of a linear operator depend on a positive parameter which in general has to be chosen a posteriori.

In section 2.2 we show how these parameters can be chosen in case of exact as well as approximate data to ensure convergence of the methods. In case of approximate or noisy data this choice is linked to a discrepancy priniple.

The iteration methods for the SFP are analyzed in section 2.3. They produce sequences which in general have weak accumulation points that are solutions of the SFP. In the following section we show that the same iterative scheme can be used to compute metric and Bregman projections onto affine subspaces that are given via the nullspace or the range of a linear operator. For this case we can even prove strong convergence. In the last two sections we are concerned with possibilities to efficiently implement the methods. We show that the choice of parameters can be replaced by line searches and propose generalized conjugate gradient and sequential subspace methods for the computation of projections onto affine subspaces in case of exact data.

2.1 Convex Constraints and Related Operators

We intend to examine a little more closely the operators we will deal with.

First we recall some facts about linear operators [25, 47]. By L(X, Y ) we denote the Banach space of all continuous linear operators A : X −→ Y endowed with the operator norm

kAk := sup

kxk≤1kAxk . (2.1)

The dual operator A^∗∈ L(Y^∗, X^∗) is defined by

hA^∗y^∗| xi := hy^∗| Axi for all x ∈ X, y^∗∈ Y^∗ (2.2) and the equalities kA^∗k = kAk and N (A^∗) =R(A)^⊥ are valid. In case X is reflexive we also haveN (A) = R(A^∗)^⊥ and N (A)^⊥ =R(A^∗) and in case Y is reflexive we also haveN (A^∗)^⊥=R(A). An operator A ∈ L(X, Y ) is called compact, if the image A(BX) of the unit ball of X is a relatively compact subset of Y . It is a fact that A is compact iff A^∗ is compact and that a com-pact operator A is weak-to-norm-continuous, i.e. if (xn)n is a sequence in X which converges weakly to some x∈ X, then (Axn)nconverges strongly to Ax.

From now on we assume that X is a smooth and uniformly convex Banach space with a (bijective) duality mapping JX with gauge function t 7→ t^p−1. If JY is a set-valued duality mapping of another Banach space Y and we write “JY(y)” for some y ∈ Y , then we mean that J^Y(y) is allowed to be any element in the set JY(y). The additional assumptions in the following definition will be used for the different kinds of contraints in case of exact and approximate data.

Definition 2.1. We call assumption

(A, Q) Given are: a uniformly smooth and uniformly convex Banach space Y with duality mapping JY (with gauge function t7→ t^r−1), a compact operator 06= A ∈ L(X, Y ), a set Q ∈ C(Y ) and a con-stant γ∈ (0, 1). The set

MAx∈Q:={x ∈ X | Ax ∈ Q}

is not empty.

(A, y) Given are: an arbitrary Banach space Y with duality mapping JY

(with gauge function t 7→ t^r−1), an operator 0 6= A ∈ L(X, Y ), an element y∈ Y and a constant γ ∈ (0, 1). The set

MAx=y :={x ∈ X | Ax = y}

is not empty.

(A, y, +) Given are: an arbitrary Banach lattice Y with positive duality mapping J+, an operator 0 6= A ∈ L(X, Y ), an element y ∈ Y and a constant γ ∈ (0, 1). The set

MAx≤y :={x ∈ X | Ax ≤ y}

is not empty.

(Ci) In addition to assumption (C) a constant β ∈ (0, 1) and convex sets Ci∈ C(X) are given with

dm(C, Ci)≤ ǫ^mi

and

i→∞lim ǫ^m_i = 0 for all m∈ N .

2.1 Convex Constraints and Related Operators 47 (Aj, Qk) In addition to assumption (A, Q) a constant β ∈ (0, 1), compact operators 06= Aj∈ L(X, Y ) and sets Qk ∈ C(Y ) are given with

kA − A^jk ≤ η^j≤ ηj−1 , dm(Q, Qk)≤ δk^m≤ δ^mk−1

and

j→∞lim ηj = 0 , lim

k→∞δ^m_k = 0 for all m∈ N .

(Aj, yk) In addition to assumption (A, y) a constant β∈ (0, 1), operators 06= A^j∈ L(X, Y ) and elements y^k∈ Y are given with

kA − Ajk ≤ ηj ≤ ηj−1 , ky − ykk ≤ δk≤ δk−1

and

j→∞lim ηj= 0 , lim

k→∞δk = 0 .

(Aj, yk, +) In addition to assumption (A, y, +) the same holds as under as-sumption (Aj, yk).

Under assumption (C) we define the operator TC: X−→ X by

TC(x) := Π_C^p(x) . (2.3)

Under assumption (A, Q) we define for µ > 0 the operators T_A,Q,Π^µ , T_A,Q,P^µ : X−→ X by

T_A,Q,Π^µ (x) := J_X^∗

JX(x)− µA^∗

JY(Ax)− JY Π_Q^r(Ax)

, (2.4) and

T_A,Q,P^µ (x) := J_X^∗

JX(x)− µA^∗JY Ax− PQ(Ax)

, (2.5)

whereby Π^r is the Bregman projection and P is the metric projection in Y . For Q ={y} with some y ∈ Y under assumption (A, y) we get the (possibly set-valued) operator

T_A,y^µ : X−→ 2^X with T_A,{y},P^µ (x) =

T_A,y^µ (x) := J_X^∗ JX(x)− µA^∗JY(Ax− y) . (2.6) Under assumption (A, y, +) we define for µ > 0 the (possibly set-valued) op-erator

T_A,y,+^µ : X−→ 2^X by

T_A,y,+^µ (x) := J_X^∗

JX(x)− µA^∗J+ (Ax− y)⁺

. (2.7)

In Hilbert spaces T_A,y^µ and T_A,Q,P^µ are just the familiar operators T_A,y^µ (x) = x− µA^∗(Ax− y) and TA,Q^µ (x) = x− µA^∗ Ax− PQ(Ax) , which appear in the ordinary Landweber methods and the CQ algorithm for the SFP. Operator T_A,Q,Π^µ may also be useful in the context of more general Bregman projections.

For an operator T : X−→ 2^X we denote by Fix(T ) :={x ∈ X | x ∈ T (x)}

the set of all fixed points of T and by

S-Fix(T ) :={x ∈ X | x = T (x)}

the set of all strong fixed points of T . Obviously S-Fix(T )⊂ Fix(T ) and if T is single-valued then these sets coincide.

Proposition 2.2.

(a) Under assumption (C) we have

Fix(TC) = C .

(b) Under assumption (A, Q) and for all µ > 0 we have Fix(T_A,Q,Π^µ ) = Fix(T_A,Q,P^µ ) = M_Ax∈Q. (c) Under assumption (A, y) and for all µ > 0 we have

Fix(T_A,y^µ ) = S-Fix(T_A,y^µ ) = MAx=y. (d) Under assumption (A, y, +) and for all µ > 0 we have

Fix(T_A,y,+^µ ) = S-Fix(T_A,y,+^µ ) = MAx≤y.

Proof. (a) is just 1.26 (a). For x∈ MAx∈Q we have Π_Q^r(Ax) = Ax = PQ(Ax) and thus T_A,Q,Π^µ (x) = x = T_A,Q,P^µ (x). Hence M_Ax∈Q ⊂ Fix(TA,Q,Π^µ ) and MAx∈Q ⊂ Fix(TA,Q,P^µ ). Conversely for x∈ Fix(TA,Q,Π^µ ) we get

x = T_A,Q,Π^µ (x)

⇔ JX(x) = JX(x)− µA^∗

JY(Ax)− JY Π_Q^r(Ax)

⇔ A^∗

JY(Ax)− JY Π_Q^r(Ax)

= 0 .

Since MAx∈Q is supposed to be non-empty, we take some z∈ X with Az ∈ Q and get

2.1 Convex Constraints and Related Operators 49 0 =D

A^∗

JY(Ax)− JY Π_Q^r(Ax) x − z

=JY(Ax)− J^Y Π_Q^r(Ax)

Ax− Az

=JY(Ax)− JY Π_Q^r(Ax)

Ax− ΠQ^r(Ax) +JY(Ax)− J^Y Π_Q^r(Ax)

Π_Q^r(Ax)− Az

≥JY(Ax)− J^Y Π_Q^r(Ax)

Ax− ΠQ^r(Ax) ,

because of the validity of the variational inequality (1.30) for Π_Q^r(Ax) and Az ∈ Q. Since Y is strictly convex by 1.14 (a) the above inequality gives Ax = Π_Q^r(Ax) ∈ Q. The inclusion Fix(TA,Q,P^µ ) ⊂ M^Ax∈Q can be shown similarly. In (b) it suffices to show

Fix(T_A,y^µ )⊂ M^Ax=y ⊂ S-Fix(TA,y^µ ) ,

because S-Fix(T_A,y^µ )⊂ Fix(TA,y^µ ). If x∈ M^Ax=y then we have Jy(Ax− y) = 0 and it follows that MAx=y ⊂ S-Fix(TA,y^µ ). Conversely for x ∈ Fix(TA,y^µ ) we find some u^∗∈ J^Y(Ax− y) such that

x = T_A,y^µ (x) ⇔ J^X(x) = JX(x)− µA^∗JY(Ax− y) ⇔ A^∗JY(Ax− y) = 0 . Since MAx=y is supposed to be non-empty, we take some z∈ X with Az = y and get

0 =hA^∗JY(Ax− y) | x − zi = hJY(Ax− y) | Ax − yi = kAx − yk^r, which gives Ax = y and thus Fix(T_A,y^µ )⊂ M^Ax=y. In (c) it again suffices to show

Fix(T_A,y,+^µ )⊂ MAx≤y ⊂ S-Fix(TA,y,+^µ ) .

If Ax≤ y then we get (Ax−y)⁺= 0 and thus J+ (Ax−y)⁺ = 0, which yields T_A,y,+^µ (x) = x. Hence MAx≤y⊂ S-Fix(TA,y,+^µ ). Conversely for x∈ Fix(TA,y,+^µ ) we find some u^∗∈ J⁺ (Ax− y)⁺ such that

x = J_X^∗ JX(x)− µA^∗u^∗

⇔ JX(x) = JX(x)− µA^∗u^∗ ⇔ A^∗u^∗= 0 . Since MAx≤yis supposed to be non-empty, we find some z∈ X with y−Az ≥ 0 and by the properties of the positive duality mapping 1.21 we get

0 =hA^∗u^∗| x − zi

=hu^∗| Ax − yi + hu^∗| y − Azi

=hu^∗| (Ax − y)⁺i + hu^∗| y − Azi

=k(Ax − y)⁺k²+hu^∗| y − Azi

≥ k(Ax − y)⁺k²,

from which we infer that Ax− y ≤ 0 and thus Fix(TA,y,+^µ )⊂ M^Ax≤y. ⊓⊔

The operators are also linked to subdifferentials of certain functionals.

Proposition 2.3. We assume (A, Q), (A, y) or (A, y, +) and accordingly de-fine the functions fA,Q,P, fA,y, fA,y,+: X−→ R by

fA,Q,P(x) := 1

rkAx − P^Q(Ax)k^r, fA,y(x) := 1

rkAx − yk^r, fA,y,+(x) := 1

2k(Ax − y)⁺k². Then we have for all x∈ X

A^∗JY Ax− P^Q(Ax)

⊂ ∂fA,Q,P(x) , A^∗JY(Ax− y) ⊂ ∂fA,y(x) , A^∗J+ (Ax− y)⁺

⊂ ∂fA,y,+(x) .

Proof. The assertions for fA,yand fA,y,+follow immediately by 1.12 and 1.22.

We prove the assertion for fA,Q,P. For all x, y ∈ X we get by 1.12 and the variational inequality for the metric projection (1.16)

fA,Q,P(y)− fA,Q,P(x)

= 1

rkAy − PQ(Ay)k^r−1

rkAx − PQ(Ax)k^r

≥JY Ax− PQ(Ax)

Ay− PQ(Ay) − Ax − PQ(Ax)

=A^∗JY Ax− PQ(Ax) y− x +JY Ax− P^Q(Ax)

PQ(Ax)− P^Q(Ay)

≥A^∗JY Ax− P^Q(Ax)

y− x .

⊓

⊔

We do not know whether operator T_A,Q,Π^µ is also linked to a subdifferential of a functional fA,Q,Π. The canonical candidates fA,Q,Π(x) = ∆r Ax, Π_Q^r(Ax) or fA,Q,Π(x) = ∆r Π_Q^r(Ax), Ax do not seem to work (do they?). However by 1.24 (e) for fixed z∈ X we have

A^∗

JY(Az)− J^Y Π_Q^r(Az)

⊂ ∂fA,Q,Π_Q^r(Az)(z) with

f_A,Q,Π^r_Q(Az)(x) = ∆r Π_Q^r(Az), Ax

, x∈ X .

In document Iterative regularization methods for the solution of the split feasibility problem in Banach spaces (Page 51-57)