Optimization problems for set functions

(1)

OPTIMIZATION PROBLEMS FOR SET FUNCTIONS

SLAWOMIR DOROSIEWICZ

(Received 16 June 2000)

Abstract.This paper gives the formal definition of a class of optimization problems, that is, problems of finding conditional extrema of given set-measurable functions. It also formulates the generalization of Lyapunov convexity theorem which is used in the proof of first-order optimality conditions for the mentioned class of optimization problems.

2000 Mathematics Subject Classiﬁcation. 90C48.

1. Introduction. This paper concentrates on the analysis of certain class of opti-mization problems, namely the problems of finding conditional extrema of set func-tions. This type of problems has been shown to arise in interval and pointwise estima-tion of probability distribuestima-tion, testing hypothesis (in particular during the construc-tion of the strongest tests)—proof of Neyman-Pearson lemma (cf. [3, 4, 10]). These problems also appear in analysis of some transportation problems, especially com-putations of trafficassignment in transportation networks including finding optimal systems of regular lines (see [7]). In future, one can expect the widening possibilities of applications of this class of optimization problems.

The considerations are divided into four parts. The ﬁrst part consists of general re-marks on set functions and optimization problems for these functions. Sections3and

4have an auxiliary character. They include some special properties of bounded mea-sures (the generalization of Lapunov convexity theorem (Lemma 3.1andTheorem 3.2) and the definition of differentiability of set functions). The most important results are presented inSection 5. There are theorems and some corollaries which formulate the properties of solutions of optimization problems for set functions. These necessary conditions for optimality seem to be the generalization of the results formulated in [5,6,9,11,12]. The mentioned theorems are similar to the well-known Kuhner-Tucker first-order optimality conditions. The main difficulty to obtain these results is caused by the structure of the union of feasible solutions—this family does not usually have the useful structures: compactness, convexity, the structure of a Banach space.

2. Optimization problems for set functions. Let(X,M,µ)be a measurable space with bounded measureµ:M→R.

Deﬁnition2.1. Every map (for a givenk∈NMk _{denotes the cartesian product}

M×···×M

nfactors

(2)

Example2.2. We consider theµ-integrable functionsfi:X→R(i=1,...,k). An

important family consists of set functions

F:Mk _→_R_, _F(S)₌k i=1

Si

fidµ. (2.1)

The obvious generalization is the following:

F:Mk _→_R_, _F(S)₌_u

S1f1dµ,...,

Sk

fkdµ

, (2.2)

whereu:Rk_→_R_{is given.}

We formulate the general form of optimization problem for a set function. This problem relies on ﬁnding extrema (without loss of generality—minima) of a given set functionF0:Mk_→_R_:

F0(S) →min, (2.3)

under some constraints (conditions). They will have two main forms:

• constraints deﬁned by other set functionsFi:Mk→R, wherei=1,...,s

Fi(S)≤0 (i=1,...,s); (2.4)

• constraints imposed directly on the sets, or equivalently on characteristic func-tions of these sets

χS(x)∈v(x) forµ-almost all (a.a.)x∈X, (2.5)

wherev:X→2{0,1}k

is a given function taking values in the family of all subsets of{0,1}k_,_χ

Sdenotes the vector of characteristic functions of the elementS∈

Mk_{, if}_S₌_(S1,...,S

k), thenχS=(χS1,...,χSk)and

χSj(x)=  

1 if_{0 if}x_x∈_∈S_Xj₋, _S_j_. (2.6)

The notions: feasible solution and optimal one can be deﬁned in a standard way. The feasible solution is an elementS∈Mk_{satisfying conditions (}_2.4_{) and (}_2.5_{); an optimal}

solution is feasible which minimizes the value of (2.3) among other feasible solutions. To investigate the properties of optimal solutions of (2.3), (2.4), and (2.5) we need some special properties of measures and set functions.

3. Some properties of bounded measure. This part provides the proof of some properties of bounded measures. It follows that a wide class of optimization prob-lems with set functions is equivalent to the probprob-lems of convex mathematical pro-gramming in Euclidean space and can be examined using appropriate methods of convex analysis.

Let(X,M,µ)be a space with bounded measure. ForS=(S1,...,Sk)∈MkandU=

(3)

Lemma3.1. Suppose that

(1) µis bounded and nonatomic;

(2) Cis a (totally) unimodular matrix;

(3) for allx∈Xthe system of equations

Cy=χU(x) (3.1)

is consistent. The set

W=µS1,...,µSk

:S∈Mk_{, Cχ}

S=χU (3.2)

is compact and convex. Ifµis bounded (but not necessarily nonatomic), then the setW

is compact (but it may not be connected).

Proof. Suppose ﬁrst thatµis nonatomic. This part of the proof is similar to the proof of Lapunov convexity theorem and uses the ideas of Lindenstrauss (see [8]).

Without loss of generality, we assume thatµis scalar and nonnegative. Let

ᏸ= k

i=1

L∞_(X,M,µ),

K=g=g1,...,gk∈ᏸ: 0≤gi≤1, Cg=χU,

J:ᏸ →Rk_, _J(g)₌

Xg1dµ,...,

Xgkdµ

.

(3.3)

The set ᏸ with the standard norm g =ki=1gi∞, where · ∞ is the norm in L∞_(X,M,µ)_{such that}_h_∞₌_inf_A_:_µ(A)₌₀_sup

x∈X−A|h(x)|is a Banach space.

The setKis an intersection of cartesian product ofksets{h∈L∞_(X,M,µ)_{: 0}_≤_h_≤ 1}and the set of functions satisfying the equationCy=χU. From Alaoglu theorem

(cf. [1,2]) and Tichonov theorem, we obtain that

k

i=1

g∈L∞_(X,M,µ)_{: 0}_≤_g_≤₁ _(3.4)

is compact in∗-weak topology ofᏸ. Because the set of solutions of (3.1) is closed, the setKis compact.

LetJi fori=1,...,kdenotes theith components of the mapJ. The functionsJi

are linear and continuous in the topology ofL∞_(X,M,µ)_{deﬁned by the norm}_·_∞. Dunford-Schwartz theorem implies thatJiare continuous in the∗-weak topology of

L∞_{(X,M,µ). Therefore the map}_J_{is continuous in the weak}_∗_{-topology of}_ᏸ_{. The set} J(K)⊂Rk_{is the image of a compact and convex set in the linear map, therefore it is}

also compact and convex.

We show thatKis nonempty. We denote by᏾x, for any givenx∈X, the set of

non-negative solutions of the system (3.1). Under the assumptions, forµ-a.a.x∈Xwe have ᏾x≠∅. Because{(x,z):x∈X, z∈᏾x} ∈M⊗β(Rk), then there exists a measurable

selectorv0of the family{᏾x:x∈X}(i.e., the mapv0:X→Rk, such that forµ-a.a.

x∈X,v0(x)∈᏾xholds). Deﬁnition ofKimplies thatv0∈K, thusKis nonempty. Of

(4)

andW⊂J(K), then it suﬃces to show that either the setJ−1₍_{_a_}₎_∩_K_{is empty or it} includes the mapχS for anyS∈Mk.

Fixa∈J(K). The setJ−1₍_{_a_}₎_∩_K_{is nonempty, compact, and convex. Krein-Millman} theorem follows that this set has at least one extreme point. Letf=(f1,...,fm)be

such a point. We will show that each component offtakes only values 0,1 (µ-a.e. onX). LetXC denote the set{y:Cy=0}. If dimXC=0, then for everyx∈X,᏾x

con-tains one element. This unique element is obviously the extreme point of᏾x. As the

matrixC is unimodular we have that every component ofr (x)∈᏾x is either 0 or

1. The selector v0 of the family {᏾x :x ∈X} (the mapv0(x)=r (x) for x∈ X)

can be written as v0 = χS for any S ∈ Mk. This means that Lemma 3.1 holds in

this case.

Suppose that for anyi∈ {1,...,k}, fi is not a characteristic function of any

mea-surable set. In this case, dimXC>0 and there exist* >0 and the setE0∈Msuch that

µ(E0) >0 and* < fi(x) <1−*for everyx∈E0. The unimodularity ofCimplies that

every component of the extreme point of᏾xis equal to 0 or 1. Therefore, for every

x∈E0, the vectorf (x)is not an extreme point of᏾x. Hence there exist a setE0⊂E0, a vectorc∈XC,c≠0, and a numberδ >0, such that

µE 0

>0, ∀x∈E 0,

λ∈[0,δ]⇒fi(x)+λc∈ +᏾x−᏾x,ex, (3.5)

where᏾x,exdenotes the set of extreme points of᏾x.

Becauseµis nonatomic, there exists a measurable setE1⊂E

0such that 0< µ(E1) < µ(E_{0). Choose the measurable sets} _G1_⊂_E1_and_G2_⊂_E

0−E1with positive measures and the numberss1,s2∈R(s1+s2>0), such that

maxs1,s2< δ;

s1µE1−2µG1+s2µE

0−E1−2µG2=0.

(3.6)

Consider the function

h=s1χE1−2χG1

+s2χE0−E1−2χG2

. (3.7)

It is easy to check thatXhdµ=0 and for suﬃciently smalls1,s2functionsf+=f+hc,

f−₌_f₋_hc_{belong to}_J−1₍_{_a_}₎_∩_{K. Because}_f₌_1/2(f+₊_f−_{), then}_f _{is not an} ex-treme point ofJ−1₍_{_a_}₎_∩_{K. This contradicts the deﬁnition of}_f_{. For}_i₌_1,...,k_and µ-a.a.x∈X, we havefi(x)∈ {0,1}, which implies the existence of an elementS∈Mk

such thatχS=f. This ﬁnishes the proof of the ﬁrst part ofLemma 3.1.

Suppose that the measureµis only bounded (not necessarily nonatomic). It is easy to see that the following decomposition holds:

µ=µna+µa, (3.8)

where the measuresµna,µaare singular and the ﬁrst of them is nonatomic. Of course

(5)

atoms andaµ the sum of sets of this family. For S =(S1,...,Sk)∈Mk, we denote

S−aµ=(S1−aµ,...,Sk−aµ)andS∩aµ=(S1∩aµ,...,Sk∩aµ). For everyA∈Mwe

have an obvious decompositionA=(A−aµ)∪(A∩aµ), soχU=χU−aµ+χU∩aµ and

χS=χS−aµ+χS∩aµ. This means that

W=WnaWa, (3.9)

where

Wna=µnaS1,...,µnaSk:S∈Mk, CχS=χU−aµ

,

Wa=µaS1,...,µaSk:S∈Mk, CχS=χU∩aµ

. (3.10)

From the ﬁrst part ofLemma 3.1, it follows thatWnais compact. It remains to prove

thatWaalso has this property. Note thatWacan be rewritten as follows:

Wa=   

b∈Aµ

µS1∩b,...,µSk∩b:S∈Mk, CχS=χU∩aµ   

=   

b∈Aµ

µ(b)x1b,...,µ(b)xbk:xb1,...,xbk∈ {0,1}k∩᏾b, b∈Aµ   ,

(3.11)

where᏾b, forb∈Aµ, satisﬁes the condition

µx∈b:᏾x=᏾b=µ(b). (3.12)

The setWa is equalφ(U), where (denotes the cartesian product of set)

U=

b∈Aµ

{0,1}k_∩_᏾ b,

φ:[0,1]k|Aµ| _→_Rk_, _φ_x_bi

b∈Aµ,i=1,...,k

=

b∈Aµ

µ(b)xb1,...,µ(b)xbk.

(3.13)

Continuity ofφand the compactness ofUimply thatWais compact. This completes

the proof.

Note that inLemma 3.1, the systemCχS=χUwas rewritten as the conditionχS(x)∈

᏾(x)forµ-a.a.x∈X. Unimodularity of matrixCand other assumptions guarantee that the sets᏾(x)were nonempty subsets of{0,1}k_{. Taking the measurable map}

v:X→2{0,1}k

− {∅}(the symbol 2{0,1}k

denotes the family of all subsets of{0,1}k_),

measurability ofvmeans thatv−1_(B)_∈_M _{for every}_B_{⊂ {}_0,1_}k_{, and replacing (}_3.1₎

by (the mapχS satisfying such condition will be called the selection ofv)

χS(x)∈v(x) forµ-a.a.x∈X, (3.14)

(6)

Theorem3.2. Letµbe the bounded, nonatomic measure and letv:X→2{0,1}k be the measurable map, such that forµ-a.a.x∈Xv(x)≠∅. Then the set

W=µS1,...,µSk:S=S1,...,Sk∈Mk, χS is selection ofv (3.15)

is compact and convex.

Proof. The collection{v−1₍_{_B_}₎_:_B_{⊂ {}_0,₁_}k_}_{is the partition of}_{X. It follows that}

W=B⊂{0,1}kWB, where

W=µS1,...,µSk:S=S1,...,Sk∈Mk,

Sj⊂v−1(B), χS|v−1_(B)is a selection ofv|_v−1_(B)

, (3.16)

seeLemma 3.1. The setWis nonempty and compact since anyWBhas these properties

(seeLemma 3.1). This completes the proof.

4. Differentiability of set functions. The definition presented below seems to be a generalization (on the case of bounded measure with any family of atoms) of the notion of differentiability introduced in [5,9].

Before deﬁning a derivative of set functions, we note a few facts. Note that any set function can be equivalently deﬁned on the family of characteristic functions of measurable subsets of X. We denote ˆM = {χS :S ∈M}. Any element S ∈Mk

cor-responds tokcharacteristic functionsχS∈L∞(X,M,µ,Rk)(orχS∈L∞(X,M,µ,Rk),

whereL∞_(X,M,µ,Rk₎_{denotes the space of essentially bounded functions), this}

im-plies equivalence between set functions and maps deﬁned on ˆMk_{, set function}_F

cor-responds to ˆF: ˆMk_→_{R, ˆ}_F(χ

A)=F(A). Additionally, if we can identify the sets whose

symmetricdiﬀerence has measure zero, thenMk_{can be viewed as a metric space—the}

distance can be deﬁned as follows: ifA=(A1,...,Ak)∈Mk,B=(B1,...,Bk)∈Mk, then

ρ:Mk_×_Mk _→_R, _ρ(A,B)₌k i=1

µAiBi, (4.1)

wheredenotes the symmetricdiﬀerence of sets.

Of course, the derivative of the function can be easily deﬁned, if its domain has the structure of a linear and metric space. Unfortunately, Mk _{(or equivalently ˆ}_Mk₎

has no “natural” linear structure. Additionally ˆMk_{is not a convex nor a closed}

sub-set of the space of integrable or essentially bounded functions. These facts do not make impossible defining differentiability, but they require making the appropriate modifications.

Deﬁnition4.1. We say that a set functionF is diﬀerentiable at S0_∈_Mk _{if for}

any S ∈Mk _{there exist an integrable function}_f

S0_,S_∩_a_µ :X−a_µ→Rk and the map φS0:Mk→R, such that the following decomposition holds:

F(S)−FS0₌

XfS0,S∩aµ

χS−χS0dµ+φ_S0S∩a_µ+R_S0(S),

RS0(S)=oρS−aµ,S0−aµ.

(4.2)

(7)

Example4.2. The set functions (2.1) are diﬀerentiable. Moreover,

fS0_,S_∩_a_µ=f|X−aµ, φS0(S)=

Xf

χS∩aµ−χS0_∩_a_µdµ. (4.3)

The notion of diﬀerentiability has some “good” properties. For example, the unique-ness property: any set functionF:Mk_→_R_{has, at given}_S0_∈_Mk_{, no more than one}

derivative. The proof can be found in [7].

In the special case whenµis nonatomic, we haveaµ= ∅and for everyS∈MkS∩aµ = ∅def=₍_∅_,...,_∅₎_∈_Mk_{which means that}_f

S0_,S_∩_a_µ=f_S0,∅andφ_S0≡0 forS∈Mk.

5. Properties of solutions of optimization problems withset functions. In this section, we formulate the necessary conditions for optimality in the problems (2.3), (2.4), and (2.5) with diﬀerentiable set functionsFi (i=0,1,...,s). The derivatives of

these functions are denoted by(fS,i,0), respectively, the components offS,iare

de-noted byfS,ij, wherej=1,...,k.

We begin by considering the case with nonatomic measure.

Theorem5.1. LetS∗_∈_Mk_{be an optimal solution of (}_2.3_{), (}_2.4_{), and (}_2.5_).

(1)Suppose that

(a) µis bounded and nonatomic;

(b) Fi(i=0,1,...,s)are diﬀerentiable atS∗in the sense ofDeﬁnition 4.1;

(c) the mapv:X→2{0,1}k

−{∅}is measurable; then there exist the nonnegative numbersλ∗

0,λ∗1,...,λ∗s, not simultaneously equal to zero, such that for every

S∈Mk_satisfying_χ

S(x)∈v(x)forx∈X, the following inequality holds: s

i=0 λ∗

i

XfS∗,i

χS−χS∗dµ≥0. (5.1)

Moreover, fori=1,...,s,

λ∗

iFiS∗=0. (5.2)

(2)If there existsSˆ∈Mk_{, such that}_χˆ

S is measurable selection ofv satisfying, for

i=1,...,s, the inequality

FiSˆ+

XfS∗,i

χˆS−χS∗dµ <0, (5.3)

then in (5.1) we may putλ∗ 0=1.

Proof. The presented conditions are similar to the Lusternik theorem describing necessary conditions for conditional extrema of functionals in Banach spaces. The proof is divided into a sequence of steps.

Step1. We show that

V=x0,...,vs∈Rs+1:∃S∈Mk, χS selection ofv,

∀i=1,...,s, xi≥

XfS∗,i

χS−χS∗dµ+1−δ_i₀F_iS∗,

(5.4)

whereδ denotes Kronecker delta, is convex. Fix p1_,p2_∈_V, _a_∈_[0,_{1]. There exist} S1_,S2_∈_Mk_{such that, for}_m₌_1,2,_i₌_0,_1,...,s,

pm i ≥

XfS∗,i

(8)

andχSm(x)∈v(x)forµ-a.a.x∈X.Lemma 3.1applied to the measure

MA→ν(A)=

XfS∗,iχAdµ

i=0,1,...,s

∈Rs+1 _(5.6)

shows that the set

XfS∗,0χSdµ,...,

XfS∗,sχSdµ

∈Rs+1_:_S_∈_Mk_{, χ}

S(x)∈v(x)forx∈X

(5.7)

is convex. For anya∈[0,1]there exists an elementSa_∈_Mk_{such that}

XfSa,0χSadµ,...,

XfSa,sχSadµ

=a

XfS1,0χS1dµ,...,

XfS1,sχS1dµ

+(1−a)

XfS2,0χS2dµ,...,

XfS2,sχS2dµ (5.8)

and forµ-a.a.x∈X,χSa(x)∈v(x).

This implies that

ap1₊₍₁₋_a)p2_≥

XfSa,0

χSa−χ_S∗dµ,...,

XfSa,s

χSa−χ_S∗dµ

+1−δi0FiS∗,

(5.9) which means thatap1₊₍₁₋_a)p2_∈_V_{. This ﬁnishes the proof of convexity.}

Step2. We show thatVis disjoint with]−∞,0[s+1_{. If for}_µ-a.a._x_∈_X_{the set}_v(x) has only one element, then forS∈Mk_{satisfying the condition}_χ

S(x)∈v(x)we have

S=S∗_{. The set}_V_{is disjoint with}_]_−∞_,0[s+1_{and the inequality (}_5.1_{) obviously holds.} Assume that there existsS0_≠_S∗_{such that}_χ

S0(x)∈v(x)forµ-a.a.x∈X. Suppose thatV∩]−∞,0[s+1_≠_∅_{. Hence there exists}_S0_∈_Mk_{, such that for all}_i₌_0,1,...,s_the

following inequality holds:

XfS∗,i

χS0−χ_S∗dµ+1−δ_i₀F_iS∗≤0. (5.10)

Lapunov convexity theorem applied to the measure

MA→

A

_χ_S0−χ_S∗dµ,

AfS∗,0

χS0−χ_S∗dµ,...,

AfS∗,s

χS0−χ_S∗dµ

∈Rs+2_,

(5.11) where |χS0−χS∗| =k_j₌₁|χ_S0

j −χS∗j|, implies that for any α∈[0,1]there exists a

µ-measurable set

Ωα_⊂_x_∈_X_:_χ

S∗(x)≠χ_S0(x) (5.12)

such that

α

X

_χ_S0−χS∗dµ=

ΩαχS0−χS∗dµ, (5.13)

and for everyi=0,...,s,

α

XfS∗,i

χS0−χS∗dµ=

ΩαfS∗,i

(9)

Consider for ﬁxedα∈[0,1], the map

uα_:_X _→_Rk_, _uα_(x)₌_χ

S∗(x)+χ_S0(x)−χ_S∗(x)χ_Ωα(x), (5.15)

and we denote bySα _{the element}_Mk_{which corresponds to}_uα_{, that is,}_χ

Sα=uα. It

is easy to check thatχSα(x)∈v(x)forµ-a.a.x∈X. For suﬃciently smallα >0 the

elementSα_{is closed to}_S∗_{with respect to the metric}_ρ_{(see (}_4.1_)):

ρSα_,S∗₌

X

_χ_Sα−χ_S∗dµ=

ΩαχSα−χS∗dµ=α

X

_χ_S₀₋_χ_S∗dµ, (5.16)

thereforeρ(Sα_,S∗₎_→_{0 if}_α_→₀₊_. Diﬀerentiability ofFiimplies that

FiSα−FiS∗=

XfS∗,i

χSα−χ_S∗dµ+R_i,S∗Sα, (5.17)

and hence

FiSα−FiS∗=α

XfS∗,i

χS0−χS∗dµ+R_i,S∗Sα, (5.18)

whereRi,S∗(Sα)=o(ρ(Sα,S∗))=o(α)if α→0. Equation (5.10) gives that the ﬁrst component in (5.18) is negative, therefore for suﬃciently smallα >0 its absolute value is greater than the second one. This implies thatF0(Sα_{) < F0(S}∗₎_and_F

i(Sα) <

Fi(S∗)≤0 fori=1,...,s. ElementSαis the feasible solution of (2.3), (2.4), and (2.5);

the value of the objective function forSα _{is less than for}_S∗_{. This contradicts the} deﬁnition ofS∗_.

Step 3. This step ﬁnishes the proof of inequality (5.1), the separation theorem implies that there exist nonnegative numbersλ∗

0,...,λ∗s not all zero such that, for all

(t0,t1,...,ts)∈V,

s

i=0 λ∗

iti≥0. (5.19)

ForS∈Mk_satisfying_χ

S(x)∈v(x)forµ-a.a.x∈X, we have

s

i=0 λ∗

i

XfS∗,i

χS−χS∗dµ+1−δ0_,iF_iS∗

≥0. (5.20)

LettingS =S∗ _{we obtain} s

i=1λ∗iFi(S∗)≥0. Becauseλ∗i ≥0 andFi(S∗)≤0 fori=

1,...,s, thenλ∗

iFi(S∗)=0. This ﬁnishes the ﬁrst part of the proof.

Now suppose that there exists an element ˆS∈Mk_{, which satisﬁes the inequalities}

(5.3) withλ∗

0=0. The inequality (5.20) gives

s

i=1 λ∗

i

XfS∗,i

χˆS−χS∗dµ+F_iS∗

≥0. (5.21)

Each component of this sum is nonpositive, thenλ∗

i =0 fori=1,...,s. We obtain the

contradiction with the fact thatλ∗

0,λ∗1,...,λ∗s are not all zero. Henceλ∗0>0. Dividing both sides of (5.1) byλ∗

(10)

Corollary5.2. Suppose thatS∗₌_(S∗

1,...,Sk∗)∈Mkis an optimal solution of (2.3), (2.4), and (2.5) and this problem does not have the conditions (2.5). If the assumptions ofTheorem 5.1are satisﬁed, then there exist nonnegative numbersλ∗

0,λ∗1,...,λ∗s, not simultaneously equal to zero, such that forj=1,...,k,

s

i=0 λ∗

ifS∗_,i(x)

 

≤0 ifx∈S

∗

j, ≥0 ifx∈X−S∗

j.

(5.22)

Proof. According toTheorem 5.1, for everyS∈Mk_{we have} s

i=0 λ∗

i

XfS∗,i

χS−χS∗dµ≥0. (5.23)

Fixj ∈ {1,...,k}. Inequality (5.23) must hold for every measurableu=(u1,...,uk)

such thatX→ {0,1}k_{, where}_u

i=χS∗_i fori≠j. This gives (the symbolfS∗_,ij denotes thejth component offS∗_,i)

s

i=0 λ∗

i

XfS∗,ij

uj−χS_j∗

dµ≥0, (5.24)

which is equivalent to (5.22). This completes the proof.

Corollary5.3. LetS∗_{be the optimal solution of (}_2.3_{), (}_2.4_{), and (}_2.5_{) in which the}

condition (2.5) can be written asχS(x)∈Vforµ-a.a.x∈X, whereV is a given subset of{0,1}k_._{Theorem 5.1}_{shows that there exist nonnegative numbers}_λ∗

i (i=0,1,...,s), which do not vanish simultaneously, such that, forv¯∈Vandx∈χ−1

S∗(v)¯ ,

∀v∈V s

i=0 λ∗

ifS∗_,i(v−v)¯ ≥0. (5.25)

Example5.4. Let(X,M,µ)be a probabilistic space with nonatomic, bounded mea-sureµ,Fi(i=0,1,...,s), the diﬀerentiable set functions onM3. Consider the problem

F0S1,S2,S3→min, (5.26)

subject to

S1,S2,S3∈M3_{, F}

iS1,S2,S3≤0 (i=1,...,s), S1∪S2⊂S3. (5.27)

Note that the last constraint can be written, using the characteristic functions of S1,S2,S3, in the following form:

χS1∪S2=max

χS1,χS2

≤χS3. (5.28)

This inequality has ﬁve solutions:

v1=(0,0,0), v2=(0,0,1), v3=(1,0,1), v4=(0,1,1), v5=(1,1,1), (5.29)

thus we have

χS1(x),χS2(x),χS3(x)

∈V forx∈X, (5.30)

(11)

Suppose thatFiare diﬀerentiable. We denote the derivative ofFiatS=(S1,S2,S3)

byfS,i, and its components byfS,i,1,fS,i,2,fS,i,3, respectively. Assume that the

prob-lem (5.26) and (5.27) has the optimal solutionS∗₌_(S∗

1,S2∗,S3∗).Corollary 5.3implies that there exist nonnegative numbersλ∗

i (i=0,...,s)(which do not vanish

simulta-neously), such that for anyv∈V,

XᏲS∗

v−χS∗dµ≥0, (5.31)

whereᏲS∗=s_i₌₀λ∗_if_S∗_,i. The functionᏲ_S∗, likef_S∗_,i, has three components which correspond toS1,S2,S3. We denote them byᏲS₁∗,ᏲS₂∗,ᏲS₃∗, respectively.

Consider the inequality (5.31). Putting the functions

v(k)_(x)₌   χS

∗(x) forx∈S₁∗∪S₂∗∪S₃∗, vk forx∈S1∗∪S2∗∪S3∗,

(5.32)

wherek=1,...,5, we obtain the system of inequalities which holdsµfor a.a.X−S∗ 1∪ S∗

2∪S3∗(i.e., forxsuch thatχS∗(x)=v1), ᏲS∗

1(x)+ᏲS3∗(x)≥0, ᏲS2∗(x)+ᏲS∗3(x)≥0, ᏲS∗1(x)+ᏲS2∗(x)+ᏲS3∗(x)≥0.

(5.33)

Forx∈S∗ 3−

S∗ 1∪S2∗

(ifχS∗(x)=v2),

ᏲS₃∗(x)≤0, ᏲS₁∗(x)≥0, ᏲS₂∗(x)≥0. (5.34)

Ifx∈S∗

1−S2∗(χS∗(x)=v3), then

ᏲS1∗(x)≤0, ᏲS2∗(x)≥0, ᏲS3∗(x)≤0. (5.35)

Ifx∈S∗

2−S1∗(χS∗(x)=v4), then

ᏲS1∗(x)≥0, ᏲS2∗(x)≤0, ᏲS3∗(x)≤0. (5.36)

In the case whenx∈S∗

1∩S2∗∩S3∗(χS∗(x)=v5), then

ᏲS1∗(x)≤0, ᏲS2∗(x)≤0, ᏲS3∗(x)≤0. (5.37)

Inequalities (5.33), (5.34), (5.35), (5.36), and (5.37) allow us to determine the optimal solution(−s)S∗₌_(S∗

1,S2∗,S3∗)(if exist) of the problem (5.26) and (5.27).

The considerations presented so far concerned the special case when the measure µ was nonatomic. It is easy to prove some generalizations ofTheorem 5.1. We have the following corollary.

Corollary5.5. LetS∗_{be an optimal solution of (}_2.3_{), (}_2.4_{), and (}_2.5_{), with}

diﬀer-entiable set functionsFi (i=0,...,s). There exist the nonnegative numbers λ∗i (i=

0,...,s), which do not vanish simultaneously, such that any feasible solutionSsatisﬁes the following inequality:

s

i=0 λ∗

i

X−aµfS

∗_,S∗_∩_a_µ_,iχ_S−χ_S∗dµ≥0, (5.38)

(12)

Proof. The elementS∗_{is the optimal solution of (}_2.3_{), (}_2.4_{), and (}_2.5_{) if and only} ifS∗₋_a

µ=(S1∗−aµ,...,Sk∗−aµ)is the optimal solution of the problem (with decision

variablesA=(A1,...,Ak))

F0A∪S∗_∩_a

µ →min, (5.39)

subject to

Aj∈M, Aj⊂X−aµ forj=1,...,k;

FiA∪S∗∩aµ≤0 fori=1,...,s;

χA(x)∈v(x) forµ-a.a.x∈X−aµ.

(5.40)

Measure µ restricted to the family of measurable subsets of X−aµ is obviously

nonatomic. ApplyingTheorem 5.1to the problem (5.39) and (5.40) ﬁnishes the proof.

Corollary 5.7concentrates on ordinary (unconditional) extrema of set function. This kind of extreme can be viewed as the optimal solutions of (2.3), (2.4), and (2.5), in which the constraints (2.4) do not appear explicitly.

Deﬁnition5.6. We say that the set functionF:Mk_→_R_{has in}_S∗_∈_Mk_minimum

(local minimum), ifS∗_{is the optimal solution of the problem}

F(S) →min, (5.41)

subject to

S∈Mk_{there exists}_{* >}_0, _{such that}_ρ_S∗_,S_{< *}_. _(5.42)

The necessary condition for optimality gives the following corollary.

Corollary5.7. IfFhas minimum (respectively, local minimum) inS∗₌_(S∗ 1,...,Sk∗) and((fS∗_,S_∩_a_µ)_S_∈_Mk,φS∗)is the derivative ofFinS∗, then for anyS∈Mk(respectively,

there exists* >0, such thatρ(S,S∗_{) < *}_{) and}_j₌_1,...,k_,

x∈S∗

j −aµ ⇒fS∗_,S∗_∩_a_µ_,j(x)≤0;

x∈S∗

j −aµ ⇒fS∗_,S∗_∩_a_µ_,j(x)≥0, (5.43)

φS∗S∩a_µ≥0. (5.44)

Proof. Without loss of generality, it is sufficient to consider the case of local min-imum. Minimum ofF corresponds to the situation with*≥µ(X). Formulas (5.43) follows directly fromTheorem 5.1. To finish the proof, we should show the inequality (5.44). Differentiability ofF and optimality ofS∗_{imply that if}_ρ(S∗_{,S) < *, then}

0≤F(S)−F(S∗₎₌

X−aµfS

∗_,S_∗∩_a_µχ_S−χ_S∗dµ

+φS∗S∩a_µ+oρS−a_µ,S∗−a_µ.

(5.45)

(13)

References

[1] A. Alexiewicz,Functional Analysis, Monograﬁe Matematyczne, vol. 49, Polish Scientiﬁc Publishers, Warsaw, 1968 (Polish).

[2] A. V. Balakrishnan,Analiza Funkcjonalna Stosowana, PWN, Warsaw, 1992.

[3] J.-R. Barra, Notions Fondamentales de Statistique Mathématique, Dunod, Paris, 1971 (French), Maîtrise de mathématiques et applications fondamentales.MR 53#6805. Zbl 257.62004.

[4] J. Bartoszewicz,Wykłady ze Statystyki Matematycznej[Lectures on Mathematical Statis-tics], Wydawnictwo Naukowe PWN, Warsaw, 1996 (Polish).CMP 1 624 321. [5] H. W. Corley,Optimization theory forn-set functions, J. Math. Anal. Appl.127(1987),

no. 1, 193–205.MR 88j:90185. Zbl 715.90096.

[6] H. W. Corley and S. D. Roberts,A partitioning problem with applications in regional design, Operations Res.20(1972), 1010–1019.MR 48#7924. Zbl 249.90021.

[7] S. Dorosiewicz,Modelowanie sieci Transportowych. Wybrane Zagadnienia[Modelling of Transportation Networks. Selected Topics], Monograﬁe i Opracowania, vol. 437, Warsaw School of Economics, Warsaw, 1997 (Polish).

[8] J. Lindenstrauss,A short proof of Liapounoﬀ’s convexity theorem, J. Math. Mech.15(1966), 971–972.MR 34#7754. Zbl 152.24403.

[9] R. J. T. Morris,Optimal constrained selection of a measurable subset, J. Math. Anal. Appl.

70(1979), no. 2, 546–562.MR 80h:49013. Zbl 417.49032.

[10] J. Neyman and E. S. Pearson,On the problem of the most eﬃcient tests of statistical hypothe-ses, Philos. Trans. Roy. Soc. London Ser. A231(1933), 289–337.Zbl 006.26804. [11] P. K. C. Wang,On a class of optimization problems involving domain variations, New

Trends in Systems Analysis (Proc. Internat. Sympos., Versailles, 1976), vol. 2, Springer-Verlag, Berlin, 1977, Lecture Notes in Control and Information Sciences, pp. 49–60.MR 58:7308. Zbl 372.93028.

[12] G. J. Zalmai,Optimality conditions and duality for constrained measurable subset selection problems with minmax objective functions, Optimization20(1989), no. 4, 377–395. MR 91b:90199. Zbl 677.90072.