Weierstraß-Institut
für Angewandte Analysis und Stochastik
Leibniz-Institut im Forschungsverbund Berlin e. V.
Preprint
ISSN 0946 – 8633
A simple formula for the second-order subdifferential
of maximum functions
Konstantin Emich, René Henrion
submitted: July 25, 2013 Weierstrass Institute Mohrenstr. 39 10117 Berlin Germany E-Mail: [email protected] [email protected] No. 1819 Berlin 20132010 Mathematics Subject Classification. 49J52, 49J53.
Key words and phrases. Second-order subdifferential, extended partial second-order subdifferential, maximum function, calculus rules.
Edited by
Weierstraß-Institut für Angewandte Analysis und Stochastik (WIAS) Leibniz-Institut im Forschungsverbund Berlin e. V.
Mohrenstraße 39 10117 Berlin Germany
Fax: +49 30 20372-303
E-Mail:
[email protected]
Abstract
We derive a simple formula for the second-order subdifferential of the maximum of co-ordinates which allows us to construct this set immediately from its argument and the direc-tion to which it is applied. This formula can be combined with a chain rule recently proved by Mordukhovich and Rockafellar [9] in order to derive a similarly simple formula for the extended partial second-order subdifferential of finite maxima of smooth functions. Anal-ogous formulae can be derived immediately for the full and conventional partial second-order subdifferentials.
1
Introduction
In 1976, B. S. Mordukhovich introduced a nonconvex normal cone and a corresponding subdif-ferential with the intention to derive necessary optimality conditions for optimal control problems with endpoint geometric constraints [4]. These constructions, meanwhile carrying his name, may have appeared unusual in the beginning because they fell outside the duality scheme between tangents and normals or directional derivatives and subdifferentials dominating the ideas of vari-ational analysis at that time. Soon it became evident, however, that the renunciation of convexity was a key property for precise characterizations in dual terms of optimality conditions or stability of multifunctions. The significance of Mordukhovich’s normal cone and related objects such as the associated coderivative relies on the fact that they are small on the one hand but robust on the other, the latter property being the basis for a surprisingly rich calculus which made it possi-ble to benefit from these constructions in more and more complex settings. Among the abundant proofs for the striking usefulness of the mentioned concepts, we just mention Mordukhovich’s celebrated coderivative criterion for the Aubin property (and related) of general multifunctions (see, e.g., [5]). As stated in the monograph by Rockafellar/Wets [10], "the Mordukhovich cri-terion... is the key to a Lipschitzian calculus for set-valued mappings". For a comprehensive account of similar results, the reader is referred to the basic monograph [6]. In the beginning, first order variational analysis was a primary field of application. Later, the focus shifted from ’raw’ objects to derived ones, e.g. from constraint mappings to solution mappings to generalized equations. A typical need for this transition arises in the derivation of dual necessary optimality conditions for hierarchical problems such as bilevel optimization or, more generally, mathemat-ical programming with equilibrium constraints. A typmathemat-ical problem of interest in this context is given by
min{g(x, y) | y ∈ S(x)},
S(x) := {y ∈ Y |0 ∈ ∇
yf (x, y) + N
C(y)}.
This bilevel problem has numerous applications, for instance in the characterization of equilibria in electricity spot markets (see, e.g., [3]). Upon observing that the feasible set of this problem,
i.e. the graph of
S
, can be equivalently described bygr S = Φ
−1(gr N
C),
Φ(x, y) := (y, −∇
yf (x, y)),
it becomes evident that dual necessary optimality conditions require the analysis of normal cones to derived objects which are related to normal cones themselves. This means, that one actually deals with second order variational analysis. Corresponding indispensable calculus rules (first of all chain rules) for the so-called (full) second-order subdifferential can be found in [6]. The application of the full second-order subdifferential is limited in the setting above, however, to functions
f
of classC
2 and to setsC
not depending onx
. Recently, however, increasing interest has arised in more general settings, e.g., when considering moving setsC(x)
as in the control of the sweeping process [1] or functionsf
being just of classC
1,1 as inconditioning of linear-quadratic two-stage stochastic optimization problems [2]. In these cases it is rather the so-called extended partial second-order subdifferential introduced in [9] whose calculus is of interest. A fundamental chain rule for this object has been established in [9] which basically allows to reduce the computation of the extended partial second-order subdifferential of some composite function to the computation of the full second-order subdifferential of the outer function. An extensive application can be found in the recent paper [8]. An important spe-cial case of composite functions is given by maximum functions. For instance, the mapping
S
considered above could be generalized to
S(x) := {y ∈ Y |0 ∈ ∂
yf (x, y) + N
C(y)},
f (x, y) := max
i=1,...,m
{f
i(x, y)},
where the
f
i are possibly smooth. A potential application would again arise from electricity spotmarket models with convex, nonsmooth bidding functions. Then,
f
is no longer differentiable and so one deals with multi-valued base mappings. Lipschitz properties of mappingsS
with such multivalued base have been studied in [7].In the context of such maximum functions, the chain rule mentioned above requires an efficient computation of the full second-order subdifferential of the maximum of coordinates. In principle, this can be realized by using a formula provided in [9, Lemma 4.4]. A concrete application of this formula, however, requires to evaluate differences of certain critical faces to the standard simplex (see 5) which may be tedious work within a systematic study. In this paper we present a very simple formula for the second-order subdifferential of the maximum of coordinates which provides an immediate construction given the initial data, i.e., the point in the graph and the direction of interest. The obtained formula is then used to prove a similarly immediate expression for the extended partial second-order subdifferential of a finite maximum of smooth functions.
2
Basic concepts and notation
As usual, we denote by ’gr
M
’ the graph of some multifunctionM
and byZ
0the polar of some setZ
. We recall the following definitions (see [6]):Definition 1. Let
C ⊆ R
mbe a closed subset andx ∈ C
¯
. The Mordukhovich normal cone toC
atx
¯
is defined byN
C(¯
x) :=
x
∗|∃ (x
n, x
∗n) → (¯
x, x
∗) : x
n∈ C, x
∗n∈ [T
C(x
n)]
0.
Here,
[T
C(x
n)]
0refers to the Fréchet normal cone toC
atx
n, which is the polar of thecontin-gent cone
T
C(x) := {d ∈ R
m|∃t
k↓ 0, d
k→ d : x + t
kd
k∈ C, ∀k } .
(1)to
C
atx
n. For an extended-real-valued, lower semicontinuous functionf : R
m→ ¯
R
with|f (¯
x)| < ∞
, the Mordukhovich normal cone induces a subdifferential via∂f (¯
x) := {x
∗| (x
∗, −1) ∈ N
epi f(¯
x, f (¯
x))} .
Definition 2. Let
M : R
n⇒ R
m be a multifunction with closed graph. The Mordukhovich coderivativeD
∗M (¯
x, ¯
y) : R
m⇒ R
nofM
at some(¯
x, ¯
y) ∈ gr M
is defined asD
∗M (¯
x, ¯
y)(y
∗) := {x
∗∈ R
n| (x
∗, −y
∗) ∈ N
gr M(¯
x, ¯
y)}
Definition 3. For a lower semicontinuous function
f : R
n→ ¯
R
which is finite atx ∈ R
nand for an elements ∈ ∂f (x)
the second-order subdifferential off
is a multifunction∂
2f (x, s) :
R
n⇒ R
ndefined by∂
2f (x, s) (u) := (D
∗∂f ) (x, s) (u)
∀u ∈ R
n.
Definition 4. For a lower semicontinuous function
f : R
n×R
m→ ¯
R
which is finite at(x, y) ∈
R
n× R
m, the partial subdifferential is defined as∂
yf (x, y) := ∂f (x, ·) (y)
. Following [9], for(x, y) ∈ R
n× R
m and anys ∈ ∂
y
f (x, y)
the (extended) partial second-order subdifferentialof
f
is a multifunction∂
˜
y2f (x, y, s) : R
m⇒ R
n× R
mdefined by˜
∂
y2f (x, y, s) (u) := (D
∗∂
yf ) (x, y, s) (u)
∀u ∈ R
m.
3
A formula for the second-order subdifferential of the
max-imum of coordinates
Let
θ : R
m→ R
be defined asθ(x) := max
i=1,...,m
x
i.
Our aim is to calculate the second-order subdifferential of
θ
. We denote byJ (x)
the set of active indices atx
:J (x) := {i ∈ {1, . . . , m}|x
i= θ(x)}
and by
J
c(x)
its complement. Furthermore for any fixed(¯
x, ¯
s) ∈ gr ∂θ
we introduce the fol-lowing index setsK := {i ∈ J (¯
x)|¯
s
i= 0} ,
L := {i ∈ J (¯
x)|¯
s
i> 0} .
Theorem 5. For any fixed
(¯
x, ¯
s) ∈ gr ∂θ
and any fixedu ∈ R
mthe second-order subdifferen-tial∂
2θ(¯
x, ¯
s)(u)
is nonempty if and only ifu
i
= c
for alli ∈ L
, wherec
is a constant. In thiscase it holds that
∂
2θ(¯
x, ¯
s)(u) =
(
w
mX
i=1w
i= 0, w
i≥ 0 ∀i ∈ J
>, w
i= 0 ∀i ∈ J
c(¯
x) ∪ J
<)
,
(2) whereJ
>:= {i ∈ J (¯
x) |u
i> c }
andJ
<:= {i ∈ J (¯
x) |u
i< c }
.Proof. We first notice that the function
θ
can be written in the equivalent form:θ(x) = max
v∈Shv, xi,
whereS :=
(
s ∈ R
m mX
i=1s
i= 1, s
i≥ 0, i = 1, . . . , m
)
.
(3)Denoting by
e
i the canonical unit vectors ofR
m, the subdifferential ofθ
atx
¯
is well known toadmit the representation:
∂θ(¯
x) = conv {e
i|i ∈ J(x)} ⊆ S.
Accordingly, the fixed
(¯
x, ¯
s) ∈ gr ∂θ
satisfiesm
X
i=1
¯
s
i= 1, ¯
s
i> 0 ∀i ∈ L, ¯
s
i= 0 ∀i ∈ J
c(¯
x) ∪ K,
(4)In particular,
L 6= ∅
. It is easy to see that the contingent coneT
S(¯
s)
to the simplexS
at¯
s ∈ ∂θ(¯
x)
is given byT
S(¯
s) =
(
h
mX
i=1h
i= 0, h
i≥ 0 ∀i ∈ J
c(¯
x) ∪ K
)
.
By [9, Lemma 4.4], the second-order subdifferential of
θ
at(¯
x, ¯
s)
in directionu
can be char-acterized as follows:∂
2θ(¯
x, ¯
s)(u) = {w |
there are closed facesK
1⊇ K
2ofT
S(¯
s) ∩ ¯
x
⊥such thatw ∈ K
1− K
2, −u ∈ (K
1− K
2)
0}
(5)We claim that the critical cone to
S
at¯
s
is given byT
S(¯
s) ∩ ¯
x
⊥=
(
h
mX
i=1h
i= 0, h
i≥ 0 ∀i ∈ K, h
i= 0 ∀i ∈ J
c(¯
x)
)
.
(6)Indeed, if
h ∈ T
S(¯
s) ∩ ¯
x
⊥, then, byh
i≥ 0
for alli ∈ J
c(¯
x)
,0 = h¯
x, hi = θ(¯
x)
X
i∈J (¯x)h
i+
X
i∈Jc(¯x)¯
x
ih
i≤ θ(¯
x)
mX
i=1h
i= 0
(7)which implies that the inequality above actually holds as an equality and, thus,
X
i∈Jc(¯x)
h
i(¯
x
i− θ(¯
x)) = 0.
Since
x
¯
i< θ(¯
x)
fori ∈ J
c(¯
x)
, it follows thath
i= 0
for alli ∈ J
c(¯
x)
, whenceh
belongsto the right-hand side of (6). Conversely, if
h
belongs to the right-hand side of (6), then clearlyh ∈ T
S(¯
s)
. Now, (7) can be read from the right to the left in order to derive thath ∈ ¯
x
⊥. ThisThe application of (5) requires the knowledge of the closed faces of
T
S(¯
s) ∩ ¯
x
⊥. Owing to (6),these are given by:
M
I:=
(
h
mX
i=1h
i= 0, h
i≥ 0 ∀i ∈ K \ I, h
i= 0 ∀i ∈ J
c(¯
x) ∪ I
)
(I ⊆ K).
We show thatI
1⊆ I
2⇔ M
I1⊇ M
I2∀I
1, I
2⊆ K.
(8) The direction ’⇒
’ is evident. For the reverse direction letI
1, I
2⊆ K
and assume thatI
1* I
2.Then there exists some
i
0∈ I
1\I
2. As observed below (4), we may choose some indexi
∗∈ L
.Recall that
K ∩ L = ∅
. Hence,i
∗∈ K
/
andi
06= i
∗. This allows us to define
h
by settingh
i0:= 1, h
i∗:= −1
andh
i:= 0
for all remainingi
. Then,h
i≥ 0
for alli ∈ K \ I
2 due toi
∗∈ K
/
. Furthermore,i
0, i
∗∈ I
/
2 andi
0, i
∗∈ J(¯
x)
due toK, L ⊆ J (¯
x)
. This implies thath
i= 0
for alli ∈ J
c(¯
x) ∪ I
2, whenceh ∈ M
I2. On the other hand,i
0
∈ I
1, so that
h
i0= 1
leads toh /
∈ M
I1. Altogether this proves (8).Combining (8) with (5), we arrive at
∂
2θ(¯
x, ¯
s)(u) =
w
∃I
1⊆ I
2⊆ K : w ∈ M
I1− M
I2, −u ∈ (M
I1− M
I2)
0
,
(9) We show next that for
I
1⊆ I
2⊆ K
the setsM
I1− M
I2 and(M
I1− M
I2)
0 are given by:
M
I1− M
I2=
(
p
mX
i=1p
i= 0, p
i≥ 0 ∀i ∈ I
2\ I
1, p
i= 0 ∀i ∈ J
c(¯
x) ∪ I
1)
(10)(M
I1− M
I2)
0= {p
∗| ∃˜
c ∈ R : p
∗i≤ ˜
c ∀i ∈ I
2\ I
1, p
∗i= ˜
c ∀i ∈ J (¯
x) \ I
2} .
(11)The inclusion ’
⊆
’ in (10) follows readily from the definitions. For the reverse inclusion, letp
belong to the right-hand side of (10) and define
h
(1), h
(2) byh
(1)i: = p
i,
h
(2)i:= 0 ∀i ∈ J
c(¯
x) ∪ I
2h
(1)i: = [p
i]
+,
h
(2) i:= [p
i]
−∀i ∈ J(¯
x)\ (I
2∪ {i
∗})
h
(1)i∗: = −
X
i∈Jc(¯x)∪I 2p
i−
X
i∈J (¯x)\(I2∪{i∗})
[p
i]
+,
h
(2)i∗:= −
X
i∈J (¯x)\(I2∪{i∗})
[p
i]
−,
where,
[p
i]
+, [p
i]
− denote the positive and negative parts ofp
i, respectively, andi
∗ is anar-bitrarily selected index of the nonempty set
L ⊆ J (¯
x)
as it has already been used before. Then, by construction,m
P
i=1
h
(1)i= 0
. The properties ofp
and the inclusionI
1⊆ I
2 ensurethat
h
(1)i= p
i= 0
for alli ∈ J
c(¯
x) ∪ I
1. Finally, ifi ∈ K\I
1 is arbitrary theni 6= i
∗ dueto
K ∩ L = ∅
. We distinguish the casesi /
∈ I
2 andi ∈ I
2. In the first case it follows thati ∈ J (¯
x)\ (I
2∪ {i
∗})
byK ⊆ J (¯
x)
. Then,h
(1)i
:= [p
i]
+≥ 0
. Otherwise,i ∈ I
2\I
1 andh
(1)i= p
i≥ 0
. Summarizing,h
(1)i≥ 0
for alli ∈ K\I
1 and, thus,h
(1)∈ M
I1. Similarly, by construction,m
P
i=1
arbitrary then
i 6= i
∗due toK ∩ L = ∅
. Therefore,i ∈ J (¯
x)\ (I
2∪ {i
∗})
byK ⊆ J (¯
x)
. Weconclude that
h
(2)i= [p
i]
−≥ 0
, whenceh
(2)∈ M
I2. It remains to show thatp = h
(1)
− h
(2).Indeed, by construction one has the following exhaustive cases:
h
(1)i− h
(2)i= p
i∀i ∈ J
c(¯
x) ∪ I
2h
(1)i− h
(2)i= [p
i]
+− [p
i]
−= p
i∀i ∈ J(¯
x)\ (I
2∪ {i
∗})
h
(1)i∗− h
(2) i∗= −
X
i∈Jc(¯x)∪I 2p
i−
X
i∈J (¯x)\(I2∪{i∗})
p
i= p
i∗.
Here, the last equality follows from
m
P
i=1
p
i= 0
. Altogether this proves (10).By duality theory for systems of linear equalities and inequalities we have that
p
∗∈ (M
I1−
M
I2)
0 if and only if there are coefficients
c, λ
˜
i
∈ R
fori ∈ J
c(¯
x) ∪ I
1 andµ
i≤ 0
fori ∈ I
2\ I
1such thatp
∗= ˜
c1 +
X
i∈Jc(¯x)∪I 1λ
ie
i+
X
i∈I2\I1µ
ie
i,
where1 = (1, . . . , 1)
T. Forj ∈ I
2
\I
1it follows thatj /
∈ J
c(¯
x)∪I
1, whencep
∗j= ˜
c+µ
j≤ ˜
c
.For
j ∈ J (¯
x) \ I
2it follows thatj /
∈ J
c(¯
x) ∪ I
1 (due toI
1⊆ I
2) and alsoj /
∈ I
2\ I
1, whencep
∗j= ˜
c
. Summarizing,p
∗∈ (M
I1− M
I2)
0 if and only if it satisfies the conditions on the
right-hand side of (11).
(M
I1− M
I2)
0= {p
∗| J(¯
x) \ I
2⊆ {i ∈ J(¯
x)|p
∗i=
max
j∈J (¯x)\I1p
∗j}}.
By (9) it holds that∂
2θ(¯
x, ¯
s)(u) =
w
∃I
1⊆ I
2⊆ K :
mX
i=1w
i= 0,
w
i≥ 0 ∀i ∈ I
2\ I
1,
w
i= 0 ∀i ∈ J
c(¯
x) ∪ I
1,
J (¯
x) \ I
2⊆ {i ∈ J(¯
x)|u
i=
min
j∈J (¯x)\I1u
j}
.
(12) With the notationsA
I1,I2:=
(
w
mX
i=1w
i= 0, w
i≥ 0 ∀i ∈ I
2\ I
1, w
i= 0 ∀i ∈ J
c(¯
x) ∪ I
1)
,
(13)J (I) = {i ∈ J (¯
x) | u
i=
min
j∈J (¯x)\Iu
j}
(14)(12) can be written as
∂
2θ(¯
x, ¯
s)(u) =
[
I1⊆I2⊆K J (¯x)\I2⊆J(I1)A
I1,I2=
[
I1⊆I2⊆K J (¯x)\J (I1)⊆I2A
I1,I2=
[
I1⊆K J (¯x)\J (I1)⊆K
[
I1⊆I2⊆K J (¯x)\J (I1)⊆I2A
I1,I2
(15)In the iterated union on the right-hand side we consider the inner union over sets
I
2 to beconditional with respect to the set
I
1⊆ K
fixed in the outer union. Note that in case ofJ (¯
x)\J (I
1) * K
there exists noI
2 satisfying the conditions of the inner union which allows usto add the condition
J (¯
x) \ J (I
1) ⊆ K
to the outer union. Now, for any fixedI
1 satisfying theconditions of the outer union let
I
2a⊆ I
b2 be two index sets satisfying the conditions for
I
2 in theinner union. By (13), it then holds that
A
I1,I2a⊇ A
I1,I2b. Thus, one may shrink the inner union to the minimal setI
2 of the indicated family which is evidently given byI
2= I
1∪(J(¯
x) \ J (I
1))
.Consequently, for
M := {I ⊆ K|J (¯
x) \ J (I) ⊆ K}
we have that∂
2θ(¯
x, ¯
s)(u) =
[
I∈MA
I,I∪(J (¯x)\J (I))=
[
I∈M˜
A
I withA
˜
I:= A
I,I∪(J (¯x)\J (I))(I ∈ M ).
(16) From
J (¯
x) \ K = L
and (14) we derive thatM = {I ⊆ K|L ⊆ J (I)} = {I ⊆ K|u
i=
min
j∈J (¯x)\I
u
j∀i ∈ L}.
(17)Note that
J (¯
x) \ I 6= ∅
for allI ∈ M
because ofI ⊆ K
and, thus,∅ 6= L = J(¯
x) \ K ⊆
J (¯
x) \ I
. This implies thatM
is nonempty if and only ifu
i is constant for alli ∈ L
which isthe first assertion of the theorem.
To show the asserted formula (2), assume now that
M 6= ∅
, henceu
i= c
for alli ∈ L
and for some constant
c
. Consequently, forJ
< as introduced below (2), one has thatJ
<⊆
J (¯
x) \ L = K
. Moreover, by (17),I ∈ M ⇐⇒ I ⊆ K
andu
j≥ c ∀j ∈ J(¯
x) \ I ⇐⇒ J
<⊆ I ⊆ K.
(18)In particular, because of
J
<⊆ K
, we infer thatJ
<∈ M
and thatJ
< is in fact the smallestmember of
M
.In the last step of the proof we show that
A
˜
I defined in (16) is a decreasing family of sets.To this aim, we define
J
I:= J (¯
x) \ (J (I) ∪ I)
forI ∈ M
, and observe thatJ
I= [I ∪
(J (¯
x) \ J (I))] \ I
, whence by definition ofA
˜
Iand by (13)˜
A
I= {w |
mX
i=1
w
i= 0, w
i≥ 0 ∀i ∈ J
I, w
i= 0 ∀i ∈ J
c(¯
x) ∪ I}.
(19)From the definitions of
J
IandJ (I)
we conclude thatJ
I= {i ∈ J (¯
x) \ I | u
i>
min
Now let
I
a, I
b∈ M
be arbitrary withI
a⊆ I
b. We claim thatA
˜
Ib
⊆ ˜
A
Ia. For arbitrarily givenw ∈ ˜
A
Ib one has by (19) thatm
X
i=1
w
i= 0, w
i≥ 0 ∀i ∈ J
Ib, w
i= 0 ∀i ∈ J
c(¯
x) ∪ I
b.
(21)Since
I
a⊆ I
b, it is sufficient to show thatw
i
≥ 0
for alli ∈ J
Ia\ J
Ib. Let suchi
be arbitrarily given. Then, by (20),i ∈ J (¯
x) \ I
a and by (17)u
i>
min
j∈J (¯x)\Ia
u
j= c.
(22) Moreover,i /
∈ J
Ib leaves two possibilities according to (20): eitheri /
∈ J(¯
x) \ I
b which byi ∈ J (¯
x) \ I
a leads toi ∈ I
b\ I
a, whencew
i
= 0
(see (21)); or, again relying on (17),u
i≤
min
j∈J (¯x)\Ib
u
j= c,
which is a contradiction with (22). Summarizing,
w
i≥ 0
and, thus,A
˜
Ib⊆ ˜
A
Ia. The fact thatA
˜
I defined in (16) is a decreasing family of sets yields along with (18) that∂
2θ(¯
x, ¯
s)(u) =
[
I∈M˜
A
I= ˜
A
J<=
(
w |
mX
i=1w
i= 0, w
i≥ 0 ∀i ∈ J
J<, w
i= 0 ∀i ∈ J
c(¯
x) ∪ J
<)
by (19). Observing that by (20), (18) and (17)
J
J<= {i ∈ J (¯
x) \ J
<| u
i> c} = {i ∈ J (¯
x) | u
i> c} = J
> forJ
> being defined below (2), we finally derive the asserted formula (2).Corollary 6. In the setting of Theorem 5 one has that
∂
2θ(¯
x, ¯
s)(0) = {w |
m
X
i=1
w
i= 0, w
i= 0 ∀i ∈ J
c(¯
x)}.
Proof. Since
L 6= ∅
, it follows thatc = 0
for the constantc
defined in the statement of Theorem 5. Accordingly,J
<= J
>= ∅
and the assertion follows from (2).In order to illustrate how Theorem 5 reduces the effort to calculate the second order subdiffer-ential of a maximum of coordinates, we pick up an example from [9]:
Example 7 ([9], Example 3.5). Let
m = 4, ¯
x = (0, 0, 0, 0), ¯
s = (
1 2,
1
2
, 0, 0), u = (0, 0, 1, −1)
.Then,
J (¯
x) = {1, 2, 3, 4}, J
c(¯
x) = {∅}
andL = {1, 2}
. Sinceu
i
= 0 =: c
for alli ∈ L
, itfollows that
J
>= {3}, J
<= {4}
. Thus,∂
2θ(¯
x, ¯
s)(u) =
(
w
4X
i=1w
i= 0, w
3≥ 0, w
4= 0
)
.
4
A formula for the extended partial second-order
subdiffer-ential of a finite maximum of smooth functions
We are now going to apply the result of Theorem 5 to the maximum not just of coordinates but of general smooth functions:
ϕ(x, y) := max
i=1,...,m
g
i(x, y).
We partition the argument into two parts in order to discuss the extended second-order subd-ifferential of
ϕ
. A completely analogous result will hold true for the conventional second-order subdifferential. Clearly we may writeϕ = θ ◦ g
withθ(z
1, . . . , z
m) := max
i=1,...,mz
i andg(x, y) := (g
1(x, y), . . . , g
m(x, y))
T. We assume thatg : R
n× R
d→ R
m is of classC
2.The key tool for our result is the following chain rule:
Theorem 8 ([9], Theorem 3.3 & Theorem 4.3). Fix any
(¯
x, ¯
y)
ands ∈ ∂
¯
yϕ(¯
x, ¯
y)
. Assumethat the basic second-order constraint qualification
∂
2θ (g(¯
x, ¯
y)), v)) (0) ∩ ker ∇
Tyg(¯
x, ¯
y) = {0}
(23) is satisfied at somev ∈ ∂θ(g(¯
x, ¯
y))
withs = ∇
¯
Tyg(¯
x, ¯
y)v
. Then,v
is uniquely defined and for allu ∈ R
d it holds that˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) =
" ∇
2 xyhv, gi(¯
x, ¯
y)
∇
2yyhv, gi(¯
x, ¯
y)
#
u +
" ∇
T xg(¯
x, ¯
y)
∇
T yg(¯
x, ¯
y)
#
∂
2θ(g(¯
x, ¯
y), v)(∇
yg(¯
x, ¯
y)u),
(24) The basic second-order constraint qualification (23) motivates us to introduce the following weakened form of a linear independence condition:Definition 9. A set
{z
1, . . . , z
s}
of vectors is called graphically linearly independent if the set{(z
1, 1), . . . , (z
s, 1)}
of enhanced vectors is linearly independent in the classical sense.Clearly, graphical linear independence is a strictly weaker notion than classical linear indepen-dence. Recalling that
˜
∂
yϕ(¯
x, ¯
y) = conv{∇
yg
i(¯
x, ¯
y)}
i∈I(¯x,¯y) whereI(¯
x, ¯
y) := {i|g
i(¯
x, ¯
y) = max
j
g
j(¯
x, ¯
y)},
we observe that any
s ∈ ˜
¯
∂
yϕ(¯
x, ¯
y)
can be written in the form¯
s =
X
i∈I(¯x,¯y)v
i∇
yg
i(¯
x, ¯
y),
X
i∈I(¯x,¯y)v
i= 1
or(¯
s, 1) =
X
i∈I(¯x,¯y)v
i(∇
yg
i(¯
x, ¯
y), 1).
It follows that the multipliers
v
i are uniquely defined bys
¯
. Now, we are in a position to makeTheorem 10. In the setting of Theorem 8 assume that the set
{∇
yg
i(¯
x, ¯
y)}
i∈I(¯x,¯y) isgraph-ically linearly independent. Denote by
v ∈ ∂θ(g(¯
x, ¯
y))
the unique element defined bys =
¯
∇
Ty
g(¯
x, ¯
y)v
(see discussion above) and letL := {i ∈ I(¯
x, ¯
y)|v
i> 0}
. Then, for arbitraryu ∈ R
d, one has that:˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) 6= ∅ ⇐⇒ ∃c ∈ R : h∇
yg
i(¯
x, ¯
y), ui = c ∀i ∈ L.
(25)In this case it holds that
˜
w ∈ ˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u)
⇐⇒
∃w ∈ R
m:
˜
w =
" ∇
2 xyhv, gi(¯
x, ¯
y)
∇
2yyhv, gi(¯
x, ¯
y)
#
u +
" ∇
T xg(¯
x, ¯
y)
∇
T yg(¯
x, ¯
y)
#
w,
mX
i=1w
i= 0,
w
i≥ 0 ∀i ∈ I
>,
w
i= 0 ∀i ∈ I
c(¯
x, ¯
y) ∪ I
<,
(26) whereI
>:= {i ∈ I(¯
x, ¯
y) |h∇
yg
i(¯
x, ¯
y), ui > c }
I
<:= {i ∈ I(¯
x, ¯
y) |h∇
yg
i(¯
x, ¯
y), ui < c }
I
c(¯
x, ¯
y) := {1, . . . , m} \ I(¯
x, ¯
y).
Proof. By Corollary 6 we have that
∂
2θ (g(¯
x, ¯
y), v) (0) =
(
w
mX
i=1w
i= 0, w
i= 0 ∀i ∈ I
c(¯
x, ¯
y)
)
.
(27)Then, evidently the inclusion ’
⊇
’ in (23) is satisfied. Conversely, ifw
belongs to the left-hand side of (23), thenw
satisfies (27) andw ∈ ker ∇
Ty
g(¯
x, ¯
y)
. Hence,X
i∈I(¯x,¯y)w
i(∇
yg
i(¯
x, ¯
y), 1) =
mX
i=1w
i(∇
yg
i(¯
x, ¯
y), 1) = (0, 0).
The assumed graphical linear independence of the set
{∇
yg
i(¯
x, ¯
y)}
i∈I(¯x,¯y)provides thatw
i=
0
for alli ∈ I(¯
x, ¯
y)
. Along withw
i= 0
for alli ∈ I
c(¯
x, ¯
y)
by (27), we arrive at the desiredconclusion
w = 0
. Thus, (23) is satisfied and we may invoke Theorem 8 in order to derive (25). Our first conclusion from (24) is that˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) 6= ∅ ⇐⇒ ∂
2θ(g(¯
x, ¯
y), v)(∇
yg(¯
x, ¯
y)u) 6= ∅.
By virtue of Theorem 5 this entails (25). Similarly, (26) follows from (24) upon using the explicit representation (2).
Remark 11. Exploiting similar chain rules to the one given in Theorem 8 but relating to the full and conventional partial second-order subdifferential rather than to its extended partial one (see [6], [9]) one may derive in the same way corresponding fully explicit formulae for those other types of second-order subdifferentials of finite maxima of smooth functions.
As an illustration of Theorem 10 we consider the following example:
Example 12. Define
g : R
2→ R
byg(x, y) := (−xy, xy
2)
and consider the point(¯
x, ¯
y) :=
(1, 0)
. Then,ϕ(¯
x, y) = max{−y, y
2}
and∂
yϕ(¯
x, ¯
y) = [−1, 0]
. Moreover,I(¯
x, ¯
y) =
{1, 2}
. Due to∇
yg
1(¯
x, ¯
y) = −1
and∇
yg
2(¯
x, ¯
y) = 0
the active gradients are graphicallylinearly independent (while not linearly independent) as required in Theorem 10. Clearly, for any
¯
s ∈ ∂
yϕ(¯
x, ¯
y)
the vectorv
has the unique representationv = (−¯
s, 1 + ¯
s)
. Now, (25) yieldsthat
∂
˜
2y
ϕ(¯
x, ¯
y, ¯
s)(u) 6= ∅
ifL = {1}
orL = {2}
or finally, ifL = {1, 2}
andu = 0
. Inthese cases, formula (26) reduces with our concrete data to:
˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) =
−v
1u
2v
2u − w
1| w
1= −w
2, w
i≥ 0 (i ∈ I
>), w
i= 0 (i ∈ I
<)
.
Now, let
u ∈ R
be arbitrary. We consider three cases:1
s = −1
¯
. Then,v = (1, 0), L = {1}, c = −u
. Ifu > 0
, thenI
>= {2}
elseI
>= ∅
.Similarly, if
u < 0
, thenI
<= {2}
elseI
<= ∅
. Thus,˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) = {(−u, t)|t ∈ A}
whereA =
t ≥ 0
ifu > 0
t ∈ R
ifu = 0
t = 0
ifu < 0
2
s = 0
¯
. Then,v = (0, 1), L = {2}, c = 0
. Ifu < 0
, thenI
>= {1}
elseI
>= ∅
.Similarly, if
u > 0
, thenI
<= {1}
elseI
<= ∅
. Thus,˜
∂
y2ϕ(¯
x, ¯
y, ¯
s)(u) = {(0, 2u − t)|t ∈ A}
whereA =
t = 0
ifu > 0
t ∈ R
ifu = 0
t ≥ 0
ifu < 0
3
−1 < ¯
s < 0
. Then,v = (−¯
s, 1 + ¯
s), L = {1, 2}, c = −u
. As mentioned above,˜
∂
2y
ϕ(¯
x, ¯
y, ¯
s)(u) = ∅
ifu 6= 0
. Ifu = 0
thenI
>= I
<= ∅
. Thus,∂
˜
y2ϕ(¯
x, ¯
y, ¯
s)(u) =
{0} × R
.References
[1] G. Colombo, R. Henrion, N.D. Hoang, and B.S. Mordukhovich. Optimal control of the sweeping process. Dyn. Contin. Discrete Impuls. Syst. Ser. B Appl. Algorithms, 19:117– 159, 2012.
[2] K. Emich, R. Henrion, and W. Römisch. Conditioning of linear-quadratic two-stage stochas-tic optimization problems. Mathemastochas-tical Programming, 2013.
[3] R. Henrion, J. Outrata, and T Surowiec. On regular coderivatives in parametric equilibria with non-unique multiplier. Mathematical Programming, 136:111–131, 2012.
[4] B. S Mordukhovich. Maximum principle in problems of time optimal control with nonsmooth constraints. J. Appl. Math. Mech., 40:960–969, 1976.
[5] B. S Mordukhovich. Complete characterization of openness, metric regularity, and
Lipschitzian properties of multifunctions. Trans. Amer. Math. Soc., 340:1–36, 1993. [6] B. S. Mordukhovich. Variational Analysis and Generalized Differentiation I: Basic Theory,
volume 330 of Grundlehren der mathematischen Wissenschaften. Springer, 2006. [7] B. S. Mordukhovich and N. M. Nam. Variational analysis of extended generalized equations
via coderivative calculus in asplund spaces. J. Math. Anal. Appl., 350:663–679, 2009. [8] B. S. Mordukhovich, N. M. Nam, and N.T.Y. Nhi. Partial second-order subdifferentials in
variational analysis and optimization. Numer. Func. Anal. Optim., page to appear, 2013. [9] B. S. Mordukhovich and R. T. Rockafellar. Second-order subdifferential calculus with
ap-plications to tilt stability in optimization. SIAM J. Optim., 22(3):953–986, 2012.
[10] R. T. Rockafellar and R. J.-B. Wets. Variational Analysis, volume 317 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin, 1998.