Some variations - Elements of Linear and Real Analysis

In this section, V will be a finite-dimensional real vector space.

Definition 4.48 (Sublinear functions) A real-valued function p(v) on V is said to be sublinear if p(v + w) ≤ p(v) + p(w) for all v, w ∈ V and p(t v) = t p(v) for all v ∈ V and all nonnegative real numbers t. In particular, p(0) = 0.

Unlike a norm, a sublinear function p(v) is not asked to satisfy p(−v) = p(v).

Theorem 4.49 Let V be a finite-dimensional real vector space, and let p be a sublinear function on V . Suppose that W is a vector subspace of V and that µ is a linear functional on W such that

µ(w) ≤ p(w) for all w ∈ W.

(4.50)

4.9. SOME VARIATIONS 33 Then there is a linear functional µ on V which is an extension of µ andb

satisfies

µ(v) ≤ p(v)b for all v ∈ V.

(4.51)

Note that we simply have µ(w) andµ(v) in (4.50) and (4.51), rather thanb

their absolute values. If p(−v) = p(v) for all v ∈ V , then this would not make a real difference.

Theorem 4.49 can be proved in a similar manner as Theorem 4.17 in Section 4.3. The first part of the proof is essentially the same except for simple changes following the differences in the statements of the theorems.

This works up to (4.21), whose counterpart in the present setting is µj+1(u + tz) ≤ p(u + tz) for all u ∈ Wj and t ∈ R.

(4.52)

To be more precise, for the proof we want to choose the real number α (which µj+1(z) is assigned to be) so that (4.52) holds, and if we can do that, then the theorem will follow.

For the next step we need a modification, which is to say that (4.52) can be reduced to

µj+1(u + z) ≤ p(u + z) and µj+1(u − z) ≤ p(u − z) (4.53)

for all u ∈ Wj.

As before, the t = 0 case of (4.52) comes from the induction hypothesis (for µj and Wj), and for t 6= 0 one can convert (4.52) into (4.53) by employing the homogeneity of the sublinear function p(v). Now we have to treat t > 0 and t < 0 separately, since we are only assuming homogeneity for positive real numbers.

We can rewrite (4.53) as

µj(u) + α ≤ p(u + z) and µj(u) − α ≤ p(u − z) (4.54)

for all u ∈ Wj,

using the definition of µj+1, as in the previous situation. The problem is to verify that α ∈ R can be chosen so that this condition holds.

This requirement can be rephrased as

µj(u) − p(u − z) ≤ α ≤ p(u + z) − µj(u) for all u ∈ Wj. (4.55)

34 CHAPTER 4. NORMED VECTOR SPACES The rest of the argument is nearly the same as before. To show that there is an α ∈ R for which (4.55) is satisfied, it suffices to verify that

µj(u1) − p(u1− z) ≤ p(u2+ z) − µj(u2) for all u1, u2 ∈ Wj. (4.56)

This is equivalent to saying that

µj(u1+ u2) ≤ p(u1− z) + p(u₂+ z) for all u1, u2 ∈ W_j. (4.57)

The subadditivity property of p(v) shows that this condition holds if µ_j(u₁+ u₂) ≤ p(u₁+ u₂) for all u₁, u₂ ∈ W_j,

(4.58)

which is the same as

µj(u) ≤ p(u) for all u ∈ Wj. (4.59)

This is exactly part of the “induction hypothesis” for W_j and µ_j in this setting, i.e., the counterpart of (4.50) for Wj and µj. Thus (4.56) is valid, and it is possible to choose α ∈ R with the desired feature. This completes the proof of Theorem 4.49.

Now let us turn to convex cones. A subset C of V is called a convex cone if

x + y ∈ C whenever x, y ∈ C (4.60)

and

t x ∈ C whenever x ∈ C, t > 0.

(4.61)

In the second condition, t is a real number.

Clearly convex cones are convex sets. Conversely, C is a convex cone if it is convex and satisfies (4.61).

A subset C of V is called an open convex cone, or a closed convex cone, if it is a convex cone and if it is open or closed, respectively, as a subset of V . For the latter we use a linear isomorphism between V and Rⁿ, where n is the dimension of V , to define open and closed sets in V . Which isomorphism ones employs does not matter, because invertible linear mappings on Rⁿ are homeomorphisms.

For example, if C is the set of x = (x1, . . . , xn) in Rⁿ such that the coordinates x_i of x are all positive, then C is an open convex cone in Rⁿ. If C is the set of x ∈ Rⁿsuch that the coordinates xi are all nonnegative, then C is a closed convex cone.

4.9. SOME VARIATIONS 35 Here is a simple recipe for producing convex cones. If C0 is a convex set in the vector space V , then consider the set

{t x : t > 0, x ∈ C₀}.

(4.62)

One can check that this is a convex cone in V , and in fact it is an open convex cone if C0 is an open convex subset of V . If C0 is a compact convex subset of V which does not contain 0, then

{t x : t > 0, x ∈ C₀} ∪ {0}

(4.63)

is a closed convex cone in V .

If P is an affine plane in V that does not pass through the origin, and C0is contained in P , then (4.62) and (4.63) define convex cones whose intersection with P is exactly C0.

To avoid degeneracies, one sometimes asks that a convex cone C in V not be contained in a proper vector subspace of V . One also sometimes asks that a convex cone C not contain any lines. Let us make the standing convention that

if C is a closed convex cone, then 0 ∈ C.

(4.64)

In other words, a closed convex cone should not be the empty set.

Now suppose that a norm k · k on V has been chosen. If C is a nonempty convex cone in V , define p(v) on V by

p(v) = inf{kv − xk : x ∈ C}.

(4.65)

In other words, p(v) is the distance from v to C relative to the norm k · k. It is not hard to verify that p(v) is a sublinear function when C is a nonempty convex cone. Note that the closure of C is a closed convex cone which leads to the same function p(v).

If C is a convex cone in V , then we can associate to it a subset C^∗ of the dual space V^∗ of V , namely, the set of linear functionals on V which are nonnegative on C. Actually, this makes sense for any subset C of V , but if C is not already a convex cone, then one can take the convex cone that it generates, and this convex cone will define the same set in V^∗. Similarly, the closure of C leads to the same set in the dual, and adding 0 to C if it is not already an element of C does not affect C^∗.

It is not hard to check that C^∗ is always a closed convex cone in V^∗, containing 0 in particular. This is called the dual cone to C.

36 CHAPTER 4. NORMED VECTOR SPACES If one takes C to be the degenerate cone consisting of only 0, then the dual cone is all of V^∗. Similarly, if C is all of V , then the dual cone consists of only the zero linear functional.

Suppose that V = Rⁿ, and for C let us consider the closed convex cone of points x all of whose coordinates are nonnegative. A linear function λ on Rⁿ can be given as

λ(x) =

Xn i=1

a_ix_i (4.66)

for some real numbers a₁, . . . , a_n, and this linear functional is nonnegative on C exactly when ai ≥ 0 for each i.

Let C be a closed convex cone in V , and let C^∗ be the dual cone in V^∗. Associated to it is the second dual cone C^∗∗ in V^∗∗. As in Section 4.3, we can identify V^∗∗ with V in a simple way, where each element of V defines a linear functional on V^∗ by evaluation. It is easy to see that C ⊆ C^∗∗. In fact,

C = C^∗∗. (4.67)

To verify this, one wants to show that an element v of V does lie in C if every linear functional in C^∗ is nonnegative at v. In other words, if v ∈ V does not lie in C, then one would like to say that there is a linear functional on V which is nonnegative on C, but negative at v. This can be established as in Section 4.8, or using Theorem 4.49. For the latter, one starts with a linear functional on the 1-dimensional subspace spanned by v, and one uses a sublinear function p associated to C as above. There is also a multiplication by −1 involved, for switching between nonnegativity and nonpositivity on C.

See Chapter III of [SteW2] for some very interesting topics concerning convexity, duality, Fourier transforms, and holomorphic functions on certain regions in Cⁿ.

Chapter 5 Strict convexity

In document Elements of Linear and Real Analysis (Page 38-43)