Probabilistic constraints - Stochastic programming models with applications

mathematical program assigned to ω might be a linear program or a general nonlinear program. In the nonlinear case, the set Cωis given by a system of inequalities (gω,i ≤ 0)^m_i=1, where g_ω,i : Rⁿ→ R are given functions. The precise meaning is this:

C_ω =

i=1

{x | x ∈ Rⁿ and g_ω,i(x) ≤ 0} . (3.13) The objective function f_ω is usually given by an expression, which makes sense for all x ∈ Rⁿ. So it is natural to have the objective function fω : Rⁿ → R, instead of f_ω : C_ω → R, given by the same expression.

Our fω is extended now, but without including the information about feasibility. This information is contained in the values of the functions g_ω,i at x. Consider a suitable loss function t_ω : R^m+1 → R and define fω by composition as

f_ω(x) = t_ω

g_ω,1(x), . . . , g_ω,m(x), f_ω(x)

∀x ∈ Rⁿ.

If desired, the function t can be defined in such a way, that f_ω is indeed an extension of fω. It is particularly easy to see in case of linear programming model. The linear program reads as

minn

c^T_ωx | A_ωx = b_ω and x ≥ 0o and the functions g_ω,i are linear and can be represented as

g_ω,i(x) =

A^T_ωx − b_ω

where the index i refers to the i-th component of the right-hand side vector. If we set, for example, t_ω(x₁, . . . , x_m) = Pm+1

i=1 x_i, the composed mapping f_ω will coincide with f_ω on C_ω, because all the functions g_ω,i have zero value on C_ω.

3.8 Probabilistic constraints

The concept of probabilistic constraints naturally follows from the considerations regard-ing feasibility presented in the previous section. Recall that for each ω, we have a collection of m functions g_ω,i, each of them defined on the whole Rⁿ. The set C_ω assigned to ω is uniquely specified by (3.13).

It is important to note that every ω is assigned the same number m of constraints. The applications of stochastic programming often arise from deterministic (linear or nonlin-ear) problems by introducing the randomness to the coefficients of the constrains. In this situation, although the constraints are different for each ω, their number stays the same in all cases.

In the very general situation we obviously don’t need to limit ourselves to this setting.

Generally, if there exists an ω^∗ with maximal number of constraints, we can add the ap-propriate number of constraints to the remaining members of Ω. These added constraints

can be represented by constant functions (with constant value 0 or -1, for example) on the whole Rⁿ. This guarantees that their adding doesn’t affect the original feasible regions C_ω. Our effort up to now has been to extend the objective functions f_ω to Rⁿ, in order to be able to evaluate every possible decision, no matter if it is feasible for certain ω or not. Having done this, we can solve the unconstrained problem, using for example the expected objective approach (see section (3.5)).

Now, we present a possible way of restriction of the domain Rⁿ. For every fixed x ∈ Rⁿ and every i ∈ {1, . . . , m} (recall the fixed number of constraints) we define a function G_x,i: Ω → R by

Gx,i(ω) = gω,i(x). (3.14)

Provided all the functions G_x,i are measurable, we have obtained a collection of m random variables at every point x. We will assume the measurability of the functions G_x,i in the sequel.

Consider a particular random variable, say G_x^∗_,1 and the set S_x^∗ = {ω|G_x^∗_,1(ω) ≤ 0}.

Since Gx^∗,1 is measurable, the set Sx^∗ ⊂ Ω belongs to the σ-algebra F and has assigned a probability P (S_x^∗). If all the functions g_ω,1 are less or equal than 0 at x^∗, meaning that the first constraint is satisfied for all ω at x^∗, then G_x^∗_,1 ≤ 0 for every ω and therefore Sx= Ω and P (Sx) = 1. On the other hand, if for some ω the first constraint at x^∗ is not satisfied (g_ω,1 > 0), we can expect the the probability of S_x^∗ to be less than 1. If the first constraint is violated at x^∗ for each ω, then obviously S_x^∗ = ∅ and P (S_x^∗) = 0.

Let’s build a new function, that assigns the probability P (S_x) to every x ∈ Rⁿ: H₁ : Rⁿ→ [0, 1], H₁(x) = P

{ω|G_x,1(ω) ≤ 0}

. (3.15)

The function H₁ classifies the decisions x according to the probability of the set of events, for which the first constraint is satisfied at x. So, for example the set {x|H₁(x) > 0.95}

has a natural interpretation as the set of decisions, for which the first constraint is satis-fied with probability greater than 0.95.

We will now use a similar construction taking the m-tuple of functions (g_ω,i)^m_i=1 at once.

For a fixed x, find the set

S_x = { ω | G_x,i(ω) ≤ 0 for i ∈ {1, . . . , m}} =

= {ω | G_x,1(ω) ≤ 0} ∩ {ω | G_x,2(ω) ≤ 0} ∩ . . . ∩ {ω | G_x,m(ω) ≤ 0} .

The set S_xbelongs to F , being an intersection of sets belonging to F . We define a function H : Rⁿ→ [0, 1] by

H(x) = P (S_x) = P

{ ω | G_x,i(ω) ≤ 0 for i ∈ {1, . . . , m}}

. (3.16)

For a finite probability space, we have H(x) = 1 if and only if all the constraints are satisfied at x for all ω, that is if and only if x ∈ C_ω, meaning that x is always feasible.

We can see that the following identity holds:

H(x) = P ({ω | x ∈ C_ω}). (3.17)

3.8. Probabilistic constraints 25 Note that the expression (3.17) itself cannot be used to define the function H. For general sets C_ω we cannot decide whether {ω | x ∈ C_ω} ∈ F . We are able to show or state it with help of the functions gω,i specifying Cω, and definig then H by (3.16).

Example: The program

min f (x)

s.t. H(x) ≥ 0.9,

x ≥ 0,

minimizes the objective function on a set of nonnegative decisions, for which the proba-bility of being feasible is greater or equal 0.9. y

The construction we have described is usually presented in a less clear and more intuitive way: and for example the equation H(x) ≥ 0.9 is written as

This notation looks like a recipe saying ”just add probability”. We will summarize our results and join them with the ideas from the previous section.

Given programs (C_ω, f_ω), with C_ω ⊂ Rⁿ, we use the extensions to define an objective function f over Rⁿ. If the sets C_ω are given by m constraints g_ω,i ≤ 0 as in (3.13) and the functions G_x,i given by (3.14) are measurable, we can use the function H defined in (3.16) to additionally restrict the domain of f to a reasonable subset C of Rⁿ. The program (C, f ) then reads as

min f (x)

s.t. H(x) ≥ α,

where α ∈ [0, 1], and is called the program with joint probabilistic constraints.

If we use the functions Hi defined as in (3.15) instead of H, we obtain the program min f (x)

s.t. H₁(x) ≥ α₁, H₂(x) ≥ α₂,

...

H_m(x) ≥ α_m,

which is said to be with separate probabilistic constraints. Obviously, we can pro-duce more general settings by coupling certain subsets of the constraints together.

In document Stochastic programming models with applications (Page 33-36)