Robust optimization - UFO as a generalization

4.3 UFO as a generalization

4.3.2 Robust optimization

Consider a general linear program with m constraints and n variables x = {xj | j = 1, . . . , n}, a constraint matrix A = {aij | i = 1, . . . , m, j = 1, . . . n}, cost coefficients c ={cj | j = 1, . . . , n} and upper and lower bound vectors l and u.

In the problem, only coefficients a_ij are uncertain; the uncertainty set U is char-acterized by the sets J_i containing the indices of the uncertain coefficients for each row i = 1, . . . , m. Each coefficient satisfies a_ij∈ [a_ij− ^a_ij, a_ij+ ^a_ij].

Given a solution x, the worst coefficient realization at row i is given by

β_i(x, Γ_i) = max

where Γ_ilimits the number of simultaneously varying coefficients: bΓ_ic coefficients can take any value in [a_ij − â_ij, a_ij + â_ij], and one coefficient takes value in [a_ij− (Γ_i−bΓ_ic)â_ij, a_ij+ (Γ_i−bΓ_ic)â_ij].

With this notation, the robust optimization problem of Bertsimas and Sim (2004) is the following problem¹:

z^∗_ROB=min c^Tx (4.20)

s.t. X

j=1,...,n

a_ijx_j+ β_i(x, Γ_i)≤ b_i ∀i = 1, . . . , m, (4.21)

l ≤ x ≤ u. (4.22)

We define the complementary function of β_i(x, Γ_i) as

β_i(x, Γ_i) = min

which, given a solution x, corresponds to the value of the |Ji| − Γi coefficients that contribute least to the total deviation P

j=1,...,na^_ij|xj|.

To illustrate the two functions, consider the example with a unique constraint (m = 1), 3 variables (n = 3) and J₁ ={1, 2, 3}, i.e. all coefficients are changing. Now,

1Bertsimas and Sim (2004) use a maximization problem; we transform it to a minimization problem to match our framework and replace the yjvariables, −yi≤ xj≤ yj by|xj|, ∀j = 1, . . . , n

suppose Γ1 = 1.4, x = (2, 3, 1)^T and that ^a1j = (2, 0.5, 1.2)^T. The worst scenario for the current solution and Γ₁ is β(x, 1.4) = 4.6 (using S₁ ={1} and t1 ={2}) and the best scenario when |J1| − Γ1 = 1.6 coefficients take value a_ij+ ^a_ij is β(x, 1.4) = 2.1 (using S1 ={3} and t1 ={2}). We see that β(x, 1.4) + β(x, 1.4) = 6.7 = β(x, 3) = β(x, 0).

This example illustrates the following complementarity theorem.

Theorem (Complementarity) Let

β_i(x, Γ_i) = max

{Si∪{ti}|Si⊆J_i,|Si|=bΓic,t_i∈J_i\Si}

j∈Si

a_ij|xj| + (Γi−bΓ_ic)^a_it_i|xti|

and

β_i(x, Γ_i) = min

{Si∪{ti}|Si⊆J_i,|Si|=b|Ji|−Γic,t_i∈J_i\Si}

j∈S_i

a_ij|xj| + (|Ji| − Γi−b|Ji| − Γic)^a_it_i|xti|

Then the following relation holds:

β_i(x,|Ji|) = βi(x, Γ_i) + β_i(x, Γ_i).

Proof :

For a fixed vector x and a fixed constraint i ∈ {1, . . . , m}, let S^∗_i ∪{t^∗_i} be the optimal set maximizing β_i(x, Γ_i) and S^∗_i ∪{t^∗i} the optimal set minimizing βi(x, Γ_i).

We assume, w.l.o.g. that the |Ji| changing coefficients are ordered with respect to increasing ^aij|xj|. Then, the b|Ji| − Γic first ones are in S^∗_i and, similarly, the bΓic last ones are in S^∗_i and S^∗_i ∩ S^∗_i =∅.

When Γ_i is integer, then S^∗_i ∪ S^∗_i = J_i and the theorem is trivially proved. We therefore assume that Γi is non-integer. In this case, S^∗_i ∩ S^∗_i = ∅ implies that both indices of the coefficients considered only as fractionally varying are the same, i.e. t^∗_i = t^∗_i; the fractionally varying coefficient in both cases is the one at position b|Ji|−Γic+1.

Additionally, as Γi is non-integer, the following holds:

b|Ji| − Γic =|Ji| − bΓic − 1.

Let us sum all terms of S^∗_i and S^∗_i using the previous equality and the fact that

4.3. UFO AS A GENERALIZATION 63 In the next two paragraphs we address two different cases when using formulation (4.20)-(4.22). First, we consider the problem of finding a robust solution with given values of the parameters Γ_i and we show that a tighter formulation can be obtained using the UFO framework. Next, we determine values of the parameters Γ_i such that a robust solution exists and we show that, using the UFO framework, we are able to derive an algorithm to determine lower bounds for Γ_i, i = 1, . . . , m that guarantee the existence of a robust solution.

Equivalent UFO formulation. We suppose for now that the parameters Γ_i are given and fixed and that a robust solution satisfying (4.20)-(4.22) exists. We first focus on the feasibility of the solution for all possible scenarios in U, we apply the framework to this feasibility problem and show that an appropriate choice of UF and ρ lead to an equivalent formulation to (4.16)-(4.18).

Let (F) be the following feasibility problem:

(F) z^∗_F =min_x∈X{g(x)}

where g(x) is the value of the most violated constraint in the worst scenario when all the|Ji| coefficients of row i vary. We refer to the case when Γi=|Ji|, i = 1, . . . , m as

the unbounded worst-case. the set of feasible solutions X describes the set of solutions defined by (4.21)-(4.22). A solution with g(x) ≤ 0 is a solution that is feasible on the whole uncertainty set U.

If z^∗_F ≤ 0, i.e. at least one robust solution exists, we set the budget constraint as g(x) ≤ 0 and UFO leads to the robust solution that has lowest cost, which is what is sought. If z^∗_F < 0, it does not make sense to restrict the search space to a subset of the set of all robust solutions, i.e. by setting the budget constraint to be strictly negative, and we still adopt the budget constraint g(x) ≤ 0.

We assume that z^∗_F > 0, i.e. no robust solution exists on U and we denote the optimal solution of (F) by x^∗. We apply the UFO framework using µ(x) = −c^Tx, i.e. the original cost function with negative sign as UF. As required, µ and g are inversely correlated because of the price of robustness (Bertsimas and Sim, 2004).

Additionally, maximizing µ increases the performance of the solution: the cost is decreased. We then apply the budget constraint on each function gi(x) individually, i.e. adding constraints

g_i(x) ≤ (1 + ρ_i)z^∗_F ∀i = 1, . . . , m, where (1 + ρ_i)z^∗_F depends on the value of β_i(x^∗, Γ_i):

(1 + ρ_i)z^∗_F =

min_{k|β

k(x^∗,Γk)>0}

β_k(x, Γ_k)

if β_i(x^∗, Γ_i) > 0,

0 if β_i(x^∗, Γi) = 0. (4.23)

Note that there is at least one i such that β_i(x^∗, Γi) > 0: indeed, if β_i(x^∗, Γi) = 0

∀i, we obtain from the complementarity theorem:

g_i(x^∗) = Xn

j=1

a_ijx_j+ β_i(x,|Ji|) − bi = Xn

j=1

a_ijx_j+ β_i(x, Γ_i) − b_i.

Now, as we supposed a solution to (4.20)-(4.22) exists, there is at least one solution such that P_n

j=1a_ijx_j+ β_i(x^∗, Γ_i) − b_i ≤ 0, which contradicts the optimality of x^∗ as z^∗_F > 0.

If non-zero, (1 + ρ_i)z^∗_F corresponds to the smallest constraint violation of the unbounded worst-case with respect to the bounded one, i.e. the additional |Ji| − Γi simultaneously varying coefficients. Concretely, adding these budget constraints extends the solution space for all constraints that do not satisfy the bounded worst-case, i.e. they restrict the worst-case by limiting the number of simultaneously varying coefficients.

4.3. UFO AS A GENERALIZATION 65

We obtain the UFO formulation (F’):

z^∗_F0 =min c^Tx (4.24)

s.t. X

j=1,...,n

a_ijx_j+ β_i(x,|Ji|) − (1 + ρi)z^∗_F ≤ b_i ∀i = 1, . . . , m, (4.25)

x ∈ X. (4.26)

For all i such that β_i(x^∗, Γ_i) = 0, it is clear that β_i(x,|Ji|) ≥ βi(x, Γ_i) as Γ_i ≤|Ji|;

therefore, due to (4.23), solutions satisfying constraint (4.25) always satisfy con-straints (4.21). Furthermore, for concon-straints i such that β_i(x^∗, Γ_i) > 0, using the theorem, we get:

βi(x,|Ji|) − min

{k|βk(x^∗,Γk)>0}β_k(x, Γk)≥ βi(x,|Ji|) − βi(x, Γi) = βi(x, Γi).

(F⁰) is a tighter formulation than problem (4.20)-(4.22) since it is robust for a larger uncertainty set.

Determining parameters Γ_i. The approach of Bertsimas and Sim (2004) assumes values Γ_ias given parameters. It is however not easy to determine values that provide a sufficient protection level while guaranteeing the existence of a solution. Using UFO allows to derive an algorithm to determine, in a finite number of iterations, a lower bound on values of Γ_i such that a bounded robust solution exists or leads to the proof that the solution set X defined by constraints (4.21)-(4.22) is empty.

At each iteration k of the algorithm, we consider the problem z^(k)_F ^∗ =min g(x)

s.t.

g(x) ≤ (1 + ρ^(k−1))z^(k−1)_F ^∗ x ∈ X.

We denote the optimal solution at iteration k by x^∗_k (or (x_j)^∗_k for a single variable).

For the algorithm, we use a different definition for 1 + ρ^(k−1) than previously: we replace min by max to get the following definition:

1 + ρ^(k−1)=





maxi=1,...,m

β^(k−1)_i (x,Γ_i^(k−1))

z^(k−1)∗_F if z^(k−1)_F ^∗ > 0,

0 otherwise.

The value of (1 + ρ^(k))is then determined using the following parameters:

Γ_i⁽⁰⁾ =|Ji|;

Γ_i^(k) =sup

0≤ Γ ≤ Γ_i^(k−1)| βi(x^∗_k, Γ )≥ z^(k)_F ^∗

, k6= 0.

We then iterate over k.

Theorem (Convergence)

Consider the iterative process defined above and consider the following algorithm:

1) set Γ_i⁽⁰⁾ =|Ji|;

2) Solve the problem finding the value of z^(k)_F ^∗; 3) IF z^(k)_F ^∗ ≤ 0 STOP:

there exists a robust solution for the given sets of Γ_i^(k), i = 1, . . . , m;

4) at iteration k, let i^∗ ∈{1, . . . , m} be the index of a function gi(x^∗_k) with value z^(k)_F ^∗, then:

find Γ_i^(k)∗ =sup{0 ≤ Γ ≤ Γi^(k−1)^∗ | β(x^∗k, Γ )≥ z^(k)_F ^∗};

5) IF no such Γ exists STOP: the set X = ∅;

ELSE set k = k + 1 and go back to 2).

The proposed algorithm converges in a finite number of iterations.

Proof :

From the complementarity theorem, we get that β_i(x, Γ ) is a decreasing function for increasing Γ (using the theorem with β_i(x^∗_k, Γ ) being an increasing function for increasing Γ ). Moreover, β_i(x,|Ji|) = βi(x, 0), as by definition β_i(x, 0) = 0.

We assume that the algorithm did not converge after iteration k − 1, i.e. that

z^(k−1)_F ^∗ > 0 and i^∗ ∈ {1, . . . , m} is an index such that gi^∗(x^∗_k) = z^(k)_F ^∗ and at least

Γ_i^(k−1)∗ > 0.

If no solution exists for

β_i^∗(x^∗_k, Γ )≥ z^(k)_F ^∗, 0≤ Γ ≤ Γ_i^(k−1)∗ ,

this holds in particular for the largest possible value of the left-hand-side, obtained with Γ = 0 as β_i^∗(x^∗_k, Γ ) decreases for increasing Γ . Therefore, β_i^∗(x^∗_k, 0) < z^(k)_F ^∗, which implies that

β_i^∗(x^∗_k,|Ji^∗|) < z^(k)F ^∗ = Xn

j=1

a_i^∗_j(x_j)^∗_k+ β_i^∗(x^∗_k,|Ji^∗|) − bi^∗. Reordering the previous result, we obtain

Xn j=1

a_i^∗_j(x_j)^∗_k > b_i^∗,

4.3. UFO AS A GENERALIZATION 67

which contradicts x ∈ X, since X is defined by constraints Ax ≤ b and we proved that X = ∅.

Let us now prove that the sequence is not stationary by contradiction: We assume that Γ_i^(k) = Γ_i^(k−1) for all i: in particular, this is also true for i^∗. As Γ_i^(k)∗ is obtained by Γ_i^(k)∗ =sup{0 ≤ Γ ≤ Γi^(k−1)^∗ | β(x^∗k, Γ )≥ z^(k)_F ^∗}, then the following holds:

β_i∗(x^∗_k, Γ_i^(k)∗ ) = β_i∗(x^∗_k, Γ_i^(k−1)∗ )≥ g_i^∗(x^∗_k), and by reordering the previous inequality:

Xn j=1

a_i^∗_j(x_j)^∗_k+ β_i^∗(x^∗_k,|Ji^∗|) − βi^∗(x^∗_k, Γ_i^(k)∗ )≤ b_i^∗.

Using the complementarity problem, this leads to Xn

j=1

a_i^∗_j(x_j)^∗_k+ β_i^∗(x^∗_k, Γ_i^(k)∗ )≤ b_i^∗,

i.e. the solution is robust for Γ_i^(k) simultaneously changing coefficients and z^(k)_F ^∗ ≤ 0 and the iterative process converged.

For a non stationary solution, Γ_i^(k) < Γ_i^(k−1) for at least i = i^∗. Moreover, we know that a solution of Γ_i^(k)∗ exists, otherwise we would have proved that X = ∅.

Finally, at iteration k + 1, all functions satisfy f_i(x) ≤ β_i^∗(x^∗_k, Γ_i^(k)∗ ), for all x ∈ X, the inequality being strict at least for i = i^∗. The values of z^(k)_F ^∗ are therefore strictly decreasing as well, for increasing k.

We proved that the method eventually converges either to a solution with z^(k)_F ^∗ ≤ 0, or we obtain Γ_i^(k)= 0for all i = 1, . . . , m, meaning no solution for Ax ≤ b exists.

2 In the special case where Γ_i^(k) are restricted to integers, the method converges in at most n × m iterations.

When used to derive the approach of Bertsimas and Sim (2004), the UFO frame-work has similarities with the light robustness of Fischetti and Monaci (2008): both methods adopt a budget constraint. However, the objective of light robustness is based on an uncertainty characterization: it aims at finding the solution with lowest constraint violation in the worst-case. UFO is a generalization of the approach, as the LR and HLR violation methods proposed in the paper can be formulated as UFs.

The main difference is that Fischetti and Monaci (2008) start from the original cost minimization problem, aiming at minimizing the constraint violation and characterize the worst violation, i.e. the worst scenario, according to the optimal solution of the deterministic problem. We believe that this is not correct in general, as the charac-terization of the worst scenario depends on a solution and should be evaluated for

each of them independently, as it is the case in Bertsimas and Sim (2004): the worst scenario is characterized by the function β_i(x, Γ_i), which clearly depends on solution x; with this notation, Fischetti and Monaci (2008) characterize the worst scenario as βi(x^∗, Γi) for each solution x ∈ X, x^∗ being the optimal solution of the deterministic problem; their worst scenario is constant with respect to changing solutions x. The methodology leads to a heuristic way to compute maximal values of Γ_i such that, if it exists, a bounded worst-case robust solution exists.

4.4 Illustration on the Multi-Dimensional

In document Combining robustness and recovery for airline schedules (Page 77-84)