Geometry and Basic Feasible Solutions - Basic Feasible Solutions

Basic Feasible Solutions

4.1 Geometry and Basic Feasible Solutions

In the last few lectures, we developed geometric intuition that allowed us to find the optimal solution of linear programs with 2 and 3 variables. Then, we moved to higher dimensions by introducing definitions for extreme points. We saw that one of these extreme points will always be an optimal solution to a linear program, provided that the program is not infeasible or unbounded, and gave an algebraic characterisation of extreme points by introducing the notion of a basic feasible solution. At this point, we have reduced the problem of solving a linear program to the problem of finding the best basic feasible solution of that program. Before going further, let’s see an example bringing together some of these principles.

Let’s consider the following linear program in 2 variables:

maximise 4x₁+1 2x₂

subject to x₁ + x₂ ≤ 3 1

2x₁ + x₂ ≤ 2

−1/2x₁ + x₂ ≥ −1 x₁, x₂ ≥ 0

Notice that the last constraint that is a ≥ constraint instead of a ≤. Before sketching, we should always convert all of our constraints to ≤ constraints, so we multiply both sides by −1. We get the following equivalent linear program in standard inequality form:

maximise 4x₁+1 2x₂ subject to x1+ x2 ≤ 3

2x₁+ x₂ ≤ 2 1

2x₁− x₂ ≤ 1 x₁, x₂ ≥ 0

(4.2)

We want to see what the feasible region looks like, so we plot the boundary of each constraint. That is, we plot the lines we get when we change the ≤ signs into = signs. We get the following:

x1 + x2 =

21x₁ +x₂

2x¹− x²

x₁ x2

Which part of the picture corresponds to the feasible region? We need to figure out which side of each line the feasible region lies on. One easy way is to pick any point we want on each line and then draw the normal vector for our line.

For example, line ¹₂x₁ + x₂ = 2 is a linear equation of the form a^Tx = 2, where a^T= (¹₂, 1) and x = (x₁, x₂). So, we pick any point on this line and draw a vector starting at the point and moving 1/2 unit to right (corresponding to the 1/2 in the first coordinate of a) and 1 up (corresponding to the 1 in the second coordinate of a). Intuitively, the vector we have just drawn indicates the direction in (x₁, x₂) plane that corresponds to making the expression a^Tx = ¹₂x₁+ x₂ larger.

Our line shows us the values for which this expression is equal to 2. Because we want to also include values that are smaller than 2, our feasible solutions must all lie on the opposite side of our normal vector a.

If we repeat the same process for each different line, and consider the intersec-tion of all these regions, together with the non-negativity restricintersec-tions on x₁ and x₂, we get a picture that looks like this:

x1 + x2 =

21x₁ +x₂

2x¹− x²

x₁ x2

Normally, we would finish by sketching the direction c corresponding to the ob-jective function (here, c^T= (4, 1/2)) and use it to solve our program. Instead, let’s see how this might be done using calculation and our notion of basic feasible solutions. We saw in the last lecture that any linear program that has an optimal solution must have an optimal solution that is also a basic feasible solution. But, what do these basic feasible solutions look like? We saw that they correspond to extreme points, which are like corners of our feasible region. Let’s see in more detail exactly how this correspondence works.

In order to talk about basic feasible solutions, we need to rewrite our program into standard equation form. We have 3 inequalities in 4.2. We introduce a new slack variable for each of them to get the following program:

maximise 4x₁+ 1 2x₂

subject to x₁ + x₂+ s₁ = 3 1

2x₁ + x₂+ s₂ = 2 1

2x₁− x₂+ s₃ = 1 x₁, x₂, s₁, s₂, s₃ ≥ 0

(4.3)

We used to have n = 2 variables and m = 3 constraints. Now, we have n + m = 5 variables: 2 original variables plus one new slack variable corresponding to each constraint. We can write our system of equations more succinctly in matrix form

as:

Remember that in a basic feasible solution, we require that all the variables that take non-zero values correspond to a linearly independent set of columns in the matrix above. Since our matrix has m = 3 rows, at most m = 3 columns can be linearly independent, and we should expect that any basic feasible solution will select at most 3 variables to be set to non-zero values, or, equivalently, selecting 3 linearly independent columns of our matrix. Suppose that we select columns 1, 2, and 5 (note that these are indeed linearly independent). Then, in a corresponding basic feasible solution we are allowing x₁, x₂ and s₃ to take non-zero values, and requiring that s₁ and s₂ be set to zero. We say that x₁, x₂, and s₃ are the basic variables in this solution, and that s₁ and s₂ are the non-basic variables.

Let’s now see what the effect of this choice is. Since we have s₁ = 0, then the first equation in 4.3 reads:

x₁+ x₂ = 3

This is exactly the same as saying that the constraint corresponding to slack variable s₁ is tight in our original program (4.2). Similarly, since we have s₂ = 0, then we must have

2x₁+ x₂ = 2

and so our second constraint in (4.2) is tight. Consider the following picture, in which we have drawn the equations corresponding to each of these tight constraints:

2x¹− x²

=1 x1 +

x2 = 3

21x₁ +x₂

x₁ x2

If set s₁ and s₂ to zero and write out the system of constraints for our standard equation form program (4.3), we get:

x1+ x2 = 3 1

2x₁+ x₂ = 2 1

2x₁− x₂+ s₃ = 1

We find that this set of equations has a unique solution, namely x₁ = 2, x₂ = 1, s₃ = 1. This, then is our basic feasible solution, which was obtained by selecting a linearly independent set of columns 1,2, and 5 and then letting x₁, x₂, s₃ be our basic variables. If we plot just the x₁ and x₂ values for this solution on our picture, we get exactly the intersection point of these 2 red lines, which is also an extreme point of the feasible region. We showed in the last lecture that this will always happen—basic feasible solutions correspond exactly to extreme points of our feasible region.

So, we see that whenever a slack variable in our standard equation form pro-gram (4.3) is non-basic (that is, set to zero), it means that its corresponding constraint in our original program (4.2) is tight, and that this means that x₁ and x₂ must lie on this constraint’s boundary line. What if one of x₁ or x₂ is set to zero? Remember that we actually have 2 extra boundaries in our feasible region, corresponding to the restrictions x₁ ≥ 0 and x₂ ≥ 0 in both of our programs. If, for example x₁ = 0, then one of these is tight. Indeed, suppose that we choose

columns 2,4, and 5 in our matrix to be basic. Then, in any corresponding basic feasible solution, we must have x₁ = 0 and s₂ = 0 (so ¹₂x₁+ x₂ = 2). If we draw these tight restrictions/constraints, we see:

x1 + x2 =

12x¹− x²

1 2x₁

+x₂

x1=0

x₁ x2

Again, we see that this gives us an extreme point of our feasible region. If we set x₁ and s₁ to zero our constraint equations become:

x₂+ s₁ = 3 x₂ = 2

−x₂+ s₃ = 1

which has a unique solution x₂ = 2, s₁ = 1, s₃ = 1. Looking at the variables x₁ and x₂ from the original problem (4.2), we find x₁ = 0, x₂ = 2. So, again, our basic feasible solution lies at the intersection of 2 lines, which is an extreme point of the feasible region.

As a last example, let’s suppose we choose columns 1, 4 and 5 of our matrix.

This will give us basic variables x₁, s₂, s₃ and non-basic variables x₂, s₁. As before, we will have 1 tight constraint, giving us x₁ + x₂ = 3, and one tight restriction, giving us x₂ = 0. When we draw these get:

21x₁ +x₂

2x¹− x²

=1 x1 +

x2 = 3

x₂ = 0

x₁ x2

Now, we find that our intersection point is strictly outside the feasible region. How could this be? Let’s check our equations. If we set x2 and s1 to zero, we get:

x₁ = 3 1

2x₁+ s₂ = 2 1

2x₁+ s₃ = 1

These equations have a unique solution x₁ = 3, s₂ = ¹₂, s₃ = −¹₂. Indeed, if we check, the point x₁ = 3, x₂ = 0 corresponds to the intersection in our picture.

However, notice that we can tell right away that this solution is not feasible, since it sets s₃ = −1, but (4.3) required that all variables were non-negative. The constraint in (4.2) that corresponds to the slack variable s₃ is ¹₂x₁ − x₂ ≤ 1, and this is exactly the constraint that we are on the wrong side of!

In summary: Suppose we write a linear program with m inequality constraints and n variables in standard equation form with a matrix A that has m + n columns (n for the original variables, and m for the slack variables). Then any m linearly independent columns of A will correspond to a possible set of basic variables.

We set the other non-basic variables to zero, which will correspond to n tight constraints/restrictions in our original program. You can show that any way of doing this will always result in a single, unique solution for the values of the basic variables (see this week’s homework), and so in fact the n tight constraints

corresponding to a basic feasible solution will always be linearly independent. If this solution sets all of the basic variables to be non-negative, then it is a basic feasible solution.

In document Week 1. Introduction and Basics. 1.1 Introduction Linear Programming (Page 55-63)