Algorithm design: PICEA-w - Preference-inspired Co-evolutionary Algorithms

4.4 PICEA-w

4.4.1 Algorithm design: PICEA-w

Similar to a general decomposition based algorithm, PICEA-w decomposes a MOP into a set of single objective problems that are defined by different weighted scalarising functions. The main feature of PICEA-w is that the scalarising functions’ underlying weights are adaptively modified in a co-evolutionary manner during the search. Specifically, in PICEA-w candidate solutions are ranked by each of the weighted scalarising functions, creating a ranking matrix. The fitness of candidate solutions is then calculated based on the ranking matrix. Weights are co-evolved with the candidate solutions towards an optimal distribution, and are also responsible for striking a balance between exploration and exploitation.

We implement PICEA-w within a (µ+λ) elitist framework as shown in Figure 4.10. A population of candidate solutions and weight vectors, S and W, of fixed size, N and Nw, are co-evolved for a number of generations. At each generation t, parents S(t) are

Nw new weight vectors, W c(t), are randomly generated. S(t) and Sc(t), W(t) and

W c(t), are then pooled respectively and the combined populations are sorted according to their fitness. Truncation selection is applied to select the best N solutions as a new parent population, S(t + 1), and Nw solutions as a new preference population,

W(t+ 1). Additionally, an offline archive is employed to store all the non-dominated solutions found during the search. Evenly distributed solutions are obtained by using the clustering technique described inZitzler et al. (2002).

Figure 4.10: A (µ+λ) elitist framework of PICEA-w.

Algorithm 2:Preference-inspired co-evolutionary algorithm using weights

Input: Initial candidate solutions, S of size N, initial weight vectors,W of size Nw,

the maximum number of generations,maxGen, the number of objectives, M, archive size,ASize

Output: S, W, offline archive, BestF

1 BestF ← ∅;

2 S ←initialiseS(N);

3 F S← objectiveFunction(S);

4 BestF ← updateArchive(BestF, F S);

5 W ← weightGenerator(N_w);

6 forgen←1 tomaxGendo

7 Sc← geneticOperation(S,F S); 8 F Sc← objectiveFunction(Sc);

9 (J ointS, J ointF)← multisetUnion(S, Sc,F S, F Sc);

10 W c← weightGenerator(Nw);

11 J ointW ← multisetUnion(W, W c);

12 θ← thetaConfiguration(gen, π/2);

13 (S, F S, W)←coEvolve(J ointF, J ointS, J ointW, θ);

14 BestF ← updateArchive(BestF, F S, ASize);

15 end

The pseudo-code of PICEA-w is presented in Algorithm 2. In the following we explain the main steps of PICEA-w.

• Line 1 initialises the offline archiveBestF as∅.

• In lines 2 and 3, N candidate solutions S are initialised and their objective values F S are calculated. The offline archive, BestF is updated by function updateArchivein line 4.

• Line 5 applies functionweightGeneratorto generate Nw random weights.

• The offspring candidate solutionsScare generated by functiongeneticOperation in line 7. Their objective valuesF Scare calculated in line 8. S and Sc,F S and F Scare pooled together, respectively in line 9.

• Line 10 generates another set of weights W c. In line 11, W and W c are pooled together.

• Line 12 sets the parameterθ which would be used in functioncoEvolveto implement a local operation.

• Line 13 co-evolves the joint candidate solutions J ointS and the joint weights

J ointW, and so to obtain new parentsS and W.

• Line 14 updates the offline archive with newly obtained solutions F S.

Note that functionweightGeneratorforms a new weight setW cwithNwweight vectors

that are generated according to Equation 4.5 (Jaszkiewicz, 2002). Alternatively, W c can be formed by randomly samplingNw weights from an evenly distributed candidate

weight set, Ψ. Ψ can be created by the simplex-lattice design method (as described on p.107). Genetic operators are not applied to generate offspring weight vectors, the reason is the same as the case in PICEA-g, which is described in p. 56. In addition, functions geneticOperation and updateArchive are the same as that used in PICEA-g, see p. 56.

FunctionthetaConfigurationadjusts parameterθby Equation4.8, that is,θincreases linearly from a small value to π₂ radians.

θ= π

2 × gen

maxGen; (4.8)

The use of θ implements a local selection at the early stages of the evolution and implements a global selection at the later stages of the evolution. A local selection refers to a candidate solution that only competes with its neighbours, and a global selection refers to a candidate solution that competes with all the other candidate solutions. The benefits of this strategy will be illustrated later (in Section 4.7.2).

The core part of PICEA-w lies in the function coEvolvewhich will be elaborated next. FunctioncoEvolveevaluates the performance of candidate solutions and weight vectors, and then constructs new parent populationsSandW from the joint populationsJ ointS

andJ ointW, respectively. A candidate solution gains high fitness by performing well on a large number of weighted scalarising functions. A weight vector only gains fitness by being rewarded from the candidate solutions that are ranked as the best by this weight. In other words, a weight vector is considered to be good only if it has contribution on the survived candidate solutions. The pseudo-code of functioncoEvolveis as follows.

FunctioncoEvolve(J ointS, J ointF, W)

Input: The joint populationsJ ointS, J ointF, J ointW, the parameter θ Output: New parents,S, F S, W

1 R← rankingSW(J ointF, J ointW, θ);

2 (F S, S, ix)←selectS(J ointF, J ointS, R); 3 W ← selectW(J ointW, F S, R, ix);

Figure 4.11: Illustration of the neighbourhood of candidate solutions and weight vectors.

(i) Line 1 applies function rankingSW to rank J ointF by each weighted scalarising function. Specifically, for each w ∈ J ointW, we first identify its neighbouring candidate solutions. The neighbourhood is measured by the angle between a candidate solution, s and a weight vector, w. If the angle is smaller than θ, then s

andware defined as neighbours. For example, in Figure4.11,s1 is a neighbour of

wwhile s2 is not. Then we rank these neighbouring candidate solutions based on their performance measured by the corresponding weighted Chebyshev scalarising function. This produces a [2N×2Nw] ranking matrix, denoted asR. Rij represents

the rank of the candidate solution si on the weighted Chebyshev function using

wj, i.e., gte(si|wj). The best solution is ranked 1. Ranking values for solutions

that are not neighbours of theware set as inf. The Chebyshev scalarising function is used in PICEA-w due to its guarantee of producing a Pareto optimal solution for each weight vector, and also its robustness on problem geometries (Miettinen, 1999).

(ii) Line 2 applies function selectS to select the best N candidate solutions as new parents,S. Specifically, the following steps are executed.

• Sort each row of the ranking matrix R in an ascending order, producing another matrix Rsort. Rsort holds in the first column the best ranking (the

smallest) value achieved by each candidate solution across all the weighted scalarising functions. Simultaneously, the second column holds the second smallest ranking result for each candidate solution and so on.

• All candidate solutions are lexicographically ordered based on Rsort. This

returns a rank (denoted as r) for each candidate solution. The fitness of a candidate solution is then determined by 2N −r. Truncation selection is applied to select the bestN solutions as new parents.

(iii) Line 3 applies function selectW to select a suitable weight vector from J ointW for each of the survived candidate solutions, i.e., members in the new parent S. The basic idea of the selection is to balance exploration and exploitation. To do so, two criteria are set for the weights selection. First, for each candidate solution the selected weight must be the one that ranks the candidate solution as the best. Then, if more than one weight is found by the first criterion, the one that is the furthest from the candidate solution is chosen. The first criterion is helpful to drive the search quickly towards the Pareto front. The second criterion is helpful to guide the search to explore new areas.

More specifically, for a candidate solutionsi, first we identify all the weights that

rank si the best. If there is only one weight, then this weight is selected for si.

This guarantees that solutionsi will not lose its advantage in the next generation

as there is still one weight vector that ranks this candidate solution the best, unless a better solution along this direction is generated. If so, the convergence performance is improved. If more than one weight is found, we select the weight that has the largest angle between si and itself. In this way, the algorithm is

guided to investigate some unexplored areas, i.e., improving diversity. It is worth mentioning that the second criterion is not considered unless multiple weights are identified by the first criterion. This guarantees that when exploring for a better diversity, the exploitation for a better convergence is also maintained. The pseudo- code of the functionselectW is described as follows.

• Line 1 initialises the new weight set as empty,∅. Line 2 forms a new ranking matrixR0 by selecting theix-th row ofR, whereixis the index ofF S in the J ointF.

• Line 3 ranks the contribution of weights to each of the candidate solutions according to R0. The results are stored in matrix R00. R00_ij describes the relative order of contribution that a candidate solution si received from a

FunctionselectW(J ointW, F S, R, ix)

Input: the objective valuesF S of the survived candidate solutions, the joint weight vectors,J ointW, the ranking matrix, R, the index list ofF S inJ ointF,ix Output: weight set, W

1 W ← ∅;

2 construct the ranking matrixR0 by extracting all the ixrows of R; 3 rank weights for each si according to R0 and obtain another matrixR00;

4 foreach F si∈F S do

5 J ← {j|j= arg min_j_∈{₁_,₂_,_···_,N_}R_ij00} ; 6 if |J|>1 then

7 k←arg max_j∈Jangle(wj,F si) ;

8 else

9 k←j;

10 end

11 W ←W ∪wk;

12 set the k-th column ofR00 as inf; 13 end

• Line 5 finds all the weight vectors that give the highest contribution to solution

si, i.e., weights that haveR00_ij = 1. If there is only one weight then we choose

this weight for si. If multiple weights are found, then we choose the weight

wk that has the largest angle with F si (lines 6 to 11). To avoid multiple

selections of a weight, once the weightwk is selected, thek-th column ofR00ij

is set as inf.

• Additionally, functionanglecomputes the angle between a candidate solution and a weight vector. The dot product of two normalised vectors returns the cosine of the angle between the two vectors.

To further explain the co-evolution procedure, let us consider a bi-objective minimisation instance shown in Figure4.12with four candidate solutions and four weight vectors, i.e. N =Nw = 2. In this example, it is assumed thatw1 has two neighbourss1 and s2;w2 has only one neighbours3;w3 has two neighbourss3 and s4;w4 has no neighbour. Table4.1 shows the selection process of candidate solutions. First, candidate solutions are ranked by each of the weighted Chebyshev functions. Then the fitness of each candidate solution is calculated according to the procedure of functionselectS. Based on the fitness, s1 (F its1 = 2) and s3 (F its3 = 3) are selected as new parent candidate

solutionsS.

Next we select the best weight for each candidate solution in theS. First, we randomly select one solution from S, e.g. s1. Then we identify the weights that contribute the most to s1, that is, s1 is ranked the best on these weights. There is only one weight i.e., w1, that contributes to s1 and therefore w1 is selected for s1. After this, another candidate solution is randomly selected from the set S\s1, e.g., s3. Similarly, we find

Figure 4.12:Illustration of the co-evolution procedure of PICEA-w with a bi-objective minimisation example.

Table 4.1: Candidate solutions selection process.

(a) Ranking matrixR

w1 w2 w3 w4

s1 1 inf inf inf

s2 2 inf inf inf

s3 inf 1 1 inf

s4 inf inf 2 inf

(b) Ranking matrixR1_,_r_and_{F it}

si w1 w2 w3 w4 r F itsi

s1 1 inf inf inf 2 2

s2 2 inf inf inf 3 1

s3 1 1 inf inf 1 3

s4 2 inf inf inf 3 1

Table 4.2: Weights selection process.

(a) Ranking matrixR0

w1 w2 w3 w4

s1 1 inf inf inf

s3 inf 1 1 inf

(b) Ranking matrixR00

w1 w2 w3 w4

s1 1 2 2 2 w1

s3 2 1 1 2 w3

the weights on whichs3performs the best. Bothw2 andw3 satisfy the condition. Then we look at the second criterion. It is found that the angle between s3 and w3 is larger than that between s3 and w2 and so w3 is selected for s3. This procedure continues until each candidate solution in theS is assigned a weight vector.

Additionally, in PICEA-w the number of weight vectorsNw is not required to be equal

to the number of candidate solutions N. However, since in each generation each of the survived candidate solution is assigned a different weight vector, it is required that 2Nw ≥N.

In document Preference-inspired Co-evolutionary Algorithms (Page 136-143)