CiteSeerX — Running performance Metrics for evolutionary multi-objective optimization

(1)

Evolutionary Multi-Obje tive Optimization

Kalyanmoy Deband Sa hin Jain

Kanpur Geneti AlgorithmsLaboratory (KanGAL)

Indian Institute of Te hnology Kanpur

Kanpur, PIN 208016,India

fdeb,jsa hingiitk.a .in

KanGAL Report No. 2002004

Abstra t

It isnowwell establishedthatmorethan oneperforman emetri sare ne essaryforevalu-

atingamulti-obje tiveevolutionaryalgorithm(MOEA).Althoughthereexistanumberofper-

forman emetri sintheMOEAliterature,mostofthemareappliedtothenalnon-dominated

set obtainedby anMOEA to evaluate its performan e. In thispaper, wesuggest a oupleof

running metri s{oneformeasuringthe onvergen etoareferen esetandotherformeasuring

thediversityinpopulationmembersateverygenerationofanMOEArun. Eitherusingaknown

Pareto-optimalfrontoranagglomerationofgeneration-wisepopulations,thesuggestedmetri s

revealimportantinsightsandinterestingdynami softheworkingofanMOEAorhelpprovide

a omparativeevaluationoftwoormoreMOEAs.

1 Introdu tion

Over the past ouple of years or so, more emphasis in the area of multi-obje tive evolutionary

algorithms (MOEAs) has been laid on the development of performan e metri s. The primary

reasonforits lukewarm interestinthepastisthatunlikeinthesingle-obje tive EAstudies,where

theperforman emetri isdire tlyrelatedtotheobje tivefun tion(bothbeings alarquantities),in

multi-obje tiveoptimizationtheperforman emetri mustassessanumberofsolutionsea hhaving

a ve tor of obje tive values. This immediately demands the need of more than one performan e

metri s. A re ent study (Zitzler et al., 2002) has shown that for an M-obje tive optimization

problem,atleastM performan emetri smustbeused. Inthe ontextofsingle-obje tiveEAs,the

re ordof a runningperforman emetri with generationprovideda plethora of informationabout

theworkingoftheEAontheproblemathand. Itisnotsurprisingthatifappropriateperforman e

measures of MOEA populations are also re orded and analyzed, salient information about their

working would beimminent. Inthispaper, we dis ussand suggestpotential runningperforman e

metri sformulti-obje tiveoptimization.

Althoughanumberofdierentperforman emetri shave beensuggested,manyareonlyappli-

abletotwo-obje tiveproblemsandmostimportantlyitisnotobviouswhi hoftheseperforman e

metri sone mayuseinpra ti e. Knowlesand Corne(2002) omparedmostofthese metri sbased

ontheir ompatibilitywithoutperforman erelationsbetweentwosetsofsolutionsandtheirmono-

toni ityproperties. Themetri sare lassiedbasedonstrong, weakand ompleteoutperforman e

(2)

tentofoutperforman e,insteadofjustdeterminingwhetheroneoutperformestheotherornot. The

study also suggested the use of any of the three R-metri s introdu ed in Hansen and Jaszkiewi z

(1998). TheR-metri smeasuretheproportionofo asionsoneapproximationsetwillbepreferred

byapre-dened setof utilityfun tions. Althoughsu h ametri is apragmati one,they maynot

be omputationally eÆ ient to beused asa runningperforman emetri .

Itisintuitivethattheuseofasetofmetri slessthanthenumberofobje tiveswouldmeanalose

of a dimensionand wouldimmediately make theapproa h theoreti allyina urate. However, one

ofthewaystoover omethedimensionalityproblempra ti allyistouseafun tionally independent

setofvariables(Goldberg,1993). Byrealizingthattherearetwomainfun tionalgoalsofMOEAs,

eorts an be made in devising two metri s: (i) one for measuring the onvergen e of solutions

to the Pareto-optimal front and (ii) the other formeasuring thediversityof solutions. Su h a set

of two metri s will enable two or more solution sets to be ompared among ea h other in terms

of their fun tional a hievements. Although the rst metri may simply be devised based on the

distan efrom areferen eset, these ondmetri isnotthatstraightforwardto explain,parti ularly

inthe ase of largenumberof obje tives.

In theremainderofthispaper,webrie y reviewtheexistingperforman emetri s. Thereafter,

we suggest a metri ea h forevaluating two fun tionalities des ribed above. Finally, the need of

using running metri s in MOEAs is amply demonstrated by applying them on a number of two

and three-obje tive problems.

2 Existing Performan e Metri s

Veldhuizen(1999),inhisdissertation,reportedanumberofperforman emetri sformulti-obje tive

optimization. Later, the rst author, in his book (Deb, 2001), lassied existing performan e

metri s into three lasses: (i) metri s for onvergen e, (ii) metri s for diversity estimation, and

(iii) metri s for both onvergen e and diversity. Although advantages and disadvantages of ea h

metri werequalitativelymentioned,Knowlesand Corne(2002)analyzedmostofthose metri son

thebasisof extentof outperforman erelations betweentwosets of non-dominatedsolutions. This

studysuggested theuseofanyofthethree performan emetri sofHansenandJaszkiewi z(1998).

We des ribethem brie y here.

The essentialideaisthat foranytwo approximatesets(Aand B),these metri susea number

of utility fun tions and determine the expe ted number of o asions the solutions of one set is

better thantheother. Thesemetri s,in otherwords,de lare thatset asthewinnerwhi hwillbe

the hoi e ofmost de ision-makers. Forexample,the metri R 1isdenedas follows:

R 1(A;B;U;p)= Z

u2U

C(A;B;u)p(u)du; (1)

where U is the set of all utility fun tions onsidered, p(u) is the frequen y of o urren e of the

utilityfun tionu,and the out omefun tionC is denedasfollows:

C(A;B;u)= 8

>

<

>

:

1; ifu

(A)>u

(B),

0:5; ifu

(A)=u

(B),

0; ifu

(A)<u

(B).

(2)

(3)

Here,u (A)is themaximumvalueof theutilityfun tionuon approximationset Aandis denoted

by u

(A) = max

z2A

fu(z)g (z is an obje tive ve tor from A). It is lear from equation 1 that

the R 1-metri measures the expe ted proportion of o asions the set A is better than set B.

The developers suggested to de lare A to be the winner if R 1(A;B;U;p)> 0:5 and B to be the

winner if R 1(A;B;U;p) < 0:5. If R 1(A;B;U;p) = 0:5, superiority of either A or B annot be

on luded from the study. However, the diÆ ulties with this metri are the requirement of a

numberofutilityfun tions,theirprobabilitiesofo urren e,andanumeri alintegrationte hnique

toa tuallyevaluatethemetri . Toavoidthenon-asso iativityprobleminvolvedin omparingmore

than two approximation sets, developers have also suggested omparing ea h set A with respe t

to a referen e set R , as follows: R 1

R

(A;U;p) = R 1(A;R ;U;p). In order to redu e the bias from

the hosen referen e set R , developers suggested using more than one referen e sets to on lude

thesuperiorityof one set overthe other. To quantifytheextent of superiorityof one set overthe

other,developershave suggestedtwomore metri s:

R 2(A;B;U;p) = Z

u2U (u

(A) u

(B))p(u)du; (3)

R 3(A;B;U;p) = Z

u2U u

(A) u

(B)

u

(A)

p(u)du: (4)

TheR 2-metri measurestheexpe teddegreeofsuperiorityandthetheR 3-metri measurestheex-

pe tedproportionofsuperiority. Thesetwometri s analsobemodiedtobeusedwithareferen e

set R , but need to be used as R 2

R

(A;U;p) = R 2(R ;A;U;p) and R 3

R

(A;U;p) = R 3(R ;A;U;p),

respe tively. As pointedoutearlier, these metri valueslargely dependonthe hosen utilityfun -

tions. Fortwo-obje tiveproblemswitha onvexPareto-optimalregion,thefollowingutilityfun tion

u

k

2U (where k 2[0;1℄) an be hosen:

u

k

=kf

1

+(1 k)f

2

: (5)

In general,however, weightedT heby hemetri s an be usedasthe familyofutilityfun tions.

For measuring diversity and onvergen e of obtained solutions, Zitzler's hyper-volume metri

(also known as the S-metri ) an also be used. The metri omputes the hyper-volume of the

obje tive spa e dominated by an approximation set. Although a set with a good diversity of

solutionswouldmeanalargerhyper-volumemetri value(hen ebetter),theS-metri valuedepends

onthe hosenreferen epoint usedforthehyper-volume al ulationand demandsnormalizationof

theobje tivesbefore omputingthemetri .

Alltheseauthorsseemtohavemadeonepoint lear: the omparisonoftwonon-dominatedsetof

solutionsisnotastraightforwardmatter,be auseofthedimensionalityinvolvedinthesets. Itisalso

generallyagreedthatoneperforman emetri isnotenoughtojudgetheperforman eofanMOEA.

Re ently, Zitzleretal. (2002) have given a formal proof stating thatforan M-obje tive problem,

at least M performan e metri s are needed to ompare two or more set of solutions. Intuitively

thismakes sense, as otherwise thiswould suggest an ina uratejudgment made witha redu tion

in dimensionality. However, we argue that although mathemati allyin orre t, it may be possible

to ompare two or more approximation sets fun tionally, as is often followed in understanding

behaviorsof omplexsystems. Inmulti-obje tiveoptimization,therearetwoprimaryfun tionalities

that an MOEA must a hieve: (i) approa hing the Pareto-optimal front and (ii)maintenan e of a

(4)

fun tionalitiesofmulti-obje tiveoptimization,itmaybepossibletodenetwoperforman emetri s,

one formeasuringea h fun tionalityex lusively, even forM >2 obje tives.

2.1 Need for Running Metri s

InmostMOEA studies,in ludingthose involving performan emetri s,two ormore MOEAshave

beenusually ompared purelyon thebasis of what havebeenobtained at theendof a simulation

run. Either a arefully updated ar hive or an EA population at the end of a simulation run

is evaluated for this purpose. However, interesting information about how an MOEA arrives at

the nal population has pra ti ally never been dis ussed in an MOEA study. On the ontrary,

mostsingleobje tiveEAstudiesanalyzesu hageneration-wiseperforman emeasureshowinghow

the average or best tness or some other performan e metri is varying with generation. Su h

an information provides a lot of insights to the working of an algorithm and helps de ide whi h

problemsarediÆ ultoreasy foranEA.Themainreasonfortheirabsen eintheMOEAliterature

is the ardinality of the ne essary performan e metri s to properly evaluate an MOEA and the

omplexitiesinvolvedin omputingthem.

However, some re ent studies (Abbass, 2002; Farhang-Mehr and Azarm, 2002; Lu and Yen,

2002)havejustbegunto showgeneration-wise dynami sofperforman emetri s. Inthispaper,we

demonstrate theuse of two su h runningperforman emetri s forMOEAsbyapplyingthem on a

numberoftest problemsand dis usssome interestingout omes.

3 Suggested Running Metri s

Parti ularlyfordesigningrunningmetri s,thefollowing propertiesare worth onsidering:

1. The metri shouldtake a value betweenzero and one inan absolute sense. Sin e themetri

isto be ompared generation-wise,anabsolute s aling ofarunningmetri betweenzero and

one willallowto assess the hangeof themetri valuefrom one generationto another.

2. The target (or desired) metri value ( al ulated foran ideally onverged and diversied set

of points)must be known.

3. Themetri shouldprovideamonotoni in reaseorde reaseinitsvalue,asthepopulationgets

improved ordeteriorated slightly. Thiswill also help in evaluating theextent of superiority

of oneapproximation setwith another.

4. Themetri shouldbes alabletoanynumberofobje tives. Althoughthisisnotanabsolutely

ne essary property, but if followed, it will ertainly be onvenient forevaluating s alability

issues ofMOEAsinterms of numberof obje tives.

5. The metri may be omputationally inexpensive, although this is not a stringent ondition

to befollowed.

Sin etwoindependentmetri saretobe hosen,ea hformeasuringoneofthetwofun tionalities

of multi-obje tive optimization, ea h metri may preferably evaluate the orresponding fun tion-

ality only, ignoringthe other fun tionality. Sin e two fun tionalities an not be measuredby one

(5)

suggested metri s,su h astheD1

R

metri (average distan e of referen e pointsfrom theapproxi-

mationset)suggestedbyCzyzakandJaszkiewi z(1998)for onvergen emeasureortheS-measure

(usedinZitzler(1999))fordiversitymeasure annotbeadequatelyused. Be auseof omputational

expensesinvolved in omputingtheR -metri s, they maynotbe suitable andidatesto be usedas

runningmetri s. In thefollowing two subse tions,we proposetwo dierent metri s forevaluating

MOEAsbykeepinginmindtheabove properties.

3.1 Metri for Convergen e

We usea simple metri forevaluating onvergen e towards a referen e set. A target set of points

P

(orreferen e set) an beeithera setof Pareto-optimal points(ifknown) orthenon-dominated

set of points ina ombined poolof all generation-wise populations obtained from an MOEA run.

Thereafter, for a population P (t)

at ea h generation, we ompute the onvergen e metri in the

followingmanner:

Step 1 Identifythenon-dominated setF (t)

of P (t)

.

Step 2 From ea h point i inF (t)

, al ulate the smallestnormalized Eu lidean distan e to P

as

follows:

d

i

= jP

j

min

j=1 v

u

t M

X

k=1 f

k (i) f

k (j)

f max

k f

min

k

!

2

: (6)

Here, f max

k

and f min

k

are the maximum and the minimum fun tion values of k-th obje tive

fun tioninP

.

Step 3 Cal ulate the onvergen e metri by averaging the normalized distan e for all points in

F (t)

:

C(P (t)

)= P

jF (t)

j

i=1 d

i

jF (t)

j

: (7)

In order to keep the onvergen e metri within[0;1℄, on e the above metri valuesare al ulated

for all generations, we normalize the C(P (t)

) values by its maximum value (usually C(P (0)

)):

C(P (t)

)=C(P (t)

)=C(P (0)

).

3.2 Metri for Diversity

In termsof measuring thediversityof solutions,a re ent study(Farhang-Mehr and Azarm, 2002)

suggested an entropy-based te hnique. Ea h obtained solution is proje ted on a suitable hyper-

plane 1

. Thereafter,a (M 1)-dimensional normallydistributedentropyfun tion isassigned with

its mean being on ea h proje ted point and with a user-dened standard deviation. All su h

entropy fun tionsforallproje ted pointsareaddedtogether anda normalized entropyfun tionis

al ulated. Iftheproje ted pointsarewelldistributedon thehyper-planeand asuitablestandard

deviation of the entropy fun tion is hosen, the resulting normalized entropy fun tion will be a

atfun tion, thereby ausinga largevalue of theShannon'sentropy measure al ulatedusingthe

1

Aplanewithadire tionve torequaltotheunitve torofthelinejoiningtheidealpointandthepointwiththe

worstindividualobje tivevaluesofthereferen esetissuggestedintheoriginalstudy.

(6)

that points are rowded insome partsof theproje ted plane, the entropy measure willbe small,

thereby meaning a poor diversity among solutions. The idea of measuring diversity of solutions

usingasimulatedentropyfun tionismeaningful,buttheapproa hhasthefollowingshort omings:

1. The entropy measure largely depends on the hosen standard deviation, as the resulting

distributionbeingpeakyor atdependslargelyonthevarian eofthenormalentropyfun tion

used.

2. Sin ea ontinuousnormalentropyfun tionisused,ifaprobleminvolvesdis onne tedPareto-

optimalfronts, thismethodmaybeerroneous.

3. ForproblemswherethePareto-optimal front isadegenerated lesser-dimensional urve(su h

as DTLZ5 (Deb et al., 2001)), a sub-optimal front may produ e a higher entropy measure

than a set of Pareto-optimal points, thereby allo ating a large entropy value to an inferior

set.

4. A performan e metri measuring the diversity of solutions annot adequately measure the

onvergingabilityof an MOEA,and vi eversa.

The diversity metri suggested next is similar in on ept to the above metri , ex ept that it

attempts to alleviate most of the above diÆ ulties. Moreover, this and the onvergen e metri

suggestedearliertogether anbeusedto systemati allyevaluateMOEAsforboth onvergen eand

diversityproperties. Thesuggested metri is also omputationally fast.

The essential idea is that the obtained non-dominated points at ea h generation is proje ted

on a suitable hyper-plane, thereby losing a dimension of the points. The plane is divided into a

numberof smallgrids (or(M 1) dimensional boxes). Dependingon whether ea h grid ontains

an obtained non-dominated pointor not, a diversitymetri isdened. If allgrids arerepresented

with at least one point, the best possible (with respe t to the hosen numberof grids) diversity

measure is a hieved. If some gridsare notrepresented by a non-dominated point, thediversityis

poor. The parameters required from the user are the dire tion osine of the referen e plane, the

numberofgrids(G

i

)in ea h of (M 1) dimension,andthe target (orreferen e) set ofpointsP

.

Here isthepro edure:

Step 1 From P (t)

,determinetheset F (t)

whi h arenon-dominated to P

.

Step 2 Forea h grid indexedby(i;j;:::), al ulate followingtwo arrays:

H(i;j;:::) = (

1; ifthegrid hasa representative point inP

;

0; otherwise.

(8)

h(i;j;:::) = (

1; ifH(i;j;:::)=1and thegrid hasa representative point inF (t)

;

0; otherwise.

(9)

Step 3 Assignavaluem(h(i;j;:::))toea hgriddependingonitsanditsneighbor'sh(). Similarly,

al ulatem(H(i;j;:::))usingH()forreferen e points.

(7)

to thatforH():

D(P (t)

)= P

i;j;:::

H(i;j;:::)6=0

m(h(i;j;:::))

P

i;j;:::

H(i;j;:::)6=0

m(H(i;j;:::))

: (10)

In the simple ase, the value fun tion m() for a grid an be omputed by using its h() and two

neighboringh()dimension-wise. Withasetofthree onse utivebinaryh()values,thereareatotal

of 8possibilities. Anyvalue fun tionmaybe assignedbykeepinginmindthefollowing:

A 111isthebest distributionand a000 istheworst.

A 010or a 101meansa periodi pattern with agood spreadand may bevalued more than

a 110ora 011. For example,the above valuation willmake an approximation setwith 50%

overage of grids but having a wider spread (su h as 1010101010) better than another set

havingthesame overage butwitha smallerspread(su has1111100000).

A 110ora011 maybe valued more thana 001ora 100,be ause of more overed grids.

Basedonaboveobservations, thefollowingm()valuesforh()aresuggestedandusedinthisstudy:

h(:::j 1:::) h(:::j::: ) h(:::j+1:::) m(h(:::j::: ))

0 0 0 0.00

0 0 1 0.50

1 0 0 0.50

0 1 1 0.67

1 1 0 0.67

0 1 0 0.75

1 0 1 0.75

1 1 1 1.00

Identi al value are usedfor H(). In the urrent study,two ormore dimensional hyper-planesare

handledby al ulating the above metri dimension-wise,whereas a higher-dimensional version of

theabovevaluefun tion analso be arefullydesignedby onsideringamovingsetofhyper-boxes.

One su h onsideration foratwo-dimensionalset of9 boxes is shownbelow:

is better than

Obviously,withmorenumberofobje tives,thevaluefun tionwillbediÆ ulttodene. Itremains

as an interesting future study to ndif su h higher-dimensional hyper-boxes are really ne essary

ompared to thedimension-wise al ulationof theproposedone-dimensionalmetri .

As an illustration to the above al ulation pro edure, Figure 1 shows a set of target points

(marked aslled ir les)and a setof population points(marked as shaded and open boxes) fora

two-obje tive minimizationproblem. Thepointsmarkedwithshadedboxesarethenon-dominated

pointswithrespe tto thetarget pointsandareusedforthediversity al ulation (thisisStep1 of

thepro edure). Thef

2

=0 plane isusedas thereferen eplane here andthe ompleterangeof f

1

valuesaredividedinto G

1

=10 grids. In thenext step, forea h grid bothh() and H()valuesare

al ulated. Fortheboundarygrids(extremegrids andgrids (:::;j;:::) withH(:::;j 1;:::)=0

(8)

f

2

1

1 1 0 0 1 0 1 1 0 0

h() 1 1

0.67 0.67 0.50 0.75 0.75

0.50 0.67 0.50

1.00 0.50

m() = 6.50

1 1 1 1 1 1 1 1 1 1

H()

1.00 1.00 1.00 1.00 1.00

1.00 1.00 1.00

1.00 m() 1.00

1 1

= 10.00

Diversity Measure = 6.50/10.00 = 0.65

Figure 1: Thediversitymetri omputationis illustrated.

or H(:::;j+1;:::)=0), an imaginary neighboringgrid with a h() orH() value of one is always

assumed. Inthe gure,these gridsare shown indashed boxes. The h()valuesare one inthe rst

two grids, fth, seventh and eighth grids. Noti ethat althoughmore thanone point mayexist in

a grid,theH() orh() value is stillone forthat grid. Based on a movingwindow ontaining three

onse utivegrids, them() valuesare omputedinthegureand thediversitymetri is al ulated.

To avoid the boundary ee ts (the ee t of using the imaginary grids), we normalize the above

metri asfollows:

D(P (t)

)= P

i;j;:::

H(i;j;:::)6=0

m(h(i;j;:::)) P

i;j;:::

H(i;j;:::)6=0 m(0 )

P

i;j;:::

H(i;j;:::)6=0

m(H(i;j;:::)) P

i;j;:::

H(i;j;:::)6=0 m(0 )

;

where0 isa zero-valuedarray. A arefulthought willrevealthattheH(i;j;:::)6=0 onsideration

in omputingtheD(P (t)

)termandtheboundarygridadjustmentsuggested above allowageneri

wayto handleproblemswithdis onne tedPareto-optimal fronts. The metri doesnotin ludethe

value fun tion for a grid on whi h there exists no referen e solution. We have in luded one su h

test problem(ZDT3)inoursimulationstudies.

IfthePareto-optimalfrontisnotknown(parti ularlytoreal-worldproblems),thetargetsetmay

bedetermined inthefollowingway. First,an MOEAis runforT generations and thegeneration-

wisepopulations(P (t)

,t=0;1;:::;T) arestored. Thereafter,thenon-dominatedmembersF (t)

of

ea h population are ombined together and the target set is dened as the non-dominated set of

the ombinedpopulation: P

=Non-dominated([

T

t=0 F

(t)

).

(9)

Convergence

Diversity

0 0.2 0.4 0.6 0.8 1

0 10 20 30 40 50 60 70 80 90 100

Metrics

Generation Number ZDT1

Figure 2: Two runningmetri sforZDT1 usinga setof Pareto-optimal points.

4 Simulation Results

Inthisse tion,we applytheabove metri sonsolutionsobtainedusingNSGA-II(Debetal.,2000)

onanumberoftestproblemssuggestedintheliteratureandonareal-worldtwo-obje tive gearbox

designproblem. In ea h ase, the metri sare appliedon a typi al runof NSGA-II. However, it is

advisedto applythemetri sonmultiplerunsofanMOEAandto reportaverage metri values. In

all ases with NSGA-II, we have used theSBX re ombination operator(Deb, 2001) withp

=1:0

and

=10,and thepolynomialmutationoperatorwith p

m

=1=nand

m

=20.

First four test problems are two-obje tive test problems hosen from Zitzler, Deb and Thiele

(2000). Sin e they are test problems and the exa t lo ationof the Pareto-optimal front is known

in ea h ase, we apply the suggested metri s using a set of 200 uniformly set of points on the

Pareto-optimal front (intheobje tive spa e).

Figure 2 shows the two metri s on simulation results obtained usingNSGA-II using N = 100

populationsizerunningtill100generations. Inthisandotherproblemsofthisstudy,wesetnumber

of gridsinea h dimension asG

i

=N

1=(M 1)

. Thus,fortwo-obje tive problems, we have usedthe

number of grids equal to the population size, as if ea h population member is expe ted on ea h

grid. Wealsousef

2

=0planetoproje tthepoints,unlessotherwisestated. Thegureshowsthat

the onvergen e metri qui kly moves to zero, thereby implying that NSGA-II solutions starting

from a random set of solutions qui kly approa h the Pareto-optimal front. A value of zero of

the onvergen e metri impliesthat allnon-dominated solutions mat h the hosen Pareto-optimal

points. Afterabout30generations,theNSGA-IIpopulation omesvery losetothePareto-optimal

front. Similarly,thediversitymetri shows thattillabouttherst26generations,nosolutionnon-

dominatedtothePareto-optimalset wasfound. Butfrom thisgenerationonwards,NSGA-IInds

more andmore pointsnon-dominatedwiththe hosen Pareto-optimal points. The hoi e ofpoints

aresu hthatthediversitymetri in reasesexponentiallytillabout70generations, afterwhi hthe

diversityremains more or less the same. Although the obtained pointsare very lose the hosen

Pareto-optimalpoints, diversitymetri os illatesnear astablevalue,a matterwhi his inherentto

(10)

Convergence

Diversity

0 0.2 0.4 0.6 0.8 1

0 10 20 30 40 50 60 70 80 90 100

Metrics

Generation Number ZDT2

Figure 3: Two runningmetri sforZDT2 usinga setof Pareto-optimal points.

NSGA-II and other MOEAs and wasalso dis ussed for NSGA-II and SPEA2 in an earlier study

(Laumanns etal.,2001).

It isinterestingto notethatthediversitymeasuresuggested hereusesonlysolutionswhi hare

non-dominated with thereferen e set. This maymean to suggest that onvergen e properties are

somehow in-built into thisdiversitymeasure, but thisis not quite true. Noti e how the diversity

measure is onsistentlyequal to zero tillabout 26 generations, untilwhi h time no solutionsnon-

dominatedto the referen e set is found. On the other hand, the onvergen e metri duringthese

early generations shows how the population has been onsistently approa hing the referen e set.

Thus,a ombination of thesetwo metri s providea lear insight into theworkingof an MOEA.

Next, we hoosethetest problemZDT2. Thisproblemhasanon- onvex Pareto-optimal front.

Figure3 showsthetwo runningmetri swiththesameGA parametersasbefore. The onvergen e

metri showsinabout30generations,NSGA-IIpopulationmovesvery losetothePareto-optimal

front. Itisalso interestingtonotethatNSGA-IItakesabout10moregenerationstondasolution

non-dominated with thereferen e set in ZDT2 than thatrequired inZDT1. On e some solutions

very losetothePareto-optimalfrontarefound,moreandmorepointsnearthefrontaredis overed

and the diversityamong solutionsin reases rapidly,indi atingthat NSGA-II nds solutions lose

to thePareto-optimal front witha good diversityamong them.

The test problemZDT3 hasa disjointedset of Pareto-optimal fronts. NSGA-II with identi al

parameter setting are applied to thisproblemfor 100 generations and the orresponding running

metri s are shown in Figure 4. The gure shows a similar behavior as that in ZDT1. In this

problem, we have also omputed thediversitymetri byproje tingthe pointson an in linedline,

as suggested in the footnote in se tion 3.2. The orresponding diversity metri is plotted with a

solidline. Thegureshowsthatalthoughthereissomedieren einmagnitudeofthetwodiversity

metri omputations(proje tedonf

2

=0planeandproje tedonthein linedline),theirbehaviors

arevery similar.

Next, we hoose the test problemZDT6, whi h has a non-uniform density of solutions along

thePareto-optimal front. This time, theNSGA-II isrunfor300 generations. Figure5 shows that

(11)

Diversity (projected) Diversity

(on f_1)

Convergence

0 0.2 0.4 0.6 0.8 1

0 10 20 30 40 50 60 70 80 90 100

Metrics

Generation Number ZDT3

Figure 4: Two runningmetri s forZDT3 usinga set of Pareto-optimal points al ulated usingf

1

axisand proje tedon an in linedplane.

Convergence

Diversity 0

0.2 0.4 0.6 0.8 1

0 50 100 150 200 250 300

Metrics

Generation Number ZDT6

Figure 5: Tworunningmetri s forZDT6 using

a setof Pareto-optimal points.

Convergence

Diversity

0 0.2 0.4 0.6 0.8 1

0 50 100 150 200 250 300

Metrics

Generation Number ZDT6

Figure 6: Two running metri s for ZDT6 us-

ing an agglomeration of generation-wise popu-

lations.

thepopulationsteadilyapproa hesthePareto-optimalfront,butalwaysmaintainsanitedistan e

from the true Pareto-optimal front. Sin e almost all obtained solutions are not non-dominated

withthe hosen Pareto-optimal points,thediversitymetri is al ulatedwithonlyafewsolutions,

thereby having a very smallvalue. For problemswhi hare diÆ ultto solve to Pareto-optimality,

wemayuseanagglomerationofpopulations(dis ussedinthelastparagraphofse tion3.2),instead

of Pareto-optimal points,to ompute theabove metri s. Figure6 showsthat NSGA-IIpopulation

rea hes its nal non-dominated front after around 165 generations. After this generation, the

populationqui kly onverges to thenalnon-dominatedfront andthepopulationdiversityresorts

into a noisy steady-state value. This study shows that when it is diÆ ult to obtainsolutions on

(12)

Diversity

Convergence 0

0.2 0.4 0.6 0.8 1

0 50 100 150 200 250 300 350 400 450 500

Metrics

Generation Number Gearbox Design Problem

Figure 7: Two runningmetri sshowing the onvergen eand diversityinsolutionsof NSGA-II.

the true Pareto-optimal front, it is better to use the population-agglomeration te hnique. This

te hnique an alsobe usedfor omparingtwo ormore MOEAs.

Next, we use the population-agglomeration te hnique to evaluate the performan e of NSGA-

II to a two-obje tive gearbox design problem dis ussed elsewhere (Deb and Jain, 2002). In this

problem,thePareto-optimalfrontisnotknown. With100gridpointsonf

1

,themetri sareplotted

inFigure7forNSGA-IIrunusing100populationmembers. Thegureshowsthatthe onvergen e

near the obtained non-dominated front is fast and the rst point near this front was found after

about65generations. Itisinterestingtonotethatatthenalgenerationthediversitymetri value

is less thanone. This meansthat NSGA-II does not ontain all 100 solutions used as a referen e

set at theendofthesimulationrun. Some better distributedsolutionsarelostduringtherun.

The naltwotest problemsarethree-obje tivetest problemsborrowed fromDebetal. (2001).

TheproblemDTLZ2hasaspheri alPareto-optimalsurfa e. Here,weuse100populationmembers

and NSGA-II and SPEA2 (Zitzler, Laumanns,and Thiele, 2001) arerun for300 generations. For

omputingthemetri s,wehave hosen10 10or100gridsintheinterval[0;1℄off

1 andf

2

axes. The

f

3

=0planeis hosentoproje tthepoints. Figure8showsthe orrespondingmetri s,astheyvary

withthe generationnumber. The onvergen e metri valuesforbothMOEAssuggestthat notall

non-dominatedpointseven atthenalgenerationhave rea hed losetothe hosenPareto-optimal

points. The diversity metri values show that both NSGA-II and SPEA2 have found a solution

non-dominated to the hosen Pareto-optimal pointsat around 20th generation. Thereafter, more

and more pointswith in reasing diversityamong them have beenfound. Thereafter, like in ZDT

problems, the populationmaintains a parti ular levelof diversitywith some variations. However,

thediversityofsolutionsobtainedbySPEA2ismu hbetter thanthatofNSGA-IIinthisproblem.

Inorder to investigate thediversityobtainedinea h ase, we haveplotted thenalobtainednon-

dominated solutions in Figures 9 and 10 for NSGA-II and SPEA2, respe tively. It is lear from

these two plots that SPEA2has obtaineda better spread thanNSGA-II. The lustering operator

of SPEA2 is more omputationally expensive than the rowding operator used in NSGA-II, but

SPEA2isbetterabletodistributepoints. Figure8revealsthelatterfa tquantitativelybyshowing

(13)

Diversity (NSGA−II)

Convergence (SPEA2)

Diversity (SPEA2)

Convergence (NSGA−II)

0 0.2 0.4 0.6 0.8 1

0 50 100 150 200 250 300

Metrics

Generation Number DTLZ2 (Three Objectives)

Figure 8: Two runningmetri s for DTLZ2using a set of Pareto-optimal points for NSGA-II and

SPEA2.

0 0.25 0.5 0.75 1 1.25

f1 0

0.25 0.5

0.75 1

1.25 f2

0 0.25 0.5 0.75 1

f3

Figure9: Non-dominatedpointsobtainedusing

NSGA-II.

0 0.25 0.5 0.75 1 1.25

f1 0

0.25 0.5

0.75 1

1.25 f2

0 0.25 0.5 0.75 1

f3

Figure 10: Non-dominatedpoints obtained us-

ingSPEA2.

SPEA2having abetter diversitymetri value thanNSGA-II notonlyat thenal generation, but

in the entire run of the two algorithms; however the onvergen e levels of both algorithms are

more or less thesame. The u tuations inthe diversity metri bybothmethods arisedue to the

dynami loss and gain of riti al points on the non-dominated solutions, on e they ome loseto

the referen e set. In any ase, thisstudyshows how thetwo suggested metri s an bring outthe

dieren es intheworking of twoMOEAsquantitativelyover theentire runofthealgorithms.

Our next problemisDTLZ5, whi hhas aPareto-optimal urve,instead ofathree-dimensional

Pareto-optimal surfa e. Here, we run NSGA-II for 200 generations, whereas all other NSGA-II

parametersand parametersfor omputingthemetri sarethesameasinDTLZ2. Figure11shows

the onvergen eanddiversitymetri sofasinglerunofNSGA-II.The onvergen emetri depi tsa

good onvergen eofNSGA-IIonthePareto-optimal front. Sin ethenumberofgridsinvolvingthe

(14)

Diversity (10X10)

Diversity (50X50)

Convergence 0

0.2 0.4 0.6 0.8 1

0 20 40 60 80 100 120 140 160 180 200

Metrics

Generation Number DLTZ5

Figure 11: Two runningmetri sforDTLZ5usinga setof Pareto-optimal points.

Pareto-optimalfrontisfewinthisproblem omparedto thetotal numberofgridsbeing onsidered

inthe diversity omputation(due to the redu eddimensionalityof thePareto-optimal front), the

diversitymetri hangesinnitesteps. Theattainmentofadiversitymetri valueofunitysuggests

theperfe tdiversitybeinga hievedwith10gridsinea hobje tive. When50gridsonea hobje tive

(f

1 and f

2

) axis are hosen, thePareto-optimal front is represented by more grids. The diversity

metri forthis aseisalsoshowninthegure. Ahyper-planewithmoregridsrequiremoreobtained

solutions to attain a good diversity measure. It is not surprisingthat the diversity metri values

degrade inthis ase from before. Thisillustrates thesensitivityof the suggested diversitymetri

on the hosennumberofgrids.

5 Dis ussions and Future Studies

Theneedofdevelopinggoodrunningmetri sandtheiruseinMOEAstudiesisnow learfromthe

above omputersimulations. Althoughtwometri sareusedinthisstudy,moremetri smeasuring

furtherdetailedpropertiesofanapproximationset analsobeusedand areamatterofimmediate

resear hinterests.

Although the suggested template-based diversity metri is a way to measure the extent of

diversity in an approximation set, there are at least a ouple of diÆ ulties with this metri : (i)

the metri value depends on the hosen template (the value fun tionm()) and (ii) as mentioned

earlier it is diÆ ult to assign a value fun tion for higher dimensions, although a dimension-wise

appli ationhasreasonablyworkedwellinthispaper. However, theoverallstrategyofproje tingan

approximationsetto asuitablegrid-edhyper-planeandthenapplyingadiversitymetri remainsa

goodapproa hforndingthediversityofanapproximationset. Insteadofthesuggestedtemplate-

based diversitymetri , thefollowingmetri s an also beused:

1. A grid- ount diversitymetri ounting the numberof o upied gridsin the proje ted plane

an simply be used. To normalizethe metri ,the ount an be dividedbythe total number

(15)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20 30 40 50 60 70 80 90 100

Different Diversity Metrics

Generation Number D2 D3

D1 ZDT1

Figure 12: Three diversity running metri s on

ZDT1 are shown. D1 is the metri dened in

se tion3.2, D2isthegrid- ount metri andD3

isthe varian e-metri V.

f_1

Generation 49 (7 grids) Generation 50 (11 grids) Generation 51 (10 grids)

0 0.2 0.4 0.6 0.8 1

Figure 13: The grids o upied by NSGA-II at

generations49,50,and51areshownforZDT1.

of o upied grids obtained using the referen e set. Although this metri does not tell the

sparseness of o upied grids, the sheer number of them may be an important fa tor for

omparing two or more MOEAs. Figure 12 plots this normalized grid- ount metri (shown

as D2) on thepopulations obtainedusing NSGA-II inproblem ZDT1. The plot shows that

NSGA-II solutionsshare about70%gridswiththe referen eset at theendof therun.

2. A varian e in inter-member distan es an be another measure of diversityof solutions on a

proje ted plane. The al ulation pro edure is as follows. For a population P (t)

, rst the

non-dominated solutions F (t)

(with respe t to P

) are found and are proje ted to a hosen

hyper-plane. Ea h o upied grid on the hyper-plane is represented by its enter point ~x (i)

.

First, the entroid~g of all o upied gridsis al ulated. Thereafter, the ve tor ~v (i)

from the

grid~x (i)

to the entroid~g is omputed. Then,a diversitymetri V is al ulated asfollows:

V(P (t)

)= M 1

X

j=1 v

u

t jF

(t)

j

X

i=1

v (i)

j

2

: (11)

For ea h dimension j, the term inside the square-root is the sum of the square of the pro-

je tions of all solutions from the entroid along the j-th dimension. Thus, if the solutions

are sparsely pla ed, the above metri willprodu e a large value. Sin e the proje tions are

squared andadded, thepositive and negative proje tionsdo not an elea h other.

In order to normalize themetri , the above value an be dividedbythe metri value of the

referen e set: V =V(P (t)

)=V(P

). Figure 12 also shows aplot of thismetri (shown asD3)

on ZDT1. The omparison of these three plots shows the similarityin their variations with

(16)

intunewiththeundulationsinthenumberofo upiedgrids(D2). An interestingdieren e

amongthesethree metri sis learduringthetransitionofthepopulationfromgeneration49

through51:

Generation Numberof Diversity Metri s

number o upiedgrids D1 D2 D3

49 7 8.754/100 7/100 0.192/2.887

50 11 14.899/100 11/100 0.405/2.887

51 10 13.468/100 10/100 0.891/2.887

Figure 13 also shows the o upied grids at generations 49, 50, and 51. A total of 100 grids

areassumed forf

1

2[0;1℄. The grid- ount (D2) metri forthese generationsdoesnotreveal

the fa t that a better diversity of solutions is obtained in generation51, although thegrid-

ount is less in generation 51 ompared to that in generation 50. Be ause of the use of

the three-grid template inD1, thedieren e in diversity obtained from generation 50 to 51

is not apparent by this metri either. But the D3 metri shows a big jump in this metri

value during this transition, meaning that a set with a mu h better diversity among them

is obtained. However, the real dieren e among these metri s may be apparent in solving

higher-dimensional or more omplex problems and the hoi e of one metri over the other

mayalso dependon the omputational omplexityasso iatedwithea h metri .

3. Theentropy-basedmetri suggested byFarhang-Mehrand Azarm(2002) an be modiedto

handledis onne tedPareto-optimal sets. The holes ordis ontinuitiesintheproje ted plane

anbeeliminatedandallindividualregionsofPareto-optimalsets anbepla ednextto ea h

other,sothatinthemodiedplane,there isnoholeordis ontinuity. Althoughthe hoi e of

thestandarddeviation ofthe normaldistributionstillremainsasan importantfa tor, other

distributionsmayalso betried.

Adetailstudyapplyingthesemetri stothe hosentestproblemsand omparingthemwithexisting

metri ssu hasthe R -metri sortheS-metri also remainsanimportant futuretask.

6 Con lusions

In order to investigate the generation-wise dynami s of an MOEA, two metri s are suggested in

this paper. The rst metri is a distan e measure of a population from a referen e set and is

used to measure the onvergen e ability of an MOEA. The se ond metri uses a lo al template

based evaluation te hnique to estimate the diversityof one set omparedto a referen e set. Both

metri s are normalized so as to have their values bounded between zero and one. On six test

problems and one real-world engineering design problem, the appli ation of these two metri s

have showninterestingpropertiesof NSGA-II. The non-dominatedsolutions ontinually approa h

the referen e set in all ases, meaning thereby that the domination-based sele tion operator of

NSGA-II is apableof bringingthepopulationnear thereferen e setwithoutgetting stu kin any

intermediatelevel. On esomepointsnearthereferen esetaredis overed,thesear hoperatorsand

thediversity-preservingme hanismofNSGA-IIareabletondmoreandmoresolutionswhi hare

(17)

test problem, NSGA-IIand SPEA2 populations are ompared generation-wise,revealing superior

performan eofSPEA2 inmaintaininga betterdiversityamong non-dominatedsolutions.

Forproblemswith unknownPareto-optimal front, thisstudyhas alsosuggested away to on-

stru t a referen e set based on an agglomeration of generation-wise non-dominated populations.

Su ha te hnique an alsobeusedto omputeother performan emetri srequiringareferen eset.

Itisalsoimportanttohighlightthatthe omputational omplexityofMOEAsisanotherimpor-

tantmatterandmustbetakeninto onsiderationwhile omparingtwoormoreMOEAs. Neverthe-

less,theresultsofthisstudyhasshowntheimportan eofusingrunningmetri sformulti-obje tive

evolutionary omputation andre ommend more su h studiesinthe nearfuture.

A knowledgement

Authorsthank Mar o Laumannsforsharing theSPEA2resultson test problemDTLZ2.

Referen es

Abbass, H. A. (2002). The self-adaptive Pareto dierential evolution algorithm. Pro eedings of

the World Congress on Computational Intelligen e,pp. 831{836.

Czyzak, P.andJaszkiewi z, A.(1998). Paretosimulatedannealing{a metaheuristi formultiob-

je tive ombinatorialoptimization. Multi-Criteria De ision Analysis,7, 34{47.

Deb, K., Thiele, L., Laumanns, M., and Zitzler, E. (2001). S alable test problems for evolu-

tionary multi-obje tive optimization. TIK ReportNo. 112. Zuri h, Switzerland: Computer

Engineeringand NetworksLab(TIK),SwissFederalInstituteof Te hnology (ETH).

Deb, K. (2001). Multi-obje tive optimization using evolutionary algorithms. Chi hester, UK:

Wiley.

Deb, K.,Agarwal, S.,Pratap,A., and Meyarivan,T.(2000). A fastelitistnon-dominated sorting

geneti algorithm for multi-obje tive optimization.: NSGA-II. Pro eedings of the Parallel

Problem Solving from Nature, pp. 849{858. Springer.

Deb, K.and Jain,S.(2002). Multi-speedgearboxdesignusingmulti-obje tiveevolutionaryalgo-

rithms. KanGAL ReportNo. 2002001. Kanpur,India: Department ofMe hani al Engineer-

ing, IITKanpur.

Farhang-Mehr, A. and Azarm, S. (2002). Diversity Assessment of Pareto-optimal solution sets:

Anentropyapproa h. Pro eedingsof the World Congress on Computational Intelligen e,pp.

723{728.

Goldberg,D.E.(1993). Makinggeneti algorithms y: A lessonfromWrightbrothers. Advan ed

Te hnology for Developers, 2,1{8.

(18)

dominated set. Te hni al Report IMM-REP-1998-7. Institute of Mathemati al Modeling,

Te hni al Universityof Denmark.

Knowles,J.and Corne, D.(2002). Onmetri s for omputingnon-dominated sets. Pro eedings of

the World Congress on Computational Intelligen e,pp. 711{716.

Laumanns, M., Thiele, L., Deb, K., and Zitzler, E. (2001). On the onvergen e and diversity

preservation properties of multi-obje tive evolutionary algorithms. Te hni al Report 108.

Zuri h,Switzerland: ComputerEngineeringandNetworksLab(TIK),SwissFederalInstitute

of Te hnology(ETH).

Lu, H,and Yen,G. (2002). Rank-densitybased multiobje tivegeneti sear h. Pro eedings of the

World Congress on Computational Intelligen e, pp. 944{949.

Van Veldhuizen, D. A. (1999). Multiobje tive evolutionary algorithms: Classi ations, analyses,

andnewinnovations. Do toraldissertation. Dayton,Ohio: AirFor eInstituteofTe hnology.

Zitzler, E. (1999). Evolutionary algorithms for multiobje tive optimization: Methods and appli-

ations. Do toral dissertation. Zuri h, Switzerland: Swiss Federal Institute of Te hnology

(ETH)(DissertationETH No. 13398).

Zitzler,E.,Deb,K.,andThiele,L.(2000). Comparisonofmultiobje tiveevolutionaryalgorithms:

Empiri alresults. Evolutionary Computation Journal, 8(2), 125{148.

Zitzler, E., Laumanns, M., Thiele, L., Fonse a, C., and Fonse a, V. G. (2002). Why quality

assessmentof multiobje tiveoptimizers isdiÆ ult? Unpublisheddo ument.

Zitzler,E.,Laumanns,M.,Thiele,L.(2001)SPEA2: ImprovingtheStrengthParetoEvolutionary

Algorithm. TIKReportNo. 103. Zuri h,Switzerland: ComputerEngineeringand Networks

Lab(TIK),SwissFederalInstituteof Te hnology (ETH).