/$ IEEE

(1)

A Pheromone-Rate-Based Analysis on the

Convergence Time of ACO Algorithm

Han Huang, Chun-Guo Wu, and Zhi-Feng Hao

Abstract—Ant colony optimization (ACO) has widely been ap-plied to solve combinatorial optimization problems in recent years. There are few studies, however, on its convergence time, which reflects how many iteration times ACO algorithms spend in con-verging to the optimal solution. Based on the absorbing Markov chain model, we analyze the ACO convergence time in this paper. First, we present a general result for the estimation of convergence time to reveal the relationship between convergence time and pheromone rate. This general result is then extended to a two-step analysis of the convergence time, which includes the following: 1) the iteration time that the pheromone rate spends on reaching the objective value and 2) the convergence time that is calculated with the objective pheromone rate in expectation. Furthermore, four brief ACO algorithms are investigated by using the proposed theoretical results as case studies. Finally, the conclusions of the case studies that the pheromone rate and its deviation determine the expected convergence time are numerically verified with the experiment results of four one-ant ACO algorithms and four ten-ant ACO algorithms.

Index Terms—Ant colony optimization (ACO), convergence time, nature-inspired cybernetic mechanisms, pheromone rate, runtime analysis.

I. INTRODUCTION

T

HE ANT colony optimization (ACO) algorithm is a nature-inspired cybernetic method in artificial intelli-gence. ACO [1] can be considered to be a cognitive informatics (CI) model of social animals like ants rather than the CI model of humans [2]. The idea of ACO comes from the ants’ behavior, which is different from traditional mathematics-based cybernetic techniques. The ACO algorithm does a surprisingly

Manuscript received November 30, 2007; revised April 27, 2008, December 14, 2008, and December 24, 2008. First published April 17, 2009; current version published July 17, 2009. This work was supported by the Na-tional Natural Science Foundation of China under Grants 60803052, 60873078, and 60673062.

H. Huang is with the School of Software Engineering, South China Uni-versity of Technology, Guangzhou 510006, China (e-mail: [email protected]; [email protected]).

C.-G. Wu is with the College of Computer Science and Technology, Jilin University, the Key Laboratory of Symbol Computation and Knowledge Engi-neering, Ministry of Education, Changchun 130012, China, and also with the National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China (e-mail: [email protected]).

Z.-F. Hao is with the School of Applied Mathematics, Guangdong University of Technology, Guangzhou 510006, China, and also with the College of Computer Science and Engineering, South China University of Technology, Guangzhou 510640, China (e-mail: [email protected]).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TSMCB.2009.2012867

successful performance in the solution of NP-hard problems, which draws more and more attention on ACO research, partic-ularly to the study of its theoretical foundation.

The ACO algorithm was first proposed by Dorigo and his colleagues to solve combinatorial optimization problems like the traveling salesman problem (TSP) [2], [3]. This algorithm is presented under the inspiration that an ant colony could build the shortest path from a food source to their nest by using some chemical substance called pheromone. Ants lay down pheromone trails when passing paths. The more ants choose a path, the more pheromone is laid down on it. The latter ants tend to choose the path with a higher pheromone intensity. However, it is very interesting that ants do not always choose the path with the highest pheromone intensity. Otherwise, the shortest path will hardly be built up.

Dorigo abstracted the ACO model from the biologic phe-nomenon and carried out the simulation of ant behavior to solve the TSP problems. Due to the advantage of this ant-based optimizing principle combined with pheromone evaporation to avoid early convergence to local optima, ACO has widely been used to tackle NP-hard combinatorial optimization problems rising from industry and engineering [4]–[7].

The tremendous computation power of ACO attracts in-creasingly more and more researchers to study its theoreti-cal foundations, including mathematitheoreti-cal model construction, convergence, and runtime analysis. Gutjahr proposed the convergence judgment for a special ACO algorithm named graph-based ant system [8]–[10]. Stützle and Dorigo presented a short convergence proof for a class of ACO algorithms [11], which is quite similar to the analysis of nonheuristic algorithms. Moreover, the convergence of ACO is classified into two types [5], [12]: 1) the convergence in value proposed by Dorigo, which states that at least one ant could reach the optimal solution when the iteration time tends to be infinite, and 2) the convergence in solution proposed by Dorigo, which states that the ACO algorithm can find the optimal solution with a probability of 1 when the iteration time tends to be infinite.

The analysis of convergence speed is considered [12] as the first open problem in ACO fields. For the convergence analysis, some researchers worked based on probability models, and others worked based on stochastic process models. Badr and Fahmy [13] used the random branching model to study the convergence of a specific ACO algorithm.

Most of the current convergence studies have discussed whether an ACO algorithm could reach the optimal solution when its iteration time tends to infinity but did not reveal how fast it converges. A brief review of the literature shows that

(2)

there are only a few investigations discussing the computational time complexity of ACO algorithms. These investigations are summarized as follows.

Neumann and Witt [14] presented a runtime analysis of a simple ACO algorithm and investigated the effect of the evap-oration factor. Hao and Huang [15] studied the complexity of a single-ant algorithm, which could converge in polynomial time. Recently, based on the earlier work of Hao and Huang [15], the authors of [16] proposed a general method for estimat-ing the ACO convergence time and, by usestimat-ing the proposed method, verified that the algorithm presented in [5] cannot converge in polynomial time (iteration) without the help of local searching.

However, the conclusions in [15] and [16] are hard to extend to other ACO algorithms because the algorithms concerned are simplified and too specific. The results in [16] need to be stud-ied in detail since they did not reveal the factors determining the time complexity of ACO algorithms.

Due to the aforementioned deficiencies of existing works, in this paper, two theoretical results followed by some case studies are presented with regard to analyzing how fast ACO algorithms converge based on an absorbing Markov chain. The rest are organized as follows. A brief review of runtime analysis in the research of nature-inspired cybernetic mecha-nisms and evolutionary computation is introduced in Section II. The framework of the ACO algorithm and its modeling are presented in Section III. A general result of estimating the convergence time is given in Section IV. The specific result is presented in Section V. In Section VI, four brief ACO algo-rithms are discussed as case studies. The numerical application of these case studies is given in Section VII. In the last section, the conclusions and remarks are presented.

II. BRIEFREVIEW OFRELATEDRESEARCH ON

RUNTIMEANALYSIS

Runtime analysis is a challenging topic in cybernetics. In this paper, the convergence time is the average iteration time that an ACO algorithm spends in converging to the optimal solution. It can reveal the runtime or time complexity of an ACO algorithm to explain how powerful the algorithm is. Compared with ACO, an evolutionary algorithm (EA) is the first runtime-analysis object of the nature-inspired algorithms in cybernetics. This section presents a brief review of the runtime analysis of EAs as the related research result.

The earliest runtime-analysis result may refer to Rudolph’s work on the time complexity of (1+1)EA for the One-Max problem [17]. Garnier et al. [18] proved that the runtime of (1 +1)EA with multi-bit-flip mutation is more than the one with single-bit-flip mutation. Drosteet al.[19], [20] improved these results by proving that the time complexity of (1+1)EA with c-bit-flip mutation is O(n log n). Afterward, EAs for simple linear functions and long path problems were verified to attain the optimal solution in polynomial runtime [21], [22]. However, the conclusion is only applicable to a (1 +1)EA, which is a single-individual version.

He and Yao first studied the average time complexity of multi-individual EAs like (N+N)EA [23]. Moreover,

He et al.[24] proposed a computational time analysis for de-ceptive problems. Recently, He and Yao have designed a math-ematical tool named drift analysis [23], [26] for estimating the average time complexity of EAs. They successfully used drift analysis to study the runtime of binary EAs for linear func-tion, the One-Max problem, and the node covering problem [25], [26]. Furthermore, their theoretical result is also extended to the problem difficulty measure on black-box optimization [27]. Zhou and He [28] applied the same theory to analyze the runtime of EAs for the constrained optimization problem. Although it is necessary to design a distance function before drift analysis is used, He and Yao did not propose the approach to make the function, which is the bottleneck of this runtime-analysis tool.

There are two other technical methods for the runtime analysis of EAs. Witt [29]–[31] studied the runtime of binary (N+ 1)EA with the family tree model. Neumann [32]–[34] also applied this model to analyze EAs for the Eulerian Cycle and Minimum Spanning Tree problems. Yu and Zhou [35] proposed an approach in estimating the first hitting time of EAs based on an absorbing Markov chain and used it to study the runtime of EAs for Subset Sum problems.

In brief, existing runtime-analysis studies mainly focus on binary EAs for simple optimization problems. There are few theoretical results of EAs for the TSP and other complex NP-hard problems. It is also the case in the research of the ACO algorithm, which is a nature-inspired method for solving the TSP and other combinatorial optimization problems. There-fore, the runtime analysis of ACO for the TSP problem [36] is a challenging and open problem [37], which this paper addresses.

III. FRAMEWORK OFACO ALGORITHM ANDITS

ABSORBINGMARKOVCHAINMODEL

This section will introduce the basic framework and mathe-matical model of ACO algorithms, which is referred to Dorigo’s definitions [5].

Definition 1: (S,Ω, f)is a combinatorial optimization prob-lem, whereS is the discrete solution space,Ωis the constraint set, f :S→R+ is the objective function, and R+ is the positive real domain.

Given an example of minimization optimization (S,Ω, f), the function of the ACO algorithm can be considered to find the optimal solution s∗ such that f(s∗)≤f(s) for ∀s∈S.

s={s₁, . . . , s_n},∀s_i∈C,i= 1, . . . , n, whereCis a discrete set, and n is the size of the tackled problem. The searching of the ACO algorithm is the process where artificial ants randomly walk on the graph G= (V, E, W_L, W_T), where V

is the set of vertexes, E is the set of edges,W_L is the length (traveling cost) of the edges, and W_T is the weight of the edges.Tis the pheromone matrix, of which the elementτ(i, j)

is the pheromone trail value of edge (i, j). Fig. 1 shows the framework of a basic ACO algorithm, and its modeling is also given in this section.

Step 1 initializes the pheromone matrix τ(i, j,0) =τ₀≥ τ_min>0, wherei, j= 1, . . . , n, andτ_min>0 is the minimal pheromone value. In step 4, each artificial ant first starts at a

(3)

Fig. 1. Framework of a basic ACO algorithm.

randomly selected vertex and then chooses the next vertex until

s=x_naccording to P(s_i+1=c_j|T, x_i) = ⎧ ⎨ ⎩ τα_(i,j,t)ηβ_(i,j) cy∈C∧(i,y)∈J(xi) τα_(i,j,t)ηβ_(i,y), if(i, j)∈J(xi) 0, otherwise (1)

where x_i={s₁, s₂, . . . , s_i}, x_i+1={s₁, s₂, . . . , s_i, y} for

∀s_i∈C, andi= 1, . . . , n.J(x_i)is the set of feasible neighbor vertexes fors_i.η(i, j)is the heuristic value of edge(i, j), andα,

β >0are the weight parameters. Step 7 updates the pheromone matrix according to the following:

∀(i, j)∈s_bs(t) :

τ(i, j, t+ 1)←(1−ρ₁)τ(i, j, t) +z₁(s_bs(t)) (2)

∀(i, j)∈sands∈S_iter(t) :

τ(i, j, t+ 1)←(1−ρ₂)τ(i, j, t) +z₂(s) (3) where (2) and (3) are named as global and local updating, respectively. After global and local updating, a pheromone renewal is again implemented according to

∀(i, j) :τ(i, j, t+ 1)←max{τ_min, τ(i, j, t+ 1)}. (4) In this paper,θandρare volatilization parameters meeting 0< θ, ρ <1. The increments z₁(·), z₂(·) :S→R+ satisfy

z₁(a)≥z₁(b)andz₂(a)≥z₂(b), respectively, if solutionais better thanb. Most of the ACO algorithms [1], [2], [4], [5] fall into the framework previously described.

The absorbing Markov chain model of the ACO algorithm is presented with the following definitions.

Definition 2: {ξ(t)}+_t=0∞ is the stochastic process for ACO algorithms, whereξ(t) = (X(t), T(t)), andX(t) ={s_bs(t)} ∪ S_iter(t).

The state of the stochastic process for the ACO algorithm per iterationξ(t)indicates the value of the best-so-far solution, the ant-attaining solutions, and the pheromone matrix at thetth iteration.

Definition 3: Y is the state set ofξ(t)fort= 0,1,2, . . .. Each element ofY is a feasible status forξ(t),t= 0,1,2, . . ..

Definition 4: Y∗ is the optimal state set if there is at least a solutions∗∈X∗ for∀ξ∗={X∗, T∗} ∈Y∗such thatf(s∗)≤ f(s)for∀s∈S, whereY∗⊂Y.

Each element ofY∗contains at least an optimal solution for the problem tackled.

Definition 5: A stochastic process{ξ(t)}+_t=0∞is an absorbing Markov chain if P{ξ(t+ 1)∈Y∗|ξ(t)∈Y∗}= 0, whereY∗

is the optimal state set.

The ACO algorithm will keep the optimal solutions∗forever when attainingY∗if{ξ(t)}+_t=0∞is an absorbing Markov chain. The definition of an absorbing Markov chain has been used to estimate the runtime of EA [35], [37]. This section also builds a similar model for the ACO algorithm as Lemma 1 shows.

Lemma 1 [16]: A stochastic process for the ACO algorithm

{ξ(t)}+_t=0∞is an absorbing Markov chain.

Proof: See Section A of the Appendix.

According to Lemma 1, an ACO algorithm can be modeled as an absorbing Markov chain.

IV. CONVERGENCETIME ANDITSESTIMATION A. Convergence Time

The convergence time is used as a measurement to analyze the runtime of ACO algorithms in this paper. First, some definitions of convergence time are introduced.

Definition 6: λ(t) =P{ξ(t)∈Y∗}is the probability that an ACO algorithm attains the optimal state set at thetth iteration.

Definition 7: γ is the convergence time, and Eγ is its expectation, such that P{ξ(t)∈Y∗}= 1 if t≥γ and

P{ξ(t)∈Y∗}<1if0≤t < γ.

Eγ indicates the mean time that ACO algorithms spend in reaching the optimal solution. The smaller expectationEγis, the smaller the time complexity of an ACO algorithm can be.

Definition 8: μ is the first hitting time, and Eμ is the expected first hitting time, such thatξ(t)∈Y∗andξ(t)∈Y∗if 0≤t < μ.

If an ACO algorithm is modeled as an absorbing Markov chain,Eγcan be calculated with the expected first hitting time

Eμ (Definition 8), which is used to study the computational complexity of EAs [25], [35].

Lemma 2: γ=μ is held for an absorbing Markov chain

{ξ(t)}+_t=0∞.

Proof: See Section B of the Appendix.

Lemma 2 indicates that the convergence time can be studied withEμ.

B. Estimation of Convergence Time

The following theorems can be used to estimateEγ, which is considered as a measurement of the time complexity of ACO algorithms in this paper.

(4)

Theorem 1: Eγ=+_i=0∞(1−λ(i))is held iflimt→+∞λ(t)=

1, λ(t) =P{ξ(t)∈Y∗}, and {ξ(t)}+_t=0∞ is an absorbing Markov chain.

Proof: See Section C of the Appendix.

Theorem 1 indicates that a direct approach in estimating the runtime of an ACO is to analyze the probabilityλ(t)(t=

1,2, . . .). λ(t) is hard to calculate in practice; therefore, the estimation method should be improved as Corollary 1 shows.

Corollary 1: b−1[1−λ(0)]≤Eγ≤a−1[1−λ(0)] is held if a≤p_t=P{ξ(t)∈Y∗|ξ(t−1)} ∈Y∗} ≤b (a,b >0 and

t= 1,2, . . .) and lim_t_→₊_∞λ(t) = 1 are true, where λ(t) = P{ξ(t)∈Y∗}, and{ξ(t)}+_t=0∞is an absorbing Markov chain.

Proof: See Section D of the Appendix.

The probabilityp_t=P{ξ(t)∈Y∗|ξ(t−1)}is easier to ob-tain thanλ(t)fort= 1,2, . . .. Therefore, Corollary 1 is more useful in analyzing theEγ of an ACO than Theorem 1. The theoretical results of the following sections are all based on the corollary.

V. ANALYZING THECONVERGENCETIME OFACO WITHPHEROMONERATE

A. Crucial Probability

According to Corollary 1, the probability p_t=P{ξ(t)∈ Y∗|ξ(t−1)∈Y∗}(t= 1,2, . . .) is decisive to estimate Eγ. However, p_t cannot directly be calculated, and the result is supposed to be further analyzed in detail. Fig. 1 shows that

p_t is determined at step 4. For a TSP problem of n cities, each artificial ant attains its solution by making n decisions according to (1).

Definition 9: q(i, t, s∗) =P(s_i=c∗_i|T, x_i₋₁)is the crucial probability that an artificial ant selectsc∗_i as the ith compo-nent, wheres_i₋₁, c∗_i ∈s∗, ands∗ is the optimal solution(i=

1, . . . , n).

P(s_i=c∗_i|T, x_i₋₁)is defined in (1). Therefore,q(1, t, s∗) = P(s1=c∗₁|T, x0) = 1is true, which means that the first chosen component belongs tos∗.

The probability that an ant finds the optimal solutions∗ is

P(t, s∗) =n_i=1q(i, t, s∗). According to the property of the absorbing Markov chain, fort= 1,2, . . . , P{ξ(t)∈Y∗|ξ(t−

1)∈Y∗}= 1is held. GivenKants in an ACO algorithm, the probability thats∗ is attained at thetth iteration time can be calculated according to the following:

p_t=P{ξ(t)∈Y∗|ξ(t−1)∈Y∗} = 1−(1−P(t, s∗))K = 1− 1−n i=1 q(i, t, s∗) K . (5)

Therefore,q(i, t, s∗) is the crucial factor in the time com-plexity of an ACO algorithm, which can be indicated in Corollaries 2 and 3.

Corollary 2: Eγ≤(1−λ(0))[1−(1−pn_low)K]−1 is held if q(i, t, s∗)≥p_low>0 for t= 1,2, . . . and i= 1, . . . , n, wheres∗is the only optimal solution.

Proof: See Section E of the Appendix.

Corollary 2 indicates that an upper bound of the expected convergence time can be confirmed if the crucial probability has a unified lower bound. Corollary 3 proposes the opposite result.

Corollary 3: Eγ≤P(n) is held if q(i, t, s∗)≥[1−(1− P(n)−1)1/K]1/n>0fort= 1,2, . . .andi= 1, . . . , n, where

nis the size of the tackled problem, andP(n)is a polynomial ofnorder.

Proof: See Section F of the Appendix.

Corollaries 2 and 3 show the importance ofq(i, t, s∗). The time complexity of an ACO algorithm can be as small as a polynomial ofnorder ifq(i, t, s∗)is big enough fort= 1,2, . . .

andi= 1, . . . , n. The following section will explain the result in more detail.

B. Pheromone Rate

The pheromone value, which is used to simulate the biologic behavior of ants, is the essential factor of ACO algorithms. In this section, we propose a method in estimating the ACO time complexity based on pheromone rate (see Definition 10 below), which is a factor of the pheromone value.

Definition 10:c(i, t)=(τα(a∗_i₋₁, a∗_i, t))/(_a∗

i−1,x∈J(ai−1)×

τα(a∗_i₋₁, x, t)) is the pheromone rate of the path

a∗_i₋₁, a∗_i ∈s∗.

The results of Corollary 2 can be extended with the pheromone rate as follows.

Corollary 4: Eγ≤(1−λ(0))·[1−(1−c_lown (ηβ(a∗_i₋₁, a∗_i)/ ηβ(a∗_i₋₁,max)))K]−1 is held if c(i, t)≥c_low>0 for t=

1,2, . . .andi= 1, . . . , n, wheres∗is the only optimal solution,

ηβ(a∗_i₋₁,max) = max_(a∗

i−1,x)∈J(a∗i−1){η

β₍_a∗

i−1, x)}, anda∗j∈

s∗(j= 0,1, . . . , n).

Proof: See Section G of the Appendix.

Corollary 4 reveals the relationship between the convergence time and the pheromone rate of an ACO algorithm. The unified low bound of pheromone rate c(i, t)for t= 1,2, . . .andi=

1, . . . , ndetermines the expected convergence timeEγ.

Corollary 5: Eγ≤P(n) is held if c(i, t)≥(ηβ(a∗_i₋₁,

max)/ηβ(a∗_i₋₁, a∗_i))[1−(1−P(n)−1)1/K]1/n for t= 1,2, . . . and i= 1, . . . , n, where s∗ is the only optimal solution, ηβ(a∗_i₋₁,max) = max_(a∗

i−1,x)∈J(a∗i−1){η

β₍_a∗

i−1, x)},

a∗_i ∈s∗(a∗₀=Null), andP(n)is a polynomial ofnorder.

Proof: It can directly be proved based on Corollaries 3

and 4.

Therefore, two methods for the estimation of the ACO run-time can be summarized as follows. One is as follows.

1) Calculate a unified lower or upper bound of crucial prob-abilityq(i, t, s∗)fort= 1,2, . . .andi= 1, . . . , n. 2) Estimate the ACO convergence time by using

Corollaries 2 and 3. Another approach is as follows.

1) Calculate a unified lower or upper bound of pheromone ratec(i, t)fort= 1,2, . . .andi= 1, . . . , n.

2) Estimate the ACO convergence time by using Corollaries 4 and 5.

(5)

Fig. 2. Framework of a brief ant system algorithm.

C. Convergence Time Estimation With Pheromone Rate

The unified low bounds of c(i, t) and q(i, t, s∗) for t=

1,2, . . ., however, cannot easily be calculated every time. The method mentioned in Section V-B should be improved to be more practical for runtime analysis. The improvement can be made in a less rigorous condition that the unified low bounds

c_lowfort=t₀+ 1, t₀+ 2, . . .can be determined, where some

t₀exists such thatc(i, t)≥c_low(t₀)>0ift > t₀andc(i, t)< c_low(t₀)ift≤t₀. The complexity can be estimated by the sum-marization oft₀andEγ[signed asω(c_low, t₀)of Definition 11] based onc_low.

Definition 11: ω(c_low, t₀)is the upper bound ofEγ calcu-lated by Corollary 4 with c_low, such that ω(c_low, t₀) = (1− λ(0)) · [1−(1 −c_lown (ηβ(a∗_i₋₁, a∗_i)/ ηβ(a∗_i₋₁,max)))K]−1, wherec(i, t)≥clow>0ift > t0, andc(i, t)< clowift≤t0.

Theorem 2: Given s∗ is the only optimal solution, Eγ≤ E[t0|ξ(0)] +ω(clow, t0)is held, wherec(i, t)≥clow>0when

t > t₀; else,c(i, t)< c_lowwhent≤t₀.

Proof: See Section H of the Appendix.

Theorem 2 indicates an improved method for the estimation of the ACO convergence time. This more practical method could be summarized as follows.

1) Calculate a low boundc_lowofc(i, t)fort=t₀+ 1, t₀+

2, . . ..

2) EstimateE[t0|ξ(0)]. 3) Calculateω(c_low, t₀).

4) AddE[t₀|ξ(0)]andω(c_low, t₀)to obtain an upper bound ofEγ.

VI. CASESTUDIES

In this section, we present some case studies on the estima-tion of time complexity for a brief ACO algorithm (Fig. 2) as examples on how to use the proposed methods and theoretical results.

The ant system algorithm is a brief version of the ACO algorithm. In step 2, (1) is changed into (6). Equation (8) is a brief version of (1) without considering the heuristic value and weight parameters so that the point of the case study lies in the

pheromone rate, which is the key factor of the aforementioned proposed theoretical result:

P(s_i+1=c_j|T, x_i) = ⎧ ⎨ ⎩ τ(xi,xj,t) y∈C∧xi,y∈J(xi) τ(i,y,t), ifxi, xj ∈J(xi) 0, otherwise. (6)

Moreover, step 3 uses another updating rule:

∀x, y ∈sands∈S_iter(t) :

τ(x, y, t+ 1)←τ(x, y, t) + Δτ(x, y, t). (7) Equation (7) runsK·ntimes per iteration if there areK arti-ficial ants. The following sections indicate the time complexity analysis of four ACO algorithms of the framework in Fig. 2: Ant System (AS)-1, AS-2, AS-3, and AS-4.

A. AS-1 Algorithm

AS-1 is an AS algorithm with the setting K= 1 and Δτ(x, y) =Q, where Q is a constant, ∀x, y ∈s and s∈ S_iter(t)(t= 1,2, . . .).

Theorem 3: Given{ξas-1(t)}+_t=0∞is the stochastic process for the AS-1 algorithm, then, E{c(i, t+ 1)|ξas-1(t)}=c(i, t) is held fori= 1, . . . , nandt= 0,1,2, . . ..

Proof: See Section I of the Appendix.

According to Theorem 3, we have

E{c(i, t+ 1)−c(i, t)|ξas-1(t)}

=E{c(i, t+ 1)|ξas-1(t)} −c(i, t) = 0.

The pheromone rate c(i, t) for the components of optimal solution s∗ remains constant during the iterations, where i=

1, . . . , n, and t= 0,1,2, . . .. According to step 1 of Fig. 1,

τ(i, j,0) =τ₀andc(i, j,0) =n−1are true fori, j= 1, . . . , n. Thus, we have E[t₀|ξas-1(0)] = +∞ if c_low> c(i, j,0) = n−1, wherec(i, t)≥clow>0 whent > t0, and c(i, t)< clow

whent≤t0.

Given c(i, t)≈E{c(i, t)|ξas-1(0), ξas-1(1), . . . , ξas-1(t−

1)}, c(i, t)≈E{c(i, t)|ξas-1(t−1)} because {ξas-1(t)}+_t=0∞

is an absorbing Markov chain for Definition 4. According to K= 1 and (8), we have the relationship on the crucial probability (see Definition 9)

q(i, t, s∗)≈n−1, i= 1, . . . , n.

Then, it follows that

p_t=Pξas-1(t)∈Y∗|ξas-1(t−1)∈Y∗ = 1− 1− n i=1 q(i, t, s∗) K ≈n−n.

Thus, Eγas-1≈(1−λ(0))·nn is held, according to Corollary 1. Because λ(0) = 0, Eγas-1≈nn is true, which indicates that the convergence time of AS-1 is exponential.

(6)

B. AS-2 Algorithm

AS-2 is an AS algorithm with the settingK=k₀>1 and Δτ(x, y) =Q, where Q is a constant, ∀x, y ∈s and s∈ S_iter(t) (t= 1,2, . . .). Because k₀ ants independently modify the pheromone matrixT(t)during the iterations, we have

Ec(i, t+ 1)−c(i, t)|ξas-2(t)

=k0·Ec(i, t+ 1)−c(i, t)|ξas-2(t)= 0

where{ξas-2(t)}+_t=0∞is the stochastic process for AS-2. The convergence time of AS-2 is also approximate to nn. The increment of the number of ants cannot improve the per-formance of the AS-1 algorithm. The theoretical result indicates that the performances of AS-1 and AS-2 are very bad because of the large amount of iteration time they cost.

C. AS-3 Algorithm

AS-3 is an AS algorithm with the setting K= 1 and whereΔτ(x, y, t)is alterable,∀x, y ∈sands∈Siter(t) (t=

1,2, . . .).

Theorem 4: Given {ξas-3(t)}+_t=0∞ is the stochastic process for the AS-3 algorithm, E{c(i, t+ 1)|ξas-3(t)}= [1 + θ(i, t)]c(i, t) is held (i=1, . . . , n and t=1,2, . . .), where

θ(i, t)=h/g, h=_a_i₋₁_,y_∈_J(a_i₋₁₎_−{_a_i₋₁_,a_i_}((τ(a_i₋₁, y, t)· [Δτ(a_i₋₁, a_i, t)−Δτ(a_i₋₁, y, t)])/l), g= Δτ(a_i₋₁, a_i, t) + ai−1,x∈J(ai−1)τ(ai−1, x, t), and l= Δτ(ai−1, y, t) + ai−1,x∈J(ai−1)τ(ai−1, x, t).

Proof: See Section J of the Appendix.

Thus, E{c(i, t)|ξas-3(t−1)} rises as Δτ(a_i₋₁, a_i, t−1)

increases for i= 1, . . . , n and t= 1,2, . . .. We have

E{c(i, t)|ξas-3(t−1)} ≥(1 +θ)·c(i, t−1)andE{c(i, t)− c(i, t−1)|ξas-3(t−1)} ≥θ·c(i, t−1), when θ(i, t)≥ θ >0.

Given c(i, t)≈E{c(i, t)|ξas-3(0), ξas-3(1), . . . , ξas-3(t−

1)}, the relationship c(i, t)≈E{c(i, t)|ξas-3(t−1)} is true, because{ξas-3(t)}+_t=0∞is an absorbing Markov chain according to Definition 4. Thus, we have

E{c(i, t)|ξas-3(0), ξas-3(1), . . . , ξas-3(t−1)}

≥(1 +θ)tc(i,0)≥(1 +θ)

t

n . (8)

Then, t≤log_(1+θ)(n·E{c(i, t)|ξas-3(0), ξas-3(1), . . . , ξas-3(t−1)}) is true. Given c_low> c(i, j,0) =n−1>0, such that c(i, t)≥clow when t > t0 and c(i, t)< clow when 0< t≤t0,E[t0|ξas-3(0)]≤log_(1+θ)(n·clow).

Given s∗ is the only optimal solution, Eγas-3≤ E[t₀|ξas-3(0)] +ω(c_low, t₀)is held, according to Theorem 2, whereω(cn_low) = (1−λ(0))[cn_low]−1=c−_lown. Thus, we have

Eγas-3≤log_(1+θ)(n·c_low) +c−_lown. (9) The expected convergence time can be less if the unified low bounds of pheromone ratec_lowand pheromone rate difference

θbecome smaller. Moreover, the problem tackled of bigger size

nrequires more runtime of the ACO.

D. AS-4 Algorithm

AS-4 is an AS algorithm with the setting K=k₀>1and whereΔτ(x, y, t)is alterable,∀x, y ∈sands∈S_iter(t) (t=

1,2, . . .).

Becausek₀ants independently modify the pheromone matrix

T(t)during the iterations, we have

Ec(i, t+ 1)−c(i, t)|ξas-4(t)

=k₀·Ec(i, t+ 1)−c(i, t)|ξas-4(t)

≥k0·θ·c(i, t) (10)

where θ(i, t)≥θ >0, and {ξas-4(t)}+_t=0∞ is the stochastic process for AS-4.

Thus, it follows that

Ec(i,1)|ξas-4(0)≥k₀·θ·c(i,0) +c(i,0) ≥1 +k0·θ n Ec(i,2)|ξas-4(1)≥k₀·θ·c(i,1) +c(i,1) ≥(1 +k₀·θ)·c(i,1) .. . Ec(i, t)|ξas-4(t−1)≥k0·θ·c(i, t−1) +c(i, t−1) ≥(1 +k0·θ)·c(i, t−1). Given c(i, t)≈E{c(i, t)|ξas-4(0), ξas-4(1), . . . , ξas-4(t−

1)}, we have the relationshipc(i, t)≈E{c(i, t)|ξas-4(t−1)}, because{ξas-4(t)}+_t=0∞is an absorbing Markov chain according to Definition 4. Then, it follows that

Ec(i, t)|ξas-4(0), ξas-4(1), . . . , ξas-4(t−1) =Ec(i, t)|ξas-4(t−1) ≥(1 +k0·θ)c(i, t−1) ≥ (1 +k0θ)t n . (11) Thus, we have t≤log_(1+k₀_·_θ)n·Ec(i, t)|ξas-4(0) ξas-4(1), . . . , ξas-4(t−1). Given c_low> c(i, j,0) =n−1>0, E[t₀|ξas-4(0)]≤

log_(1+θ)(n·c_low)is held, under the following two cases. 1) c(i, t)≥clowwhent > t0.

2) c(i, t)< c_lowwhent≤t₀.

Given s∗ is the only optimal solution, Eγas-3≤ E[t₀|ξas-4(0)] +ω(c_low, t₀) is held according to Theorem 2, where ω(cn_low) = (1−λ(0))[cn_low]−1=c−_lown. Therefore, we have

Eγas-4≤log_1+k₀_·_θ[n·c_low] +1−(1−cn_low)k0−1_. ₍₁₂₎ Equation (12) shows that the convergence time of AS-4 will decrease if the number of antsk₀becomes larger. Thus, AS-4 is an improvement of AS-3.

(7)

Given c_low=[1−(1−P(n)−1)1/k0]1/n _and log

(1+k0·θ) (n·c_low)≤P(n) (i= 1, . . . , n and t= 1,2, . . .), we have

Eγas-4≤P(n) +P(n), where bothP(n)andP(n)are poly-nomials ofnorders. Hence, the AS-4 algorithm converges in polynomials ofnorders in this case.

E. Conclusion of Case Studies

This section presents the convergence time analysis of a brief ACO algorithm (see Fig. 2) by using the theoretical results proposed in Sections IV and V. Sections VI-A–D indicate four cases of the brief ACO algorithm (AS-1, AS-2, AS-3, and AS-4), which have different pheromone updating rules and the same setting in other parts. The conclusions of analyzing the convergence time of the four cases can be drawn as follows.

1) AS-1 and AS-2, of which the pheromone increment per updating is a constant, perform worse than AS-3 and AS-4 for a no-increment pheromone rate per iteration on average.

2) The convergence times of AS-3 and AS-4 can be reduced when c_low becomes large according to (9) and (12), respectively.

3) The convergence times of AS-3 and AS-4 can be re-duced whenθ becomes large according to (9) and (12), respectively.

4) AS-4 performs better than AS-3 for a larger number of ants based on the (12).

The next section presents eight ACO algorithms that belong to AS-1, AS-2, AS-3, and AS-4, respectively. The algorithms show different performances in solving TSP problems. The simulated items for describing c_low and θ are designed and investigated during the solution of the algorithms. The conclu-sion of the case studies in this section is applied to analyze the convergence time of the tested algorithms and explain the reasons for their different performances.

VII. NUMERICALEXPERIMENT

This section presents a numerical experiment as an applica-tion of the proposed theoretical result. Eight ACO algorithms following the framework of Fig. 2 are considered, and their performances of solving TSP problems are analyzed with the conclusion of case studies in Section VI.

The only differences of the tested algorithm are the num-ber of ants k₀ and the pheromone increment per updating Δτ(x, y, t), which are indicated in Table I.

In Table I,d(x, y)is the length of path(x, y), andL(x, y, t)

is the length of the tour where path (x, y) belongs to. AS-1, AS-2, AS-3, and AS-4 are the algorithm cases in Section VI.

The eight algorithms are tested in solving 20 benchmark TSP problems [36]: berlin52, eil101, lin105, and so on. The maxi-mum number of iterations is 2000 for all of the algorithms. Each algorithm for a problem runs 30 times. During the solution, several items (shown in Table II) are investigated to explain the different performances of the algorithms, which is shown in Tables III–V. For the limit of the space, only the experimental results of three problems are indicated in detail.

TABLE I

EIGHTTESTEDACO ALGORITHMS

TABLE II

INVESTIGATEDITEMS OF THETESTEDALGORITHMS

TABLE III

COMPARISON OFKEYITEMSAMONGEIGHTACO ALGORITHMS IN THEberlin52 PROBLEM

In Table II,˜c_lowis investigated to describe the unified low bound of the pheromone rate signed as clow in Sections V and VI. θ˜min, θ˜max, θ˜ave, and N_θ+_˜ are used to simulate the unified low bounds of the difference of pheromone rate signed asθin Section VI. The second and third conclusions of the case studies in Section VI indicate that the biggerc_lowandθare, the better the respective ACO algorithm performs.

(8)

TABLE IV

COMPARISON OFKEYITEMSAMONGEIGHTANTSYSTEM

ALGORITHMS IN THEeil101 PROBLEM

TABLE V

COMPARISON OFKEYITEMSBETWEENEIGHTANTSYSTEM

ALGORITHMS IN THElin105 PROBLEM

In Tables III–V, Len_ave is the average length of the best solution in 20 runs. The comparison results include the following.

1) The ACO algorithms (AS-D-1, AS-D-k, AS-T-1, AS-T-k, AS-TD-1, and AS-TD-k) that belong to the AS-3 and AS-4 cases obtained better solutions than the algorithms (AS-Q-1 and AS-Q-k) of the AS-1 and AS-2 cases. It ver-ifies the first conclusion of the case studies in Section VI. 2) The algorithms (AS-TD-1 and AS-TD-k) with larger values ofc˜low attain better performances, which proves the second conclusion of case studies in Section VI. 3) The algorithms (AS-D-1 and AS-TD-k) with largerθ˜_min,

˜

θ_max,θ˜_ave, andN+_˜

θ values perform better, which testifies

to the third conclusion of case studies in Section VI. 4) The ten-ant ACO algorithms (AS-D-k, AS-T-k, and

AS-TD-k) perform better than the one-ant ACO algo-rithms (AS-D-1, AS-T-1, and AS-TD-1), which proves that AS-4 performs better than AS-3 as the last conclusion of the case studies in Section VI suggests.

The following figures will indicate the results in detail.

Fig. 3. Length comparison among the one-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of eil101 problem. (c) Results of lin105 problem.

Figs. 3 and 4 show the length of the best-so-far solu-tion per iterasolu-tion on average of 30 runs for the tested ACO algorithms. The results indicate that AS-TD-1 performs the best among all four one-ant ACO algorithms. Figs. 5–8 show that AS-TD-k performs the best among all four ten-ant ACO algorithms.

Figs. 5–8 show the pheromone rate andN+_˜

θ(t)per iteration

(9)

Fig. 4. Length comparison among the ten-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of the eil101 problem. (c) Results of the lin105 problem.

N_θ+_˜(t)is the average positive-seta number ofndimensions at thetth iteration. It is obvious that the order (from the largest to the smallest) of the pheromone rate and N+_˜

θ(t) is nearly

the same as the order (from the shortest to the longest) of the solution length to each TSP problem. The results again prove the correctness of the conclusions of the case studies in Section VI.

Fig. 5. Pheromone rate comparison among the one-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of the eil101 problem. (c) Results of the lin105 problem.

VIII. CONCLUSION ANDDISCUSSION

This paper proposes a theoretical result of an ACO runtime analysis based on an absorbing Markov chain model, which reveals the relationship between the pheromone rate and the runtime of the ACO algorithm.

The convergence time is the measurement of the runtime of an ACO in our study. First, the ACO algorithm is modeled as an absorbing Markov chain, and some general theoretical

(10)

Fig. 6. Pheromone rate comparison among the ten-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of the eil101 problem. (c) Results of the lin105 problem.

results about the convergence time are introduced. Then, the probability that an artificial ant selects a component of the optimal solution is studied as a crucial factor having an im-pact on the ACO runtime. Moreover, the result is specified through analyzing the pheromone rate, with which two methods for estimating the convergence time are proposed. The first approach is suitable for the case where a unified low bound of pheromone rate is definite. The second approach improves the first one in three aspects: 1) estimating the iteration time

Fig. 7. N_˜+

θ (t)comparison among the one-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of the eil101 problem. (c) Results of the lin105 problem.

that a pheromone rate spends in reaching the objective value, which is named as first reaching time (FRT); 2) calculating the expected convergence time based on the objective pheromone rate; and 3) estimating the ACO runtime by using both FRT and convergence time. Moreover, based on the analysis of this paper, the condition where ACO algorithms converge inn-order polynomial time is also given.

(11)

Fig. 8. N+_˜

θ(t)comparison among the ten-ant ACO algorithms. (a) Results of the berlin52 problem. (b) Results of the eil101 problem. (c) Results of the lin105 problem.

The proposed method is used to analyze a class of brief ACO algorithms as case studies. The convergence time of four ACO algorithms is estimated. In the analysis, we find that the incre-ment of the number of ants cannot improve the performance of an ACO algorithm when the increment of pheromone rate is constant. However, the convergence time could be reduced in some special cases when the number of ants is larger. In

conclusion of the case studies, it can be drawn that a more pheromone rate and pheromone rate difference per iteration can reduce the runtime of the brief ACO algorithms.

As an application of the proposed theory, a result of the numerical experiment is proposed to explain the different per-formances of the ACO algorithms with different pheromone updating rules. In the experiment, four one-ant ACO algorithms and four ten-ant ACO algorithms are tested to solve benchmark TSP problems. During the solution, several items are investi-gated for the simulation of pheromone rate. As a result, the investigation verifies the correctness of the conclusions of the case studies.

The proposed results and methods are useful when the pheromone rates for the components of the optimal solution are able to increase in expectation during the searching. Therefore, future study is suggested to extend the results into other kinds of ACO algorithms and design rules to improve the ACO algorithms in practice.

APPENDIX A. Proof of Lemma 1

Proof: {ξ(t)}+_t=0∞ is the stochastic process of discrete state for ξ(t) = (X(t), T(t)) is denoted as discrete codes in a computer. Because (X(t), T(t)) is only determined by (X(t−1), T(t−1)) and {X(0), T(0)} can be arbi-trary, P{ξ(t)∈Y|ξ(0), . . . , ξ(t−1)}=P{ξ(t)∈Y|ξ(t−

1)} ∀Y⊆Y. Thus,{ξ(t)}+_t=0∞is a Markov chain.

ξ(t)∈Y∗ ifs_bs(t) =s∗. According to the fifth step of the ACO algorithm (Fig. 1),s_bs(t+ 1) =s_bs(t), so thatξ(t+ 1)∈ Y∗andP{ξ(t+ 1)∈Y∗|ξ(t)∈Y∗}= 0.

Therefore,{ξ(t)}+_t=0∞is an absorbing Markov chain based on

Definition 4.

B. Proof of Lemma 2

Proof: It is obvious thatP{ξ(μ+ 1)∈Y∗|ξ(μ)∈Y∗}=

1 is held, becauseξ(t)∈Y∗ whent=μand{ξ(t)}+_t=0∞ is an absorbing Markov chainP{ξ(μ+ 1)∈Y∗|ξ(μ)∈Y∗}= 0.

Similarly, P{ξ(t)∈Y∗}= 1 when t≥μ. According to Definition 8,P{ξ(t)∈Y∗}<1whent < μ. Therefore,γ=μ

is held owing to Definition 7.

C. Proof of Theorem 1

Proof: P{μ=t}=λ(t)−λ(t−1)is held, sinceλ(t) = P{ξ(t)∈Y∗}=P{μ≤t} is true. It follows that λ(t)− λ(t−1) =P{μ≤t} −P{μ≤t−1}=P{μ=t}. Therefore, we have Eμ= +∞ t=1 t·(λ(t)−λ(t−1)) = +∞ t=1 t·(λ(t)−λ(t−1)) = (λ(1)−λ(0)) + 2 (λ(2)−λ(1)) +· · · +N(λ(N)−λ(N−1)) +· · ·

(12)

=+∞ i=1 (λ(i)−λ(i−1)) + +∞ i=2 (λ(i)−λ(i−1)) +· · · ++∞ i=N (λ(i)−λ(i−1)) +· · · =+∞ t=1 +∞ i=t (λ(t)−λ(t−1)) = +∞ t=1 lim N→+∞λ(N)−λ(t−1) =+∞ t=1 [1−λ(t−1)] = +∞ i=0 (1−λ(i)).

According to Lemma 2,γ=μis true so thatEγ=+_i=0∞(1−

λ(i))is held.

D. Proof of Corollary 1

Proof: P{ξ(t)∈Y∗|ξ(t−1)∈Y∗}= 1 is held, since

{ξ(t)}+_t=0∞is an absorbing Markov chain. Thus, it follows that

λ(t) = [1−λ(t−1)]P{ξ(t)∈Y∗|ξ(t−1)∈Y∗} +λ(t−1)P{ξ(t)∈Y∗|ξ(t−1)∈Y∗} = [1−λ(t−1)]p_t+λ(t−1) ∀t= 0,1,2, . . . . Then, we have 1−λ(t)≤[1−p_t] [1−λ(t−1)] = [1−λ(0)] t i=1 (1−p_i). According to Theorem 1 Eγ= +∞ t=0 (1−λ(t)) = +∞ t=0 [1−λ(0)] t i=1 (1−p_i) ≤a[1−λ(0)] +∞ t=0 (1−a)t ≤[1−λ(0)]1 a =a−1[1−λ(0)]. Similarly,Eγ≥b−1[1−λ(0)].

Hence,b−1[1−λ(0)]≤Eγ≤a−1[1−λ(0)]is held.

E. Proof of Corollary 2

Proof: Because q(i, t, s∗)≥p_low>0 (t= 1,2, . . . and

i= 1, . . . , n) is true, we have

p_t=P{ξ(t)∈Y∗|ξ(t−1)∈Y∗} ≥1−(1−pn_low)K.

According to Corollary 1, it is obvious that Eγ≤(1− λ(0))[1−(1−pn_low)K]−1is held.

F. Proof of Corollary 3

Proof: Because q(i, t, s∗)≥p_low= [1−(1− P(n)−1)1/K]1/n>0 (t= 1,2, . . . and i= 1, . . . , n) is true, according to Corollary 1, we have

Eγ≤(1−λ(0))

1−(1−pn_low)K

₋₁

= (1−λ(0))P(n).

Fig. 1 shows that X(0) ={s_bs(0)} ∪S_iter(0) =∅; there-fore,λ(0) = 0. Hence, Eγ≤P(n)is held, which means that the time complexity of an ACO algorithm can be a polynomial

ofnorder. G. Proof of Corollary 4 Because q(i, t, s∗) =P(si=a∗_i|T, xi−1) = τα a∗_i₋₁, a∗_i, tηβa∗_i₋₁, a_i a∗ i−1,x∈J(a∗i−1) ταa∗_i₋₁, x, tηβa∗_i₋₁, x ≥ ηβ a∗_i₋₁, a∗_iταa∗_i₋₁, a∗_i, t ηβa∗_i₋₁,max a∗ i−1,x∈J(a∗i−1) ταa∗_i₋₁, x, t ≥c(i, t) η β_a∗ i−1, a∗i ηβa∗_i₋₁,max ≥c_low η β_a∗ i−1, a∗i ηβa∗_i₋₁,max >0

is true, it is obvious that

Eγ≤(1−λ(0))· ⎡ ⎣1− 1−cn_low η β_a∗ i−1, a∗i ηβa∗_i₋₁,max K⎤ ⎦ −1 according to Theorem 1. H. Proof of Theorem 2

Proof: Let t=t−t₀ and c(i, t)≥c_low(t₀)>0

for t= 1,2, . . . and i= 1, . . . , n. According to Corollary 2,ω(c_low, t₀)=(1−λ(0))[1−(1−cn_lowηβ(a∗_i₋₁, a∗_i)/ ηβ(a∗_i₋₁,max))K]−1. Becausec(i, t)≥c_low>0whent > t₀, we have

Eγ≤t₀+ω(c_low, t₀). (13) Based on Corollary 1,Eγ=+_t=0∞[(1−λ(0))t_i=1(1−p_i)]

is held. According to Definition 2, we have ξ(0) =

(X(0), T(0)) and X(0) ={s_bs(0)} ∪S_iter(0). Because

X(0) =∅ as shown in steps 2–3 of Fig. 1, λ(0) = 0 and

Eγ=+_t=0∞[t_i=1(1−pi)]are true.

Because λ(0)=0⇔P{ξ(0)∈Y∗}=0, p₁=P{ξ(1)∈Y∗| ξ(0)∈Y∗}=P{ξ(1)∈Y∗}; therefore, Eγ=+_t=0∞[t_i=1(1− p_i)]has no relation withξ(0).

(13)

Ec(i, t+ 1)|ξas-1(t)= ai−1,y∈J(ai−1)−{ai−1,ai} ⎡ ⎢ ⎣ τ(ai−1, y, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· τ(a_i₋₁, a_i, t) Q+ ai−1,x∈J(ai−1) τ(a_i₋₁, x, t) ⎤ ⎥ ⎦ + τ(ai−1, ai, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· τ(a_i₋₁, a_i, t) +Q Q+ ai−1,x∈J(ai−1) τ(a_i₋₁, x, t) = τ(ai−1, ai, t) Q+ ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)+ τ(a_i₋₁, a_i, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· Q Q+ ai−1,x∈J(ai−1) τ(a_i₋₁, x, t) = τ(ai−1, ai, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t) =c(i, t) Ec(i, t+ 1)|ξas-3(t)= ai−1,y∈J(ai−1)−{ai−1,ai} ⎡ ⎢ ⎣ τ(ai−1, y, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· τ(a_i₋₁, a_i, t) l ⎤ ⎥ ⎦ + τ(ai−1, ai, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· τ(a_i₋₁, a_i, t) + Δτ(a_i₋₁, a_i, t) g = ai−1,y∈J(ai−1)−{ai−1,ai} ⎡ ⎢ ⎣ τ(ai−1, ai, t) ai−1,x∈J(ai−1) τ(ai−1, x, t)· τ(a_i₋₁, y, t) l ⎤ ⎥ ⎦ + τ(ai−1, ai, t) ai−1,x∈J(ai−1) τ(a_i₋₁, x, t)· τ(a_i₋₁, a_i, t) + Δτ(a_i₋₁, a_i, t) g = ⎡ ⎣τ(ai−1, ai, t) + Δτ(ai−1, ai, t) g + ai−1,y∈J(ai−1)−{ai−1,ai} τ(a_i₋₁, y, t) l ⎤ ⎦_·c(i, t) = ⎡ ⎣τ(ai−1, ai, t) + Δτ(ai−1, ai, t) g + h g + ai−1,y∈J(ai−1)−{ai−1,ai} τ(a_i₋₁, y, t) g ⎤ ⎦_·c(i, t) = g g + h g ·c(i, t) = [1 +θ(i, t)]c(i, t)

Because ω(c_low, t₀)=(1−λ(0))[1−(1−cn_low(ηβ(a∗_i₋₁, a∗_i)/ ηβ(a∗_i₋₁,max)))K]−1, which has no relation with ξ(0),

E[ω(c_low, t₀)|ξ(0)] =ω(c_low, t₀).

Hence, according to (13), we have Eγ≤E[t₀|ξ(0)] +

ω(clow, t0).

I. Proof of Theorem 3

Proof: Because K= 1 and Δτ(x, y) =Q (Qis a con-stant), we obtain the first set of equations shown at the top of

the page.

J. Proof of Theorem 4

Proof: According to (9) and Δτ(x, y, t), we have the second set of equations shown at the top of the page.

ACKNOWLEDGMENT

The authors would like to thank Prof. X. Yao for his general introduction and comment to the primary result of the pre-sented paper in the international conference SEAL 2006 and Dr. X. W. Yang for his suggestion on the revision of this paper.

REFERENCES

[1] C. Q. Zhang and Y. C. Lu, “The improved ant colony algorithm based on immunity system genetic algorithm and application,” inProc. 5th IEEE Int. Conf. Cognitive Inform., 2006, vol. 2, pp. 726–731.

[2] Y. X. Wang and Y. Wang, “Cognitive informatics models of the brain,”

IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 36, no. 2, pp. 203– 207, Mar. 2006.

[3] M. Dorigo, V. Maniezzo, and A. Colorni, “Ant system: Optimization by a colony of cooperating agents,” IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 26, no. 1, pp. 29–41, Feb. 1996.

(14)

[4] M. Dorigo, G. D. Caro, and L. M. Gambardella, “Ant algorithms for discrete optimization,” Artif. Life, vol. 5, no. 2, pp. 137–172, Apr. 1999.

[5] M. Dorigo and T. Stützle,Ant Colony Optimization. Cambridge, MA: MIT Press, 2004.

[6] M. Dorigo and L. M. Gambardella, “Ant colony system: A cooperative learning approach to the traveling salesman problem,”IEEE Trans. Evol. Comput., vol. 1, no. 1, pp. 53–66, Apr. 1997.

[7] T. Stützle and H. H. Hoos, “MAX–MIN ant system,”Future Gener. Comput. Syst., vol. 16, no. 8, pp. 889–914, 2000.

[8] W. J. Gutjahr, “A generalized convergence result for the graph-based ant system metaheuristic,” Dept. Stat. Decis. Support Syst., Univ. Vienna, Vienna, Austria, 1999. Tech. Rep., pp. 99–109.

[9] W. J. Gutjahr, “A graph-based ant system and its convergence,”Future Gener. Comput. Syst., vol. 16, no. 9, pp. 873–888, Jun. 2000.

[10] W. J. Gutjahr, “ACO algorithms with guaranteed convergence to the optimal solution,”Inform. Process. Lett., vol. 82, no. 3, pp. 145–153, May 2002.

[11] T. Stützle and M. Dorigo, “A short convergence proof for a class of ant colony optimization algorithms,”IEEE Trans. Evol. Comput., vol. 6, no. 4, pp. 358–365, Aug. 2002.

[12] M. Dorigo and C. Blum, “Ant colony optimization theory: A survey,”

Theor. Comput. Sci., vol. 344, no. 2/3, pp. 243–278, Nov. 2005. [13] A. Badr and A. Fahmy, “A proof of convergence for ant algorithms,”

Inform. Sci., vol. 160, pp. 267–279, 2004.

[14] F. Neumann and C. Witt,Runtime Analysis of a Simple Ant Colony Opti-mization Algorithm, vol. 4288. Berlin, Germany: Springer-Verlag, 2006, pp. 618–627.

[15] Z. F. Hao and H. Huang, A Time Complexity Analysis of ACO for Linear Functions, vol. 4247. Berlin, Germany: Springer-Verlag, 2006, pp. 513–520.

[16] H. Huang, Z. F. Hao, C. G. Wu, and Y. Qin, “The convergence speed of Ant Colony Optimization,”Chin. J. Comput., vol. 30, no. 8, pp. 1343– 1353, 2007 (in Chinese).

[17] G. Rudolph, “Convergence properties of canonical genetic algorithms,”

IEEE Trans. Neural Netw., vol. 5, no. 1, pp. 96–101, Jan. 1994. [18] J. Garnier, L. Kallel, and M. Schoenauer, “Rigorous hitting times for

binary mutations,”Evol. Comput., vol. 7, no. 2, pp. 167–203, 1999. [19] S. Droste, T. Jansen, and I. Wegener, “A rigorous complexity analysis

of the (1+1) evolutionary algorithm for linear functions with Boolean inputs,”Evol. Comput., vol. 6, no. 2, pp. 185–196, 1998.

[20] S. Droste, T. Jansen, and I. Wegener, “On the analysis of the (1+1) evolutionary algorithms,”Theor. Comput. Sci., vol. 276, no. 1/2, pp. 51– 81, Apr. 2002.

[21] J. Garnier and L. Kallel, “Statistical distribution of the convergence time of evolutionary algorithms for long path problems,”IEEE Trans. Evol. Comput., vol. 4, no. 1, pp. 16–30, Apr. 2000.

[22] G. Rudolph, “How mutation and selection solve long-path problems in polynomial expected time,”Evol. Comput., vol. 4, no. 2, pp. 194–205, 1996.

[23] J. He and X. Yao, “From an individual to a population: An analysis of the first hitting time of population-based evolutionary algorithms,”IEEE Trans. Evol. Comput., vol. 6, no. 5, pp. 495–511, Oct. 2002.

[24] J. He, H. Huang, and L. Kang, “The computational time of genetic al-gorithms for fully deceptive problem,”Chin. J. Comput., vol. 21, no. 9, pp. 999–1003, 1999.

[25] J. He and X. Yao, “A study of drift analysis for estimating computation time of evolutionary algorithms,”Nat. Comput., vol. 3, no. 1, pp. 21–35, Mar. 2004.

[26] J. He, X. Yao, and J. Lin, “A comparative study of three evolutionary algorithms incorporating different amounts of domain knowledge for node covering problem,”IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 2, pp. 266–271, May 2005.

[27] J. He, C. Reeves, C. Witt, and X. Yao, “A note on problem difficulty measures in black-box optimization: Classification, realizations and pre-dictability,”Evol. Comput., vol. 15, no. 4, pp. 435–443, 2007.

[28] Y. R. Zhou and J. He, “A runtime analysis of evolutionary algorithms for constrained optimization problems,”IEEE Trans. Evol. Comput., vol. 11, no. 5, pp. 608–619, Oct. 2007.

[29] C. Witt, “Runtime analysis of the (u +1)EA on the simple pseudo-Boolean functions,” inProc. 8th Annu. Conf. Genetic Evol. Comput., Seattle, WA, 2006, pp. 651–658.

[30] C. Witt, “Population size vs. runtime of a simple EA,” inProc. Congr. Evol. Comput., Canberra, Australia, 2003, pp. 1996–2003.

[31] C. Witt, “Worse-case and average-case approximations by simple random-ized search heuristics,” inProc. 22nd Annu. Symp. Theor. Aspects Com-put. Sci.. Stuttgart, Germany: Springer-Verlag, 2005, vol. 3404, pp. 44–56.

[32] F. Neumann, “Expected run-times of evolutionary algorithms for the Eulerian cycle problem,” in Proc. Congr. Evol. Comput., 2004, pp. 904–910.

[33] F. Neumann and I. Wegener, “Minimum spanning trees made easier via multi-objective models,”Nat. Comput., vol. 5, no. 3, pp. 305–319, Sep. 2006.

[34] F. Neumann and I. Wegener, “Randomized local search, evolutionary algorithms and the minimum spanning tree problem,” inProc. Genetic Evol. Comput. Conf.. Seattle, WA: Springer-Verlag, 2004, vol. 3102, pp. 713–724.

[35] Y. Yu and Z. H. Zhou, “A new approach to estimating the expected first hitting time of evolutionary algorithms,” inProc. 21st Nat. Conf. Artif. Intell. AAAI, Boston, MA, 2006, pp. 555–556.

[36] G. Reinelt, “TSPLIB—A traveling salesman problem library,” ORSA J. Comput., vol. 3, no. 4, pp. 376–384, 1991.

[37] H. Huang, “A relation-and-ordering-based analysis for convergence and convergence time of evolutionary algorithm,” Ph.D. dissertation, South China Univ. Technol., Guangzhou, China, 2008. Unpublished (in Chinese).

Han Huang received the B.S. degree in applied mathematics and the Ph.D. degree in computer ap-plication techniques from South China University of Technology, Guangzhou, China, in 2003 and 2008, respectively.

Since 2008, he has been with the faculty of the School of Software Engineering, South China Uni-versity of Technology, where he is currently a Lec-turer and the Coach of the university’s team for the Association for Computing Machinery International Collegiate Programming Contest. He has published a number of academic papers on evolutionary computation and its theoretical foundation. His current research interests include ant colony optimization, ge-netic algorithms, evolutionary programming, particle swarm optimization, and their theoretical foundation containing mathematical modeling, convergence proofs, and runtime analysis.

Chun-Guo Wureceived the B.Sc., M.Sc., and Ph.D. degrees from Jilin University, Changchun, China, in 1999, 2002, and 2006, respectively.

From July 2003 to January 2004, he joined an overseas student program with the Institute of High Performance Computing, Singapore. He is currently a Lecturer with the Key Laboratory of Symbol Com-putation and Knowledge Engineering, Ministry of Education, Changchun. He is also with the National Laboratory of Pattern Recognition, Institute of Au-tomation, Chinese Academy of Sciences, Beijing, China. His research interests include theoretical and applied studies on evolu-tionary computation, support vector machines, quantum computing, and neural networks.

Zhi-Feng Haoreceived the B.S. degree in math-ematics from Zhongshan University, Guangzhou, China, in 1990 and the Ph.D. degree in mathematics from Nanjing University, Nanjing, China, in 1995.

In 2001, he was a Visiting Researcher with Rutgers, The State University of New Jersey, Piscataway. Since 1995, he has been with the faculty of the School of Mathematical Sciences, South China University of Technology, Guangzhou, where he was promoted to Associate Professor in 1997 and Full Professor in 2000. He is currently an authorized Ph.D. Supervisor with the College of Computer Science and Engineering, South China University of Technology. He is also with the School of Applied Mathematics, Guangdong University of Technology, Guangzhou. He has pub-lished more than 60 academic papers on applied mathematics, evolutionary computation, support vector machines, and bioinformatics. His current research interests include support vector machines, neural networks, evolutionary com-putation, and bioinformatics.