• No results found

New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems

N/A
N/A
Protected

Academic year: 2020

Share "New complexity analysis for primal-dual interior-point methods for self-scaled optimization problems"

Copied!
22
0
0

Loading.... (view fulltext now)

Full text

(1)

R E S E A R C H

Open Access

New complexity analysis for primal-dual

interior-point methods for self-scaled

optimization problems

Bo Kyung Choi and Gue Myung Lee

*

*Correspondence:

[email protected] Department of Applied Mathematics, Pukyong National University, Busan, 608-737, Korea

Abstract

A linear optimization problem over a symmetric cone, defined in a Euclidean Jordan algebra and called a self-scaled optimization problem (SOP), is considered. We formulate an algorithm for a large-update primal-dual interior-point method (IPM) for the SOP by using a proximity function defined by a new kernel function, and we obtain the best known complexity results of the large-update IPM for the SOP by using the Euclidean Jordan algebra techniques.

MSC: 90C51; 90C25; 65K05

Keywords: Euclidean Jordan algebra; self-scaled optimization problem; primal-dual interior-point methods; kernel function; proximity function; complexity analysis; worst-case iteration bound

1 Introduction and preliminaries

Primal and dual interior-point methods (IPMs) have been well known as the most effec-tive methods for solving wide classes of optimization problems, for example, the linear optimization (LO) problem, the quadratic optimization problem (QOP), the semidefinite optimization (SDO) problem, the second-order cone optimization (SOCO) problem, and the convex optimization problem (CP).

The so-called barrier update parameter θ in algorithms for IPMs plays an important role in both theory and practice of IPMs. Usually, ifθis a constant independent of the di-mension of the problem, then the algorithm is called alarge-updatemethod. If it depends on the dimension, then the algorithm is said to be asmall-updatemethod. Large-update methods are much more efficient than small-update methods in practice [], but have a worst-case iteration bound. Such a gap between theory and practice has been referred to as irony of IPMs []. Recently, many authors have tried to reduce the gap of the worst-case iteration bound between the large-update IPM and the small-update IPM.

Using self-regular proximity functions instead of a classical logarithmic barrier func-tion, Penget al.[–] improved the complexity of large-update IPMs for the LO problem, the SDO problem, and the SOCO problem. Baiet al.[] introduced a new class of eligible kernel functions. The class was defined by some simple conditions on the kernel func-tion and its derivatives. The best iterafunc-tion bound for the LO problem, which was given by Baiet al.[], isO(√nlognlogn). Recently, Wanget al.[] obtained the complexity result

O(nlogn/) for the SDO problem based on a simple kernel function. Bai and Wang []

(2)

obtained the best known complexity result for the SOCO problem based on a parametric kernel function including the classical logarithmic function, the prototype regular kernel function, and the non-self-regular kernel function. Very recently, using the kernel func-tionφ(t) = (t– )/ + (etq–– )/q, Choi and Lee [, ] have obtained the complexity

results of large-update primal-dual IPMs for SDO and SOCO,O(√n(logn)(q+)/qlogn/) andO(√N(logN)(q+)/qlogN/), respectively.

In this paper, we consider a linear optimization problem over a symmetric cone which is defined in a Euclidean Jordan algebra. Nesterov and Todd [] proposed first this kind of an optimization problem under the name of convex programming for self-scaled cones and established the polynomial complexity of the primal-dual interior point method using the so-called NT (Nesterov-Todd) direction []. We call the linear optimization problem over the symmetric cone the self-scaled optimization problem (SOP).

Faybusovich first studied the SOP in view of a Euclidean Jordan algebra and gave a the-oretical background for nondegeneracy assumptions and the uniqueness of solutions for Newton systems in IPMs for the SOP [], presented a short-step path-following algo-rithm for a quadratic programming problem defined on the intersection of a symmet-ric cone with an affine subspace [] and obtained complexity estimates for a long-step primal-dual interior-point algorithm for the optimization problem of the minimization of a linear function on a feasible set obtained as the intersection of an affine subspace and a symmetric cone []. SOPs include linear optimization problems, semidefinite optimiza-tion problems, second-order optimizaoptimiza-tion problems, and various combinaoptimiza-tions of these types of problems as special cases. Schmieta and Alizadeh [] extended primal-dual in-terior point algorithms for LOs, SDOs, and SOCOs to SOPs by using logarithmic barrier functions.

Baes raised an open question in his monograph [] as follows: The theory of self-regular functions has been created for linear programming by Jiming Peng, Cornelius Roos, and Tamás Terlaky []. They subsequently extended it to second-order programming and semidefinite programming separately using implicitly the aforementioned construction. However, the unified treatment of this theory using the Jordan algebraic framework is not accomplished yet.

Choi and Lee [] gave primal-dual interior point algorithms by using a very simple self-regular functionψ(t) =(t–t),t>  for the SOP and gave partial answers for the ques-tion of Baes. Very recently, Vieira [, ] gave complete answers for the open quesques-tion of Baes by proving thee-convexity property of eligible kernel functions and, in particular, he presented the iteration complexity results for ten eligible kernel functions. Among ten kernel functions in [], the best iteration complexity for a large-update method was ob-tained forψ(t) =t–+t–q–q–withq=logr, and its iteration complexity isO(√rlogrlogr), which is the best known one.

In this paper, we define a new eligible kernel functionψ(t) =t–+ep(tq–)

–

pq ,p and

(3)

But we use Proposition . in [] obtained from the technique of Sun and Sun [] instead of using Proposition . in [].

This paper is organized as follows. In Section , we introduce our kernel functions, for-mulate the Newton system for the SOP, and present a useful inequality for our proximity function. In Section , we give an algorithm for the SOP and calculate an upper bound for the proximity function afterμ-update. We calculate an upper bound for difference between proximity functions after one step in inner iterations and then determine our default step size for search directions. We present a worst-case iteration bound for our large-update primal-dual interior point method for the SOP.

Now, we give definitions and preliminary properties for a Euclidean Jordan algebra which are found in [] and will be used in the next sections.

Definition . ([]) A finite-dimensional real vector spaceV is called an algebra if a bilinear mapping (x,y)→xyfromV×VtoV is defined.

An algebraVis called a Jordan algebra if the following hold:

(i) commutativity: for allx,yV,xy=yx;

(ii) Jordan’s axiom: for allx,yV,x(xy) =x(xy), wherex=xx.

A Jordan algebraVis said to be Euclidean if

(iii) x+y= x=y= , equivalently, there exists an inner product(·|·)onV such that(xy|z) = (y|xz).

A Jordan algebraV is simple if it does not contain any non-trivial ideal. The Jordan algebra may not be associative, but it is power-associative,i.e.,xpxq=xp+q. We assume a Jordan algebraV has an identity element,i.e., there existsesuch thatxe=ex=x. SinceV is finite-dimensional, givenxV, there exists a minimal positive integerksuch that the vectorse,x, . . . ,xkare linearly dependent. Denote this integerm(x). We define the

rank ofVas

rank(V) =r=maxm(x)|xV.

An element xV is said to be invertible if there exists an elementy∈R[x] such that xy=e, whereR[x] is the algebra overRof polynomials in one variable with coefficients inR. It is defined byx–. An elementvV is called idempotent ifv=v. For an element xV, letL(x) be a linear map ofVdefined asL(x)y=xy. The cone of squares

:=x|xV

is a symmetric cone; the following conditions hold:

(i) for every pair ofx,y∈int, there is an invertible linear transformationL:VV

such thatL() =andL(x) =y;

(ii) *=, where*:={yV|(x,y),for anyx}.

Let=int. Then={x|xVis invertible}={xV|L(x) is positive definite}.

Definition .([]) Letc, . . . ,ckV. Then{c, . . . ,ck}is said to be a Jordan frame ifci,

(4)

following properties hold:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

ci =ci,

cicj=  ifi=j,

k i=ci=e.

Theorem . (Theorem III.. in []) For every xV, there exist a Jordan frame

{c(x), . . . ,cr(x)}and real numbersλ(x), . . . ,λr(x)such that

x=λ(x)c(x) +· · ·+λr(x)cr(x). ()

The numbersλi(x), for alli= , . . . ,r, are said to be the eigenvalues ofx, and () is called

the eigenvalue (or spectral) decomposition ofx. Now, it is possible to extend the defini-tion of any real-valued funcdefini-tionψ(·) to elements of the Euclidean Jordan algebra via their eigenvalues:

ψ(x) :=ψ λ(x)

c(x) +· · ·+ψ λr(x)

cr(x). ()

Particularly, we have some examples as follows:

(i) Square root:x/=λ/

 (x)c(x) +· · ·+λ/r (x)cr(x)if allλi(x). (ii) Inverse:x–=λ–

 (x)c(x) +· · ·+λ–r (x)cr(x)if allλi(x)= . (iii) Square:x=λ

(x)c(x) +· · ·+λr(x)cr(x).

From the above examples, we know that forx,λi(x/) =λi/(x) and forx,λi(x–) = λ–i (x). Let us denote byψ(x) the derivative ofψ(x) with respect toλi(x):

ψ(x) :=ψ λ(x)

c(x) +· · ·+ψ λr(x)

cr(x). ()

In the Jordan algebra, we define the determinant ofxand the trace ofxas follows:

det(x) =

r

i=

λi(x), tr(x) = r

i= λi(x).

SinceVis a Euclidean Jordan algebra, x,y:=tr(xy) is a scalar product onV(see Propo-sition III.. in []). The following lemma is called the second Pierce decompoPropo-sition the-orem which will be used in Section .

Lemma .(Theorem IV.. in [], Theorem .. (Second Pierce decomposition

theo-rem) in []) Let{c, . . . ,cr}be a Jordan frame of V.If

Vij:=

⎧ ⎨ ⎩

{vij|civij=vij} if i=j, {vij|civij=vij} ∩ {vij|cjvij=vij} if i=j,

we have

(i) V=ijrVij;

(ii) VijVkl={},if{i,j} ∩ {k,l}=∅; (iii) VijVjkVik,ifi=k;

(5)

Consider the following self-scaled optimization problem (SOP):

(P) Minimize c,x

subject to ai,x=bi, i= , . . . ,m,

x,

and its dual problem:

(D) Maximize

m

i=

biyi

subject to

m

i=

yiai+s=c,

s, y∈Rm,

wherec,a, . . . ,amV andb∈Rm are given. We callxprimal feasible if ai,x=bi

fori= , . . . ,m. Similarly, (y,s)∈Rm× is called dual feasible ifm

i=yiai+s=c. Let

Ax= (a,x, . . . , am,x)T for any xV. Then A:V →Rm is a linear transformation.

Throughout this paper, we assume thatAis surjective. Then its adjointAT is injective andATy=m

i=yiai, wherey= (y, . . . ,ym)T∈Rm. So, we can reformulate (P) and (D) as

follows:

(P) Minimize c,x

subject to Ax=b, x,

and its dual problem:

(D) Maximize bTy

subject to ATy+s=c,

s, y∈Rm.

We can check that weak duality between (P) and (D) holds, that is,inf(P)sup(D). From now on, we assume that both (P) and (D) satisfy the interior-point condition (IPC), that is, there exists (x,y,s) such thatAx=b,x,ATy+s=c,s. Then there exists

a pair of optimal solutions (x,y,s) of (P) and (D), andinf(P) =sup(D) [, ]. The following lemma is well known [, , , ].

Lemma . For x,sV,the following statements are equivalent:

(i) x,sand x,s= ; (ii) x,sandxs= .

Using Lemma ., we can check (see Proposition . in []) that finding a pair of optimal solutions (x,y,s) of (P) and (D) is equivalent to solving the following Newton system:

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

Ax=b,

ATy+s=c,

xs= ,

x,s, y∈Rm.

(6)

The basic idea of primal-dual IPMs is to replace the third equation in (), the so-called complementarity conditionfor the SOP, by the parameterized system with a positive pa-rameterμ:

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

Ax=b,

ATy+s=c,

xs=μe,

x,s, y∈Rm.

()

For eachxV, we define the quadratic representation as follows:

Qx:= L(x) –L x

.

Lemma .([]) Let x,sand p be invertible.Then xs=μe if and only if QpxQp–s=

μe.

Proposition .(Proposition  in []) If x,s,then Qxs.

Letx,s. Then there uniquely existspsuch thatQpx=s[, ]. So, we can choosepsuch thatQpx=Qp–s. Such a choice exists and is unique, and leads to the Nesterov-Todd (NT) method.

From Lemma ., the system () becomes

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

Ax=b,

ATy+s=c,

QpxQp–s=μe,

x,s, y∈Rm.

()

Then, for eachμ> , the parameterized system () has a unique solution (x(μ),y(μ),s(μ)) [, ], which is called a μ-center of (P) and (D). The set ofμ-centers, that is, C=

{(x(μ),y(μ),s(μ))|μ> }, is said to be thecentral pathof (P) and (D). Therefore, asμtends to zero, (x(μ),y(μ),s(μ)) converges to a pair of optimal solutions of (P) and (D) [, ].

In general, IPMs for the SOP consist of two strategies. The first one, which is called the inner iteration scheme, is to keep the iterative sequence in a certain neighborhood of the central path or to keep the iterative sequence in a certain neighborhood of theμ-center. And the second one, called the outer iteration scheme, is to decrease the parameterμto

μ+:= ( –θ)μfor someθ∈(, ).

2 Proximity functions and search directions

Newton’s method is a well-known procedure to solve a system of nonlinear equations. Most IPMs for solving the SOP employ different search directions together with suitable strategies for following the central path appropriately.

Assume that a starting point (x,s) in a certain neighborhood of the central path

(7)

and linearize the Newton system for () by replacingx,y,swithx+:=x+x,y+:=y+y,

s+:=s+s, respectively. Then we get the following system in []:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

Ax= ,

ATy+s= ,

QpxQp–s+QpxQp–s=μ+eQpxQp–s.

()

To describe our new search direction, we need more notations:

¯

A:=

μAQp–, v:=

μQpx=

μQp–s,

dx:=

μQpx, ds:=

μQp–s.

()

In this case,

p=Qx/(Qx/s)–/

–/

=Qs–/(Qs/x)/

–/

. ()

From Proposition .,v. Hence,L(v) is positive definite. Thus, the system () is equiv-alent to the following system:

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

¯

A dx= ,

¯

ATy+ds= ,

dx+ds=v–v.

()

We say that the above (dx,y,ds) is called the NT search direction for the SOP. Further-more, dx,ds= , which is coming from the first and second equations of () or from the orthogonality ofxands.

For our IPM, we use the following new eligible kernel function:

ψ(t) =t

– 

 +

ep(tq–)– 

pq , p andq fort> . ()

Please see the definition of an eligible function in []. The new kernel function () sat-isfies

ψ(t) > , ψ(t) <  and lim

t→+ψ(t) =tlim→∞ψ(t) =∞.

Note thatψ() =ψ() = . Thenψ(t) is determined:

ψ(t) =

t

ξ

ψ(ζ)dζdξ. ()

The proximity function (measure) for (P) and (D) is

(x,s;μ) :=(v) :=tr ψ(v)=

r

i=

ψ λi(v)

(8)

whereψ(v) is defined by (). Note that(v) = , ifv=e(i.e.,xs=μe) and(v) > , otherwise. Replacing the right-hand side of the last equation in () by –ψ(v), we have the following system from ():

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

¯

A dx= ,

¯

ATy+ds= ,

dx+ds= –ψ(v).

()

LetX={xV| ¯Ax= }. ThenX⊥={ ¯ATy|yRm}. Hence, the system () has a unique

solution.We introduce the norm-based proximity measure as follows:

σ:=dx+ds=ψ(v)=dx+ds. ()

The following lemma gives a lower bound ofσin terms of(v).

Lemma . For any v,

σ(v).

Proof Since () satisfies ψ(t)(ψ(t))andσ=r

i=(ψ(λi(v))),

(v)σ.

This completes the proof.

Also, our new kernel function () satisfies the following exponential convexity property.

Lemma . Let t> and t> .Then

ψ(√tt)

ψ(t) +ψ(t)

.

The following proposition can be found in [], but for the completeness, we give its proof.

Proposition .(Theorem . in []) Let be the proximity function defined in(), then for any x,s,

(Qx/s)/

(x) +(s)

.

Proof SinceQx/s,

λi (Qx/s)/

=λ/i (Qx/s) and (Qx/s)/

=

r

i=

ψ λ/i (Qx/s)

.

By Theorem . in [],

k

i=

λi(Qx/s)

k

i=

(9)

and

r

i=

λi(Qx/s) =

r

i=

λi(x)λi(s).

Thus,

k

i=

λ/i (Qx/s)

k

i=

λ/i (x)λi/(s), fork= , . . . ,r– ,

and

r

i=

λ/i (Qx/s) =

r

i=

λ/i (x)λ/i (s).

Letαi=λ/i (Qx/s) andβi=λ/i (x)λ/i (s). Thenαi>  andβi> . Moreover, since these

conditions satisfy the assumptions of Corollary .. in [] and (iii) in Corollary .. in [] with our kernel function (),

r

i=

ψ λ/i (Qx/s)

r

i=

ψ λ/i (x)λ/i (s).

By Lemma ., we obtain the following result:

r

i=

ψ λ/i (x)λ/i (s)  

r

i=

ψ λi(x)

+

r

i=

ψ λi(s)

=

(x) +(s)

.

3 Algorithm and its complexity analysis

Now, we explain our algorithm for the large-update primal-dual IPM for the SOP. Assum-ing that a startAssum-ing point in a certain neighborhood of the central path is available, we can set out from this point. Then, we will go to the outer ‘while loop’. Ifμsatisfies, then it is reduced by the factor  –θ, whereθ∈(, ). Then, we make use of the inner ‘while loop’, and we repeat the procedure until we find iterates that are ‘close’ to (x(μ),y(μ),s(μ)), that is, the proximity(x,s;μ) <τ. Here, we apply Newton’s method targeting at the new

μ-centers to decide a search direction (x,y,s). We return to the outer ‘while loop’. The whole process is repeated untilμis small enough, say until<.

The choice of the step sizeαis another crucial issue in the analysis of the algorithm. It has to be taken so that the closeness of the iterates to the currentμ-center can improve by a sufficient amount. In the algorithm, the inner ‘while loop’ is called theinner iteration and the outer ‘while loop’ is called theouter iteration. Each outer iteration consists of an update of the parameterμand a sequence of (one or more) inner iterations. The total number of inner iterations is the worst-case iteration bound for our algorithm.

The algorithm for our large-update primal-dual IPM for the SOP is given as follows: Primal-dual algorithm for the SOP

Inputs

(10)

a variable damping factorα;

a fixed barrier update parameterθ∈(, ); (x,s) andμ=  such that(x,s;μ)τ.

begin

x:=x;s:=s;μ:=μ;

whiledo

begin

μ:= ( –θ)μ;

while(x,s;μ)τ do begin

Solve the system () forx,y,s; Determine a step sizeα;

x:=x+αx; y:=y+αy; s:=s+αs; end

end end

3.1 Bound of the proximity function after

μ

-update

We have(v)τ before the update ofμwith the factor  –θ at the start of each outer iteration. After updatingμin an outer iteration, the vectorvis divided by the factor√ –θ, which in general leads to an increase in the value of(v). Then during the inner iteration, the value of(v) decreases until it passes the thresholdτ.

As we mentioned, our kernel function () is eligible. To obtain an upper bound for a

μ-updated proximity function in each outer iteration in the algorithm, we use the well-known Lemma ., which can be induced from the decreasing part of the kernel function, instead of using theorems which can be obtained from some properties for eligible func-tions (for example, Theorem . in [] and Theorem . in []). Both of the following lemmas make our analysis in the outer while loop easy. And we will show a theorem that an upper bound for(√

–θv) is expressed with(v) by using the following two lemmas.

Lemma . Letβ.Then

ψ(βt)ψ(t) +(β

– )

t

.

Proof Defineψb(t) := e

p(tq–)–

pq . Thenψb(t) is monotonically decreasing int. So, we can

easily obtain

ψ(βt) = β

t– 

 +ψb(βt) = t– 

 +ψb(t) +

βt–t

 +ψb(βt) –ψb(t)

ψ(t) +(β

– )

t

.

Lemma . For any v,then

(11)

Proof Since ep(tq–)

pq is positive andpq, the kernel function () has a lower bound as

follows:

ψ(t)t

– 

 –

pq

t

 –  – .

This implies ri=λi(v)(v) + r.

Theorem . Letθbe such that <θ< .Then,for any v,

 –θv

 –θ (v) +r

.

Proof From Lemma . withβ=√

–θ and Lemma .,

 –θv

=

r

i= ψ

 –θλi(v)

(v) + 

  –θ – 

v

(v) + θ

 –θ (v) + r

 –θ (v) +r

,

the last inequality comes fromθ∈(, ).

By the assumption(v)τjust before the update ofμ,

 –θv

 –θ(τ+r).

We define

L(r,θ,τ) = 

 –θ(τ+r).

Sinceτ=O(r) andθ=(),

L=O(r).

3.2 Determining a default step size

In this section, we compute the feasible step sizeα such that the proximity function is decreasing and is bound for the decrease during inner iterations; then we give our default step sizeα¯;α¯ = (( + σ( +pq+q)( +p–logσ)(q+)/q))–. We will show that the step

size not only keeps the iterates feasible but also gives rise to a sufficiently large decrease in the barrier function(v) in each inner iteration. Let us denote the difference between the proximity before and after one step by a function of the step size, that is,

f(α) :=(v+) –(v). ()

The main task in the rest of this section is to study the decreasing behavior off(α). Now, in equation (),v+andp+are determined byx,sin () and () replaced byx+:=

x+αx,s+:=s+αs, respectively, which is as follows:

v+:=

μQp+(x+αx) = 

μQp–

(12)

Lemma .(Proposition II.. in []) Let x and s be elements in V.Then

(i) (Qxs)–=Qx–s–ifxandsare invertible.

(ii) QQsx=QsQxQs.

Lemma .([]) Let x,s,p.Then

(i) Qx/sandQs/xhave the same eigenvalues.

(ii) Qx/sandQ(Qpx)/(Qp–s)have the same eigenvalues.

The following proposition was given by Vieira in [] (see Proposition . in []), but we provide its proof using Lemma . and Lemma ..

Proposition . Letbe the proximity function defined in().Then we have

(v+) =

Q(v+αdx)/(v+αds)

/

.

Proof FromQp–s=Qpxand (i) in Lemma ., we know thatQpxandQx/p have the same eigenvalues. By the definition ofpand (ii) in Lemma .,

Qx/p=Qx/ Q/x (Qx/s)–/

–

=Qx/Qx–/(Qx/s)/= (Qx/s)/.

Then we can find Qp–

+ s+ and (Qx/+ s+)

/ have the same eigenvalues. Here,μv +=

Qp–

+s+. We know thatx+=

μ

Qp–(v+αdx) ands+=√μQp(v+αds), by the definition () and by (ii) in Lemma ., then (Qx/

+ s+)

/=μ(Q(Q

p–(v+αdx))/(Qp(v+αds)))/and

μ(Q(v+αdx)/(v+αds))/have the same eigenvalues. Therefore, the proximity function

satisfies the equality.

Then Proposition . and Proposition . imply the following inequality:

(v+)

(v+αdx) + 

(v+αds).

So, we can definef(α):

f(α)f(α) :=

(v+αdx) +(v+αds)

(v).

To facilitate the forthcoming analysis, we also define, for anyxV,

λmin(x) :=min

λi(x)|i= , . . . ,r

.

The following lemma is obtained from Lemma  in [] so that we can get the common lower bound of eigenvalues ofv+αdxandv+αds, whereα satisfiesv+αdxand v+αds.

Lemma . For anyα∈(,λmin(v)

σ ),

λmin(v+αdx)λmin(v) –ασ and λmin(v+αds)λmin(v) –ασ,

(13)

Proof Letαbe a fixed number in (,λmin(v)

σ ). From Lemma  in [],

λmin(v+αdx)λmin(v) –αdx.

Sinceσdx, we have

λmin(v+αdx)λmin(v) –ασ.

Similarly, we obtain

λmin(v+αds)λmin(v) –ασ.

The proof of the following proposition can be found in [], but for the completeness, we give its detailed proof.

Proposition .([]) Suppose that the functionsψ(x)and(x)are defined by()and (),respectively.Then,for anyα∈(,λmin(v)

σ ),

d dαf(α) =

 tr ψ

(v+αdx)dx+ tr ψ

(v+αds)ds,

d

f(α)

max

ψ λi(v+αdx),λj(v+αdx)

|i,j= , . . . ,rdx

+

max

ψ λi(v+αds),λj(v+αds)

|i,j= , . . . ,rds,

where

ψ λi(·),λj(·)

=

⎧ ⎨ ⎩

ψ(λi(·)) ifλi(·) =λj(·),

ψ(λi(·))–ψ(λj(·))

λi(·)–λj(·) ifλi(·)=λj(·).

Proof Using Lemma . in [], we have

d

dαψ(v+αdx)

=

r

i=

ψ λi(v+αdx),λi(v+αdx)

ci(v+αdx),dx

ci(v+αdx)

+

j<lr

ψ λj(v+αdx),λl(v+αdx)

×cj(v+αdx)◦ cl(v+αdx)◦dx

. ()

Then we have

d

tr ψ(v+αdx)

= d

ψ(v+αdx),e

=

d

dαψ(v+αdx),e

=tr

d

dαψ(v+αdx)

(14)

=tr

r

i=

ψ λi(v+αdx)

ci(v+αdx),dx

ci(v+αdx)

(by associativity of trace)

=

r

i=

ψ λi(v+αdx)

ci(v+αdx),dx

trci(v+αdx)

=

r

i=

ψ λi(v+αdx)

ci(v+αdx),dx

tr ci(v+αdx)

.

Then from Baes [, ] we know thattr(ci(v+αdx)) = , and hence, from the definition

(), we get

d

tr ψ(v+αdx)

=tr ψ(v+αdx)◦dx.

Thus, we have

d dαf(α) =

 tr ψ

(v+αdx)dx+ tr ψ

(v+αds)ds.

So, the first equality holds.

For the second inequality, we will use () by replacingψ byψ.

d

tr ψ(v+αdx)

= d

tr ψ

(v+αdx)dx

=tr

d dαψ

(v+αdx)dx

=tr

r

i=

ψ λi(v+αdx),λi(v+αdx)

ci(v+αdx),dx

ci(v+αdx)

+

j<lr

ψ λj(v+αdx),λl(v+αdx)

×cj(v+αdx)◦ cl(v+αdx)◦dx

dx

.

Here, letdx=rj=λj(dx)cj(dx). Then we have

r

i=

tr ci(v+αdx)◦dx

=

r

i=

tr

r

j=

λj(dx)ci(v+αdx)◦cj(dx)

=

r

i=

r

j=

λj(dx)tr ci(v+αdx)◦cj(dx)

(15)

Sinceci(v+αdx) andcj(dx) are inwhich is a self-dual cone, then

tr ci(v+αdx)◦cj(dx)

, fori,j= , . . . ,r.

Furthermore,rj=tr(ci(v+αdx)◦cj(dx)) =tr(ci(v+αdx)) = . Then we have

r

i=

tr ci(v+αdx)◦dx

 = r i= r j=

tr ci(v+αdx)◦cj(dx)

λj(dx)

r i= r j=

λj(dx)tr ci(v+αdx)◦cj(dx)

= r j=

λj(dx)

r

i=

tr ci(v+αdx)◦cj(dx)

.

Sinceri=tr(ci(v+αdx)◦cj(dx)) =tr(cj(dx)) = , we have

r

i=

tr ci(v+αdx)◦dx

dx. ()

Now, we decomposedx along Lemma . such asdx=ikrdxik for the system of

idempotent{c(v+αdx), . . . ,cr(v+αdx)}. Then, forj<l,

tr cj(v+αdx)◦dx

cl(v+αdx)◦dx

=tr

cj(v+αdx)◦

ikr

dxik

cl(v+αdx)◦

ikr

dxik

=tr

cj(v+αdx)◦ r i= dxji

cl(v+αdx)◦ r

i=

dxli

by (ii) in Lemma .

=tr

dxjldxlj

by (iii) and (iv) in Lemma .

= tr dx

jl

.

This means, for eachj<l,

tr cj(v+αdx)◦dx

cl(v+αdx)◦dx

.

Moreover, we have,

j<lr

tr cj(v+αdx)◦dx

cl(v+αdx)◦dx

j<lr

tr cj(v+αdx)◦dx

cl(v+αdx)◦dx

+  r i=

tr ci(v+αdx)◦dx

(16)

= tr

r

i=

ci(v+αdx)◦dx

= tr dx

= dx

. ()

Since for eachi, (tr(ci(v+αdx)◦dx))are nonnegative and for eachj,lwithj<l,tr((cj(v+ αdx)◦dx)◦(cl(v+αdx)◦dx)) are nonnegative, we get from () and ()

d

tr ψ(v+αdx)

maxψ λi(v+αdx),λj(v+αdx)

|i,j= , . . . ,rdx.

Similarly,

d

tr ψ(v+αds)

maxψ λi(v+αds),λj(v+αds)

|i,j= , . . . ,rds.

From the definition off(α),

d

f(α) =

d

tr ψ(v+αdx)

+

tr ψ(v+αds)

.

Thus, we have the conclusion.

The next result presents an upper bound for the second derivative off(α) which is usable

for establishing the polynomial complexity of the algorithm.

Proposition . For anyα∈(,λmin(v)

σ ),

f(α) ψ

λ

min(v) –ασ

σ.

Proof Sinceψ(t) is a decreasing function ont∈(,∞), using Lemma . and the mean value theorem, we have

ψ λmin(v) –ασ

maxψ λi(v+αdx),λj(v+αdx)

|i,j= , . . . ,r

and

ψ λmin(v) –ασ

maxψ λi(v+αds),λj(v+αds)

|i,j= , . . . ,r.

Thus, by Proposition .,

d

f(α)

 ψ

λ

min(v) –ασ

dx+ ψ

λ

min(v) –ασ

ds

= 

ψ

λ

min(v) –ασ

(17)

We can easily check that f() =  andf() = –

σ

 . By Proposition ., we obtain an

upper boundf(α) forf(α) as follows:

f(α) =f() +f()α+

α

ξ

f(ζ)dζdξ

f(α) :=f() +f()α+

 σ

α

ξ

ψ λmin(v) –ασ

dζdξ.

Note thatf() = . Furthermore, sincef(α) = –σ

 + σ

 (ψ(λmin(v)) –ψ(λmin(v) –ασ)), we

havef() = –σ

 which is the same value off(), andf(α) =σ

ψ(λmin(v) –ασ) which

is increasing onα∈[,λmin(v)

σ ). Usingf() =f() andf(α)f(α), we can easily check

that

f(α) =f() +

α

f(ξ)dξf(α).

This relation gives that

f(α), iff(α).

To compute the feasible step sizeαsuch that the proximity measure is decreasing when we take a new iterate for fixedμ, we want to calculate the step sizeαwhich satisfies that f(α) holds withαas large as possible. Sincef(α) > , that is,f(α) is monotonically increasing atα, the largest possible value atαsatisfyingf(α) occurs whenf(α) = , that is,

ψ λmin(v) –ασ

+ψ λmin(v)

=σ

. ()

Sinceψ(t) is monotonically decreasing, the derivative of the left-hand side in () with respect toλmin(v) is

ψ λmin(v) –ασ

+ψ λmin(v)

< .

So, the left-hand side in () is decreasing atλmin(v). This implies that ifλmin(v) becomes

smaller, thenαgets smaller with fixedσ. Note that

σ=

n

i=

ψ λi(v)

ψ λmin(v)–ψ λmin(v)

and the equality is true if and only ifλmin(v) is the only coordinate in (λ(v), . . . ,λr(v)) which

is different from  andλmin(v) < , that is,ψ(λmin(v)) < . Hence, the worse situation for

the largest step size occurs whenλmin(v) satisfies

ψ λmin(v)

=σ. ()

(18)

From now on, we denote thatρ: [,∞)→(, ] is the inverse function of the restriction of –ψ(t) in the interval (, ]. Then () implies

λmin(v) =ρ(σ). ()

By using () and (), we immediately obtain

ψ λmin(v) –ασ

=

σ.

By the definition ofρand (), the largest step sizeαof the worse case is given as follows:

α*=ρ(σ) –ρ(

 σ)

σ . ()

For the purpose of finding an upper bound off(α), we need a default step sizeα¯that is the lower bound of theα*and consists ofσ.

Lemma . Letσ.Then,for <(σ),

ψ(t) + σ( +pq+q)

 + plogσ

q+

q .

Proof Fromψ(t) =ttq–·ep(tq–), let –ψ

b(t) =tq–·ep(t

q–)

and letρ: [,∞)→(, ] denote the inverse function of the restriction of –ψb(t) to the interval (, ]. Letρ(

σ) =˜t.

Then  <t˜ and 

σ= –ψ(t˜) = –t˜–ψbt). So, –ψbt) =t˜+σ + σσ. Sinceρis

a decreasing function, (ρ(σ) =˜t=)ρ(–ψbt))ρ(σ). Letρ(σ) =ˆt. Then

σ= –ψbt) = ρ(σ)–q–·ep((ρ(σ))–q–) ()

implies

ep((ρ(σ))–q–)= σ ρ(σ)q+σp ρ(σ)–q– logσ

ρ(σ)

 + plogσ

–q

, ()

ψt) =  + (q+ )ˆtq+pqˆtq–·eptq–)· ˆtq–=  + (q+ )ˆtq+pqψbt)· ˆtq–

=  + (q+ )ˆtq+pq·σ· ρ(σ)–q– + σ( +pq+q)

 + plogσ

q+

q ,

the last inequality comes fromˆt∈(, ] and ().

Now, we present a lower bound of the value ofα*.

Theorem . Letα*be as defined in().Then

α* 

(19)

Proof Since –ψ(ρ(σ)) =σ, taking the derivative ofσat both sides, we get

ρ(σ) = – 

ψ(ρ(σ)).

Moreover, we have

α*= 

σ

σ

 σ

ρ(ξ)= 

σ

 σ

σ

ψ(ρ(ξ))

σ ξ

ψ(ρ(σ))

! σ

σ

= 

ψ(ρ(σ)),

where the inequality follows fromσξσandρandψare monotonically decreasing.

Also, by Lemma ., we can complete the proof.

For usingα¯ as the default step size in the algorithm for the SOP, define theα¯as follows:

¯

α= 

( + σ( +pq+q)( +plogσ)q+q )

. ()

We will useα¯as the default step size in our algorithm.

3.3 Decrease of the proximity function during an inner iteration

Now, we show that our proximity functionwith our default step sizeα¯is decreasing. It can be easily established by using the following result.

Lemma .([]) Let h(t)be a twice differentiable convex function with h() = ,h() <  and let h(t)attain its(global)minimum at t*> .If h(t)is increasing for t[,t*],then

h(t)th ()

 , tt

*.

Sincef(α) satisfies assumptions of the above lemma,

f(α)f(α)f(α)

f()

α for all αα

*.

Sincef() = –σ, we can obtain the upper bound for the decreasing value of the proximity in the inner iteration by Lemma ..

Theorem . Letα¯ be the default step size as defined in().Then we have

f(α¯)– ·

 + √( +pq+q)( +plog√)q+q .

Proof Sincef() = –σ

 andα¯∈[,α*], we have

f(α¯)  α¯f

() =

 ·

( + σ( +pq+q)( +plogσ)q +

q )

·

σ

= –  ·

σ

 + σ( +pq+q)( +plogσ) q+

(20)

This expresses the decrease in one inner iteration in terms ofσ. Since the decrease de-pends monotonically onσ, we can express the decrease in terms of=(v) by Lemma . as follows:

f(α¯)– ·

 + √( +pq+q)( +plog√) q+

q

– ·

·√

+ √( +pq+q)( + plog

)

q+

q

= – ·

 + √( +pq+q)( +plog√)

q+

q ,

where the inequality follows fromτ. The theorem is satisfied.

3.4 Iteration bound

We need to count how many inner iterations are required to return to the situation where

(v)τ after aμ-update. We denote the value of(v) afterμ-update as; the

subse-quent values in the same outer iteration are denoted ask,k= , . . . . IfKdenotes the total

number of inner iterations in the outer iteration, then we have

L(r,θ,τ) =O(r), K–>τ, 

and according to Theorem .,

k+k

 + √( +pq+q)( +plog√)

q+

q

 

k.

At this stage, we invoke Lemma  in [].

Lemma .([]) Let t,t, . . . ,tKbe a sequence of positive numbers such that

tk+tkβt–

γ

k , k= , , . . . ,K– ,

whereβ> and <γ .Then

K t

γ

βγ.

Lettingtk=k, β= 

+√(+pq+q)(+plog√)

q+

q

andγ =, we can get the following

lemma from Lemma ..

Lemma . Let K be the total number of inner iterations in the outer iteration.Then we have

K

 + √( +pq+q)

 + plog

q+

q

/,

(21)

Now, we estimate the total number of iterations of our algorithm.

Theorem . Ifτand <θ< ,the total number of iterations is not more than

"

 + √( +pq+q)

 + plog

q+

q

/

#"

θlog

r

#

.

Proof In the algorithm,,μk:= ( –θ)andμ= . By simple computation, we

have

k

θ log

r

.

Therefore, the number of outer iterations is bounded above by

θ log

r

.

Multiplication of this result by the number in the above lemma satisfies the theorem.

Since/=O(√r), if we takep=O(logr) andq= , then we can get the best known upper bound for the total number of inner iterations in the outer iteration is

O(√rlogr).

Also, we take forθ a constant (not depending on r), namely θ =(). Withτ =O(r), the best complexity of the primal-dual interior-point method for a linear optimization problem based on our new proximity function withp=lograndq=  is given by

O

rlogrlogr

.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors, together discussed and solved the problems in the manuscript. All authors read and approved the final manuscript.

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2012-0006236).

Received: 30 June 2012 Accepted: 7 November 2012 Published: 26 November 2012 References

1. Andersen, ED, Gondzio, J, Mészáros, C, Xu, X: Implementation of interior point methods for large scale linear programming. In: Terlaky, T (eds.) Interior Point Methods of Mathematical Programming, pp. 189-252. Kluwer Academic, Dordrecht (1996)

2. Renegar, J: A Mathematical View of Interior-Point Methods in Convex Optimization. MPS/SIAM Ser. Optim. SIAM, Philadelphia (2001)

3. Peng, J, Roos, C, Terlaky, T: Primal-dual interior-point methods for second-order conic optimization based on self-regular proximities. SIAM J. Optim.13, 179-203 (2002)

(22)

5. Peng, J, Roos, C, Terlaky, T: Self-Regularity: A New Paradigm for Primal-Dual Interior-Point Algorithms. Princeton University Press, Princeton (2002)

6. Bai, YQ, Ghami, ME, Roos, C: A comparative study of kernel functions for primal-dual interior-point algorithms in linear optimization. SIAM J. Optim.15, 101-128 (2004)

7. Wang, GQ, Bai, YQ, Roos, C: Primal-dual interior-point algorithms for semidefinite optimization based on a simple kernel function. J. Math. Model. Algorithms4, 409-433 (2005)

8. Bai, YQ, Wang, GQ: Primal-dual interior-point algorithms for second-order cone optimization based on a new parametric kernel function. Acta Math. Sin. Engl. Ser.23, 2027-2042 (2007)

9. Choi, BK, Lee, GM: On complexity analysis of the primal-dual interior-point methods for semidefinite optimization problem based on a new proximity function. Nonlinear Anal., Theory Methods Appl.71, e2628-e2640 (2009) 10. Choi, BK, Lee, GM: On complexity analysis of the primal-dual interior-point method for second-order cone

optimization problem. J. Korean Soc. Ind. Appl. Math.14, 93-111 (2010)

11. Nesterov, YE, Tood, M: Primal-dual interior-point methods for self-scaled cones. SIAM J. Optim.8, 324-364 (1998) 12. Muramatsu, M: On a commutative class of search directions for linear programming over symmetric cones. J. Optim.

Theory Appl.112, 595-625 (2002)

13. Faybusovich, L: Linear systems in Jordan algebras and primal-dual interior-point algorithms. J. Comput. Appl. Math.

86, 149-175 (1997)

14. Faybusovich, L: Euclidean Jordan and interior-point algorithms. Positivity1, 331-357 (1997)

15. Faybusovich, L, Arana, R: A long-step primal-dual algorithm for symmetric programming problems. Syst. Control Lett.

43, 3-7 (2001)

16. Schmieta, S, Alizadeh, F: Extensions of primal-dual interior-point algorithms to symmetric cones. Math. Program.96, 409-438 (2003)

17. Baes, M: Optimization Methods for Convex Symmetric Problems. Monograph, April (2007)

18. Choi, BK, Lee, GM: Complexity analysis for primal-dual interior-point methods for self-scaled optimization problems (submitted)

19. Vieira, MVC: Jordan algebraic approach to symmetric optimization. Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, The Netherlands (2007)

20. Vieira, MVC: Interior-point methods based on kernel functions for symmetric optimization. Optim. Methods Softw.

27, 513-537 (2011)

21. Sun, D, Sun, J: Lowner’s operator and spectral functions in Euclidean Jordan algebras. Math. Oper. Res.33, 421-445 (2008)

22. Faraut, J, Korányi, A: Analysis on Symmetric Cones. Oxford University Press, London (1994)

23. Nesterov, YE, Nemirovskii, A: Interior Point Polynomial Algorithms in Convex Programming. SIAM Stud. Appl. Math., vol. 13. SIAM, Philadelphia (1994)

24. Gowda, MS, Szajder, R, Tao, J: SomeP-properties for linear transformations on Euclidean Jordan algebras. Linear Algebra Appl.393, 203-232 (2004)

25. Faybusovich, L: A Jordan-algebraic approach to potential-reduction algorithms. Math. Z.239, 117-129 (2002) 26. Lim, Y: Applications of geometric means on symmetric cones. Math. Ann.319, 457-468 (2001)

27. Tunçel, L: Potential reduction and primal-dual methods. In: Wolkowicz, H, Saigal, R, Vandenberghe, L (eds.) Handbook of Semidefinite Programming Theory, Algorithms and Applications, pp. 235-265. Kluwer Academic, Boston (2000) 28. Alizadeh, F, Schmieta, S: Symmetric cones, potential reduction methods and word-by-word extensions. In:

Wolkowicz, H, Saigal, R, Vandenberghe, L (eds.) Handbook of Semidefinite Programming, Theory, Algorithms and Applications, pp. 195-233. Kluwer Academic, Boston (2000)

29. Horn, RA, Johnson, CR: Topics in Matrix Analysis. Cambridge University Press, Cambridge (1991)

30. Baes, M: Convexity and differentiability properties of spectral functions and spectral mappings on Euclidean Jordan algebras. Linear Algebra Appl.422, 664-700 (2007)

doi:10.1186/1687-1812-2012-213

References

Related documents

The following are either trademarks or registered trademarks of MicroStrategy Incorporated in the United States and certain other countries: MicroStrategy, MicroStrategy

Derivatives in foreign exchange and interest rate derivatives were supported by central banks because many thought their evolution would enhance monetary policy instruments, as

In this paper we propose the integration of cross-cuttings standardized control access policies (based on RBAC model and Oasis XACML) into legacy business processes, using a

The works in this document illustrate the difficulties in implementing measures based on the Anglo American model to improve corporate governance in four Asian countries;

The search of the Pareto set (or its connected parts) is performed by a complex, multi-deme HMS with a particular type of rank selection, supplemented with the rank modification

The subject of environmental protection was restricted to the areas of renewable energies/efficient use of energy, drinking water supply and wastewater disposal.. Procedure

Edison Medical use an advanced and thorough implant cleaning process for all dental implants to ensure surface structure and surface chemical composition and its purity grade..

We chose tests whose targeted cognitive functions (in parentheses) are hypothetically related to violent behavior: (i) Continuous Performance Test-HQ (CPT-HQ) ( Nuechterlein and