On the convergence of high order Ehrlich type iterative methods for approximating all zeros of a polynomial simultaneously

(1)

R E S E A R C H

Open Access

On the convergence of high-order

Ehrlich-type iterative methods for

approximating all zeros of a polynomial

simultaneously

Petko D Proinov

*

_{and Maria T Vasileva}

*_{Correspondence:}

[email protected] Faculty of Mathematics and Informatics, University of Plovdiv, Plovdiv, 4000, Bulgaria

Abstract

We study a family of high-order Ehrlich-type methods for approximating all zeros of a polynomial simultaneously. Let us denote byT(1)_{the famous Ehrlich method (1967).}

Starting fromT(1)_{, Kjurkchiev and Andreev (1987) have introduced recursively a}

sequence (T(N)₎∞

N=1of iterative methods for simultaneous ﬁnding polynomial zeros. For

givenN≥1, the Ehrlich-type methodT(N)_{has the order of convergence 2N}_{+ 1. In this}

paper, we establish two new local convergence theorems as well as a semilocal convergence theorem (under computationally verifiable initial conditions and with ana posteriorierror estimate) for the Ehrlich-type methodsT(N). Our first local convergence theorem generalizes a result of Proinov (2015) and improves the result of Kjurkchiev and Andreev (1987). The second local convergence theorem generalizes another recent result of Proinov (2015), but only in the case of the maximum norm. Our semilocal convergence theorem is the first result in this direction.

MSC: Primary 65H04; secondary 12Y05

Keywords: simultaneous methods; Ehrlich method; polynomial zeros; accelerated convergence; local convergence; semilocal convergence; error estimates

1 Introduction

Throughout this paper (K,| · |) denotes an algebraically closed ﬁeld andK[z] denotes the ring of polynomials (in one variable) overK. For a given vectorxinKn,xialways denotes theith coordinate ofx. In particular, ifF is a map with values inKn_{, then}_F

i(x) denotes theith coordinate of the vectorF(x). We endow the vector spaceKnwith the normxp deﬁned as usual:

xp= _n

i= |xi|p

/p

for ≤p<∞; x∞=max|x|, . . . ,|xn|

. (.)

We endowRn_{with the coordinate-wise ordering}_{deﬁned by}

xy if and only if xi≤yi for eachi= , . . . ,n. (.)

(2)

Then (Rn_,_·

p) is a solid vector space. We endowKnwith the cone norm · :Kn→Rn deﬁned by

x=|x|, . . . ,|xn|

.

Then (Kn_,_·_,_{) is a cone normed space over}_Rn_(see,_e.g._{, Proinov []).}

Letf ∈K[z] be a polynomial of degreen≥. A vectorξ∈Knis said to be aroot vector off if

f(z) =a

n

i=

(z–ξi) for allz∈K, (.)

wherea∈K. Obviously,fhas a root vector inKnif and only if it splits inK. We denote by

sep(f) theseparation numberoff which is deﬁned to be the minimum distance between two distinct zeros off, that is,

sep(f) =min|ξ–η|:f(ξ) =f(η),ξ=η. (.) 1.1 The Weierstrass method and Weierstrass correction

In the literature, there are a lot of iterative methods for ﬁnding all zeros off simultaneously (see,e.g., the monographs of Sendovet al.[], Kyurkchiev [], McNamee [] and Petković [] and references given therein). In , Weierstrass [] published his famous iterative method for simultaneous computation of all zeros off. TheWeierstrass methodis deﬁned by the following iteration:

x(k+)₌_x(k)_–_W

f

x(k)_, _k_{= , , , . . . ,} _(.)

where the operatorWf:D⊂Kn→Knis deﬁned by

Wf(x) =

W(x), . . . ,Wn(x)

with Wi(x) =

f(xi)

a_j₌_i(xi–xj)

(i= , . . . ,n), (.)

wherea∈Kis the leading coeﬃcient off and the domainDofW is the set of all vec-tors inKn_{with distinct components. The Weierstrass method (.) has second order of} convergence provided that all zeros off are simple. The operatorWfis calledWeierstrass

correction. We should note thatWfplays an important role in many semilocal convergence theorems for simultaneous methods.

1.2 The Ehrlich method

Another famous iterative method for finding simultaneously all zeros of a polynomialf was introduced by Ehrlich [] in . The Ehrlich methodis defined by the following fixed point iteration:

x(k+)=Tx(k), k= , , , . . . , (.)

where the operatorT:D⊂Kn_→_Kn_{is deﬁned by}_T₍_x_{) = (}_T

(x), . . . ,Tn(x)) with

Ti(x) =xi–

f(xi)

f(xi) –f(xi)

j=ixi–xj

(3)

and the domain ofTis the set

D=

x∈D:f(xi) –f(xi)

j=i  xi–xj

=  fori∈In

. (.)

Here and throughout the paper, we denote by In the set of indices , . . . ,n, that is,

In={, . . . ,n}. The Ehrlich method has third order of convergence if all zeros off are sim-ple. The Ehrlich method was rediscovered by Abert [] in . In , Börsch-Supan [] introduced another third-order method for numerical computation of all zeros of a poly-nomial simultaneously. In , Werner [] has proved that the both methods are iden-tical. The Ehrlich method (.) is known also as ‘Ehrlich-Abert method’, ‘Börsch-Supan method’, and ‘Abert method’.

Recently, Proinov [] obtained two local convergence theorems for Ehrlich method un-der diﬀerent types of initial conditions. The ﬁrst one generalizes and improves the results of Kyurkchiev and Tashev [, ] and Wang and Zhao [], Theorem .. The second one generalizes and improves the results of Wang and Zhao [], Theorem . and Tilli [], Theorem ..

Before we state the two results of [], we need some notations which will be used throughout the paper. For given vectorsx∈Kn_and_y_∈_Rn_{, we deﬁne in}_Rn_{the vector}

x y=

_|

x| y

, . . . ,|xn| yn

,

provided thatyhas no zero components. Givenpsuch that ≤p≤ ∞, we always denote byqthe conjugate exponent ofp,i.e. qis deﬁned by means of

≤q≤ ∞ and /p+ /q= .

In the sequel, we use the functiond:Kn_→_Rn_{deﬁned by}_d₍_x_{) = (}_d

(x), . . . ,dn(x)) with

di(x) =min

j=i |xi–xj| (i= , . . . ,n).

Leta>  andb≥. We deﬁne the real functionφby

φ(t) = at



( –t)( –bt) –at (.)

and the real numberRas follows:

R= 

b+  +(b– )_{+ }_a. (.)

Theorem .(Proinov []) Let f ∈K[z]be a polynomial of degree n≥which has only simple zeros,ξ∈Kn _{be a root vector of f and}__≤_p_{≤ ∞}_._{Suppose x}()_∈_Kn_{is an initial}

guess satisfying

Ex()=x ()_–_ξ

d(ξ) p

<R= 

(4)

where a= (n– )/q _{and b}_{= }/q_._{Then the Ehrlich iteration}_(.)_{is well deﬁned and}

con-verges cubically toξwith error estimates

x(k+)–ξλkx(k)–ξ and x(k)–ξλ(k–)/x()–ξ

for all k≥,whereλ=φ(E(x()₎₎_{and the function}_φ_{is deﬁned by}_{(.).}

Theorem .(Proinov []) Let f∈K[z]be a polynomial of degree n≥,ξ∈Knbe a root vector of f and≤p≤ ∞.Suppose x()_∈_Kn_{is a vector with distinct components satisfying}

Ex()=x ()_–_ξ

d(x()₎

p

≤R= 

b+  +(b– )_{+ }_a, (.)

where a= (n– )/q _{and b}_{= }/q_._{Then f has only simple zeros in}_K_{and Ehrlich iteration} (.)is well deﬁned and converges toξwith error estimates

_x(k+)_–_ξ_{θ λ}k_x(k)_–_ξ _and _x(k)_–_ξ_θk_λ(k–)/_x()_–_ξ

for all k≥,whereλ=φ(E(x()_)),_θ₌_ψ₍_E₍_x()₎₎_{and the function}_φ_{is deﬁned by}_(.)_and the functionψ by

ψ(t) =( –t)( –bt) –at 

 –t–at . (.)

Moreover,the method converges cubically toξ provided that E(x()) <R.

1.3 A family of high-order Ehrlich-type methods In the following deﬁnition, we deﬁne a sequence (T(N)₎∞

N=of iteration functions in the vector spaceKn. In what follows, we deﬁne the binary relation#onKnby

x#y ⇔ xi=yj for alli,j∈Inwithi=j. (.)

Deﬁnition . Let f ∈K[z] be a polynomial of degree n≥. Deﬁne the sequence (T(N)₎∞

N=of functionsT(N):DN⊂Kn→Knrecursively by settingT()(x) =xand

T_i(N+)(x) =xi–

f(xi)

f(xi) –f(xi)

j=i_x_i–_T(N)

j (x)

(i= , . . . ,n), (.)

where the sequence of domains (DN)∞N=is also deﬁned recursively by settingD=Knand

DN+=

x∈DN:x#T(N)(x),f(xi) –f(xi)

j=i  xi–Tj(N)(x)

=  fori∈In

. (.)

Given N∈N, theNth method of Kjurkchiev-Andreev’s family can be deﬁned by the following ﬁxed point iteration:

(5)

It is easy to see that in the caseN=  the Ehrlich-type method (.) coincides with the classical Ehrlich method (.). The order of convergence of the Ehrlich-type method (.) is N+ .

Kjurkchiev and Andreev [] established the following convergence result for the Ehrlich-type methods (.). This result and its proof can also be found in the monographs of Sendov, Andreev and Kjurkchiev [], Section  and Kyurkchiev [], Chapter .).

Theorem .(Kjurkchiev and Andreev []) Let f ∈C[z]be a polynomial of degree n≥ which has only simple zeros,ξ∈Cn_{be a root vector of f and N}_≥_._Let_{ <}_h_{< }_{and c}_{> }

be such that

δ> c + (n– )h and nc 

(δ–c)(δ– c– ch) – (n– )c_h ≤, (.)

whereδ=sep(f)deﬁned by(.).Suppose x()∈Cnis an initial guess satisfying the condi-tion

x()–ξ_∞≤ch. (.)

Then the Ehrlich-type method(.)converges toξ with error estimate

x(k)–ξ_∞≤ch(N+)k for all k≥. (.)

1.4 The purpose of the paper

In this paper, we present two new local convergence theorems as well as a semilocal con-vergence theorem (under computationally veriﬁable initial conditions and with ana pos-teriori error estimate) for Ehrlich-type methods (.). Our ﬁrst local convergence re-sult (Theorem .) generalizes Theorem . (Proinov []) and improves Theorem . (Kjurkchiev and Andreev []). Our second local convergence result (Theorem .) gen-eralizes Theorem . (Proinov []), but only in the casep=∞. Furthermore, several nu-merical examples are provided to show some practical applications of our semilocal con-vergence result.

2 A general convergence theorem

Recently, Proinov [–] has developed a general convergence theory for iterative pro-cesses of the type

xk+=Txk, k= , , , . . . , (.)

whereT:D⊂X→Xis an iteration function in a cone metric spaceX. In order to make this paper self-contained, we brieﬂy review some basic deﬁnitions and results from this theory.

Throughout this paperJdenotes an interval onR+containing . For an integerk≥, we denote bySk(t) the following polynomial:

Sk(t) =  +t+· · ·+tk–.

(6)

Deﬁnition .([]) A functionϕ:J→R+is calledquasi-homogeneousof degreer≥ onJif it satisﬁes the following condition:

ϕ(λt)≤λrϕ(t) for allλ∈[, ] andt∈J. (.)

If mfunctionsϕ, . . . ,ϕm are quasi-homogeneous onJ of degreer, . . . ,rm, then their productϕ· · ·ϕmis a quasi-homogeneous function of degreer+· · ·+rmonJ. Note also that a functionϕis quasi-homogeneous of degree  onJif and only it is nondecreasing onJ.

Deﬁnition .([]) A functionϕ:J→Jis said to be agauge function of order r≥ onJ if it satisﬁes the following conditions:

(i) ϕis quasi-homogeneous of degreeronJ; (ii) ϕ(t)≤tfor allt∈J.

A gauge functionϕof orderronJis said to be astrict gauge functionif the inequality in (ii) holds strictly whenevert∈J\{}.

The following is a suﬃcient condition for a gauge function of orderr.

Lemma . ([]) If ϕ:J→R+ is a quasi-homogeneous function of degree r≥on an interval J and R> is a ﬁxed point ofϕin J,thenϕis a gauge function of order r on[,R]. Moreover,if r> ,then functionϕis a strict gauge of order r on J= [,R).

Deﬁnition . ([]) LetT:D⊂X→X be a map on an arbitrary set X. A function E:D→R+ is said to be afunction of the initial conditionsofT (with a gauge function ϕonJ) if there exists a functionϕ:J→Jsuch that

E(Tx)≤ϕE(x) for allx∈DwithTx∈DandE(x)∈J. (.)

Deﬁnition .([]) LetT: D⊂X→Xbe a map on an arbitrary setX, andE: D→R+ be a function of the initial conditions ofTwith a gauge function onJ. Then a pointx∈D is said to be aninitial pointofT (with respect toE) ifE(x)∈Jand all of the iteratesTkx (k= , , , . . .) are well deﬁned and belong toD.

The following is a simple suﬃcient condition for initial points.

Theorem .([]) Let T:D⊂X→X be a map on an arbitrary set X and E: D→R+ be a function of the initial conditions of T with a gauge functionϕon J.Suppose that x∈D with E(x)∈J implies Tx∈D.Then every point x∈D such that E(x)∈J is an initial points of T.

Deﬁnition .([]) LetT:D⊂X→Xbe an operator in a cone normed space (X, · ) over a solid vector space (Y,), and letE:D→R+be a function of the initial conditions ofT with a gauge function on an intervalJ. Then the operatorT is said to bean iterated contraction at a pointξ∈D(with respect toE) ifE(ξ)∈Jand

Tx–ξ βE(x)x–ξ for allx∈DwithE(x)∈J, (.)

(7)

The following ﬁxed point theorem plays an important role in our paper.

Theorem .(Proinov []) Let T: D⊂X→X be an operator of a cone normed space (X, · )over a solid vector space(Y,),and let E:D→R+ be a function of the initial conditions of T with a gauge functionϕ of order r≥on an interval J.Suppose T is an iterated contraction at a pointξ with respect to E with control functionβsuch that

tβ(t)is a strict gauge function of order r on J (.)

and there exists a functionψ:J→R+such that

β(t) =φ(t)ψ(t) for all t∈J, (.)

whereφ:J→R+is a nondecreasing function satisfying

ϕ(t) =tφ(t) for all t∈J. (.)

Then the following statements hold true.

(i) The pointξis a unique ﬁxed point ofTin the setU={x∈D:E(x)∈J}.

(ii) Starting from each initial pointx()ofT,Picard iteration(.)remains in the setU

and converges toξwith error estimates

x(k+)–ξθ λrkx(k)–ξ and x(k)–ξθkλSk(r)_x()_–_ξ _(.)

for allk≥,whereλ=φ(E(x()₎₎_and_θ₌_ψ₍_E₍_x()₎₎_.

In the caseβ≡φ, Theorem . reduces to the following result.

Corollary .([]) Let T: D⊂X→X be an operator in a cone normed space(X, · ) over a solid vector space(Y,),and let E:D→R+be a function of the initial conditions of T with a strict gauge functionϕof order r≥on an interval J.Suppose that T is an iterated contraction at a pointξwith respect to E and with control functionφsatisfying(.).Then the following statements hold true.

(i) The pointξis a unique ﬁxed point ofTin the setU={x∈D:E(x)∈J}.

(ii) Starting from each initial pointx()_of_T_,_{Picard iteration}_(.)_{remains in}_U_and

converges toξwith orderrand error estimates

_x(k+)_–_ξ_λrk_x(k)_–_ξ _and _x(k)_–_ξ_λSk(r)_x()_–_ξ _(.)

for allk≥,whereλ=φ(E(x()₎₎_.

3 Some inequalities inKn

In this section, we present some useful inequalities inKn_{which play an important role in} the paper.

Lemma .([]) Let u,v∈Kn,v be a vector with distinct components and≤p≤ ∞. Then for all i,j∈In,

|ui–vj| ≥

 –u–v d(v) p

(8)

|ui–uj| ≥

 – /qu–v d(v)

p

|vi–vj|. (.)

Lemma .([]) Let u,v∈Kn_and__≤_p_{≤ ∞}_._{If the vector v has distinct components and}

u_d–₍_v₎v p

<  

then the vector u also has distinct components.

Lemma .([]) Let u,v,ξ∈Kn_,_ξ_{be a vector with distinct components}_{, }_≤_p_{≤ ∞}_and

v–ξ u–ξ. (.)

Then for all i,j∈In,

|ui–vj| ≥

 – /qu–ξ d(ξ) p

|ξi–ξj|. (.)

Lemma . Let u,v,ξ∈Kn_,_α_≥_,_and__≤_p_{≤ ∞}_._{If v is a vector with distinct components}

such that

u–ξ αv–ξ, (.)

then for all i,j∈In,

|uj–vi| ≥

 – ( +α)v–ξ d(v) p

|vi–vj|. (.)

Proof By the triangle inequality of cone norm inKn_{and (.), we obtain}

u–v u–ξ+v–ξ ( +α)v–ξ,

which yields

u_d–₍_v₎v( +α)v–ξ d(v) .

Taking thep-norm, we get

u_d–₍_v₎v p

≤( +α)v–ξ d(v) p

. (.)

From (.) and (.), we obtain (.), which completes the proof.

Lemma . Let u,v,ξ∈Kn,α≥,and≤p≤ ∞.If v is a vector with distinct components such that(.)holds,then for all i,j∈In,

|ui–uj| ≥

 – /q( +α)v–ξ d(v) p

|vi–vj|. (.)

(9)

4 Local convergence theorem of the ﬁrst type

Letf ∈K[z] be a polynomial of degreen≥ which has only simple zeros inK, and let

ξ∈Knbe a root vector off. In this section we study the convergence of the Ehrlich-type methods (.) with respect to the function of the initial conditionsE:Kn_→_R

+deﬁned as follows:

E(x) =x–ξ d(ξ) p

(≤p≤ ∞). (.)

Leta>  andb≥. Throughout this section, we deﬁne the functionφand the real num-berRby (.) and (.), respectively. It is easy to show thatRis the smallest positive solution of the equationφ(t) = . Note thatφis an increasing function which maps [,R] onto [, ]. Besides,φis quasi-homogeneous of degree  on [,R]. In the next deﬁnition, we introduce a sequence of such functions.

Deﬁnition . We deﬁne the sequence (φN)∞N= of nondecreasing functions φN: [,R]→[, ] recursively by settingφ(t) =  and

φN+(t) =

at_φ

N(t) ( –t)( –bt) –at_φ

N(t)

, (.)

wherea>  andb≥ are constants.

Proof of the correctness of Definition. We prove the correctness of the definition by induction. ForN=  it is obvious. Assume that for someN≥ the functionφN is well defined and nondecreasing on [,R] andφN(R) = . We shall prove the same forφN+. It follows from the induction hypothesis that

( –t)( –bt) –atφN(t)≥( –t)( –bt) –at>  for allt∈[,R] (.)

which means that the functionφN+is well deﬁned on [,R]. From (.) and the induction hypothesis, we deduce thatφN+is nondecreasing on [,R]. From (.) andφN(R) = , we obtain

φN+(R) =

aRφN(R) ( –R)( –bR) –aR_φ

N(R)

= aR



( –R)( –bR) –aR =φ(R) = .

This completes the induction and the proof of the correctness of Deﬁnition ..

Deﬁnition . For any integerN≥, we deﬁne the functionϕN: [,R]→[,R] as fol-lows:

ϕN(t) =tφN(t), (.)

where the functionφN is deﬁned by Deﬁnition ..

In the next lemma, we present some properties of the functionsφN andϕN.

(10)

(i) φN is a quasi-homogeneous function of degreeNon[,R];

(ii) φN+(t)≤φ(t)φN(t)for everyt∈[,R];

(iii) φN+(t)≤φN(t)for everyt∈[,R];

(iv) φN(t)≤φ(t)N for everyt∈[,R];

(v) ϕN is a gauge function of orderN+ on[,R].

Proof Claim (i) can easily be proved by induction. From (.) and (.), we get

φN+(t) =

at_φ

N(t) ( –t)( –bt) –at_φ

N(t)≤

at_φ

N(t)

( –t)( –bt) –at =φ(t)φN(t),

which proves (ii). Claim (iii) is a trivial consequence from (ii). Claim (iv) follows from (ii) by induction. Claim (v) follows from (i) and the deﬁnition ofϕN.

Lemma . Let f ∈K[z]be a polynomial of degree n≥,ξ∈Kn_{be a root vector of f and}

N≥.Suppose x∈DN is a vector such that f(xi)= for some i∈In.

(i) Ifx#T(N)₍_x₎_,_then f(xi)

f(xi)

–

=  –σi xi–ξi

, (.)

whereσi∈Kis deﬁned by

σi= (xi–ξi)

j=i

T_j(N)(x) –ξj

(xi–ξj)(xi–Tj(N)(x))

. (.)

(ii) Ifx∈DN+,then

T_i(N+)(x) –ξi= –

σi  –σi

(xi–ξi). (.)

Proof (i) Taking into account thatξ is a root vector off, we get

f(xi)

–

j=i 

xi–Tj(N)(x) =

n

j=  xi–ξj

–

j=i 

xi–Tj(N)(x)

= 

xi–ξi

+

j=i

 xi–ξj

– 

xi–Tj(N)(x)

= 

xi–ξi

–

j=i

T_j(N)(x) –ξj

(xi–ξj)(xi–Tj(N)(x))

=  –σi xi–ξi ,

which proves (.).

(ii) It follows fromx∈DN+that

f(xi) –f(xi)

j=i 

xi–Tj(N)(x)

(11)

Then from (.) and (.), we obtain

T_i(N+)(x) –ξi=xi–ξi–

f(xi)

–

j=i 

xi–Tj(N)(x) –

=xi–ξi–

xi–ξi  –σi

= – σi  –σi

(xi–ξi),

which completes the proof.

Lemma . Let f ∈K[z]be a polynomial of degree n≥which has only simple zeros in

K,ξ∈Knbe a root vector of f,N≥,and≤p≤ ∞.Suppose x∈Knis a vector satisfying the following condition:

E(x) =x–ξ d(ξ) p

<R= 

b+  +(b– )_{+ }_a, (.)

where the function E:Kn_→_R

+is deﬁned by(.),a= (n– )/q,and b= /q.Then

x∈DN, T(N)(x) –ξφN

E(x)x–ξ and ET(N)(x)≤ϕN

E(x). (.)

Proof We shall prove statements by induction onN. IfN= , then (.) holds trivially. Assume that (.) holds for someN≥.

First, we show thatx∈DN+,i.e. x#T(N)(x) and (.) holds for everyi∈In. It follows from the ﬁrst inequality in (.) that the inequality (.) is satisﬁed with u=x and v=T(N)₍_x_{). Then by Lemma . and (.), we obtain}

xi–Tj(N)(x)≥

 –bx–ξ d(ξ) p

|ξi–ξj| ≥

 –bE(x)dj(ξ) >  (.)

for everyj=i. Consequently,x#T(N)(x). It remains to prove (.) for everyi∈In. Leti∈In be ﬁxed. We shall consider only the non-trivial casef(xi)= . In this case (.) is equivalent to

f(xi)

–

= . (.)

We deﬁneσiby (.). It follows from Lemma .(i) that (.) is equivalent toσi= . By Lemma . withu=xandv=ξand (.), we get

|xi–ξj| ≥

 –x–ξ d(ξ) p

|ξi–ξj| ≥

 –E(x)di(ξ) >  (.)

for everyj=i. From the triangle inequality inK, (.), (.), the induction hypothesis and Hölder’s inequality, we get

|σi| ≤ |xi–ξi|

j=i

|T_j(N)(x) –ξj|

|xi–ξj||xi–Tj(N)(x)|

≤ 

( –E(x))( –bE(x))

|xi–ξi|

di(ξ)

j=i

|T_j(N)(x) –ξj|

(12)

≤ aE(x)ϕN(E(x)) ( –E(x))( –bE(x))

= aE(x) _φ

N(E(x))

( –E(x))( –bE(x)). (.)

From this,φN(E(x))≤ and (.), we obtain

|σi| ≤

aE(x)

( –E(x))( –bE(x))< 

which yieldsσi= , and so (.) holds. Hence,x∈DN+.

Second, we show that the inequalities in (.) hold forN+ . The ﬁrst inequality for N+  is equivalent to

T_i(N+)(x) –ξi≤φN+

E(x)|xi–ξi| for alli∈In. (.)

Leti∈Inbe ﬁxed. Ifxi=ξi, thenTi(N+)(x) =ξiand so (.) becomes an equality. Suppose

xi=ξi. By Lemma .(ii), the triangle inequality inK, and the estimate (.), we get

T_i(N+)(x) –ξi= |

σi|

| –σi||

xi–ξi| ≤ |

σi|  –|σi||

xi–ξi|

≤ aE(x)φN(E(x)) ( –E(x))( –bE(x)) –aE(x)_φ

N(E(x))|

xi–ξi|

=φN+

E(x)|xi–ξi|,

which proves (.). Dividing both sides of the inequality (.) bydi(ξ) and taking the

p-norm, we obtain

ET(N+)(x)≤ϕN+

E(x)

which proves that the second inequality in (.) holds forN+ . This completes the

in-duction and the proof of the lemma.

Now we are ready to state the main result of this section. In the caseN=  this result coincides with Theorem ..

Theorem . Let f ∈K[z]be a polynomial of degree n≥which has only simple zeros in

K,ξ∈Kn_{be a root vector of f}_,_N_≥_,_and__≤_p_{≤ ∞}_._{Suppose x}()_∈_Kn_{is an initial guess}

satisfying

Ex()=x ()_–_ξ

d(ξ) p

<R= 

b+  +(b– )_{+ }_a, (.)

where the function E:Kn→R+ is deﬁned by(.), a= (n– )/q,and b= /q. Then the Ehrlich-type iteration(.)is well deﬁned and converges toξ with error estimates

x(k+)–ξλ(N+)kx(k)–ξ and x(k)–ξλ((N+)k–)/(N)x()–ξ (.)

(13)

Proof We apply Corollary . to the iteration functionT(N)_:_D

N⊂Kn→Kndeﬁned by Deﬁnition . and to the functionE:Kn_→_R

+ deﬁned by (.). LetJ= [,R). It follows from Lemma ., Lemma .(v), and Lemma . thatEis a function of the initial con-ditions of T(N) _{with a strict gauge function} _ϕ

N of order r= N+  onJ. Since ξ is a root vector of f, then E(ξ) = ∈J. It follows from Lemma . thatT(N) _{is an iterated} contraction at a pointξ with respect toE and with control functionφN. The fact that

x() _{is an initial point of}_T(N)_{follows from Lemma . and Theorem .. Hence, all the} assumptions of Corollary . are satisﬁed, and the statement of Theorem . follows

from it.

Corollary . Let f ∈K[z]be a polynomial of degree n≥which has only simple zeros in

K,ξ∈Kn_{be a root vector of f}_,_N_≥_,_and__≤_p_{≤ ∞}_._{Suppose x}()_∈_Kn_{is an initial guess}

satisfying(.).Then the Ehrlich-type iteration(.)is well deﬁned and converges toξ with error estimates

x(k+)–ξλN(N+)kx(k)–ξ and

x(k)–ξλ((N+)k–)/x()–ξ

(.)

for all k≥,whereλ=φ(E(x()₎₎_and_φ_{is a real function deﬁned by}_{(.).}

Proof It follows from Theorem . and Lemma .(iv).

Let  <h<  be a given number. Solving the equationφ(t) =hin the interval (,R), we can reformulate Corollary . in the following equivalent form.

Corollary . Let f ∈K[z]be a polynomial of degree n≥which has n simple zeros inK,

ξ∈Kn_{be a root vector of f}_,_N_≥_{, }_≤_p_{≤ ∞}_,_and_{ <}_h_{< .}_{Suppose x}()_∈_Kn_{is an initial}

guess which satisﬁes

Ex()=x ()_–_ξ

d(ξ) p

<Rh=



b+  +(b– )_{+ }_a_{( + /}_h₎, (.)

where a= (n– )/q _{and b}_{= }/q_._{Then the Ehrlich-type method}_(.)_{is well deﬁned and}

converges toξ with error estimates

x(k+)–ξhN(N+)kx(k)–ξ and

x(k)–ξh(N+)k–x()–ξ

(.)

for all k≥.

Remark . Corollary . is an improvement of the result of Kjurkchiev and Andreev []

(see Theorem . above). Suppose that a vectorx()∈Knsatisﬁes (.). It is easy to show that condition (.) is equivalent to the following one:

 <c<min

δ

( + (n– )h),

δ

 + h+(n– )h_{+ }_h_{+ }_n_{+ }

(14)

From this, the initial condition (.) and  <h< , we obtain

x()–ξ

d(ξ)

_∞≤x()–ξ∞

δ ≤

ch δ

≤ h

 + h+(n– )h_{+ }_h_{+ }_n_{+ }

≤ 

 +(n– ) + (n+ )/h

≤ 

 +n–  + (n– )/h.

Therefore, x() _{satisﬁes (.) with} _p₌_∞_{. Then it follows from Corollary . that the} Ehrlich-type method (.) is well deﬁned and converges toξwith error estimates (.). From the second estimate in (.) and (.), we get the estimate (.), which completes the proof.

5 Local convergence theorem of the second type

Letf ∈K[z] be a polynomial of degreen≥. We study the convergence of the Ehrlich-type method (.) with respect to the function of the initial conditionsE:D→R+deﬁned by

E(x) =x–ξ d(x) p

(≤p≤ ∞). (.)

In the previous section, we introduce the functionsφN,ϕN, and the real numberRwith two parametersa>  andb≥. In this section, we consider a special case ofφN,ϕN, and

Rwhenb= . In other words, now we deﬁneRby

R= 

 +√ + a. (.)

Furthermore, we deﬁne the functionsφN andϕN by Deﬁnitions . and ., respectively, but with

φN+(t) =

atφN(t) ( –t)( – t) –at_φ

N(t)

(.)

instead of (.), wherea>  is a constant.

Deﬁnition . For a given integer N ≥ , we deﬁne the increasing function

βN: [,R]→[, ) by

βN(t) =

at_φ

N–(t)  –t–at_φ

N–(t)

(.)

and we deﬁne the decreasing functionψN: [,R]→(, ] as follows:

ψN(t) =  – t

 +βN(t)

=( –t)( – t) –at _φ

N–(t)  –t–at_φ

N–(t)

(15)

Proof of the correctness of Deﬁnition . The functionsβN andψN are well deﬁned on [,R] since

 –t–atφN–(t)≥ –t–at>  for allt∈[,R]. (.)

The monotonicity of βN and ψN is obvious. It remains to prove that βN(R) <  and

ψN(R) > . SinceφN(R) = , we obtain

βN(R) =

aR

 –R–aR <  and ψN(R) =

( –R)( – R) –aR  –R–aR > ,

which completes the proof of the correctness of Deﬁnition ..

Lemma . Let N≥.Then:

(i) βN is a quasi-homogeneous of degreeNon[,R];

(ii) βN(t) =φN(t)ψN(t)for everyt∈[,R];

(iii) βN+(t)≤βN(t)for everyt∈[,R];

(iv) ψN+(t)≥ψN(t)for everyt∈[,R].

Proof The functionβN can be presented in the formβN(t) =tφN–(t)(t), where(t) = a/( –t–atφN–(t)). Therefore,βN is quasi-homogeneous of degree Non [,R] since it is a product of three quasi-homogeneous functions on [,R] of degree , N– , and . From the deﬁnitions of the functionsφN,ψN, andβN, we get

φN(t)ψN(t) =

at_φ

N–(t) ( –t)( – t) –at_φ

N–(t)

( –t)( – t) –at_φ

N–(t)  –t–at_φ

N–(t)

=βN(t).

Claim (iii) follows from Lemma .(iii) and (.). Claim (iv) follows from (iii) and (.).

Lemma . Let f ∈K[z]be a polynomial of degree n≥which splits overK,ξ∈Kna root vector of f,N≥,and≤p≤ ∞.Suppose x∈Kn_{is a vector with distinct components such}

that

E(x) =x–ξ d(x) p

≤R= 

 +√ + a, (.)

where the function E:D→R+is deﬁned by(.)and a= (n– )/q.Then f has only simple zeros inK,

x∈DN, T(N)(x) –ξβN

E(x)x–ξ and ET(N)(x)≤ϕN

E(x). (.)

Besides,the vector T(N)₍_x₎_{has pairwise distinct components}_.

Proof It follows from (.) andR< / thatE(x) < /. Then it follows from Lemma . that the vectorξhas distinct components, which means thatf has only simple zeros inK. We divide the proof into two steps.

(16)

First we show thatx∈DN+,i.e. x#T(N)(x) and (.) holds for everyi∈In. It follows from the ﬁrst inequality in (.) that (.) holds withv=x,u=T(N)₍_x_{), and}_α_{= . Therefore by} Lemma ., (.) andR< /, we obtain

xi–Tj(N)(x)>

 – x–ξ d(x) p

|xi–xj| ≥

 – E(x)dj(x) >  (.)

for everyj=i. Consequently,x#T(N)(x). It remains to prove (.) for everyi∈In. Leti∈In be ﬁxed. We shall consider only the non-trivial casef(xi)= . In this case (.) is equivalent to (.). On the other hand, it follows from Lemma .(i) that (.) is equivalent toσi= , whereσiis deﬁned by (.). By Lemma . withu=ξ andv=xand (.), we get

|xi–ξj| ≥

 –x–ξ d(x) p

|xi–xj|=

 –E(x)|xi–xj| ≥

 –E(x)di(x) >  (.)

for everyj=i. Hence, we obtainx#ξ. From the induction hypothesis, we get _T(N)

i (x) –ξi≤βN

E(x)|xi–ξi|. (.)

Combining the triangle inequality inK, (.), (.) and (.), we obtain

|σi| ≤ |xi–ξi|

j=i

|T_j(N)(x) –ξj|

|xi–ξj||xi–Tj(N)(x)|

≤ 

( –E(x))( – E(x))

|xi–ξi|

di(x)

j=i

|T_j(N)(x) –ξj|

dj(x)

≤ βN(E(x)) ( –E(x))( – E(x))

|xi–ξi|

di(x)

j=i

|xj–ξj|

dj(x)

which, using Hölder’s inequality, yields

|σi| ≤

aE(x)_β

N(E(x))

( –E(x))( – E(x)). (.)

From this and (.), we deduce

|σi| ≤

aE(x)

( –E(x))( – E(x))< ,

which yieldsσi= , and so (.) holds. Thus we prove thatx∈DN+.

Now we have to prove that the ﬁrst inequality in (.) is satisﬁed forN+ , which is equivalent to

T_i(N+)(x) –ξi≤βN+

E(x)|xi–ξi| for alli∈In. (.)

(17)

estimate (.) that

T_i(N+)(x) –ξi= |

σi|

| –σi|

|xi–ξi| ≤ |

σi|  –|σi|

|xi–ξi|

≤ aE(x)βN(E(x)) ( –E(x))( – E(x)) –aE(x)_β

N(E(x))|

xi–ξi|.

From this inequality, Lemma .(ii),ψN(t)≤, (.) and Lemma .(iv), we obtain

T_i(N+)(x) –ξi≤

aE(x)_φ

N(E(x))ψN(E(x)) ( –E(x))( – E(x)) –aE(x)_φ

N(E(x))|

xi–ξi|

≤φN+

E(x)ψN

E(x)|xi–ξi|

≤φN+

E(x)ψN+

E(x)|xi–ξi|=βN+

E(x)|xi–ξi|

which proves (.). This completes the induction.

Step. In this step we prove the second inequality in (.) and thatT(N)(x) has distinct components. First inequality in (.) allow us to apply Lemma . withu=T(N)₍_x_),_v₌_x_, andα=βN(E(x)). By Lemma . and (.), we deduce

T_i(N)(x) –T_j(N)(x)≥ – /qE(x) +βN

E(x)|xi–xj| ≥ψN

E(x)|xi–xj|.

By taking the minimum over allj∈Insuch thatj=i, we obtain

di

T(N)(x)≥ψN

E(x)di(x) >  (.)

which implies that T(N)₍_x_{) has distinct components. It follows from (.), (.), and} Lemma .(ii) that

|T_i(N)(x) –ξi|

di(T(N)(x)) ≤

βN(E(x))

ψN(E(x))

|xi–ξi|

di(x) =φN

E(x)|xi–ξi| di(x)

.

By taking thep-norm, we obtain

ET(N)(x)≤φN

E(x)E(x) =ϕN

E(x),

which proves the second inequality in (.). This completes the proof.

Now we are able to state the main result of this section. In the case whenN=  and p=∞this result reduces to Theorem ..

Theorem . Let f∈K[z]be a polynomial of degree n≥which splits overK,ξ∈Knbe a root vector of f,N≥,and≤p≤ ∞.Suppose x()_∈_Kn_{is an initial guess with distinct}

components such that

Ex()=x ()_–_ξ

d(x()₎

p

≤R= 

(18)

where the function E is deﬁned by(.)and a= (n– )/q_._{Then f has only simple zeros in}_K_,

and the Ehrlich-type iteration(.)is well deﬁned and converges toξ with error estimates

x(k+)_–_ξ_{θ λ}(N+)k

x(k)_–_ξ _and

x(k)–ξθkλ((N+)k–)/Nx()–ξ

(.)

for all k≥,whereλ=φN(E(x())),θ=ψN(E(x())).Moreover, the method is convergent

with orderN+ provided that E(x()) <R.

Proof We apply Theorem . to the iteration functionT(N)_:_D

N⊂Kn→Kntogether with the functionE: DN→R+deﬁned by (.).

It follows from Lemma . and Lemma .(v) thatEis a function of the initial conditions ofT(N)_{with gauge function}_ϕ

N of orderr= N+  on the intervalJ= [,R].

From Lemma ., we see thatT(N)_{is an iterated contraction at}_ξ_{with respect to}_E_and with control functionβN. Also, it is easy to see that the functionsβN,φN,ψN, andϕNhave the properties (.), (.) and (.).

It follows from Lemma . thatx()_∈_D

N. According to Theorem . to prove thatx() is an initial point ofT(N)_{it is suﬃcient to prove that}

x∈DN and E(x)∈J ⇒ T(N)(x)∈DN. (.)

From x∈DN, we have T(N)(x)∈Kn. By Lemma ., T(N)(x) has distinct components andE(T(N)₍_x₎₎_≤_ϕ

N(E(x)). The last inequality yields E(T(N)(x))∈JsinceϕN:J→J and

E(x)∈J. Thus we have bothT(N)₍_x₎_∈_D_and_E₍_T(N)₍_x₎₎_∈_J_{. Applying Lemma . to the} vectorT(N)₍_x_{), we get}_T(N)₍_x₎_∈_D

N, which proves (.). Therefore,x()is an initial point ofT(N)_.

Now the statement of Theorem . follows from Theorem ..

6 Semilocal convergence theorem

In this section we establish semilocal convergence theorems for Ehrlich-type methods (.) for ﬁnding all zeros of a polynomial simultaneously. We study the convergence of these methods with respect to the function of the initial conditionsEf:D→R+deﬁned by

Ef(x) = Wf(x)

d(x) p

(≤p≤ ∞). (.)

Recently Proinov [] has shown that there is a relationship between local and semilocal theorems for simultaneous root-ﬁnding methods. It turns out that from any local conver-gence theorem for a simultaneous method one can obtain as a consequence a semilocal theorem for the same method. In particular, from Theorem . we can obtain a semilo-cal convergence theorem for Ehrlich-type methods (.) under computationally veriﬁable initial conditions. For this purpose we need the following result.

Theorem .(Proinov []) Let f ∈K[z]be a polynomial of degree n≥.Suppose x∈Kn

is an initial guess with distinct components such that

Wf(x)

d(x) p

≤ R( –R)

(19)

for some≤p≤ ∞and <R≤/( +√a),where a= (n– )/q_._{In the case n}_{= }_{and p}₌_∞

we assume that inequality in(.)is strict.Then f has only simple zeros inK,and there exists a root vectorξ∈Knof f such that

x–ξ αEf(x)Wf(x) and x_d–₍_xξ₎

p

≤R, (.)

where the real functionαis deﬁned by

α(t) = 

 – (a– )t+( – (a– )t)_{– }_t. (.)

If the inequality(.)is strict,then the second inequality in(.)is strict too.

Now, we are ready to state and prove the main result of this paper.

Theorem . Let f∈K[z]be a polynomial of degree n≥,N≥, ≤p≤ ∞. Suppose x()_∈_Kn_{is an initial guess with distinct components such that}

Ef

x()=Wf(x ()₎

d(x()₎

p

< 

( +√ + a), (.)

where the function Ef is deﬁned by(.)and a= (n– )/q.Then f has only simple zeros in K,and the Ehrlich-type iteration(.)is well deﬁned and converges to a root vectorξ of f with order of convergenceN+ and with an a posteriori error estimate

x(k)_–_ξ_α_E

f

x(k)_W

f

x(k) _(.)

for all k≥such that Ef(x(k)) < /( +

√

 + a)_,_{where the function}_α_{is deﬁned by}_(.).

Proof Let us deﬁneRby (.). It is easy to calculate thatR< /( +√a) and

R( –R)  + (a– )R=

( +√ + a)

( +√ + a)( + a+√ + a)=

 ( +√ + a).

Therefore, (.) can be written in the form

Wf(x())

d(x()₎

p

< R( –R)  + (a– )R.

Then it follows from Theorem . thatf has only simple zeros inKand there exists a root vectorξ∈Knoff such that

x()–ξ

d(x()₎

p

<R.

(20)

k≥,

Wf(x(k))

d(x(k)₎

p

< R( –R)

 + (a– )R. (.)

Then it follows from Theorem . that there exists a root vectorη∈Knoff such that

x(k)–ηαEf

x(k)Wf

x(k) and x (k)_–_η

d(x(k)₎

p

<R. (.)

From the second inequality in (.) and Theorem ., we conclude that the Ehrlich-type iteration (.) converges toη. By the uniqueness of the limit, we getη=ξ. Therefore, the error estimate (.) follows from the ﬁrst inequality in (.). This completes the proof. Settingp=∞in Theorem ., we obtain the following result.

Corollary . Let f∈K[z]be a polynomial of degree n≥and N≥.Suppose x()_∈_Kn

is an initial guess with distinct components such that

Wf(x())

d(x()₎

_∞< 

( +√n– ). (.)

Then f has only simple zeros inKand the Ehrlich-type iteration(.)is well deﬁned and converges to a root vectorξ of f with order of convergenceN+ and with error estimate (.)for p=∞.

Settingp=  in Theorem . we obtain the following result.

Corollary . Let f ∈K[z]be a polynomial of degree n≥and N≥.Suppose x()∈Kn is an initial guess with distinct components such that

Wf(x())

d(x()₎

 <

. (.)

Then f has only simple zeros inKand the Ehrlich-type iteration(.)is well deﬁned and converges with orderN+ to a root vectorξof f with error estimate(.)for p= .

7 Numerical examples

In this section, we present several numerical examples to show some applications of The-orem .. Letf∈C[z] be a polynomial of degreen≥ and letx()_∈_Cn_{be an initial guess.} We show that Theorem . can be used:

• to prove numerically thatf has only simple zeros;

• to prove numerically thatNth Ehrlich-type iteration (.) starting fromx()_{is well} deﬁned and converges with orderN+ to a root vector off;

• to guarantee the desired accuracy when calculating the roots off via theNth Ehrlich-type method.

In the examples below, we use the function of the initial conditionsEf:D→R+deﬁned by

Ef(x) = Wf(x)

d(x)

(21)

whereWfis the Weierstrass correction deﬁned by (.). We consider only the casep=∞, since the other cases are similar.

Also, we use the real functionαdeﬁned by

α(t) = 

 – (n– )t+( – (n– )t)_{– }_t. (.)

It follows from Theorem . that if there exists an integerm≥ such that

Ef

x(m)≤R= 

( +√n– ), (.)

thenf has only simple zeros and the Ehrlich-type iteration (.) is well deﬁned and con-verges to a root vectorξoff with order of convergence N+ . Besides, for allk≥msuch that

Ef

x(k)<R= 

( +√n– ) (.)

the followinga posteriorierror estimate holds: x(k)–ξ_∞<εk, whereεk=α

Ef

x(k)Wf

x(k)_∞. (.)

In the examples, we apply the Ehrlich-type methods (.) for someN≥ using the following stopping criterion:

εk< – and Ef

x(k)<R (k≥m). (.)

For givenNwe calculate the smallestm≥ which satisﬁes the convergence condition (.), the smallestk≥mfor which the stopping criterion (.) is satisﬁed, as well as the value ofεkfor the lastk.

In Table  the values of iterations are given to  decimal places. The values of other quantities (R,Ef(x(m)),etc.) are given to six decimal places.

Numerical calculations are made using the software package Mathematica [].

Example . We consider the polynomial

f(z) =z– 

and the initial guess

x()= (. + .i, –. + .i, –. + .i, . – .i)

(22)

[image:22.595.142.452.96.225.2]

Table 1 Values ofm,k, andεkfor Example 7.1 (R= 0.125)

N m Ef(x(m)) εm k εk εk+1

[image:22.595.142.451.258.356.2]

1 2 0.010032 1.457548×10–2 ₄ _4.385760_×₁₀–21 _8.919073_×₁₀–63 2 1 0.067725 1.242914×10–1 ₃ _1.347060_×₁₀–38 _7.284576_×₁₀–193 3 1 0.015716 2.300541×10–2 3 1.825502×10–106 5.054741×10–744 4 1 0.002730 3.887455×10–3 2 1.330837×10–25 3.543773×10–230 5 1 0.001215 1.722883×10–3 2 4.720064×10–37 2.999643×10–407 6 1 0.000206 2.927439×10–4 2 1.060096×10–50 5.523501×10–657 7 1 0.000081 1.155284_×10–4 ₂ _6.261239_×₁₀–67 _3.252761_×₁₀–1,002 8 1 0.000014 1.986052_×10–5 ₂ _6.080606_×₁₀–85 _3.570038_×₁₀–1,439 9 1 0.000005 7.910775×10–6 ₂ _1.309022_×₁₀–105 _1.170454_×₁₀–2,002 10 1 0.000000 1.366899×10–6 ₂ _4.301615_×₁₀–128 _8.477451_×₁₀–2,683 100 1 0.000000 1.820743×10–57 ₁ _1.820743_×₁₀–57 _3.460397_×₁₀–11,451

Table 2 Numerical results for Example 7.1 in the caseN= 10

k x₁(k) x₂(k)

0 0.5 + 0.5i –1.36 + 0.42i

1 1.000000380419496 + 0.000000816235730i –1.000000220051461 – 0.000000495915480i

2 1.000000000000000 + 0.000000000000000i –1.000000000000000 + 0.000000000000000i

k x₁(k) x₂(k)

0 –0.25 + 1.28i 0.46 – 1.37i

1 0.000000277962637 + 0.999999578393062i –0.000000314533436 – 0.999998669784542i

2 0.000000000000000 + 1.000000000000000i 0.000000000000000 – 1.000000000000000i

Table 3 Values ofm,k, andεkfor Example 7.2 (R= 0.043061)

N m Ef(x(m)) εm k εk εk+1

1 6 0.036897 3.187918×10–2 9 3.967908×10–36 5.304009×10–106 2 5 0.000003 1.182714×10–6 6 6.112531×10–28 2.230412×10–134 3 4 0.000064 2.475020_×10–5 ₅ _2.446120_×₁₀–29 _2.722168_×₁₀–197 4 4 0.000000 1.550670×10–11 ₅ _3.838741_×₁₀–93 _1.589981_×₁₀–827 5 3 0.005793 2.415745×10–3 ₄ _9.532339_×₁₀–24 _8.487351_×₁₀–248 6 3 0.000293 1.127450×10–4 ₄ _9.565008_×₁₀–45 _1.725858_×₁₀–565 7 3 0.000005 2.173198×10–6 ₄ _4.018844_×₁₀–77 _6.737932_×₁₀–1,138 8 3 0.000000 1.562375×10–8 ₄ _1.162424_×₁₀–123 _1.291370_×₁₀–2,080 9 3 0.000000 4.092421×10–11 ₄ _4.245137_×₁₀–187 _1.373908_×₁₀–3,530 10 3 0.000000 3.904607×10–14 ₄ _4.643262_×₁₀–270 _2.543247_×₁₀–5,644 30 2 0.000055 2.129417×10–5 ₃ _5.721566_×₁₀–249 _2.377023_×₁₀–15,106

In Table , we present numerical results for Example . in the caseN= .

Example . We consider the polynomial

f(z) =z+z+ 

and Aberth’s initial approximationx()_∈_Cn_{given by (see Aberth [] and Petković}_{et al.} []):

x()ν = – a

n +rexp(iθν), θν= π

n

ν–



,ν= , . . . ,n, (.)

[image:22.595.146.449.394.522.2]