Preconditioned Lanczos method for generalized Toeplitz eigenvalue problems

(1)

Contents lists available atScienceDirect

Journal of Computational and Applied

Mathematics

journal homepage:www.elsevier.com/locate/cam

Preconditioned Lanczos method for generalized Toeplitz

eigenvalue problems

I

Yuan-Yuan Wang

a

, Lin-Zhang Lu

b,a,∗

a_{School of Mathematical Science, Xiamen University, China}

b_{School of Mathematics and Computer Science, Guizhou Normal University, China}

a r t i c l e i n f o

Article history: Received 28 March 2007

Received in revised form 23 September 2007 Keywords: Toeplitz matrix Sine transform Lanczos method Preconditioning a b s t r a c t

We employ the sine transform-based preconditioner to precondition the shifted Toeplitz matrixAn−ρBninvolved in the Lanczos method to compute the minimum eigenvalue

of the generalized symmetric Toeplitz eigenvalue problemAnx = λBnx, whereAnand

Bn are given matrices of suitable sizes. The sine transform-based preconditioner can

improve the spectral distribution of the shifted Toeplitz matrix and, hence, can speed up the convergence rate of the preconditioned Lanczos method. The sine transform-based preconditioner can be implemented efficiently by the fast transform algorithm. A convergence analysis shows that the preconditioned Lanczos method converges sufficiently fast, and numerical results show that this method is highly effective for a large matrix.

1. Introduction

Computing the extreme eigenvalues of a generalized eigenvalue problem is one of the most important topics in numerical linear algebra, which often arises in many scientific and engineering problems, such as signal and image processing as well as control theory. Iteration methods, like Lanczos method, are widely used for solving these kinds of problems (see [2,5,7, 10,18–20]), and the preconditioning strategy is often adopted to accelerate their converge rates (see [14,15]).

Some of these problems may have special structures such as Toeplitz, Hankle, or banded [4,16]. Using specific preconditioners which utilize the advantages of these structured matrix pencils can considerably save computing workloads and accelerate convergence rate.

In this paper, we present a preconditioned Lanczos method for computing the extreme eigenvalues of the generalized Toeplitz eigenvalue problem

Anx

=

λ

Bnx

,

whereAn

,

Bnaren

×

nsymmetric Toeplitz matrices withBnpositive definite. This problem is also briefly called the pencil problem for

(

An

,

Bn

)

. This kind of problem arises from the estimation of sinusoidal signals in noise [6] etc.

The basic algorithm is an iteration on an approximate eigenpair. At each step we use the Rayleigh–Ritz projection on a certain Krylov subspace generated by a preconditioned shifted Toeplitz matrix so that a new approximate eigenpair can be produced. We use the optimal sine transform-based preconditioner to improve the spectral distribution of the shifted Toeplitz matrix. Theoretical analysis and numerical experiments on this new method are presented and bounds on asymptotic convergence rate are derived.

I_{This work was supported by the National Natural Science Foundation of China (No. 10531080).}

∗_{Corresponding author at: School of Mathematics and Computer Science, Guizhou Normal University, China.}

(2)

The motivation of choosing sine transform-based preconditioner for the preconditioned Lanczos method is that the shifted Toeplitz matrix is still a Toeplitz matrix, and the method involves only Toeplitz and sine transform matrix–vector multiplications which can be computed efficiently by the fast transform algorithm. Moreover, the sine transform-based preconditioner is determined by its first column which can save some cost.

Ng [17] proposed applying the preconditioned Lanczos method, using the sine transform-based preconditioner, to compute the minimum eigenvalue of a symmetric positive definite Toeplitz matrix. In this paper, we extend his result from the standard eigenvalue problem to general eigenvalue problems.

This paper is organized as follows. In Section2, we briefly introduce the inverse-free Krylov subspace method for generalized symmetric eigenvalue problems. In Section3, we recall some results of approximating a given matrix by matrices that can be diagonalized by the discrete sine transform matrix. In Section4, we use the optimal sine transform approximation to construct a preconditioner for the shifted Toeplitz matrix involved in the inverse-free Lanczos method and present the algorithm. In Section5, we show that if the symmetric Toeplitz pencil

(

An

,

Bn

)

is generated by 2

π

-periodic even continuous functionsfandgwithgpositive, then the spectra of the preconditioned shifted matrices will be clustered at 1, and it follows that the preconditioned Lanczos method will converge quadratically to the minimum eigenpair of the preconditioned pencil. Finally, we use numerical results ofAlgorithm 2to show the advantage of the preconditioned Lanczos method.

Throughout the paper, we useB

0 to denote thatBis a symmetric positive definite matrix, diag

(

An

)

to denote the diagonal matrix whose diagonal elements are equal to those of the matrixAn. AlsoMatlab[13] notations are used wherever necessary.

2. The basic inverse-free Krylov method

In this section, we present the basic algorithm for finding the smallest eigenvalue

λ

and a corresponding eigenvectorxof a pencil

(

A

,

B

)

, whereAandBare symmetric withB

0. As a result, we can easily find the largest eigenvalue by technically modifying the method, or just considering the pencil

(

−

A

,

B

)

.

Given an initial approximation

(ρ

0

,

x0

)

, we then improve it by minimizing the Rayleigh quotient

ρ(

x

)

=

x

T_Ax xT_Bx

on a certain subspace. Noting that the gradient of

ρ(

x

)

atx0is r0

= ∇

ρ(

x0

)

=

(

A

−

ρ(

x0

)

B

)

x0

/

xT0Bx0

,

we can use the steepest descent method to choose a new approximate eigenvectorx1

∈

span

{

x0

,

r0

}

by minimizing

ρ(

x0

)

on the space span

{

x0

,

r0

}

. This can be considered as the Rayleigh–Ritz projection method on the subspaceK1

≡

span

{

x0

, (

A

−

ρ

0B

)

x0

}

. On the other hand, the inverse iteration constructs a new approximation byx1

=

(

A

−

ρ

0B

)

−1Bx0. If the inversion is computed by an inexact inverse iteration [9], thenx1is indeed chosen from a two-dimensional Krylov subspace generated byA

−

ρ

₀B. In this case,x1is extracted from the Krylov subspace by solving a linear system.

In general, we consider a natural extension of these two approaches, which finds a new approximate eigenvectorx1from the Krylov subspace

Km

=

Km

(

A

−

ρ

0B

,

x0

)

≡

span

{

x0

, (

A

−

ρ

0B

)

x0

, . . . , (

A

−

ρ

0B

)

mx0

}

(for some fixed integer m) by using the Rayleigh–Ritz projection method. The projection can be carried out by first constructing a basis forKmand then forming and solving the projection problem with respect to the pencil

(

A

,

B

)

. Repeating this process, we obtain the following iteration method, called the inverse-free Krylov subspace method for

(

A

,

B

)

.

Algorithm 1 (Golub, Ye [8]). The Inverse-Free Krylov Subspace Method for

(

A

,

B

)

. Input an integerm

≥

1 and an initial approximationx0with

k

x0

k =

1;

ρ

0

=

ρ(

x0

)

;

Fork

=

0

,

1

,

2

, . . .

until convergence,

Construct a basisZm

= [

z0

,

z1

, . . . ,

zm

]

forKm

=

span

{

xk

, (

A

−

ρ

kB

)

xk

, . . . , (

A

−

ρ

kB

)

mxk

};

FormAm

=

ZmT

(

A

−

ρ

kB

)

ZmandBm

=

ZmTBZm;

Find the smallest eigenpair

(µ

1

, v

1

)

for

(

Am

,

Bm

)

;

ρ

k+1

:=

ρ

k

+

µ

1andxk+1

:=

Zm

v

1. End

In the above algorithm, we apply the projection to the shifted pencil

((

A

−

ρ

_kB

),

B

)

and compute

ρ

k+1by updating

ρ

kaccordingly. Theoretically, the process is equivalent to using the projection on

(

A

,

B

)

directly. The updating formula, however, saves matrix–vector multiplications when we formAmby utilizing

(

A

−

ρ

kB

)

Zm, which needs to be computed in the construction ofZm.

Here, we notice that there are many possible ways for constructing a basisZmforKm, e.g., the orthonormal basis by the Lanczos algorithm or the B-orthonormal basis by the Arnoldi algorithm. But it is evident that whatever method we choose

(3)

to get the basis, the new approximate eigenpair

(ρ

k+1

,

xk+1

)

will be theoretically the same and is defined independently of the basis by

ρ

k+1

=

xT k+1Axk+1 xT k+1Bxk+1

=

min 06=u∈Km uTAu uT_Bu

.

(1)

InAlgorithm 1, we have assumed for convenience that dim

(

Km

)

=

m

+

1 so that a full basis

{

z0

,

z1

, . . . ,

zm

}

can be constructed. If dim

(

Km

)

=

p

+

1

<

m

+

1, we can only generatez0

,

z1

, . . . ,

zp. Then the Rayleigh–Ritz projection is carried out by simply replacingZmbyZp

= [

z0

,

z1

, . . . ,

zp

]

, and(1)is still valid. Numerically, however, early termination at step pof the inner iteration is not likely, and a full basis is usually constructed even whenp

<

min theory, but this causes no problem as the larger space spanned by more vectors would yield a better approximation.

We notice that the convergence rate of this method is determined by the spectral distribution of the shifted matrix, as opposed to that ofB−1_A_{in the Lanczos method as implied by the Kaniel-Paige–Saad theory; see [8] or Theorem 5.3} in Section5. Therefore, in order to accelerate the convergence speed ofAlgorithm 1, we can use the sine transform to precondition the shifted Toeplitz matrixA

−

ρ

B.

3. Sine transform approximation

Ann-by-nmatrixTnis said to be a Toeplitz matrix if it is of the form

Tn

=







t0 t−1

· · ·

t2−n t1−n t1 t0 t−1 t2−n

...

t1 t0

...

tn−2

... ...

t−1 tn−1 tn−2

· · ·

t1 t0







.

(2)

We assume that the entriestjin(2)are given by tj

=

1

π

Z

π 0 f

(θ)

cos

(

j

θ)

d

θ,

j

=

0

,

±

1

,

±

2

, . . . ,

wheref

(θ)

is the generating function of the Toeplitz matrixTn. Then the eigenvalues

λ

j

(

Tn

)

,j

=

0

,

1

, . . . ,

n

−

1, of the Toeplitz matrixTnare equally distributed asf

(

2

π

j

/

n

)

; see [11].

The equal distribution of the eigenvalues of the Toeplitz matrices indicates that the eigenvalues are not clustered in general. Hence the Lanczos method converges slowly when it is applied to compute the minimum eigenpair of these Toeplitz matrices.

In order to speed up the convergence of the Lanczos method, we construct a sine transform-based preconditioner for this class of Toeplitz matrices.

Then-by-ndiscrete sine transform matrixΨn

=

(

[

Ψn

]

_ij

)

is defined by

[

Ψn

]

_ij

=

r

2 n

+

1sin

π

jk n

+

1

,

i

≤

j

,

k

≤

n

.

We note thatΨnis symmetric and orthogonal, i.e.,Ψn

=

Ψ_nTandΨnΨ_nT

=

Ψ_nTΨn

=

In, hereIndenotes then-by-nidentity matrix.

For anyn-dimensional vector

v

, the matrix–vector productΨn

v

can be done inO

(

nlogn

)

real operations by the fast sine transform (FST) [21,22].

Let_Sbe a linear space over the field of real numbers containing alln-by-nmatrices that can be diagonalized by the discrete sine transform matrixΨn, i.e.,

S

= {

ΨnΛnΨn

|

Λnis diagonal

}

.

Given ann-by-nmatrixAn, we are interested in finding a matrixSn

∈

Swhich minimizes

{k

S

−

An

k

F

|

S

∈

S

}

, where

k • k

F is the Frobenius norm. The minimizer denoted bySnis called the optimal sine transform approximation toAn. The following lemma gives some basic property ofSn.

Lemma 3.1 (Chan, Ng, and Wong [3]). Let Anbe an n-by-n symmetric matrix and Snbe the minimizer of

{k

S

−

An

k

F

|

S

∈

S

}

.

Then Snis uniquely determined by Anand is given by

Sn

=

Ψn∆nΨn

,

(3)

where∆ndenotes the diagonal matrix whose diagonal entries are equal to those of the matrixΨnAnΨn, i.e.,

diag

(

∆n

)

=

diag

(

ΨnAnΨn

).

(4)

λ

min

(

An

)

≤

λ

min

(

∆n

)

≤

λ

max

(

∆n

)

≤

λ

max

(

An

).

(5) In particular, if Anis positive definite, the Snis positive definite, too.

We note that formingSnby computing all the diagonal entries ofΨnAnΨnas in(4)requiresO

(

n2logn

)

operations. [3] gave another approach for constructingSn, which reduces the cost toO(n2

)

operations. Before describing how the matrix Snis formed, we letUi

,

i

=

1

, . . . ,

n, be then-by-nmatrices with the

(

j

,

k

)

th entry being given by

[

Ui

]

jk

=











1

,

if

|

j

−

k

| =

i

−

1

,

−

1

,

ifj

+

k

=

i

−

2

,

−

1

,

ifj

+

k

=

2n

−

i

+

3

,

0

,

otherwise

.

It is evident thatUiis a sparse matrix with at most 2nnonzero entries. We further define

r

=

(

1T_n

(

U1

◦

An

)

1n

,

1Tn

(

U2

◦

An

)

1n

, . . . ,

1Tn

(

Un

◦

An

)

1n

)

T

,

(6)

where1n

= [

1

,

1

, . . . ,

1

]

Tis ann-dimensional vector and

◦

is the Hadamard product symbol. Now, we can give an explicit formula for computing the entries of the minimizerSn.

Lemma 3.2 (Chan, Ng, and Wong [3]).Let An

= [

ajk

]

be an n-by-n symmetric matrix and Snbe the minimizer of

{k

Sn

−

An

k

F

|

S

∈

S

}

. Denote the first column of Snby s. If roand reare, respectively, the sums of the odd and the even index entries of r where the vector r is given in(6), then we have

[

s

]

₁

=

1 2

(

n

+

1

)

(

2

[

r

]

1

− [

r

]

3

),

[

s

]

i

=

1 2

(

n

+

1

)

(

[

r

]

i

− [

r

]

i+2

),

i

=

2

, . . . ,

n

−

2

.

In particular

[

s

]

₁

=

1 2

(

n

+

1

)

(

ro

+ [

r

]

n−1

),

[

s

]

i

=

1 2

(

n

+

1

)

(

2re

+ [

r

]

n

)

if n is even, and

[

s

]

₁

=

1 2

(

n

+

1

)

(

re

+ [

r

]

n−1

),

[

s

]

i

=

1 2

(

n

+

1

)

(

2ro

+ [

r

]

n

)

if n is odd.

IfAnhas no special structure, then the vectorrcan be computed inO

(

n2

)

operations asUi is sparse with onlyO

(

n

)

nonzero entries. Therefore,Sncan be computed inO

(

n2

)

operations. However, ifAnis a Toeplitz matrix, then the cost can be reduced toO

(

n

)

operations. In particular, if

[

t0

,

t1

, . . . ,

tn−1

]

Tis the first column of ann-by-nsymmetric Toeplitz matrix, then the first column of the corresponding optimal sine transform-based preconditionerSnis given by

sk

=











t0

−

n

−

2 n

+

1

t2

,

fork

=

1

,

t1

−

n

−

3 n

+

1

t3

,

fork

=

2

,

n

−

k

+

3 n

+

1

tk−1

−

n

−

k

−

1 n

+

1

tk+1

,

fork

=

3

,

4

, . . . ,

n

−

2

,

4 n

+

1

tn−2

,

fork

=

n

−

1

,

3 n

+

1

tn−1

,

fork

=

n

.

LetDdenote the diagonal matrix whose diagonal entries are equal to the first column ofΨn(i.e.,Ψne1

=

D1n). Then it is easy to proveD−1_Ψn_S

ne1

=

∆n1n; see [3]. Hence,Snis determined by its first column and∆ncan be constructed in

O

(

nlogn

)

flops.

4. The optimal sine transform-based Lanczos algorithm

(5)

Algorithm 2. The Preconditioned Lanczos Method for (An

,

Bn)

1 Input an integerm

≥

1 and an initial approximationx₀with

k

x₀

k =

1

ρ

n(0)

=

xT0Ax0

/

xT0Bx0;

Fork

=

0

,

1

,

2

, . . .

until convergence. ConstructSn(k)

=

Ψn

(

∆An

−

ρ

(k) n ∆Bn

)

Ψn

=

Q (k) n Qn(k)T. 5 ComputeW_n(k)

=

Q_n(k)−1

(

A_n

−

ρ

_n(k)B_n

)

Q_n(k)−TandB

ˆ

_n

=

Q_n(k)−1B_nQ_n(k)−T. Construct the Krylov subspace basisZm

= [

z0

,

z1

, . . . ,

zm

]

forKm

(

Wn(k)

,

Qn(k)Txk

)

. FormWm

=

ZmTW

(k)

n Zm,Bm

=

ZmTB

ˆ

nZm.

Find the smallest eigenpair

(µ

1

, v

1

)

for

(

Wm

,

Bm

)

Let

(µ

1

,

yk

)

be the smallest Ritz pair for

(

Wn(k)

,

B

ˆ

n

)

and setyk

=

Zm

v

1. 10

ρ

n(k+1)

=

(

xTk+1Anxk+1

)/(

xTk+1Bnxk+1

)

=

ρ

n(k)

+

µ

1,xk+1

=

Q(k) −T n yk

=

Q(k) −T n Zm

v

1 End.

InAlgorithm 2, we may consider a basisZmthat is orthogonal under a certain inner product. Such a basis of the Krylov subspace forWn(k)is typically constructed through an iterative method, which is called the inner iteration. The original iteration ofAlgorithm 2is then called the outer iteration.

The outer loop of the method updates a certain preconditioned shifted matrix and the inner loop applied the Lanczos method to the preconditioned shifted matrixWn(k). We compute the smallest Ritz value and the corresponding Ritz vector of

(

Wn(k)

,

B

ˆ

m

)

and then transform it back to getxk+1, an approximate eigenvector of

(

An

,

Bn

)

. The Rayleigh quotient

ρ

k+1of xk+1is an approximate eigenvalue of

(

An

,

Bn

)

.

In order to use the FST algorithm,Sn(k)does not need to be computed explicitly. The total cost of the process isO

(

nmlog

(

n

))

operations.

As we know, the number of inner iterationsmis also important to the whole algorithm. Theoretically, asmincreases, the number of the outer iterations decreases rapidly. But, in inexact calculation, if the preconditioner is good enough, the dimension of the Krylov subspace is not very large, and a smallmcould also lead to a quick convergence, which saves cost in the inner iterations.

In the following, we present a special case in whichm

=

1.

Algorithm 3. The Preconditioned Lanczos Method for (An

,

Bn) whenm

=

1 1 Input an initial approximationx0with

k

x0

k =

1

;

ρ

n(0)

=

xT0Ax0

/

xT0Bx0

;

Fork

=

0

,

1

,

2

, . . .

until convergence. ConstructSn(k)

=

Ψn

(

∆An

−

ρ

(k) n ∆Bn

)

Ψn

=

Q (k) n Qn(k)T. 5 SetWn(k)

=

Q(k) −1 n

(

An

−

ρ

n(k)Bn

)

Q(k) −T n andB

ˆ

n

=

Q(k) −1 n BnQ(k) −T n . Set

w

0

=

Qn(k)Txk,z0

=

w

0

/

k

w

0

k

.

w

1

=

Wn(k)z0,

α

1

=

(w

1

,

z0

)

w

1

=

w

1

−

α

1z0,

β

2

= k

w

1

k

z1

=

w

1

/β

2 10

w

2

=

Wn(k)z1

−

β

2z0,

α

2

=

(w

2

,

z1

)

SetW1

=

(

α_β1 β2 2 α2

)

andB1

=

Z T 1B

ˆ

nZ1.

The smallest eigenpair

(µ

1

, v

1

)

for

(

W1

,

B1

)

can be easily got by solving a monadic quadratic equation. Let

(µ

1

,

yk

)

be the smallest Ritz pair for

(

Wn(k)

,

B

ˆ

n

),

yk

=

Z1

v

1.

15

ρ

_n(k+1)

=

(

xT_k₊₁A_nx_k+1

)/(

xTk+1Bnxk+1

)

=

ρ

n(k)

+

µ

1,xk+1

=

Q(k) −T n Z1

v

1 End.

Algorithm 3is a transmutation ofAlgorithm 2under the condition thatm

=

1. Under this circumstance, we can easily give the presentation of the Krylov subspace basis.

In lines 6–9, we construct the orthonormal basisZ1

=

(

z0

,

z1

)

for the Krylov subspaceK1

=

span

{

w

0

,

Wn(k)

w

0

}

. In line 12, we give the form ofW1andB1. It is easy to get the minimum eigenvalue of

(

W1

,

B1

)

by solving a monadic quadratic equation.

We can see from numerical results thatAlgorithm 3converges sufficiently fast and costs less computing time.

5. Convergence analysis

We first prove the following lemma, which is essential for proving the convergence of the preconditioned Lanczos method.

(6)

Lemma 5.1. Let

{

An

}

be a sequence of n-by-n symmetric matrices and let∆An

=

diag

(

ΨnAnΨn

)

,

{

Bn

}

be a sequence of n-by-n symmetric positive definite matrices, and∆B_n

=

ΨnBnΨn. If

λ

min

(

An

,

Bn

)

≤

ρ

n(0)

< α/β,

where

α

= [

∆A_n

]

_kkand

β

= [

∆B_n

]

_kksatisfy

α

−

ρ

_n(0)

β

≤ [

∆A_n

]

_jj

−

ρ

(0)_n

[

∆B_n

]

_jj

,

j

=

1

, . . . ,

n, and the preconditioners S(k)n are constructed by

S_n(k)

=

Ψn∆A_nΨn

−

ρ

_n(k)Ψn∆B_nΨn

,

k

≥

0

,

with

ρ

n(0)being the starting value and Sn(k)being generated byAlgorithm2, then the matrix Sn(k)is symmetric positive definite for all k

≥

0. Moreover, if infn

|

α

−

ρ

n(0)

β

| ≥

δ >

0, with

δ

a constant independent of n, then

k

S(k)

−1

n

k

2is uniformly bounded for all n and k.

Proof. Sn(k)can be written asΨn

(

∆An

−

ρ

(k)

n ∆Bn

)

Ψn, where∆An

−

ρ

(k)

n ∆Bnis a diagonal matrix. Obviously,S

(k)T n

=

Sn(k). Fork

=

0, by using(4)and

ρ

n(0)

<

min(₍∆_∆An)jj

Bn)jj, where

(

∆Bn

)

jj

=

e

T

jΨnBnΨnej

>

0, we getSn(0)

>

0. Fork

>

0, note fromAlgorithm 2that

ρ

(k+1) n

=

xT_k₊₁Anxk+1 xT k+1Bnxk+1

=

x T k+1

(

An

−

ρ

n(k)Bn

)

xk+1 xT k+1Bnxk+1

+

ρ

_n(k)

=

y T kW (k) n yk yT_kB

ˆ

nyk

+

ρ

_n(k)

=

µ

₁

+

ρ

_n(k)

,

where

(µ

1

,

yk

)

is the smallest Ritz pair for

(

Wn(k)

,

B

ˆ

n

)

. SinceBn

0,B

ˆ

n

=

Q(k)

−1 n BnQ(k)

−T

n is also a symmetric positive definite matrix. So, by lettingB

ˆ

n

=

LLT, we see that

µ

1is the smallest eigenvalue ofL−1_W(k)

n L−T.

Let

σ

1be the smallest eigenvalue ofWn(k). Then fromAlgorithm 2we can getWm

=

ZmTW (k) n Zm. So

[

W_n(k)

]

₁₁

=

z₁TWkz1

=

1

k

Qn(k)Txk

k

2

(

Q_n(k)Txk

)

TWn(k)

(

Qn(k)Txk

)

=

1

k

Qn(k)Txk

k

2 xT_k

(

An

−

ρ

n(k)Bn

)

xk

=

0

.

We can get

σ

1

≤

0 by using the Cauchy interlace theorem. By using the congruence property of the transformation the above result leads to

µ

1

≤

0. Hence

{

ρ

n(k)

}

is a nonincreasing sequence. This shows thatSn(k)

0.

Next, we estimate the bound for

k

Sn(k)−1

k

2.

k

S_n(k)−1

k

₂

= k

(

Ψn

(

∆A_n

−

ρ

_n(k)∆B_n

)

Ψn

)

−1

k

₂

= k

Ψn

((

∆A_n

−

ρ

_n(k)∆B_n

))

−1Ψn

k

₂

=

1 inf n

|

α

−

ρ

(k) n

β

|

≤

1 inf n

|

α

−

ρ

_n(0)

β

|

.

If infn

|

α

−

ρ

(0)n

β

| ≥

δ >

0 and

δ

is independent ofn, then

k

S(k) −1

n

k

2

≤

1_δ. Since

λ

min

(

An

,

Bn

)

=

minx∈Rn

xTAx

xT_Bx, we easily see that

λ

min

(

An

,

Bn

)

≤

α/β

. If

λ

min

(

An

,

Bn

)

=

α/β

, then it is already done. Otherwise,

λ

min

(

An

,

Bn

) < α/β

and there must exist somex0that can satisfy the assumption inLemma 5.1.

We note fromAlgorithm 2that the Cholesky factorization for the preconditionerSn(k)is needed, i.e.,Sn(k)

=

Qn(k)Qn(k)T. In our case, sinceSn(k)is proved to be symmetric positive definite and can be diagonalized by the discrete sine transform matrixΨn, the matrixQn(k)is given by

Q_n(k)

=

Ψn

(

∆A_n

−

ρ

_n(k)∆B_n

)

1/2

.

Note that there are no extra computations for obtaining the Cholesky factor.

We can further show that the spectra of the preconditioned Toeplitz matrixWn(k)

=

Q(k) −1

n

(

An

−

ρ

n(k)Bn

)

Q(k) −T n are clustered at 1 at the end of the preconditioned Lanczos iteration.

Theorem 5.2. Let f be a continuous function defined on

[

0

, π

]

and

{

An

}

be a sequence of Toeplitz matrices generated by f ; let g be a positive continuous function defined on

[

0

, π

]

and

{

Bn

}

be a sequence of Toeplitz matrices generated by g. If infn

|

α

−

ρ

n(0)

β

| ≥

δ >

0, where

δ

is a constant independent of n, then for any given

>

0, there exist positive integers N1and N2such that for all n

≥

N1and k

≥

0, at most N2eigenvalues of the matrix W(

k) n

=

Q( k)−1 n

(

An

−

ρ

( k) n Bn

)

Q( k)−T n lie outside the interval

(

1

−

,

1

+

)

.

(7)

Proof. By using the matrix decompositionSn(k)

=

Q( k) n Q(

k)T

n , we note that the matrix W_n(k)

=

Q_n(k)−1

(

An

−

ρ

n(k)Bn

)

Qn(k)−T

is similar to the matrix

Q_n(k)−TW_n(k)Q_n(k)T

=

S(_nk)−1

(

An

−

ρ

n(k)Bn

),

which is equal toIn

+

S(k)

−1

n

[

(

An

−

ρ

n(k)Bn

)

−

S(k)n

]

.

SinceAnandBnare Toeplitz matrices, it is easy to see thatCn

=

(

An

−

ρ

n(k)Bn

)

is also a Toeplitz matrix. ∆An

−

ρ

( k) n ∆Bn

=

diag

(

ΨnAnΨn

)

−

ρ

( k) n diag

(

ΨnBnΨn

)

=

diag

(

ΨnAnΨn

−

ρ

_n(k)ΨnBnΨn

)

=

diag

[

Ψn

(

An

−

ρ

n(k)Bn

)

Ψn

]

.

From Lemma 6 in Chan [3] we know that for all

>

0 there exist two positive integersN1,N2such that for

∀

n

>

N1 andk

≥

0, at mostN2eigenvalues of

[

An

−

ρ

(k)n Bn

−

Sn(k)

]

have absolute values large than

. In addition, fromLemma 5.1,

k

S(k)n −1

k

2is uniformly bounded, which leads to the result that the spectra ofS(k) −1

n

(

An

−

ρ

n(k)Bn

)

are clustered at 1. We remark that the spectra ofSn(k)−1

(

An

−

ρ

n(k)Bn

)

andWn(k)are equal and it is clear that the spectra ofWn(k)are clustered at 1.

The following theorem (see [8], Theorem 3.4) gives the convergence rate for the smallest eigenvalue in the Lanczos method for

(

A

,

B

)

in the exact arithmetic.

Theorem 5.3 (Golub, Ye [8]).Let

λ

1

< λ

2

≤ · · · ≤

λ

nbe the eigenvalues of

(

A

,

B

)

and

(ρ

k+1

,

xk+1

)

be the approximate eigenpair obtained by the inverse-free Krylov subspace method from

(ρ

k

,

xk

)

. Let

σ

1

< σ

2

≤ · · · ≤

σ

nbe the eigenvalues of A

−

ρ

kB and u1be a unit eigenvector corresponding to

σ

1. Assume

λ

1

< ρ

k

< λ

2. Then

ρ

k+1

−

λ

1

≤

(ρ

k

−

λ

1

)

m2

+

2

(ρ

k

−

λ

1

)

3/2

m

k

B

k

σ

2

1/2

+

δ

_k

,

where 0

≤

δ

_k

≡

ρ

_k

−

λ

₁

+

σ

1 uT₁Bu1

=

O

((ρ

k

−

λ

1

)

2

)

and

m

=

min p∈Pm,p(σ1)=1 max i6=1

|

p

(σ

i

)

|

with Pmthe set of all polynomials of degree not greater than m.

According toTheorem 5.3, the convergence rate for the smallest eigenvalue depends on

m, which can be bounded in terms of

σ

ias

m

≤

2

1

−

√

ψ

1

+

√

ψ

m with

ψ

=

σ

2

−

σ

1

σ

n

−

σ

1

.

Then the convergence speed depends on the eigenvalue distribution ofAn

−

ρ

n(k)Bn, not those of

(

An

,

Bn

)

as in the Lanczos method. This difference gives us a good opportunity to accelerate the convergence by equivalent transformations that change the spectra ofAn

−

ρ

n(k)Bnwithout changing the spectra of

(

An

,

Bn

)

.

By usingTheorem 5.3, we can select a polynomial that annihilates the

(

N2

−

1

)

extreme eigenvalues of the preconditioned Toeplitz matrix and is large at

σ

1in comparison with its value at the remaining clustered eigenvalues between 1

−

and 1

+

. So

ψ

=

σ2−σ1

σn−σ1

≈

1, and the preconditioned method converges to the minimum eigenvalue of

(

An

,

Bn

)

quadratically. This is precisely summarized in the following corollary.

Corollary 5.4. Let f be a continuous function defined on

[

0

, π

]

and

{

An

}

be a sequence of Toeplitz matrices generated by f ; let g be a positive continuous function defined on

[

0

, π

]

and

{

Bn

}

be a sequence of Toeplitz matrices generated by g. If infn

|

α

−

ρ

n(0)

β

| ≥

δ >

0, where

δ

is a constant independent of n, then for any given

>

0, there exist positive integers N1and N2such that for all n

≥

N1and k

≥

0we have

ρ

k+1

−

λ

1

≤

O((ρk

−

λ

1

)

2

)

+

4

(ρ

k

−

λ

1

)

1

−

√

ψ

1

+

√

ψ

2m

+

4

(ρ

k

−

λ

1

)

3/2

1

−

√

ψ

1

+

√

ψ

m

k

B

k

1

+

1/2

,

(7) where

ψ

=

1−σ1− 1−σ1+.

(8)

Fig. 1. The convergence behavior of IPL (solid), PIK (dashed) and standard Lanczos method (dotted) forExample 6.1.

According toCorollary 5.4,

ψ

≈

1 where

|

σ

₁

|

is much larger than

. Therefore, the last two terms on the right-hand side of the inequality(7)approximate to 0 and, hence,

ρ

k+1

−

λ

1.O

(ρ

k

−

λ

1

)

2

.

The cost at each step of the preconditioned Lanczos iteration is about 8n operations plus computing Qn(k)−1

(

An

−

ρ

(k)n Bn

)

Q(k) −T

n

v

for a vector

v

. The matrix–vector multiplicationsQ(k) −1

n

v

andQ(k) −T

n

v

can be done inO

(

nlogn

)

operations by using FST [21,22].

6. Numerical results

We perform the new inverse-free preconditioned Lanczos method (IPL) for generalized Toeplitz eigenvalue problems by using optimal sine transform-based [1] preconditioner for computing the minimum eigenvalue of a symmetric positive definite Toeplitz matrix pencil, and also compare the numerical behavior of the IPL with the standard Lanczos method for Toeplitz matrix pencil and the preconditioned inverse-free Krylov subspace method (PIK) derived by Golub and Ye [8].

(9)

Fig. 2. The convergence behavior of IPL (solid), PIK (dashed) and standard Lanczos method (dotted) forExample 6.2.

We ran numerical tests on anIntel Pentium IV2.4 GHz with memory 512MB and the machine precisioneps

=

2

.

2204

×

10−16_{by using}_Matlab_{7.0.4 on a Window XP-based system. In all examples, we have used the same randomly} generated starting vector.

Example 6.1. The matrix pencil

(

An

,

Bn

)

is defined as follows.Anis derived by the even function

θ

2defined on

[

0

, π

]

, and the symmetric positive definite Toeplitz matrixBn

=

(

[

Bn

]

jk

)

is given by

[

Bn

]

jk

=

bj−k

=











1

+

π

4 5

,

forj

=

k

,

(

−

1

)

|j−k|

4

π

2

|

j

−

k

|

2

−

24

|

j

−

k

|

4

,

forj

6=

k

,

which is derived by the even function

θ

4

+

1 defined on

[−

π, π

]

. In the tests, the required residual tolerance is taken to be 1

×

10−11_.

We should mention that there exists an initialx0such that

ρ

n(0)

≤

α_β, which satisfies the condition inLemma 5.1.

Example 6.2. The matrix pencil

(

An

,

Bn

)

is defined as follows.Anis derived by the even function

θ

2

−

1 defined on

[−

π, π

]

, andBn

=

(

[

Bn

]

jk

)

is the Kac–Murdock–Szegö (KMS) matrix given by

[

Bn

]

jk

=

bj−k

=

η

|j−k|

,

0

< η <

1

which is derived by the positive continuous function f

(θ)

=

1

−

η

2 1

−

2

η

cos

θ

+

η

2

defined on

[

0

, π

]

, see [11,12]. In the test, the required residual tolerance is taken to be 1

×

10−11.

Remark 6.3. Here we should notice the selection ofmis important to both IPL and PIK. Asmincreases, the number of the outer iterations decreases rapidly, but, the calculation cost increases. So we are interested in the balance point between convergence rate and calculation cost. In both algorithms, ifmis not given, we first selectm

=

2, and let it change during the iteration based on the current convergence rate. The upper bound ofmis min

(

n

−

1

,

128

)

.

(10)

Fig. 3. The convergence behavior of IPL (m=1) (solid) and PIK (dotted) forExample 6.4.

Figs. 1and2andTable 1show the convergence behavior of the IPL, the PIK and the standard Lanczos method. We see from these figures that the standard Lanczos method converges very slowly, and it even cannot reach the given precision for many cases, seeTable 1. The numbers of the outer iterations required for the IPL and the PIK are significantly less than those required for the standard Lanczos method. Moreover, the relative error of the IPL method is much less than that of the standard Lanczos method. Therefore, the optimal sine transform-based preconditioner can remarkably speed up the convergence rate of the Lanczos method.

Comparing the IPL method with the PIK method fromFigs. 1and2we can see that the former converges fast than the latter for large matrices. It then follows that the IPL method using FST costs much less matrix–vector multiplications than the PIK method.

Example 6.4. We consider the matrix pencil problem in Example 6.1with two conditions, m

=

1 and mselected

automatically by the condition of the problem in the algorithm. The required residual tolerance is taken to be 1

×

10−11, and the initial vectors are the same as inExample 6.1.

(11)

Table 1

Contrast between IPL, PIK and IPL(m=1)for the Problem inExample 6.1

n IPL PIK IPL(m=1)

Iter Cpu Iter Cpu Iter Cpu

127 51 2.09 91 2.2 43 1.84

255 106 6.56 132 13.31 106 6.43

511 78 33.13 153 99.44 79 32.96

1023 103 385.2 110 310.4

FromFig. 3, we can see that the convergence behavior of IPL (m

=

1) is nearly as good as PIK. Themin PIK is chosen as we said inRemark 6.3which ensures thatmis not very big but has a good convergence rate. This shows that a smallmcan give a very good convergence rate and inexpensive calculation cost.

All the examples show that the sine transform-based preconditioner is a good preconditioner for the generalized symmetric Toeplitz eigenvalue problem.

References

[1] E. Anderson, Z. Bai, C. Bischof, L.S. Blackford, J. Demmel, J.J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, D. Sorensen, LAPACK Users’ Guide, 3rd, SIAM, 1999.

[2] Z. Bai, J. Demmel, J. Dongarra, A. Ruhe, H. Vorst (Eds.), Templates for the Solution of Algebraic Eigenvalue Problems: A Practical Guide, SIAM, Philadelphia, 2000.

[3] R. Chan, M. Ng, C. Wong, Sine transform based preconditioners for symmetric Toeplitz systems, Linear Algebra Appl. 232 (1996) 237–259. [4] G. Cybenko, C. Van Loan, Computing the minimum eigenvalue of a symmetric positive definite Toeplitz matrix, SIAM J. Sci. Stat. Comput. 7 (1986)

123–131.

[5] J. Demmel, Applied Numerical Linear Algebra, SIAM, Philadelphia, 1997.

[6] M.P. Fargust, A.A. Beex, Fast order-recursive generalized Hermitian Toeplitz eigenspace decomposition, Math. Control Signals Systems 4 (1991) 99–117.

[7] G. Golub, C. Van Loan, Matrix Computations, 2nd ed., The Johns Hopkins University Press, Baltimore, MD, 1989.

[8] G. Golub, Q. Ye, An inverse free preconditioned krylov subspace method for symmetric generalized eigenvalue problems, SIAM J. Sci. Comput. 24 (2002) 312–334.

[9] G. Golub, Q. Ye, Inexact inverse iterations for the generalized eigenvalue problems, BIT-Numerical Mathematics 40 (2000) 672–684.

[10] G. Golub, Z. Zhang, H. Zha, Large sparse symmetric eigenvalue problems with homogeneous linear constraints: The Lanczos process with inner–outer iterations, Linear Algebra Appl. 309 (2000) 289–306.

[11] U. Grenander, G. Szegö, Toeplitz Forms and Their Applications, 2nd ed., Chelsea Publishing, New York, 1984.

[12] M. Kac, W. Murdock, G. Szegö, On the eigenvalues of certain Hermitian forms, J. Rational Mech. Anal. 2 (1953) 1264–1271. [13] The MathWorks, Inc. MATLAB 7, September 2004.

[14] R. Morgan, D. Scott, Preconditioning the Lanczos algorithms for sparse symmetric eigenvalue problems, SIAM J. Sci. Comput. 14 (1993) 585–593. [15] K. Neymeyr, A geometric theory for preconditioned inverse iteration, I: Extrema of the Rayleigh quotient, Linear Algebra Appl. 322 (2001) 61–85. [16] M. Ng, Fast iterative methods for solving Toeplitz-plus-Hankel least squares problems, Electron Trans. Numer. Anal. 2 (1994) 23–39.

[17] M. Ng, Preconditioned Lanczos methods for the minimum eigenvalue of a symmetric positive definite Toeplitz matrix, SIAM J. Sci. Comput. 21 (2000) 1973–1986.

[18] Y. Saad, Numerical Methods for Large Eigenvalue Problems, Manchester University Press, Manchester, UK, 1992. [19] Y. Saad, Iterative Methods for Sparse Linear Systems, PWS, Boston, 1996.

[20] D. Sorensen, Implicit application of polynomial filters in ak-step Arnoldi method, SIAM J. Matrix Anal. Appl. 13 (1992) 357–385.

[21] P. Yip, K. Rao, Fast decimation-in-time algorithms for a family of discrete sine and cosine transforms, Circuits Systems Signal Process. 3 (1984) 387–408. [22] P. Yip, K. Rao, A fast computational algorithm for the discrete sine transform, IEEE Commun. Trans. 28 (1980) 304–307.

ScienceDirect

doi:10.1016/j.cam.2008.05.023