Local two-sided bounds for eigenvalues of self-adjoint operators

(1)

Strathprints Institutional Repository

Barrenechea, Gabriel and Boulton, Lyonell and Boussaid, Nabile (2016)

Local two-sided bounds for eigenvalues of self-adjoint operators.

Numerische Mathematik. ISSN 0029-599X ,

http://dx.doi.org/10.1007/s00211-016-0822-1

This version is available at http://strathprints.strath.ac.uk/56730/

Strathprints is designed to allow users to access the research output of the University of Strathclyde. Unless otherwise explicitly stated on the manuscript, Copyright © and Moral Rights for the papers on this site are retained by the individual authors and/or other copyright owners. Please check the manuscript for details of any other licences that may have been applied. You may not engage in further distribution of the material for any profitmaking activities or any commercial gain. You may freely distribute both the url (http://strathprints.strath.ac.uk/) and the content of this paper for research or private study, educational, or not-for-profit purposes without prior permission or charge.

Any correspondence concerning this service should be sent to Strathprints administrator:

(2)

Numer. Math.

DOI 10.1007/s00211-016-0822-1

Mathematik

Local two-sided bounds for eigenvalues of self-adjoint

operators

G. R. Barrenechea1 · L. Boulton2 · N. Boussaïd3

Received: 20 January 2015 / Revised: 27 April 2016

Abstract We examine the equivalence between an extension of the Lehmann– Maehly–Goerisch method developed a few years ago by Zimmermann and Mertins, and a geometrically motivated method developed more recently by Davies and Plum. We establish a general framework which allows sharpening various previously known results in these two settings and determine explicit convergence estimates for both methods. We demonstrate the applicability of the method of Zimmermann and Mertins by means of numerical tests on the resonant cavity problem.

Mathematics Subject Classification 65M60_·65L60_·65L15_·65N12

1 Introduction

In this work we study in close detail the equivalence between two pollution-free techniques for numerical computation of eigenvalue bounds for general self-adjoint

B

G. R. Barrenechea [email protected] L. Boulton [email protected] N. Boussaïd [email protected]

1 _{Department of Mathematics and Statistics, University of Strathclyde, 26 Richmond Street,} Glasgow G1 1XH, Scotland

2 _{Department of Mathematics and Maxwell Institute for Mathematical Sciences,} Heriot-Watt University, Edinburgh EH14 4AS, UK

3 _{Laboratoirde de Mathématiques, UFR Sciences et Techniques, Université de Franche-Comté,} 16, route de Gray, Besançon 25030, France

(3)

operators: a method considered a few years ago by Zimmermann and Mertins [35], and a method developed more recently by Davies and Plum [23]. These two methods are pollution-free by construction and have been proven to provide reliable numerical approximations.

The approach of Zimmermann and Mertins is built on an extension of the Lehmann– Maehly–Goerisch method [4,26,33] and it has proven to be highly successful in various concrete applications. These include the computation of bounds for eigenvalues of the radially reduced magnetohydrodynamics operator [15,35], the study of complemen-tary eigenvalue bounds for the Helmholtz equation [6] and the calculation of sloshing frequencies [4,5].

The method of Davies and Plum on the other hand, is based on a notion of approxi-mated spectral distance and is highly geometrical in character. Its original formulation dates back to [21–23] but it is yet to be tested properly on models of dimension larger than one.

In this work we follow the analysis conducted in [23, Section 6] where the equiv-alence of both these techniques was formulated in a precise manner. Our main goal is two-fold. On the one hand we examine more closely the nature of this equivalence by considering multiple eigenvalues. On the other hand we determine sharp estimates for both methods. These results include convergence and error estimates for both the eigenvalues and associated eigenfunctions. We finally illustrate the applicability of the method of Zimmermann and Mertins using the Maxwell eigenvalue problem as benchmark.

1.1 Context, scope and contribution of the present work

The computational approach considered in this work has a “local” character, in the sense that a shift parameter should be set before hand. The methods derived from this approach only provide information about the spectrum in a vicinity of this parameter, in similar fashion as the Galerkin method gives information only about the eigenvalues below the bottom of the essential spectrum. They give upper bounds for the eigenvalues to the right of the parameter and lower bounds for the eigenvalues to the left of it.

The method of Davies and Plum primarily relies on the geometrical properties of a notion of approximated spectral distance. We introduce this notion in Sect.3. Our Proposition2was first formulated in [21, theorems 3 and 4]. These statements played a fundamental role in the proof of [23, Theorem 11] which provided crucial connections with the method of Zimmermann and Mertins. In Proposition5and Corollary6we establish an extension of [21, theorems 3 and 4] allowing multiple eigenvalues. These rely on convexity results due to Danskin (see Lemma4and [8, Theorem D1]) and they are of fundamental importance in various parts of our analysis.

Our Lemma9follows the original [23, Theorem 11] and its proof involves very similar arguments. In conjunction with Corollary6, it leads to an alternative proof of [35, Theorem 1.1] which includes multiplicity counting. The latter is the central statement of what we call the method of Zimmermann and Mertins. This alternative derivation of the method is formulated in our main Theorem10and Corollary11.

(4)

Theorems13and14, and Corollary15, are precise formulations of convergence in the setting of the method of Davies and Plum. The two theorems differ from one another in that a higher order of approximation occurs when the shift parameter is away from the spectrum. In Theorem16we show that, remarkably, the method of Zimmermann and Mertins always renders the higher order of approximation as a consequence of Corollary15. This is, for instance, in great agreement with the results presented in [34], which compare the errors in Lehmann–Goerisch and Rayleigh–Ritz bounds (see also [28], where convergence of iterative solvers is studied).

In Proposition7we establish upper bounds for error estimates for eigenfunctions in terms of spectral gaps. This statement is related to similar results of Weinberger [32] and Trefftz [30]. See also [33, Chapter 5]. The precise connection between Proposi-tion7and all these results is unclear at present and will be examined elsewhere.

The model of the isotropic resonant cavity that we consider in Sect.6has been well-documented to render spectral pollution when the classical Galerkin method and finite elements of nodal type are employed for numerical approximation. We show by means of numerical tests that, remarkably, the method of Zimmermann and Mertins provides robust and accurate approximations of the eigenvalues of the Maxwell operator even when implemented on standard Lagrange elements. By construction, this method is free from spectral pollution. A more systematic investigation in this respect with many more numerical tests (including anisotropic media), a convergent algorithm and a reference to a fully reproducible computer code can be found in [3].

Preliminary information on the number of eigenvalues in a given interval, which might or might not be available in practice, allows the determination of enclosures from the one-sided bounds produced by the approaches discussed in this work. Convergence also yields enclosures in suitable asymptotic regimes. The algorithm described in [3] is an example of a concrete realisation of this assertion.

1.2 Outline of the analysis

Section2 includes the notational conventions and assumptions which will be used throughout this work. Section3sets the general framework of approximated spec-tral distances and their geometrical properties. There we also discuss approximation of eigenspaces with explicit estimates. The method of Zimmermann and Mertins is derived in Sect.4and its convergence is established in Sect.5. These two sections comprise the main contribution of this work. The final Sect.6is devoted to illustrating a concrete computational application of the method of Zimmermann and Mertins to the resonant cavity problem.

2 Preliminary notation, conventions, and assumptions

LetA_:D(A)_−→Hbe a self-adjoint operator acting on a Hilbert spaceH. Decom-pose the spectrum of A in the usual fashion, as the disjoint union of discrete and essential spectra, σ (A) ₌ σdisc(A)∪σess(A). Let J be any Borel subset of R.

Below the spectral projector associated to Ais denoted by1J(A)=

JdEλ, so that

(5)

E_λ(A)₌E_{_λ_}(A). GenerallyE_J(A)_⊆1J(A)H, however there is no reason for these

two subspaces to be equal except when the spectrum withinJ is only pure point. Everywhere belowt_∈R_{will denote a scalar parameter. This is the shift parameter} which is intrinsic to the methods.

Letlt :D(A)×D(A)−→Cbe the (not necessarily closed) bi-linear form

associ-ated to(A₋t),

lt(u, w)= (A−t)u, w ∀u, w∈D(A).

Letqt :D(A)×D(A)−→Cbe the closed bi-linear form

qt(u, w)= (A−t)u, (A−t)w ∀u, w∈D(A). (1)

For anyu _∈ D(A)we will constantly refer to the followingt-dependant semi-norm, which is a norm iftis not an eigenvalue,

|u_|t =qt(u,u)1/2= (A−t)u. (2)

By virtue of the min–max principle,qt characterises the part of the spectrum of the

positive operator(A₋t)2which lies near the origin. As we shall see next, this gives rise to a notion of local counting function attfor the spectrum of A.

Let d_j(t)₌ inf dimV₌j V⊂D(A) sup u∈V |u_|t u (3)

so that 0≤d_j(t)_≤ d_k(t)for j <k. Thend₁(t)is the Hausdorff distance fromtto

σ (A),

d₁(t)₌min{|λ₋t_{| :}λ_∈σ (A)_{} =} inf

u_∈D(A)

|u_|t

u. (4)

Similarlyd_j(t)are the distances from t to the jth nearest point inσ (A)counting multiplicity but in a generalised sense. That is, the sequence (d_j(t))j∈N becomes

stationary when it attains the distance fromtto the essential spectrum. Moreover

d_j(t)₌d_j₋₁(t) _⇐⇒ either dimE_[_t₋d_j₋₁(t),t₊d_j₋₁(t)_](A) > j−1 or t_±d_j₋₁(t)_∈σess(A). Set δj(t)=dist t, σ (A)_{\ {}t_±d_k(t)_}_kj₌₁. Let n− j(t)=sup{s<t :Tr1(s,t_](A)≥ j} and n+ j(t)=inf{s>t :Tr1[t,s)(A)≥ j},

(6)

conveying thatn−

j(t)= −∞whenever Tr1(−∞,t](A) < jandn+j(t)= +∞

when-ever Tr1[t,_+∞)(A) < j. Thenn∓_j(t)is the jth point inσ (A)to the left(−)/right(+)

oftcounting multiplicities. Heret _∈σ (A)is allowed and neithertnorn∓

j(t)have to

be isolated from the rest ofσ (A). Without further mention, all the statements below regarding bounds onn∓

j(t)will be immediate and useless in either of these two cases

and so will not be considered in the proofs. Set

ν−_j (t)₌sup{s<t_:Tr1(s,t)(A)≥ j} and

ν+_j (t)₌inf{s>t_:Tr1(t,s)(A)≥ j}.

These are the spectral points ofAwhich are strictly to the left and strictly to the right oft respectively. The inequalityν±_j (t)₌n±

j(t)only occurs whentis an eigenvalue.

Everywhere belowL_⊂_D(A)will be a (trial) subspace of dimensionn ₌dimL_. Unless explicitly stated, we will assume the following.

Assumption 1 The combination of parametert and subspaceL_{are such that}

L_∩E_t(A)_{= {}0}. (5) The integer numberm_≤nwill always be chosen such that the following assumption holds true.

Assumption 2

[t₋d_m(t),t₊d_m(t)_{] ∩}σ (A)_⊆σdisc(A). (6)

By virtue of (6),δj(t) >dj(t)for all j ≤m.

3 Approximated local counting functions

In this section we show how to extract certified information aboutσ (A)in the vicinity oft from the action ofAontoL, see [21, Section 3]. For j _≤n, let

Fj(t)= min dimV=j V⊂L max u∈V |u_|t u. (7)

Then 0_≤ F1(t)≤ · · · ≤ Fn(t)andFj(t)≥dj(t)for all j =1, . . . ,n.

As a consequence of the triangle inequality,Fj is a Lipschitz continuous function

such that

|Fj(t)−Fj(s)| ≤ |t−s| ∀s,t ∈R and j =1, . . . ,n. (8)

Since[t ₋d_j(t),t₊d_j(t)_{] ⊆ [}t₋Fj(t),t+Fj(t)], there are at least j spectral

points ofAin the segment[t₋Fj(t),t+Fj(t)]. As we shall see next, this possibly

(7)

Lemma 1 For any j₌1, . . . ,n,

Tr1[t₋Fj(t),t+Fj(t)](A)≥ j. (9)

Proof LetBbe a non-negative self-adjoint operator such thatL_⊂D(B)_⊂D(B1/2₎_.

Letb(u)₌B1/2_u_,_B1/2_u

for allu _∈D(B1/2₎_{be the closure of the quadratic form}

associated toB. Let ˜ λj(L)= min dimV=j V⊂L max u∈V b(u) u2 and λj = inf dimV=j V_⊂D(B1/2₎ sup u∈V b(u) u2.

We claim that, ifλ˜j(L)=λj, thenλj must be an eigenvalue of B. In other words,

E_λ

j(B)= {0}. Let us firstly verify the validity of this claim.

Suppose that j ₌1. Then

λ1= inf u∈D(B1/2₎

b(u)

u2

is attained by a non-zero vectorv _∈ L. Using the Rayleigh–Ritz principle (see [20, §4.5]), we deduce thatv _∈ D(B)and in factvis an eigenvector associated withλ1.

This implies the above claim for j₌1.

Now suppose that j _≥ 2. We have two possibilities. Eitherλ˜j(L)=λj is in the

discrete spectrum of B and the claim follows, or it is in the essential spectrum. In the latter case, without loss of generality we can assume thatλ˜j(L) /∈ σdisc(B)and

λj−1 ∈σdisc(A). That is,λk ∈σdisc(B)for anyk ∈ {1, . . . ,j−1}andλk =λj for

anyk_{∈ {}j, . . . ,n_}. Let L′₌L₊ ⎡ ⎣ j₋1 k=1 E_λ k(B) ⎤ ⎦.

Thenλ˜k(L′)=λkfor anyk∈ {1, . . . ,j−1}and λj ≤ ˜λj(L′)≤ ˜λj(L).

But, sinceλ˜j(L)=λj, then alsoλj = ˜λj(L′). Now, in the orthogonal decomposition

L′_{= ˆ}L_⊕ ⎡ ⎣ j−1 k=1 E_λ_k(B) ⎤ ⎦,

(8)

ˆ

L is the subspace of L′ orthogonal to _kj₌−₁1E_λ_k(B)and it is different fromL in general. For allu_{∈ ˆ}L,

b(u)_≥λju2 andλ˜1(Lˆ)=λj. Hence, min u∈ ˆL b(u) u2 =λj =_u min ∈D(B1/2₁ J(B)) b(u) u2.

Thus, from the case j₌1 already proven, we deduce thatλj is indeed an eigenvalue

ofB. This is the above claim for j_≥2.

We now complete the proof of the lemma. Recall (3) and (7). We have two possi-bilities, eitherFj(t)=dj(t)orFj(t) >dj(t).

Suppose thatFj(t)=dj(t). From the previous claim forB=(A−t)2we deduce

that

E_d

j(t)2((A−t) 2₎

= {0}.

Hence, according to the Spectral Mapping Theorem, the segment_[t₋d_j₍t),t₊d_j₍t)_]

contains jeigenvalues and so

Tr1[t−Fj(t),t+Fj(t)](A)=Tr1[t−dj(t),t+dj(t)](A)≥ j

as needed.

Now suppose thatFj(t) >dj(t). Thent∓dj(t)∈ [t−Fj(t),t+Fj(t)]. Moreover,

eithert₋d_j(t)ort₊d_j(t)lies in the essential spectrum and is either isolated from

σ (A)or is an accumulation point of eigenvalues ofAor is an endpoint of a segment inσ (A). Thus,

Tr1[t−Fj(t),t+Fj(t)](A)≥Tr1[t−Fj(t),t−dj(t)](A)+Tr1[t+dj(t),t+Fj(t)](A) = ∞ ≥ j,

and hence once again the conclusion of the lemma is guaranteed. _⊓_⊔ By virtue of this lemma,Fj(t)can be regarded as an approximated local counting

function forσ (A). Moreover,Fj(t)is thejth smallest eigenvalueμof the non-negative

weak problem:

find(μ,u)_{∈ [}0,_∞)_×L_\{0_} such that qt(u, v)=μ2u, v ∀v∈L. (10)

Hence, we also have the following characterisation,

Fj(t)= max dimV=j−1 V⊂L min u_∈L_⊖V |u_|t u =dimmaxV=j−1 V⊂H min u_∈L_⊖V |u_|t u. (11)

(9)

3.1 Optimal setting for local detection of the spectrum

As we show next, it is possible to detect the spectrum of Ato the left/right oft by means of Fj in an optimal setting. This is a crucial ingredient in the formulation of

the strategy proposed in [21–23].

The following statement was first formulated in [21, theorems 3 and 4] and will be sharpened in Corollary6.

Proposition 2 Let t−<t <t+. Then

Fj(t−)≤t−t−⇒t−−Fj(t−)≤n−_j(t)

Fj(t+)≤t+−t ⇒t++Fj(t+)≥n+j(t). (12)

Moreover, let t₁−<t₂−<t <t₂+<t₁+. Then

Fj(t_i−)≤t−t_i− for i =1,2⇒t₁−−Fj(t₁−)≤t₂−−Fj(t₂−)≤n−_j(t)

Fj(ti+)≤ti+−t for i =1,2⇒t1++Fj(t1+)≥t2++Fj(t2+)≥n+j(t). (13)

Proof We begin by showing (12). Suppose thatt _≥Fj(t−)+t−. Then

Tr1[t−₋Fj(t−),t](A)≥ j.

Sincen−

j(t)≤ · · · ≤n−1(t)are the only spectral points in the segment[n−j(t),t], then

necessarily

n−

j(t)∈ [t−−Fj(t−),t].

The second statement in (12) is shown in a similar fashion and the assertion (13) follows by observing that the mapst _→t_±Fj(t)are monotonically increasing as a

consequence of (8). ⊓⊔

The structure of the trial subspaceLdetermines the existence oft±satisfying the hypothesis in (12). If we expect to detectσ (A)at both sides oft, from Poincaré’s Eigenvalue Separation Theorem [9, Theorem III.1.1], a necessary requirement onL should certainly be the condition

min u∈L Au,u u,u <t <maxu∈L Au,u u,u . (14)

By virtue of Lemmas8and9below, for j ₌1, the left hand side inequality of (14) implies the existence oft−and the right hand side inequality implies the existence of

t+, respectively.

Remark 1 From Proposition 2 it follows that optimal lower bounds for n−

j(t) are

(10)

by virtue of (13), t− ₋Fj(t−) ≤ ˆt−j −Fj(tˆ−j ) ≤ n−j(t) for any other t− as in

(12). Similarly, optimal upper bounds forn+

j(t)are found by analogous means. This

observation will play a crucial role in Sect.4.

Proposition2is central to the hierarchical method for finding eigenvalue inclusions examined a few years ago in [21,22]. For fixedLthis method leads to bounds for eigenvalues which are far sharper than those obtained from the obvious idea of esti-mating local minima ofF1(t). From an abstract perspective, Proposition2provides an

intuitive insight on the mechanism for determining complementary bounds for eigen-values. The method proposed in [21–23] is yet to be explored more systematically in a practical setting. However in most circumstances, the technique described in [35], considered in detail in Sect.4, is easier to implement.

3.2 Geometrical properties of the first approximated counting function

We now determine various geometrical properties ofF1and examine its connection

to the spectral distance.

Let λ _∈ σ (A) be isolated from the rest of the spectrum. If there exists a non-vanishingu_∈L_∩E_λ(A)(recall Assumption1), then

|u_|s u = |λ−s| =d1(s) ∀s∈ λ₋|λ−ν − 1(λ)| 2 , λ+ |λ₋ν₁+(t)_| 2 .

According to the convergence analysis carried out in Sect.5, the closerLis to the spec-tral subspaceE_λ(A), the closerF1(t)is tod1(t)fort∈(λ−|λ−ν

−

1(λ)|

2 , λ+

|λ−ν+₁(λ)|

2 ).

The special case ofLandE_λ(A)having a non-trivial intersection is considered in the following lemma.

Lemma 3 Forλ_∈σ (A)isolated from the rest of the spectrum,the following state-ments are equivalent.

(a) There exists a minimiser u _∈ Lof the right side of (7)for j ₌ 1,such that

|u_|t =d1(t)for a single t∈(λ−| λ₋ν−₁(λ)_| 2 , λ+ |λ₋ν₁+(λ)_| 2 ), (b) F1(t)=d1(t)for a single t∈(λ−| λ−ν₁−(λ)| 2 , λ+ |λ−ν+₁(λ)| 2 ), (c) F1(s)=d1(s)for all s∈ [λ−|λ−ν − 1(λ)| 2 , λ+ |λ−ν+₁(λ)| 2 ], (d) L_∩E_λ(A)_{= {}0}.

Proof SinceL_{is finite-dimensional, (a) and (b) are equivalent by the definitions of}

d₁(t),F1(t)andqt. From the paragraph above the statement of the lemma it is clear that

(d)_⇒(c)_⇒(b). Since_|u_|t/uis the square root of the Rayleigh quotient associated

to the operator(A₋t)2, the fact thatλis isolated combined with the Rayleigh–Ritz principle, gives the implication (a)_⇒(d). _⊓_⊔ As there can be a mixing of eigenspaces, it is not possible to replace (b) in this lemma by an analogous statement includingt ₌λ_±|λ−ν±1(λ)|

(11)

eigenvalue, for example, thenF1(λ+λ

′

2 )=d1(

λ₊λ′

2 )ensures thatLcontains elements

ofE_λ(A)_⊕E_λ′(A). However it is not guaranteed to contain elements of any of these

two subspaces.

3.3 Geometrical properties of the subsequent approximated counting functions

Various extensions of Lemma3to the case j >1 are possible, however it is difficult to write these results in a neat fashion. Proposition5below is one such an extension. We start presenting a preliminary result needed for its proof. LetJ _⊂R_{be an open} segment. Denote by

∂_t±f(t)₌ lim

τ_→0+±

f(t_±τ )₋ f(t)

τ ,

the one-side derivatives of a function f _: J _−→R_{, if they exist. Let}Vbe a compact topological space. For givenJ _: J_×V_−→R_{we write}

˜ J(t)₌max v_∈V J(t, v) and V˜(t)₌ ˜ v_∈V_{: ˜}J(t)₌J(t,v)_˜ .

Below we consider an upper semi-continuous functionJ. Together with the fact that Vis compact, this ensures the existence ofJ˜(t). Using the notation just introduced, we state the following generalization of Danskin’s Theorem, which is a direct consequence of [8, Theorem D1].

Lemma 4 If the mapJ is upper semi-continuous and∂_t±J(t, v)exist for all(t, v)_∈

J_×V, then also∂t±J˜(t)exist for all t∈ J and

∂_t±J˜(t)₌ max

˜ v∈ ˜V(t)

∂_t±J˜(t,v)._˜ (15) In the statement of this lemma, note that the left and right derivatives of bothJ andJ˜can be different.

Proposition 5 Let j ₌1, . . . ,n and t_∈R_{be fixed. The next assertions are}

equiva-lent.

(a) _|Fj(t)−Fj(s)| = |t−s|for some s =t .

(b) There exists an open segment J _⊂Rcontaining t in its closure,such that

|Fj(t)−Fj(s)| = |t−s| ∀s∈ J.

(c) There exists an open segment J _⊂Rcontaining t in its closure,such that

∀s_∈ J, either L_∩E_s₊_F

(12)

Proof (a)_⇒(b). Assume (a). Sincer_→r_±Fj(r)are continuous and monotonically

increasing, then they have to be constant in the closure of

J_{= {}τt₊(1₋τ )s_:0< τ <1_}.

This is precisely (b).

(b)_⇒(c). Assume (b). Thens _→ Fj(s)is differentiable in J and its one-sided

derivatives are equal to 1 or−1 in the whole of this interval. For this part of the proof, we aim at applying (15), in order to get another expression for these derivatives.

LetF_j _{be the family of}(j ₋1)-dimensional linear subspaces ofL_{. Identify an} orthonormal basis ofL_{with the canonical basis of}Cn_{. Then any other orthonormal}

basis ofLis represented by a matrix in O(n), the orthonormal group. By picking the first(j₋1)columns of these matrices, we cover all possible subspacesV _∈F_j. Indeed we just have to identify(v_{1| · · · |}v_j₋₁)for[v_kl_]n_kl₌₁_∈O(n)withV ₌Span{v_k_}_kj−₌1₁.

Let

K_j ₌(v₁, . . . , v_j₋₁)_{: [}v_kl_]n_kl₌₁_∈O(n)_⊂Cn_{× · · · ×}Cn

j−1

.

ThenK_j_{is a compact subset in the product topology of the right hand side. According} to (11), Fj(s)= max (v₁,...,v_j₋₁)_∈K_jg(s;v1, . . . , vj−1) where g(s_;v₁, . . . , v_j₋₁)₌ min (a1,...,aj−1)∈Cj−1 |ak|2=1 akv˜k s.

Here we have used the correspondence betweenv_k _∈Cn_and_v_˜

k ∈Lin the orthonormal

basis set above. We write

g(r,V)₌g(r_;v₁, . . . , v_j₋₁) for V ₌Span{˜v_k_}_kj₌−₁1_∈F_j.

The mapg _: J _×K_j _−→ R+ _{is the minimum of a differentiable function, so the} hypotheses of Lemma4are satisfied byJ _{= −}g. By virtue of (15),

∂_s±g(s,V)₌ min u∈L_⊖V,u=1 |u_|s=g(s,V) Rels(u,u) |u_|s .

As minima of continuous functions,g(s,V)and∂_s±g(s,V)are upper semi-continuous. Therefore, a further application of Lemma4yields

(13)

∂_s±Fj(s)= max (v₁,...,v_j₋₁)_∈K_j g(s_;v1,...,vj−1)=Fj(s) ∂_s±g(s, v₁, . . . , v_j₋₁) = max V_∈F_j g(s,V)=Fj(s) min u_∈L_⊖V,u₌1 |u|s=g(s,V) Rels(u,u) |u_|s .

Now, this shows that

max V∈F_j g(s,V)=Fj(s) min u∈L_⊖V,u=1 |u_|s=g(s,V) Rels(u,u) |u_|s =1.

AsL_{is finite dimensional, there exists a vector}u _∈ L_satisfying_|u_|s = Fj(s)such

that

|Rels(u,u)| |u_|s =

1.

Thus_|Re(A₋s)u,u_{| =}(A₋s)u, (A₋s)u₌ Fj(s). Hence, according to the

“equality” case in the Cauchy–Schwarz inequality,u must be an eigenvector of A

associated with eithers₊Fj(s)ors−Fj(s). This is precisely (c).

(c)_⇒(a). Under the condition (c), there exists an open segmentJ˜_⊆J, possibly smaller, such thatt_{∈ ˜}JandFj(s)=dj(s)for alls∈ ˜J. Since|dj(s)−dj(r)| = |s−r|,

then either (a) is immediate, or it follows by takingr_→t. ⊓⊔

Proposition5leads to the following version of Proposition2fortan eigenvalue.

Corollary 6 Recall Assumption1.Let t _∈ σ (A)be an eigenvalue of multiplicity k.

Let t−<t <t+. IfE_t(A)_∩L_{= {}0},then

Fj(t−)≤t−t−⇒t−−Fj(t−)≤n−_j₊_k(t)

Fj(t+)≤t+−t ⇒t++Fj(t+)≥n+j₊k(t). (16)

Proof According to (9),

Tr1_[t−₋Fj(t−),t−+Fj(t−)](A)≥ j.

Thus, ift >Fj(t−)+t−, there is nothing to prove.

Consider now the caset ₌ Fj(t−)+t−. If there existsτ < t− such thatt =

Fj(τ ) +τ, then (Proposition 5) there exists an open segment J ⊂ R containing (τ,t−)such that

∀s_∈ J, either L_∩E_s₊_F

(14)

From the assumption, it follows that only the second alternative takes place, and necessarilys₋Fj(s)is an eigenvalue of Afor alls ∈(τ,t−). Hence, ass−Fj(s)

is continuous and H_{is separable, this function should be constant in the segment}

(τ,t−). Moreover, due to monotonicity for anys _∈(τ,t−),s₊Fj(s)=t−. Hence

ifs _∈ (τ,t−) _→ s₋Fj(s)is constant (equal to some value, sayv), thens is the

midpoint betweent andvfor anys_∈(τ,t−). This contradicts the fact thatτ ₌t−. Hence

t>Fj(τ )+τ, ∀τ <t−

and so

τ ₋Fj(τ )≤n−j₊k(t),

for allτ <t−. By continuity, it then follows that also

t−₋Fj(t−)≤n−_j₊_k(t).

The second statement (16) is shown in a similar fashion. _⊓_⊔

3.4 Approximated eigenspaces

We conclude this section by showing how to obtain certified information about spectral subspaces.

Our model is the implication (b) ⇒(d) in Lemma 3. In a suitable asymptotic regime forL, the distance between these eigenfunctions and the spectral subspaces of|A₋t_|in the vicinity of the origin is controlled by a term which is as small as O(Fj(t)−dj(t))forFj(t)−dj(t)→0.

The following statement is independent, but it is clearly connected with classi-cal results of Weinberger [32] and Trefftz [30]. Note that a shift parameter can be introduced in Weinberger’s formulation following [4].

Proposition 7 Let m be as in Assumption2.Let t_∈Rand j _{∈ {}1, . . . ,m_}be fixed. Let

{ut_j_}n_j₌₁_⊂Lbe an orthonormal family of eigenfunctions associated to the eigenvalues

μ₌Fj(t)of the weak problem(10). Suppose that Fj(t)−dj(t)is small enough so

that0< εj <1holds true in the following inductive construction,

ε1= F1(t)2−d1(t)2 δ1(t)2−d1(t)2 εj = Fj(t) 2₋_d j(t)2 δj(t)2−dj(t)2 + j−1 k₌1 ε_k2 1−ε_k2 1₊dj(t) 2₋_d k(t)2 δj(t)2−dj(t)2 .

(15)

Then,there exists an orthonormal basis _{φt_j_}m_j₌₁ ofE_[_t₋d_m(t),t+d_m(t)](A)such that φt_j _∈E_{_t₋d_j(t),t₊d_j(t)_}(A), ut_j₋ut_j, φt_jφt_j_≤εj and (17) |ut_j₋ut_j, φt_jφt_j_|t ≤ Fj(t)2−dj(t)2+dj(t)2ε2_j. (18)

Proof As it is clear from the context, in this proof we suppress the indext on top of

any vector. We writeSto denote the orthogonal projection onto the subspaceSwith

respect to the inner product_·,_·.

Let us first consider the case j ₌1. LetS₁₌E_{_t₋d₁(t),t+d₁(t)}(A)and decompose u1=S₁u1+u1⊥whereu⊥1 ⊥S1. SinceAis self-adjoint,

F1(t)2= (A−t)u12=d1(t)2S₁u12+ (A−t)u⊥₁2. (19) Hence F1(t)2≥d1(t)2(1− u⊥1 2₎ +δ1(t)2u⊥1 2_.

Since δ1(t) > d1(t), clearing from this identity u⊥12 yieldsu⊥1 ≤ ε1. Hence S₁u12≥1−ε₁2>0. Let

φ1=

1

S₁u1

so thatS₁u1 = |u1, φ1|. Then (17) holds immediately and (18) is achieved by

clearing(A₋t)u₁⊥2from (19). This is the case j ₌1.

Let us now look at the case j >1. We define the needed basis, and show (17) and (18), for jup tominductively as follows. Set

φj =

1

S_juj S_juj

where S_j ₌ E_{_t₋d_j(t),t+d_j(t)}(A)⊖Span{φl}1j−1 andSjuj = 0, all this for 1 ≤

j _≤ k₋1. Assume that (17) and (18) hold true for j up tok₋1. Define S_k ₌ E_{_t₋d_k(t),t+d_k(t)}(A)⊖Span{φl}1k−1. We first show that Skuk = 0, and so we can

define

φk =

1

S_kuk

S_kuk (20)

ensuring φk ⊥ Span{φl}_lk₌−₁1. After that, we verify the validity of (17) and (18) for

j₌k. Decompose uk=S_kuk+ 1 l₌k₋1 uk, φlφl+u⊥k

(16)

whereu⊥_k is orthogonal to Span{φl}lk₌−11⊕Sk. Then Fk(t)2=dk(t)2S_kuk2+ 1 l=k−1 d_l(t)2_|uk, φl|2+ (A−t)u⊥k 2 ≥d_k(t)2S_kuk2+ 1 l₌k₋1 d_l(t)2_|uk, φl|2+δk(t)2u⊥k 2 =d_k(t)2(1₋u⊥_k2)₊ 1 l₌k₋1 (d_l(t)2₋d_k(t)2)_|uk, φl|2+δk(t)2u⊥k2.

The conclusion (17) up tok₋1, implies|ul, φl|2≥ 1−ε_l2forl =1, . . . ,k−1.

Sinceuk,ul =0 forl=k,

|ul, φl||uk, φl| = |uk,ul− ul, φlφl|.

Then, the Cauchy–Schwarz inequality alongside with (17) yield

|uk, φl|2≤ ε_l2 1₋ε_l2. (21) Hence, sinced_l(t)_≤d_k(t), Fk(t)2≥dk(t)2+ 1 l=k−1 (d_l(t)2₋d_k(t)2) ε 2 l 1₋ε_l2 +(δk(t) 2 −d_k(t)2)u⊥_k2.

Clearingu⊥_k2from this inequality and combining with the validity of (21) and (17) up tok₋1, yieldsS_kuk=0.

Letφkbe as in (20). Then (17) is guaranteed for j=k. On the other hand, (17) up

to j ₌k, (21) and the identity

Fk(t)2=dk(t)2|uk, φk|2+ (A−t)(uk− uk, φkφk)2,

yield (18) up to j ₌k. ⊓⊔

Remark 2 If t ₌ n

−

j(t)+n+j(t)

2 for a given j, the vectors φ t

j introduced in

Proposi-tion7(and invoked subsequently) might not be eigenvectors ofAdespite the fact that

(17)

4 Local bounds for eigenvalues

Our next purpose is to characterise the optimal parameters t± in Proposition 2

(Remark1) by means of the following weak eigenvalue problem,

findu_∈L_\{0_} and τ _∈R _{such that}

τqt(u, v)=lt(u, v) ∀v∈L. (22)

This problem is central to the method for calculating eigenvalue bounds considered by Zimmermann and Mertins in [35]. Note that Assumption1 ensures that (22) is well-posed.

Let

τ₁−(t)_{≤ · · · ≤}τ_n−₋(t) <0 and 0< τ_n+₊(t)_{≤ · · · ≤}τ₁+(t),

be the negative and positive eigenvalues of (22), respectively. Here and belown∓(t)

are the number of these negative and positive eigenvalues, respectively. Both these quantities are piecewise constant int. Below we will denote eigenfunctions associated withτ∓_j (t)byu∓_j(t).

Below we write most statements only for the case of “lower bounds for the eigen-values of A which are to the left oft”. As the position oft relative to the essential spectrum is irrelevant here, evidently this does not restrict generality. The correspond-ing results regardcorrespond-ing “upper bounds for the eigenvalues ofAwhich are to the right of

t” can be recovered by replacingAby₋A.

The left side of (14) ensures the existence ofτ₁−(t).

Lemma 8 The following conditions are equivalent, (a−) F1(s) >t−s for all s<t

(b−) Au_u_,_u,u >t for all u_∈L

(c−) all the eigenvalues of (22)are positive.

Remark 3 LetL₌_Span_{bj}n_j₌₁. The matrix[qt(bj,bk)]n_{j k}₌₁is singular if and only

if E_t(A)_∩L _{= {}₀_}_{. On the other hand, the kernel of (}₂₂_{) might be non-empty.} If n0(t) is the dimension of this kernel andn_∞(t) = dim(Et(A)∩L), then n =

n_∞(t)₊n0(t)+n−(t)+n+(t).

Note thatn_∞(t)_≥1 if and only ifFj(t)=0 forj =1, . . . ,n_∞(t). In this case the

conclusions of Lemma9and Theorem10below do not have any meaning. In order to write our statements in a more transparent fashion we use Assumption1.

By virtue of the next three statements, finding the negative eigenvalues of (22) is equivalent to findings_{= ˆ}t−_j _∈R_{such that}

t₋s₌Fj(s), (23)

and in this casetˆ−_j ₌ t₊ 1

2τ−_j (t). It then follows from Remark1 that (22) encodes

information about the optimal bounds for the spectrum aroundt, achievable by (13) in Proposition2.

(18)

4.1 The eigenvalue to the immediate left oft

We begin with the case j ₌1, see [23, Theorem 11].

Lemma 9 Let t_∈R_andLsatisfy Assumption1.The smallest eigenvalueτ ₌τ₁−(t)

of (22)is negative if and only if there exists s <t such that(23)holds true. In this

case s₌t₊ 1 2τ₁−(t) and F1(s)= − 1 2τ₁−(t) = |u−₁(t)_|s u−₁(t)

for u₌u−₁(t)_∈Lthe corresponding eigenvector.

Proof For allu_∈L_ands_∈R_,

qs(u,u)−F1(s)2u,u =qt(u,u)+2(t−s)lt(u,u)+

(t₋s)2₋F1(s)2 u,u.

Suppose thatF1(s)=t−s. Then

qs(u,u)−F1(s)2u,u =qt(u,u)+2F1(s)lt(u,u).

As the left side of this expression is non-negative,

lt(u,u)

qt(u,u)≥ −

1 2F1(s)

for allu _∈L_\{₀_}_{and the equality holds for some}u_∈L_{. Hence}₋ 1

2F1(s)is the smallest

eigenvalue of (22), and thus necessarily equal toτ₁−(t). In this cases₋ F1(s) =

t₋2F1(s)=t +_τ−1

1(t)

. Here the vectoru for which equality is achieved is exactly

u ₌u−₁(t).

Conversely, letτ₁−(t)andu−₁(t)be as stated. Then

τ₁−(t)_≤ lt(u,u)

qt(u,u)

for allu_∈Lwith equality foru ₌u−₁(t). Re-arranging this expression yields

qt(u,u)−

1

τ₁−(t)lt(u,u)≥0

for allu_∈L_{with equality for}u₌u₁−(t). The substitutiont ₌s₋ 1

2τ₁−(t)then yields

qt(u,u)−

1

(19)

for all u _∈ L_{. The equality holds for} u ₌ u−₁(t). This expression can be further re-arranged as |u_|2_s u2 ≥ 1 (2τ₁−(t))2. HenceF1(s)2= ₍₂_τ−1 1(t))2 , as needed. ⊓⊔ 4.2 Subsequent eigenvalues

An extension of Lemma9to the case j ₌1 is now found by induction.

Theorem 10 Let1≤ j _≤ n be fixed. The number of negative eigenvalues n−(t)of

(22)is greater than or equal to j if and only if

Au,u

u,u <t for some u∈L⊖Span{u

−

1(t), . . . ,u−j−1(t)}.

Assuming this holds true,thenτ ₌τ−_j (t)and u₌u−_j(t)are solutions of (22)if and

only if Fj ! t₊ 1 2τ−_j (t) " = − 1 2τ−_j (t) = u−j(t) t+ 1 2τ−_j (t) u−_j(t) .

Proof Recall thatt _∈ R_andLsatisfy Assumption1. For j ₌1 the statements are

Lemma9taking into consideration (14). For j > 1, due to the self-adjointness of the eigenproblem (22), it is enough to apply again Lemma9 by fixing L˜ ₌ L_⊖ Span{u−₁(t), . . . ,u−_j₋₁(t)_}as trial spaces. Note that the negative eigenvalues of (22) for the trial spaceL˜_{are those of (}₂₂_{) for}L_{except for}τ₁−(t), . . . , τ−_j₋₁(t). ⊓⊔

A neat procedure for finding spectral bounds forA, as described in [35], can now be deduced from Theorem10. By virtue of Proposition2and Remark1, this procedure is optimal in the context of the approximated counting functions discussed in Sect.3, see [23, Section 6]. We summarise the core statement as follows.

Corollary 11 For all t _∈Rand j_{∈ {}1, . . . ,n±(t)_},

t₊ 1 τ−_j (t) ≤ n− j(t) and n+j(t)≤t+ 1 τ+_j (t). (24)

This corollary is an extension of the case j ₌1 established in [23, Theorem 11]. In recent years, numerical techniques based on this statement (for j ₌1) have been developed to successfully compute eigenvalues for the radially reduced magnetohydro-dynamics operator [15,35], the Helmholtz equation [6] and the calculation of sloshing

(20)

frequencies [5]. We show an implementation to the case of the Maxwell operator with

j_≥1 in Sect.6. See also [3].

5 Convergence and error estimates

Our first goal in this section will be to show that, ifL_{captures an eigenspace of} A

within a certain order of precisionO(ε)as specified below, then the residuals

|t∓_∓Fj(t∓)−n∓_j(t)|

(see the right side of (12)) are (a) O(ε)for anyt _∈R_, (b) O(ε2)fort_∈/σ (A).

This will be the content of Theorems13and14, and Corollary15. We will then show that, in turns, (24) has always residual of orderO(ε2)for anyt _∈R_{. See Theorem}₁₆_. In the spectral approximation literature this property is known as optimal order of convergence/exactness, see [18, Chapter 6] or [33].

Recall Remark2, and the Assumptions1and2. Below{φt_j_}m_j₌₁denotes an ortho-normal set of eigenvectors ofE_[_t₋d_m(t),t₊d_m(t)_](A)which is ordered so that

|A₋t_|φt_j ₌d_j(t)φt_j for j ₌1, . . . ,m.

Whenever 0< εj <1 is small, as specified below, the trial subspaceLwill be close

to Span{φt_j_}m_j₌₁in the sense that there existwt_j _∈L_{such that}

wt_j ₋φt_j_≤εj and (A0) |wt_j₋φt_j_|t ≤εj. (A1)

We have split this condition into two terms, in order to highlight the fact that some times only (A1) is required. Unless otherwise specified, the index j runs from 1 to

m. From Assumption2 it follows that the family{φs_j_}m_j₌₁ _⊂ E_[_t₋d_m(t),t+d_m(t)](A)

and the family{ws_j_}m_j₌₁ _⊂L_{above can always be chosen piecewise constant for}s

in a neighbourhood oft. Moreover, they can be chosen so that jumps only occur at

s_∈σ (A).

A set_{wt_j_}m_j₌₁subject to (A0)–(A1) is not generally orthonormal. However,

accord-ing to the next lemma, it can always be substituted by an orthonormal set, provided

εjis small enough.

Lemma 12 There exists C >0independent ofLensuring the following. If_{wt_j_}m_j₌₁_⊂

Lis such that(A0)-(A1)hold true for allεj such that

ε₌ m j=1 ε2_j < _√1 m,

(21)

then there is a set_{vt_j_}m_j₌₁_⊂Lorthonormal in the inner product_·,_·such that

|vt_j₋φt_j_|t+ vtj−φ t

j<Cε.

Moreover,all these vectors are locally constant in t with jumps only at the spectrum

of A.

Proof Recall Assumption2. As it is clear from the context, in this proof we suppress

the indexton top of any vector. The desired conclusion is achieved by applying the Gram–Schmidt procedure. Let G _{= [}wk, wl]_klm₌₁ ∈ Cm×m be the Gram matrix

associated to_{wj}. Set vj = m k=1 (G−1/2)k j wk. Then G₋I_≤ m k,l=1 |wk, wl − φk, φl|2 ≤ 2 m k,l=1 wk−φk2(wl + φl)2 ≤√2(2+ε)ε. Since vj−wj2= # # # # # m k₌1 (G−1/2₋I)k jwk # # # # # 2 = m k,l₌1 (G−1/2₋I)k j(G−1/2−I)l jwk, wl = m k=1 (G−1/2₋I)k j ! _m l=1 Gkl(G−1/2−I)l j " = m k=1 (G−1/2₋I)k j(G1/2−G)j k =(I₋G1/2)2 j j then vj−wj ≤ I−G1/2.

(22)

AsG1/2is a positive-definite matrix, for everyv_∈Cm _{we have}

(G1/2₊I)v2₌G1/2v2₊2G1/2v, v₊v2_≥v2.

Then det(I₊G1/2)₌0 and(I ₊G1/2)−1_≤1. Hence

vj−wj ≤ (I−G)(I+G1/2)−1 ≤ I−G (I+G1/2)−1 ≤(2+ε)ε. (25)

Now, identifyv₌(v1, . . . , vm)∈Cm withv=mk=1vkφk. As G1/2v₌ # # # # # # m j₌1 v, φjwj # # # # # # ≥ v₋ # # # # # # m j₌1 v, φj(wj−φj) # # # # # # ≥(1−ε)v then G−1/2_≤ 1 1₋ε. Hence |vj −wj|t ≤ m k₌1 |(G−1/2₋I)j k||wk|t ≤ m k₌1 |(G−1/2₋I)j k|(εk+dk(t)) ≤ m k,l₌1 |(G−1/2)kl||(G1/2−I)l j|(εk+dk(t)) ≤ √_m (ε₊d_m₍t))(2₊ε) 1₋ε ε. (26)

The conclusion follows from (25) and (26). ⊓⊔ 5.1 Convergence of the approximated local counting function

The next theorem addresses the claim (a) made at the beginning of this section. Accord-ing to Lemma12, in order to examine the asymptotic behaviour ofFj(t)asεj → 0

under the constraints (A0)–(A1), without loss of generality the trial vectorswt_j can be

assumed to form an orthonormal set in the inner product_·,_·.

Theorem 13 Let_{wt_j_}m_j₌₁ _⊂L be a family of vectors which is orthonormal in the

inner product_·,_·and satisfies(A1).Then

Fj(t)−dj(t)≤ ⎛ ⎝ j k=1 ε2_k ⎞ ⎠ 1/2 ∀j ₌1, . . . ,m.

(23)

Proof Recall Assumption2. From the Rayleigh–Ritz principle we obtain Fj(t)≤max |ck|2=1 j k=1 ckwk t ≤ max |ck|2=1 j k₌1 ck(wk−φk) t + max |ck|2=1 j k₌1 ckφk t =max |ck|2=1 j k=1 ck(wk−φk) t +d_j(t). This gives Fj(t)−dj(t)≤max |ck|2=1 j k₌1 |ck||wk−φk|t ≤ max |ck|2=1 ⎛ ⎝ j k₌1 |ck|2 ⎞ ⎠ 1/2⎛ ⎝ j k₌1 |wk−φk|2t ⎞ ⎠ 1/2 ≤ ⎛ ⎝ j k₌1 ε_k2 ⎞ ⎠ 1/2 as needed. _⊓_⊔

In terms of order of approximation, Theorem13will be superseded by Theorem14

fort_∈/σ (A). However, ift _∈σ (A), the trial spaceLcan be chosen so thatF1(t)−d1(t)

is only linear inε1. Indeed, fixing any non-zerou ∈D(A)andL=Span{u}, yields

F1(t)−d1(t)= F1(t)=ε1. Therefore Theorem13is optimal, on the presumption

thattis arbitrary.

The next theorem addresses the claim (b) made at the beginning of this section. Its proof is reminiscent of that of [29, Theorem 6.1].

Theorem 14 Let t _∈/σ (A). Suppose that theεjin(A1)are such that m j=1 ε2_j <d1(t) 2 6 . (27) Then, Fj(t)−dj(t)≤3 d_j(t) d₁(t)2 j k₌1 ε_k2 _∀j ₌1, . . . ,m. (28)

Proof Recall Assumption2. Sincet _∈/σ (A), then(D(A),qt(·,·))is a Hilbert space.

Let PL:D(A)−→Lbe the orthogonal projection ontoLwith respect to the inner

productqt(·,·), so that

(24)

Then|u_|2_t _{= |}PLu|2_t + |u−PLu|2_t for allu ∈D(A)and|u−PLu|t ≤ |u−v|tfor all v_∈L_{. Hence}

|φk−PLφk|t ≤εk ∀k=1, . . . ,m. (29)

LetE_j ₌Span_{φk}_kj₌₁. Define

F_j _{= {}φ_∈E_j _:φ₌1} and μ_Lj(t)₌ max φ∈F_j 2 Reφ, φ−PLφ − φ−PLφ2 .

Hereμ_Lj depends ont, asPLdoes. We first show that, under hypothesis (27),μ_Lj(t) < 1

2. Indeed, givenφ∈Fj we decompose it asφ= j k=1ckφk. Then |φ, φ₋PLφ| = j k=1 ckφk, φ−PLφ = j k=1 ck d_k(t)2qt(φk, φ−PLφ) = qt ⎛ ⎝ j k=1 ck d_k(t)2φk, φ−PLφ ⎞ ⎠ = qt ⎛ ⎝ j k=1 ck d_k(t)2(φk−PLφk), φ−PLφ ⎞ ⎠ ≤ j k₌1 ck d_k(t)2(φk−PLφk) t j k₌1 ck(φk−PLφk) t . (30)

For each multiplying term in the latter expression, the triangle and Cauchy–Schwarz’s inequalities yield (takeαk =ck orαk = _dck

k(t)2) j k=1 αk(φk−PLφk) t ≤ j k=1 |αk| |φk−PLφk|t ≤ ⎛ ⎝ j k₌1 |αk|2 ⎞ ⎠ 1/2⎛ ⎝ j k₌1 |φk−PLφk|2t ⎞ ⎠ 1/2 . (31) Then |2 Reφ, φ₋PLφ| ≤2 ⎛ ⎝ j k=1 |ck|2 d_k(t)4 ⎞ ⎠ 1/2⎛ ⎝ j k=1 |ck|2 ⎞ ⎠ 1/2 j k=1 ε2_k ≤ 2 d₁(t)2 j k=1 ε_k2 (32) for allφ_∈F_j_.

(25)

The other term in the expression forμ_Lj(t)has an upper bound found as follows. According to the Rayleigh–Ritz principle

φ₋PLφ2≤

1

d₁(t)2qt(φ−PLφ, φ−PLφ). (33)

Therefore, by repeating analogous steps as in (30) and (31), we get

φ₋PLφ2≤ 1 d₁(t)2 j k₌1 ckqt(φk−PLφk, φ−PLφ) =qt ⎛ ⎝ j k₌1 ck d₁(t)2(φk−PLφk), φ−PLφ ⎞ ⎠ =qt ⎛ ⎝ j k₌1 ck d₁(t)2(φk−PLφk), j l₌1 cl(φl−PLφl) ⎞ ⎠ ≤ _d 1 1(t)2 j k₌1 ε2_k. (34)

Hence, from (32) and (34),

μ_Lj(t)_≤ 3 d₁(t)2 j k₌1 ε2_k < 1 2 (35) as a consequence of (27).

Next, observe that dim(PLEj)= j. IndeedPLψ=0 forψ =1 would imply

μ_Lj(t)_≥ 2 Reψ, ψ−PLψ − ψ−PLψ2 = ψ2=1,

which would contradict the fact thatμ_Lj(t) <1. Then,

Fj(t)2≤ max u∈PLEj |u_|2_t u2 =max_φ_∈_E j |PLφ|2_t PLφ2 = max φ∈F_j |PLφ|2_t PLφ2 . As PLφ2= φ2−2 Reφ, φ−PLφ + φ−PLφ2≥1−μLj(t), we get Fj(t)2≤ max φ_∈F_j |φ_|2_t 1−μ_Lj(t) =max|ck|2=1 j k=1|ck|2dk(t)2 1−μ_Lj(t) = d_j(t)2 1−μ_Lj(t) . (36)

(26)

Finally, (36) and (35) yield Fj(t)2−dj(t)2≤ μ_Lj(t) 1₋μLj(t) d_j(t)2 ≤2μ_Lj(t)d_j(t)2 ≤2 3 d₁(t)2dj(t) 2 j k=1 ε_k2. (37)

The proof is completed by observing thatFj(t)+dj(t)≥2dj(t). ⊓⊔

As the next corollary shows, a quadratic order of decrease for Fj(t)−dj(t)is

prevented for t _∈ σ (A)(in the context of Theorems13and14), only for j up to dimE_t(A).

Corollary 15 Let t _∈σdisc(A),ℓ=1+dimEt(A)and k∈ {ℓ, . . . ,m}. Let αk(t)=

1

4min{|dl(t)−dl−1(t)| :dl(t)=dl−1(t),l=ℓ, . . . ,k}>0.

There exists ε > 0 independent of k ensuring the following. If(A1)holds true for

m j₌1ε2j < ε,then Fk(t)−dk(t)≤3 d_k(t) αk(t)2 k j=1 ε2_j.

Proof Without loss of generality we assume thatt₊d_k(t)_∈ σ (A). Otherwiset₋

d_k(t)_∈σ (A)and the proof is analogous to the one presented below.

Lett˜ ₌ t ₊αk(t). Then t˜ ∈/ σ (A)andt +dk(t) = ˜t +dk(t˜). Since the map

s _→ s₊Fj(s)is non-decreasing as a consequence of Proposition2, Theorem14

applied att˜yields Fk(t)−dk(t)=t+Fk(t)−(t+dk(t))≤ ˜t+Fk(t˜)−(t˜+dk(t˜)) =Fk(t˜)−dk(t˜)≤3 d_k(t˜) d₁(t˜)2 k j=1 ε2_k _≤3 dk(t) αk(t)2 k j=1 ε2_j as needed. ⊓⊔

5.2 Convergence of local bounds for eigenvalues

Our next task in this section is to formulate precise statements on the convergence of the method of Zimmermann and Mertins (Sect.4). Theorem16below improves upon two crucial aspects of a similar result established in [15, Lemma 2]. It allows j >1