Key words. a posteriori error estimators, quasi-orthogonality, adaptive mesh refinement, error and oscillation reduction estimates, optimal meshes.

(1)

FOR GENERAL SECOND ORDER LINEAR ELLIPTIC PDE

KHAMRON MEKCHAY

^∗

AND RICARDO H. NOCHETTO

^†

Abstract. We prove convergence of adaptive finite element methods (AFEM) for general (non- symmetric) second order linear elliptic PDE, thereby extending the result of Morin et al [6, 7]. The proof relies on quasi-orthogonality, which accounts for the bilinear form not being a scalar product, together with novel error and oscillation reduction estimates, which now do not decouple. We show that AFEM is a contraction for the sum of energy error plus oscillation. Numerical experiments, including oscillatory coefficients and convection-diffusion PDE, illustrate the theory and yield optimal meshes.

Key words. a posteriori error estimators, quasi-orthogonality, adaptive mesh refinement, error and oscillation reduction estimates, optimal meshes.

AMS subject classifications. 65N12, 65N15, 65N30, 65N50, 65Y20

1. Introduction and Main Result. Let Ω be a polyhedral bounded domain in R ^d , (d = 2, 3). We consider a general second order elliptic Dirichlet boundary value problem

Lu = −∇·(A∇u) + b · ∇u + c u = f in Ω, (1.1)

u = 0 on ∂Ω, (1.2)

with the following assumptions:

• A : Ω 7→ R ^d×d is Lipschitz and symmetric positive definite with smallest eigenvalue a − and largest eigenvalue a + , i.e.,

a − (x) |ξ| ² ≤ A(x)ξ · ξ ≤ a + (x) |ξ| ² , ∀ξ ∈ R ^d , x ∈ Ω; (1.3)

• b ∈ [L ^∞ (Ω)] ^d is divergence free (∇·b = 0 in Ω);

• c ∈ L ^∞ (Ω) is nonnegative (c ≥ 0 in Ω);

• f ∈ L ² (Ω).

The purpose of this paper is to prove the following convergence results for adap- tive finite element methods (AFEM) for (1.1-1.2), and document their performance computationally.

Theorem 1 (Convergence of AFEM). Let {u k } _k∈N

₀

be a sequence of finite ele- ment solutions corresponding to a sequence of nested finite element spaces ©

V ^k ª

k∈N

₀

produced by the AFEM of §3.5, which involves loops of the form SOLVE → ESTIMATE → MARK → REFINE.

There exist constants σ, γ > 0, and 0 < ξ < 1, depending solely on the shape regularity of meshes, the data, the parameters used by AFEM, and a number 0 < s ≤ 1 dictated by the angles of ∂Ω, such that if the initial meshsize h 0 satisfies h ^s ₀ kbk _L

∞

< σ, then for any two consecutive iterates k and k + 1 we have

|||u − u k+1 ||| ² + γ osc k+1 (Ω) ² ≤ ξ ²

³

|||u − u k ||| ² + γ osc k (Ω) ²

´

. (1.4)

∗

Department of Mathematics, University of Maryland, College Park, MD 20742, USA ([email protected]). Partially supported by NSF Grant DMS-0204670.

†

Department of Mathematics and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA ([email protected]). Partially supported by NSF Grant DMS-0204670.

1

(2)

Therefore, AFEM converges with a linear rate ξ, namely

|||u − u k ||| ² + γ osc k (Ω) ² ≤ C 0 ξ ^2k , where C 0 := |||u − u 0 ||| ² + γ osc 0 (Ω) ² .

Hereafter, |||·||| denotes the energy norm induced by the operator L and osc(Ω), the oscillation term, stands for information missed by the averaging process associated to FEM. This convergence result extends those of Morin et al. [6, 7] in several ways:

• We deal with a full second order linear elliptic PDE with variable coefficients A, b and c, whereas in [6, 7] A is assumed to be piecewise constant and b and c to vanish.

• The underlying bilinear form B is non-symmetric due to the first order term b · ∇u.

Since B is no longer a scalar product as in [6, 7], the Pythagoras equality relating u, u k and u k+1 fails; we prove a quasi-orthogonality property instead.

• The oscillation terms depend on discrete solutions in addition to data. Therefore, oscillation and error cannot be reduced separately as in [6, 7].

• The oscillation terms do not involve the oscillation of the jump residuals. This is achieved by exploiting positivity and continuity of A.

• Since error and oscillation are now coupled, in order to prove convergence we need to handle them together. This leads to a novel argument and result, the contraction property (1.4), according to which both error and oscillation decrease together.

This paper is organized as follows. In section 2 we introduce the bilinear form, the energy norm, recall existence and uniqueness of solutions, and state the quasi- orthogonality property. In section 3 we describe the procedures used in AFEM, namely, SOLVE, ESTIMATE, MARK, and REFINE, state new error and oscillation re- duction estimates, present the adaptive algorithm AFEM and prove its convergence.

In section 4 we prove the quasi-orthogonality property of section 2 and the error and oscillation reduction estimates of section 3. In section 5 we present three numerical experiments to illustrate properties of AFEM. We conclude in section 6 with exten- sions to A piecewise Lipschitz with discontinuities aligned with the initial mesh and non-coercive bilinear form B due to ∇·b 6= 0.

2. Discrete Solution and Quasi-Orthogonality. For an open set G ∈ R ^d we denote by H ¹ (G) the usual Sobolev space of functions in L ² (G) whose first derivatives are also in L ² (G), endowed with the norm

kuk _H

1

(G) := ³

kuk _L

2

(G) + k∇uk _L

2

(G)

´ _1/2

;

we use the symbols k·k _H

1

and k·k _L

2

when G = Ω. Moreover, we denote by H ₀ ¹ (G) the space of functions in H ¹ (G) that vanish on the boundary in the trace sense.

A weak solution of (1.1) and (1.2) is a function u satisfying

u ∈ H ₀ ¹ (Ω) : B[u, v] = hf, vi ∀v ∈ H ₀ ¹ (Ω), (2.1) where hu, vi := R

Ω uv for any u, v ∈ L ² (Ω), and the bilinear form is defined on H ₀ ¹ (Ω)×H ₀ ¹ (Ω) as

B[u, v] := hA∇u, ∇vi + hb · ∇u + c u, vi . (2.2) By Cauchy-Schwarz inequality one can easily show the continuity of the bilinear form

|B[u, v]| ≤ C B kuk _H

1

kvk _H

1

,

(3)

where C B depends only on the data. Combining Poincar´e inequality with the diver- gence free condition ∇·b = 0, one has coercivity

B[v, v] ≥ Z

Ω

a − |∇v| ² + cv ² ≥ c B kvk _H ²

1

,

where c B depends only on the data. Existence and uniqueness of (2.1) thus follows from Lax-Milgram theorem [5].

We define the energy norm on H ₀ ¹ (Ω) by |||v||| ² := B[v, v], which is equivalent to H ₀ ¹ (Ω)-norm k·k _H

1

. In fact we have

c B kvk _H ²

1

≤ |||v||| ² ≤ C B kvk _H ²

1

∀ v ∈ H ₀ ¹ (Ω). (2.3)

2.1. Discrete Solutions on Nested Meshes. Let {T H } be a shape regular family of nested conforming meshes over Ω: that is there exists a constant γ ^∗ such that

H T

ρ T ≤ γ ^∗ ∀ T ∈ [

H

T H , (2.4)

where, for each T ∈ T H , H T is the diameter of T , and ρ T is the diameter of the biggest ball contained in T ; the global meshsize is h H := max T ∈T

H

H T .

Let {V H } be a corresponding family of nested finite element spaces consisting of continuous piecewise polynomials over T H of fixed degree n ≥ 1, that vanish on the boundary. Let u H be a discrete solution of (2.1) satisfying

u H ∈ V H : B[u H , v H ] = hf, v H i ∀ v H ∈ V H . (2.5) Existence and uniqueness of this problem follows from Lax-Milgram theorem, since V H ⊂ H ₀ ¹ (Ω).

2.2. Quasi-Orthogonality. Consider two consecutive nested meshes T H ⊂ T h , i.e. T h is a refinement of T H . For the corresponding spaces V H ⊂ V h ⊂ H ₀ ¹ (Ω), let u h ∈ V h and u H ∈ V H be the discrete solutions. Since the bilinear form is non- symmetric, it is not a scalar product and the orthogonality relation between u − u H

and u h − u H , the so-called Pythagoras equality, fails to hold. We have instead a perturbation result referred to as quasi-orthogonality provided that the initial mesh is fine enough. This result is stated below and the proof is given in section 4.

Lemma 2.1 (Quasi-orthogonality). Let f ∈ L ² (Ω). There exist a constant C ^∗ > 0, solely depending on the shape regularity constant γ ^∗ and coercivity constant c B , and a number 0 < s ≤ 1 dictated only by the angles of ∂Ω, such that if the meshsize h 0 of the initial mesh satisfies C ^∗ h ^s ₀ kbk _L

∞

< 1, then

|||u − u h ||| ² ≤ Λ 0 |||u − u H ||| ² − |||u h − u H ||| ² , (2.6) where Λ 0 := (1 − C ^∗ h ^s ₀ kbk _L

∞

) ⁻¹ . The equality holds provided b = 0 in Ω.

3. Adaptive Algorithm. The Adaptive procedure consists of loops of the form SOLVE → ESTIMATE → MARK → REFINE.

The procedure SOLVE solves (2.5) for the discrete solution u H . The procedure ES-

TIMATE determines the element indicators η H (T ) and oscillation osc H (T ) for all

(4)

elements T ∈ T H . Depending on their relative size, these quantities are later used by the procedure MARK to mark elements T , and thereby create a subset b T H of T H of elements to be refined. Finally, procedure REFINE partitions those elements in b T H

and a few more to maintain mesh conformity. These procedures are discussed more in detail below.

3.1. Procedure SOLVE : Linear Solver. We employ linear solvers, either direct or iterative methods, such as preconditioned GMRES, CG, and BICG, to solve linear system (2.5). In other words, given a mesh T k , an initial guest u k−1 for the solution, and the data A, b, c, f , SOLVE computes the discrete solution

u k := SOLVE(T k , u k−1 , A, b, c, f )

3.2. Procedure ESTIMATE : A Posteriori Error Estimate. Subtracting (2.5) from (2.1), we have the Galerkin orthogonality

B[u − u H , v H ] = 0 ∀ v H ∈ V H . (3.1) In addition to T H , let S H denote the set of interior faces (edges or sides) of the mesh (triangulation) T H . We consider the residual R(u H ) ∈ H ⁻¹ (Ω) defined by

R(u H ) := f + ∇·(A∇u H ) − b · ∇u H − c u H ,

and its relation to the error L(u − u H ) = R(u H ). It is then clear that to estimate

|||u − u H ||| we can equivalently deal with kR(u H )k _H

−1

(Ω) . To this end, we integrate by parts elementwise the bilinear form B[u − u _H , v] to obtain the error representation formula

B[u − u H , v] = X

T ∈T

H

Z

T

R T (u H )v + X

S∈S

H

Z

S

J S (u H )v ∀ v ∈ H ₀ ¹ (Ω), (3.2) where the element residual R _T (u _H ) and the jump residual J _S (u _H ) are defined as

R T (u H ) := f + ∇·(A∇u H ) − b · ∇u H − c u H in T ∈ T H , (3.3) J _S (u _H ) := −A∇u ⁺ _H · ν ⁺ − A∇u ⁻ _H · ν ⁻ := [[A∇u _H ]] _S · ν _S on S ∈ S _H , (3.4) where S is the common side of elements T ⁺ and T ⁻ with unit outward normals ν ⁺ and ν ⁻ , respectively, and ν S = ν ⁻ . Whenever convenient, we will use the abbreviations R T = R T (u H ) and J S = J S (u H ).

3.2.1. Upper Bound. For T ∈ T H , we define the local error indicator η H (T ) by

η H (T ) ² := H _T ² kR T (u H )k _L ²

2

(T ) + X

S⊂∂T

H S kJ S (u H )k ² _L

2

(S) . (3.5) Given a subset ω ⊂ Ω, we define the error estimator η H (ω) by

η H (ω) ² := X

T ∈T

H

, T ⊂ω

η H (T ) ² .

Hence, η H (Ω) is the error estimator of Ω with respect to the mesh T H . Using (3.1),(3.2) and properties of Cl´ement interpolation, as shown in [1, 3, 12], we obtain the upper bound of the error in terms of the estimator,

|||u − u H ||| ² ≤ C 1 η H (Ω) ² , (3.6)

where the constant C 1 > 0 depends only on the shape regularity γ ^∗ , coercivity con-

stant c B and continuity constant C B of the bilinear form.

(5)

3.2.2. Lower Bound. Using the explicit construction of Verf¨ urth [1, 12] via bubble functions and positivity and continuity of A, we can get a local lower bound of the error in terms of local indicators and oscillation. That is, there exist constants C 2 , C 3 > 0, depending only on the shape regularity γ ^∗ , C B , and c B , such that

C 2 η H (T ) ² − C 3

X

T ⊂ω

T

H _T ² °

°R _T −R T

° ° ²

L

²

(T ) ≤ ku − u H k ² _H

1

(ω

T

) , (3.7) where the domain ω T consists of all elements sharing at least a side with T , and R T

is a polynomial approximation of R _T on T . We define the oscillation on the elements T ∈ T H by

osc H (T ) ² := H _T ² °

°R T − R T

° ° ²

L

²

(T ) , (3.8)

and for a subset ω ⊂ Ω, we define

osc H (ω) ² := X

T ∈T

H

, T ⊂ω

osc H (T ) ² .

Remark 3.2.1. We see from (3.7) that if the oscillation osc H (ω T ) is small compared to the indicator η H (T ), then the size of the indicator η H (T ) will give reliable information about the size of the error ku − u H k _H

1

(ω

T

) . This explains why refining elements with large indicators usually tends to equi-distribute the errors, which is an ultimate goal of adaptivity. This idea is employed by the procedure MARK of §3.3.

Remark 3.2.2. The oscillation osc H (T ) does not involve oscillation of the jump residual J S (u H ) as is customary [1, 12]. This result follows from the positivity and continuity of A, and is explained in §4.2.

Remark 3.2.3. The oscillation osc H (T ) depends on R T = R T (u H ) which in turn depends on the discrete solution u H . This is a fundamental difference with Morin et al. [6, 7], where the oscillation is purely data oscillation. It is not clear now that the oscillation will decrease when the mesh T H will be refined because u H will also change. Controlling the decay of osc H (T ) is thus a major challenge addressed in this work; see §3.3 and §3.4. It is not possible to show that oscillation will always decrease as the mesh gets refined as in [6, 7].

For a given mesh T H and discrete solution u H , along with input data A, b, c and f , the procedure ESTIMATE computes indicators η H (T ) and oscillations osc H (T ) for all elements T ∈ T H according to (3.5) and (3.8):

{η H (T ), osc H (T )} _{T ∈T}

_H

= ESTIMATE(T H , u H , A, b, c, f )

3.3. Procedure MARK . Our goal is to devise a marking procedure, namely to identify a subset b T H of the mesh T H such that after refining, both error and oscillation will be reduced. We use two strategies for this: Marking Strategy E deals with the error estimator, and Marking Strategy O does so with the oscillation.

3.3.1. Marking Strategy E : Error Reduction. This strategy was intro- duced by D¨orfler [4] to enforce error reduction:

Marking Strategy E : Given a parameter 0 < θ < 1, construct a subset b T H

of T H such that

X

T ∈ b T

H

η H (T ) ² ≥ θ ² η H (Ω) ² , (3.9)

and mark all elements in b T H for refinement.

(6)

We will see later that Marking Strategy E guarantees error reduction in the absence of oscillation terms. Since the latter account for information missed by the averaging process associated with the finite element method, we need a separate procedure to guarantee oscillation reduction.

3.3.2. Marking Strategy O : Oscillation Reduction. This procedure was introduced by Morin et al. [6, 7] as a separate means for reducing oscillation:

Marking Strategy O : Given a parameter 0 < ˆ θ < 1 and the subset b T H ⊂ T H

produced by Marking Strategy E, enlarge b T H such that X

T ∈ b T

H

osc H (T ) ² ≥ ˆ θ ² osc H (Ω) ² , (3.10)

and marked all elements in b T H for refinement.

Given a mesh T H and all information about the local error indicators η H (T ), and oscillation osc H (T ), together with user parameters θ and ˆ θ, MARK generates a subset T b H of T H

T b H = MARK(θ, ˆ θ ; T H , {η H (T ), osc H (T )} _{T ∈T}

_H

)

3.4. Procedure REFINE. The following Interior Node Property, due to Morin et al [6, 7], is known to be necessary for error and oscillation reduction:

Interior Node Property : Refine each marked element T ∈ b T H to obtain a new mesh T h compatible with T H such that

T and the d + 1 adjacent elements T ⁰ ∈ T H of T , as well as their common sides, contain a node of the finer mesh T h in their interior.

In addition to the Interior Node Property, we assume that the refinement is done in such a way that the new mesh T h is conforming, which guarantees that both T H and T h are nested. With this property, we have a reduction factor γ 0 < 1 of element size, i.e. if T ∈ T h is obtained by refining T ⁰ ∈ b T H , then h T ≤ γ 0 H T

⁰

. For example, when d = 2 with triangular elements, to have Interior Node Property we can use 3 newest bisections for each single refinement step, whence γ 0 ≤ 1/2.

Given a mesh T H and a marked set b T H , REFINE constructs the refinement T h

satisfying the Interior Node Property:

T h = REFINE(T H , b T H )

Combining the marking strategies of §3.3 with the Interior Node Property, we obtain the following two crucial results whose proofs are given in §4.

Lemma 3.1 (Error Reduction). There exist constants C 4 and C 5 , only depending on the shape regularity constant γ ^∗ and θ, such that

η H (T ) ² ≤ C 4 ku h − u H k ² _H

1

(ω

T

) + C 5 osc H (ω T ) ² ∀ T ∈ b T H . (3.11)

We realize that the local energy error between consecutive discrete solutions is

bounded below by the local indicators for elements in the marked set b T H , provided

the oscillation term is relatively small.

(7)

Lemma 3.2 (Oscillation Reduction). There exist constants 0 < ρ 1 < 1 and 0 < ρ 2 , only depending on γ ^∗ and ˆ θ, such that

osc h (Ω) ² ≤ ρ 1 osc H (Ω) ² + ρ 2 |||u h − u H ||| ² . (3.12) We have that oscillation reduces with a factor ρ 1 < 1 provided the energy error between consecutive discrete solutions is relatively small.

Remark 3.4.1 (Coupling of error and oscillation). Lemmas 3.1 and 3.2 seem to lead to conflicting demands on the relative sizes of error and oscillation. These two concepts are indeed coupled, which contrasts with [6, 7] where oscillation just depends on data and reduces separately from the error. This suggests that we must handle them together, this being the main contribution of this paper. We make this assertion explicit in Theorem 1 below.

3.5. Adaptive Algorithm AFEM. The adaptive algorithm consists of the loops of procedures SOLVE, ESTIMATE, MARK, and REFINE, consecutively, given that the parameters θ and ˆ θ are chosen according to Marking Strategies E and O.

AFEM Choose parameters 0 < θ, ˆ θ < 1.

1. Pick an initial mesh T 0 , initial guest u −1 = 0, and set k = 0.

2. u k = SOLVE(T k , u k−1 , A, b, c, f ).

3. {η k (T ), osc k (T )} _{T ∈T}

k

= ESTIMATE(T k , u k , A, b, c, f )).

4. c T k = MARK(θ, ˆ θ ; T k , {η k (T ), osc k (T )} _{T ∈T}

_k

).

5. T k+1 = REFINE(T k , c T k ).

6. Set k = k + 1 and go to Step 2.

Theorem 1 (Convergence of AFEM). Let {u k } _k∈N

₀

be a sequence of finite ele- ment solutions corresponding to a sequence of nested finite element spaces ©

V ^k ª

k∈N

0

produced by AFEM. There exist constants σ, γ > 0, and 0 < ξ < 1, depending solely on the mesh regularity constant γ ^∗ , data, parameters θ and ˆ θ, and a number 0 < s ≤ 1 dictated by angles of ∂Ω, such that if the initial meshsize h 0 satisfies h ^s ₀ kbk _L

∞

< σ, then for any two consecutive iterates k and k + 1, we have

|||u − u k+1 ||| ² + γ osc k+1 (Ω) ² ≤ ξ ²

³

|||u − u k ||| ² + γ osc k (Ω) ²

´

. (3.13) Therefore AFEM converges with a linear rate ξ, namely,

|||u − u k ||| ² + γ osc k (Ω) ² ≤ C 0 ξ ^2k , where C 0 := |||u − u 0 ||| ² + γ osc 0 (Ω) ² .

Proof. We just prove the contraction property (3.13), which obviously implies the decay estimate. For convenience, we introduce the notation

e k := |||u − u k ||| , ε k := |||u k+1 − u k ||| , osc k := osc k (Ω) .

The idea is to use the quasi-orthogonality (2.6) and replace the term |||u k+1 − u k ||| ²

using new results of error and oscillation reduction estimates (3.11) and (3.12). We

proceed in three steps as follows.

(8)

1. We first get a lower bound for ε k in terms of e k . To this end, we use Marking Strategy E and the upper bound (3.6) to write

θ ² e ² _k ≤ C 1 θ ² η k (Ω) ² ≤ C 1

X

T ∈c T

k

η k (T ) ² .

Adding (3.11) of Lemma 3.1 over all marked elements T ∈ c T k , and observing that each element can be counted at most D := d + 2 times due to overlap of the sets ω T , together with kvk _H ²

1

≤ c ⁻¹ _B |||v||| ² for all v ∈ H ₀ ¹ (Ω), we arrive at

θ ² e ² _k ≤ DC 1 C 4

c B ε ² _k + DC 1 C 5 osc ² _k . If Λ 1 := _DC ^θ

²

^c

^B

1

C

4

, Λ 2 := ^C _C

⁵

^c

^B

4

, then this implies the lower bound for ε ² _k ,

ε ² _k ≥ Λ 1 e ² _k − Λ 2 osc ² _k . (3.14) 2. If h 0 is sufficiently small so that the quasi-orthogonality (2.6) of Lemma 2.1 holds, then

e ² _k+1 ≤ Λ 0 e ² _k − ε ² _k ,

where Λ 0 = (1−C ^∗ h ^s ₀ kbk _L

∞

) ⁻¹ . Replacing the fraction βε ² _k of ε ² _k via (3.14) we obtain e ² _k+1 ≤ (Λ ₀ − βΛ ₁ )e ² _k + βΛ ₂ osc ² _k − (1 − β)ε ² _k ,

where 0 < β < 1 is a constant to be chosen suitably. We now assert that it is possible to chose h 0 compatible with Lemma 2.1 and also that

0 < α := Λ 0 − βΛ 1 < 1.

A simple calculation shows that this is the case provided h ^s ₀ kbk _L

∞

< βΛ 1

C ^∗ (1 + βΛ 1 ) < 1 C ^∗ , i.e., h ^s ₀ kbk _L

∞

< σ with σ := _C

∗

(1+βΛ ^βΛ

¹ 1

) . Consequently

e ² _k+1 ≤ αe ² _k + βΛ 2 osc ² _k − (1 − β)ε ² _k . (3.15) 3. To remove the last term of (3.15) we resort to the oscillation reduction estimate of Lemma 3.2

osc ² _k+1 ≤ ρ 1 osc ² _k + ρ 2 ε ² _k . We multiply it by (1 − β)/ρ 2 and add it to (3.15) to deduce

e ² _k+1 + 1 − β

ρ 2 osc ² _k+1 ≤ α e ² _k + µ

βΛ 2 + ρ 1

ρ 2 (1 − β)

¶ osc ² _k .

If γ := ^1−β _ρ

2

, then we would like to choose β < 1 in such a way that

βΛ 2 + ρ 1 γ = µγ

(9)

for some µ < 1. A simple calculation yields

β =

µ−ρ

1

ρ

2

Λ 2 + ^µ−ρ _ρ

¹

2

,

and shows that ρ 1 < µ < 1 guarantees that 0 < β < 1. Therefore, e ² _k+1 + γ osc ² _k+1 ≤ α e ² _k + µγ osc ² _k ,

and the asserted estimate (3.13) follows upon taking ξ = max(α, µ) < 1. ¤ Remark 3.5.1 (Comparison with [6, 7]). In [6, 7] the oscillation is independent of discrete solutions, i.e. ρ 2 = 0, and is reduced by the factor ρ 1 < 1 in (3.12).

Consequently, Step 3 above is avoided by setting β = 1 and the decay of e k and osc k is monitored separately. Since this is no longer possible, e k and osc k are now combined and decreased together.

Remark 3.5.2 (Splitting of ε k ). The idea of splitting ε k is already used by Chen and Jia [2] in examining one time step for the heat equation. This is because a mass (zero order) term naturally occurs, which did not take place in [6, 7]. The elliptic operator is just the Laplacian in [2].

Remark 3.5.3 (Effect of Convection). Assuming that h ^s ₀ kbk _L

∞

< σ implies that the local Peclet number is sufficiently small for the Galerkin method not to exhibit oscillations. This appears to be essential for u 0 to contain relevant information and guide correctly the adaptive process. This restriction is difficult to verify in practice because it involves unknown constants. However, starting from a coarse mesh does not seem to be a problem in practice (see numerical experiments in §5).

Remark 3.5.4 (Vanishing Convection). If b = 0, then Theorem 1 has no restriction on the initial mesh. This thus extends the convergent result of Morin et al. [6, 7] to variable diffusion coefficient and zero order terms.

Remark 3.5.5 (Optimal β). The choice of β can be optimized. In fact, we can easily see that

α = Λ 0 − βΛ 1 , µ = ρ 1 + β 1 − β ρ 2 Λ 2

yields a unique value 0 < β ∗ < 1 for which α = µ and the contraction constant ξ of Theorem 1 is minimal. This β ∗ depends on geometric constant Λ 0 , Λ 1 , Λ 2 as well on θ, ˆ θ and h 0 , but it is not computable.

4. Proofs of Lemmas. Let b T H ⊂ T H be a set of marked elements obtained from procedure MARK. Let T h be a refined mesh obtained from procedure REFINE, and let V _H ⊂ V _h be nested spaces corresponding to compatible meshes T _H and T _h , respectively. For convenience, set

e h := u − u h , e H := u − u H , ε H := u h − u H .

4.1. Proof of Lemma 2.1: Quasi-Orthogonality. In view of Galerkin orthogonality (3.1), namely B[e h , v h ] = 0, v h ∈ V h , we have

|||e H ||| ² = |||e h ||| ² + |||ε H ||| ² + B[ε H , e h ].

(10)

If b = 0, then B is symmetric and B[ε H , e h ] = B[e h , ε H ] = 0. For b 6= 0, instead, B[ε H , e h ] 6= 0, and we must account for this term. It is easy to see that ∇·b = 0 and integration by parts yield

B[ε H , e h ] = B[e h , ε H ] + hb · ∇ε H , e h i − hb · ∇e h , ε H i = 2 hb · ∇ε H , e h i . Hence

|||e h ||| ² = |||e H ||| ² − |||ε H ||| ² − 2 hb · ∇ε H , e h i .

Using Cauchy-Schwarz inequality and replacing the H ¹ (Ω)-norm by the energy norm we have, for any δ > 0 to be chosen later,

−2 hb · ∇ε H , e h i ≤ δ ke h k _L ²

2

+ kbk _L ²

∞

δc B |||ε H ||| ² .

We then realize the need to relate L ² (Ω) and energy norms to replace ke h k _L

2

by |||e h |||.

This requires a standard duality argument whose proof is reported in Ciarlet [3].

Lemma 4.1 (Duality). Let f ∈ L ² (Ω) and u ∈ H ^1+s (Ω) for some 0 < s ≤ 1 be the solution, where s depends on the angles of ∂Ω (s = 1 if Ω is convex). Then, there exists a constant C 6 , depending only on the shape regularity constant γ ^∗ and the coercivity constant c B , such that

ke h k _L

2

≤ C 6 h ^s |||e h ||| . (4.1) Inserting this estimate in the preceding two bounds, and using h ≤ h 0 , the meshsize of the initial mesh, we deduce

¡ 1 − δC ₆ ² h ^2s ₀ ¢

|||e h ||| ² ≤ |||e H ||| ² − Ã

1 − kbk _L ²

∞

δc B

!

|||ε H ||| ² .

We now choose δ = _C ^kbk

^L∞

6

√

c

B

h

^s₀

to equate both parenthesis, as well as the assumption that h 0 is sufficiently small for δC ₆ ² h ^2s ₀ = C ^∗ h ^s ₀ kbk _L

∞

< 1 with C ^∗ := C 6 / √

c B . We end up with

|||e h ||| ² ≤ 1

1 − C ^∗ kbk _L

∞

h ^s ₀ |||e H ||| ² − |||ε H ||| ² .

This implies (2.6) and concludes the proof. ¤

4.2. Proof of Lemma 3.1 : Error Reduction. Upon restricting the test function v in (3.2) to V h ⊃ V H , we obtain the error representation

B[ε H , v h ] = X

T ∈T

H

Z

T

R T v h + Z

T

(R T − R T )v h + X

S∈S

H

Z

S

J S v h ∀ v h ∈ V h , (4.2)

where we use the abbreviations R T = R T (u H ), J S = J S (u H ), and R T = Π ⁿ⁻¹ _T R T

denotes the L 2 -projection of R T onto the space of polynomials P n−1 (T ) over the

element T ∈ T H . Except for avoiding the oscillation terms of the jump residual J S ,

the proof goes back to [4, 6, 7]. We proceed in three steps.

(11)

1. Interior Residual. Let T ∈ T H , and let x T be an interior node of T generated by the procedure REFINE. Let ψ T ∈ V h be a bubble function which satisfies ψ T (x T ) = 1, vanishes on ∂T , and 0 ≤ ψ T ≤ 1; hence supp (ψ T ) ⊂ T . Since R T ∈ P n−1 (T ) and ψ T > 0 in a polyhedron of measure comparable with that of T , we have

C °

°R T

° ° ²

L

²

(T ) ≤ Z

T

ψ T R T 2

= Z

T

R T (ψ T R T ).

Since ψ T R T is a piecewise polynomial of degree ≤ n over T h , it is thus an admissible test function in (4.2) which vanishes outside T (and in particular on all S ∈ S H ).

Therefore C °

°R _T °

° ²

L

²

(T ) ≤ B[ε H , ψ T R T ] + Z

T

(R T − R T )ψ T R T

≤ C

³

H _T ⁻¹ kε H k _H

1

(T ) + °

°R _T − R T

° °

L

²

(T )

´ ° °R _T °

° L

²

(T ) ,

because of an inverse inequality for ψ T R T . This, together with the triangle inequality, yields the desired estimate for H _T ² kR T k _L ²

2

(T ) :

H _T ² kR T k _L ²

2

(T ) ≤ C ³

kε H k _H ²

1

(T ) + H _T ² °

°R T − R T

° ° ²

L

²

(T )

´

. (4.3)

2. Jump Residual. Let S ∈ S H be an interior side of T 1 ∈ b T H , and let T 2 ∈ T H be the other element sharing S. Let x S be an interior node of S created by the procedure REFINE. Let ψ S ∈ V h be a bubble function in ω S := T 1 ∪ T 2 such that ψ S (x S ) = 1, ψ S vanishes on ∂ω S , and 0 ≤ ψ S ≤ 1; hence supp (ψ S ) ⊂ ω S .

Since u H is continuous, then [[∇u H ]] _S is parallel to ν S , i.e. [[∇u H ]] _S = j S ν S . Moreover, the coefficient matrix A(x) being continuous implies

J S = A(x) [[∇u H ]] _S · ν S = j S A(x)ν S · ν S = a(x) j S ,

where a(x) := A(x)ν S · ν S satisfies 0 < a _S ≤ a(x) ≤ a S with a _S , a S the smallest and largest eigenvalues of A(x) on S. Consequently,

kJ S k _L ²

2

(S) ≤ a ² _S Z

S

j _S ² ≤ Ca ² _S Z

S

j _S ² ψ S ≤ C a ² _S a _S

Z

S

(j S ψ S )J S , (4.4)

where the second inequality follows from j S being a polynomial and ψ S > 0 in a polygon of measure comparable with that of S.

We now extend j S to ω S by first mapping to the reference element, next extending constantly along the normal to ˆ S and finally mapping back to ω S . The resulting extension E _h (j _S ) is a piecewise polynomial of degree ≤ n−1 in ω _S so that ψ _S E _h (j _S ) ∈ V h , and satisfies kψ S E h (j S )k _L

2

(ω

S

) ≤ CH _S ^1/2 kj S k _L

2

(S) . Since v h = ψ S E h (j S ) is an admissible test function in (4.2) which vanishes on all sides of S H but S, we arrive at

Z

S

J _S (j _S ψ _S ) = B[ε _H , v _h ] − Z

T

1

R _T

₁

v _h − Z

T

2

R _T

₂

v _h

≤ C Ã

H _S ^−1/2 kε _H k _H

1

S

(ω

S

) + H _S ^1/2 X 2 i=1

kR _T

_i

k _L

2

(T

i

)

!

kj _S k _L

2

(S) .

(4.5)

(12)

Therefore

H S kJ S k _L ²

2

(S) ≤ C Ã

kε H k ² _H

1

(ω

S

) + X 2 i=1

H _T ²

_i

kR T

i

k ² _L

2

(T

i

)

!

. (4.6)

3. Final Estimate. To remove the interior residual from the right hand side of (4.6) we resort to (4.3) since T 1 and T 2 contain an interior node according to procedure REFINE. Hence

H S kJ S k _L ²

2

(S) ≤ C Ã

kε H k ² _H

1

(ω

S

) + X 2 i=1

H _T ²

_i

°

°R _T

_i

− R T

i

° ° ²

L

²

(T

i

)

!

. (4.7)

The asserted estimate for η H (T ) ² is thus obtained by adding this bound to (4.3).

The constant C depends on the shape regularity constant γ ^∗ and the ratio a ² _S /a _S of

eigenvalues of A(x) on S. ¤

Remark 4.2.1 (Positivity). The use of A(x) being positive definite in (4.4) avoids having oscillation terms on S. This comes at the expense of a constant de- pending on a ² _S /a _S . If we were to proceed in the usual manner, as in [1, 8, 12], we would end up with oscillation of the form

H _S ^1/2 k(A(x) − A(x S )) [[∇u H ]] _S · ν S k _L

2

(S) = H _S ^1/2 k(a(x) − a(x S ))j S k _L

2

(S)

≤ CH _S ^3/2 kAk _W

1

∞

(S) kj S k _L

2

(S)

≤ CH S

° °

°H _S ^1/2 J S

° °

° L

²

(S) ,

where C > 0 also depends on the ratio a S /a _S dictated by the variation of a(x) on S. This oscillation can be absorbed into the term H _S ^1/2 kJ S k _L

2

(S) provided that the meshsize H S is sufficiently small; see [8]. We do not need this assumption in our present discussion.

Remark 4.2.2 (Continuity of A). The continuity of A is instrumental in avoid- ing jump oscillation which in turn makes computations simpler. However, jump oscil- lation cannot be avoid when A exhibits discontinuities across inter-element boundaries of the initial mesh. We get instead of (4.7)

CH S kJ S k _L ²

2

(S) ≤ kε H k ² _H

1

(ω

_S

) + X 2 i=1

H _T ²

_i

°

°R _T

_i

−R T

i

° ° ²

L

²

(T

i

) +H S

° °J _S −J S

° ° ²

L

²

(S) , (4.8) where J S is L 2 -projection of J S onto P n−1 (S). To obtain estimate (4.8) we proceed as follows. Starting from a polynomial J S , we get an estimate similar to that of (4.4)

C °

°J _S °

° ²

L

²

(S) ≤ Z

S

ψ S J S 2 =

Z

S

J S (ψ S J S ) + Z

S

(J S − J S )(ψ S J S ). (4.9) In contrast to (4.4), we see that the oscillation term (J S − J S ) cannot be avoided when A has a discontinuity across S. We estimate the first term on the right hand side of (4.9) exactly as we have argued with (4.5) and thereby arrive at

Z

S

J S (J S ψ S ) ≤ C Ã

H _S ^−1/2 kε H k _H

1

S

(ω

S

) + H _S ^1/2 X 2 i=1

kR T

i

k _L

2

(T

i

)

! °

°J _S °

° L

²

(S) .

(13)

This and a further estimate of the second term on the right hand side of (4.9), yield

H S

° °J S

° ° ²

L

²

(S) ≤ C Ã

kε H k ² _H

1

(ω

S

) + X 2

i=1

H _T ²

_i

kR T

i

k ² _L

2

(T

i

) + H S

° °J S − J S

° ° ²

L

²

(S)

! ,

whence the assertion (4.8) follows using triangle inequality for kJ S k _L

2

(S) . Combining with (4.3), we deduce an estimate for η H (T ) similar to (3.11), namely,

η H (T ) ² ≤ C

³

kε H k ² _H

1

(ω

T

) + osc H (ω T ) ²

´ , with the new oscillation term involving jumps on interior sides

osc H (T ) ² := H _T ² °

°R T − R T

° ° ²

L

²

(T ) + X

S⊂∂T

H S

° °J S − J S

° ° ²

L

²

(S) . (4.10) In §6.1 we discuss the case of a discontinuous A. We show an oscillation reduction property of osc H (T ), defined by (4.10), similar to Lemma 3.2.

4.3. Proof of Lemma 3.2 : Oscillation Reduction. The proof hinges on the Marking Strategy O and the Interior Node Property. We point out that if T ∈ T h is contained in T ⁰ ∈ b T H , then REFINE gives a reduction factor γ 0 < 1 of element size:

h T ≤ γ 0 H T

⁰

. (4.11)

The proof proceeds in three steps as follows.

1. Relation between Oscillations. We would like to relate osc h (T ⁰ ) and osc H (T ⁰ ) for any T ⁰ ∈ T H . To this end, we note that for all T ∈ T h contained in T ⁰ , we can write

R T (u h ) = R T (u H ) − L T (ε H ) in T, where ε H = u h − u H as before and

L _T (ε _H ) := −∇·(A∇ε _H ) + b · ∇ε _H + c ε _H in T.

By Young’s inequality, we have for all δ > 0 osc h (T ) ² = h ² _T

° °

°R T (u h ) − R T (u h )

° °

° ²

L

²

(T )

≤ (1+δ)h ² _T

° °

°R T (u H )−R T (u H )

° °

° ²

L

²

(T ) +(1+δ ⁻¹ )h ² _T

° °

°L T (ε H )−L T (ε H )

° °

° ²

L

²

(T ) , where R T (u h ), R T (u H ), and L T (ε H ) are L ² -projections of R T (u h ), R T (u H ), and L T (ε H ) onto polynomials of degree ≤ n − 1 on T . We next observe that

° °

°L T (ε H ) − L T (ε H )

° °

° L

²

(T ) ≤ kL T (ε H )k _L

2

(T )

and that, according to (4.11),

h T ≤ γ T

⁰

H T

⁰

(14)

provided γ T

⁰

= γ 0 if T ⁰ ∈ b T H and γ T

⁰

= 1 otherwise. Therefore, if T h (T ⁰ ) denotes all T ∈ T h contained in T ⁰ ,

osc h (T ⁰ ) ² = X

T ∈T

h

(T

⁰

)

osc h (T ) ²

≤ (1 + δ)γ ² _T

0

osc H (T ⁰ ) ² + (1 + δ ⁻¹ ) X

T ∈T

h

(T

⁰

)

h ² _T kL T (ε H )k _L ²

2

(T ) , (4.12)

because R T (u H ) = R T

⁰

(u H ) and R T (u H ) is the best approximation of R T

⁰

(u H ) in T . 2. Estimate of L T (ε H ). In order to estimate kL T (ε H )k _L

2

(T ) in terms of kε H k _H

1

(T ) , we first split it as follows

kL T (ε H )k _L

2

(T ) ≤ k∇·(A∇ε H )k _L

2

(T ) + kb · ∇ε H k _L

2

(T ) + kc ε H k _L

2

(T )

and denote these terms N A , N B , and N C , respectively. Since N A ≤ k(∇·A) · ∇ε H k _L

2

(T ) + kA : H(ε H )k _L

2

(T )

where H(ε H ) is the Hessian of ε H in T , invoking the Lipschitz continuity of A together with an inverse estimate in T , we infer that

N A ≤ C A

³

k∇ε H k _L

2

(T ) + h ⁻¹ _T k∇ε H k _L

2

(T )

´ ,

where C A depends on A and the shape regularity constant γ ^∗ . Besides, we readily have

N B ≤ C B k∇ε H k _L

2

(T ) , N C ≤ C C kε H k _L

2

(T ) , where C B , C C depend on b, c. Combining these estimates, we arrive at

h ² _T kL T (ε H )k _L ²

2

(T ) ≤ C ∗ kε H k _H ²

1

(T ) . (4.13) 3. Choice of δ. We insert (4.13) into (4.12) and add over T ⁰ ∈ T H . Recalling the definition of γ T

⁰

and utilizing (3.10), we deduce

X

T

⁰

∈T

H

γ _T ²

0

osc H (T ⁰ ) ² = γ ₀ ² X

T

⁰

∈ b T

H

osc H (T ⁰ ) ² + X

T

⁰

∈T

H

\ b T

H

osc H (T ⁰ ) ²

= osc H (Ω) ² − (1 − γ ₀ ² ) X

T

⁰

∈ b T

H

osc H (T ⁰ ) ²

≤ (1 − (1 − γ ₀ ² )ˆ θ ² ))osc H (Ω) ² ,

where ˆ θ is the user’s parameter in (3.10). Moreover, since C ∗ kε H k ² _H

1

≤ C o |||ε H ||| ² with C o = C ∗ c ⁻¹ _B in light of (2.3), we end up with

osc h (Ω) ² ≤ (1 + δ)(1 − (1 − γ ₀ ² )ˆ θ ² )osc H (Ω) ² + (1 + δ ⁻¹ )C o |||ε H ||| ² . To complete the proof, we finally choose δ sufficiently small so that

ρ 1 = (1 + δ)(1 − (1 − γ ₀ ² )ˆ θ ² ) < 1, ρ 2 = (1 + δ ⁻¹ )C o . ¤

(15)

5. Numerical Experiments. We test performance of the adaptive algorithm AFEM with several examples. We are thus able to study how meshes adapt to various effects from lack of regularity of solutions and convexity of domains to data smooth- ness, boundary layers, changing boundary conditions, etc. For simplicity, we strict our experiments to the case of piecewise linear finite element solutions with polygo- nal domains in 2 dimensions. The implementation is done within the finite element toolbox ALBERT of Schmidt and Siebert [10, 11] which provides tools for adaptivity.

5.1. Implementation. We employ the four main procedures as given by Morin et al. [6, 7]: SOLVE, ESTIMATE, MARK, and REFINE. We slightly modified the built- in adaptive solver for elliptic problems of ALBERT toolbox [10] to make it work for the general PDE (1.1) and mixed boundary conditions, as follows:

• SOLVE. We used built-in solvers provided by ALBERT toolbox, such as GMRES, BICG, or CG.

• ESTIMATE. We modified ALBERT for computing the estimator so that it works for (1.1), and added procedures for computing oscillations which are not provided.

• MARK. We utilized the same algorithm introduced by Morin et al [6, 7] for finding a marked set b T H .

• REFINE. We employed 3 newest bisections for each refinement step to enforce the Interior Node Property.

For simplicity and convenience of presentation, we introduce the following nota- tion:

• DOF := number of elements in a given mesh, which is comparable with number of degree of freedoms;

• EOC := log(e(k−1)/e(k))

log(DOF(k)/DOF(k−1)) , experimental order of convergence, e(k) := ku − u k k _H

1

;

• EOC(η) := log(DOF(k)/DOF(k−1)) ^log(η

^k−1

^/η

^k

⁾ , experimental order of convergence of estimator, η k := η k (Ω);

• Z e := e(k)/e(k − 1) and Z o := osc k /osc k−1 , reduction factors of error and oscillation;

• Eff := η k /e k , effectivity index, i.e. the ratio between the estimator and error.

5.2. Experiment 1 : Oscillatory Coefficients and Nonconvex Domain.

We consider the PDE (1.1) with Dirichlet boundary condition u = g on the nonconvex L-shape domain Ω := [−1, 1] ² \[0, 1]×[−1, 0]. We also take the exact solution

u(r) = r

²³

sin( 2 3 θ),

where r ² := x ² + y ² and θ := tan ⁻¹ (y/x) ∈ [0, 2π). We deal with variable coefficients A(x, y) = a(x, y)I, b(x, y), c(x, y) defined by

a(x, y) = 1

4 + P (sin( ^2πx _² ) + sin( ^2πy _² )) , (5.1)

b(x, y) = (−y, x). (5.2)

c(x, y) = A c (cos ² (k 1 x) + cos ² (k 2 x)), (5.3) where P, ², A c , k 1 and k 2 are parameters. The functions f in (1.1) and g are defined accordingly. To see how AFEM performs comparing to standard uniform refinement, results of AFEM and standard uniform refinement are provided in Tables 5.1 and 5.2.

Some examples of adapted refined meshes from AFEM are also displayed in Figure

5.1. Observations and conclusions about AFEM performance are as follows:

(16)

DOF ku − u k k _H

1

EOC Z e Z o Eff 153 1.9800e-01 0.7766 0.6963 0.6537 0.5829 323 1.5470e-01 0.3303 0.7813 0.4862 0.7699 478 1.0859e-01 0.9028 0.7020 0.5570 0.7945 869 7.0835e-02 0.7148 0.6523 0.3853 0.8197 1404 5.6474e-02 0.4723 0.7973 0.4895 0.9184 2266 4.1940e-02 0.6216 0.7426 0.4038 0.9625 3689 3.1117e-02 0.6125 0.7419 0.6052 1.0033 7103 2.1326e-02 0.5767 0.6854 0.4379 1.0024 13729 1.4849e-02 0.5494 0.6963 0.5205 0.9849

Table 5.1

Experiment 1 (Oscillatory coefficients and nonconvex domain): The parameters of AFEM are θ = ˆ θ = 0.6, and those controlling the oscillatory coefficients are P = 1.8, ² = 0.4, A

c

= 1.0, k

1

= k

2

= 4.0, as described in (5.1)-(5.3). The experimental order of convergence EOC is close to the optimal value 0.5, which indicates quasi-optimal meshes. The oscillation reduction factor Z

o

is small than the error reduction factor, which confirms that oscillation decreases faster than error.

DOF ku − u k k _H

1

EOC Z e Z o

384 1.5833e-01 0.4224 0.5568 0.3431 1536 8.3427e-02 0.4622 0.5269 0.1772 6144 5.0608e-02 0.3606 0.6066 0.1851 24576 3.1809e-02 0.3350 0.6285 0.2458

Table 5.2

Experiment 1 (Oscillatory coefficients and nonconvex domain): Standard uniform refinement is performed using the same values for parameters P, ², A

c

, k

1

, and k

2

as that of AFEM given by Table 5.1 above. EOC is now suboptimal and close to the expected value 1/3.

Fig. 5.1. Experiment 1 (Oscillatory coefficients and nonconvex domain): Parameters of AFEM are θ = ˆ θ = 0.6, those of oscillatory coefficient A are P = 1.8, ² = 0.4, and both b and c vanish.

Key words. a posteriori error estimators, quasi-orthogonality, adaptive mesh refinement, error and oscillation reduction estimates, optimal meshes.

FOR GENERAL SECOND ORDER LINEAR ELLIPTIC PDE

KHAMRON MEKCHAY

AND RICARDO H. NOCHETTO

Key words. a posteriori error estimators, quasi-orthogonality, adaptive mesh refinement, error and oscillation reduction estimates, optimal meshes.

AMS subject classifications. 65N12, 65N15, 65N30, 65N50, 65Y20

1. Introduction and Main Result. Let Ω be a polyhedral bounded domain in R d , (d = 2, 3). We consider a general second order elliptic Dirichlet boundary value problem

Lu = −∇·(A∇u) + b · ∇u + c u = f in Ω, (1.1)

u = 0 on ∂Ω, (1.2)

with the following assumptions:

• A : Ω 7→ R d×d is Lipschitz and symmetric positive definite with smallest eigenvalue a − and largest eigenvalue a + , i.e.,

a − (x) |ξ| 2 ≤ A(x)ξ · ξ ≤ a + (x) |ξ| 2 , ∀ξ ∈ R d , x ∈ Ω; (1.3)

• b ∈ [L ∞ (Ω)] d is divergence free (∇·b = 0 in Ω);

• c ∈ L ∞ (Ω) is nonnegative (c ≥ 0 in Ω);

• f ∈ L 2 (Ω).

The purpose of this paper is to prove the following convergence results for adap- tive finite element methods (AFEM) for (1.1-1.2), and document their performance computationally.

Theorem 1 (Convergence of AFEM). Let {u k } k∈N

be a sequence of finite ele- ment solutions corresponding to a sequence of nested finite element spaces ©

V k ª

k∈N

produced by the AFEM of §3.5, which involves loops of the form SOLVE → ESTIMATE → MARK → REFINE.

There exist constants σ, γ > 0, and 0 < ξ < 1, depending solely on the shape regularity of meshes, the data, the parameters used by AFEM, and a number 0 < s ≤ 1 dictated by the angles of ∂Ω, such that if the initial meshsize h 0 satisfies h s 0 kbk L

< σ, then for any two consecutive iterates k and k + 1 we have

|||u − u k+1 ||| 2 + γ osc k+1 (Ω) 2 ≤ ξ 2

³

|||u − u k ||| 2 + γ osc k (Ω) 2

´

. (1.4)

Department of Mathematics, University of Maryland, College Park, MD 20742, USA ([email protected]). Partially supported by NSF Grant DMS-0204670.

Department of Mathematics and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA ([email protected]). Partially supported by NSF Grant DMS-0204670.

1

Therefore, AFEM converges with a linear rate ξ, namely

|||u − u k ||| 2 + γ osc k (Ω) 2 ≤ C 0 ξ 2k , where C 0 := |||u − u 0 ||| 2 + γ osc 0 (Ω) 2 .

Hereafter, |||·||| denotes the energy norm induced by the operator L and osc(Ω), the oscillation term, stands for information missed by the averaging process associated to FEM. This convergence result extends those of Morin et al. [6, 7] in several ways:

• We deal with a full second order linear elliptic PDE with variable coefficients A, b and c, whereas in [6, 7] A is assumed to be piecewise constant and b and c to vanish.

• The underlying bilinear form B is non-symmetric due to the first order term b · ∇u.

Since B is no longer a scalar product as in [6, 7], the Pythagoras equality relating u, u k and u k+1 fails; we prove a quasi-orthogonality property instead.

• The oscillation terms depend on discrete solutions in addition to data. Therefore, oscillation and error cannot be reduced separately as in [6, 7].

• The oscillation terms do not involve the oscillation of the jump residuals. This is achieved by exploiting positivity and continuity of A.

• Since error and oscillation are now coupled, in order to prove convergence we need to handle them together. This leads to a novel argument and result, the contraction property (1.4), according to which both error and oscillation decrease together.

2. Discrete Solution and Quasi-Orthogonality. For an open set G ∈ R d we denote by H 1 (G) the usual Sobolev space of functions in L 2 (G) whose first derivatives are also in L 2 (G), endowed with the norm

kuk H

(G) := ³

kuk L

(G) + k∇uk L

(G)

´ 1/2

;

we use the symbols k·k H

and k·k L

when G = Ω. Moreover, we denote by H 0 1 (G) the space of functions in H 1 (G) that vanish on the boundary in the trace sense.

A weak solution of (1.1) and (1.2) is a function u satisfying

u ∈ H 0 1 (Ω) : B[u, v] = hf, vi ∀v ∈ H 0 1 (Ω), (2.1) where hu, vi := R

Ω uv for any u, v ∈ L 2 (Ω), and the bilinear form is defined on H 0 1 (Ω)×H 0 1 (Ω) as

B[u, v] := hA∇u, ∇vi + hb · ∇u + c u, vi . (2.2) By Cauchy-Schwarz inequality one can easily show the continuity of the bilinear form

|B[u, v]| ≤ C B kuk H

kvk H

,

where C B depends only on the data. Combining Poincar´e inequality with the diver- gence free condition ∇·b = 0, one has coercivity

B[v, v] ≥ Z

Ω

a − |∇v| 2 + cv 2 ≥ c B kvk H 2

,

where c B depends only on the data. Existence and uniqueness of (2.1) thus follows from Lax-Milgram theorem [5].

We define the energy norm on H 0 1 (Ω) by |||v||| 2 := B[v, v], which is equivalent to H 0 1 (Ω)-norm k·k H

. In fact we have

c B kvk H 2

≤ |||v||| 2 ≤ C B kvk H 2

∀ v ∈ H 0 1 (Ω). (2.3)

2.1. Discrete Solutions on Nested Meshes. Let {T H } be a shape regular family of nested conforming meshes over Ω: that is there exists a constant γ ∗ such that

H T

ρ T ≤ γ ∗ ∀ T ∈ [

H

T H , (2.4)

where, for each T ∈ T H , H T is the diameter of T , and ρ T is the diameter of the biggest ball contained in T ; the global meshsize is h H := max T ∈T

H T .

Let {V H } be a corresponding family of nested finite element spaces consisting of continuous piecewise polynomials over T H of fixed degree n ≥ 1, that vanish on the boundary. Let u H be a discrete solution of (2.1) satisfying

u H ∈ V H : B[u H , v H ] = hf, v H i ∀ v H ∈ V H . (2.5) Existence and uniqueness of this problem follows from Lax-Milgram theorem, since V H ⊂ H 0 1 (Ω).

and u h − u H , the so-called Pythagoras equality, fails to hold. We have instead a perturbation result referred to as quasi-orthogonality provided that the initial mesh is fine enough. This result is stated below and the proof is given in section 4.

< 1, then

1. Introduction and Main Result. Let Ω be a polyhedral bounded domain in R ^d , (d = 2, 3). We consider a general second order elliptic Dirichlet boundary value problem

• A : Ω 7→ R ^d×d is Lipschitz and symmetric positive definite with smallest eigenvalue a − and largest eigenvalue a + , i.e.,

a − (x) |ξ| ² ≤ A(x)ξ · ξ ≤ a + (x) |ξ| ² , ∀ξ ∈ R ^d , x ∈ Ω; (1.3)

• b ∈ [L ^∞ (Ω)] ^d is divergence free (∇·b = 0 in Ω);

• c ∈ L ^∞ (Ω) is nonnegative (c ≥ 0 in Ω);

• f ∈ L ² (Ω).

Theorem 1 (Convergence of AFEM). Let {u k } _k∈N

V ^k ª

There exist constants σ, γ > 0, and 0 < ξ < 1, depending solely on the shape regularity of meshes, the data, the parameters used by AFEM, and a number 0 < s ≤ 1 dictated by the angles of ∂Ω, such that if the initial meshsize h 0 satisfies h ^s ₀ kbk _L

|||u − u k+1 ||| ² + γ osc k+1 (Ω) ² ≤ ξ ²

|||u − u k ||| ² + γ osc k (Ω) ²

|||u − u k ||| ² + γ osc k (Ω) ² ≤ C 0 ξ ^2k , where C 0 := |||u − u 0 ||| ² + γ osc 0 (Ω) ² .

2. Discrete Solution and Quasi-Orthogonality. For an open set G ∈ R ^d we denote by H ¹ (G) the usual Sobolev space of functions in L ² (G) whose first derivatives are also in L ² (G), endowed with the norm

kuk _H

kuk _L

(G) + k∇uk _L

´ _1/2

we use the symbols k·k _H

and k·k _L

when G = Ω. Moreover, we denote by H ₀ ¹ (G) the space of functions in H ¹ (G) that vanish on the boundary in the trace sense.

u ∈ H ₀ ¹ (Ω) : B[u, v] = hf, vi ∀v ∈ H ₀ ¹ (Ω), (2.1) where hu, vi := R

Ω uv for any u, v ∈ L ² (Ω), and the bilinear form is defined on H ₀ ¹ (Ω)×H ₀ ¹ (Ω) as

|B[u, v]| ≤ C B kuk _H

kvk _H

a − |∇v| ² + cv ² ≥ c B kvk _H ²

We define the energy norm on H ₀ ¹ (Ω) by |||v||| ² := B[v, v], which is equivalent to H ₀ ¹ (Ω)-norm k·k _H

c B kvk _H ²

≤ |||v||| ² ≤ C B kvk _H ²

∀ v ∈ H ₀ ¹ (Ω). (2.3)

2.1. Discrete Solutions on Nested Meshes. Let {T H } be a shape regular family of nested conforming meshes over Ω: that is there exists a constant γ ^∗ such that

ρ T ≤ γ ^∗ ∀ T ∈ [

u H ∈ V H : B[u H , v H ] = hf, v H i ∀ v H ∈ V H . (2.5) Existence and uniqueness of this problem follows from Lax-Milgram theorem, since V H ⊂ H ₀ ¹ (Ω).

|||u − u h ||| ² ≤ Λ 0 |||u − u H ||| ² − |||u h − u H ||| ² , (2.6) where Λ 0 := (1 − C ^∗ h ^s ₀ kbk _L

) ⁻¹ . The equality holds provided b = 0 in Ω.

B[u − u H , v H ] = 0 ∀ v H ∈ V H . (3.1) In addition to T H , let S H denote the set of interior faces (edges or sides) of the mesh (triangulation) T H . We consider the residual R(u H ) ∈ H ⁻¹ (Ω) defined by

|||u − u H ||| we can equivalently deal with kR(u H )k _H

(Ω) . To this end, we integrate by parts elementwise the bilinear form B[u − u _H , v] to obtain the error representation formula

J S (u H )v ∀ v ∈ H ₀ ¹ (Ω), (3.2) where the element residual R _T (u _H ) and the jump residual J _S (u _H ) are defined as

η H (T ) ² := H _T ² kR T (u H )k _L ²

H S kJ S (u H )k ² _L

η H (ω) ² := X

η H (T ) ² .

|||u − u H ||| ² ≤ C 1 η H (Ω) ² , (3.6)

where the constant C 1 > 0 depends only on the shape regularity γ ^∗ , coercivity con-

C 2 η H (T ) ² − C 3

H _T ² °

°R _T −R T

° ° ²

(T ) ≤ ku − u H k ² _H

is a polynomial approximation of R _T on T . We define the oscillation on the elements T ∈ T H by

osc H (T ) ² := H _T ² °

° ° ²

osc H (ω) ² := X

osc H (T ) ² .

Remark 3.2.1. We see from (3.7) that if the oscillation osc H (ω T ) is small compared to the indicator η H (T ), then the size of the indicator η H (T ) will give reliable information about the size of the error ku − u H k _H