ANZIAM J. 59 (EMAC2017) pp.C97–C111, 2018 C97
A mixed finite element method for elliptic optimal control problems using a three-field
formulation
BP Lamichhane 1 A Kumar 2 B Kalyanaraman 3
Received 16 November 2017; revised 18 March 2018
Abstract
In this paper, we consider an optimal control problem governed by elliptic differential equations posed in a three-field formulation. Using the gradient as a new unknown we write a weak equation for the gradient using a Lagrange multiplier. We use a biorthogonal system to discretise the gradient, which leads to a very efficient numerical scheme.
A numerical example is presented to demonstrate the convergence of the finite element approach.
Subject class: 49J20, 65L20
Keywords: Optimal control; biorthogonal system; a priori error esti- mates
doi:10.21914/anziamj.v59i0.12643 , c Austral. Mathematical Soc. 2018. Published
July 3, 2018, as part of the Proceedings of the 13th Biennial Engineering Mathematics and
Applications Conference. issn 1445-8810. (Print two pages per sheet of paper.) Copies of
this article must not be made otherwise available on the internet; instead link directly to
the doi for this article.
Contents C98
Contents
1 Introduction C98
2 Finite element method C101
3 Variational discretisation: error estimates C104
4 Numerical results C108
References C109
1 Introduction
The cost functional of an optimal control problem governed by a partial differential equation often involves the solution as well as the gradient of the solution. In that situation, a better approximation of the gradient is obtained by using a mixed finite element method [1]. Recently, mixed finite element approaches have become quite popular to discretise optimal control problems involving elliptic partial differential equations [3–5].
In this paper, we apply a mixed finite element method to approximate the solution of an optimal control problem governed by a Poisson problem.
In contrast to previous approaches [3–5], which are based on a two-field
formulation of the Poisson problem, we use a three-field formulation of
the Poisson problem which allows us to use a biorthogonal system in the
discretisation. The use of a biorthogonal system allows us to statically
condense out all the extra degrees of freedom we have in the mixed formulation
leading to a very efficient finite element approach. The formulation is obtained
by introducing the gradient of the solution of Poisson equation as a new
unknown and writing an additional variational equation in terms of a Lagrange
multiplier. An efficient numerical scheme is obtained by using a biorthogonal
1 Introduction C99
system to discretise the space of the gradient of the solution and the Lagrange multiplier space in the discrete setting.
Let Ω be a bounded and convex domain in R
2and Γ the boundary of Ω. In the following we use the usual notations for the Sobolev space H
k(Ω) for positive integer k, and the associated norm on H
k(Ω) with L
2(Ω) = H
0(Ω) denoting the set of all real-valued square integrable functions defined on Ω [2].
Let α ∈ R be a parameter, y
d∈ L
2(Ω) the desired state and σ
d∈ [L
2(Ω)]
2the desired gradient. Consider the following optimal control problem
u∈U
min
adJ(y, u) := 1
2 ky − y
dk
20,Ω+ 1
2 kσ − σ
dk
20,Ω+ α
2 kuk
20,Ω, σ = ∇y, (1) subject to the partial differential equation
− ∇ · K∇y = Bu + f in Ω with y = 0 on Γ , (2) where K ∈ L
∞(Ω , R
2×2) is a real-valued, symmetric and positive definite matrix, and B is the bounded linear operator defined on the set of admissible set of controls U
adU
ad=
u ∈ L
2(Ω) : kuk
0,Ω6 M , so that for some positive constant b
kBuk
0,Ω6 bkuk
0,Ω, u ∈ U
ad.
The existence and uniqueness of the optimal control follows from the strict convexity [13]. Using B
∗as the adjoint operator of B the first order optimality condition can be formulated as
(αu + B
∗p , v − u) > 0, v ∈ U
ad, (3) where p is the adjoint state associated with u and it solves the following adjoint equation of finding p ∈ H
10(Ω) such that
Z
Ω
[K∇p · ∇q + (∇y − σ
d) · ∇q + (y − y
d)q] dx = 0, q ∈ H
10(Ω). (4)
1 Introduction C100
The variational inequality in equation (3) can be written as u(x) = P
[−M,M]− B
∗p α
, where [12, 13]
P
[a,b](f(x)) = max(a, min(b, f(x))).
We start with the following variational form of (2) to recast our problem as a three-field formulation:
min
y∈H10(Ω)
1 2 Z
Ω
K∇y · ∇y dx − Z
Ω
(f + Bu) y dx, (5)
and introduce the gradient of the state of the system as a new unknown σ = ∇y, and write its weak equation as
Z
Ω
(σ − 5y) · ψ dx = 0, ψ ∈ [L
2(Ω)]
2,
where ψ acts as a Lagrange multiplier. Using V = H
10(Ω), R = [L
2(Ω)]
2, and
`(y) = R
Ω
(f + Bu) y dx, we get a constrained minimisation problem, which leads to the following saddle point problem of finding (y, σ, φ) ∈ V × R × R such that
a((y, σ), (z, τ))+ b((z, τ), φ) = `(z), ˜ (z, τ) ∈ V × R,
b((y, σ), ψ) = 0, ψ ∈ R, (6)
where
a((y, σ), (z, τ)) = ˜ Z
Ω
Kσ · τ dx, b((y, σ), ψ) = Z
Ω
(σ − ∇y) · ψ dx.
Thus our optimal control problem is to find (y, σ, φ, u) ∈ V × R × R × U
adsuch that (6) is satisfied as well as the inequality (3), where p = ˜ y in (3) is
2 Finite element method C101
one field in the problem of finding three fields (˜ y, ˜ σ, ˜ φ) ∈ V × R × R such that Z
Ω
φ ˜ · ∇z dx − Z
Ω
(y − y
d)z dx = 0, z ∈ V, (7) Z
Ω
(K˜ σ + ˜ φ) · τ dx + Z
Ω
(σ − σ
d) · τ dx = 0, τ ∈ R, (8) Z
Ω
(˜ σ − ∇˜ y) · ψ dx = 0, ψ ∈ R. (9) Here the adjoint equations are derived from the Euler-Lagrange equations of the following minimisation problem
arg min
( ˜y, ˜σ)∈[V×R]
1 2 Z
Ω
K˜ σ · ˜ σ dx − ˜`(˜ y, ˜ σ)
, (10)
under the constraint Z
Ω
(˜ σ − ∇˜ y) · ψ dx = 0, ψ ∈ R, where
˜`(˜y, ˜σ) = Z
Ω
(y
d− y) ˜ y dx − Z
Ω
(σ − σ
d) · ˜ σ dx.
To get a stable discrete formulation we replace the bilinear form ˜ a(·, ·) by the bilinear form a(·, ·) [6] defined as
a((y, σ), (z, τ)) = 1 2
Z
Ω
(Kσ · τ + K∇y · ∇z) dx.
2 Finite element method
Let T
hbe a quasi-uniform partition of the domain Ω in triangles. We use the standard linear finite element space on the mesh T
hdefined as
S
h= {z
h∈ H
1(Ω) : z
h|
T∈ P
1(T ), T ∈ T
h}, V
h= S
h∩ H
10(Ω),
2 Finite element method C102
where P
1(T ) is the space of linear polynomials in T [2].
Let {ϕ
1, ϕ
2, · · · , ϕ
N} be the finite element basis for S
h. We now construct a set of basis functions {µ
1, µ
2, · · · , µ
N} of another finite element space W
hso that the basis functions of S
hand W
hsatisfy a condition of biorthogonality
relation Z
Ω
ϕ
iµ
jdx = c
jδ
ij, c
j6= 0, 1 6 i, j, 6 N, (11) where δ
ijis the Kronecker symbol, and c
ja scaling factor [10]. Hence the sets of basis functions of S
hand W
hform a biorthogonal system. The finite element space for the gradient of the solution is
L
h= [S
h]
2, and for the Lagrange multiplier is
M
h= [W
h]
2.
Our discrete problem is to find (y
h, σ
h, ϕ
h) ∈ V
h× L
h× M
hsuch that
a ((y
h, σ
h) , (z
h, τ
h)) + b ((z
h, τ
h) , ϕ
h) = ` (z
h) , (z
h, τ
h) ∈ V
h× L
h,
b ((y
h, σ
h) , ψ
h) = 0, ψ
h∈ M
h,
(12) We now introduce a projection operator Q
h: L
2(Ω) → S
hdefined as
Z
Ω
Q
hv µ
hdx = Z
Ω
vµ
hdx, µ
h∈ W
h, v ∈ L
2(Ω).
Due to the biorthogonality relation (11), Q
his well-defined, and Q
hv for v ∈ L
2(Ω) is given by
Q
hv = X
ni=1
R
Ω
µ
iv dx c
iϕ
i.
Using the notation that Q
his applied component-wise when applied to a
vector function the second equation of (12) leads to σ
h= Q
h(∇y
h). Hence
2 Finite element method C103
static condensation of the gradient of the solution and the Lagrange multiplier leads to the following problem of finding y
h∈ V
hsuch that
A
h(y
h, z
h) = `(z
h), z
h∈ V
h, where
A
h(y
h, z
h) = 1 2
Z
Ω
KQ
h∇y
h· Q
h∇z
hdx + Z
Ω
K∇y
h· ∇z
hdx
. Using the same approach to discretise the adjoint equations (7) – (9) we have the problem of finding (˜ y
h, ˜ σ
h, ˜ φ
h) ∈ V
h× L
h× M
hsuch that
Z
Ω
φ ˜
h· ∇z
hdx − Z
Ω
(y
h− y
d)z
hdx = 0, z
h∈ V
h, (13) Z
Ω
(K˜ σ
h+ ˜ φ
h) · τ
hdx + Z
Ω
(σ
h− σ
d) · τ
hdx = 0, τ
h∈ L
h, (14) Z
Ω
(˜ σ
h− ∇˜ y
h) · ψ
hdx = 0, ψ ∈ M
h. (15) We note that (15) leads to ˜ σ
h= Q
h∇˜ y
h. Using this and setting p
h= ˜ y
h, we arrive at the problem of finding p
h∈ V
hsuch that for all q
h∈ V
hwe have
Z
Ω
(KQ
h∇p
h) · (Q
h∇q
h) dx = Z
Ω
(y
d− y
h)q
hdx
− Z
Ω
(Q
h∇y
h− σ
d) · (Q
h∇q
h) dx.
Using the same stabilisation approach as for the state variable, we now obtain the equation of finding p
h∈ V
hsuch that
A
h(p
h, q
h) = Z
Ω
(y
d−y
h)q
hdx−
Z
Ω
(Q
h∇y
h−σ
d) ·(Q
h∇q
h) dx, q
h∈ V
h.
3 Variational discretisation: error estimates C104
3 Variational discretisation: error estimates
Now we consider the discrete formulation of the state and adjoint state equa- tions where the control u is not discretized initially but given by a projection formula (18) [12, 13]. The discrete problem is then to find (y
h, p
h, u
h) ∈ V
h× V
h× U
adsuch that
A
h(y
h, z
h) = (Bu
h+ f, z
h), z
h∈ V
h, (16) A
h(p
h, q
h) = (y
d− y
h, q
h) − (Q
h∇y
h− σ
d, Q
h∇q
h), q
h∈ V
h,(17)
u
h= −P
[a,b]B
∗p
hα
. (18)
where the bilinear form A
h(·, ·) satisfies the following continuity and coercivity properties with respect to the H
1-norm for two positive constants α and β independent of h:
(i) A
h(φ
h, ψ
h) 6 αkφ
hk
1,Ωkψ
hk
1,Ω, φ
h, ψ
h∈ V
h. (ii) A
h(φ
h, φ
h) > βkφ
hk
21,Ω, φ
h∈ V
h.
Since Q
his stable in H
1and L
2-norms [10], and B is bounded, the right hand sides of both variational equations (16) and (17) are continuous. Thus both variational equations (16) and (17) have unique solutions by Lax-Milgram lemma. We note that the variational inequality
(αu
h+ B
∗p
h, v − u
h) > 0, v ∈ U
ad, for the control u
his replaced by the projection formula (18).
For a fixed u, let y
h(u) be the solution of
A
h(y
h(u), z
h) = (Bu + f, z
h), z
h∈ V
h, (19) and for a fixed y, let p
h(y) be the solution of
A
h(p
h(y), q
h) = (y
d− y, q
h) − (Q
h∇y − σ
d, Q
h∇q
h), q
h∈ V
h.(20)
3 Variational discretisation: error estimates C105
Then under the assumption that the domain Ω is convex, we have the following approximation results for the solutions of the discrete equations (19) and (20).
Theorem 1. Let y
h(u) and p
h(y) be the solutions of (19) and (20) respec- tively. Then there exists a constant C, independent of h, such that
ky − y
h(u)k
0,Ω+ hky − y
h(u)k
1,Ω6 Ch
2kyk
2,Ω(21) kp − p
h(y)k
0,Ω+ hkp − p
h(y)k
1,Ω6 Ch
2kpk
2,Ω. (22) Setting y
h= y
h(u
h) and p
h= p
h(y
h), we have the following result.
Lemma 2. Let y
h(u) and p
h(y) be the solutions of (19) and (20) respectively.
Then there exists a constant C, independent of h, such that ky
h(u) − y
hk
1,Ω6 Cku − u
hk
0,Ωkp
h(y) − p
hk
1,Ω6 Cky − y
hk
0,Ω.
The proof of this lemma follows from the coercivity of A
h(·, ·) [11, Lemma 3.1].
Theorem 3. Let (y, p, u) ∈ (H
2(Ω)∩H
10(Ω))×(H
2(Ω)∩H
10(Ω))×U
adbe the solutions of (1) and (2) with u ∈ H
1(Ω), and (y
h, p
h, u
h) ∈ V
h× V
h× U
adbe the solutions of (16)–(18). Then for sufficiently small h there exists a mesh-independent constant C such that
ku − u
hk
0,Ω6 Ch, ky − y
hk
0,Ω6 Ch, kp − p
hk
0,Ω6 Ch. (23)
Proof: From the optimality condition, we get αku − u
hk
20,Ω6 (p − p
h, B(u
h− u))
= (p − p
h(y), B(u
h− u)) + (p
h(y) − p
h, B(u
h− u)).
From (19), we have
A
h(y
h− y
h(u), z
h) = (B(u
h− u), z
h).
3 Variational discretisation: error estimates C106
Setting z
h= p
h(y) − p
hwe get from the above equation
A
h(y
h− y
h(u), p
h(y) − p
h) = (B(u
h− u), p
h(y) − p
h).
Thus
αku − u
hk
20,Ω6(p − p
h(y), B(u
h− u)) (24) + A
h(p
h(y) − p
h, y
h− y
h(u)). (25) From (20), we get
A
h(p
h(y) − p
h, q
h) = (y
h− y, q
h) − (Q
h(∇y − ∇y
h), Q
h∇q
h).
We now use q
h= y
h(u) − y
hin the above equation to write A
h(p
h(y) − p
h, y
h(u) − y
h)
= (y
h− y, y
h(u) − y
h) − (Q
h∇(y − y
h), Q
h∇(y
h(u) − y
h))
= (y
h(u) − y, y
h(u) − y
h) − (y
h(u) − y
h, y
h(u) − y
h)
−(Q
h∇(y − y
h(u)), Q
h∇(y
h(u) − y
h)) +(Q
h∇(y
h− y
h(u)), Q
h∇(y
h(u) − y
h))
= (y
h(u) − y, y
h− y
h(u)) − ky
h(u) − y
hk
20,Ω+(Q
h∇(y − y
h(u)), Q
h∇(y
h− y
h(u))) − kQ
h∇(y
h− y
h(u)k
20,Ω6 Cky − y
h(u)k
1,Ωky
h− y
h(u)k
1,Ω6 Chkyk
2,Ωku − u
hk
0,Ω,
where we use the L
2stability of Q
h, Theorem 1 and Lemma 2. Now we use the above estimate for the second term on the right of (24) and use the following estimate for the first term on the right of (24)
|(p − p
h(y), B(u
h− u)) | 6 Ch
2ku − u
hk
0,Ωkpk
2,Ω, to write (24) as
αku − u
hk
20,Ω6 Ch
2kpk
2,Ωku − u
hk
0,Ω+ Chkyk
2,Ωku − u
hk
0,Ω,
3 Variational discretisation: error estimates C107
which yields
ku − u
hk
0,Ω6 C(h
2kpk
2,Ω+ hkyk
2,Ω).
In order to estimate ky − y
hk
0,Ω, we start with
ky − y
hk
0,Ω6 ky − y
h(u)k
0,Ω+ ky
h(u) − y
hk
0,Ω.
For the first term on the right of the above estimate we have ky−y
h(u)k
0,Ω6 Ch
2kyk
2,Ω, and for the second term on the right we use Lemma 2 to write
ky
h(u) − y
hk
0,Ω6 Cky
h(u) − y
hk
1,Ω6 Cku − u
hk
0,Ω.
Now the result follows using the estimate for ku − u
hk
0,Ω. Similarly, to estimate kp − p
hk
0,Ω, we use the triangle inequality and write
kp − p
hk
0,Ω6 kp − p
h(y)k
0,Ω+ kp
h(y) − p
hk
0,Ω.
Since kp − p
h(y)k
0,Ω6 Ch
2kpk
2,Ω, we estimate the second term on the right side of the above estimate by using Lemma 2 to get
kp
h(y) − p
hk
0,Ω6 Ckp
h(y) − p
hk
1,Ω6 Cky − y
hk
0,Ω.
The final result follows on using the estimate for ky − y
hk
0,Ω. ♠ Remark 4. Using the standard saddle point theory [1] we have the following error estimate for the error in the gradient σ = ∇y:
kσ − σ
hk
0,Ω6 Ch,
where σ
h= Q
h∇y
hwith y and y
has defined in the above theorem.
Remark 5. The approach based on a biorthogonal system can be easily
extended to a three-dimensional problem and other elliptic partial differential
equations [7–9].
4 Numerical results C108
Figure 1: L
2errors for the state y and adjoint p (left), H
1errors for the state y and the adjoint p (middle) and L
2errors for the control u and the gradient σ (right) versus the number of elements
4 Numerical results
In this section, we present a numerical example to support the error estimates proved in the last section. We have used the primal dual active set strategy to compute the solution of the discrete formulation [13]. Let Ω = [0, 1]
2with the boundary Γ . We consider the following problem
u∈U