On optimality conditions for optimal control problem in coefficients for Δp-Laplacian

(1)

R E S E A R C H

Open Access

On optimality conditions for optimal control

problem in coefﬁcients for

_p

-Laplacian

Olha P Kupenko

1,2

_{and Rosanna Manzo}

3*

*_{Correspondence: [email protected]} 3_{Dipartimento di Ingegneria}

dell’Informazione, Ingegneria Elettrica e Matematica Applicata, Università degli Studi di Salerno, Via Giovanni Paolo II, 132, Fisciano, 84084, Italy

Full list of author information is available at the end of the article

Abstract

In this paper we study an optimal control problem for a nonlinear monotone Dirichlet problem where the control is taken asL∞(

)-coeﬃcient of

p-Laplacian. Given a cost function, the objective is to derive first-order optimality conditions and provide their substantiation. We propose some ideas and new results concerning the differentiability properties of the Lagrange functional associated with the considered control problem. The obtained adjoint boundary value problem is not coercive and, hence, it may admit infinitely many solutions. That is why we concentrate not only on deriving the adjoint system, but also, following the well-known Hardy-Poincaré Inequality, on a formulation of sufficient conditions which would guarantee the uniqueness of the adjoint state to the optimal pair.

MSC: 35J70; 49J20; 49J45; 93C73

Keywords: nonlinear Dirichlet problem; control in coeﬃcients; optimality conditions

1 Introduction

The aim of this paper is to derive a ﬁrst-order optimality system for a nonlinear Dirichlet optimal control problem where the control is taken as anL∞-coeﬃcient in a nonlinear state equation. The optimal control problem we consider in this paper is to minimize the discrepancyy–ydp

W_,p(), wherep≥,is an open bounded Lipschitz domain inR N

withN≥,yd∈W,p() is a given distribution, andyis the solution of a nonlinear

Dirich-let problem by choosing an appropriate coeﬃcientu∈L∞() ofp-Laplacian. Namely,

we consider the following minimization problem:

Minimize

I(u,y) =

_∇y(x) –∇yd(x) p

dx

(.)

subject to the constraints

u∈Aad⊂L∞()∩BV(), y∈W,p(), (.)

–divu|∇y|p–∇y=f in, (.)

y=  on∂, (.)

whereAadis a class of admissible controls andf∈W–,q(),q=p/(p– ).

Optimal control in coeﬃcients for partial diﬀerential equations is a classical subject ini-tiated by Lurie [, ], Lions [], Zolezzi []. Tartar and Murat [–] showed examples of

(2)

the non-existence for such problems (seee.g.[] also for the historical development). Since the range of OCPs in coefficients is very wide, including as well optimal shape design prob-lems, optimization of certain evolution systems, some problems originating in mechanics and others, this topic has been widely studied by many authors. In particular, it leads to the possibility to optimize material properties what is extremely important for material sci-ences. The crucial point is to give the right interpretation of the optimal coefficients in the context of applications (see, for instance, [, ]). Usually this aspect is closely related with the structural assumptions that have to be considered during the optimization process in terms of constraints. One way of doing so is via proper parametrization of the material, respectively, the coefficients, using mixtures, represented by characteristic functions. This has been pursued by Allaire [] and many other authors in recent years. Another restric-tion can be realized via regularity of the coefficients and hard constraints. This procedure has been followed first by Casas [] for a scalar problem, as one of the first papers in that direction, and later by Haslingeret al.[] in the context of what has come to be known asFree Material Optimization(FMO). However, most of the results and methods rely on linear PDEs, while only very few articles deal with nonlinear problems, see Kogut [] and Kogut and Leugering []. Another point of interest is degeneration in the coefficients which is typically avoided by assuming lower bounds on the coefficients. However, degen-eration occurs genuinely in topology optimization, damage and crack problems. In Kogut and Leugering [–] and in Kupenko and Manzo [] this problem has been considered in the context of linear problems (see also []). The nonlinear case was considered in [– ]. In this article, we extend our results to scalar nonlinear problems, where degeneration occurs already with respect to the states.

Another important point, arising after the solvability of the optimization problem had been proved, is the question as regards optimality conditions. The classical approach to deriving such conditions is based on the Lagrange principle. However, in the case when the control is considered in the coefficients of the main part of the state equation, the classical adjoint system often cannot be directly constructed due to the lack of differential proper-ties of the solution to the boundary value problem with respect to control variables. It was the main reason why Serovajskiy has proposed the concept of the so-called quasi-adjoint system [] and showed that optimality conditions for the linear elliptic control problem in coefficients can be derived, provided the mappingu →ψε(u) possesses the weakened

continuity property. However, the verification of this property is not easy matter even for linear systems. In the case of quasi-linear or nonlinear state equations, we are faced with another problem - the Lagrange functional to the indicated problem is not Gâteaux dif-ferentiable at the origin. To overcome this difficulty, Casas and Fernández introduced the special family of perturbed optimal control problems and derived the optimality condi-tions passing to the limit in optimality condicondi-tions for approximating control problems. In order to apply this approach to optimal control problem (.)-(.) it would suffice to as-sume the following extra conditions:p>N/ andAad⊂C() that look rather restrictive

(3)

formulation of suﬃcient conditions which would guarantee the uniqueness of the adjoint state to the optimal pair.

The paper is organized as follows. In Section  we give some preliminaries and prescribe the class of admissible controls to problem (.)-(.). In Section  we analyze the solvabil-ity properties of optimal control problem (.)-(.), using the monotonicsolvabil-ity of generalized

p-Laplacian (see for comparison [, ]). The aim of Section  is to give a collection of

preliminary results concerning the diﬀerentiability properties of the Lagrange functional associated with problem (.)-(.)

(u,y,λ) =I(u,y) +au(y,λ) –f,λ_W,p

 (),

where

au(y,λ) =

u(x)|∇y|p–(∇y,∇λ)_RNdx

and show that it admits the Gâteaux derivative with respect to the so-called non-degenerate directionsh∈W_,p() at the pointy.

In Section  we discuss the formal approach in deriving first-order optimality conditions for optimal control problem (.)-(.). In order to derive an optimality system, we apply the Lagrange principle. It is well known that the proof of this principle is different for different classes of optimal control problem (see, for instance, [, , , –]). The complexity of this procedure significantly depends on the form of the extremal problem under consideration. The procedure is rather simple if the controllable system is described by a linear well-posed controllable boundary value problem, but it becomes much more complicated if the controllable system is either ill-posed or nonlinear and singular.

With that in mind, we introduce the notion of a quasi-adjoint stateψεto an optimal

solutiony∈W,p() as a solution of the following Dirichlet boundary value problem for

degenerate linear elliptic equation:

–div

uθ|∇yθ|p– I+ (p– ) ∇

yθ

|∇yθ|

⊗ ∇yθ

|∇yθ|

∇ψθ

=pdiv|∇yθ–∇yd|p–(∇yθ–∇yd)

in,ψθ∈W,p(), (.)

where the degeneration occurs in a natural way with respect to the states.

This concept was proposed for linear problems by Serovajskiy [], where it was shown that an optimality system for the optimal control problems in coeﬃcients can be recov-ered in an explicit form if the mappingAadu →ψε(u) possesses the so-called weakened

continuity property. However, it should be stressed that the fulﬁlment of this property is not proved for the case ofp-Laplacian withp>  and, thus, should be considered as

(4)

In particular, following the well-known Hardy-Poincaré Inequality, we show that the sequence of quasi-adjoint states{ψεθ,θ}θ→toy∈W

,p

 () can be deﬁned in a unique way

(as unique solutions for (.)) and is bounded inW_,p() provided, for given distributions f ∈W–,q₍_{) and}_y

d∈W,p() withq= p

p– andp≥, an optimal solutiony=y(u) to

the nonlinear Dirichlet boundary value problem (.)-(.) satisﬁes the properties

∇ln|∇y| ∈L

;RN, ∇|∇y|

∈Lp;RN and |∇yd|

|∇y|∈

L∞(),

–C()≤V(x)≤λ L

L

i=



|x–xi|

a.e. in,

where

V(x) = ( –p)divA(u,y)∇ln|∇y|

–(p– )



∇ln|∇y|,A(u,y)∇ln|∇y|

RN,

A(u,y) =u I+ (p– ) ∇

y |∇y|⊗

∇y |∇y|

,

for some positive constantsC() > ,λ<λ∗:= (N– )_{/, and some collection of points} {x,x, . . . ,xL} ⊂.

Note that the fulﬁlment of this assumption is feasible if the matrix|∇y|p–_A₍_u_,_y_{) has}

a non-degenerate spectrum for eachU ∈Uad(for the details, we refer to []). The main

argument given in Sections - is to look for the solutionsψθ∈W_,p() of the

quasi-adjoint problem (.) in the form ψθ=|∇yθ|(–p)/zθ, wherezθ∈H(). As a result, we

show that each of the variational problems for the corresponding quasi-adjoint states has a unique solution, and these solutions form a weakly convergent sequence {ψθ}θ→ in

W_,p(). This property suﬃces in order to establish that the optimality system for problem (.)-(.) remains valid even if the matrix|∇y|p–A(u,y) has a degenerate spectrum.

2 Notation and preliminaries

Throughout the paper is a bounded open subset ofRN,N≥. The spaceD() of distributions in is the dual of the spaceC_∞(). For real numbers ≤p< +∞, and  <q< +∞such that /p+ /q= , the spaceW_,p() is the closure ofC_∞() in the Sobolev spaceW,p(), whileW–,q() is the space of distributions of the formf=f+

jDjfj, with

f,f, . . . ,fn∈Lq() (i.e. W–,q() is the dual space ofW,p()). As a norm in the space

W_,p() we can take the following one:

y_W,p

 ()=

|∇y|p_RN

/p

.

LetχE be the characteristic function of a setE⊂RN and let|E|be itsN-dimensional

Lebesgue measure.

For any vector ﬁeldv∈Lq(;RN), the divergence is an element of the spaceW–,q() deﬁned by the formula

divv,ϕ

W–,q₍_);W,p

 ()= –

(5)

where·,·_W–,q₍_);W,p

 ()denotes the duality pairing betweenW

–,q₍_{) and}_W,p

 (), and

(·,·)_RN denotes the scalar product of two vectors inRN.

Functions with bounded variations. Letf :→Rbe a function ofL(). Deﬁne

TV(f) :=

|Df|=sup

f(∇,ϕ)_RNdx:ϕ∈C_

;RN_,_ϕ₍_x₎_≤_{ for}_x_∈

,

where (∇,ϕ)_RN =N_i=∂ϕ_∂ i

xi.

According to the Radon-Nikodym Theorem, if TV(f) < +∞then the distributionDf is a measure and there exist a vector-valued function∇f ∈L₍_;_RN_{) and a measure}_D

sf,

singular with respect to theN-dimensional Lebesgue measureLN_{restricted to}_{, such}

thatDf =∇fLN+Dsf.

Deﬁnition . A functionf ∈L₍_{) is said to have a bounded variation in}_if_TV₍_f_{) <}

+∞. ByBV() we denote the space of all functions inL₍_{) with bounded variation,}_i.e.

BV() ={f ∈L₍_{) :}_TV₍_f_{) < +}_∞}_.

Under the normfBV()=fL₍₎+TV(f),BV() is a Banach space. For our further

analysis, we need the following properties ofBV-functions (see []).

Proposition .

(i) Let{fk}∞k=be a sequence inBV()strongly converging to somef inL()and satisfying conditionsup_k_∈_NTV(fk) < +∞.Then

f ∈BV() and TV(f)≤lim inf

k→∞ TV(fk);

(ii) for everyf∈BV()∩Lr₍₎_,_r_∈_{[, +}_∞₎_,_{there exists a sequence}_{_f

k}∞_k=⊂C∞() such that

lim

k→∞

|f –fk|rdx=  and lim

k→∞TV(fk) =TV(f);

(iii) for every bounded sequence{fk}∞k=⊂BV()there exist a subsequence,still denoted byfk,and a functionf ∈BV()such thatfk→f inL().

Admissible Controls and Generalized p-Laplacian.Letα,β,γ, andmbe given positive constants such that  <α≤β< +∞andα|| ≤m≤β||. We deﬁne the class of admis-sible controlsAadas follows:

Aad=

u∈BV()∩L∞()|TV(u)≤γ,u_L₍₎=m,α≤u(x)≤βa.e. in

. (.)

It is clear thatAadis a nonempty convex subset ofL() with empty topological interior.

We say that a nonlinear operator p:Aad×W,p()→W–,q() is the generalized

p-Laplacian if it has a representation

p(u,y) = –div

u(x)|∇y|p–∇y, where|∇y|p–:=|∇y|p–_RN =

_N

i= _∂∂_xy_j

p– 

(6)

or via the pairing

p(u,y),v

W–,q₍_);W,p

 ()=

u(x)|∇y|p–(∇y,∇v)_RNdx, ∀v∈W_,p().

It is easy to see that for every admissible controlu∈Aad, the operatorp(u,·) turns out

to be coercive,i.e.

p(u,y),y

W–,q₍_);W,p

 ()≥αy p W_,p(),

and demi-continuous, where by the demi-continuity property we mean the fulﬁlment of the implication:yk→ystrongly inW,p() implies thatp(u,yk) p(u,y) weakly in

W–,q₍_{) (see [, ]). Moreover,}_p_-Laplacian

p(u,y) is a strictly monotone operator for

eachu∈Aad. Indeed, having applied the trick

|∇y|p–|∇v|u(x)dx≤

|∇y|p–up–p p p–_dx

(p–)/p

|∇v|upp_dx

/p

=

|∇y|pu dx

(p–)/p

|∇v|pu dx

/p

=:yp–

W_,p(;u dx)vW,p(;u dx),

it is easy to check the validity of the following estimate:

p(u,y) –p(u,v),y–v

W–,q₍_);W,p

 ()

=

u(x)|∇y|p–∇y–|∇v|p–∇v,∇y–∇v_RNdx (by the Cauchy Inequality)

≥

|∇y|p–|∇y|–|∇v||∇y|u(x)dx

–

|∇v|p–|∇v||∇y|–|∇v|u(x)dx (by the Hölder Inequality)

≥ yp

W,p(;u dx)

+vp

W,p(;u dx)

–yp–

W_,p(;u dx)vW,p(;u dx)–v p–

W_,p(;u dx)yW,p(;u dx)

=yp–

W_,p(;u dx)–v p– W_,p(;u dx)

y_W,p

 (;u dx)–vW ,p

 (;u dx)

≥–py_W,p

 (;u dx)–vW ,p

 (;u dx) p

, ∀y,v∈W_,p().

As a result, sinceβ–yp

W_,p(;u dx)≤ y p

W_,p()≤α –_yp

W_,p(;u dx), it follows that

p(u,y) –p(u,v),y–v

W–,q₍_);W,p

 ()>  for ally,v∈W ,p

 (),y=v.

(7)

ev-eryu∈Aadandf ∈W–,q(), the nonlinear Dirichlet boundary value problem

p(u,y) =f in,y∈W,p(), (.)

admits a unique weak solution inW_,p(). Let us recall that a functionyis the weak solu-tion of (.) if

y∈W_,p(), (.)

u(x)|∇y|p–(∇y,∇v)_RNdx=f,v

W–,q₍_);W,p

 (), ∀v∈W ,p

 (). (.)

3 Setting of the optimal control problem

We consider the following optimal control problem:

Minimize

I(u,y) =

_∇y(x) –∇yd(x) p

dx

, (.)

subject to the constraints

u(x)|∇y|p–(∇y,∇v)_RNdx=f,v

W–,q₍_);W,p

 (), ∀v∈W ,p

 (), (.)

u∈Aad⊂L∞(), y∈W,p(), (.)

wheref ∈W–,q₍_{) and}_y

d∈W,p() are given distributions.

Hereinafter,⊂L∞()×W,p() denotes the set of all admissible pairs (u,y) to optimal

control problem (.)-(.).

Remark . As was mentioned in the previous section, the characteristic feature of

op-timal control problem (.)-(.) is the fact that the set of admissible controls Aad is a

convex set with an empty topological interior. As we will see later on, this circumstance entails some technical diﬃculties in the substantiation of optimality conditions for the given problem.

Letτ be the topology on the setL₍₎_×_W,p

 () which we deﬁne as a product of the

strong topology ofL₍_{) and the weak topology of}_W,p

 (). Further we make use of the

following results, which play a key role for the solvability of optimal control problem (.)-(.) (see [, ] and [, ] for comparison).

Proposition . For any u∈Aad and f ∈W–,q(),a weak solution y∈W,p()to the

variational problem(.)-(.)satisﬁes the estimate

y_W,p

 ()≤α

–/(p–)_f/(p–)

W–,q₍₎. (.)

(8)

and the Cauchy-Bunjakowsky Inequality on the right-hand side. Indeed, we get

αyp

W_,p()≤

u(x)|∇y|p–(∇y,∇y)_RNdx

=f,y_W–,q₍_);W,p

 ()≤ fW

–,q₍₎y

W_,p().

Now to get the desired estimate we divide each part of the obtain relation intoy_W,p

 ()

and raise each side to the power /(p– ).

Proposition . If{uk}k∈N ⊂Aad and uk→u in L(), then uk→u in Lr()for any

r∈[, +∞)and uk

∗

→u in L∞().

Proof Sinceuk→uinL() and

ukdx=m, TV(uk)≤γ, andα≤uk≤βa.e. in,∀k∈N,

by Proposition .(i) it follows that

TV(u)≤γ,

u dx=m, andα≤u≤βa.e. in.

Hence,u∈Aad. Moreover, for anyr∈[, +∞), the estimate

uk–urLr₍₎≤vrai sup

x∈

uk(x) –u(x) r–

uk–uL₍₎≤(β–α)r–u_k–u_L₍₎

implies thatuk→uinLr().

To end the proof, it is enough to note that strong convergenceuk→uinL() implies,

up to a subsequence, convergenceuk(x)→u(x) almost everywhere in. Hence, by the

Lebesgue Theorem, we have

(uk–u)ϕdx→, ∀ϕ∈L(),

that isuk

∗

→uinL∞(). Since this conclusion is true for any weakly-∗convergent sub-sequence of{uk}k∈N, it follows thatuis the weak-∗limit for the whole sequence{uk}k∈N.

Proposition . Aadis a sequentially compact subset of Lr()for any r∈[, +∞),and it

is a sequentially weakly-∗compact subset of L∞().

Proof Let{uk}k∈Nbe any sequence ofAad. Then{uk}k∈N is bounded inBV()∩L∞(). As a result, the statement immediately follows from Propositions . and .(iii).

Proposition . For every f ∈W–,q₍₎_{the set} _{is sequentially compact}_,_{i.e. for each}

sequence {(uk,yk)∈}k∈N it can be found a subsequence{(ukn,ykn)∈}n∈N such that

ukn→uin L(),ykn→yin W,p(),where(u,y)∈,that is,yis a weak solution to

(9)

Proof Let{(uk,yk)}k∈N⊂be an arbitrary sequence of admissible pairs. Then

Proposi-tion . and a priori estimate (.) lead to the existence of a subsequence, still denoted by

{(uk,yk)}k∈Nand a pair (u,y)∈Aad×W,p() such that

yk→yweakly inW,p(), (.)

uk→ustrongly inLr(),∀r∈[, +∞) and weakly-∗ inL∞(). (.)

Taking into account the inequality (see [])

|ξ|p–ξ–|η|p–η,ξ–η_RN ≥–p|ξ–η|p ∀ξ,η∈RN,

and deﬁnition of the class of admissible controlsAad, we conclude: there exists a constant

C>  independent ofk∈Nsuch that

C

|∇yk–∇y|pdx≤

uk

|∇yk|p–∇yk–|∇y|p–∇y,∇yk–∇y

RNdx

=f,yk–y_W–,q₍_);W,p

 ()

+

(u–uk)

|∇y|p–∇y,∇yk–∇y

RNdx

–

u|∇y|p–∇y,∇yk–∇y

RNdx=I+I–I. (.)

Sinceu≤β,u|∇y|p–_∇_y_∈_Lq₍_;_RN_{), and}_∇_y

k∇yinLp(;RN), it follows that

I:=f,yk–y_W–,q₍_);W,p

 () by (.)

−→  ask→ ∞,

I:=

u|∇y|p–∇y,∇yk–∇y

RNdx

by (.)

−→  ask→ ∞,

(u–uk)|∇y|p–∇y by (.)

−→  strongly inLq;RN, and, therefore,

I:=

(u–uk)

|∇y|p–∇y,∇yk–∇y

RNdx

by (.)

−→  ask→ ∞.

Hence, passing to the limit in (.) ask→ ∞, we arrive at a conclusion (see []):

yk→ystrongly inW,p(), and|∇yk|p–∇yk|∇y|p–∇yinLq

;RN.

As a result, we ﬁnally have

f,v_W–,q₍_);W,p

 ()=klim→∞

uk(x)|∇yk|p–(∇yk,∇v)RNdx

=

u(x)|∇y|p–(∇y,∇v)_RNdx, ∀v∈W_,p(),

that is, the limit pair (u,y) is an admissible to optimal control problem (.)-(.). The

(10)

Taking into account Propositions .-., in a similar manner to [, ], it is easy to conclude the following existence result.

Theorem . The optimal control problem(.)-(.)admits at least one solution

uopt,yopt∈⊂BV()∩L∞()×W_,p(),

Iuopt,yopt= inf

(u,y)∈I(u,y).

4 Auxiliary results

The main goal of this paper is to derive the optimality conditions for optimal control prob-lem (.)-(.). However, we deal with the case when we cannot apply the well-known classical approach (see, for instance, [, ]), since for a given distributionf ∈W–,q() the mappingu →y(u) is not Fréchet diﬀerentiable on the class of admissible controls, in general, and the classAadhas an empty topological interior. With that in mind, we apply

the so-called diﬀerentiation concept on convex sets and introduce the notion of a quasi-adjoint stateψεto an optimal solutiony∈W,p() that was proposed for linear problems

by Serovajskiy [].

To begin with, we discuss the diﬀerentiable properties of the Lagrange functional as-sociated with problem (.)-(.). Since (.) can be seen as a constraint, we deﬁne the Lagrangian as follows:

(u,y,μ) =I(u,y) +au(y,μ) –f,μ_W–,q₍_);W,p

 (), (.)

whereμ∈W_,p() is a Lagrange multiplier and

au(y,μ) =

–p(u,y),μ

W–,q();W_,p()=

u(x)|∇y|p–∇y,∇μ_RNdx.

For givenh∈W_,p() andθ∈[, ], let us consider the following sets:

=

x∈:|∇y|> , ,θ=

x∈:|∇y+θ∇h|> .

Clearly, we cannot claim thatχ,θ→χinLr() for some ≤r<∞, because

conver-gence of the sequence{∇y+θ∇h}θ→+to∇ydoes not imply, in general, theχ-convergence

of subsets{,θ}θ→+toasθ →+ [, p.]. Indeed, leth∈W,p() be such that |∇h(x)|>  almost everywhere in and∇y= . Then=∅whereas,θ=for all

positiveθsmall enough. Hence, in this case the convergenceχ,θ →χfails. In view of

this, we make use of the following notion (see []).

Deﬁnition . We say that an elementy∈W_,p() is a regular point for the Lagrangian (.) if for eachv∈W_,p() the directionh=v–yis non-degenerate in the following sense:

χ,θ→χ inL

₍_). _(.)

(11)

Proposition . Assume that y is an element of W_,p()such that the set

S=

x∈:∇y(x)=  (.)

has zero Lebesgue measure.Then y∈W_,p()is a regular point of the Lagrangian(.)in the sense of Deﬁnition..

Proof Lethbe a given element ofW_,p(). Ifx∈, then, by deﬁnition,|∇y(x)|> . Thus,

there is a valueθ∈(, ] such that|∇y(x) +θ∇h(x)|>  for allθ∈[,θ]. It is worth to note

that this pointwise inequality makes a sense if onlyx∈is a Lebesgue point of both∇y and∇h. However, the Lebesgue Diﬀerentiation Theorem states that, given anyg∈L₍_),

almost everyx∈is a Lebesgue point. Hence, almost all Lebesgue points of∇yare the Lebesgue points of∇y+θ∇hforθsmall enough, andχ,θ(x) =χ(x) =  for allθ∈[,θ].

Since the setShas zero Lebesgue measure, it follows that|\|=  and|\,θ|= 

forθ ∈[, ] small enough. Therefore,χ,θ →χ almost everywhere inand hence, χ,θ→χstrongly inL

₍_).

Remark . It is worth noting that due to the results of Manfredi (see []), the

assump-tions of Proposition . appears natural and it is not a restrictive supposition in practice. Indeed, following [], we can ensure that the setS:={x∈:∇y= }for non-constant

solutions of thep-Laplace equation (ap-harmonic function) has zero Lebesgue measure. Moreover, it is also easy to observe that ifyandvinW_,p() are two regular points of the functional(u,y,λ), then there exists a positive numberα∈R(α= ) such that each point of the segment [y,αv] ={y+t(αv–y) :∀t∈[, ]} ⊂W_,p() is also regular for(u,y,λ).

We are now ready to study the diﬀerentiability properties of the Lagrangian(u,y,λ). We begin with the following result.

Lemma . Let u∈Aadbe a given element,and let y∈W,p()be a regular point of the

Lagrangian(.).Then the mapping

W_,p()y →p(u,y) = –div

u(x)|∇y|p–∇y∈W–,q(), p≥

is Gâteaux diﬀerentiable at y and its Gâteaux derivative

–p(u,y)

G∈L

W_,p(),W–,q()

exists and takes the form

–p(u,y)

G[h] = –div

u(x)|∇y|p–∇h

– (p– )divu(x)|∇y|p–(∇y,∇h)_RN∇y

. (.)

(12)

deduce the following equality:

lim

λ→+

p(u,y+λh) –p(u,y)

λ – (p– )div

u(x)|∇y|p–(∇y,∇h)_RN∇y

–divu|∇y|p–∇h

Lq₍_;RN₎ = .

With that in mind, let us consider the vector-valued function

g(λ) :=|∇y+λ∇h|p–(∇y+λ∇h)

for which the Taylor expansion with the remainder term in the Lagrange form leads to the relation

g(λ) –g()≤λg(θ), θ∈(,λ),

whereg() =|∇y|p–∇yand

g(θ) =|∇y+θ∇h|p–∇h

+ (p– )|∇y+θ∇h|p–θ|∇h|+ (∇y,∇h)_RN

(∇y+θ∇h) 

|∇y+θ∇h|

=|∇y+θ∇h|p–∇h+ (p– )|∇y+θ∇h|p–(∇y+θ∇h,∇h)RN

|∇y+θ∇h|

∇y+θ∇h

|∇y+θ∇h|.

Letδ>  be an arbitrary value. Let us consider the following decomposition:

=S∪δ∪δ,

where the setSis deﬁned by (.), andδandδare measurable subsets ofsuch that

_δ=x∈:∇y(x)≥δ, _δ=x∈:  <∇y(x)<δ.

Following [, p.] (see also []), we have: for anyε>  there exists a positive value

δ>  such that

g(θ)

λ – (p– )|∇y|

p–₍_∇_y_,_∇_h₎

RN∇y–|∇y|p–∇h

Lq₍ δ;RN)

<ε

, (.)

g(θ)

λ – (p– )|∇y|

p–₍_∇_y_,_∇_h₎

RN∇y–|∇y|p–∇h

Lq₍ δ;RN)

<ε

 (.)

for allδ∈(,δ),θ∈(,λ), andλ>  small enough. Since|S|=  by the initial

assump-tions, it follows from (.)-(.) that the vector-valued function|∇y|p–_∇_y_{is Gâteaux}

dif-ferentiable. Hence, the operatorp(u,y) = –div(u(x)|∇y|p–∇y) is Gâteaux diﬀerentiable

for any regular pointy∈W_,p() and for any admissible controlu∈Aad, and its Gâteaux

(13)

Since Gâteaux diﬀerentiability of the operator y →p(u,y) implies the existence of

Gâteaux derivative for the functional

ϕ(y) =–p(u,y),μ

W–,q₍_);W,p

 ()=

u(x)|∇y|p–∇y,∇μ_RNdx,

ϕ:W_,p()→R

such that

ϕ_G(y),h_W–,q₍_);W,p

 ()=

–p(u,y)

G[h],μ

W–,q₍_);W,p

 (), ∀μ∈W ,p  (),

we arrive at the following obvious consequence of Lemma ..

Corollary . Let u∈Aad andλ∈W,p()be given elements,and let y∈W ,p

 ()be a

regular point of the Lagrangian(.).Then the mapping

W_,p()y →(u,y,μ) =I(u,y) +au(y,μ) –f,μ_W–,q₍_);W,p

 ()∈R

is Gâteaux diﬀerentiable at y and its Gâteaux derivative,

_G(u,y,μ)∈W–,q(),

exists and takes the form

_G(u,y,μ),h_W–,q₍_);W,p

 ()

=p

|∇y–∇yd|p–(∇y–∇yd,∇h)RNdx+

u(x)|∇y|p–∇μ,∇h_RNdx

+ (p– )

u(x)|∇y|p–(∇y,∇μ)_RN(∇y,∇h)_RNdx. (.)

Remark . In view of the equality

(∇y,∇μ)_RN∇y= [∇y⊗ ∇y]∇μ,

the last term in (.) can be rewritten as follows:

(p– )

u(x)|∇y|p–[∇y⊗ ∇y]∇μ,∇h_RNdx.

Before deriving the optimality conditions, we need the following auxiliary result.

Lemma . Let u∈Aad,y∈W,p(), and v∈W ,p

 ()be given distributions.Assume

that each point of the segment[y,v] ={y+α(v–y) :∀α∈[, ]} ⊂W,p()is regular for the

mapping v→(u,v,μ).Then there exists a positive valueε∈(, )such that

(u,v,μ) –(u,y,μ)

=_G(u,y+εh,μ),h_W–,q₍_);W,p

(14)

=p

|∇y+ε∇h–∇yd|p–(∇y+ε∇h–∇yd,∇h)RNdx

+

u(x)|∇y+ε∇h|p–(∇μ,∇h)_RNdx

+ (p– )

u(x)|∇y+ε∇h|p–

×(∇y+ε∇h)⊗(∇y+ε∇h)∇μ,∇h_RNdx (.)

with h=v–y.

Proof For givenu,μ,yd,y, andv, let us consider the scalar functionϕ(t) =(u,y+t(v–

y),μ). Since by Corollary ., the functional(u,·,μ) is Gâteaux diﬀerentiable at each point of the segment [y,v], it follows that the functionϕ=ϕ(t) is diﬀerentiable on (, ) and

ϕ(t) =_Gu,y+t(v–y),λ,v–y_W–,q₍_);W,p

 (), ∀t∈(, ).

To conclude the proof, it remains to take into account (.) and apply the Mean Value Theorem:

ϕ() –ϕ() =ϕ(ε) for someε∈(, ).

5 Formalism of the quasi-adjoint technique

We begin with the following assumption:

(H) The distributionf ∈W–,q₍₎_{is such that, for each admissible control}_u_∈_A ad, the corresponding weak solutiony=y(u)of the nonlinear Dirichlet boundary value problem(.)is a regular point of the mappingy →(u,y,λ).

Let (u,y)∈be an optimal pair for problem (.)-(.). Then

=(u,y,λ) –(u,y,λ)≥, ∀(u,y)∈,∀λ∈W,p(). (.)

Hence, splitting it in two terms, we obtain

(u,y,λ) –(u,y,λ) =(u,y,λ) –(u,y,λ) +(u,y,λ) –(u,y,λ)

=y(u,y,λ) +(u–u,y,λ)≥, (.)

for allλ∈W_,p() andu∈Aadsuch that (u–u)∈Aad.

Since the set of admissible controlsAad⊂L() has an empty topological interior, we

justify the choice of perturbation for an optimal control as follows:

uθ:=u+θ(u–u),

where (u,y)∈is an arbitrary admissible pair, andθ∈[, ]. Then due to Hypothesis (H) and Remark ., we can suppose that each point of the segment [y,yθ]⊂W,p() is

regu-lar for the mappingv→(u,v,λ), whereyθ:=y(uθ) =y(u+θ(u–u)) is the

(15)

valueεθ∈(, ) such that condition (.) can be represented as follows:

=(uθ,yθ,λ) –(u,y,λ)

=_Guθ,y+εθ(yθ–y),λ

,yθ–y

W–,q₍_);W,p

 ()+

(uθ–u,y,λ)

=_Guθ,y+εθ(yθ–y),λ

,yθ–y

W–,q₍_);W,p

 ()

+θ(u–u),y,λ

≥. (.)

Using (.), we obtain

=p

|∇yεθ,θ–∇yd|

p–₍_∇_y

εθ,θ–∇yd,∇yθ–∇y)RNdx +

uθ|∇yεθ,θ|

p–₍_∇_λ_,_∇_y

θ–∇y)RNdx + (p– )

uθ|∇yεθ,θ|

p–_[_∇_y

εθ,θ⊗ ∇yεθ,θ]∇λ,∇yθ–∇y

RNdx

+θ

(u–u)

|∇y|p–∇y,∇λ

RNdx≥, ∀u∈Aad, (.)

whereyεθ,θ=y+εθ(yθ–y).

Now we introduce the concept of quasi-adjoint states that was ﬁrst considered for linear problems by Serovajskiy [].

Deﬁnition . We say that, for givenθ ∈[, ] andu∈Aad, a distribution ψεθ,θ is the quasi-adjoint state toy∈W,p() ifψεθ,θsatisﬁes the following integral identity:

uθ|∇yεθ,θ|

p– _I_{+ (}_p_{– )} ∇yεθ,θ

|∇yεθ,θ|

⊗ ∇yεθ,θ

|∇yεθ,θ|

∇ψεθ,θ,∇ϕ

RN dx

+p

p–₍_∇_y

εθ,θ–∇yd,∇ϕ)RNdx= , ∀ϕ∈W

,p

 () (.)

or in the operator form

–div

uθ|∇yεθ,θ|

p– _I_{+ (}_p_{– )} ∇yεθ,θ

|∇yεθ,θ|

⊗ ∇yεθ,θ

|∇yεθ,θ|

∇ψεθ,θ

=pdiv|∇yεθ,θ–∇yd|

p–₍_∇_y

εθ,θ–∇yd)

inD(). (.)

Here,I∈L(RN;RN) is the identity matrix,yθ:=y(uθ) =y(u+θ(u–u)) is the solution

of problem (.)-(.),yεθ,θ=y+εθ(yθ–y), andεθ=ε(uθ)∈(, ) is a constant coming from equality (.).

Let us assume, for a moment, that quasi-adjoint stateψεθ,θtoy∈W

,p

 () is a

distribu-tion ofW_,p(). Then we can deﬁne the elementλin (.) as the quasi-adjoint state, that is, we can setλ=ψεθ,θ. As a result, the increment of Lagrangian (.) can be simpliﬁed to the form

(u–u)

|∇y|p–∇y,∇ψεθ,θ

(16)

Thus, in order to derive the necessary optimality conditions and provide their substan-tial analysis, it remains to pass to the limit in (.)-(.) asθ→+, and to show that the sequence of quasi-adjoint states{ψε,θ}θ→is deﬁned in a unique way through relation (.)

and it is compact with respect to the weak topology ofW_,p().

6 The Hardy Inequality and asymptotic behavior of quasi-adjoint states

The main goal of this section is to study the well-posedness of variational problem (.) and describe the asymptotic behavior of its solutions as parameterθtends to zero.

We begin with the following evident consequence of Proposition ..

Lemma . Assume that uk,u∈Aad and uk→u strongly in L∞().Then,for the

cor-responding solutions of boundary value problem (.)-(.),we have strong convergence yk=y(uk)→y=y(u)in W,p().

Since, by the initial suppositions,uθ→uinL∞() asθ→, we immediately arrive at

the following consequence of Lemma ..

Corollary .

(i) yθ→yinW,p()asθ→; (ii) yεθ,θ→yinW

,p

 ()asθ→.

Further, we note that

(A) ifr∈(,p], wherep≥, andf,g∈Lp(), then

f(x)r–g(x)r≤rf(x)+g(x)r–f(x) –g(x), a.e. in, (.)

and, hence, by the Hölder Inequality,

fr_Lr₍₎–gr_Lr₍₎=

|f|rdx–

|g|rdx≤

_|f|r–|g|rdx

≤ f –gLp₍₎

|f|+|g|(r–)p/(p–)dx

(p–)/p

≤ f –gLp₍₎|f|+|g|r–

Lp()||

(p–r)/p_. _(.)

(A) Ifr∈[, ]andf,g∈Lp()then

f(x)r–g(x)r≤f(x) –g(x)r, a.e. in, (.)

and, therefore,

fr_Lr₍₎–gr_Lr₍₎≤

|f –g|rdx≤ f–gr_Lp₍₎||(p–r)/p. (.)

Then, in view of Corollary . and estimates (.) and (.), we can give the following obvious conclusion.

(17)

Our next intention is to study the variational problem (.). With that in mind, we rewrite it in the form

–div(ρθAθ∇ψεθ,θ) =fθ in, (.)

where

ρθ:=|∇yεθ,θ|

p–_, _(.)

Aθ:=uθ[I+Cθ] =uθ I+ (p– )

∇yεθ,θ

|∇yεθ,θ|

⊗ ∇yεθ,θ

|∇yεθ,θ|

, (.)

fθ:=pdiv

p–₍_∇_y

εθ,θ–∇yd)

. (.)

To begin with, we make use of the following observation.

Proposition . Let(u,y)∈be an optimal pair for problem(.)-(.).Then,for any

θ∈[, ]andu∈Aad,we have:

() Aθ∈L∞(;SN_sym),whereSN_symdenotes the set of allN×Nsymmetric matrices,which

are obviously determined byN(N+ )/scalars.

() α|ξ|≤(ξ,Aθξ)_RN ≤β[ + (p– )N–]|ξ|for allξ∈RN.

Proof The ﬁrst property is obvious. To prove the second one, it enough to take into ac-count the deﬁnition of the class of admissible controlsAadand the following chain of

es-timates:

α|ξ|≤(ξ,uθIξ)RN ≤(ξ,uθIξ)RN + (ξ,uθCθξ)RN = (ξ,uθIξ)_RN + (p– )uθ

_∇

yεθ,θ

|∇yεθ,θ| ,ξ



RN

≤β|ξ|+ (p– )β(ξ+· · ·+ξN)≤β

 + (p– )N–|ξ|.

Proposition . (a) ρθ:=|∇yεθ,θ|

p–_{→ |∇}_y

|p–=:ρinL()asθ→; (b) Aθ→AinLr(;SNsym)for anyr∈[, +∞)andAθ

∗

→AinL∞(;SNsym)asθ→; (c) ρθAθ→ρA=u[I+ (p– )_|∇∇y_y_|⊗_|∇∇y_y_|]inL(;RN)asθ→.

Proof Validity of assertions (a)-(b) immediately follows from Corollary . and Proposi-tions . and .. The strong convergence property in (c) is a direct consequence of the

Lebesgue Convergence Theorem.

The following results are crucial for our further analysis.

Lemma . Assume that,for givenθ∈[, ]andu∈Aad,∇ln|∇yεθ,θ| ∈L

₍_;_RN_)._Then

each elementψof W_,p()can be represented in a unique way as follows:

ψ=|∇yεθ,θ|

(–p)/_z

(18)

Proof Let us ﬁx an elementψ∈W_,p(). Then forzθ:=|∇yεθ,θ|(p–)/ψ, by the Hölder Inequality withp=p/ andq=p/(p– ), we have

zθ_L₍₎=

|∇yεθ,θ|

p–_ψ_dx_{≤ ∇}_y

εθ,θ

p–

Lp₍_;RN₎ψLp() ≤Cyεθ,θ

p–

W,p() ψ

W_,p()<∞,

where the constantCcomes from the Poincaré Inequality. Using the evident equality

∇zθ=

 

√

ρψ∇lnρ+√ρ∇ψ

ρ=|∇y_εθ,θ|p–

,

and applying the Hölder Inequality with exponentsp= (p– )/(p– ) andq= (p– )/p for the ﬁrst term and withp=p/ andq=p/(p– ) for the second one, we get

∇zθL₍_;RN₎≤ p– 



|∇yεθ,θ|

(p–)/_|_ψ_|_∇_ln_|∇_y

εθ,θ|dx +||/

|∇yεθ,θ|

p–_|∇_ψ_|_dx /

≤p– 

 ψLp()

|∇yεθ,θ|

(p–)p

(p–)∇_ln|∇_y

εθ,θ| p p–_dx

(p–)/p

+||/∇yεθ,θ

(p–)/

Lp₍_;RN₎∇ψLp₍_;RN₎

≤p– 

 ψLp()∇yεθ,θ

(p–)/

Lp₍_;RN₎∇ln|∇yεθ,θ|L₍_;RN₎ +||/yεθ,θ

(p–)/

W_,p()ψW,p() ≤

p– 

 C∇ln|∇yεθ,θ|L₍_;RN₎+||/

yεθ,θ

(p–)/

W_,p()ψW,p()

<∞.

Thus,zθ∈W,()∩L(). Since the elementzθ:=|∇yεθ,θ|

(p–)/_ψ_{inherits the trace}

prop-erties along∂from its parent elementψ, we ﬁnally obtainzθ∈W_,()∩L(). The

proof is complete.

As an obvious consequence of this result and continuity of the embedding of Sobolev spacesH

()→W ,

 (), we can give the following conclusion.

Corollary . If,for givenθ∈[, ]andu∈Aad,∇ln|∇yεθ,θ| ∈L

₍_;_RN_),_{then there exists}

a dense subset D(yεθ,θ)of H



()such that

|∇yεθ,θ|

(–p)/_z_∈_W,p

 (), ∀z∈D(yεθ,θ). (.)

Taking this fact into account, it is plausible to introduce the following linear mapping:

F:D(yεθ,θ)⊂H



()→W ,p

 (), whereFz=|∇yεθ,θ|

(19)

Since domainD(yεθ,θ) ofFis dense in Banach spaceH(), it follows that forF, as for a

densely deﬁned operator, there exists an adjoint operator

F∗:DF∗⊂W–,q()→H–()

such that

F∗v,z_H–₍_),H

()=v,FzW–,q(),W ,p

 (), ∀z∈D(yεθ,θ) and∀v∈D

F∗,

where

DF∗=v∈W–,q₍₎_|_{there exists}_C_{>  such that for all}_z_∈_D₍_y

εθ,θ)

v,Fz_W–,q₍_),W,p

 ()

_≤Cz_H ()

.

Notice that, in general, the adjoint operatorF∗is not densely deﬁned. Let us consider the following linear operator:

Aθψ:= –div(ρθAθ∇ψ), ∀ψ∈W,p(),

whereρθandAθare deﬁned by (.)-(.). By Proposition ., we have

ρθAθ∇ψq_Lq₍_;RN₎≤ Aθq_L∞₍_;SN

sym)

|∇yεθ,θ|

(p–)p/(p–)_|∇_ψ_|p/(p–)_dx

p=p–_p– q=p– 

≤ Aθq

L∞(;SN_sym)yε_θ,θp(p–)/(p–)

W,p()

ψp/(p–) W,p()

.

Hence,Aθψ:W_,p()→W–,q() and this operator is obviously monotone,

Aθ(ψ–φ),ψ–φ

W–,q₍_);W,p

 ()

=

|∇yεθ,θ|

p–_A

θ(∇ψ–∇φ),∇ψ–∇φ

RN ≥,

and demi-continuous. However, because of the multiplierρθ=|∇yεθ,θ|

p–_{, this operator}

can lose the coercivity property.

Letλbe a positive constant such thatλ<λ∗:= (N– )_{/. Let}_{_x

,x, . . . ,xL} ⊂be a

given collection of points. Letu∈Aadbe a given control. We deﬁne a nonempty subset

M()u⊂W,p() as follows:y∈Mu() if and only if

–C()≤Vu,y(x)≤

λ

L

i=



|x–xi|

a.e. in, (.)

for some positive constantC() > , where

Vu,y(x) = ( –p)div

A(u,y)∇ln|∇y|–(p– )



∇ln|∇y|,A(u,y)∇ln|∇y|_RN, (.)

A(u,y) =u I+ (p– ) ∇y

|∇y|⊗

∇y

|∇y|

(20)

We are now in a position to give an important property of the operatorAθψ:W_,p()→

W–,q₍_).

Lemma . Assume that, for givenu∈Aad, θ ∈[, ],andεθ ∈(, ),the distribution

yεθ,θ∈W

,p

 ()is such that

∇ln|∇yεθ,θ| ∈L

_;_RN _and _y

εθ,θ∈Muθ()⊂W

,p  ().

Then

Aθ(Fz),Fv

W–,q₍_);W,p

 ()=

Bθ(z),v

H–₍_),H

(), (.)

where

Bθ(z) = –div(Aθ∇z) –



Vθ(x)z, (.)

Vθ(x) = ( –p)div

Aθ∇ln|∇yεθ,θ|

–(p– )



∇ln|∇yεθ,θ|,Aθ∇ln|∇yεθ,θ|

RN, (.)

and the linear operator Bθdeﬁnes an isomorphism from H()into its dual H–().

Proof Letvandzbe arbitrary elements ofD(yεθ,θ)⊂H



(). Then by Corollary ., we

haveFz,Fv∈W_,p(). Further, following the deﬁnition of the operatorF, we can provide the following chain of transformations:

Aθ(Fz) = –div

ρθAθ∇(Fz)

= –div

ρθAθ∇

z

√

ρθ

= –div

– ρ

–/

θ zAθ∇ρθ+ρθ/Aθ∇z

= – ρ

–/

θ z(∇ρθ,Aθ∇ρθ)_RN+  ρ

–/₍_∇_z_,_A

θ∇ρθ)_RN +

ρ

–/

θ zdiv(Aθ∇ρθ) –

 ρ

–/

θ (∇ρθ,Aθ∇z)RN –ρ/_θ div(Aθ∇z)

= –ρ_θ/div(Aθ∇z) –

 ρ / θ   _∇ ρθ ρθ

,Aθ∇

ρθ ρθ RN –  ρθ

div(Aθ∇ρθ)

z

=ρ_θ/

–div(Aθ∇z) –

 



(∇lnρθ,Aθ∇lnρθ)RN– 

ρθ

div(ρθAθ∇lnρθ)

z

=ρθ/

–div(Aθ∇z) –

 Vθ(x)z

.

Hence,

Aθ(Fz),Fv

W–,q₍_);W,p

 ()

=

–div

ρθAθ∇

z

√

ρθ

,√v

ρθ

W–,q₍_);W,p