An optimal control problem for linear SDE of mean field type with terminal constraint and partial information

(1)

R E S E A R C H

Open Access

An optimal control problem for linear SDE of

mean-ﬁeld type with terminal constraint and

partial information

Haiyan Zhang

1*

*_{Correspondence:}

[email protected];

[email protected]

1_{College of Sciences, Shandong} Jiaotong University, Jinan, China

Abstract

This paper is concerned with an optimal control problem for a linear stochastic differential equation (SDE) of mean-field type, where the drift coefficient of observation equation is linear with respect to the state, the control and their expectations, and the state is subject to a terminal constraint. The control problem cannot be solved by transforming it into a standard optimal control problem for an SDE without mean-field term. By virtue of a backward separation method with a decomposition technique, one optimality condition and one forward–backward filter are derived. Two linear-quadratic (LQ) optimal control problems and one cash management problem with terminal constraint and partial information are studied, and optimal feedback controls are explicitly obtained.

Keywords: Feedback control; Filter; LQ optimal control; Necessary condition; SDE of mean-ﬁeld type

1 Introduction

One begins with a complete ﬁltered probability space (Ω,F, (Ft)0≤t≤T,P), on which are given anFt-adapted standard Brownian motion (ωt,ω˜t) with value inR2and a Gaussian random variableξ with meanμ0and covarianceσ0. (ω,ω˜) is independent ofξ. LetT> 0 be a ﬁxed time horizon, letRm _{be the}_m_{-dimensional Euclidean space, and let} _f

xbe the partial derivative off with respect tox. Ifx: [0,T]→Ris uniformly bounded, one writesx∈L∞(0,T;R). Ifx: [0,T]→Ris square-integrable, one writesx∈L2(0,T;R). If

x: [0,T]×Ω→Ris anFt-adapted, square-integrable process, one writesx∈L2

F(0,T;R).

One also adopts similar notations for other ﬁltrations and Euclidean spaces. Consider the linear SDE

⎧ ⎨ ⎩

dxv

t = (atxvt+a¯tExvt +btvt+b¯tEvt)dt+ctdωt+˜ctdω˜t,

xv 0=ξ,

whereEis expectation, andvis a control process. SinceExv

t andEvtappear in the equa-tion, one calls it an SDE of mean-ﬁeld type. Assume that the solutionxv_{to the equation is}

(2)

observed through ⎧

⎨ ⎩

dyv

t = (ftxvt+f¯tExvt+gtvt+g¯tEvt)dt+htdω˜t,

y0= 0.

The cost functional is

J[v] =E T

0

lt,xv_t,Exv_t,vt,Evt

dt+φxv_T,Exv_T .

Herevtis required to beσ{yvs; 0≤s≤t}-adapted and to satisfy

E sup

0≤t≤T|

vt|2< +∞

and

Eϕxv T,ExvT

= 0;

the functionsa,a¯,b,b¯,c,˜c,f,f¯,g,g¯,h,l,φ andϕwill be specified in Sect.2. This is a partially observable optimal control problem with terminal constraint. This problem can reduce to an optimal control problem with certain additional control domain constraint, but it cannot be studied by classical control theory for SDE without mean-field term. From this viewpoint, this problem extends some standard optimal control problems and covers a few financial models.

Classical variation provides an effective tool for studying optimal control problems. However, it is not always valid for partially observed optimal control problems. A main reason is there is a circular dependence between the control vand the observationyv_. In 2008, Wang and Wu [1] originally proposed a backward separation method. In 2018, Wang coauthored their monograph [2], where the backward separation method was sys-tematically introduced and was regarded as one of most important tools for studying par-tially observed optimal control problems. Combining the backward separation method with Girsanov’s measure transformation, the circular dependence betweenvandyv_was decoupled in Wang et al. [3], and then a necessary condition for optimality was derived. Along this line, Zhang [4], Ma and Liu [5] extended [3] to the case of correlated state and observation noises, and the case of risk-sensitive control, respectively. Buckdahn et al. [6] studied an optimal control problem for SDE of conditional mean-field type. One emphasizes that [3–5] and [6] are based on the assumption that the drift coefficients of observation equations are uniformly bounded with respect to their components, which is restricted in some applications. Using the backward separation method with an approxi-mation technique, Wang et al. [7] generalized [3–6] in the sense that the drift coefficient of observation equation linearly grows with respect to the state, and for any> 0, the con-trolvsatisfiesEsup₀_≤_t_≤_T|vt|< +∞. Note that [7] did not study the case of mean-field and terminal constraint.

(3)

terminal constraint was considered. Combining the decomposition technique with the backward separation method, one solves the control problem. The contributions of this paper are as follows.

– One new necessary condition for optimality is derived. The condition together with forward–backward ﬁlter provides an eﬀective method for studying stochastic optimal control with terminal constraint and incomplete information.

– Three LQ examples with terminal constraint and partial information are solved, and optimal feedback controls are obtained by accident.

– An SDE of mean-ﬁeld type naturally arises from the study of standard LQ optimal control driven by SDE without mean-ﬁeld term. This interesting contribution can be found in Example4.2below.

The control problem is also related to those of Meyer-Brandis et al. [10], Elliott et al. [11], Yong [12], Hafayed and Abbas [13], Ni et al. [14] and Hafayed et al. [15]. Specifically, [10,15], respectively, studied a mean-field type control problem with partial information, where neither noisy observation nor filter is studied. The other work investigated mean-field type controls with complete information.

The rest of this paper is organized as follows. In Sect.2, one reformulates the con-trol problem and provides preliminary results. Section3derives one optimality condition and one forward–backward ﬁltering equation of mean-ﬁeld type. In Sect.4, one explicitly solves three LQ optimal control problems with terminal constraint and partial informa-tion. Finally, in Sect.5, one gives some concluding remarks.

2 Problem formulation and preliminary Deﬁnex0andy0by two SDEs

⎧ ⎨ ⎩

dx0

t = (atx0t +a¯tEx0t)dt+ctdωt+˜ctdω˜t,

x0₀=ξ, (1)

and ⎧ ⎨ ⎩

dy0_t = (ftx0t +f¯tEx0t)dt+htdω˜t,

y0= 0,

(2)

where a,a¯,c,c˜,f,f¯,h∈L∞(0,T;R). Assume that the controlv∈L2

F(0,T;R). Deﬁnexv,1

andyv,1_by ⎧ ⎨ ⎩

˙

xv,1t =atxv,1t +a¯tExv,1t +btvt+b¯tEvt,

xv,1₀ = 0, (3)

and ⎧ ⎨ ⎩

˙

yv,1_t =ftxv,1t +¯ftExv,1t +gtvt+g¯tEvt,

(4)

whereb,b¯,g,g¯∈L∞(0,T;R). It is clear that Eqs. (1), (2), (3) and (4) have unique solutions, respectively. Set

xv

t =x0t+xv,1t and yvt =y0t +yv,1t . (5)

It follows from Itô’s formula thatxv_and_yv_satisfy ⎧

⎨ ⎩

dxv

t = (atxvt+a¯tExvt+btvt+b¯tEvt)dt+ctdωt+˜ctdω˜t,

xv 0=ξ,

(6)

and ⎧ ⎨ ⎩

dyv

y0= 0,

(7)

respectively. For anyv∈L2

F(0,T;R), one introduces a constraint condition regarding the

terminal state and its distribution

Eϕxv_T,Exv_T= 0. (8)

Let

Fy0 t =σ

y0_s; 0≤s≤t, Ftyv=σ

yv_s; 0≤s≤t,

and letUbe a nonempty convex subset ofR. Deﬁne three admissible control sets

U0 ad=

vvtis anFy 0

t -adapted process with value inUand satisﬁes

E sup

0≤t≤T|

vt|2< +∞

,

Uad=

v|v∈U_ad0 is anF_tyv-adapted process,

and

Uc ad=

v|v∈Uadsatisﬁes terminal constraint (8)

.

It is easy to see that the inclusion relationship among them is

U0

ad⊇Uad⊇Uadc.

With (5) and the deﬁnition ofUad, one proves the equality

Fyv t =F

y0

t , v∈Uad.

In fact, sincevt isFy 0

t -adapted, then it follows from (3) and (4) thatyv,1t isF y0

t -adapted, and thusyv

tis alsoF y0

t -adapted by using the second equality in (5). This implies thatF yv t ⊆ Fy0

t ,v∈Uad. In the same way, one getsF yv t ⊇F

y0

(5)

The cost functional is in the form of

J[v] =E T

0

lt,xv_t,Exv_t,vt,Evt

dt+φxv_T,Exv_T , (9)

wherel: [0,T]×R2_×_U_×_R_→_R_and_φ_,_ϕ_:_R2_→_R_{are continuously diﬀerentiable with} respect to (x,x¯,v,v¯) and (x,x¯), respectively, and there is a constantC> 0 such that

ψ(x,x¯)≤C1 +|x|2+|¯x|2, ψχ(x,x¯)≤C

1 +|x|+|¯x|,

l(t,x,x¯,v,v¯)≤C1 +|x|2+|¯x|2+|v|2+|¯v|2, _l_χ₍_t_,_x_,_x_¯_,_v_,_v_¯₎_≤_C

1 +|x|+|¯x|+|v|+|¯v|,

withψ=φ,ϕandχ=x,x¯,v,v¯.

Then the optimal control problem with terminal constraint is restated as follows.

Problem (TC) Find au∈U_adc such that

J[u] = inf

v∈U_adc J[v]

subject to (6), (7), (8) and (9). Anyusatisfying the equality is called an optimal control of Problem (TC), andxu_{is called the optimal state corresponding to}_u_.

One also introduces an auxiliary problem without terminal constraint.

Problem (A) Find au∈Uadsuch that

Jκ[u] = inf

v∈Uad

Jκ[v]

subject to (6), (7) and

Jκ[v] =E

T

0

lt,xv_t,Exv_t,vt,Evt

dt+φxv_T,Exv_T+κϕxv_T,Exv_T , κ∈⊆R. (10)

In what follows, one provides two preliminary results, whose proofs can be found in the

Appendix.

Proposition 2.1 For anyκ∈,one has

inf

v∈Uad

Jκ

v= inf

v∈U_ad0

Jκ[v].

Proposition 2.2 Suppose that, for anyκ∈,uκ is an optimal control of Problem(A).

Moreover,suppose that there existsκ0∈such that

ϕxuκ0 T ,Ex

uκ0 T

= 0.

(6)

Proposition2.1reveals the equivalence between Problem (A) and the problem of min-imizingJκ[v] overU_ad0. Proposition2.2together with Proposition2.1shows that one can

obtain an optimal control of Problem (TC) by the following procedures: (1) to derive all optimal controlsuκ of Problem (A); (2) to ﬁnduκ0 satisfyingEϕ(x

uκ0 T ,Ex

uκ0

T ) = 0. Then suchuκ0 is exactly an optimal control of Problem (TC). Clearly, it is a more convenient approach in at least some detailed cases. See, e.g., Sect.4for more details.

This remark shows that the second procedure above can easily be ﬁnished in general, and thus it is enough to study Problem (A).

3 Optimality condition of Problem (A)

For anyv,vj∈Uad, letxvandxvjbe the solutions to (6) corresponding tovandvj,j= 1, 2, . . . . For simplicity, we set

Υ_tθ=t,x_tv+θxvjt –xvt

,Exv_t+θxvjt –xvt

,vt+θ(vj,t–vt),E

vt+θ(vj,t–vt)

,

Θ_tλ=t,xλ

t,Exλt,λt,Eλt

, Ξ_tλ=xλ

t,Exλt

,

whereλ=v,uκ,vj,j= 1, 2, . . . .

Theorem 3.1 If uκ is an optimal control of Problem(A),then the backward stochastic

diﬀerential equation of mean-ﬁeld type

⎧ ⎨ ⎩

–dpκ,t= [atpκ,t+lx(Θtuκ) +E(a¯tpκ,t+lx(¯ Θtuκ))]dt–qκ,tdωt–q˜κ,tdω˜t,

pκ,T=φx(ΞTuκ) +κϕx(ΞTuκ) +E(φx¯(ΞTuκ) +κϕx¯(ΞTuκ)),

(11)

has a unique solution(pκ,qκ,q˜κ)∈L2_F(0,T;R3)such that for anyν∈U

EHv

Θuκ

t ;pκ,t,qκ,t,q˜κ,t

+EHv¯

Θuκ

t ;pκ,t,qκ,t,q˜κ,t

(ν–uκ,t)|Fy uκ

t

≥0, κ∈, (12)

where the Hamiltonian function H is deﬁned by

H(t,x,x¯,v,¯v;p,q,q˜) = (atx+a¯tx¯+btv+b¯tv¯)p+ctq+c˜tq˜+l(t,x,x¯,v,v¯).

Proof Ifuκis an optimal control of Problem (A), Proposition2.1implies that

Jκ[uκ] = inf

v∈U0 ad

Jκ[v].

For anyv∈U0

ad, letxuκ+

εv_{be the solution to (}₆_{) corresponding to}_u

κ+εv, where 0≤ε≤1.

Introduce the variational equation

⎧ ⎨ ⎩

˙

x1,t=atx1,t+a¯tEx1,t+btvt+b¯tEvt,

(7)

which admits a unique solutionx1∈L2_F(0,T;R). It follows from Hölder’s inequality that

Combining the limit with the optimality ofu, one derives the ﬁrst-order variational in-equality

F(0,T;R3). Using Itô’s formula tox1pκand inserting it into the variational inequality,

one gets

t , then one draws the desired conclusion.

According to (12), one needs to compute the optimal ﬁlters of (11) and (6) depending

the ﬁlters ofΦtandΨt, respectively. Moreover, one denotes by

(8)

and

⎧ ⎨ ⎩

–dpˆκ,t={atpˆκ,t+E[lx(Θtuκ)|F yuκ

t ] +E(a¯tpκ,t+lx(¯ Θtuκ))}dt–Qtdωˆt,

ˆ

pκ,T=E[φx(ΞTuκ) +κϕx(ΞTuκ)|F yu

T ] +E(φx¯(ΞTuκ) +κϕx¯(ΞTuκ)),

(14)

respectively,whereΣis the unique solution to

⎧ ⎨ ⎩

˙

Σt– 2atΣt+ (c˜t+Σtfth–1t )2– (ct+c˜t)2= 0,

Σ0=σ0,

(15)

ˆ ωt=

t

0

h_s–1dy0_s–fsxˆ0s+f¯sEx0s

ds, (16)

is a standard Brownian motion with value inR,and

Qt=zˆ˜tuκ +

xuκ

t pκ,t–xˆutκpˆκ,t

fth–1t .

One emphasizes that (13) with (14) is a forward–backward stochastic differential filter-ing equation of mean-field type, which has a unique solution (xûκ_,_pˆ

κ,Q)∈L2_Fyuκ(0,T;R3)

for givenuκ. It shows that Theorem3.2is diﬀerent from the usual ﬁltering theories. See,

e.g., Xiong [16].

4 Three LQ cases of Problem (TC)

In this section, one aims at illustrating Theorems3.1and3.2by three examples. For conve-nience, one still adopts the state equation, the observation equation, and the correspond-ing assumptions introduced in Sects.2and3unless noted otherwise.

Example4.1 Find an admissible control to minimize

J[v] =1 2E

T

0

At

xv_t2+A¯t

Exv_t2+Btvt2+B¯t(Evt)2

dt

+Dxv_T2+D¯Exv_T2

(17)

overUc

adwithU=Rand the terminal constraint

Exv_T=γ, γ ∈R, (18)

subject to ⎧ ⎨ ⎩

dxv_t = (atxvt+a¯tExvt+btvt+b¯tEvt)dt+ctdωt+˜ctdω˜t,

xv 0=ξ,

(19)

and ⎧ ⎨ ⎩

dyv

y0= 0,

(9)

whereA,A¯,B,B¯ ∈L∞(0,T;R),At> 0,At+A¯t≥0,Bt> 0,Bt+B¯t> 0,D≥0,D+D¯ ≥0,

b= 0 andb+b¯= 0.

Deﬁne an auxiliary cost functional without terminal constraint

Jκ[v] =Jκ[v] –κγ

with

Jκ[v] =

1 2E

T

0

At

xv_t2+A¯t

Exv t 2

+Btv2t +B¯t(Evt)2

dt

+Dxv_T2+D¯Exv_T2+ 2κxv_T

, κ∈. (21)

Since bothκandγ are constants, it is enough to minimize (21) overUadsubject to (19) and (20). One will use three steps to explicitly solve the example.

Step 1Candidate optimal control of the auxiliary problem without terminal constraint. With the data, the Hamiltonian function is

H(t,x,x¯,v;pκ,qκ,q˜κ) = (atx+a¯tx¯+btv+b¯tv¯)pκ+ctqκ+c˜tq˜κ

+1 2

Atx2+A¯t(x¯)2+Btv2+B¯t(v¯)2

,

where (pκ,qκ,q˜κ) is determined by the Hamiltonian system

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

dxuκ

t = (atxutκ +a¯tExutκ+btuκ,t+b¯tEuκ,t)dt+ctdωt+c˜tdω˜t, –dpκ,t= [atpκ,t+Atxut +E(a¯tpκ,t+A¯txut)] –qκ,tdωt–q˜κ,tdω˜t,

xuκ

0 =ξ, pκ,T=DxuTκ +DEx¯ uκ

T +κ.

(22)

Ifuκis an optimal control of the auxiliary problem, then it follows from Theorem3.1that

Btuκ,t+btpˆκ,t+B¯tEuκ,t+b¯tEpκ,t= 0.

Solving it, we get

uκ,t= –B–1t

btpˆκ,t+ _¯

bt–B¯t(Bt+B¯t)–1(bt+b¯t)

Epκ,t

. (23)

Step 2Feedback form of (23).

Inserting (23) into (22) and taking expectations, one gets an ordinary diﬀerential equa-tion

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ d dtEx

uκ

t = (at+a¯t)Exutκ– (Bt+B¯t)–1(bt+b¯t)2Epκ,t,

d

dtEpκ,t= –(At+A¯t)Ex uκ

t – (at+a¯t)Epκ,t,

Exuκ

0 =μ0, Epκ,T= (D+D¯)ExuTκ+κ.

(24)

Note that the ﬁrst equation and the second equation in (24) are coupled. Since

(10)

the assumption condition of Theorem 2.6 in Peng and Wu [17] is satisﬁed, and hence (24) has a unique solution (Exuκ_,E_p

κ). Noticing the terminal condition of (24), one sets

Epκ,t=αtExtuκ+βκ,t, (25)

where α andβκ are deterministic and diﬀerential functions such thatαT=D+D¯ and

βκ,T=κ. Using the chain rule for computing the derivative of (25), one has

d

dtEpκ,t=α˙tEx

uκ

t +αt

d dtEx

uκ

t +β˙κ,t

=α˙t+ (at+a¯t)αt– (Bt+B¯t)–1(bt+b¯t)2α2t

Exuκ

t +β˙κ,t–αt(Bt+B¯t)–1(bt+b¯t)2βκ,t.

Comparing the equality with the second equation in (24), one deduces

⎧ ⎨ ⎩

˙

αt+ 2(at+a¯t)αt– (Bt+B¯t)–1(bt+b¯t)2α2t +At+A¯t= 0,

αT=D+D¯,

(26)

and ⎧ ⎨ ⎩

˙

βκ,t+ [at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt]βκ,t= 0,

βκ,T=κ.

(27)

It is easy to see (26) and (27) admit unique solutions, respectively. Inserting (25) into the ﬁrst equation of (24), one derives

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ d dtEx

uκ

t = [at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt]Exutκ – (Bt+B¯t)–1(bt+b¯t)2βκ,t,

Exu 0=μ0.

(28)

Using Theorem3.2to (22) with (23), one gets a forward–backward stochastic differential filtering equation of mean-field type,

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

dxˆuκ

t = [atxˆutκ–b2tBt–1pˆκ,t+a¯tExutκ – (Bt+B¯t)–1(b¯2t + 2btb¯t–B–1t B¯tb2t)Epκ,t]dt + (c˜t+Σtfth–1t )dωˆt,

–dpˆκ,t= [Atxˆtuκ+atpˆκ,t+E(a¯tpκ,t+A¯txtuκ)]dt–Qtdωˆt,

ˆ

xuκ

0 =μ0, pˆκ,T=DxˆuTκ +D¯Ex uκ

T +κ,

(29)

whereΣandωˆ satisfy (15) with (16), andExuκ _andEp

κ solve (28) and (25), respectively.

Sinceb= 0, (29) admits a unique solution (xˆuκ_,_pˆ

κ,Q)∈L2_Fyuκ(0,T;R3) by Theorem 2.6 in

[17] again. Similarly, let

ˆ

(11)

whereΓ,Γ¯ andΛκare three deterministic and diﬀerential functions satisfyingΓT=D,

Comparing it with the second equation in (29), one deduces ⎧

which admit a unique solution, respectively. Plugging (30) into (23), one gets

uκ,t= –B–1t

Step 3Optimal control of Example4.1. Solving (27) and (28), one gets

βκ,t=κe T

(12)

Exˆuκ

t =μ0e t 0ρsds_–_κ

0

(Bs+B¯s)–1(bs+b¯s)2e T s ρrdr+

t sρrdr_ds_,

with

ρt=at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt.

Recalling the terminal constraint (18), it yields

κ0=

μ0e T

0 ρtdt_–_γ T

0 (Bt+B¯t)–1(bt+b¯t)2e

2_tTρsds_dt. (35)

Then Proposition2.2implies the desired conclusion. The above deduction is summarized as follows.

Proposition 4.1 The optimal feedback control of Example4.1is given by(34)withκbeing

replaced by(35).

Example4.2 In particular, if one lets the coeﬃcients of (17), (19) and (20)A¯ =B¯=D¯ =a¯=

¯

b=c=f =f¯=g=g¯= 0 on [0,T], then Example4.1is reduced to an LQ optimal control with terminal constraint and complete information. Further, letv∈L2(0,T;R). Sincevis deterministic, Proposition4.1implies that the optimal feedback control is

uκ0,t= –B –1 t bt

αtEx uκ0 t +βκ0,t

,

whereα,βκ0andx

uκ0 satisfy ⎧

⎨ ⎩

˙

αt+ 2atαt–Bt–1b2tα2t +At= 0,

αT=D,

βκ0,t=κ0e T

t (as–B–1s b2sαs)ds_,

and ⎧ ⎨ ⎩

dxuκ0 t = (atx

uκ0

t –B–1t b2tαtEx uκ0

t –B–1t b2tβκ0,t)dt+˜ctdω˜t,

xuκ0 0 =ξ,

(36)

with

κ0=

μ0e T

0(at–B–1t b2tαt)dt T

0 B–1t b2te2 T

t (as–B–1s b2sαs)ds_dt ,

respectively.

(13)

Example4.3 One denotes byvthe rate of capital withdrawal or injection of a ﬁrm, and by

xv_{the cash-balance process on [0,}_T_{]. Assume that the liability of the ﬁrm is governed by}

–dL¯v_t =btvtdt+ctdωt+c˜tdω˜t,

wherectdωt and˜ctdω˜t describe the liability risk. Assume that the ﬁrm owns an initial investmentξ, and only invests in a money account with compounded interest ratea. Then the cash-balance is denoted by

xv_t =e0tasds

ξ– t

0

e–0sardr_d_L¯v s

,

whose diﬀerential form is the same as (6) witha¯=b¯= 0. Due to the discreteness of the account information, the ﬁrm partially observes the cash-balance by the corresponding stock price

⎧ ⎨ ⎩

dSv

t =Svt[(ftxvt +gt+1₂h2t)dt+htdω˜t],

Sv0= 1.

Set

yv_t=logSv_t.

It follows from Itô’s formula thatyv_{is governed by (}₇_{) with}¯_f

t=g¯t= 0. The ﬁrm hopes to ﬁnd a suitablevsuch that

J[v] = 1 2E

T

0

v2_tdt+xv_T–Exv_T2

is minimized overU_adc with terminal constraint (18). The model implies that the ﬁrm wants to minimize the risk ofxv_Tandvunder a ﬁxed terminal cash-balance level. Since

Exv_T–Exv_T2=Exv_T2–Exv_T2,

Example4.3is also a special case of Example4.1. The following result is an immediate one of Proposition4.1.

Corollary 4.1 The optimal rate of capital withdrawal or injection of the ﬁrm is

uκ0,t= –bt

Γtxˆ uκ0 t +Γ¯tExˆ

uκ0 t +Λκ0,t

,

whereΓ,Γ¯,Λandxˆuκ₀ _{are the solutions to}

⎧ ⎨ ⎩

˙

Γt+ 2atΓt–b2tΓt2= 0,

ΓT= 1, ⎧ ⎨ ⎩

˙¯

Γt+ 2(at–b2tΓt)Γ¯t–b2tΓ¯t2= 0,

(14)

⎨

One remarks that the model in Example4.3is inspired by Huang et al. [18], where the variance ofvdoes not enter the performance functional.

5 Concluding remarks

This paper has studied an optimal control problem driven by SDE of mean-field tpye, where the drift coefficient of observation equation is linear with respect to the state, the control and their expectations, and the observation equation is explicitly dependent on the control. This framework covers many more financial models including [18], but it causes trouble in addressing the control problem. This trouble has been solved by the backward separation method with a decomposition technique. The results obtained here partially improve those of [3–6].

The traditional separation principle method is also applicable to study Problem (TC). However, the traditional method cannot oﬀer a valid way to derive an optimality condition of Problem (TC). The reason is listed below. Let(t,x) be the conditional density ofxv

t, given the observable ﬁltrationFtyv. Similar to Theorem 2.3 in [2], one gets

⎧

is anFtyv-adapted andR-valued standard Brownian motion, and hence, the cost functional (9) is rewritten as

(15)

Appendix

One presents three lemmas ﬁrst, and then one gives a proof of Proposition2.1.

Lemma A1 For any vj∈L2_F(0,T;R),j= 1, 2,there is a constant C> 0such that

Proof The estimate is obtained by Itô’s formula and Hölder’s inequality. The details of the

proof are omitted to save space.

Lemma A2 For any v,vj∈Uad,j= 1, 2, . . . ,

lim

j→+∞Jκ[vj] =Jκ[v].

Proof Using Taylor’s expansion, Hölder’s inequality and LemmaA1, one gets

ad, one deﬁnes a family of controls by

(16)

whereν∈U,i,jare natural numbers, 1≤i≤j– 1, andδj=T/j. Similar to [8], one proves that (i) vj∈Uad for anyj, and (ii)vj→vasj→+∞inL2

Fy0(0,T;U). Thus the proof is

complete.

Proof of Proposition2.1 From the deﬁnition of decision set, it is easy to see

inf

v∈Uad

Jκ

v≥ inf

v∈U_ad0

Jκ[v].

Then one only needs to prove the reverse inequality. Sincevjintroduced in LemmaA3is an element ofUad,

Jκ[vj]≥ inf

v∈Uad

Jκ

v,

and, consequently, it follows from LemmaA2that

Jκ[v] = lim

j→+∞Jκ[vj]≥vinf∈Uad

Jκ

v.

Then the arbitrariness ofvimplies the desired result.

Proof of Proposition2.2 According to the deﬁnition of optimal control and value function, the desired conclusion is drawn by using the idea introduced in Chap. 11 of Øksendal [19]. One drops the details of the proof for saving space.

Funding

This work is supported in part by the NSF of China under Grant No. 61603219, and by the Shandong Jiaotong University Climbing Research Innovation Team Program.

Abbreviations

SDE, stochastic diﬀerential equation; LQ, linear-quadratic; TC, terminal constraint.

Availability of data and materials

Not applicable.

Competing interests

The author declares that there is no conﬂict of interests regarding the publication of this paper.

Author’s contributions

The author read and approved the ﬁnal manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional aﬃliations.

Received: 26 September 2018 Accepted: 18 February 2019

References

1. Wang, G.C., Wu, Z.: Kalman–Bucy ﬁltering equations of forward and backward stochastic systems and applications to recursive optimal control problems. J. Math. Anal. Appl.342, 1280–1296 (2008)

2. Wang, G.C., Wu, Z., Xiong, J.: An Introduction to Optimal Control of FBSDE with Incomplete Information. Springer, Cham (2018)

3. Wang, G.C., Zhang, C.H., Zhang, W.H.: Stochastic maximum principle for mean-ﬁeld type optimal control with partial information. IEEE Trans. Autom. Control59, 522–528 (2014)

4. Zhang, H.Y.: A necessary condition for mean-ﬁeld type stochastic diﬀerential equations with correlated state and observation noises. J. Ind. Manag. Optim.12, 1287–1301 (2016)

5. Ma, H.P., Liu, B.: Optimal control problem for risk-sensitive mean-ﬁeld stochastic delay diﬀerential equation with partial information. Asian J. Control19, 2097–2115 (2017)

(17)

7. Wang, G.C., Wu, Z., Xiong, J.: Maximum principles for forward–backward stochastic control systems with correlated state and observation noises. SIAM J. Control Optim.51, 491–524 (2013)

8. Wang, G.C., Wu, Z., Xiong, J.: A linear-quadratic optimal control problem of forward–backward stochastic diﬀerential equations with partial information. IEEE Trans. Autom. Control60, 2904–2916 (2015)

9. Wang, G.C., Xiao, H., Xing, G.J.: An optimal control problem for mean-ﬁeld forward–backward stochastic diﬀerential equation with noisy observation. Automatica86, 104–109 (2017)

10. Meyer-Brandis, T., Øksendal, B., Zhou, X.Y.: A mean-ﬁeld stochastic maximum principle via Malliavin calculus. Stochastics84, 643–666 (2012)

11. Elliott, R., Li, X., Ni, Y.H.: Discrete time mean-ﬁeld stochastic linear quadratic optimal control problems. Automatica49, 3222–3233 (2013)

12. Yong, J.M.: Linear-quadratic optimal control problems for mean-ﬁeld stochastic diﬀerential equations. SIAM J. Control Optim.51, 2809–2838 (2013)

13. Hafayed, H., Abbas, S.: On near-optimal mean-ﬁeld stochastic singular controls: necessary and suﬃcient conditions for near-optimality. J. Optim. Theory Appl.160, 778–808 (2014)

14. Ni, Y.H., Zhang, J.F., Li, X.: Indeﬁnite mean-ﬁeld stochastic linear-quadratic optimal control. IEEE Trans. Autom. Control 60, 1786–1800 (2015)

15. Hafayed, M., Abba, A., Abbas, S.: On partial-information optimal singular control problem for mean-ﬁeld stochastic diﬀerential equations driven by teugels martingales measures. Int. J. Control89, 397–410 (2016)

16. Xiong, J.: An Introduction to Stochastic Filtering Theory. Oxford University Press, London (2008)

17. Peng, S.G., Wu, Z.: Fully coupled forward–backward stochastic diﬀerential equations and applications to optimal control. SIAM J. Control Optim.37, 825–843 (1999)

18. Huang, J.H., Wang, G.C., Wu, Z.: Optimal premium policy of an insurance ﬁrm: full and partial information. Insur. Math. Econ.47, 208–215 (2010)