• No results found

An optimal control problem for linear SDE of mean field type with terminal constraint and partial information

N/A
N/A
Protected

Academic year: 2020

Share "An optimal control problem for linear SDE of mean field type with terminal constraint and partial information"

Copied!
17
0
0

Loading.... (view fulltext now)

Full text

(1)

R E S E A R C H

Open Access

An optimal control problem for linear SDE of

mean-field type with terminal constraint and

partial information

Haiyan Zhang

1*

*Correspondence:

[email protected];

[email protected]

1College of Sciences, Shandong Jiaotong University, Jinan, China

Abstract

This paper is concerned with an optimal control problem for a linear stochastic differential equation (SDE) of mean-field type, where the drift coefficient of observation equation is linear with respect to the state, the control and their expectations, and the state is subject to a terminal constraint. The control problem cannot be solved by transforming it into a standard optimal control problem for an SDE without mean-field term. By virtue of a backward separation method with a decomposition technique, one optimality condition and one forward–backward filter are derived. Two linear-quadratic (LQ) optimal control problems and one cash management problem with terminal constraint and partial information are studied, and optimal feedback controls are explicitly obtained.

Keywords: Feedback control; Filter; LQ optimal control; Necessary condition; SDE of mean-field type

1 Introduction

One begins with a complete filtered probability space (Ω,F, (Ft)0≤tT,P), on which are given anFt-adapted standard Brownian motion (ωt,ω˜t) with value inR2and a Gaussian random variableξ with meanμ0and covarianceσ0. (ω,ω˜) is independent ofξ. LetT> 0 be a fixed time horizon, letRm be them-dimensional Euclidean space, and let f

xbe the partial derivative off with respect tox. Ifx: [0,T]→Ris uniformly bounded, one writesxL∞(0,T;R). Ifx: [0,T]→Ris square-integrable, one writesxL2(0,T;R). If

x: [0,TΩ→Ris anFt-adapted, square-integrable process, one writesxL2

F(0,T;R).

One also adopts similar notations for other filtrations and Euclidean spaces. Consider the linear SDE

⎧ ⎨ ⎩

dxv

t = (atxvt+a¯tExvt +btvt+b¯tEvt)dt+ctdωtctdω˜t,

xv 0=ξ,

whereEis expectation, andvis a control process. SinceExv

t andEvtappear in the equa-tion, one calls it an SDE of mean-field type. Assume that the solutionxvto the equation is

(2)

observed through ⎧

⎨ ⎩

dyv

t = (ftxvt+f¯tExvt+gtvt+g¯tEvt)dt+htdω˜t,

y0= 0.

The cost functional is

J[v] =E T

0

lt,xvt,Exvt,vt,Evt

dt+φxvT,ExvT .

Herevtis required to beσ{yvs; 0≤st}-adapted and to satisfy

E sup

0≤tT|

vt|2< +∞

and

Eϕxv T,ExvT

= 0;

the functionsa,a¯,b,b¯,cc,f,f¯,g,g¯,h,l,φ andϕwill be specified in Sect.2. This is a partially observable optimal control problem with terminal constraint. This problem can reduce to an optimal control problem with certain additional control domain constraint, but it cannot be studied by classical control theory for SDE without mean-field term. From this viewpoint, this problem extends some standard optimal control problems and covers a few financial models.

Classical variation provides an effective tool for studying optimal control problems. However, it is not always valid for partially observed optimal control problems. A main reason is there is a circular dependence between the control vand the observationyv. In 2008, Wang and Wu [1] originally proposed a backward separation method. In 2018, Wang coauthored their monograph [2], where the backward separation method was sys-tematically introduced and was regarded as one of most important tools for studying par-tially observed optimal control problems. Combining the backward separation method with Girsanov’s measure transformation, the circular dependence betweenvandyvwas decoupled in Wang et al. [3], and then a necessary condition for optimality was derived. Along this line, Zhang [4], Ma and Liu [5] extended [3] to the case of correlated state and observation noises, and the case of risk-sensitive control, respectively. Buckdahn et al. [6] studied an optimal control problem for SDE of conditional mean-field type. One emphasizes that [3–5] and [6] are based on the assumption that the drift coefficients of observation equations are uniformly bounded with respect to their components, which is restricted in some applications. Using the backward separation method with an approxi-mation technique, Wang et al. [7] generalized [3–6] in the sense that the drift coefficient of observation equation linearly grows with respect to the state, and for any> 0, the con-trolvsatisfiesEsup0tT|vt|< +∞. Note that [7] did not study the case of mean-field and terminal constraint.

(3)

terminal constraint was considered. Combining the decomposition technique with the backward separation method, one solves the control problem. The contributions of this paper are as follows.

– One new necessary condition for optimality is derived. The condition together with forward–backward filter provides an effective method for studying stochastic optimal control with terminal constraint and incomplete information.

– Three LQ examples with terminal constraint and partial information are solved, and optimal feedback controls are obtained by accident.

– An SDE of mean-field type naturally arises from the study of standard LQ optimal control driven by SDE without mean-field term. This interesting contribution can be found in Example4.2below.

The control problem is also related to those of Meyer-Brandis et al. [10], Elliott et al. [11], Yong [12], Hafayed and Abbas [13], Ni et al. [14] and Hafayed et al. [15]. Specifically, [10,15], respectively, studied a mean-field type control problem with partial information, where neither noisy observation nor filter is studied. The other work investigated mean-field type controls with complete information.

The rest of this paper is organized as follows. In Sect.2, one reformulates the con-trol problem and provides preliminary results. Section3derives one optimality condition and one forward–backward filtering equation of mean-field type. In Sect.4, one explicitly solves three LQ optimal control problems with terminal constraint and partial informa-tion. Finally, in Sect.5, one gives some concluding remarks.

2 Problem formulation and preliminary Definex0andy0by two SDEs

⎧ ⎨ ⎩

dx0

t = (atx0t +a¯tEx0t)dt+ctdωtctdω˜t,

x00=ξ, (1)

and ⎧ ⎨ ⎩

dy0t = (ftx0t +f¯tEx0t)dt+htdω˜t,

y0= 0,

(2)

where a,a¯,c,c˜,f,f¯,hL∞(0,T;R). Assume that the controlvL2

F(0,T;R). Definexv,1

andyv,1by ⎧ ⎨ ⎩

˙

xv,1t =atxv,1t +a¯tExv,1t +btvt+b¯tEvt,

xv,10 = 0, (3)

and ⎧ ⎨ ⎩

˙

yv,1t =ftxv,1tftExv,1t +gtvt+g¯tEvt,

(4)

whereb,b¯,g,g¯∈L∞(0,T;R). It is clear that Eqs. (1), (2), (3) and (4) have unique solutions, respectively. Set

xv

t =x0t+xv,1t and yvt =y0t +yv,1t . (5)

It follows from Itô’s formula thatxvandyvsatisfy

⎨ ⎩

dxv

t = (atxvt+a¯tExvt+btvt+b¯tEvt)dt+ctdωtctdω˜t,

xv 0=ξ,

(6)

and ⎧ ⎨ ⎩

dyv

t = (ftxvt+f¯tExvt+gtvt+g¯tEvt)dt+htdω˜t,

y0= 0,

(7)

respectively. For anyvL2

F(0,T;R), one introduces a constraint condition regarding the

terminal state and its distribution

EϕxvT,ExvT= 0. (8)

Let

Fy0 t =σ

y0s; 0≤st, Ftyv=σ

yvs; 0≤st,

and letUbe a nonempty convex subset ofR. Define three admissible control sets

U0 ad=

vvtis anFy 0

t -adapted process with value inUand satisfies

E sup

0≤tT|

vt|2< +∞

,

Uad=

v|vUad0 is anFtyv-adapted process,

and

Uc ad=

v|vUadsatisfies terminal constraint (8)

.

It is easy to see that the inclusion relationship among them is

U0

ad⊇Uad⊇Uadc.

With (5) and the definition ofUad, one proves the equality

Fyv t =F

y0

t , vUad.

In fact, sincevt isFy 0

t -adapted, then it follows from (3) and (4) thatyv,1t isF y0

t -adapted, and thusyv

tis alsoF y0

t -adapted by using the second equality in (5). This implies thatF yv tFy0

t ,vUad. In the same way, one getsF yv tF

y0

(5)

The cost functional is in the form of

J[v] =E T

0

lt,xvt,Exvt,vt,Evt

dt+φxvT,ExvT , (9)

wherel: [0,T]×R2×U×RRandφ,ϕ:R2Rare continuously differentiable with respect to (x,x¯,v,v¯) and (x,x¯), respectively, and there is a constantC> 0 such that

ψ(x,x¯)≤C1 +|x|2+|¯x|2, ψχ(x,x¯)≤C

1 +|x|+|¯x|,

l(t,x,x¯,v,v¯)≤C1 +|x|2+|¯x|2+|v|2+|¯v|2, lχ(t,x,x¯,v,v¯)C

1 +|x|+|¯x|+|v|+|¯v|,

withψ=φ,ϕandχ=x,x¯,v,v¯.

Then the optimal control problem with terminal constraint is restated as follows.

Problem (TC) Find auUadc such that

J[u] = inf

vUadc J[v]

subject to (6), (7), (8) and (9). Anyusatisfying the equality is called an optimal control of Problem (TC), andxuis called the optimal state corresponding tou.

One also introduces an auxiliary problem without terminal constraint.

Problem (A) Find auUadsuch that

[u] = inf

vUad

[v]

subject to (6), (7) and

[v] =E

T

0

lt,xvt,Exvt,vt,Evt

dt+φxvT,ExvT+κϕxvT,ExvT , κ⊆R. (10)

In what follows, one provides two preliminary results, whose proofs can be found in the

Appendix.

Proposition 2.1 For anyκ,one has

inf

vUad

v= inf

vUad0

[v].

Proposition 2.2 Suppose that, for anyκ, is an optimal control of Problem(A).

Moreover,suppose that there existsκ0∈such that

ϕxuκ0 T ,Ex

0 T

= 0.

(6)

Proposition2.1reveals the equivalence between Problem (A) and the problem of min-imizing[v] overUad0. Proposition2.2together with Proposition2.1shows that one can

obtain an optimal control of Problem (TC) by the following procedures: (1) to derive all optimal controls of Problem (A); (2) to find0 satisfyingEϕ(x

0 T ,Ex

0

T ) = 0. Then such0 is exactly an optimal control of Problem (TC). Clearly, it is a more convenient approach in at least some detailed cases. See, e.g., Sect.4for more details.

This remark shows that the second procedure above can easily be finished in general, and thus it is enough to study Problem (A).

3 Optimality condition of Problem (A)

For anyv,vjUad, letxvandxvjbe the solutions to (6) corresponding tovandvj,j= 1, 2, . . . . For simplicity, we set

Υtθ=t,xtv+θxvjtxvt

,Exvt+θxvjtxvt

,vt+θ(vj,tvt),E

vt+θ(vj,tvt)

,

Θtλ=t,

t,Exλt,λt,Eλt

, Ξtλ=

t,Exλt

,

whereλ=v,,vj,j= 1, 2, . . . .

Theorem 3.1 If uκ is an optimal control of Problem(A),then the backward stochastic

differential equation of mean-field type

⎧ ⎨ ⎩

dpκ,t= [atpκ,t+lx(Θtuκ) +E(a¯tpκ,t+lx(¯ Θtuκ))]dt,tdωtq˜κ,t˜t,

,T=φx(ΞTuκ) +κϕx(ΞTuκ) +E(φx¯(ΞTuκ) +κϕx¯(ΞTuκ)),

(11)

has a unique solution(,,q˜κ)∈L2F(0,T;R3)such that for anyνU

EHv

Θuκ

t ;,t,,t,q˜κ,t

+EHv¯

Θuκ

t ;,t,,t,q˜κ,t

(ν,t)|Fy

t

≥0, κ, (12)

where the Hamiltonian function H is defined by

H(t,x,x¯,vv;p,q,q˜) = (atx+a¯tx¯+btv+b¯tv¯)p+ctq+c˜tq˜+l(t,x,x¯,v,v¯).

Proof Ifis an optimal control of Problem (A), Proposition2.1implies that

[] = inf

vU0 ad

[v].

For anyvU0

ad, letxuκ+

εvbe the solution to (6) corresponding tou

κ+εv, where 0≤ε≤1.

Introduce the variational equation

⎧ ⎨ ⎩

˙

x1,t=atx1,t+a¯tEx1,t+btvt+b¯tEvt,

(7)

which admits a unique solutionx1∈L2F(0,T;R). It follows from Hölder’s inequality that

Combining the limit with the optimality ofu, one derives the first-order variational in-equality

F(0,T;R3). Using Itô’s formula tox1and inserting it into the variational inequality,

one gets

t , then one draws the desired conclusion.

According to (12), one needs to compute the optimal filters of (11) and (6) depending

the filters ofΦtandΨt, respectively. Moreover, one denotes by

(8)

and

⎧ ⎨ ⎩

dpˆκ,t={atpˆκ,t+E[lx(Θtuκ)|F yuκ

t ] +E(a¯tpκ,t+lx(¯ Θtuκ))}dtQtdωˆt,

ˆ

,T=E[φx(ΞTuκ) +κϕx(ΞTuκ)|F yu

T ] +E(φx¯(ΞTuκ) +κϕx¯(ΞTuκ)),

(14)

respectively,whereΣis the unique solution to

⎧ ⎨ ⎩

˙

Σt– 2atΣt+ (c˜t+Σtfth–1t )2– (ct+c˜t)2= 0,

Σ0=σ0,

(15)

ˆ ωt=

t

0

hs–1dy0sfsxˆ0s+f¯sEx0s

ds, (16)

is a standard Brownian motion with value inR,and

Qt=zˆ˜tuκ +

xuκ

t ,t–xˆutκpˆκ,t

fth–1t .

One emphasizes that (13) with (14) is a forward–backward stochastic differential filter-ing equation of mean-field type, which has a unique solution (xˆ,pˆ

κ,Q)∈L2Fyuκ(0,T;R3)

for given. It shows that Theorem3.2is different from the usual filtering theories. See,

e.g., Xiong [16].

4 Three LQ cases of Problem (TC)

In this section, one aims at illustrating Theorems3.1and3.2by three examples. For conve-nience, one still adopts the state equation, the observation equation, and the correspond-ing assumptions introduced in Sects.2and3unless noted otherwise.

Example4.1 Find an admissible control to minimize

J[v] =1 2E

T

0

At

xvt2+A¯t

Exvt2+Btvt2+B¯t(Evt)2

dt

+DxvT2+D¯ExvT2

(17)

overUc

adwithU=Rand the terminal constraint

ExvT=γ, γ ∈R, (18)

subject to ⎧ ⎨ ⎩

dxvt = (atxvt+a¯tExvt+btvt+b¯tEvt)dt+ctdωtctdω˜t,

xv 0=ξ,

(19)

and ⎧ ⎨ ⎩

dyv

t = (ftxvt+f¯tExvt+gtvt+g¯tEvt)dt+htdω˜t,

y0= 0,

(9)

whereA,A¯,B,B¯ ∈L∞(0,T;R),At> 0,At+A¯t≥0,Bt> 0,Bt+B¯t> 0,D≥0,D+D¯ ≥0,

b= 0 andb+b¯= 0.

Define an auxiliary cost functional without terminal constraint

[v] =[v] –κγ

with

[v] =

1 2E

T

0

At

xvt2+A¯t

Exv t 2

+Btv2t +B¯t(Evt)2

dt

+DxvT2+D¯ExvT2+ 2κxvT

, κ. (21)

Since bothκandγ are constants, it is enough to minimize (21) overUadsubject to (19) and (20). One will use three steps to explicitly solve the example.

Step 1Candidate optimal control of the auxiliary problem without terminal constraint. With the data, the Hamiltonian function is

H(t,x,x¯,v;,,q˜κ) = (atx+a¯tx¯+btv+b¯tv¯)+ctqκ+c˜tq˜κ

+1 2

Atx2+A¯t(x¯)2+Btv2+B¯t(v¯)2

,

where (,,q˜κ) is determined by the Hamiltonian system

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩

dxuκ

t = (atxutκ +a¯tExutκ+btuκ,t+b¯tEuκ,t)dt+ctdωt+c˜tdω˜t,dpκ,t= [atpκ,t+Atxut +E(a¯tpκ,t+A¯txut)] –,tdωtq˜κ,t˜t,

xuκ

0 =ξ, ,T=DxuTκ +DEx¯

T +κ.

(22)

Ifis an optimal control of the auxiliary problem, then it follows from Theorem3.1that

Btuκ,t+btpˆκ,t+B¯tEuκ,t+b¯tEpκ,t= 0.

Solving it, we get

,t= –B–1t

btpˆκ,t+ ¯

btB¯t(Bt+B¯t)–1(bt+b¯t)

E,t

. (23)

Step 2Feedback form of (23).

Inserting (23) into (22) and taking expectations, one gets an ordinary differential equa-tion

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ d dtEx

t = (at+a¯t)Exutκ– (Bt+B¯t)–1(bt+b¯t)2Epκ,t,

d

dtE,t= –(At+A¯t)Ex

t – (at+a¯t)E,t,

Ex

0 =μ0, Epκ,T= (D+D¯)ExuTκ+κ.

(24)

Note that the first equation and the second equation in (24) are coupled. Since

(10)

the assumption condition of Theorem 2.6 in Peng and Wu [17] is satisfied, and hence (24) has a unique solution (Exuκ,Ep

κ). Noticing the terminal condition of (24), one sets

E,t=αtExtuκ+βκ,t, (25)

where α andβκ are deterministic and differential functions such thatαT=D+D¯ and

βκ,T=κ. Using the chain rule for computing the derivative of (25), one has

d

dtE,t=α˙tEx

t +αt

d dtEx

t +β˙κ,t

=α˙t+ (at+a¯t)αt– (Bt+B¯t)–1(bt+b¯t)2α2t

Exuκ

t +β˙κ,t–αt(Bt+B¯t)–1(bt+b¯t)2βκ,t.

Comparing the equality with the second equation in (24), one deduces

⎧ ⎨ ⎩

˙

αt+ 2(at+a¯t)αt– (Bt+B¯t)–1(bt+b¯t)2α2t +At+A¯t= 0,

αT=D+D¯,

(26)

and ⎧ ⎨ ⎩

˙

βκ,t+ [at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt]βκ,t= 0,

βκ,T=κ.

(27)

It is easy to see (26) and (27) admit unique solutions, respectively. Inserting (25) into the first equation of (24), one derives

⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ d dtEx

t = [at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt]Exutκ – (Bt+B¯t)–1(bt+b¯t)2βκ,t,

Exu 0=μ0.

(28)

Using Theorem3.2to (22) with (23), one gets a forward–backward stochastic differential filtering equation of mean-field type,

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

dxˆ

t = [atxˆutκb2tBt–1pˆκ,t+a¯tExutκ – (Bt+B¯t)–1(b¯2t + 2btb¯tB–1t B¯tb2t)Epκ,t]dt + (c˜t+Σtfth–1t )ˆt,

dpˆκ,t= [Atxˆtuκ+atpˆκ,t+E(a¯tpκ,t+A¯txtuκ)]dtQtdωˆt,

ˆ

xuκ

0 =μ0, pˆκ,T=DxˆuTκ +D¯Ex

T +κ,

(29)

whereΣandωˆ satisfy (15) with (16), andEx andEp

κ solve (28) and (25), respectively.

Sinceb= 0, (29) admits a unique solution (xˆ,pˆ

κ,Q)∈L2Fyuκ(0,T;R3) by Theorem 2.6 in

[17] again. Similarly, let

ˆ

(11)

whereΓ,Γ¯ andΛκare three deterministic and differential functions satisfyingΓT=D,

Comparing it with the second equation in (29), one deduces ⎧

which admit a unique solution, respectively. Plugging (30) into (23), one gets

,t= –B–1t

Step 3Optimal control of Example4.1. Solving (27) and (28), one gets

βκ,t=κe T

(12)

Exˆ

t =μ0e t 0ρsdsκ

0

(Bs+B¯s)–1(bs+b¯s)2e T s ρrdr+

t sρrdrds,

with

ρt=at+a¯t– (Bt+B¯t)–1(bt+b¯t)2αt.

Recalling the terminal constraint (18), it yields

κ0=

μ0e T

0 ρtdtγ T

0 (Bt+B¯t)–1(bt+b¯t)2e

2tTρsdsdt. (35)

Then Proposition2.2implies the desired conclusion. The above deduction is summarized as follows.

Proposition 4.1 The optimal feedback control of Example4.1is given by(34)withκbeing

replaced by(35).

Example4.2 In particular, if one lets the coefficients of (17), (19) and (20)A¯ =B¯=D¯ =a¯=

¯

b=c=f =f¯=g=g¯= 0 on [0,T], then Example4.1is reduced to an LQ optimal control with terminal constraint and complete information. Further, letvL2(0,T;R). Sincevis deterministic, Proposition4.1implies that the optimal feedback control is

0,t= –B –1 t bt

αtEx 0 t +βκ0,t

,

whereα,βκ0andx

0 satisfy ⎧

⎨ ⎩

˙

αt+ 2atαtBt–1b22t +At= 0,

αT=D,

βκ0,t=κ0e T

t (as–B–1s b2sαs)ds,

and ⎧ ⎨ ⎩

dxuκ0 t = (atx

0

tB–1t b2tαtEx 0

tB–1t b2tβκ0,t)dtctdω˜t,

xuκ0 0 =ξ,

(36)

with

κ0=

μ0e T

0(at–B–1t b2tαt)dt T

0 B–1t b2te2 T

t (as–B–1s b2sαs)dsdt ,

respectively.

(13)

Example4.3 One denotes byvthe rate of capital withdrawal or injection of a firm, and by

xvthe cash-balance process on [0,T]. Assume that the liability of the firm is governed by

dL¯vt =btvtdt+ctdωt+c˜tdω˜t,

wherectdωt and˜ctdω˜t describe the liability risk. Assume that the firm owns an initial investmentξ, and only invests in a money account with compounded interest ratea. Then the cash-balance is denoted by

xvt =e0tasds

ξt

0

e–0sardrdL¯v s

,

whose differential form is the same as (6) witha¯=b¯= 0. Due to the discreteness of the account information, the firm partially observes the cash-balance by the corresponding stock price

⎧ ⎨ ⎩

dSv

t =Svt[(ftxvt +gt+12h2t)dt+htdω˜t],

Sv0= 1.

Set

yvt=logSvt.

It follows from Itô’s formula thatyvis governed by (7) with¯f

t=g¯t= 0. The firm hopes to find a suitablevsuch that

J[v] = 1 2E

T

0

v2tdt+xvT–ExvT2

is minimized overUadc with terminal constraint (18). The model implies that the firm wants to minimize the risk ofxvTandvunder a fixed terminal cash-balance level. Since

ExvT–ExvT2=ExvT2–ExvT2,

Example4.3is also a special case of Example4.1. The following result is an immediate one of Proposition4.1.

Corollary 4.1 The optimal rate of capital withdrawal or injection of the firm is

0,t= –bt

Γtxˆ 0 t +Γ¯tExˆ

0 t +Λκ0,t

,

whereΓ,Γ¯,Λandxˆ0 are the solutions to

⎧ ⎨ ⎩

˙

Γt+ 2atΓtb2tΓt2= 0,

ΓT= 1, ⎧ ⎨ ⎩

˙¯

Γt+ 2(atb2tΓt)Γ¯tb2¯t2= 0,

(14)

One remarks that the model in Example4.3is inspired by Huang et al. [18], where the variance ofvdoes not enter the performance functional.

5 Concluding remarks

This paper has studied an optimal control problem driven by SDE of mean-field tpye, where the drift coefficient of observation equation is linear with respect to the state, the control and their expectations, and the observation equation is explicitly dependent on the control. This framework covers many more financial models including [18], but it causes trouble in addressing the control problem. This trouble has been solved by the backward separation method with a decomposition technique. The results obtained here partially improve those of [3–6].

The traditional separation principle method is also applicable to study Problem (TC). However, the traditional method cannot offer a valid way to derive an optimality condition of Problem (TC). The reason is listed below. Let(t,x) be the conditional density ofxv

t, given the observable filtrationFtyv. Similar to Theorem 2.3 in [2], one gets

is anFtyv-adapted andR-valued standard Brownian motion, and hence, the cost functional (9) is rewritten as

(15)

Appendix

One presents three lemmas first, and then one gives a proof of Proposition2.1.

Lemma A1 For any vjL2F(0,T;R),j= 1, 2,there is a constant C> 0such that

Proof The estimate is obtained by Itô’s formula and Hölder’s inequality. The details of the

proof are omitted to save space.

Lemma A2 For any v,vjUad,j= 1, 2, . . . ,

lim

j→+∞[vj] =Jκ[v].

Proof Using Taylor’s expansion, Hölder’s inequality and LemmaA1, one gets

ad, one defines a family of controls by

(16)

whereνU,i,jare natural numbers, 1≤ij– 1, andδj=T/j. Similar to [8], one proves that (i) vjUad for anyj, and (ii)vjvasj→+∞inL2

Fy0(0,T;U). Thus the proof is

complete.

Proof of Proposition2.1 From the definition of decision set, it is easy to see

inf

vUad

v≥ inf

vUad0

[v].

Then one only needs to prove the reverse inequality. Sincevjintroduced in LemmaA3is an element ofUad,

[vj]≥ inf

vUad

v,

and, consequently, it follows from LemmaA2that

[v] = lim

j→+∞[vj]vinf∈Uad

v.

Then the arbitrariness ofvimplies the desired result.

Proof of Proposition2.2 According to the definition of optimal control and value function, the desired conclusion is drawn by using the idea introduced in Chap. 11 of Øksendal [19]. One drops the details of the proof for saving space.

Funding

This work is supported in part by the NSF of China under Grant No. 61603219, and by the Shandong Jiaotong University Climbing Research Innovation Team Program.

Abbreviations

SDE, stochastic differential equation; LQ, linear-quadratic; TC, terminal constraint.

Availability of data and materials

Not applicable.

Competing interests

The author declares that there is no conflict of interests regarding the publication of this paper.

Author’s contributions

The author read and approved the final manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Received: 26 September 2018 Accepted: 18 February 2019

References

1. Wang, G.C., Wu, Z.: Kalman–Bucy filtering equations of forward and backward stochastic systems and applications to recursive optimal control problems. J. Math. Anal. Appl.342, 1280–1296 (2008)

2. Wang, G.C., Wu, Z., Xiong, J.: An Introduction to Optimal Control of FBSDE with Incomplete Information. Springer, Cham (2018)

3. Wang, G.C., Zhang, C.H., Zhang, W.H.: Stochastic maximum principle for mean-field type optimal control with partial information. IEEE Trans. Autom. Control59, 522–528 (2014)

4. Zhang, H.Y.: A necessary condition for mean-field type stochastic differential equations with correlated state and observation noises. J. Ind. Manag. Optim.12, 1287–1301 (2016)

5. Ma, H.P., Liu, B.: Optimal control problem for risk-sensitive mean-field stochastic delay differential equation with partial information. Asian J. Control19, 2097–2115 (2017)

(17)

7. Wang, G.C., Wu, Z., Xiong, J.: Maximum principles for forward–backward stochastic control systems with correlated state and observation noises. SIAM J. Control Optim.51, 491–524 (2013)

8. Wang, G.C., Wu, Z., Xiong, J.: A linear-quadratic optimal control problem of forward–backward stochastic differential equations with partial information. IEEE Trans. Autom. Control60, 2904–2916 (2015)

9. Wang, G.C., Xiao, H., Xing, G.J.: An optimal control problem for mean-field forward–backward stochastic differential equation with noisy observation. Automatica86, 104–109 (2017)

10. Meyer-Brandis, T., Øksendal, B., Zhou, X.Y.: A mean-field stochastic maximum principle via Malliavin calculus. Stochastics84, 643–666 (2012)

11. Elliott, R., Li, X., Ni, Y.H.: Discrete time mean-field stochastic linear quadratic optimal control problems. Automatica49, 3222–3233 (2013)

12. Yong, J.M.: Linear-quadratic optimal control problems for mean-field stochastic differential equations. SIAM J. Control Optim.51, 2809–2838 (2013)

13. Hafayed, H., Abbas, S.: On near-optimal mean-field stochastic singular controls: necessary and sufficient conditions for near-optimality. J. Optim. Theory Appl.160, 778–808 (2014)

14. Ni, Y.H., Zhang, J.F., Li, X.: Indefinite mean-field stochastic linear-quadratic optimal control. IEEE Trans. Autom. Control 60, 1786–1800 (2015)

15. Hafayed, M., Abba, A., Abbas, S.: On partial-information optimal singular control problem for mean-field stochastic differential equations driven by teugels martingales measures. Int. J. Control89, 397–410 (2016)

16. Xiong, J.: An Introduction to Stochastic Filtering Theory. Oxford University Press, London (2008)

17. Peng, S.G., Wu, Z.: Fully coupled forward–backward stochastic differential equations and applications to optimal control. SIAM J. Control Optim.37, 825–843 (1999)

18. Huang, J.H., Wang, G.C., Wu, Z.: Optimal premium policy of an insurance firm: full and partial information. Insur. Math. Econ.47, 208–215 (2010)

References

Related documents

The expectation from mechanical engineering Lecturers in Bahrain is that they should be capable to adopt teaching methods which combine traditional teaching with simulation in

and allocate the outcome of that risk (good or bad), ex post , to the responsible agent (Fried 2003). Thus we can ask, in respect of some impending legal transition, did

However, there is evidence that MNEs can influence the local training and vocational systems (TVET) in the host economies. In emerging economies, TVET systems may play a

Synopsis: Radical, violent Islam or „Militant Jihad‟ derives its authority from a particular set of interpretations of Islamic texts. An understanding of the essence of

I am also able to examine the transition of workers within the private covered sector from full-time to part-time employment (working less than 40 hours a week).

The stability (in terms of molar mass) of chitosan potentially plays an important role in its behaviour and functional properties in a wide range of applications and therefore

Importantly, we show that this policy parameter is highly dependent on the reporting en- vironment, partly because the existence of thresholds generates strong bunching responses

The input parameters for a typical FEA simulation are; geometry of the workpiece, properties of the workpiece material, grinding forces, process parameters, heat