R E S E A R C H
Open Access
Non-smooth analysis method in optimal
investment-BSDE approach
Helin Wu
1, Yong Ren
2and Feng Hu
3**Correspondence: [email protected]
3School of Statistics, Qufu Normal University, Qufu, China Full list of author information is available at the end of the article
Abstract
In this paper, the investment process is modeled by backward stochastic differential equation. We investigate a necessary condition for optimal investment problem by the method of non-smooth analysis. Furthermore, some applications of our result are given.
MSC: 60H10; 93E20; 49J52
Keywords: Backward stochastic differential equation (BSDE); Non-smooth analysis; Optimal investment
1 Introduction
Single-period mean-variance portfolio selection was first introduced and studied by Markowitz [14]. Markowitz’s pioneer work [14] has laid down the foundation for modern financial portfolio theory. In 2000, Li and Ng [11] extended the Markowitz model to the dynamic setting. From then on, various continuous time Markowitz models were studied in much of the literature (see, e.g., [1,12,13,21]).
Generally, there are two approaches, i.e. the forward (primal) approach and the back-ward (dual) approach, employed to solve the above problem in the continuous time case. The first approach is inspired by indefinite stochastic linear quadratic control (see, e.g., [3,20]) and builds the relationship between the mean-variance problem and a family of indefinite stochastic linear quadratic optimal control problems (see, e.g., [12,13,21]). The second approach is first studied by Bielecki et al. in [1], which is the generalization of the well-known risk neutral computational approach in the discrete time case (see, e.g., [9,17, 18]). The backward approach includes two steps: the first step is to compute the optimal terminal wealth; the second step is to compute the replicating portfolio strategy corre-sponding to the obtained optimal terminal wealth. As shown in [1], the optimal terminal wealth is first obtained using the Lagrange multiplier method and then the optimal repli-cating portfolio strategy is obtained by solving a backward stochastic differential equation (BSDE for short). Along this line, in 2008, Ji and Peng [10] used Ekeland’s variational prin-ciple to obtain a necessary condition for portfolio optimization problem by the backward approach. In this paper, we study optimal investment problem by the method of non-smooth analysis, which makes more general portfolio optimization problem be inside the scopes of our consideration.
Now the framework of optimal investment problem that we study are introduced. Let (Ω,F,P) be a probability space and (Wt)t≥0be ad-dimensional standard Brownian motion
with respect to filtration (Ft)t≥0generated by Brownian motion and allP-null subsets, i.e.,
Ft=σ{Ws;s≤t} ∨N,
whereN is the set of allP-null subsets. Fix a real numberT> 0. Denote F := (Ft)t∈[0,T]
Here, letL2(F
T) denote the space ofFT-measurable random variablesξ:Ω→R
satis-fyingE[ξ2] <∞;H2
T(Rn) denote the space of F-predictable processesϕ:Ω×[0,T]→Rn
satisfyingE[0T|ϕt|2dt] <∞andS2T(R) denote the space of F-progressively measurable
RCLL processesϕ:Ω×[0,T]→RsatisfyingE[sup0≤t≤T|ϕt|2] <∞.
Given an initial capitalx, the investment process can be modeled by the following one-dimensional BSDE on [0,T]:
yt=ξ+
T
t
g(s,ys,zs) ds–
T
t
zs·dWs (1)
with y0≤x, whereξ ∈L2(FT) andg(ω,t,y,z) :Ω×[0,T]×R×Rd→Ris a function
uniformly satisfying the Lipschitz condition, i.e., there exists a positive constantMsuch that for all (y1,z1), (y2,z2)∈R×Rd
g(ω,t,y1,z1) –g(ω,t,y2,z2)≤M
|y1–y2|+|z1–z2|
(A.1)
and
g(·, 0, 0)∈HT2(R). (A.2)
By Pardoux and Peng [15], we know that: Suppose thatgsatisfies (A.1) and (A.2). Then, for anyξ∈L2(F
T), BSDE (1) has a unique pair of adapted processes (yt,zt)∈ST2(R)×HT2(Rd).
Usually, we call{yt}a wealth process and{zt}a portfolio process.
Assume that the investor has initial wealthx, he invests it in the financial market ac-cording to BSDE (1). DenoteE0,gT(ξ) :=y0. Then his investment strategy is determined by
all available terminal values for him, i.e.,
A(x) :=ξ∈L2(FT)|E0,gT(ξ)≤x
.
In this paper, we study the following problem: if there existsξ∗∈A(x), which is the optimal objective of problem
min
ξ∈A(x)ρ(ξ), (2)
what is the necessary condition forξ∗? Here,ρis a function defined onL2(F
T) and
rep-resents a risk measure. We also assume thatρsatisfies the Lipschitz condition.
2 Some results about non-smooth analysis
In this section, we give some results about non-smooth analysis that are used in this paper. The following definitions, lemmas and propositions can be found in [4,5].
Suppose thatXis a Banach space, andX∗is its dual space. Obviously,L2(F
T) is a Banach
space with the normξ:= (E[ξ2])12 for anyξ∈L2(FT), andL2(FT) is also its dual space.
Definition 1 Suppose that f is a function defined on X, Given an x ∈ X, if
lim supy∈X,y→x,t↓0f(y+tv)–f(y)
t ∈Rfor anyv∈X, denote
fo(x;v) := lim sup y∈X,y→x,t↓0
f(y+tv) –f(y)
t . (3)
We callfo(x;v) the generalized directional derivative off atx.
Remark1 Suppose that a functionf :X→Rsatisfies Lipschitz condition, i.e., for anyx1,
x2∈X,
f(x1) –f(x2)≤Mx1–x2
holds for someM> 0, depending onf. By Definition1, we have:
(i) Obviously,fo(x;v) is sub-linear onX, then, by the Hahn–Banach theorem, the
gener-alized directional derivative set off atx
∂of(x) :=ζ∈X∗|ζ(v)≤fo(x;v),∀v∈X (4)
is nonempty and weak star compact inX∗.
(ii) Iff(x) attains minimum value at some pointx0∈X, i.e.,
f(x)≥f(x0), ∀x∈X,
the Fermat condition
0∈∂of(x0)
holds.
Given a setC⊂X, the distance functiondC:X→Ris defined as
dC(x) :=inf y∈Cy–x,
for a givenx∈X.
Lemma 1(Exact penalization) Suppose that f is a Lipschitz function with coefficient M defined on X,x∗∈C⊂X and f takes its minimum value at x∗on C.Then,for anyMˆ ≥M,
g(x) :=f(x) +MdCˆ (x)attains minimum value at x0on X.On the contrary,ifMˆ >M and C
Remark2 From Lemma1, we know that, supposing thatf is a Lipschitz function with coefficientMdefined onXandCis a closed subset ofXand lettingg(y) :=f(y) +MdCˆ (y), for anyy∈X, whereMˆ >M, then, if there existsx∗∈Csuch that
fx∗=min
x∈Cf(x), (5)
then we have
gx∗=min x∈Xg(x) =f
x∗. (6)
Definition 2 Assumex∈C. Givenv∈X, ifdoC(x;v) = 0, thenvis said to be a contingent derivative atx. Denote
TC(x) :=
v∈X|doC(x;v) = 0.
TC(x) is the set of contingent derivatives atx. By polarity, the normal derivative set is defined as
NC(x) :=ζ∈X∗|ζ(v)≤0,∀v∈TC(x).
Proposition 1 Suppose that x∈C,then we have
NC(x) =cl
λ≥0
λ∂odC(x)
,
where cl means the weak star closure.
Proposition 2 Given an x∈X,suppose that h is Lipschitz in a neighborhood of x and
0 /∈∂oh(x).Let
C:=y∈X|h(y)≤h(x). (7)
Then we have
NC(x)⊂
λ≥0
λ∂oh(x). (8)
Remark3 Given anx∈X, suppose thathis Lipschitz in a neighborhood of xand 0 /∈
∂oh(x). LetC:={y∈X|h(y)≤h(x)}. We also assume thatf is a Lipschitz function with coefficient Mdefined onX, andf takes its minimum value atxonC. By Definition1, Remark2, Definition2and Proposition2, then the Fermat condition
0∈∂of(x) +NC(x) (9)
holds, i.e., there exist a non-negative constantλ,ζ ∈∂of(x) andη∈∂oh(x) such that
ζ+λη= 0
Definition 3 A functionf defined onXis called strictly differentiable atx∈X, if there existsx∗∈X∗such that
lim y∈X,y→x,t→0+
f(y+tv) –f(y)
t =
x∗,v, (10)
for anyv∈X.
Lemma 2 A function f defined on X is strictly differentiable at x∈X,then∂of(x) ={x∗}, where x∗can be obtained by(10).
Lemma 3 Suppose that{hi,i= 1, 2, . . . ,n}is a set of Lipschitz functions on X.Define
h(x) :=maxhi(x)|i= 1, 2, . . . ,n, ∀x∈X.
For each x∈X,let I(x)be the subset of{1, 2, . . . ,n}satisfying that hi(x) =h(x),i∈I(x).Then we have
∂oh(x)⊂co∂ohi(x)|i∈I(x)
, (11)
where coA is the convex hull of A.
3 Maximum principle for optimal investment problem
In this section, our aim is to derive a necessary condition for optimal problem (2). Just as usual, in order to use the method of non-smooth analysis, we need the following assump-tion:
the risk measureρis Lipschitz. (A.3)
The following proposition shows thatE0,gT is Lipschitz inL2(FT). For more details, see
[2,22,23].
Proposition 3 Suppose that g satisfies(A.1)and(A.2),ξi∈L2(FT),i= 1, 2,and(yit,zit)are the solutions of BSDE(1)with terminal valuesξi,respectively,then there exists a constant C> 0such that
E
sup t∈[0,T]
y2t –y1t2
+E
T
0
z2t –z1t2dt
≤CE|ξ2–ξ1|2
. (12)
Remark4 From Proposition3, we know that, supposinggsatisfies (A.1) and (A.2), then, for anyξ1,ξ2∈L2(FT),
Eg
0,T(ξ1) –E0,gT(ξ2)≤C
1
2ξ1–ξ2
holds.
Theorem 1 Under the conditions(A.1)–(A.3),ifξ∗is an optimal objective of problem(2),
then there exist a non-negative constantλ,ζ∈∂oρ(ξ∗)andη∈∂oEg
0,T(ξ∗)such that
ζ+λη= 0 (13)
holds.
In order to prove Theorem1, we need the following lemma.
Lemma 4 Suppose that g satisfies(A.1)and(A.2).Then,for anyξ∈L2(F
T),we have0 /∈ ∂oE0,gT(ξ).
Proof For notational simplicity, we only consider the cased= 1. We assume that there existsξ∗∈L2(F
T) such that 0∈∂oE0,gT(ξ∗). For anyξ∈L2(FT), leth(ξ) :=E0,gT(ξ), then hoξ∗;η≥0
holds, for any η∈L2(FT). For eachξ ∈L2(FT), suppose that (yt,zt) and (yrt,zrt) are the
solutions of the following BSDEs on [0,T]:
Proof of Theorem1 By Lemma4, we have 0 /∈∂oEg
0,T(ξ), for anyξ ∈L2(FT). Hence by
Definition1, Remark2, Definition2and Proposition1, we can see that, ifξ∗is an optimal objective of problem (2), then the Fermat condition
0∈∂oρξ∗+NA(x)
ξ∗ (14)
holds.
Case1:E0,gT(ξ∗) =x. In this case,A(x) ={ξ∈L2(FT)|E0,gT(ξ)≤E g
0,T(ξ∗)}. Thus, by
Propo-sition2, there exist a non-negative constantλ,ζ∈∂oρ(ξ∗) andη∈∂oE0,gT(ξ∗) such that
ζ+λη= 0 (15)
holds.
Case2:E0,gT(ξ∗) =x0<x. In this case, denote
C:=ξ∈L2(FT)|E0,gT(ξ)≤x0
.
Sinceξ∗is an optimal objective of problem (2),ξ∗is also an optimal objective of problem
min ξ∈C
ρ(ξ). (16)
Thus, by Definition1, Remark2, Definition2and Proposition2again, Equation (15) also holds. The proof of Theorem1is complete.
From Remark4, we know thatE0,gT is Lipschitz inL2(FT). By Definition1, we can see
that∂oE0,gT(ξ) is not empty for anyξ∈L2(FT). Indeed, we have following result. Lemma 5 Suppose thatξ∈L2(F
T)and g satisfies(A.1)and(A.2).Let(yt,zt)be the solu-tion of BSDE(1)with terminal valueξ,then for anyη∈∂oEg
0,T(ξ)andζ∈L2(FT),there exists(ϕt,ψt)∈∂og(t,yt,zt)such that
η,ζ=y˜0
holds,where(˜yt,zt˜)is the solution of the following BSDE on[0,T]:
˜
yt=ζ+
T
t
(ϕs˜ys+ψs˜zs) ds–
T
t
˜
zs·dWs. (17)
Proof For any givenω∈Ωandt∈[0,T], letgˆ(t,yˆ,zˆ) =go(t,yt,zt;yˆ,zˆ),∀(ˆy,zˆ)∈R×Rd. By
Definition1, we can see thatgˆ(t,yˆ,zˆ) is Lipschitz, positively homogeneous and convex in (ˆy,zˆ), and hence
ˆ
g(t,yˆ,ˆz) = max
(ϕt,ψt)∈∂og(t,yt,zt)
(ϕt,ψt), (ˆy,zˆ)
. (18)
For anyζ ∈L2(F
T), we consider the following BSDE on [0,T]:
ˆ
yt=ζ+
T
t
ˆ
g(s,ˆys,zsˆ) ds–
T
t
ˆ
Letfˆ(ζ) :=yˆ0. Denote
D:=η∈L2(FT)|η,ζ=y˜0,∀(ϕt,ψt)∈∂og(t,yt,zt)
, (20)
where (˜yt,zt˜) is the solution of the following BSDE on [0,T]:
˜
yt=ζ+
T
t
(ϕs˜ys+ψs˜zs) ds–
T
t
˜
zs·dWs.
By the comparison theorem of BSDE (for example, see El Karoui et al. [7]), we have ˆ
f(ζ)≥ η,ζ, for any η∈D. Sincegˆ is positively homogeneous and convex in (ˆy,zˆ), by Propositions 3.3 and 3.4 in Peng [16], we know thatfˆis positively homogeneous and con-vex inL2(FT), and hence
ˆ
f(ζ) =max
η∈Dη,ζ. (21)
By Definition1, we can obtain that
Eg
0,T
o
(ξ;ζ) =fˆ(ζ) =max η∈Dη,ζ.
This means that∂oE0,gT(ξ) =D. The proof of Lemma5is complete. In Ji and Peng [10], the authors assume thatgis continuously differentiable in (y,z). In this special case, we can get an explicit form of∂oE0,gT.
Lemma 6 Suppose thatξ∈L2(FT)and g satisfies(A.1)and(A.2).Let(yt,zt)be the solu-tion of BSDE(1)with terminal valueξ.If g is continuously differentiable in(y,z)∈R×Rd, then we have
∂oE0,gT(ξ) ={qT}
and for anyη∈L2(F
T),
qT,η=˜y0, (22)
where(˜yt,˜zt)is the solution of the following BSDE on[0,T]:
˜
yt=η+
T
t
gy(s,ys,zs)˜ys+gz(s,ys,zs)˜zsds–
T
t
˜
zs·dWs.
Proof For any givenω∈Ω andt∈[0,T], letg˜(t,y˜,z˜) =go(t,yt,zt;˜y,z˜),∀(˜y,z˜)∈R×Rd.
Sincegsatisfies (A.1), (A.2), and is continuously differentiable in (y,z)∈R×Rd, we know thatgis strictly differentiable in (y,z)∈R×Rdand
˜
holds. For anyη∈L2(F
T), let (˜yt,zt˜) be the solution of the following BSDE on [0,T]:
˜
yt=η+
T
t
˜
g(s,ys˜,˜zs) ds–
T
t
˜
zs·dWs.
DenoteE0,g˜T(η) :=y˜0. By Definition1, we obtain
Eg
0,T
o
(ξ;η) =E0,g˜T(η). (23) Sinceg˜ is linear in (˜y,z˜), by Propositions 3.3 and 3.4 in Peng [16], we know thatE0,g˜T is linear inL2(F
T). By Proposition3, we can see thatE˜ g
0,Tis continuous inL2(FT). Thus, by
the Riesz representation theorem, there existsqT∈L2(FT) such that
Eg˜
0,T(η) =qT,η (24)
holds for any η∈L2(F
T). From (23) and (24), we have ∂oE0,gT(ξ) ={qT}. The proof of
Lemma6is complete.
By Theorem1and Lemma6, we immediately obtain the following theorem.
Theorem 2 Suppose thatρsatisfies(A.3),and g satisfies(A.1), (A.2),and is continuously differentiable in(y,z).Assume thatξ∗is an optimal objective of problem(2),let(y∗t,z∗t)be the solution of BSDE(1)with terminal valueξ∗.Then there exist a non-negative constant λandζ∈∂oρ(ξ∗)such that
ζ+λq∗T= 0
holds,where q∗Tcan be obtained by(22).
4 Some examples
In this section, we give three examples for our obtained result. In the sequel, suppose that
gsatisfies (A.1), (A.2), and is continuously differentiable in (y,z).
ProblemA. Minimize
Eϕ(ξ)–c
subject to
⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩
E[u(ξ)] =c,
Eg
0,T(ξ) =x,
Eg
0,T(ξ)≥0, ξ∈L2(FT),
whereϕ:R→Ris Lipschitz and continuously differentiable, andu:R→Rhas bounded continuous derivativeu.
ProblemB. Minimize
Eϕ(ξ)–c
subject to
⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩
E[u(ξ)]≤c,
Eg
0,T(ξ)≤x,
Eg
0,T(ξ)≥0, ξ∈L2(F
T),
whereϕ:R→Ris Lipschitz and continuously differentiable, andu:R→Rhas bounded continuous derivativeu.
In order to study the optimization problems A and B, we need the following lemma.
Lemma 7 If u:R→R has bounded continuous derivative u,then the function ρ(ξ) =
E[u(ξ)]for anyξ∈L2(F
T)is strictly differentiable,and∂oρ(ξ) ={u(ξ)}.
By the Fubini theorem and the dominated convergence theorem, we can easily prove Lemma7. So we omit it.
Theorem 3 If the optimal objectiveξ∗of ProblemAexists,there exist two constantsλ1
andλ2such that
ϕξ∗+λ1u
ξ∗+λ2q∗T= 0
holds,where∂oE0,gT(ξ∗) ={q∗T}.
Proof Denoteρ(ξ) :=E[ϕ(ξ)] –c, for anyξ∈L2(F
T) and
U:=ξ∈L2(FT)|h(ξ)≤0
,
whereh(ξ) :=max{E[u(ξ)] –c,c–E[u(ξ)],E0,gT(ξ) –x,x–E0,gT(ξ), –E0,gT(ξ)}. If the optimal
objectiveξ∗of Problem A exists, then, by Lemmas6and7, we have∂oρ(ξ∗) ={ϕ(ξ∗)},
∂oE[u(ξ∗)] ={u(ξ∗)}and∂oE0,gT(ξ∗) ={q∗T}.
By Theorem2and Lemma3, there exist two constantsλ1andλ2such that
ϕξ∗+λ1u
ξ∗+λ2q∗T= 0
Theorem 4 If the optimal objectiveξ∗of ProblemBexists,then there exist three constants
In the classic investment problem, one often takes the variance as a risk measure and the mean-variance method is used in much of the literature. But by Delbaen [6], and Föllmer and Schied [8], such a kind of risk measure is not perfect. We often takeρ(·) as a coherent or convex risk measure in (2). In Rosazza Gianin [19], the author proved that ifgis a sub-additive and positively homogeneous function satisfying (A.1) and (A.2), defineρ(ξ) :=
Eg
ProblemC. Supposef is sub-additive and homogeneous function satisfying (A.1) and
Theorem 5 If the optimal objectiveξ∗of ProblemCexists,then there exist three constants
λ≥0and a,β∈[0, 1]satisfyingα+β≤1,such that
The proof of Theorem5is very similar to that of Theorem4. So we omit the proof.
Acknowledgements
The authors would like to thank the anonymous referees for their constructive suggestions and valuable comments.
Funding
The work of Helin Wu is supported by the Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJ1400922). The work of Yong Ren is supported by the National Natural Science Foundation of China (11871076). The work of Feng Hu is supported by the National Natural Science Foundation of China (11801307) and the Natural Science Foundation of Shandong Province (ZR2016JL002 and ZR2017MA012).
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
The authors contributed equally and significantly in writing this article. All authors read and approved the final manuscript.
Author details
1School of Mathematics and Statistics, Chongqing University of Technology, Chongqing, China.2Department of
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Received: 5 June 2018 Accepted: 4 December 2018
References
1. Bielecki, T.R., Jin, H.Q., Pliska, S.R., Zhou, X.: Continuous time mean-variance portfolio selection with bankruptcy prohibition. Math. Finance15, 213–244 (2005)
2. Briand, P., Coquet, F., Hu, Y., Mémin, J., Peng, S.: A converse comparison theorem for BSDEs and related properties of g-expectation. Electron. Commun. Probab.5, 101–117 (2000)
3. Chen, S., Li, X., Zhou, X.: Stochastic linear quadratic regulators with indefinite control weight costs. SIAM J. Control Optim.36, 1685–1702 (1998)
4. Clarke, F.H.: Optimization and Nonsmooth Analysis, 2nd edn. Classics in Applied Mathematics, vol. 5. SIAM, Philadelphia (1990)
5. Clarke, F.H., Ledyaev, Y.S., Stern, R.J., Wolenski, P.R.: Nonsmooth Analysis and Control Theory. Graduate Texts in Mathematics, vol. 178. Springer, New York (1998)
6. Delbaen, F.: Coherent risk measures on general probability space. In: Sandmann, K., Schonbucher, P.J. (eds.) Advance in Finance and Stochastics, pp. 1–37. Springer, Berlin (2002)
7. El Karoui, N., Peng, S., Quenez, M.C.: Backward stochastic differential equations in finance. Math. Finance7, 1–71 (1997)
8. Föllmer, H., Schied, A.: Stochastic Finance: An Introduction in Discrete Time. De Gruyter, Berlin (2004) 9. Harrison, J.M., Kreps, D.: Martingales and multiperiod securities market. J. Econ. Theory20, 381–408 (1979)
10. Ji, S., Peng, S.: Terminal perturbation method for the backward approach to continuous time mean-variance portfolio selection. Stoch. Process. Appl.118, 952–967 (2008)
11. Li, D., Ng, W.L.: Optimal dynamic portfolio selection: multiperiod mean variance formulation. Math. Finance10, 387–406 (2000)
12. Li, X., Zhou, X., Lim, A.E.B.: Dynamic mean variance portfolio selection with no shorting constraints. SIAM J. Control Optim.40, 1540–1555 (2001)
13. Lim, A.E.B., Zhou, X.: Mean variance portfolio selection with random parameters. Math. Oper. Res.27, 101–120 (2002) 14. Markowitz, H.: Portfolio selection. J. Finance7, 77–91 (1952)
15. Pardoux, E., Peng, S.: Adapted solution of a backward stochastic differential equation. Syst. Control Lett.14, 55–61 (1990)
16. Peng, S.: Modelling derivatives pricing with their generating functions (2006)arXiv:math/0605599[math.PR] 17. Pliska, S.R.: A discrete time stochastic decision model. In: Fleming, W.H., Gorostiza, L.G. (eds.) Advances in Filtering and
Optimal Stochastic Control, in. Lecture Notes in Control and Information Sciences, vol. 42, pp. 290–304. Springer, New York (1982)
18. Pliska, S.R.: Introduction to Mathematical Finance. Blackwell, Oxford (1997)
19. Rosazza Gianin, E.: Risk measures viag-expectations. Insur. Math. Econ.39, 19–34 (2006)
20. Yong, J., Zhou, X.: Stochastic Controls—Hamiltonian Systems and HJB Equations. Springer, New York (1999) 21. Zhou, X., Li, D.: Continuous time mean variance portfolio selection: a stochastic LQ framework. Appl. Math. Optim.42,
19–33 (2000)
22. Zong, Z.: A comonotonic theorem for backward stochastic differential equations inLpand its applications. Ukr. Math. J.64, 857–874 (2012)