• No results found

arxiv: v1 [math.ap] 3 Sep 2021

N/A
N/A
Protected

Academic year: 2021

Share "arxiv: v1 [math.ap] 3 Sep 2021"

Copied!
25
0
0

Loading.... (view fulltext now)

Full text

(1)

DIOGO GOMES, JULIAN GUTIERREZ, AND RICARDO RIBEIRO

Abstract. We consider a market where a finite number of players trade an asset whose supply is a stochastic process. The price formation problem consists of finding a price process that ensures that when agents act optimally to minimize their trading costs, the market clears, and supply meets demand. This problem arises in market economies, in- cluding electricity generation from renewable sources in smart grids. Our model includes noise in the supply side, which is counterbalanced in the consumption side by storing energy or reducing the demand according to a dynamic price process. By solving a con- strained minimization problem, we prove that the Lagrange multiplier corresponding to the market-clearing condition defines the solution of the price formation problem. For the linear-quadratic structure, we characterize the price process using optimal control techniques, and we include two numerical approaches for the price computation.

1. introduction

Mean-field game theory (MFG) is an approach to study the evolution of a population of competitive rational players. Each player solves an optimal control problem that depends on statistical features of the population rather than one-to-one interactions. The statistical features inform the objective of each agent, determining their dynamics. Adopting a MFG approach, the authors in [17] addressed a deterministic price formation model with a market- clearing condition in which the objectives of a continuum of agents are coupled to the price.

In this paper, we study a price formation model where N agents interact in a market via the price, $, of the commodity they trade and whose supply is random. The agents meet a balance condition that guarantees the supply, Q, of the commodity equals its demand. The novelty of our model consists of considering a random supply, such as electricity generation from sustainable sources.

The randomness in price formation has potential applications in renewable energy pro- duction on smart grids. Small devices in the grid can to store energy that can be sold back to the grid. Changes in weather conditions and network load cause fluctuations on the available supply. Because the agents can sell the surplus of power, they can benefit from load-adaptive pricing ([18], [1]).

The works [3] and [22] addressed price formation when the producer’s revenue is opti- mized. They proposed a Stackelberg game with a linear dependence on the price and a reverse Stackelberg game with non-linear dependence, respectively. In recent works, [13]

presents a linear-quadratic model for intraday electricity markets. In [10], the authors present a Cournot model where the common noise consists of a Brownian motion and a jump process. The contemporary works [23] and [14] study price formation with a market- clearing condition. The former postulates a model for Solar Renewable Energy Certificate Markets. The supply of the energy being priced is controlled. Their approach uses a forward- backward system and variational techniques to formulate a fixed-point problem to prove the existence of a mean-field distribution, from which they obtain the price. The latter assumes a linear-quadratic structure, and the existence of the price comes from the solvability of a

Date: September 6, 2021.

Key words and phrases. Mean Field Games; Price formation; Common noise, Lagrange multiplier.

King Abdullah University of Science and Technology (KAUST), CEMSE Division, Thuwal 23955-6900.

Saudi Arabia. e-mail: [email protected].

King Abdullah University of Science and Technology (KAUST), CEMSE Division, Thuwal 23955-6900.

Saudi Arabia. e-mail: [email protected].

King Abdullah University of Science and Technology (KAUST), CEMSE Division, Thuwal 23955-6900.

Saudi Arabia. e-mail: [email protected].

The authors were partially supported by KAUST baseline funds and KAUST OSR-CRG2017-3452.

1

arXiv:2109.01478v1 [math.AP] 3 Sep 2021

(2)

coupled forward-backward system of the McKean-Vlasov type. More recently, [15] studied a further extension that considers a Major player in the market. Our contribution differs from the previous works since our existence results rely on the calculus of variations approach, whereas we derive forward-backward systems as necessary conditions for such existence.

The case of a finite number of players with a deterministic supply was addressed in [2], were only existence and uniqueness was proved, and no numerical approximation scheme was considered. The model we present here generalizes the deterministic supply case. In [16], we addressed the stochastic supply case from the optimal control perspective, and we provided numerical results for a quadratic Lagrangian depending on the trading rate only. Here, we improve those results by proving existence and uniqueness of solutions in the stochastic framework using the variational approach, and we elaborate on the numerical approximation of those solutions.

Next, we introduce our model. Let T > 0 be the time horizon. In the following, we fix a complete filtered probability space (Ω, F , F, P); that is, F = (Ft)06t6T is the standard filtration generated by t 7→ Wt, a Brownian motion in R (see [21], Definition 3.1.3, and [11], Section 2, for additional details). Here, W plays the role of the common noise in the sense that the supply follows the stochastic differential equation (SDE)

dQt= bS(Qt, t)dt + σS(Qt, t)dWt. (1.1) In our model, the agent’s interaction determines the market equilibrium price of the commodity. All of this commodity produced is consumed entirely. Let N be the number of agents and let the state variable Xti account for the quantity of the commodity held by agent i at time t. Each agent controls their trading rate according to

dXt= vtdt, t ∈ [0, T ], (1.2)

where v : [0, T ] × Ω → R, the control variable, is progressively measurable with respect to F. The optimization problem we consider reads:

Problem 1. Let N be the number of agents. Let the supply, Q, be a stochastic process adapted to F solving (1.1). Let L ∈ C1(R2; R) be a non-negative Lagrangian, and Ψ ∈ C1(R) be a non-negative terminal cost. Assume that at time t = 0, each agent i owns a quantity xi0∈ R of the commodity.

Find a price process $ and control processes vi, all adapted to F, such that for each i, with 1 6 i 6 N , Xi solves (1.2) with the initial condition X0i = xi0, and minimizes the cost functional

E

"

Z T 0

L(Xti, vit) + $tvti dt + Ψ(XTi)

#

, (1.3)

subject to the balance condition 1 N

N

X

i=1

vti= Qt, for 0 6 t 6 T. (1.4)

The functional (1.3) represents the expected cost for each agent on [0, T ]. This cost consists of three parts: the trading at the current price through the linear term $v, the charges related to storage or market impact encoded in L, and the terminal cost; the terminal cost reflects the preferences of players at the terminal time. The balance condition (1.4) guarantees demand consumes all supply. For this problem, we obtain the following result:

Theorem 1.1. Let1Q ∈ HF and suppose that Assumptions 1-5 hold. Then, there exists control processes v∗i, for 1 6 i 6 N , and a price process $ that solve Problem 1. Further- more, under Assumption 6, the price $ and the control processes v∗i, for 1 6 i 6 N , solving Problem 1, are unique.

1See Section 2 for notation and assumptions.

(3)

We prove this theorem in Section 4, where we formulate a problem independent of the price, but the constraint imposed by the balance condition is still present. Using a forward- backward characterization of optimizers, we obtain the price as the Lagrange multiplier corresponding to the constrained problem.

The outline of the paper is as follows: In Section 2, we introduce the main assumptions for the model as well as the notation for the function spaces. We close by recalling an existence result for backward stochastic differential equations. In Section 3, we study the optimization problem that each agent solves when the price is known, and we obtain a forward-backward characterization of the optimal response. Using the single agent result, we prove the existence of a solution to the N agent price formation problem, Problem 1, in Section 4. We specialize our results for a linear-quadratic structure of the model in Section 5. Using optimal control techniques and an extended-state space approach, we obtained semi-explicit expressions for the price. The computation of the price is discussed in Section 6, where we present numerical results for the model of Section 5.

2. Assumptions, notation and previous results

We consider natural assumptions in the context of the calculus of variations (see [9]).

The following conditions are used to prove the existence of minimizers of (3.2), (4.2), and (4.4). In the following, we suppose the Lagrangian L is non-negative.

Assumption 1. The Lagrangian L ∈ C1(R2; R+∪{0}) is convex in (x, v); that is, (x, v) 7→

L(x, v) is convex.

Assumption 2. The terminal cost Ψ ∈ C1(R; R+∪ {0}) is convex.

Because we consider integrals w.r.t. measure spaces, we require compositions of processes with functions to remain in the same class where the process is taken. The following growth conditions guarantee this.

Assumption 3. Ψ ∈ C1(R; R+∪ {0}) satisfies, for some C > 0, Ψ(x) 6 C(1 + |x|2), for all x ∈ R.

Moreover, its derivative, which we denote by Ψ0, satisfies, for some ˜C > 0,

0(x)| 6 ˜C(1 + |x|).

Assumption 4. L ∈ C1(R2; R+∪ {0}), and there exists ˜β, C > 0 such that L(x, v) 6 ˜β(1 + |v|2), for all x ∈ R,

|Lx(x, v)|, |Lv(x, v)| 6 C(1 + |v|), for all (x, v) ∈ R2.

In convex optimization, a natural assumption to obtain the existence of minimizers is the coercivity condition.

Assumption 5. (Coercivity) For some α > 0 and β > 0 α|v|2− β 6 L(x, v), for all x, v ∈ R.

To guarantee the uniqueness of minimizers, we consider next a strong form of convexity.

In turn, this assumption implies the coercivity condition ([4], Corollary 11.17).

Assumption 6. (Uniform convexity) For some θ > 0 v 7→ L(x, v) −θ

2|v|2 is convex for all x ∈ R.

We introduce the Hamiltonian, H, the Legendre transform of L, by H(x, p) = sup

v∈R

{−pv − L(x, v)} . (2.1)

(4)

Recall that when the map v 7→ L(x, v) is convex, H(x, p) is well defined. Furthermore, if v 7→ L(x, v) is strictly convex, L ∈ C2(R2; R), and Assumption 5 holds, there exists a unique value v where the supremum is attained. In addition,

v= −Hp(x, p) if and only if p = −L(x, v), and hence H(x, p) = −pv− L(x, v). (2.2) See [7], Theorem A. 2.5, for the proof of the previous results. For the Hamiltonian, we additionally require no more than linear growth of the gradient in the p component, as we state next.

Assumption 7. The Hamiltonian H satisfies, for some C > 0,

|Hp(x, p)| 6 C(1 + |p|), for all (x, p) ∈ R2.

Now, we set up the notation. Define the space HFas the set of processes v : [0, T ]×Ω → R, that are measurable and adapted w.r.t. F, and satisfy kvk2HF< ∞, where

hv, wiHF:= E

"

Z T 0

vtwtdt

#

, kvk2H

F:= hv, viHF.

This expectation is w.r.t. the measure induced by the Brownian motion. HF is a Hilbert space ([8], Remark 2.2.). Given v ∈ HF, the solution to (1.2) with the initial condition x0∈ R is

Xt= x0+ Z t

0

vsds.

Notice that X ∈ HFbecause kXk2H

F6 2T |x0|2+ 2T2kvk2H

F. For our purposes, we consider trajectories with initial condition x0∈ R.

For N ∈ N, we define HNF, where v = (v1, . . . , vN) ∈ HNF provided vi∈ HF, and hv, wiHN

F :=

N

X

i=1

hvi, wiiHF, kvk2HN F :=

N

X

i=1

kvik2H

F.

In this context, we recall the following result from [11] (Theorem 2.1) for the existence and uniqueness of solutions to Backward Stochastic Differential Equations (BSDE). The differential is in the sense of Itˆo.

Theorem 2.1. (Pardoux-Peng) Let f : Ω × [0, T ] × R2 → R be a Borel measurable map adapted to F. Suppose that it is uniformly Lipschitz; that is, there exists C > 0 such that

|f (ω, t, y1, z1) − f (ω, t, y2, z2)| 6 C(|y1− y2| + |z1− z2|) for all (y1, z1), (y2, z2) ∈ R2 for dP × dt-almost every (ω, t) ∈ Ω × [0, T ]. Let ξ ∈ L2T(R); that is, ξ : Ω → R is FT- measurable and satisfies E|ξ|2 < +∞. Then, the following BSDE

(dYt= f (t, Yt, Zt)dt + ZtdWt YT = ξ

has a unique solution (Y, Z) ∈ HF× HF, on [0, T ], in the following sense:

Yt= ξ + Z T

t

f (s, Ys, Zs)ds + Z T

t

ZsdWs for all 0 6 t 6 T,

where Y is a continuous R-valued process adapted to F, Z is a R-valued predictable process satisfyingRT

0 |Zt|2dt < +∞, P-almost every ω ∈ Ω.

For an SDE of the form

(dYt= b(t, Yt)dt + σ(t, Yt)dWt

Y0= ξ,

we refer the reader to [21], Section 5.2, where the growth and Lipschitz conditions

|b(t, x)| + |σ(t, x)| 6 C(1 + |x|), for all x ∈ R, 0 6 t 6 T,

|b(t, x) − b(t, y)| + |σ(t, x) − σ(t, y)| 6 D|x − y|, for all x, y ∈ R, 0 6 t 6 T,

(5)

for some constants C, D > 0, guarantee the existence and uniqueness of solutions.

The analysis of Problem 1 relies on the results for the single-agent control problem, which we consider in the next section.

3. single agent problem

In this section, we assume that a price, $, is given. We derive a weak formulation for the Euler-Lagrange equation associated with the optimal control problem for one agent. In Section 4, using the single-agent result, we study how the collective actions of the agents determine the price.

Let x0∈ R. Given v ∈ HF, consider the dynamics for the agent (dXt= vtdt, t ∈ [0, T ]

X0= x0. (3.1)

Given a price process $ ∈ HF, the agent selects v ∈ HFaiming to reach

v∈HinfFE

"

Z T 0

L(Xt, vt) + $tvt dt + Ψ(XT)

#

(3.2) subject to X solves (3.1).

Let

I[v] := E

"

Z T 0

L(Xt, vt) + $tvtdt + Ψ(XT)

# ,

where X solves (3.1) for v. In the following, we study the existence and uniqueness of solutions to (3.2). We adopt the direct method of the calculus of variations. Hence, we begin by proving that the functional I[·] is weakly lower semi-continuous.

Proposition 3.1. Let x0∈ R and $ ∈ HF. Under Assumptions 1-4, the functional I[·] is weakly lower semi-continuous in HF.

Proof. We will prove that I[·] is convex and lower semi-continuous, from which weak lower semi-continuity follows ([19] Theorem 7.2.5). First, notice that, by Assumptions 1 and 2, I[·] is convex. To prove lower semi-continuity, let (vk)k∈N in HF be such that vk converges to v. Denote by Xk and X the solutions to (3.1) with the controls vk and v, respectively.

Notice that, because the trajectories Xk and X have the same initial condition, we have kXk− Xk2H

F6 2T2kvk− vk2H

F.

Therefore, Xkconverges to X. The convexity in Assumptions 1 and 2 imply ([4], Proposition 17.7)

Lv(Xt, vt)(vkt − vt) + L(Xt, vt) 6 L(Xt, vtk), (3.3) Ψ0(XT)(XTk − XT) + Ψ(XT) 6 Ψ(XTk). (3.4) Adding L(Xtk, vtk) − L(Xt, vkt) + $tvtto both sides of (3.3), we get

Lv(Xt, vt)(vkt − vt) − L(Xt, vtk) + L(Xtk, vkt) + L(Xt, vt) + $tvt 6 $t(vt− vkt) + L(Xtk, vtk) + $tvtk.

Taking E[RT

0 · dt] in the previous inequality, E[·] in (3.4), and adding both results, we obtain hLv(X, v), vk− viHF+ E

"

Z T 0

L(Xtk, vkt) − L(Xt, vtk) dt

#

+ I[v] + EΨ0(XT)(XTk− XT)

6 h$, v − vkiHF+ I[vk]. (3.5)

By Assumption 4, Lv(X, v) ∈ HF, hence

hLv(X, v), vk− viHF→ 0. (3.6)

By Assumption 3, Ψ0(XT) ∈ HF, and using the representation XTk − XT =RT

0 vkt − vt dt, we obtain

E[Ψ0(XT)(XTk− XT)] → 0. (3.7)

(6)

By Assumption 4, the Cauchy inequality, and the triangle inequality

E

"

Z T 0

L(Xtk, vtk) − L(Xt, vkt) dt

#

6 Ch|Xk− X|, 1 + |vk|iHF

6 CkXk− XkHF T + kvkkHF → 0.

Using the previous inequality, (3.6), (3.7), and the assumption on $, taking lim inf in (3.5), we obtain

I[v] 6 lim inf

k∈N I[vk].

Therefore, I[·] is lower semi-continuous. 

Proposition 3.2. Suppose that Assumptions 1- 5 hold. Given an initial condition x0∈ R and a price process $ ∈ HF, there exists an optimal control v ∈ HF that solves (3.2).

Furthermore, under Assumption 6, v is unique.

Proof. To prove existence, we use the direct method in the calculus of variations. By As- sumption 5, we have

α v + $

2

−$2

4α − β 6 L(x, v) + v$. (3.8)

Since $ ∈ HF, select a and b such that

0 < a < α, 2(α−a)1 k$k2H

F6 b.

Then, for any v ∈ HF, we have 0 6 (α − a)E

"

Z T 0

vt+2(α−a)1 $t2 dt

#

+ b −2(α−a)1 k$k2H

F. The previous inequality, (3.8), and 0 6 Ψ in Assumption 2, imply

akvk2H

F− b − βT 6 E

"

Z T 0

α(vt)2+ $tvt− β dt

# 6 I[v]

for all v ∈ HF. Therefore, v 7→ I[v] is coercive, and in particular, the infimum in (3.2) is finite. Let (vk)k∈N in HF be a minimizing sequence; that is,

lim

k→+∞I[vk] = inf

v∈HF

I[v].

By the coercivity of I[·], (vk)k∈N is bounded in HF. Recall that HF is a Hilbert space, so it is reflexive and, therefore, weakly precompact ([12], Appendix D, Theorem 3). Hence, there exists a subsequence, still denoted by vk, that weakly converges to v∈ HF; that is, for all w ∈ HF

hvk, wiHF→ hv, wiHF. By Proposition 3.1

I[v] 6 lim inf

k→+∞I[vk] = lim

k→+∞I[vk] = inf

v∈HF

I[v].

Therefore, v is a minimizer.

To prove uniqueness, denote by X the solution of (3.1) with the control variable v. Assume that ˜v ∈ HF is a minimizer of (3.2), with trajectory ˜X solving (3.1) for ˜v. Set Y = 12(X+ ˜X), so that Y satisfies (3.1) for the control 12(v+ ˜v). Then, by Assumptions 1 and 6,

I1

2(v+ ˜v) 6 12(I [v] + I [˜v]) −θ4kv− ˜vk2H

F.

It follows that ˜v = v in HF, which implies that ˜X = X.  The following result provides a characterization of minimizers of I[·]. This condition is a weak form of the Euler-Lagrange equation.

(7)

Proposition 3.3. Suppose that Assumptions 3 and 4 hold. Let v∈ HF solve (3.2), with the corresponding trajectory X solving (3.1). Then (X, v) satisfies

E

"

Z T 0



Lx(Xt, vt)δXt+ (Lv(Xt, vt) + $t) δvt



dt + Ψ0(XT)δXT

#

= 0 (3.9) for all δv ∈ HF, where

δXt= Z t

0

δvsds. (3.10)

Proof. Let  > 0 and δv ∈ HF. Consider the control v+ δv in (3.1). The corresponding trajectory is Xt= Xt+ δXt. Because v is a minimizer of I[·], the function

 7→ E

"

Z T 0



L(Xt, vt + δvt) + $t(vt+ δvt)

dt + Ψ(XT)

#

has a minimum at  = 0; that is, d

dE

"

Z T 0



L(Xt, vt+ δvt) + $t(vt+ δvt)

dt + Ψ(XT)

# =0

= 0. (3.11) By Assumption 4, the partial derivatives of L evaluated at (Xt, vt + δvt) are integrable w.r.t. E[RT

0 · dt]. From Assumption 4 and Young’s inequality, we have that L(Xt, vt+ δvt) + $t(vt + δvt) 6 ˜β + ( ˜β + 1

2)|vt+ δvt|2+1 2|$t|2.

In the same way, Assumption 3 guarantees analogous conditions for Ψ at XTε. Hence, we can differentiate under the integral sign in (3.11) ([5], Theorem 16.8), from which the result

follows. 

The formulation presented in Proposition 3.3 corresponds to the classical second-order characterization of minimizers given by the Euler-Lagrange equations. As in Hamiltonian mechanics, this second-order characterization has an equivalent first-order formulation. For this first-order characterization, we use the adjoint equation (see (3.12)).

Proposition 3.4. Suppose L ∈ C1(R2; R) and Assumptions 3 and 6 hold. Given x0∈ R, assume that (v, X) solves (3.9), where v ∈ HF, and X solves (3.1) for v. Then, the backward SDE

(dPt= −Lx(Xt, vt)dt + ZtdWt

PT = Ψ0(XT) (3.12)

has a unique solution (P, Z) on [0, T ], where P, Z ∈ HF. Furthermore,

P = −Lv(X, v) − $, (3.13)

and (X, P, Z) solves, on [0, T ], the forward-backward SDE system









dXt= −Hp(Xt, Pt+ $t)dt X0= x0

dPt= Hx(Xt, Pt+ $t)dt + ZtdWt PT = Ψ0(XT).

(3.14)

Proof. Assumption 3 implies that Ψ0(XT) ∈ L2T(R), and the continuity of Lx guarantees the adaptability of Lx(Xt, vt) w.r.t. F. Notice that this term is independent of P and Z.

Hence, Theorem 2.1 guarantees the existence and uniqueness of (P, Z) solving (3.12).

Let δv ∈ HF and δX according to (3.10). Then, because δX0= 0, using (3.12), we have E[Ψ0(XT)δXT] = E[PTδXT] = E

"

Z T 0

d (PtδXt)

#

= E

"

Z T 0

dPtδXt+ Ptδvtdt

#

= E

"

Z T 0

−Lx(Xt, vt)δXtdt + Ptδvtdt + ZtδXtdWt

# .

(8)

From the previous identity and (3.9), we get E[Ψ0(XT)δXT] = E

"

Z T 0

(Lv(Xt, vt) + $t+ Pt)δvtdt + ZtδXtdWt+ Ψ0(XT)δXT

# .

Recall that Z, δX ∈ HF, which implies that EhRT

0 ZtδXtdWti

= 0. Hence, we conclude that, for all δv ∈ HF,

E

"

Z T 0

(Lv(Xt, vt) + $t+ Pt)δvtdt

#

= hLv(X, v) + $ + P, δviHF= 0.

Therefore, Pt= −Lv(Xt, vt)−$t, from which Assumption 6 and (2.2) imply that (X, P, Z)

solves (3.14). 

Remark 3.5. Notice that (3.14) is independent of the optimal control v. Hence, if (3.14) has a unique solution and (3.13) is invertible, we obtain explicit expressions for the optimal control. This is the case, for instance, when L and Ψ are quadratic, as we illustrate in Section 5.

Next, we give conditions for the converse of Proposition 3.4 to hold.

Proposition 3.6. Assume that L ∈ C2(R2; R) is strictly convex in v, Ψ ∈ C1(R), and Assumptions 5, and 7 hold. Let (X, P, Z) solve (3.14), where X, P, Z ∈ HF. Then, v:= −Hp(X, P + $) and X satisfy (3.9). Furthermore, P = −Lv(X, v) − $.

Proof. From Assumption 7, we have v ∈ HF. The first equation in (3.14) states that X solves (3.1) for the control v. Then, by the strict convexity of L in v and Assumption 5, (2.2) gives that P = −Lv(X, v) − $ and Lx(X, v) = −Hx(X, P + $). Take δv ∈ HF

and δX, as in (3.10), and multiply the previous identities to obtain

Lx(X, v)δX = −Hx(X, P + $)δX, and (Lv(X, v) + $)δv = −P δv. (3.15) Integrating on [0, T ] the relation d

PtδXt

= dPtδXt+ Ptδvtdt, recalling that δX0= 0, and replacing (3.15), we have

PTδXT = Z T

0

dPtδXt− (Lv(Xt, vt) + $t)δvtdt.

Using the third equation in (3.14), the previous expression becomes PTδXT =

Z T 0

Hx(Xt, Pt+ $t)δXtdt + ZtδXtdWt− (Lv(Xt, vt) + $t)δvtdt.

Replacing the terminal condition for P in (3.14), using (2.2), taking expectation and recalling that Eh

RT

0 ZtδXtdWt

i

= 0, we obtain

E

"

Z T 0

Lx(Xt, vt)δXt+ (Lv(Xt, vt) + $t)δvt dt + Ψ0(XT)δXT

#

= 0.

Because δv ∈ HFis arbitrary, (X, v) solves (3.9).  Notice that Proposition 3.4 guarantees the existence of solutions to the system (3.12) and (3.14), but it only states the uniqueness of solutions to the system (3.12). The following proposition states a uniqueness result for (3.14).

Proposition 3.7. Suppose that Assumptions 1, 2, 3, 5 and 7 hold. Assume that L is strictly convex in v. Let H be given by (2.1). Suppose H ∈ C1(R2; R), and either

L is strictly convex in (x, v) or Ψ is strictly convex, and Hp, Hx

are uniformly Lipschitz in (x, p).

Then, the solution to (3.14) is unique.

(9)

Proof. Let (X, P, Z) and ( ˜X, ˜P , ˜Z) solve (3.14). Let

v = −Hp(X, P + $), Hx= Hx(Xt, Pt+ $t), Lx= Lx(X, v), Lv = Lv(X, v).

˜

v = −Hp( ˜X, ˜P + $), H˜x= Hx( ˜Xt, ˜Pt+ $t), L˜x= Lx( ˜X, ˜v), K˜v = Lv( ˜X, ˜v).

By Assumption 7, v, ˜v ∈ HF. Because of the strict convexity of L in v and Assumption 5, using (2.2), we have

Lv= −(P + $), ˜Lv= −( ˜P + $), Hx= −Lx, ˜Hx= − ˜Lx. (3.16) By Assumption 3, Ψ0(XT), Ψ0( ˜XT) ∈ L2T(R) (see Theorem 2.1). Hence, using (3.14) and Itˆo’s product rule, we get

Eh

Ψ0(XT) − Ψ0( ˜XT)

XT − ˜XT

i

(3.17)

= E

"

Z T 0

d

(Pt− ˜Pt)(Xt− ˜Xt)

#

= E

"

Z T 0

(Hx− ˜Hx)(Xt− ˜Xt)dt + (Zt− ˜Zt)(Xt− ˜Xt)dWt− (Pt− ˜Pt)(˜vt− vt)dt

# .

Recalling that Eh RT

0 (Zt− ˜Zt)(Xt− ˜Xt)dWt

i

= 0 and using (3.16), we obtain

E

"

Z T 0

(Hx− ˜Hx)(Xt− ˜Xt)dt + (Zt− ˜Zt)(Xt− ˜Xt)dWt− (Pt− ˜Pt)(˜vt− vt)dt

#

= E

"

Z T 0

( ˜Lx− Lx)(Xt− ˜Xt)dt − ( ˜Lv− Lv)(˜vt− vt)dt

# , and by Assumption 1 ([4], Proposition 17.7)

E

"

Z T 0

( ˜Lx− Lx)(Xt− ˜Xt)dt − ( ˜Lv− Lv)(˜vt− vt)dt

#

6 0. (3.18)

In the same way, the convexity of Ψ (see Assumption 2) implies 0 6 Eh

Ψ0(XT) − Ψ0( ˜XT)

XT − ˜XT

i

. (3.19)

Hence, from (3.17), (3.18) and (3.19), it follows that 0 6 Eh

Ψ0(XT) − Ψ0( ˜XT)

XT − ˜XT

i

(3.20) 6 E

"

Z T 0

( ˜Lx− Lx)(Xt− ˜Xt)dt − ( ˜Lv− Lv)(˜vt− vt)dt

# 6 0.

Now we use a characterization of strict convexity provided in [4], Proposition 17.10: If L is strictly convex in (x, v), (3.20) implies X = ˜X and v = ˜v, from which (3.16) implies P = ˜P and Z = ˜Z follows. On the other hand, if Ψ is strictly convex, (3.20) implies ˜XT = XT. Therefore, both (X, P, Z) and ( ˜X, ˜P , ˜Z) solve the BSDE









dXt= −Hp(Xt, Pt+ $t)dt XT = ˜XT,

dPt= Hx(Xt, Pt+ $t)dt + ZtdWt

PT = Ψ0( ˜XT)

for all t ∈ [0, T ]. The Lipschitz condition in both Hp and Hxallows us to use Theorem 2.1

to conclude that (X, P, Z) = ( ˜X, ˜P , ˜Z). 

Propositions 3.3 and 3.4 show that the existence of solutions to (3.14) is a necessary condition for the existence of solutions to (3.2). In the next result, we consider conditions for (3.14) to be sufficient.

(10)

Proposition 3.8. Suppose that Assumptions 1, 2, 5, and 7 hold. Assume further that L is strictly convex in v. Let (X, P, Z) solve (3.14) and define v= −Hp(X, P + $). Then v solves (3.2).

Proof. By Assumption 7, v∈ HF. Let v ∈ HF and X solve (3.1) for v. From Proposition 3.6, we have Lv(X, v) = −P − $. By the convexity of L in (x, v), we have

L(Xt, vt) + Lx(Xt, vt)(Xt− Xt) + Lv(Xt, vt)(vt− vt) 6 L(Xt, vt) By Assumption 5 and the strict convexity of L in v, from (2.2), we get

Lx(Xt, vt)(Xt− Xt) + Lv(Xt, vt)(vt− vt)

= −Hx(Xt, Pt+ $t)(Xt− Xt) − (Pt+ $t)(vt− vt).

Hence,

L(Xt, vt) + $vt− Hx(Xt, Pt+ $t)(Xt− Xt) − (Pt)(vt− vt) 6 L(Xt, vt) + $tvt. (3.21) Using (3.1) and (3.14), we compute

d (Pt(Xt− Xt)) = dPt(Xt− Xt) + Pt(dXt− dXt)

= Hx(Xt, Pt+ $t)(Xt− Xt)dt + Zt(Xt− Xt)dWt+ Pt(vt− vt).

Taking E[RT

0 · dt] in the previous identity, recalling that Eh RT

0 Zt(Xt− Xt)dWt

i

= 0, and using the terminal condition for P in (3.14) and the initial condition for X and Xin (3.1), we get

E [PT(XT − XT)] = E

"

Z T 0

Hx(Xt, Pt+ $t)(Xt− Xt)dt + Pt(vt− vt)dt

# .

From the previous identity, (3.21), and the convexity of Ψ (see Assumption 2), we conclude that

I[v] 6 I[v]

for arbitrary v. The result follows. 

4. N-agent problem

Here, we introduce a minimization problem that aggregates the costs of all agents con- sidered in the previous section and is constrained by the total supply. By aggregating the costs of all agents, we obtain an equivalent variational problem that is independent of the price. We show the existence of minimizers and obtain the price as the Lagrange multiplier for the supply constraint.

Let x0 ∈ RN be the initial configuration of N agents. Given the controls v ∈ HNF, consider the dynamics for N agents

(dXt= vtdt, t ∈ [0, T ]

X0= x0, (4.1)

where X = (X1, . . . , XN). If the price is known, the functional of the single-agent mini- mization in (3.2) is independent of the actions of other agents. To solve (3.2), each agent looks for their optimal control v, and this control is coupled with the control of other agents through the balance condition (1.4). Hence, as long as the balance condition is satisfied, the vector v := (v∗1, . . . , v∗i), consisting of the optimal controls for each agent, is an optimal control for the following minimization problem

inf

v∈HNF

1 N

N

X

i=1

E

"

Z T 0

L(Xti, vti) + $tvitdt + Ψ(XTi)

#

subject to X solves (4.1).

(11)

Reciprocally, as long as the balance condition is satisfied, any optimal control v of the previous minimization problem provides, through its components v∗i, for 1 6 i 6 N , an optimal control for (3.2). Therefore, Problem 1 is equivalent to the following

inf

v∈HNF

1 N

N

X

i=1

E

"

Z T 0

L(Xti, vti) + $tvitdt + Ψ(XTi)

#

(4.2)

subject to 1 N

N

X

i=1

vi = Q, and X solves (4.1).

Substituting the balance condition into the expression to minimize in (4.2), we get 1

N

N

X

i=1

E

"

Z T 0

L(Xti, vit) + $tQtdt + Ψ(XTi)

#

. (4.3)

Let

IN[v] := 1 N

N

X

i=1

E

"

Z T 0

L(Xti, vti)dt + Ψ(XTi)

# .

Since the expression h$, QiHFin (4.3) is independent of v, we can drop this term and obtain that (4.2) is equivalent to

Problem 2. Find a vector of control processes v∈ HNF that attains the following inf

v∈HNF

IN[v] (4.4)

subject to 1 N

N

X

i=1

vi= Q, and X solves (4.1).

The next proposition shows that this problem has a solution; that is, there exists v∈ HNF such that IN[v] attains the infimum in (4.4) and satisfies the constraints.

Proposition 4.1. Let Q ∈ HF. Suppose that Assumptions 1, 2, 3, 4, and 5 hold. Given an initial condition x0 ∈ RN, there exists an optimal control v ∈ HNF that solves Problem 2. Furthermore, under Assumption 6, v is unique.

Proof. We follow the direct method in the calculus of variations to prove existence. Define the set of admissible controls

C = (

v ∈ HNF : 1 N

N

X

i=1

vi= Q, )

. (4.5)

Notice that C is a convex set. Also, this set is not empty because vi = Q for 1 6 i 6 N is an element of C. The set C is also closed because any sequence (vk)k∈N in C that converges to v in HNF satisfies

1 N

N

X

i=1

vi− Q

2

HF

6 2 N

N

X

i=1

vi− vi,k

2 HF→ 0.

By a similar argument to that used in the proof of Proposition 3.2, Assumption 5 implies

that α

Nkvk2HN

F − βT 6 IN[v],

for all v ∈ HNF. Therefore, v 7→ IN[v] is coercive and bounded from below. In particular, the infimum in (4.4) is finite. Let (vk)k∈N in C be a minimizing sequence of (4.4); that is,

lim

k→+∞IN[vk] = inf

v∈CIN[v].

By coercivity of IN[·], (vk)k∈Nis bounded in HNF. Because HFis a Hilbert space, HNF is also a Hilbert space. Hence, let v∈ HNF be a control for which there is a subsequence, still denoted by vk, that weakly converges to v. Since C is convex and closed, by Mazur’s theorem ([19],

(12)

Theorem 7.2.4), it is weakly closed. Therefore, v∈ C. Arguing as in Proposition 3.1 using Assumptions 1, 2, 3, and 4, we have that IN is weakly lower semi-continuous. Hence,

IN[v] 6 lim inf

k→+∞IN[vk] = lim

k→+∞IN[vk] = inf

v∈CIN[v].

Accordingly, v is a minimizer. The uniqueness of v follows from Assumption 6 and a similar argument to the one in the proof of Proposition 3.2.  The following lemma characterizes the orthogonal complement of the elements in HNF, whose entries add to zero. We will use this lemma to prove the existence of a Lagrange multiplier.

Lemma 4.2. Let Z =n

w ∈ HNF : PN

i=1wi = 0o

. Denote by Z the orthogonal comple- ment of the set Z with respect to h·, ·iHN

F. Then, Z = v ∈ HNF : v = ¯v1N, ¯v ∈ HF , where 1N = (1, . . . , 1) ∈ RN.

Proof. Let v ∈ Z. For δw ∈ HNF, define w = δw −



1 N

N

P

i=1

δwi



1N. Then, w ∈ Z, which implies that hv, wiHN

F = 0. Writing

N

X

i=1



δwi, vi− 1 N

N

X

k=1

vk



HF

= E

"

Z T 0

N

X

i=1

δwit vti− 1 N

N

X

k=1

vkt

!#

= E

"

Z T 0

N

X

i=1

vti δwit− 1 N

N

X

k=1

δwtk

!#

= E

"

Z T 0

N

X

i=1

vitwitdt

#

= hv, wiHN F, the orthogonality between v and w implies that

N

X

i=1



δwi, vi− 1 N

N

X

k=1

vk



HF

= 0.

Because in the previous identity δw is arbitrary, we conclude that ¯v := N1 PN

k=1vk ∈ HF

satisfies vi = ¯v, for 1 6 i 6 N ; that is, v = ¯v1N, where ¯v ∈ HF. On the other hand, let v = ¯v1N, where ¯v ∈ HF, and let w ∈ Z. Then,

hv, wiHN F =

N

X

i=1

hvi, wiiHF =

N

X

i=1

E

"

Z T 0

¯ vtwitdt

#

= E

"

Z T 0

N

X

i=1

wit

!

¯ vtdt

#

= 0,

which implies that v ∈ Z. This completes the proof. 

Next, we prove the existence of a Lagrange multiplier corresponding to the balance con- dition. This Lagrange multiplier uniquely defines the price.

Proposition 4.3. Suppose that Assumptions 3 and 4 hold. Let v∈ HNF solve Problem 2 with the corresponding trajectory X. For 1 6 i 6 N , let Pi, Zi ∈ HF solve, on [0, T ],

(dPti= −Lx(Xti, vti)dt + ZtidWt

PTi = Ψ0(XTi). (4.6)

Then, there exists a unique Π ∈ HF that satisfies

Π = Pi+ Lv(X∗i, v∗i) for 1 6 i 6 N. (4.7) Hence,

Π = 1 N

N

X

i=1

Pi+ Lv(X∗i, v∗i), and ΠT = 1 N

N

X

i=1

Ψ0(XTi) + Lv(XTi, vTi). (4.8)

(13)

Proof. Let δv ∈ Z, and define δX according to (3.10). Then, for all  > 0, according to (4.1), the process X= X+ δX is driven by v+ δv. Notice that v+ δv ∈ C because v satisfies the balance condition, and hence

1 N

N

X

i=1



v∗i+ δvi

= Q.

Thus, the function  7→ IN[v+ δv] attains a minimum at  = 0. Therefore, d

dIN[v+ δv]

=0

= 0.

Proceeding as in the proof of Proposition 3.3, using Assumptions 3 and 4, we conclude that 1

N

N

X

i=1

E

"

Z T 0

Lx(Xti, vti)δXti+ Lv(Xti, vti)δvitdt + Ψ0(XTi)δXTi

#

= 0. (4.9) Now, for 1 6 i 6 N , consider the following BSDE on [0, T ]

(dPti= −Lx(Xti, vti)dt + ZtidWt

PTi = Ψ0(XTi). (4.10)

Assumption 3 guarantees that Ψ0(XTi) ∈ L2T(R), so we use Theorem 2.1, and we denote by (Pi, Zi) the unique solution of (4.10). By applying Itˆo’s product rule to PiδXi, we get

Lx(Xti, vti)δXtidt = Ptiδvtidt − d PtiδXti + δXtiZtidWt. (4.11) Because the process s 7→Rs

0ZtiδXtidWt is a martingale w.r.t. Fsand hence ([21], Corollary 3.2.6)

E

"

Z T 0

ZtiδXtidWt

#

= 0, (4.12)

using 4.11, the definition of δX, and the previous identity, we write (4.9) as 1

N

N

X

i=1

E

"

Z T 0

Pti+ Lv(Xti, vti) δvti dt

#

= 1

NhP + Lv(X, v), δviHN F = 0.

Hence, by Lemma 4.2, there exists Π ∈ HF such that for 1 6 i 6 N Pi+ Lv(X∗i, v∗i) = Π.

Thus, taking the mean over i and using the terminal condition for Pi, we get Π = 1

N

N

X

i=1

Pi+ Lv(X∗i, v∗i), and ΠT = 1 N

N

X

i=1

Ψ0(XTi) + Lv(XTi, vTi).  The following result shows that existence of the price process follows from existence of the Lagrange multiplier associated with the balance condition.

Proof of Theorem 1.1. By Proposition 4.1, let v = (v∗1, . . . , v∗N) be a minimizer of (4.4). From Proposition 4.3, let Π ∈ HFbe the process that satisfies, for 1 6 i 6 N ,

Pi+ Lv(X∗i, v∗i) − Π = 0.

Hence, for δv ∈ HNF and 1 6 i 6 N , we have E

"

Z T 0



Pti+ Lv(Xti, vti) − Πt

 δvitdt

#

= 0.

Applying Itˆo’s product rule to PiδXi, δX as in (3.10), and using (4.6), we rearrange (4.11) to obtain

d PtiδXti = −Lx(Xti, vti)δXtidt + ZtiδXtidWt+ Ptiδvtidt.

References

Related documents