VOL. 15 NO. 2 (1992) 359-366
ANALYTIC
SOLUTION
OF RICCATI
EQUATIONSOCCURRING
IN
OPEN-LOOP
NASH
MULTIPLAYER DIFFERENTIAL GAMES
L.
JDAR
andE. NAVARRODepartamentodeMatemticaAplicada UniversidadPolit6cnicadeValencia
E.T.S.I.I.,
P.O. Box22.012Valencia,Spain
(Received January
2,1991)
ABSTRACT.
In
this paper we present explicit analytic solutions of coupled Riccati matrixdifferential systems appearing in open-loop Nash games. Two different cases are considered.
Firstly, by means of appropriate algebraic transformations the problem is decoupled so that an
explicitsolutionoftheproblemis available. The secondis basedontheexistenceofasolutionofa
rectangularRiccatitype algebraicmatrixequation associatedwiththeproblem.
KEY
WORDS AND PHRASES. Riccatiequations, differential games, optimizationproblem.1991
AMS SUBJECT CLASSIFICATION CODE.
34A341.
INTRODUCTION.
When noncooperative problemsare tackled,a gametheoretic approachis necessary: each control agent(decision
maker orplayer)
tries to optimize his own cost function which conflicts more or less with others.An
equilibrium solution must besought, and the Nashstrategy is a natural choice.
In
this case, a player cannot improve his payoff by deviating unilaterally from his Nash strategy. Due to this noncooperation, the optimization problem ofvarious players are strongly coupled and necessary conditions for open-loop Nash strategy lead to
complextwo-pointboundaryvalueproblems.
ConsideraN-playerslinearquadraticdifferentialgame definedby
N
x’= Ax
+
Biui;
x(0)-
x0(1.1)
i=l withthecost functionals associated withtheplayers
1 T
x!+
(xTOix/
uRijuj)d
Ji
"
x.f
Kiy
0 j=l
x(tl)
xI
(1.2)
where all matrices are nxn symmetric with
Rii
for< <
N,
positive definite. It is wellknown that the open-loop Nash controlmustsatisfy[12]:
ui=
-RIB@i;
=
-Qi
x-AT@i,
@i(tf)=Kifxf,
l<i<N(1.3)
whereqi
is the costate vector associatedwith player "i’. When the transformationi
Kix,
isN is
givenby introduced,for 1g g
N,
theopen-loopNashstrategy(u
)i
*
-R.-:;..1BT..Ki(t)O(t,O)xo,,,
l<i<Nwhere
with
N
K=
-ATKi-KiA-Qi+Ki
y
SjKj;
Ki(tl)=Kif,
<_i<_N(1.5)
j-1
S
BiR’IBTi,
<
<_
N(1.6)
and
q)(t,0)
isthesystem’s
transition matrixsatisfyingN
’(t,O)
(A-
sjgj)@(t,O);
@(t,t)
I
(1.7)
j=l
Notethat thematrices
Rij
for j donot appearinthe necessary conditions dueto the fact that under open-loop strategyassumptions, eachplayeroptimizeshis criterionknowingthatOui/Ox
O,for
_< _<
N. Fortheopen-loopNashstrategyandunder theexistenceofasolutionof thecoupledRiccatisystem, the optimizationproblemhasonlyone solution
[2].
The solution of system
(1.5)
is generally difficult to obtain due to the permanent coupling between the players’ strategies.In
[4]
a series solution of system(1.5)
is proposed but the coefficient areobtained solving several linear matrixequations.In
[14]
a numericalalgorithm for the integration of(1.5)
is given.A
singular perturbation method for solving(1.5)
is proposed in[10].
ForthecaseN 2, aniterativealgorithmforsolving(1.5)
isgivenin[5].
For
thecaseN 2 andQ2- cQ1,
where is ascalar, an analyticsolutionofsystem(1.5)
waspointed out in[1].
In
this paper we obtain an explicit solution of system(1.5)
for a case moregeneral
than the oneproposed in
[1].
Also, a different type condition expressed in terms of the existence of certaincoupled algebraicRiccati matrixsystemisproposed.
2.
ANALYTIC
SOLUTION OFCOUPLED RICCATI SYSTEM BASED
ONALGEBRAIC
TRANSFORMATIONS.
Forconvenience, thenecessary conditions tobe satisfied
(1.1), (1.3),
are rewritteninamatrix formA
-Q1
-AT
-O
o
-0
o
-S
-S2 SN0 0
-A
T 00
A
T(2.1)
z(0)
Zo,Now,
letusintroducethechangeofbasis=T
w21
WN
i(tf)
Kifz
f, 1< <
N(2.2)
I
0 0 00
I
0 00
L
2I
00
L
N 0I
(2.3)
equivalenttothe followingone
wN
A
-S-S2L
SNL
N -S2 SN-Q1
-A
T 0 0L2Q-Q2
L2AT-ATL2
-A
T 0LNQI-QN
LNAT-ATLN
0A
T(2.4)
x(O)
Xo,(t!)
Kfxf,
wj(ty)
(Kj]- LjKlf)Xy,
2<_
j<
N(2.5)
The purpose of this transformation is to find under what conditions the player’s optimization
problemcanbedecoupled.
In
fact,notethat ifL2,L3,...,L
Nsatisfy the systemLjQ
Qj,
LjA
TATLj,
2<
j<
N
(2.6)
the matrix
T-IMT
is reduced to ablocktriangularform andthe costatevectorsq/1,w2,...,w
N, arecoupled onlyviatheterminalconditions(2.2).
Note
that forthecaseN
2 and thematricesQ1
andQ2
areproportional, i.e.,Q2
cQ1,
forome
scalarc, takingL
cl, onegetssolutionsofsystem(2.6)
forN
2. Thus thecase studiedin
[1]
isaparticularcaseof(2.6).
In
order tocharacterizetheexistenceofsolutionsfor the algebraic system(2.6),
we recall theconceptof tensor productofmatrices. If
A,B
arematrices inR
mxnandR
kxs,
respectively, thenthe tensorproductof
A
andB,
denotedA
(R)B,
is definedasthe partitionedmatrixIf
A
ER
raxn,
wedenoteA(R)B=
allB
al2B
alnB
amlB
am2B
amnB
alj
A.j=
l<
j<
n;vecM=
If
M,N
andP
arematricesof suitable dimensions, thenusingthe column lemma[7,
p.410],
wegetvec(MNP)
(pT
(R)M)
vecN(2.7)
Taking intoaccount
(2.7),
ifweapplytensorproducts ineach equation of system(2.6),
itmay bewritten intheequivalentform
where 2
<
j<
NandC
vecLj
vec[0,Qj]
(2.8)
I(R)AT_A(R)
I
1
C
(2.9)
QI(R)I
thesystem
(2.9)
iscompatibleifand onlyifCC
+
vec[0,Q/]
vec[0,Q/]
(2.10)
Furthermore, undercondition
(2.10),
thegeneralsolutionof(2.9)
forLj
isgiven byvecL/=
C+
vec[0,Q2
+
(I -C +
C)z
(2.11)
n2
where
I
denotes the identitymatrixinRn2x
andz isanarbitraryvectorinR
n2.
Werecall thatC
+
maybecomputed byusingMATLAB
[8].
Let us assume the existence ofsolutions
Lj
of(2.8)
for 2<
jgN,
then from(2.4),
(2.5)
itfollows that
-$2
SN
-AT
00
A
Tx(0)
x0@l(t/)
Kl.fX
!
wj(t.f)
(KH
LjKII)X
f 2
<_
j<_
N
(2.12)
where
A
-S1-$2L2
SNL
N-Q1
-AT
Letusconsiderthechange
t(s)
I
s, 0<
s<
tl,
andlet(2.13)
Hence
problem(2.11)
may bewrittenintheformff2
-V
-S
2 SN(t
I)
to,Solving
(2.15)
weobtainql(0)
KII.(0),
tj(0)
(Kjf- LjKIf)(O
),
2<_
j<_
Nffj(s)
exp(sAT)ffj(O),
2<_
j<_
N
From
(2.14)
and(2.16)
wehave(2.15)
(2.16)
where
Thuswehave
+
J=20
JL
0j
j(s)
[I,
OlG(s)(O);
el(S)
[o,
IlG(s)y(o)
(2.18)
(2.19)
(2.20)
Note
that[I,0]G(0)=
I,
and from the continuity ofG,
there exists an interval 0<
s<
6, such that[l,O]G(s)
isinvertible for allsE[0,6]
(2.21)
From
(2.19)
and(2.20)
weobtain(0)
{[I,O]G(s)}
-1(s)
i(s)
[O,I]G(s){[I,O]G(s)}
-l,(s),
0<
s<
6(2.22)
Now,
from(2.3)
and(2.13),
itfollows that,I,(s)
L
,r()
+,(),
2_<
j_<
gand from
(2.15)and (2.19)
Cj(s) {exp(sAT)(Kjl
LjKII
+ Lj[O,I]G(s)}
{[I,O]G(s)}
-1(s)
(2.23)
for 0 <s<6.
From
(2.13)
andthe relationsqj(t)= Kj(t)z(t),
1<
j<
N,
itfollows thatKI(t
[0,
IIG(t!
t)
{[I,0IG(t!
0}
-x
(2.24)
Kj(t)
{exp(AT(tl-
t))[Kj!-
LjKll
+ Lj[O,I]G(t!-
t)}
{[l,O]G(tl-
t)}
-1l-6<t<tf;
2<j<_N(2.25)
whereG
isdefinedby(2.18).
Thus thefollowingresult has beenproved:THEOREM 1. Letusassumethatmatrices
A
andQi
for 1_< _<
N,
satisfy thecondition(2.9)
whereC
is definedby(2.8),
and letLj
be the solution of(2.8)
for 2<
j_<
N. Then thereexistsapositive number 6 such that on the interval
[t1-6, tl]
the unique solution of the coupled RiccatiREMARK
1. Notethatthe caseN 2,Q2
aQ1,
wherea is ascalar,is aparticularcaseof the previous theorem takingL
2 aI. It isimportant to notethat from(2.23)
and(2.24),
wehavethe followingrelationbetween
K1
(t)
andK2(t
):
K2(t
L2KI(t
+exp(AT(tf-
t))[K2l-
L2gly]{[I,O]G(ty-
t)}
-1and since the function
{[I,O]G(tl-t)}-I
is involved in the computation ofgl(t),
thecomputationalcost isreduced because
K2(t
isexpressed interms ofKl(t
).
Finallywerecall thatefficient methods for computingmatrixexponentials and integrals involving them and that appear in the expression of
G(s),
maybe found in[13].
Theseproceduresareextremelyeasy toimplement andyieldanestimationof the approximationerror.3.
ANALYTIC SOLUTION
OFCOUPLED RICCATI DIFFERENTIAL SYSTEM BASED ON
THE
EXISTENCE OFA SOLUTION
OFA
COUPLEDALGEBRAIC RICCATI SYSTEM.
Riccati typematrixequations withrectangularcoefficientsarisefor instanceintheproblemof finding a state estimate feedback controller
[3]
and in the transformation of ill-conditioned linearsystemstoablock diagonal form
[6,9].
An
efficientmethod to findsolutionsofsuch equations may befound in[15].
The aimofthis section is to propose anotherway to find an analyticsolution of the coupled Riccati differential system(1.5).
Note that system(1.5)
may be written in thefollowing compactform
where
and
K’(t)
-Q-
K(t)A
BK(t)
+
K(t)SK(t);
K(tI)
K!
(3.1)
K=
Q=
B=Diag(AT,
AT,...,AT), K]=
(3.2)
s
=[S,S2,...,SN]
Letus assumethat therectangular algebraicRiccatiequation-Q-XA-BX+XSX=O
U.f
K
I
X
oadmitsasolution
X
o 6_CNnxnand let usconsiderthechangeV(t)
K(t)
X
oThenproblem
(3.1)
isequivalent to thefollowingoneU’(t)
BoV(t)
U(t)Ao
+
U(t)SV(t);
where
Bo
XoS-B
Ao
A-SXo
Now,
letusconsiderthe extendedlinearsystem(3.3)
(3.4)
(3.5)
(3.6)
where
V(t)
ECnx,andZ(t)
CNnxn.
Ifwedefine thematrixfunctionexp((t-s)A)
I
exp((t-v)A)Sexp((v-z)B)dv
S(t,s)
ti
(3.9)
0
exp((t-s)Bo)
thenaneasycomputation yields
Ao
-S]
(o/ot)s(t,)
s(t,)
0
Bo
(3.10)
and thus
S(t,s)
is a fundamental matrix of(3.8)
and the unique local solution of(3.8)
in aneighborhoodof
I
isgivenbyS(tI, tf)
I(N
+
1)nNote
thatV(![)=
In
and thusin aneighbourhoodY
containing I,V(t)
is invertible.Now,
letusdefine the
CnXn-values
matrix functionU(t)= Z(t)(V(t))-x
forteJ.
Note
that from(3.8)it
followsthat
V’(t)
AoV(t)- SZ(t)
andZ’(t)
BoZ(t)
Computingitfollows thatU’(t)
Z’(t)(V(t))
-1Z(t)(V(t))
-1V’(t)(V(t))
-1BoU(t)
U(t)Ao
+
U(t)SU(t)
for all (
J.
Hence
K(t)= U(t)+
Xo
is the solution of(3.1),
defined on the intervalJ.
From
(3.10)
itfollows thatV(t)
exp((t
tl)Ao)-
|
exp((t
v)Ao)Sexp((v
tl)Bo)dvU
!
exp((t-
tl)AotI-
.exp((t!
v)Ao)Sexp(vBo)dv
exp(g(t)
exp((t-
ti)Bo)U
!
-1
K(t)
Xo +
exp((t
ti)Bo)U!
I
.exp((tI
v)Ao)Sexp(vBo)dv exp(
exp(
f
t)Ao))
tl
(3.11)
hu if the gebre meeati
muation (3.4)
h solutionXo,
hen theoutionK(t)
of(3.1)
defined by(3.11)
whereUI,
Ao
dBo
e given by(3.7).
he solution is defined in theeighho[
t]
whereV(t)
isivertible.ACKNOWLEDGEMENt.
This pnr hn
ptily supported by theD.G.I.C.Y.T.
PS87-0064d the
NAO
grit900040.REFERENCES
ABOU-KANDIL, H.
andBERTRAND,
P.,
Analytic solution for a class of linear quaAratic2.
BASAR,
T.,
On the uniqueness of the Nash solution in linear quadratic differential games,Internat. J.ofGameTheory, vol.5
(1973),
65-90.3.
ANDERSON, B.D.O.,
The testing of optimality linear systems,Intern.at..
J.
(ontrol No.(I
966),
29-40.4.
CRUZ, J.B.,
JR. andCHEN, C.I.,
Series Nash solution of two-person nonzero sum linearquadratic games,J._:. TheoryandAppl.,
7(1971),
240-257.5.
J(DAR,
L. andABOU-KANDIL, H., A
resolution method for Riccati differential systems coupledin theirquadratic terms,SIAM
J.
Math. Anal.
!9(1988),
1225-1230.6.
KOKOTOVIC, P.V., A
Riccati equation for block-diagonalizationof ill-contditioned systems,IEEE
Trans. Autom. (ontrolA-21(1975),
812-814.7.
LANCASTER,
P. andTISMENETSKY, M.,
The theory of matrices,Second
ed.. AcademicPress,
New York, 1985.8.
MOLER, C.B., MATLAB User’s
guide, TechnicalReport
CS81-1(1980)
Department
ofComputer
Science,Univ.ofNewMexico,Alburquerque, NewMexico, 87131.9.
NARASIMHAMURTHI,
N. andWU,
F.F.,
On the Riccati equation arising from thestudyof singularly perturbed systems,Proc.
of the 1977JointAutom.
Control Conf. 22-24 June, SanFranscisco, Vol.V,
1244-1247.10.
OZGUNER,
U.
andPERKINS,
W.R.,
A
series solution to the Nash strategy for large scaleinterconnectedsystems, Automatica, Vol.._._.
13(1977),
313-315.11.
RAO,
C.R. andMITRA, S.K.,
Generalized Inverse of Matrices and its Applications, John-Wiley,New York, 1971.11/2.
STARR,
A.W. andHO, Y.C.,
Non-zero sum differential games,J.
Optimization Theory and Appl.,3_(1969),
179-197.13.
VAN LOAN, C.F.,
Computing integrals involving thematrix exponential,IEEE Trans.
Aut. (,ontrol.,AC.-2:
(1978. $95-404.14.
SIMAAN,
M. andCRUZ, J.B.,
JR.,
On
the solution ofopen-loop Nash Riccati equations inlinear qualratic differential games,
Internat,
J.
(ontrol,
Vol. 18(1973),
57-63.15.
MEYER, H.B.,
The matrix equationAZ
+
B-ZCZ-ZD 0,SIAM
J.
Appl. Math., Vol.