On the basis of assumptions (2.A.1) and (2.A.2), the dynamic system (2.1)-(2.2) admits a unique solution corresponding to each pair (π , π») β π― Γπ΅ [111]. We denote this solution by π(β β£π , π»). Substituting π(β β£π , π») into (2.5) gives π(β β£π , π»), the predicted system output corresponding to (π , π») β π― Γ π΅. More formally,
π(π‘β£π , π») = π(π(π‘β£π , π»), π»), π‘ β€ π. (2.6)
Suppose that the output from system (2.1)-(2.2) has been measured experimentally at times π‘ = π‘π, π = 1, . . . , π, where each π‘π β [0, π ]. Let Λππ β βπ denote the measured output
at time π‘ = π‘π. Then the problem of identifying the unknown time-delays and system
parameters can be formulated mathematically as follows.
Problem (P). Choose π β π― and π» β π΅ such that the following cost function:
π½(π , π») =
π
β
π=1
π(π‘πβ£π , π») β Λππ2 (2.7)
is minimized, where β£ β β£ denotes the usual Euclidean norm.
Problem (P) is a nonlinear dynamic optimization problem whose decision variables are the delays and system parameters in system (2.1)-(2.2). We need to select optimal values for these delays and parameters so that the predicted system output best ο¬ts the experimental data. There are very few optimization techniques available in the literature for time-delay systems. In the existing literature, the delays are often assumed to be ο¬xed and known (see, for example, [79, 119, 150]). Problem (P) is unique in that the delays are not ο¬xed, but are decision variables to be chosen optimally. The cost function in Problem (P) is also highly non-standard, as it depends on the systemβs state at a set of discrete time points, not just at the terminal time. Such cost functions have been considered in [120, 121] for non-delay systems, and in [93] for systems with ο¬xed delays. However, the computational techniques developed in these references are not applicable to Problem (P) because the time-delays in system (2.1)-(2.2) are decision variables to be identiο¬ed.
2.3 Preliminaries
Throughout this subsection, let π β {1, . . . , π} and (π , π») β π― Γπ΅ be arbitrary but ο¬xed. For simplicity, we write π(π‘) instead of π(π‘β£π , π»), and ππ(π‘) instead of π(π‘β£π +πππ, π»), where
ππ denotes the πth unit basis vector in βπ.
Deο¬ne
Note that πΌ β= β and 0 β πΌ. Clearly,
π β πΌ ββ π + πππβ π― .
For each π β πΌ, deο¬ne
ππ(π‘) = ππ(π‘) β π(π‘), π‘ β€ π,
and
π½π,π(π‘) = ππ(π‘ β π
πβ ππΏππ) β π(π‘ β ππ), π‘ β€ π, π = 1, . . . , π,
where πΏππ denotes the Kronecker delta function. Furthermore, let
π½π(π‘) = [(π½π,1(π‘))β€, . . . , (π½π,π(π‘))β€]β€β βππ, π‘ β€ π.
Clearly,
π½π,π(π‘) = ππ(π‘ β π
π), π‘ β€ π, π β= π, (2.8)
ππ(π‘) = 0, π‘ β€ 0. (2.9)
In the sequel, we will use the notation β
β Λππ to denote partial diο¬erentiation with respect
to the πth delayed state in Λπ(π‘) (i.e. partial diο¬erentiation with respect to π(π‘ β ππ)).
Now, deο¬ne ππ(π‘) = β§ β¨ β© Λπ(π‘), if π‘ β€ 0, π(π‘, ππ(π‘), ππ(π‘ β π 1β ππΏπ1), . . . , ππ(π‘ β ππβ ππΏππ), π»), if π‘ β (0, π ]. (2.10) We immediately see that for almost all π‘ β (ββ, π ],
Λππ(π‘) = ππ(π‘). (2.11)
Let Β―π > 0 be a ο¬xed constant such that
Β―π = max
π=1,...,π{ππ}.
We have the following lemma.
Lemma 2.1. There exists a a positive real number πΏ2 > 0 such that for each π β πΌ,
β£ππ(π‘)β£, β£ππ(π‘)β£ β€ πΏ
2.3 Preliminaries 25
Proof. Recall that π(π ) = π(π ) is given for all π β€ 0. By (2.A.1), π(π ) is twice diο¬eren-
tiable. Clearly, there exist positive numbers πΌ1 and πΌ2 such that
β£π(π )β£ β€ πΌ1, π β [βΒ―π, 0], (2.13)
β£ Λπ(π )β£ β€ πΌ2, π β [βΒ―π, 0]. (2.14)
For brevity, we denote
Λππ(π‘) = [ππ(π‘ β π
1β ππΏπ1)β€, . . . , ππ(π‘ β ππβ ππΏππ)β€]β€.
For each π β [0, π ], we have
ππ(π‘) = π(0) +β« π‘
0 π(π , π
π(π ), Λππ(π ), π»)ππ , π‘ β [0, π ]. (2.15)
Since π» is bounded in π΅, applying (2.A.2) to (2.15) it follows that
β£ππ(π‘)β£ β€ πΌ 1+ β« π‘ 0 πΏ1(1 + β£π π(π )β£ + β£Λππ(π )β£ + β£π»β£)ππ β€ πΌ1+ πΏ1π + β« π‘ 0 πΏ1(β£π π(π )β£ + β£Λππ(π )β£ + β£π»β£)ππ β€ πΌ1+ πΏ1π + β« π‘ 0 πΏ1(β£π π(π )β£ + β£π»β£)ππ +βπ π=1 β« π‘βππ βππ πΏ1β£ππ(π )β£ππ β€ πΏ0+ β« π‘ 0 πΏ1β£π π(π )β£ππ +βπ π=1 β« π‘ βΒ―ππΏ1β£π π(π )β£ππ , π‘ β [0, π ],
where πΏ0 = πΌ1+ πΏ1π + ππΏ1ππ and Β―Β― π = maxπ=1,...,π{ππ}. Therefore, by using (2.13),
β£ππ(π‘)β£ β€ πΏ
0+ ππΌ1Β―π + (π + 1)πΏ1
β« π‘
0 β£π
π(π )β£ππ , π‘ β [0, π ].
Then, by using (2.10) and Gronwall-Bellmanβs lemma [111], we obtain
β£ππ(π‘)β£ β€ π
1exp((π + 1)πΏ1π), π‘ β [0, π ], (2.16)
where π1 = πΏ0+ ππΌ1Β―π.
In addition, for each π β [0, π ], consider (2.10)-(2.11), then by using (2.A.2),
β£ππ(π‘)β£ β€ πΏ
Clearly, β£ππ(π‘)β£ β€ πΏ 1+ ππΏ1π + πΏΒ― 1β£ππ(π‘)β£ + πΏ1 π β π=1 β£ππ(π β π π β ππΏππ)β£. (2.17)
Thus, by applying (2.16) to (2.17), we obtain
β£ππ(π‘)β£ β€ πΏ
2, (2.18)
where πΏ2 = πΏ1+ ππΏ1π + (π + 1)πΒ― 1πΏ1exp((π + 1)πΏ1π). Combining (2.13), (2.14), (2.16)
and (2.18), the conclusion of the lemma follows readily. Deο¬ne
Ξ = {π β βπ : β£πβ£ β€ πΏ
2}. (2.19)
Then, it follows form Lemma 2.1 that ππ(π‘) β Ξ for all π‘ β [βΒ―π, π ] and π β πΌ. We have
the following lemma.
Lemma 2.2. There exists a positive real number πΏ3 > 0 such that for all π β πΌ,
β£ππ(π‘)β£, β£ππ(π‘) β π0(π‘)β£, max
π=1,...,πβ£π½
π,π(π‘)β£ β€ πΏ
3β£πβ£, π‘ β [0, π ]. (2.20)
Proof. For each π‘ β [0, π ], we have
β£ππ(π‘)β£ = β£ππ(π‘) β π(π‘)β£ β€β« π‘
0 β£π
π(π ) β π0(π )β£ππ , π‘ β [0, π ]. (2.21)
By using (2.A.1), the function π is Lipschitz continuous on Ξ Γ Ξ. Hence, there exists a real number π > 0 such that
β£ππ(π ) β π0(π )β£ β€ πβ£ππ(π )β£ + πβπ π=1
β£π½π,π(π )β£, π β [0, π ]. (2.22)
Let π β πΌ be arbitrary but ο¬xed. For each π β [0, π ],
β£π½π,π(π )β£ = β£ππ(π β π πβ π) β π(π β ππ)β£ β€ β£ππ(π β π πβ π) β ππ(π β ππ)β£ + β£ππ(π β ππ) β π(π β ππ)β£. Hence, by (2.11), β£π½π,π(π )β£ β€β« π½(π ) πΌ(π ) β£π π(π)β£ππ + β£ππ(π‘ β π π)β£, π β [0, π ], (2.23)
2.3 Preliminaries 27 where πΌ(π ) = min{π β ππ, π β ππβ π}, π½(π ) = max{π β ππ, π β ππβ π}. Clearly, β£π½(π ) β πΌ(π )β£ = π, π β [0, π ], (2.24) and [πΌ(π ), π½(π )] β [βΒ―π, π ], π β [0, π ]. (2.25) Substituting (2.24)-(2.25) into (2.23), and using Lemma 2.1, it yields
β£π½π,π(π )β£ β€ πΏ
2β£πβ£ + β£ππ(π β ππ)β£, π β [0, π ]. (2.26)
Substituting (2.26) into (2.22) gives
β£ππ(π ) β π0(π )β£ β€ πβ£ππ(π )β£ + πππΏ 2β£πβ£ + π π β π=1 β£ππ(π β π π)β£, π β [0, π ]. (2.27)
Substituting (2.27) into (2.21), it follows that
β£ππ(π‘)β£ β€ πππΏ 2β£πβ£π + β« π‘ 0 πβ£π π(π )β£ππ + πβπ π=1 β« π‘ 0 β£π π(π β π π)β£ππ β€ πππΏ2β£πβ£π + β« π‘ 0 πβ£π π(π )β£ππ + πβπ π=1 β« π‘βππ βππ β£ππ(π )β£ππ β€ πππΏ2β£πβ£π + π β« π‘ 0 β£π π(π )β£ππ + ππβ« π‘ βΒ―πβ£π π(π )β£ππ , π‘ β [0, π ].
Recall that ππ(π ) = 0 for all π β€ 0. Therefore,
β£ππ(π‘)β£ β€ πππΏ
2β£πβ£π + (π + 1)π
β« π‘
0 β£π
π(π )β£ππ , π‘ β [0, π ].
Thus, by Gronwall-Bellmanβs lemma,
β£ππ(π‘)β£ β€ π
2β£πβ£, π‘ β [0, π ],
where π2 = πππΏ2π exp{(π + 1)ππ }. Since ππ(π ) = 0 for all π β€ 0, it follows that
β£ππ(π‘)β£ β€ π
Substituting (2.28) into (2.26) yields
β£π½π,π(π )β£ β€ (πΏ
2+ π2)β£πβ£, π β [0, π ].
Substituting (2.28) into (2.27), we obtain
β£ππ(π ) β π0(π )β£ β€ πππΏ 2β£πβ£ + ππ2β£πβ£ + π π β π=1 π2β£πβ£ = (πππΏ2+ (π + 1)ππ2)β£πβ£, π β [0, π ].
Choose πΏ3 = max{πππΏ2+ (π + 1)ππ2, πΏ2 + π2, π2}. The proof is completed.
To proceed further, we need the following lemma. Lemma 2.3. For almost all π‘ β [0, π ], it holds that
lim
πβ0
π½π,π(π‘) β ππ(π‘ β π π)
π = βπ0(π‘ β ππ). (2.29)
Proof. Let π‘ β [0, π ] β ππ be arbitrary but ο¬xed. Then, for each π β πΌ β 0,
π½π,π(π‘) β ππ(π‘ β π π) = ππ(π‘ β ππβ π) β ππ(π‘ β ππ). Hence, by (2.11), π½π,π(π‘) β ππ(π‘ β π π) π = 1 π β« π‘βππβπ π‘βππ ππ(π )ππ .
We can write this equation as follows: π½π,π(π‘) β ππ(π‘ β π π) π = βπ0(π‘ β ππ) + π0(π‘ β ππ) + 1 π β« π‘βππβπ π‘βππ ππ(π )ππ = βπ0(π‘ β π π) + π(π), (2.30) where π(π) = 1π β« π‘βππβπ π‘βππ { ππ(π ) β π0(π‘ β π π)}ππ .
Then, by using triangle inequality,
β£π(π)β£ β€ 1 β£πβ£ β« π½1 πΌ1 β£ππ(π ) β π0(π )β£ππ + 1 β£πβ£ β« π½1 πΌ1 β£π0(π ) β π0(π‘ β π π)β£ππ , (2.31) where πΌ1 = min{π‘ β ππ, π‘ β ππβ π}, π½1 = max{π‘ β ππ, π‘ β ππβ π}.
2.3 Preliminaries 29 Clearly, π½1β πΌ1 = β£πβ£. Thus, by Lemma 2.2 and (2.31), it follows that, for each π β πΌ,
β£π(π)β£ β€ πΏ3β£πβ£ +β£πβ£1
β« π½1
πΌ1
β£π0(π ) β π0(π‘ β π
π)β£ππ . (2.32)
Since π‘ β= ππ, we need to consider the following two cases. Case 1: π‘ < ππ and Case 2:
π‘ > ππ.
Case 1: π‘ < ππ. Clearly,
π β πΌ, β£πβ£ < ππβ π‘ β [πΌ1, π½1] β [βΒ―π, 0]. (2.33)
Now, since π is continuously diο¬erentiable (recall (2.A.1)), and π» and π are bounded on [βΒ―π, π ] (recall (2.4) and (2.12)), we can show that π0 is Lipschitz continuous on [βΒ―π, 0].
Hence, there exists a real number π1 > 0 such that
β£π0(π ) β π0(π‘ β π
π)β£ β€ π1β£π β π‘ + ππβ£, π β [βΒ―π, 0] (2.34)
It follows from (2.33) and (2.34) that
β£π0(π ) β π0(π‘ β π
π)β£ β€ π1β£π β π‘ + ππβ£ β€ π1(π½1β πΌ1) = π1β£πβ£, π β [πΌ1, π½1],
when π β πΌ is suο¬ciently small. Substituting this inequality into (2.32) gives
β£π(π)β£ β€ (πΏ3+ π1)β£πβ£.
This shows that
lim
πβ0π(π) = 0, π β πΌ β 0, π β [βΒ―π, 0] (2.35)
Case 2: π‘ > ππ
Suppose π‘ > ππ. Clearly,
π β πΌ, β£πβ£ < π‘ β ππ β [πΌ1, π½1] β (0, π ]. (2.36)
Similarly, since π is continuously diο¬erentiable (recall (2.A.1)), and π» and π are bounded on [βΒ―π, π ] (recall (2.4) and (2.12)), we can show that π0 is Lipschitz continuous on (0, π ].
Hence, there exists a real number π2 > 0 such that
β£π0(π ) β π0(π‘ β π
It follows form (2.36) and (2.37) that
β£π0(π ) β π0(π‘ β π
π)β£ β€ π2β£π β π‘ + ππβ£ β€ π2(π½1β πΌ1) = π2β£πβ£, π β [πΌ1, π½1],
when π β πΌ is suο¬ciently small.
Substituting this inequality into (2.32) gives
β£π(π)β£ β€ (πΏ3+ π2)β£πβ£, π β (0, π ].
This shows that
lim
πβ0π(π) = 0, π β πΌ β 0, π β (0, π ] (2.38)
Applying (2.35) and (2.38) to (2.30) completes the proof.
2.4 Gradient computation
Problem (P) involves choosing a ο¬nite number of decision variables to minimize the cost function (2.7). Thus, in principle, Problem (P) can be viewed as a nonlinear programming problem. Standard algorithms for solving nonlinear programming problemsβfor example, sequential quadratic programming or interior-point methods [79]βtypically require the gradient of the cost function, which is diο¬cult to determine in Problem (P) because the delays and parameters inο¬uence (2.7) implicitly through the dynamic system (2.1)-(2.2). The aim of this section is to develop an eο¬cient computational method for computing the gradient of the cost function in Problem (P). This method, which is inspired by earlier works in [117,125,126], can be integrated with a standard nonlinear programming algorithm to solve Problem (P).
2.4.1 State variation with respect to time-delays
The solution of system (2.1)-(2.2) is normally viewed as a function of time, with π and π» being ο¬xed vectors. By ο¬xing π‘ β (ββ, π ] while allowing π and π» to vary, we obtain a new function π(π‘β£β , β ) : π― Γ π΅ β βπ whose value at (π , π») β π― Γ π΅ is π(π‘β£π , π»). In
the following theorem, we show that π(π‘β£β , β ) is diο¬erentiable with respect to the time- delays. This result is central to the development of a computational procedure for solving Problem (P).
Theorem 2.1. Let π‘ β (0, π ] be a ο¬xed time point. Then, π(π‘β£β , β ) is diο¬erentiable with
respect to ππ on π― Γ π΅. In fact, for each (π , π») β π― Γ π΅,
βπ(π‘β£π , π») βππ = Ξ