The Method of Characteristics in Inverse Problems of Dynamics

(1)

The Method of Characteristics in Inverse

Problems of Dynamics

Nina N. Subbotina

∗

,

Timofey B. Tokmantsev

Institute of Mathematics and Mechanics UrB RAS, Ural Federal University, Ekaterinburg, Russia ∗_{Corresponding Author: [email protected]}

Abstract

Perturbed inverse problem of dynamics are under consideration for dynamical systems linear relative controls when sampling history and sampling error estimate are known. Auxiliary optimal control problems are introduced to minimize a regularized inte-gral discrepancy functional in the domain of admissible motions. The trajectories of the system are constructed with the help of characteristics of the Bellman equation in the domain of admissible motions. It is proven that the characteristics minimized the functional are solutions of the perturbed inverse problem of dynamics.

Keywords

Inverse Problems of Dynamics, Ridge Regression, Dynamic Programming, Method of Charac-teristics

1 Introduction

Perturbed inverse problems for controlled dynamical systems are under consideration. A sampling history of the real trajectory is known. This trajectory is gener-ated by the control, which isn’t known. Moreover, the deviation of the samples from the real trajectory satis-ﬁes the known estimate of sampling error. The inverse problem with perturbed (inaccurate) sampling of trajec-tory is to rebuild such controls that would be close to the control generated the real trajectory and that would have the least norm inL2.

A solution of the problem was considered, for example, in the work [2]. The approach, suggested in the paper, is based on a feedback constructed in forward time using a dynamical guide. The approach has roots in works by N.N. Krasovskii’s school on the optimal feedbacks theory. This method can be considered as a variation of the regularization method by A.N. Tikhonov [6].

In the presented work another approach is suggested. It is as well based on the optimal feedbacks theory to problems with regularized integral discrepancy function-als. The approach uses backward procedures of dynamic programming, the characteristics method and optimal synthesis, too [3].

2 Unperturbed inverse problems

We consider the following controlled system

dx(t)

dt =g(t, x) +f(t, x)u, t∈[0, T], (1)

where x∈ Rn is the state vector, controls u∈ Rn are restricted, namely

u∈U ={ui∈[a−i , a

+

i ], a−i < a

+

i , i= 1,2, . . . , n}. (2) We denote the set of admissible controls by the symbol

U:

U ={u(t)∈U, t∈[0, T] :u(·) is measurable}.

Let x_∗(·) : [0, T] → Rn _{be the real trajectory of the} system (1). The inverse problem of dynamics consists in ﬁnding the control u(·)∈ U, generating such trajectory

x(t) of the system (1)x(t) =x(t; 0, x_∗(0), u(·)) that

x(t) =x_∗(t) for all t∈[0, T].

Lemma 1. The set U is closed inL2.

Proof. Let un(·) ∈ U, lim

n→∞∥un(·)−v(·)∥L2 = 0. The symbol ∥ · ∥L2 denotes the norm inL2.

As known [5], such subsequence can be extracted from the strongly converging sequence in L2, that converges

to v(·) almost everywhere. Let the sequence {unk(t)}

converge to v(t) almost everywhere on [0, T]. Since

unk(t)∈U and the setU is closed inR

n _then _v₍_t₎_∈_U almost everywhere on [0, T].

Using this lemma and the strong convexity of the square of the norm ∥u(·)∥L2 in L2 it isn’t diﬃcult to

show that the subset U ⊂ U of measurable controls

u(·) : [0, T]→U that solve the inverse problem for the system (1), (2) have the following properties.

Lemma 2. The setUis nonempty, convex and bounded

inL2. Moreover, there exists the only elementu∗(·)∈U

with the minimal norm in L2.

(2)

2.1 Perturbed inverse problems

We don’t know the real trajectoryx_∗(·) of the system (1) in a perturbed inverse problem. Instead of it we know continuous function y(·) : [0, T] → Rn_{, which is} the sampling history of the state variable x_∗(t). It is known also that the real trajectory of the system (1) belongs to the domain Ωδ of admissible errors of the samples

(t, x_∗(t))∈Ωδ ={(t, x) :t∈[0, T], ∥x−y(t)∥ ≤δ},

(3) where δ > 0 is the parameter of sampling error, while symbol ∥z∥ means the Euclidean norm of the ﬁnite di-mensional vectorz.

The perturbed inverse problem is to reconstruct the real trajectory of the systemx_∗(·) as accurate as possi-ble, using the sampling historyy(·). In other words, we need to build the controluδ₍_·_{) : [0}_{, T}_] _→_U _{and such a} trajectoryxδ₍_·_{) : [0}_{, T}_]_→ _Rn _{of the system (1), which} belongs the compact domain Φ(δ) of admissible motions

Ωδ ⊂Φ(δ)⊂cl ΠT = [0, T]×Rn,

Φ0(δ) ={x0:∥x0−y(0)∥ ≤δ}, (4)

Φt(δ) ={x=x(t; 0, x0, u(·)) : x0∈Φ0(δ), u(·)∈ U},

t∈[0, T],

that in caseδ→0

(t, xδ(t))∈Φ(δ), ∀t∈[0, T], (5)

dH(Φ(δ),grx∗(·))→0, δ→0, (6)

∥xδ(·)−x_∗(·)∥C= max t∈[0,T]∥

xδ(t)−x_∗(t)∥ →0, (7)

∥uδ(·)−u_∗(·)∥2L2 = T

∫

0

∥uδ(t)−u_∗(t)∥2dt→0, (8)

where dH(·) is the Hausdorﬀ distance, symbol grx∗(·) means the graph of the functionx_∗(·),∥ · ∥C is the norm in the space of continuous functions,∥ · ∥L2 is the norm in the spaceL2.

2.2 Assumptions

The problem is considered under the following as-sumptions.

A.Functionsgi(t, x),fi,j(t, x),i, j∈1, nare continu-ously diﬀerentiable in (0, T)×Rn_{and have the sublinear} growth property

∥f(t, x)∥ ≤K1(1 +∥x∥)

∥g(t, x)∥ ≤K1(1 +∥x∥),

whereK1>0, (t, x)∈cl ΠT.

As the domain Φ(δ) is compact there are the following restrictions in the domain Φ(δ) (4):

max i,j,k∈1,n

max

(t,x)_∈Φ(δ){|gi(t, x)|, |fi,j(t, x)|} ≤K2, (9)

max i,j,k∈1,n

max

(t,x)∈Φ(δ)

{

∂gi_∂t(t, x), ∂gi(t, x) ∂xk

, (10)

∂fij(∂tt, x)

, ∂fij(t, x) ∂xk

}

≤K3,

whereK2>0,K3>0.

3 A construction of solutions of

the perturbed inverse problem

3.1 Regularized optimal control

prob-lems

We introduce the auxiliary optimal control problems of the system (1), (2) in domain (4) to minimize the discrepancy functionals

It0,x0(u(·)) = T

∫

t0

[

∥x(t)−y(t)∥2

2 +

α

2∥u(t)∥

2

]

dt, (11)

whereα >0 is a regularizing parameter, (t0, x0)∈Φ(δ),

x(t) = x(t;t0, x0, u(·)) is the trajectory of the system

(1), starting at the point x(t0) =x0 and generating by

an admissible control u(·) : [t0, T]→U.

The HamiltonianHα₍_{t, x, s}_{) for this problem (1), (2),} (11) has the form

Hα(t, x, s) = min u_∈U

[

⟨s, g(t, x)⟩+⟨s, f(t, x)u⟩+ (12)

+α 2∥u∥

2₊∥x−y(t)∥ 2

2

]

,

where the symbol ⟨·,·⟩means the inner product of two ﬁnite dimension vectors. It’s not diﬃcult to see that the following relations are true:

Hα(t, x, s) =⟨s, g(t, x)⟩+∥x−y(t)∥

2

2 + (13)

+⟨s, f(t, x)uα⟩+α 2∥u

α_∥2_.

Here uα= (uα1, . . . , uαn) :

uα_i =

  

a−_i , if rα_i(t, x, s)≤a−_i , rα

i(t, x, s), if rαi(t, x, s)∈[a−i , a

+

i ],

a+_i , if rα

i(t, x, s)≥a

+

i ,

(14)

where

rα_i(t, x, s) =−1

α

n

∑

j=1

sjfji(t, x), i∈1, n.

Introduction of the regularizing parameter α in (11) implies that the argument uα₍_{t, x, s}_{) for the} minimiza-tion operaminimiza-tion in the expression for the hamiltonian

Hα₍_{t, x, s}_{) is the singleton for any vector} _s_∈_Rn_{. Also} it provides continuity for the extremal controluα₍_{t, x, s}₎ on cl ΠT ×Rn.

3.2 The characteristic system

We consider the characteristic system for the prob-lem (1), (2), (11)

dxi

dt =

∂Hα₍_{t, x, s}₎

∂si

, i∈1, n, t∈[0, T], (15)

dsi

dt =−

∂Hα₍_{t, x, s}₎

∂xi

, i∈1, n, t∈[0, T], (16) with boundary conditions

(3)

Here

∂Hα₍_{t, x, s}₎

∂si

=gi(t, x) + n

∑

j=1

fij(t, x)uαj(t, x, s), (18)

∂Hα₍_{t, x, s}₎

∂xi

= [xi−yi(t)] +⟨s, Dxig(t, x)⟩+ (19)

+⟨s, Dxif(t, x)u

α₍_{t, x, s}₎_⟩₊_R_i(_{t, x, s}₎_, where

Ri(t, x, s) =⟨(f⊤(t, x)s+αuα(t, x, s)), Dxiu

α₍_{t, x, s}₎⟩_, (20)

Dxig(t, x) =

(

∂g1(t, x)

∂xi

, . . . ,∂gn(t, x) ∂xi

,

)

Dxif(t, x) =

{(

∂fkj(t, x)

∂xi

)}

, k, j∈1, n,

Dxiu

α₍_{t, x, s}_{) =}

(

∂uα

1(t, x, s)

∂xi

, . . . ,∂u

α n(t, x, s)

∂xi

,

)

notation⊤ means the operation of transposition. The following relations imply from (14) for any j ∈

1, n:

•If the relations

uαj(t, x, s) =rjα(t, x, s)∈[a−j, a

+

j]

are true on the time interval [t1, t2] ⊂ [0, T], then the

j-th element of the summa in the expression of scalar multiplication forRi(t, x, s)i= 1, . . . , n(20) is equal to zero, since

f_j⊤(t, x)s+αuα_j(t, x, s) = 0, (21) where the symbol f_j⊤(t, x) means the j-th row of the matrixf⊤(t, x).

•In the case of either the relations

rα_j(t, x(t), s(t))≤a−_j, uα_j(t, x(t), s(t))≡a−_j,

or the relations

rα_j(t, x, s)≥a+_j, u_jα(t, x(t), s(t))≡a+_j

hold on the interval (t1, t2)⊂[0, T], then

Dxiu

α

j(t, x(t), s(t)) = 0. (22) Thus, from (20), (21), (22), it follows that

Ri(t, x, s) = 0, ∀ i∈1, n. (23) One can summarize the foregoing in the following statement:

Lemma 3. The characteristic system(15),(16)for the

problem (1),(2),(11)has the form

dx

dt =g(t, x) +f(t, x)u

α₍_{t, x, s}₎_, ₍₂₄₎

ds dt =−

[

(x−y(t)) +s⊤Dxg(t, x) +s⊤Dxf(t, x)uα(t, x, s)

]

, (25)

where

s⊤Dxf(t, x)uα(t, x, s) = =(s⊤Dx1f(t, x)u

α₍_{t, x, s}₎_{, . . . , s}⊤_D

xnf(t, x)u

α₍_{t, x, s}₎)_.

3.3 Properties of solutions of the

char-acteristic system

We consider state componentsx(t, ξ) of the solutions of the characteristic system (15) (hereinafter — state characteristics). We denote the set of the state charac-teristics, graphs of which {(t, x(t, ξ))},t∈[0, T], belong to the domain Φ(δ) (4), by the symbolX(α, δ).

The following statements are true:

Lemma 4. Let the conditionsAbe true in problem (1),

(2), (11), then the constant K4 = K4(δ, α) > 0

ex-ists such that the following relations for all

character-istics (24),(25) satisfying inclusions (t, x(t, ξ))∈Φ(δ),

t∈[0, T] are true:

|xi(t, ξ)| ≤K4,

dxi(t, ξ)

dt

≤K4, t∈[0, T], i∈1, n;

(26)

|si(t, ξ)| ≤K4,

dsi(t, ξ)

dt

≤K4, t∈[0, T] i∈1, n.

(27) It is easy to prove the following statements.

Lemma 5. Let the conditionsAbe true in problem (1),

(2), (11), then the extremal control uα₍_{t, x, s}_{) (14)} _has

the following properties:

• The functions (t, x, s) → uα

i(t, x, s), i ∈ 1, n are

continuous in Φ(δ)×Sδ, where

Sδ ={s∈Rn:|si| ≤K2, i∈1, n}. (28)

• for anyi∈1, n, (t′, x′),(t′′, x′′)∈Φ(δ), s′, s′′∈Sδ

the following estimates are true

|uα_i(t′, x′, s′)−uα_i(t′′, x′′, s′′)| ≤ (29)

≤ 1 αφ(|t

′₋_t′′_|_,_∥_x′₋_x′′_∥_,_∥_s′₋_s′′_∥₎_,

where

φ(|t′−t′′|,∥x′−x′′∥,∥s′−s′′∥)→0 (30)

in case |t′−t′′| →0,∥x′−x′′∥ →0,∥s′−s′′∥ →0.

3.4 A solution of the optimal feedback

problem

We construct the optimal feedback (the optimal syn-thesis) (t, x)→u0(t, x) : Φ(δ)→U for the problem (1), (2), (11). We introduce the setX0(α, δ) of optimal tra-jectories x0(·) for the problem (1), (2), (11), generated by this synthesis and satisﬁed the condition

(t, x0(t))∈Φ(δ).

It follows from the condition Athat the setX0₍_{α, δ}_{) is}

nonempty and compact inCn_[0_{, T}_].

It follows from the deﬁnition of Φ(δ) and the paper [4], this setX0₍_{α, δ}_{) is the subset of}_X₍_{α, δ}_{) of all such state}

(4)

We should ﬁnally pick such characteristics xδ₍_·_{) =}

x(·, ξ)∈X0₍_{α, δ}₎_⊂_X₍_{α, δ}_{) and the realizations of}

ex-tremal feedbacks uα

δ[t] = uα(t, xδ(t), sδ(t)), generating them, which satisfy the relations:

I₀_,xδ₍₀₎(uα_δ(·)) = min

x(·,ξ)∈X(α,δ)

I0,x(0)(uα[·]) =V(α, δ),

(31)

uα_[_·_{] =}_uα₍_{t, x}₍_{t, ξ}₎_{, s}₍_{t, ξ}_)).

The main result of this paper is the proof of the fact that these characteristicsxδ₍_·_{, ξ}_{) and the realizations of} extremal feedbacksuα

δ[t] =u

δ₍_{t, x}δ₍_t₎_{, s}δ₍_t_{)), generating} them, give the solution of the perturbed inverse problem of dynamics. The next part of the paper contains this proof.

4 Justification for the solution of

the perturbed inverse problems

4.1 Auxiliary results

Lemma 6. The square of the norm inL2is weakly lower

semicontinuous.

Proof. We prove that the square of the norm in L2 is

Gato diﬀerentiable.

Let the symbol⟨x, h⟩L2 denotes the inner product in

L2. It is easy to see that the relations are hold:

∥x+δh∥2

L2− ∥x∥

2

L2

δ

= 2δ⟨x, h⟩L2−δ

2_⟨_{h, h}_⟩

L2

δ →2⟨x, h⟩L2

whereδ→0.

So, the derivative of square of the norm inL2 at the

element xrelative the direction his equal to 2⟨x, h⟩L2 for eachh∈L2. The Gato derivative is equal to 2xand

the derivative is a monotone function. Let us considerθ(x) =∥x∥_L22 ,θ′(x) = 2x.

We show that for eachx, y∈L2the inequality holds:

θ(y)−θ(x)≥ ⟨θ′(x), y−x⟩L2.

According to the monotonicity of the Gato derivative for anyt1, t2∈R,t1> t2 one can obtain

⟨θ′(x+t1(y−x)), y−x⟩L2≥ ⟨θ′(x+t2(y−x)), y−x⟩L2. We introduce the functionω(t) =θ(x+t(y−x)).

According to the Lagrange ﬁnite increment formula, we have

ω(1)−ω(0) =θ(y)−θ(x) =ω′(λ)

≥ω′(0) =⟨θ′(x), y−x⟩L2,

λ∈(0,1).

If x = x0, y = xn and xn → x0 weakly as n → ∞

then the previous relations imply

θ(x0)≤θ(xn) +⟨θ′(x0), x0−xn⟩L2.

Let’s pass on to the lower limit in the relations as

n→ ∞. Due to the weak convergence of{xn} tox0we

have

⟨θ′(x0), x0−xn⟩L2 →0.

Finally,

∥x0∥2L2 ≤lim inf_n_→∞ ∥xn∥

2

L2.

We consider the procedure of numerical solution of characteristic system (15), (16). We denote the step of numerical integration of the characteristic system byh,

Γ ={ti=ih}, i∈0, N(h), (N(h) + 1)h=T. Letxh(·),sh(·) be the numerical approximations of the solutions x(t) =x(t, ξ),s(t) =s(t, ξ) of the characteris-tic system (15), (16) with boundary conditions

xh(T) =ξ, ξ∈ΦT(δ), sh(T) = 0 and with the restriction

(t, x(t))∈Φ(δ), t∈[0, T].

The numerical solutions of the characteristic sys-tem (15), (16) are called the Euler polygonal paths.

Results of the works [3], [4] implies the following state-ment:

Lemma 7. Let xh(t),sh(t) be such numerical

approx-imations of the accurate solutions of the characteristic

system (15),(16), which approximate the accurate

solu-tions x(t), s(t) such that (t, x(t)) ∈ Φ(δ). Then such

constants M1, M2, where M1 = M1(Φ(2δ)), M2 =

M2(Φ(2δ)), exist that for any t ∈ [0, T] the following

estimates are true:

(t, xh(t))∈Φ(2δ), ∥xh(t)−x(t)∥ ≤M1h,

∥sh(t)−s(t))∥ ≤M2h,

Mi(Φ(2δ))≤Mi(Φ(2δ0)) ∀δ∈(0, δ0), i= 1,2.

4.2 The main result

We will consider the characteristicsxδ(·) and the real-izations of extremal feedbacksuα_δ[t] =uδ(t, xδ(t), sδ(t)), generating them, which satisfy the condition (31). And letxδ_h(·),uδ_h(·) be the numerical approximations of the inverse problem of the dynamics (1) – (3), approximat-ing the accurate solutionxδ₍_·_),_uα

δ(·). We have uδ

h(t, xh(t), sh(t)) = uδ(ti, xh(ti), sh(ti)) on the intervalt∈[ti, ti+1], i∈0, N(h).

We estimate

∆I= T

∫

0

α

2

[

∥uδ_h(t, xh(t), sh(t))∥2− ∥u_∗(t)∥2]dt=

= ∆I1+ ∆I2+ ∆I3,

(5)

∆I1=

α

2 N_∑(h)

i=0 (i∫+1)h

ih

∥uδ(ti, xh(ti), sh(ti))∥2−

−∥uδ(t, xh(t), sh(t))∥2dt,

∆I2=

α

2 T

∫

0

∥uδ(t, xh(t), sh(t))∥2− ∥uδ(t, x(t), s(t))∥2dt,

∆I3=

α

2 T

∫

0

∥uδ(t, x(t), s(t))∥2− ∥u_∗(t)∥2dt.

Using convexity of the square of the norm and the triangle inequality, we can continue to estimate ∆I1and

∆I2.

∆I1≤

α

2 N_∑(h)

i=0 (i_∫+1)h

ih

uδ(ti, xh(ti), sh(ti))−

−uδ(t, xh(t), sh(t))2dt,

∆I2≤

α

2 T

∫

0

uδ(t, xh(t), sh(t))−uδ(t, x(t), s(t))2dt.

Using estimates for ∆I, ∆I1, ∆I2 and lemmas 4–7,

we prove the following result.

Theorem 1. Let the conditions A be true in the

per-turbed inverse problems of dynamics (1)–(3), while the

parameters of the problem h=h(δ)>0,α=α(δ)>0,

δ >0 satisfy the conditions

lim δ→0

2

α

(

ϕ(δ, h) +ρ(δ, h) +T δ

2

)

= 0, (32) lim

δ→0h(δ) = 0, δlim→0α(δ) = 0,

then the following relations are true

lim δ_→0∥x

δ

h(δ)−x∗∥C= 0,lim δ_→0∥u

δ

h(δ)−u∗∥L2= 0 (33)

for the functions xδ_h(·),uδ_h(·).

Proof. As the trajectory xδ(·) belongs to the domain

Φ(δ) of admissible motions and lemma 7 is true, we can estimate the deviation of the numerical approximation of the solutionxδ_h₍_δ₎(·) from the real trajectoryx_∗(·).

∥xδ_h(t)−x_∗(t)∥ ≤ ∥xδ_h(t)−xδ(t)∥+∥xδ(t)−x_∗(t)∥ ≤

≤M1h+dH(Φ(δ),grx∗(·)). Ifh→0,δ→0, then

∥xδ_h₍_δ₎(·)−x_∗(·)∥C= max t∈[0,T]∥x

δ

h(δ)(t)−x∗(t)∥ ≤

≤M1h+dH(Φ(δ),grx∗(·))→0. (34)

We estimate the numerical approximationVe3(α, δ, h)

of the value V(α, δ) (31)

e

V(α, δ, h) = T

∫

0

∥xδ_h(t)−y(t)∥2

2 +

α∥uδ_h(t)∥2

2 dt≤

≤ 1

2 T

∫

0

∥xδ_h(t)−xδ(t)∥2+

+2∥xδ_h(t)−xδ(t)∥∥xδ(t)−y(t)∥dt+

+ T

∫

0

∥xδ₍_t₎₋_y₍_t₎_∥2

2 +

α∥uδ h(t)∥

2

2 dt.

Using inclusion (t, x(t))∈Φ(δ) and lemma 7, we get

1 2

T

∫

0

∥xδ_h(t)−xδ(t)∥2+ 2∥xδ_h(t)−x(t)∥∥xδ(t)−y(t)∥dt

≤T(M1δh)2

2 +T(δ+dH(Φ(δ),grx∗(·)))M1h.

We introduce the symbol

ϕ(δ, h) =T M1h

(

M1h

2 +δ+dH(Φ(δ),grx∗(·))

)

and provide the relations

e

V(α, δ, h) = T

∫

0

∥xδ

h(t)−y(t)∥

2

2 +

α∥uδ h(t)∥

2

2 dt≤

≤ϕ(δ, h) + T

∫

0

∥xδ₍_t₎₋_y₍_t₎_∥2

2 +

α∥uδ h(t)∥2

2 dt.

Using estimates for ∆I1, ∆I2, we obtain

α

2 T

∫

0

∥uδ_h(t)∥2≤α 2

T

∫

0

∥uα_δ[t]∥2dt+

+α 2

T

∫

0

∥uδ_h(t)−uα_δ[t]∥2dt,

where uδ

h(t) = u α₍_t

(6)

uα₍_{t, x}δ

h(t), sδh(t)),

α

2 T

∫

0

∥uδ_h(t)−uα_δ[t]∥2dt

≤ α

2 N_∑(h)

i=0 (i_∫+1)h

ih

uα(ti, xδh(ti), s δ h(ti))

−uα(t, xδ_h(t), sδ_h(t))

2 dt +α 2 T ∫ 0

uα(t, xδh(t), sδh(t))−uα(t, xδ(t), sδ(t))

2

dt

≤ α

2 N_∑(h)

i=0 (i∫+1)h

ih

uα(ti, xδh(ti), sδh(ti))

−uα(t, xδh(t), s δ h(t))

2

dt

+1 2φ(0,∥x

δ h(·)−x

δ₍_·₎_∥

C,∥sδh(·)−s δ₍_·₎_∥_C)

≤ 1

2T φ(h, K4h, K4h) + 1

2φ(0, M1h, M2h) =ρ(h, δ). Finally, we have

T

∫

0

∥xδ

h(t)−y(t)∥

2

2 +

α∥uδ h(t)∥

2

2 dt≤

≤ϕ(δ, h) +ρ(δ, h) + T

∫

0

∥xδ₍_t₎₋_y₍_t₎_∥2

2 +

α∥uα δ(t)∥2

2 dt.

The integral in the last expression coincides with the functionalI0,xδ₍₀₎(u_δα(·)) (31). Since the trajectoryxδ(·)

and the control uα

δ[·] are the solution of the optimal control problem (1), (2), (11) and satisfy the condi-tion (31), so the funccondi-tional I0,xδ₍₀₎(uα_δ(·)) isn’t greater

then I0,x∗(0)(u∗(·)). So, the following inequalities are

true

T

∫

0

∥xδ₍_t₎₋_y₍_t₎_∥2

2 +

α∥uα δ[t]∥2

2 dt

≤

T

∫

0

∥x_∗(t)−y(t)∥2

2 +

+α 2∥u∗(t)∥

2_dt_≤T δ2

2 +

α

2∥u∗∥

2

L2.

We gather the estimations and obtain

e

V(α, δ, h) = T

∫

0

∥xδ

h(t)−y(t)∥2

2 +

α∥uδ h(t)∥2

2 dt≤

≤ϕ(δ, h) +ρ(δ, h) +T δ

2

2 +

α

2∥u∗∥

2

L2.

We minorize the integralVe(α, δ, h)

e

V(α, δ, h) = T

∫

0

∥xδ

h(t)−y(t)∥2

2 +

α∥uδ h(t)∥2

2 dt≥

≥

T

∫

0

α∥uδ_h(t)∥2

2 dt=

α

2∥u δ h∥2L2.

As a result, we get

∥uδ_h∥2_L2 ≤ 2 α

(

ϕ(δ, h) +ρ(δ, h) +T δ

2

)

+∥u_∗∥2_L2. (35) The collection of functions uδ

h(·) ∈ U, h > 0, δ > 0 is bounded in L2, so there exists such sequence δk → 0, hk = h(δk) → 0, k → ∞ that uk(·) = uδ_hk_k(·) weakly converges to an elementv_∗(·) inL2. We will show that

v_∗(·)∈U.

Let’s prove thatv_∗(·)∈U={u∈ U:x(t, u) =x_∗(t)}.

xk(ti+1) =xk(ti)−hk[f(ti, xk(ti))uk(ti) +g(ti, xk(ti))],

xk(ti+1) =xk(T)−hk N

∑

j=i

[f(tj, xk(tj))uk(tj)+ +g(tj, xk(tj))].

We pass to the limit ask→ ∞. Since∥xk(·)−x_∗∥C→0 and uk(·) weakly converges to v_∗(·) in L2, , it follows

from the lemma 1 and the equalities thatv_∗(·)∈ U and

x_∗(t) =x_∗(T)− T

∫

t

[f(τ, x_∗(τ))v_∗(τ) +g(τ, x_∗(τ))]dτ,

(36)

t∈[0, T].

The equality (36) means that v_∗(·)∈U. According to lemma 6 we obtain

∥v_∗(·)∥2_L

2 ≤lim inf_k_→∞ ∥uk(·)∥

2

L2. (37) From inequalities (35), (37) and from lemma (2), we get the following estimates

∥u_∗(·)∥2_L2 ≤ ∥v_∗(·)∥2_L2≤lim inf k→∞ ∥uk(·)∥

2

L2 ≤

≤lim sup k→∞

∥uk(·)∥L22 ≤ ∥u∗(·)∥

2

L2.

Therefore, lim

k→∞∥uk(·)∥

2

L2 =∥u∗(·)∥

2

L2=∥v∗(·)∥

2

L2.

As the normal solution u_∗(·) is unique in U (see lemma 2), then

u_∗(·) =v_∗(·).

So, the collection of functions uδ

h(·), h > 0 has the only weak limit pointu_∗(·), moreover

lim h→0∥u

δ

h(·)∥L2 =∥u∗(·)∥L2.

Then, using the equality and the weak convergency of

uk(·) tou∗(·) in L2, we ﬁnally get

∥uδ_h(·)−u_∗(·)∥2_L 2≤

≤ ∥uδ_h(·)∥_L22 −2⟨uδ_h(·), u_∗(·)⟩+∥u_∗(·)∥2_L2→0,

(7)

5 Example

Consider the system ˙

x1=x2+u1,

˙

x2=u2,

u1∈[0,1],u2 ∈[−9,−10],t∈[0,10]. Let the sampling

history be the following one (free fall)

y1(t) =−

9.8t2

2 + 480,

y2(t) =−9.8t, t∈[0,10].

We putδ= 0.01,h= 0.001,α= 0.01.

The picture Fig.1 shows the real trajectory x∗₁(t) =

y1(t) (red color) and the reconstructed trajectoryxδ1h(t) (blue color).

1 2 3 4 5 6 7 8 9 10

[image:7.595.107.261.276.355.2]

t

Figure 1. the real trajectoryx∗₁(t) =y1(t) (red color) and the reconstructed trajectoryxδ

1h(t) (blue color)

The picture Fig.2 shows the reconstructed control

uδ

2h(t).

1 2 3 4 5 6 7 8 9 10

t

Figure 2. the reconstructed controluδ 2h(t)

We note that the reconstructed controls u = (u1(t), u2(t)) should be constant andu1(t) = 0,u2(t) =

−9.8 because the given real trajectory (y1(t), y2(t)) is

the trajectory of free fall.

6 Conclusion

In the paper the new method for solving inverse prob-lems is suggested and justiﬁed. It can be useful in problems of modeling and identiﬁcation of parameters of models in mechanics, engineering, economics, biology and so on, when statistic data is given.

The method will be developed to problems with more complete dynamics, to problems in higher dimension state space and to problems with uncomplete statistic data.

REFERENCES

[1] Krasovsky N.N., Subbotin A.I. Game-Theoretical Con-trol Problems. NY:Springer-Verlag (1987).

[2] Osipov Y.S., Vasiliev F.P., Potapov M.M. Bases of the Dynamical Regularization Method. Moscow: Moscow State University, (1999) (in Russian).

[3] Subbotina N.N., Tokmantsev T.B. On Grid Optimal Feedbacks to Control Problems of Prescribed Duration on the Plane // Annals of the ISDG: Advances in Dy-namic Games. Vol. 11. Boston: Birkh¨auser, 133-147, (2011).

[4] Subbotina N.N. The method of characteristics for Hamilton–Jacobi equations and applications to dynam-ical optimization. Journal of Mathematdynam-ical Sciences. Vol. 135(3). Pp.2955-3091. NY: Springer (2006).

[5] Kolmogorov A.N., Fomin S.V. Elements of the Theory of Functions and Functional Analysis. Dover Publica-tions (1999).

[image:7.595.107.258.488.517.2]