• No results found

Decentralized State Feedback and Near Optimal Adaptive Neural Network Control of Interconnected Nonlinear Discrete-Time Systems

N/A
N/A
Protected

Academic year: 2021

Share "Decentralized State Feedback and Near Optimal Adaptive Neural Network Control of Interconnected Nonlinear Discrete-Time Systems"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

Missouri University of Science and Technology

Missouri University of Science and Technology

Scholars' Mine

Scholars' Mine

Electrical and Computer Engineering Faculty

Research & Creative Works

Electrical and Computer Engineering

01 Dec 2010

Decentralized State Feedback and Near Optimal Adaptive Neural

Decentralized State Feedback and Near Optimal Adaptive Neural

Network Control of Interconnected Nonlinear Discrete-Time

Network Control of Interconnected Nonlinear Discrete-Time

Systems

Systems

Shahab Mehraeen

Jagannathan Sarangapani

Missouri University of Science and Technology, [email protected]

Mariesa Crow

Missouri University of Science and Technology, [email protected]

Follow this and additional works at: https://scholarsmine.mst.edu/ele_comeng_facwork

Part of the Computer Sciences Commons, and the Electrical and Computer Engineering Commons

Recommended Citation

Recommended Citation

S. Mehraeen et al., "Decentralized State Feedback and Near Optimal Adaptive Neural Network Control of Interconnected Nonlinear Discrete-Time Systems," Proceedings of the 49th IEEE Conference on Decision and Control (2010, Atlanta, GA), pp. 114-119, Institute of Electrical and Electronics Engineers (IEEE), Dec 2010.

The definitive version is available at https://doi.org/10.1109/CDC.2010.5717123

This Article - Conference proceedings is brought to you for free and open access by Scholars' Mine. It has been accepted for inclusion in Electrical and Computer Engineering Faculty Research & Creative Works by an authorized administrator of Scholars' Mine. This work is protected by U. S. Copyright Law. Unauthorized use including reproduction for redistribution requires the permission of the copyright holder. For more information, please contact [email protected].

(2)



Abstract— In this paper, first a novel decentralized state

feedback stabilization controller is introduced for a class of nonlinear interconnected discrete-time systems in affine form with unknown subsystem dynamics, control gain matrix, and interconnection dynamics by employing neural networks (NNs). Subsequently, the optimal control problem of decentralized nonlinear discrete-time system is considered with unknown internal subsystem and interconnection dynamics while assuming that the control gain matrix is known. For the near optimal controller development, the direct neural dynamic programming technique is utilized to solve the Hamilton-Jacobi-Bellman (HJB) equation forward-in-time. The decentralized optimal controller design for each subsystem utilizes the critic-actor structure by using NNs. All NN parameters are tuned online. By using Lyapunov techniques it is shown that all subsystems signals are uniformly ultimately bounded (UUB) for stabilization of such systems.

I. INTRODUCTION

In the recent years, there has been a great interest in the decentralized control of interconnected nonlinear systems using neural networks (NNs) [1-8]. In large-scale systems such as power systems, the feedback delays degrade the controller performance thus necessitating more decentralized control techniques. The decentralized state feedback control effort has focused mainly on nonlinear continuous-time systems [1-5] and limited effort in discrete-time case [6-8]. Although for many applications, continuous-time controller design can be considered, in practice discrete-time control approaches are preferred for computer implementations [9] since continuous-time controller designs render unsatisfactory performance when implemented using low sampled hardware [10]. Therefore, decentralized controller development in discrete-time has been explicitly considered.

In [6], the discrete-time NN controller design for a class of interconnected nonlinear systems is considered where the interconnected terms are considered to be over bounded by a constant. Moreover, the control gain matrix is taken to be unity (i.e. g(x)1). In [7][8], a stabilizing robust controller is proposed by assuming the dynamics are known beforehand. Therefore, the NNs are not utilized in the controller design.

On the other hand, the objective of an optimal controller is to minimize a cost function [11] while ensuring stability. In general, the optimal control of linear systems can be

1

Authors are with Department of Electrical and Computer Engineering, Missouri University of Science and Technology, 1870 Miner Circle, Rolla,

obtained by solving the Riccati equation [11]. However, the optimal control of nonlinear discrete time systems often requires solving the nonlinear Hamilton-Jacobi-Bellman (HJB) equation. Although extensive theoretical work has been done on nonlinear optimal for discrete-time systems [12][13], obtaining a closed-form solution for the HJB equations is still extremely hard.

Therefore, approximate solutions referred to as approximate dynamic programming (ADP) have been proposed to solve the HJB equation forward-in-time for discrete-time nonlinear optimal regulation [12][13]. These solutions are based on policy-value iterations for discrete-time nonlinear systems to solve the nonlinear HJB equation offline. Neural networks (NN) are utilized to approximate the unknown nonlinear functions. The drawback with off-line solutions is the need for large data sets for NN training and a long training procedure.

In this work, first a novel decentralized NN state feedback controller is developed for a class of interconnected nonlinear discrete-time system in Brunovsky canonical form where the restrictions in [6] are relaxed while approximating the unknown subsystem internal dynamics and input coefficient matrix via NNs. The interconnected terms are also considered unknown as opposed to [7][8]. A single NN is used to approximate the control gain matrixg(x)as well as the subsystem internal dynamicsf(x)for each subsystem. By using the subsystem state vector, the overall closed-loop stability of the nonlinear system for stabilization is presented.

II. INTERCONNECTED NONLINEAR SYSTEM- FEEDBACK

CONTROLLER DESIGN

Consider the class ofN interconnected subsystems defined in Brunovsky Canonical form as

) ( ) ( ; ) ( )) ( ( )) ( ( ) 1 ( ; ) ( ) 1 ( ; ; ) ( ) 1 ( 1 1 2 1 k x k y x u k x g k x f k x k x k x k x k x i i i i i i i i in in in i i             (1)

where index i represents the subsystem number, N is the number of subsystems, n is the order of the subsystem,

)) ( (x k

fi i , represents subsystem nonlinear internal dynamics, gi(xi(k))is the input gain matrix, i(x) denotes interconnected terms of the subsystem ‘i’ with

T T N T x x x[ 1 ,, ] , xi[xi1,,xin]Tfor 1i N.

Define regulation error as

MO 65409. Contact author: [email protected]. Research Supported in part by NSF ECCS#0624644 and ECCS-0901562..

Decentralized State Feedback and Near Optimal Adaptive Neural

Network Control of Interconnected Nonlinear Discrete-time Systems

S. Mehraeen, S. Jagannathan, and M.L. Crow1

49th IEEE Conference on Decision and Control December 15-17, 2010

(3)

ipd ip

ip k x k x

z ( ) ( ) (2)

for1i N and 1 p n, wherexipd 0is the desired set point for stabilization for the statexip(k) for 1p n. Next, define the filtered regulation error as

) ( ] 1 [ ) (k z k ri  i T i (3) where T in i i i k z k z k z k z( )[ 1( ) 2( ) ( )] andi[i1 i2 i,n1]. The coefficients i1 through ip are selected such that the poles of the characteristic equation(q)i1i2q

1 2 1

,    

in qn qn are inside the unit disc. Before we proceed, the following mild assumptions and definition are needed.

Assumption 1: Let the interconnection terms (weak in nature) in (1) be bounded above in a compact set such that

      N j ij j i N j ij j i(x) 1 (r) 0(x) 1 r (4) where 0iand ijare known small positive constants for

N i  

1 and 1 j nin contrast with [6].

Assumption 2: The input gain of each subsystem in (1) is bounded away from zero and is bounded in the compact set. Without loss of generality, we assume that it satisfies

max min ( ( ))

0gi gi xi k gi (5)

in a compact set where giminand gimaxare positive real constants..

A. State Feedback Controller Design

In this part we develop a NN stabilizing controller which employs the filtered regulation error and NN function approximation capability and a novel NN weight estimate tuning scheme. The stability criterion is then elaborated to show the stability of the filtered regulation error as well as NN weight estimates.

Starting with (3), the filtered error dynamics can be written by using (1) and (2) as

) ( )) ( ( ] 0 [ )) ( ( ) 1 ( ] 1 [ ) 1 (k z k f x k x z g x k u x ri  i T i   i i  ind iT i i i ii . The ideal stabilizing control input can be defined as

) (k r K u u ui i  id ii  where



)) ( ( )) ( (x k 1 f x k g uid  i i  i i



i T i ind z x [0]

 . This results in asymptotically stable dynamics ri(k1)Kiri(k)withKi1 being a positive design constant. However, in practical applications uidis not available since the internal dynamicsfi(xi(k)) and

control gain matrixgi(xi(k))are unknown.

Thus, we employ NN function approximation property to approximate uidas





)) , ( ) , ( ] 0 [ )) ( ( )) ( ( 1 ind i i ind i i T i i T i ind i i i i id x x x x W z x k x f k x g u           (6)

whereWi is the target NN weight matrix, i(.)is the activation function, and i(.)is the approximation error which satisfies i(.) imax. In practice, the target weights

i

W and approximation error i are not available either and only an estimation of the NN weights is available. Thus,

id

u is approximated as uˆidby a NN to obtain the control input u as i ) ( ) , ( ˆ ) ( ˆ Kr k W x x Kr k u ui id  ii  iTi i ind  ii (7) whereWˆiT is the NN weight estimation matrix. Define the weight estimation error asW~i ˆWiWi.

Consequently, by using (7) and adding and subtracting

id

u in (6), the filtered error (3) dynamics becomes



~



( ) )) ( ( ) 1 ( ] 1 [ ) 1 (k z k g x k W Kr x ri   i T i   i i iTii ii i (8)

Define the NN weight update law as

) 1 ( ) ( ˆ ) 1 ( ˆ 1     cW k c r k k WiT i i i ii i (9)

whereci1is a positive design constant. By subtracting the ideal weights from (9), we have

i i i i i i i i T i k cW k c r k I c W W~ ( 1) ~( ) 1  ( 1)(  ) (10)

The presence of the parameterci 1in the above update law provides stability of the weights where (9) becomes a stable system with input ri( k 1), and thus, prevents the NN weight estimates from remaining large after the filtered regulation error becomes small, as opposed to conventional update laws [6][14] whereci 1.

B. Stability analysis

In this part we introduce the following theorem to show that the nonlinear discrete-time interconnected system (1) along with controller (7) and the NN weight update law (9) are stable while the filtered regulation errors ri(k) and weight estimation errors W~i(k) of the individual subsystems are bounded in the presence of unknown internal dynamicsfi(xi(k)) and control gain matrixgi(xi(k)), and unknown interconnection termsi(x)for1i N.

Once the filtered regulation errorri(k)is proven bounded, it is treated as a bounded input for the linear time-invariant system (3) as i1zi1(k) i,n1zi1(kn2) ) ( ) 1 ( 1k n r k

zi    i which yields bounded results for the outputzi1(k) for all1i N .

Theorem 1 (Decentralized State Feedback NN Controller):

Consider the nonlinear discrete-time interconnected system given by (1). Consider that the Assumptions 1 and 2 hold andxipd 0(for all 1i N and 1 p n) and initial conditions for system (1) are bounded in the compact set. Let the unknown nonlinearities in each subsystem be approximated by a NN whose weight update is provided by (9). Then there exist a set of control gains K and filteredi error coefficientsi, associated with the given control inputs (7) such that the filtered errorri(k) as well as the NN weight estimation error W~i are UUB for all 1i N.

Proof. Define the overall Lyapunov function candidate r W L L L  , where





   N i i i r k r k g xk L 1 2 )) 1 ( ( ) ( ) ( and 115

(4)

  N i i T i i W k W kW k L 1 ( ) ~ ) ( ~ 1 )

(  . Then, the first difference of

the Lyapunov function due to the first term becomes









       N i i i N i i i r r k g xk r k g xk L 1 2 1 2 )) 1 ( ( ) ( )) ( ( ) 1 ( (11)

Substituting the filtered error (8) into (11) and expanding the terms, we obtain





                     N i ik i N i i i i i i i T i k i N i ri r g k r x r K x W g L L 1 2 ) 1 ( 1 2 ) ( 1 ) ( ) ( ) ) ( ~ (   (12)

Next, by using (10), the first difference due to the second term in the overall Lyapunov function candidate is obtained



~() ( 1) ( )



1 ~ ()~()

1 cW k c 1 r k I c W2 W kWk

LW i i i  i iii    i i  i iT i

  (13)

Expanding the first difference of the overall Lyapunov function candidateLLrLW by using (11) and (12) and expanding the terms, applying the Cauchy-Schwarz

inequality 2 2 1 ) (a a  an ( ) 2 2 2 2 1 a an a n     ,  i T i i T i WW   ~ ~ W~iTiiTW~i, Assumption 1 , and

   N i1 i i x 2 ) (     



 N i N j ij j i i N N r 1 1 2 2 2 0 ( 1) ) 1 (( 

   2 1 ( 1) 0 N i iN i 2 1 1 2 ) 1 ( i N i N j jN ji r

  

where  i i(xi)is an arbitrary

real-valued function ofx , i Lbecomes

        N i i i iw i irr C W k C C L 1 2 2 ~( ) ) (  (14) where  

                                                    N i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i N c c c g W c I W c c W c c W c k g c g c c k g g C 1 2 0 2 max 2 1 min 2 max 2 2 2 max 2 1 2 2 2 2 max 2 2 2 2 max 1 2 max ) 1 )( 4 ) 1 ( 4 1 2 ( ) ( 2 ) 1 ( ) 1 ( 1 1 ) ( 4 ) 1 ( 4 ) ( 1           ,

                             N j j j j j j j ji i i i i i i i i i i i ir N c c c g K g c g c c g g g C 1 2 2 max 2 1 min 2 2 max 2 max 2 2 max 1 2 max max max ) 1 ( 4 ) 1 ( 4 1 2 4 ) 1 ( 4 1 1     , w i i i iw C c C        1 , and 2 max 2 max 2 max 2 2 max 1 min 4 ) 1 ( 4 i i i i i i i i i w i g c g c c g C                      . Therefore,  L 0in (14) provided the filtered error and weight estimation errors hold for all 1i Nas

ir i

i C C

r    , orWiT(k)  Ci Ciw

~ . (15)

This guaranties the boundedness of the weight estimation error W~i(k)and filtered errorri(k)which in turn shows that the errorszi(k)are UUB for all 1i Nas explained.

The bounds can be reduced through Ci which can be achieved if the constantc is selected to be close to one, i whereasK and i i are selected to be small positive constants for1i N, which, in turn, causes the stability bounds (15) to decrease. Also, the interconnection bound parameters ji and 0i (for1i N ) need to be small which can be achieved when the effect of the interconnection dynamics are small or weak in nature. On the other hand, the NN function approximation error

max i

 in C can be made small by increasing the number of i NN hidden-layer neurons [14] for1i N .

III. DECENTRALIZED OPTIMAL CONTROL

In this section our goal is to find optimal control inputs ))

( (x k

ui i or also denoted here as ui(k)for1i N in order to stabilize the interconnected system (1) while minimizing the infinite horizon cost function

)) 1 ( ( )) ( ( )) ( (x k Q xk u RuJ xk J T (16) with

  N i Qi xi x Q 1 ( ) )

( and Rdiag(R1,,RN) where 

  ) (x

Q is a positive definite function of the overall interconnected system states, RNN is positive definite matrix, and

1 1

( ) ( ( )) [ ( ( )), , ( ( ))]T N N u k u x k  u x k  u x k where ui(xi(k)) is only a function of the ith subsystem states (for 1i N). Defineg(x(k))diag(g1(x1(k)),,, gN(xN(k))), , , 0 [ )) ( (i n1  i x k g gi(xi(k))]T, f(x(k)) [f1(x(k)),, T  , T T N xk f (( ))] , ) ( , ), ( [ )) ( (xk 1 x2k x, 1k fi n i  in ,fi(xi(k)) T i(x ))](k   for 1i N. Also, define the subsystem cost function (17) as

!      ( ( )) ( ( )) ( ( )) ( ( 1)) )) ( (xk Q x k u xk Ru xk J xk Ji i i i T i i i (17)

By stationarity condition [11], the optimal policy that minimizes the cost function Ji(.) can be obtained by

1 1 1 1 2 T i i i i i i u ( x( k )) R g ( x ( k )) J ( x( k )) x ( k )   "  "  (18)

Substituting the subsystem optimal control policy (18) into equation (17) results in the nonlinear partial difference HJB equation where the subsystem optimal policy and cost function are obtained as functions of the overall interconnected system state vector x(k) due to the presence of the interconnection term, whereas the control policy

(.)

i

u has to be a function of subsystem state vector,xi(k), only to be synthesized.

Define the notations “Ji(.)” and “ui(.)” obtained by (17) and (18) to represent the optimal value of the cost function “Ji(.)” and control policy “ui(.)”, respectively. Consequently, the optimal control of the interconnected system can be viewed as the subsystem optimal problems corresponding to the cost functions (17) for1i N. As mentioned earlier, due to unavailability of the overall system state vector, a nearly optimal subsystem policy can be defined which leads to the nearly optimal policy of the overall interconnected system. However, finding a solution for the subsystem optimal policy that minimizes (17) in the presence of unknown interconnection terms is still generally hard.

Consequently, in this paper, we use an approximation of the subsystem cost function and optimal policy which are obtained through subsystem states only. Here, in order to obtain the nearly optimal control policy which is a function

(5)

of subsystem states,ui(.) is used as an approximation of (.)

i

u by using only subsystem states. Moreover, “Ji(.)” is an approximation of Ji(.) when only subsystem states and

(.)





i

u are employed. Now, assume that

)) ( ( )) ( ( )) ( ( ; )) ( ( )) ( ( )) ( (xk J x k 1 xk u xk u x k 2 xk Ji  i i i i  i i i (19) where the terms 1i(x(k)) and 2i(x(k)) reflect the effects of the interconnection terms to be discussed later. Subsequently, the nearly optimal subsystem policy

)) ( (x k

ui i and the corresponding subsystem cost function ))

( (x k

Ji i for 1i N are obtained. From (17) and (19), the subsystem cost function Ji(xi(k)) satisfies

)) ( ( )) 1 ( ( )) ( ( )) ( ( )) ( ( )) ( ( k x k x J k x u R k x u k x Q k x J i i i i i T i i i i i i $              (20) where $(x(k))Ji(x(k))1i(x(k)Qi(xi(k)) 



ui(xi(k))





( ( )) ( ( ))



)) ( ( 2 2i xk TRiui xi k  i xk     



Ji(xi(k1))1i(x(k1))



Note thatx(k1) f(x(k))g(x(k))u(x(k))%(x(k)), and thus,x(k)%1(x(k1)). Hence, the term

) ( ( )) ( ( )) ( (xk Ji xk 1i xk $  ))) ( ( )) ( ( ) ( (x k u x k Ru x k Qi i  i i T i i i  Ji(xi(k1)) can be written as$(x(k))1i(x(k1)).

Assumption 3: Let the terms 1i(x(k)in (20) be bounded above as a function of the interconnection terms in the compact set such that

    N j ij j i i x 10 x 1 x 1( )  ( ) ( )  ,

where1i0(x) 1i0M, with 1i0M and ij are positive constants for1i N.

By using the stationarity condition [11] 0 )) ( ( )) ( ( "  "Ji xi k ui xk , equation (18) yields )) ( ( ) 1 ( )) 1 ( ( )) ( ( 5 . 0 ) 1 ( )) 1 ( ( )) ( ( 5 . 0 )) ( ( )) ( ( )) ( ( 3 1 1 2 k x k x k x J k x g R k x k x J k x g R k x k x u k x u i i i i T i i i i i T i i i i i i i     "   "    "  "            (21) where ) 1 ( )) 1 ( ( )) ( ( 5 . 0 )) ( ( 1 1 3i x k  Ri gi xi k T "i x k "xi k 

Since the HJB equation (20) has no known closed-form solution, we use neural networks (NN), with subsystem states as their inputs, to approximate the cost functionJi(xi(k)) as well as the nearly optimal policy

)) ( (x k

ui i in a forward-in-time manner. In addition, the effect of the interconnection terms is overcome by augmenting a feedforward term similar to the tracking problem [15] while the nearly optimal control design for each subsystem is performed simultaneously.

Define Ji(xi(k))WciT i(xi(k))ci (22) and ui(xi(k))uio(xi(k))uiF(xi(k))WaiT i(xi(k))aiFi(xi(k))

(23)

respectively, where uio(xi(k))WaiT i(xi(k))ai is the optimal policy, uFi(xi(k))Fi(xi(k)) is the feedforward term, Wci and Wai are the target subsystem critic and action NN weights which are assumed to be bounded satisfying

ciM ci W

W  , Wai WaiM, with ci and ai denote approximation errors satisfyingci ciM, ai aiM, with i(.)and i(.)being the NN activation function vectors for the critic and action NNs, respectively [14].

Assumption 4: The gradient of the approximation error satisfies"ci(xi(k1))"xi(k1)cMin for1i N .

Next the stabilizer design is introduced.

IV. ACTOR-CRITIC STABILIZER DESIGN

The central theme of employing an actor-critic design is to use parametric structures, such as neural networks (NNs), to approximate the cost function and optimal control law by assuming that the local states are available for measurement.

A. The Critic Network Design

The objective of the optimal control law is to stabilize the system (1) while minimizing the cost functions (20). Since the cost function (20) is analytically not available it will be approximated by a NN as provided in (22). Consequently, by employing the subsystem states, the cost functionJi(xi(k))is approximated by )) ( ( ) ( ˆ )) ( ( ˆ x k W k x k Ji i  ciT i i (24)

whereWˆci is the estimated weight matrix of the target W . ci Note that the function1i(.)(and Ji(x(k))) in (20) cannot be approximated due to unavailability of overall state vector. Now we construct the critic network by augmenting the individual vectors, we obtain

) , , ( c1 cN cdiagW W W ,Wˆc(k)diag(Wˆc1(k),,WˆcN(k))

&

'

&

'

               ) ( ) 1 ( ) ( ) ( ) 1 ( ) ( ) ( 1 1 1 p k k k p k k k k N N N   , and                )] ( ) 1 ( ) ( [ )] ( ) 1 ( ) ( [ ) ( 1 1 1 p k k k p k k k k cN cN cN c c c c          

wherep1 is the number of past values from the previous time steps. The overall cost function is computed as

)) ( ( ) ( ˆ )) ( ( ˆ x k T k x k c  W J  (25)

whereJˆ(x(k))[Jˆ1(x1(k)),,JˆN(xN(k))]T. Now define the critic error as

)) ( ( ) ( ˆ ) 1 ( ) (k k cT k x k c Q W  E     (26) where (k)(x(k))(x(k1)) (27) and TT N T Q Q k) [ ] (  1  Q where  )) ( ( )) ( ( )) ( ( [Qx k u x k Ru x k Qi i i  i i T i i i Qi(xi(k ))p  ))] ( ( )) ( (x k p Ru x k p

ui i  T i i i  for 1i N. Then, the critic error dynamics become

) 1 ( ) 1 ( ˆ ) ( ) 1 (k  k  cT k  k c Q W  E (28)

By selecting the critic weight update law as [16] 117

(6)







( ) ( )



) 1 ( ) 1 ( ) 1 ( ) 1 ( ˆ 1 k k k k k k T T c c T c Q E    W             (29) the critic error dynamics becomes

) ( ) 1 (k c c k c E E   (30)

wherecdiag(c1,,cN)withcibeing a design constant for1i N . From (20) and (22) we obtain

) 1 ( ) 1 ( ) ( ) ( ) 1 ( ) ( ) 1 ( ) ( 1                k k k k k k k k c T c c c T c T c   W V    W  W Q (31) wherec(k1)c(k1)c(k)V1(x(k)) and  )) ( ( 1 xk V               )] ( ) 1 ( ) ( [ )] ( ) 1 ( ) ( [ 1 1 1 11 11 11 p k k k p k k k N N N          . Define

weight estimation error to beW~cWcWˆc. By using (28), (29) and (31), it can be concluded that



( 1) ˆ () ( )



( 1) ) 1 ( ) 1 ( ~            k k k k k k c cT c T c  Q W   W  (32)

Then, by shifting the time step in (31) to obtain Q( k 1)and substituting it in (32) , we obtain ) 1 ( ) ( ) ( ~ ) ( ) 1 ( ~ ) 1 (         T k Wc k c TkWc k c ck c k .(33) As a result, the dynamics of the weight estimation errors can be obtained as



( )~( ) ( )



( 1)( ( 1) ( 1)) ( 1) )) 1 ( ) 1 ( )( 1 ( ) 1 ( ~ 1 1                        k k k k k k k k k k k c T c i T T c c      W     W  (34)

The dynamics of the weight estimation errors in (34) will be well-defined provided the matrixT(k)(k)is invertible which will be shown next by using the following Lemma.

Lemma 1. Letui(.)be an admissible control for system (1) for1i N. It can be shown that

)) 1 ( ( { )) 1 ( (    

i xi k

i xi k  i(xi(k))}1L is basis. B. The Action Network Design for Stabilization

The action networks in each subsystem generate the nearly optimal control inputs in order to minimize the cost function (16). The action network for subsystem ‘i ’ is defined such that, by using (22), it estimates the nearly optimal control policy (21) as

) 1 ( )) 1 ( ( )) ( ( 2 1 )) 1 ( ( )) ( ( 1 1  "  "      k x k x k x g R k x k x ui i i i i i T i i (35) where          "  "   "  "     ) 1 ( )) 1 ( ( ) ( ) 1 ( )) 1 ( ( )) ( ( 2 1 )) 1 ( ( 1 k x k x k W k x k x k x g R k x i i ci ci i i T i T i i i i i  ,

which is a function of x(k)(sincexi( k 1)is a function of )

(k

x ). In the optimal decentralized control presented herein, the term xi( k 1)is a function of entire state vectorx(k)which is unavailable for measurement and prevents generating the optimal policy in (21). Thus, the dependency of the critic network derivative as well as

)) 1 ( (

1i xk

 in (35) to the overall system state vectorx(k)must be carefully considered. Moreover, an appropriate modification of the control input through an additional (feedforward) termFi(x(k))in (23) is considered such that the stability analysis can be performed in the presence of the interconnection terms.

Next, the following Lemma introduces the dependency of the optimal policy (21) to the interconnection terms.

Lemma 2. Consider the nonlinear interconnected system (1) where the subsystem optimal policies are given in (35). Then, the optimal policy (35) can be represented as

 

(x(k)) (x(k))

ui i i ini( kx( )) i(x(k)) where i(xi(k)) is a function of subsystem statesxi,in is a constant, and

)) ( (x k

i

 is assumed to be small value for1i N . Proof. Proof is performed by using Taylor series expansion and is omitted due to page constraints. Assumption 5: In the compact set, let the termi(x(k))be



( ( 1)) ( 1)



)) ( ( 5 . 0 )) ( ( )) ( (    1 " 1  "  xk i xk Rigi xi k T i xk xi k i    and

bounded above as a function of the interconnection terms

such that

      N j ij j i i(x) 0(x) 1 (x)  , where M i i0(x) 0

   , with i0 M and ij are positive constants for 1i N.

Note that only i(xi)in ui( kx( ))can be approximated by

the action NN

(i.e.i(xi)WaiT(k) i(xi(k)))ai(k)Fi(xi(k)), while the others have to be explicitly considered in the proof. Thus, rearranging the terms in ui( kx( )), we obtain

0 ) 1 ( )) 1 ( ( )) ( ( 2 1 )) 1 ( ( )) ( ( )) ( ( ) ( )) ( ( ) ( )) ( ( ) ( 1 1 0 a a   "  "          g x k xk x k R k x k x k x x k x F k k x k W i i i T i i i i i i i in i i i i i T i     (36)

This step is important while analyzing the action NN error dynamics. Next, the nearly optimal controller (23) is approximated by a NN as )) ( ( )) ( ( ˆ )) ( ( )) ( ( ˆ )) ( ( ˆ x k u x k u x k Wa a x k F x k ui i  io i  iF i  iT i i  i i (37) whereWˆai is the estimated weight matrix of target weights

i Wa whileFi(xi(k))is defined as ) ( ] 0 [ )) ( ( )) ( (x k g x k 1 z k Fi i  i i  i T i (38)

The feedforward term (38) will improve the stability in the presence of the interconnection terms by reducing the stability bound and will be small provided the regulation error is small.

Define the weight estimation error for the action NN as i

i

i W W

W~a  a  ˆa . Then the action error become

) ( ˆ ) 1 ( )) 1 ( ( )) ( ( 2 1 )) ( ( )) ( ( ) ( ˆ ) 1 ( a 1 a W k k x k x k x g R k x F k x k W k e ci i i i T i i i i i i i T i i  "  "      (39)

Subtracting (36) from (39) yields

) 1 ( )) 1 ( ( )) ( ( 2 1 ) ( ~ ) 1 ( )) 1 ( ( )) ( ( 2 1 ) ( ) ( ) ( ) ( ) ( ) ( ~ ) 1 ( 1 1 0 a a a  "  "   "  "            k x k x k x g R k W k x k x k x g R k x x k k k W k e i i i T i i i ci i i i T i i i i i in i i i T i i     . (40)

(7)

Also, define the action NN weight update law as 1 )) ( ( )) ( ( ) 1 ( )) ( ( ) ( ˆ ) 1 ( ˆ a a a a      k x k x k e k x k W k W i i i T i i i i i i i  (41)

which renders the following weight estimation error dynamics 1 )) ( ( )) ( ( ) 1 ( )) ( ( ) ( ~ ) 1 ( ~ a a a a      k x k x k e k x k W k W i i i T i i i i i i i  . (42)

C. Filtered Regulation Error and Stability Analysis

By using the subsystem dynamics (1) and control input (37), the filtered error dynamic defined in (3) is given by

i i i i i T i i i i i i i i i i i k x g k x k W k x g k x u k x g k x k x f k r a a a ( ) ( ( )) ( ( )) ~ )) ( ( )) ( ( )) ( ( )) ( ( )) ( ( ) 1 (            (43) wherefi(xi(k))fi(xi(k))[0i]Tzi(k).

In this part we show that the nonlinear discrete-time interconnected system (1) along with controller (37), critic network (24), NN weights, the filtered error and weight estimation error (43) and (42) of the individual subsystems are bounded, even in the presence of the unknown interconnection termsi(x)for 1i N. A Lemma is introduced before the Theorem.

Lemma 3. Consider the large-scale interconnected system (1). Suppose that there exist ideal action NN weights Wai for1i N which provide the nearly optimal controller (23) for system (1). Let’sBfi(x) figiuii2Then

Bfi fi x

B ( ) as k*! for1i N with Bfibeing a small positive value.

Theorem 2: Consider the nonlinear interconnected discrete-time system given by (1). Let ui(xi(k))be an initial admissible control input for the ith subsystem of the nonlinear interconnected discrete-time system for1i N. Let the Assumptions 1 through 5 hold and that the initial conditions for system (1) are bounded in the compact set. Let the weight tuning for the critic and action networks be provided by (29) and (41), respectively. Then, the critic error (26), the action error (39), and regulation errorri(k) along with the weight estimation errors of the critic and action NNs for 1i Nare all uniformly ultimately bounded (UUB) for all k  k + T0. In addition,u*ubwithbbeing a positive constant.

Proof: Omitted due to space considerations. V. CONCLUSIONS

In this paper, both a decentralized state feedback and an optimal controller were introduced for interconnected nonlinear discrete-time system. For the state feedback controller, the internal dynamics, interconnection terms and the input gain matrix are considered unknown while for the optimal controller, the internal dynamics and interconnection terms are considered unknown. Novel

update laws developed in this work render uniform ultimate boundedness result.

REFERENCES

[1] S.N. Huang, K.K. Tan, and T.H. Lee, "Nonlinear Adaptive Control of Interconnected Systems Using Neural Networks", IEEE Transactions on Neural Networks, vol. 17, no. 1, pp.: 243 - 246, Jan. 2006. [2] Y. Liu and X.Y. Li, “Decentralized Robust Adaptive Control of

Nonlinear Systems with Unmodeled Dynamics”, IEEE Transactions on Automatic Control, vol. 47, no. 5, pp.:848 – 856, May 2002. [3] W. Liu, S. Jagannathan, G. K. Venayagamoorthy, D. C. Wunsch, M.

L. Crow, L. Liu, and D. A. Cartes, “Neural Network Based Decentralized Controls of Large Scale Power Systems”, Proc. of the IEEE Int. Symposium on Intelligent Control, pp.:676 – 681, October 2007.

[4] S. Huang, K.K. Tan, and T.H. Lee, “Decentralized Control Design for Large-Scale Systems With Strong Interconnections Using Neural Networks,” IEEE Tran. on Auto. Control, vol. 48, pp. 805 - 810, 2003.

[5] J.T. Spooner, K.M. Passino, “Decentralized Adaptive Control of Nonlinear Systems Using Radial Basis Neural Networks,” IEEE Tran. on Auto. Control, vol. 44, no. 11, pp. 2050 – 2057, 1999.

[6] S. Jagannathan “Decentralized Discrete-Time Neural Network Controller for a Class of Nonlinear Systems with Unknown Interconnections", Proceedings of the IEEE International Symposium on Intelligent Control, pp.268 – 273, 2005.

[7] E. Gyurkovics and T. Takacs, “Stabilization of Discrete-time

Interconnected Systems Under Control Constraints”, IEE Proceedings on Control Theory and Appl., vol. 147, no. 2, pp.: 137 – 144, 2000.

[8] A. Jokic, M. Lazar, “On decentralized stabilization of discrete-time nonlinear systems,” Proc. of the American Control Conference, pp. 5777 – 5782, 2009.

[9] Y. H. Lin and K. S. Narendra , “A New Error Model for Adaptive Systems” IEEE Tran. on Auto. Contrl, vol. 25, no. 3, pp.585-587, 1980.

[10] F. Cacccavale, F. Pierri, and L. Villani, “Adaptive observer for fault diagnosis in nonlinear discrete-time systems,” ASME Journal of Dynamic Syst. Meas. and Control, vol. 130, no. 2, pp. 1-9, 2008. [11] F. L. Lewis and V. L. Syrmos, Optimal Control, Wiley, NY 1995. [12] Z. Chen and S. Jagannathan, “Generalized Hamilton–Jacobi–Bellman

Formulation-based Neural Network Control of Affine Nonlinear Discrete-time Systems,” IEEE Trans. Neural Network., vol. 10, no. 1, pp. 90–106, Jan. 2008.

[13] A. Al-Tamimi, F.L. Lewis, and M. Abu-Khalaf, “Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof,” IEEE Transactions on Systems, Man, and Cybernetics —Part B, vol. 38, pp. 943-949, Aug. 2008.

[14] S. Jagannathan, Neural Network Control of Nonlinear Discrete-time Systems, CRC Press, April 2006.

[15] G. Toussaint, T. Basar and F. Bullo, “ !

H -optimal Tracking Control Techniques for Nonlinear Underactuated Systems.” Proceedings of IEEE Conference on Decision and Control, pp. 2078 – 2083, 2000. [16] T. Dierks and S. Jagannathan, “Optimal Control of Affine Nonlinear

Discrete-time Systems,” Proc. of the IEEE Mediterranean Symposium on Control and Automation, pp. 1390 – 1395, 24-26 Jun., 2009.

References

Related documents