• No results found

Direct Adaptive Output Feedback CMAC Control with Unknown Control Gain for a Class of Nonlinear Systems

N/A
N/A
Protected

Academic year: 2021

Share "Direct Adaptive Output Feedback CMAC Control with Unknown Control Gain for a Class of Nonlinear Systems"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

Procedia Engineering 29 (2012) 3966 – 3971 1877-7058 © 2011 Published by Elsevier Ltd. doi:10.1016/j.proeng.2012.01.603 Procedia Engineering 00 (2012) 000–000

Procedia

Engineering

www.elsevier.com/locate/procedia

2012 International Workshop on Information and Electronics Engineering (IWIEE)

Direct Adaptive Output Feedback CMAC Control with

Unknown Control Gain for a Class of Nonlinear Systems

Chun-Sheng Chen

Department of Electronic Engineering, China University of Science and Technology, Taipei 115, Taiwan

Abstract

A direct adaptive output feedback CMAC (cerebellar model articulation controller) control based on only output measurements is proposed with unknown control gain function for a class of affine nonlinear systems. Therefore, it is needed to design a state observer to estimate unmeasured states of the systems. By using strictly positive-real Lyapunov theory, the stability of the closed-loop system can be guaranteed. Finally, simulation results for an inverted pendulum system show that the effect of the approximation error on the tracking error can be attenuated efficiently. © 2012 Published by Elsevier Ltd.

Keywords: CMAC neural network; Direct adaptive controller; Observer; Nonlinear systems

1. Introduction

The control of general nonlinear system has been a widely investigated problem because of its wide applications in practical systems. Many practical systems may be so complex or, even, model-free that to construct a mathematical model or identify its parameters is difficult or even impossible. There has been considerable attention over the years on researches using neural networks (NNs) based on human heuristic and learning algorithms [1,2]. Especially, the cerebellar model articulation controller (CMAC) was first developed by Albus in the 1970s [3], and is regarded as a nonfully connected perceptron-like associative memory network with overlapping receptive-fields. Compared with the general multiplayer neural network with back-propagation algorithm, the CMAC has been applied widely to the closed-loop control of complex dynamical systems because of its simple computation, fast learning property and good

Corresponding author.

E-mail address: [email protected].

Open access under CC BY-NC-ND license.

(2)

generalization capability [4]. Therefore, the application of CMAC is not only for the control problem but also for the model-free function approximator.

However, these control methods are based on the assumption that the system states are known or available for feedback. If system states are not available, an observer needs to estimate the unmeasured states and the SPR-Lyapunov design approach has been used to derive the adaptive and control laws [5,6], and the stability of the closed-loop system can be guaranteed.

2. System formulation

Consider the nth-order nonlinear dynamic system described in the following form: d

u g f

y(n) = (x)+ (x) + (1)

where f(x) is an unknown continuous function, g(x)is also unknown but strictly positive function, d is the unknown external bounded disturbance, u and y are the control input and output, respectively. Let

T n T n x x x x x x , , , ] [ , , , ] [ ( 1) 2 1 = − = L &L

x . Rewriting (1) in the state space representation

x x x x x T C y d u g f B A = + + + = ( ( ) ( ) ) & (2) where ⎥ ⎦ ⎤ ⎢ ⎣ ⎡ = − × − × − ) 1 ( 1 1 1 ) 1 ( 0 n n n A 0 I 0 , B=[0,0,L,0,1]T, C=[1,0,L,0]T.

Give the desired trajectory yd and define the output tracking error e=ydy, desired trajectory

vector [ , , , (n 1)]T, d d d

d = y y& L y

y output tracking error vector e= ydx. The control objective is to force the output y for tracking a given bounded desired trajectory yd. If f(x) and g(x) are known and the system is free of the external disturbance d , the certainty equivalent controller can be chosen as

] ) ( [ ) ( 1 ( ) * x x Te c n d f K y g u = − − + (3) where c T n c c c k k k

K =[ 1, 2,L, ] . Let u=u*, substituting (3) into (1), the error dynamics is derived as

0 1 2 ) 2 ( 1 ) 1 ( ) ( + ++ + = − − k e k e k e e k en nc n nc n L c& c (4)

If Kc to be chosen such that the characteristic polynomial is Hurwitz, then it implies that the tracking error trajectory will converge to zero when time tends to infinity, i.e. limte(t)=0.

However, f(x) and g(x) are unknown functions and only the system output is available for measurement, the optimal control (3) cannot be implemented. An observer will be designed to estimate the state vector and determine a direct adaptive CMAC controller such that the closed loop is stable. 3. Design of CMAC system

The architecture of the CMAC system includes input space, association memory space, receptive-field space, weight memory space and output space. The Gaussian function is chosen as the receptive-field basis function which can be expressed as

] / ) ( exp[ ) ( i i ik 2 ik2 ik I I m σ φ = − − for k=1 L,2, ,Nb. (5)

where φik(Ii) represents the kth block receptive-field basis function of the input Ii with the mean mik and the variance σ ;ik Nb is the number of block for each input dimension. The multidimensional receptive-field function bp(I) is associated with pth hypercube and can be expressed as

) ( )

( ni 1 ik i

p I

b I =∏= φ for p=1 L,2, ,Nh where Nh is the number of memory.

In this technique, each state variable is quantized and the problem space is divided into discrete states. A vector of quantized input values specifies a discrete state and is used to generate addresses for retrieving information from memory for this state. The areas formed by blocks (or hypercubes) are the

(3)

addresses that store data. CMAC uses a set of indices as an address in accordance with the current state to extract the stored data. By considering the input vector vk, the output yvk of CMAC with Gaussian basis function can be mathematically expressed as

k k h k v T T v N T k v Diag b b b y =a ( 1, 2,L, )w=ζ w=w ζ (6)

where the association index with Gaussian basis function T vk ζ is defined as ( 1, 2, , ) m k N T k T v a Diag b b L b ζ =

and the weight vector w of CMAC is expressed as T

Nh w w w, , , ] [ 1 2 L = w .

4. Direct adaptive CMAC output feedback controller design

In order to achieve the control objective, the overall controller is chosen a

s

D u u

u

u= + + (7)

where uD is a CMAC-neural controller designed to approximate the optimal control law (3), and us and a

u are employed to compensate the external disturbance and modeling error. Based on the certainty equivalence approach, an optimal control law can be rewritten as

] ˆ ) ( [ ) ( 1 ( ) * x x Te c n r K y f g u = − − + + (8)

Applying (8) and (7) into (2) and after some manipulation, one has the error dynamic equation

e x e e e T a s D T c C e d u u u u g B BK A = − − − − + − = 1 * ) ] )( ( [ ˆ & (9) Design the following observer to estimate the error vector e in (9)

e e e e ˆ ˆ ) ˆ ( ˆ ˆ ˆ 1 1 1 T o T c C e e e K BK A = − + − = & (10) where o T n o o o k k k

K =[ 1, 2,L, ] is the observer gain vector, which is selected such that the characteristic

polynomial of T

oC K

A − is strictly Hurwitz because (C,A) is observable. Define the observation errors

e e e ˆ ~= and 1 1 1 ˆ ~ e e

e = − . Subtracting (10) from (9), the observable error vector is given by

e x x x x x x e e ~ ~ ] ) ˆ ( ˆ ) ˆ ( ˆ )) ˆ ( )( ˆ ( ˆ [ )] ˆ ( ˆ ) ( )( ( [ ~ ) ( ~ 1 * T a s D b T o C e d u g u g u u g B g g x u B C K A = − − − − + − + − = & (11) where ub(x)=u*(x)−uD(xˆ)−usua . By replacing uD(xˆ) and gˆ(xˆ) in (11) with specific CMAC system as (6). Define the optimal parameter vector w*D and w*g as

|] ) | ˆ ( ) ( | sup [ min arg * ˆ , * 2 1 D U U D D w D u u D w x x w x x − = ∈ ∈ Ω

∈ , arg minf[ sup12| ( ) ˆ(ˆ| ) ]| * g U U w g g g g w x x w x x − = ∈ ∈ Ω ∈

where x , xˆ , w and D w belong to compact sets g U1,U2, ΩD and Ωg, respectively, which are defined asU1={x: xM1},U2 ={xˆ: xˆ ≤M2}, ΩD={wD: wDMD} and Ωg ={wg: wgMg}. Lemma 1 [7]: Given an arbitrary ε*>0, there exist ξ(xˆ)=[ξ1(xˆ),L,ξM(xˆ)] and an ideal parameter

vector T gM g g g (w*1,w*2, ,w* ) * = L w such that − =

M= − ξ +ε j j gj gj j g b g g c w w u (x)( (x) ˆ(xˆ|w )) 1 ( * ) (xˆ) , where ε ≤ε*, cj are some positive constants, w*gj and wgj are jth elements of w*g and wg.

According to Lemma 1, (11) can be rewritten as

e w x w x x ξ w w x x e e T a g s g T D g M j j gj j T o C e u g u g g w c B C K A = + − − + + − =

= 1 1 ~ ] ) | ˆ ( ˆ ) | ˆ ( ˆ ) ˆ ( ~ ) | ˆ ( ˆ ) ˆ ( ~ [ ~ ) ( ~& ξ θ (12) where w~D =wD* −wD, gj gj gj w w

w~ = * − , j=1 L, ,M , and the total approximation error is defined as d u u g gD D − + =ε ˆ(xˆ|w )[ *(x) (xˆ|w*)]

(4)

] ) | ˆ ( ˆ ) | ˆ ( ˆ ) ˆ ( ~ ) | ˆ ( ˆ ) ˆ ( ~ )[ ( ~ 1 1=

= ξ + g TDξ − g sg aM j cjwgj j g g u g u s H e x x w w x x w x w (13)

where s denotes the Laplace operator d /dt and H(s)=CT(sI−(AKoCT))−1B is the transfer function of (12). In order to use the SPR-Lyapunov design approach, (13) can be written as

] ) | ˆ ( ˆ ) | ˆ ( ˆ ) ˆ ( ~ ) | ˆ ( ˆ ) ˆ ( ~ )[ ( ) ( ~ 1 1 1 1 1 1 1 =

= ξ + g DTξ − g sg aM j cjwgj j g g u g u s L s H e x x w w x x w x w (14) were L s =sm+bsm−1+L+bm 1 )

( , m < , is chosen such that n L−1(s) is a proper stable transfer function and H(s)L(s) is a proper SPR transfer function, us1=L−1(s)us , ua1=L−1(s)ua , ξ1(xˆ)=L−1(s)ξ(xˆ) andθ1=L−1(s)θ . Because (14) is a SPR dynamic system, there exists P1=P1T >0, 0

2 2 =PT >

P are the

solutions of the following matrix equation

1 1 1 ( ) ) (ABKcT TP +P ABKcT =−Q , (AKoCT)TP2+P2(AKoCT)=−Q2, P2Bc=Cc (15) where T m c b b

B =[0,0,L, 1,L, ] ,Cc=[1,0,L,0]T,Q1=Q1T,Q2 =Q2T is the given positive definite matrix. Theorem 1: Consider the nonlinear system (1) and assume that only output variable is measurable. Construct the adaptive CMAC controller as (6), uD(xˆ|wD)=wTDξD(xˆ) and gˆ(xˆ|wg)=wTgξg(xˆ), and the auxiliary compensation control and the adaptation laws are chosen as

) | ˆ ( ˆ ˆ ) ( oT 1 T 1 g a L s K P g u = ex w , ( )sgn(~ )ˆ 1(ˆ| ) 2 c g T s kL s PB g u = ex w (16) ) ˆ ( ) | ˆ ( ˆ ~ 1 2 x w x e w γ T c g ξ D D =− P B g & , w&g =−γge~TP2Bcξ1(xˆ) (17)

whereγD >0 and γg >0 are positive adaptive gain constants to be designed, P1=P1T >0 and 0

2 2 =PT >

P are the solutions of the matrix equation (15), respectively. The proposed control scheme can ensures that all the closed-loop signals are bounded, and the tracking errors converge to zero.

Proof: Choose the Lyapunov function candidate as

D T D M j j gj gj T T D g c w w P P V eˆ eˆ ~e e~ ~ ~ 21 w~ w~ 1 21 2 2 1 1 2 1 γ γ + + + =

= (18)

Taking the time derivative of V yields

D T D M j j gj gj T T T T D g c w w P P P P

V& eeˆ eˆ eˆ& e~& ~e e~ e~& ~ ~& 1 w~ w~& 1 1 2 2 1 2 2 1 1 2 1 1 2 1 γ γ + + + + + =

= (19)

Substituting (10) and (12) into (19) and the fact w~&D =−w&D, w~&g =−w&g, the above equation becomes )] [( | ~ | ~ ~ ˆ ˆ )] | [(| | ~ | ~ ~ ˆ ˆ ) ) ˆ ( ˆ ( ~ ~ ~ ˆ ˆ 2 2 1 1 2 1 1 2 2 1 1 2 1 1 1 2 2 1 1 2 1 k M PB Q Q k PB Q Q u g PB Q Q V c T T T c T T T s g c T T T − + − − ≤ − + − − ≤ − + − − = e e e e e e e e e e w x e e e e e θ θ & (20) According to k ≥M , then (20) becomes eˆ eˆ 21e~ 2e~

1 2 1 Q Q V&≤− TT .Denoting [ , ] 2 1 Q Q diag = Q and ] ~ , ˆ [ T T T

E = e e , then V&12ETQE<0. Therefore, it imply that

∈ L

e e x

x,ˆ, ,ˆ . Integrating V& from 0 = t to t =T yields

≤ (0)− ( )<∞ 0 2 1E Edt V V T T T

Q . Now because all variables in the right side of (12)

are bounded, ~e&1(t) is bounded. According to Barbalat’s lemma, we have limtE(t)=0. 5. Simulation results

Consider the dynamic equation of the inverted pendulum system [8]

(

)

(

)

u d x m m m l x x m m m l x x mlx x g m m y c c v c + − + + − + − + = 1 2 3 4 1 1 2 3 4 1 1 2 2 1 cos ) ( cos cos ) ( sin cos sin ) ( & & (21)

where y=x1=θ is the angle of the pendulum with respect to the vertical line and x2 =θ&, .mc is the mass of cart, m is the mass of pole, gv=9.8 m/s2 is the acceleration due to gravity, l is the half length of the pole, u is the control input. In this example, m=0.1kg, mc=1kg, l=0.5m. The control

(5)

objective is to maintain the output y to track the desired trajectory with only output is available for measurement. The disturbance d is assumed to be a square wave with the amplitude ±1 and period the π (s). The Gaussian basis functions in each input space are chosen as φik(Ii)=exp[−(Iimik)2/σik2] for

2 , 1 =

i and k=1,2,L,9 , and parameters σik =0.5 , [mi1,mi2,mi3,mi4,mi5,mi6,mi7,mi8,mi9]= ] 8 . 0 , 6 . 0 , 4 . 0 , 2 . 0 , 0 , 2 . 0 , 4 . 0 , 6 . 0 , 8 . 0

[− − − − . Given the positive matrices Q1 = Q2 =diag[10,10], feedback and observer gain vectors are given as T =

[

144,24

]

c

K and T =

[

60,900

]

o

K . The initial states are chosen as

T ] 3 . 0 -, 2 . 0 [ ) 0 ( = −

x , xˆ(0)=[0.2,0.1]T and let adaptation gains as γDg =100.

Let desired trajectory yd =0.1sin(t). Fig.1 shows the system output can track the desired output well. In Fig.2, the trajectory of the estimation state xˆ1 catches up to the trajectory of the system state x1

quickly and well. Fig. 3 shows that the tracking error has been attenuated efficiently.

0 5 10 15 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 time(sec) T racki ng r esp on se ,r ad desired actual 0 5 10 15 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 T racki ng r espo nse, ra d estimated actual 0 5 10 15 -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 e1 time(sec)

Fig. 1. Tracking response (y:dash, yd:solid) Fig. 2. Tracking response (x1:dash,ˆx1:solid) Fig. 3. The tracking error

6. Conclusions

The stable direct adaptive output feedback CMAC controller has been proposed for the trajectory tracking of uncertain nonlinear systems. This design obtains robustness in the sense that the self-tuning mechanism can automatically adapt the CMAC controller by using a learning algorithm and the global asymptotic stability of the algorithm is established via Lyapunov stability criterion. The simulation example is used to achieve a good trajectory following performance.

References

[1] R. M. Sanner and J. J. E. Slotine. Gaussian networks for direct adaptive control. IEEE Trans. Neural Networks 1992; 3:837–863.

[2] K. S. Narendra and K. Parthasarathy. Identification and control of dynamical systems using neural networks. IEEE Trans.

Neural Networks 1990; 1:4–27.

[3] J. S. Albus. A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Trans. ASME

J. Dyn. Syst., Meas., Contr 1975; 97:220–227.

[4] S. H. Lane, D. A. Handelman, and J. J. Gelfand. Theory and development of higher-order CMAC neural networks. IEEE

Contr. Syst. Mag. 1992; 12:23–30.

[5] Y. G. Leu, T. T. Lee and W. Y. Wang. Observer-based adaptive fuzzy-neural control for unknown nonlinear dynamical systems. IEEE Trans. Systems Man Cybern. B 1999; 29:583–591.

[6] J. H. Park and S. H. Kim. Direct adaptive output-feedback fuzzy controller for a nonaffine nonlinear system. IEE Proc.

Control Theory Application 2004; 151:65-72..

[7] P. A. Phan and T. J. Gale. Direct adaptive fuzzy control with a self-structuring algorithm. Fuzzy Sets and Systems 2008; 159: 871-899.

(6)

3971

Chun-Sheng Chen / Procedia Engineering 29 (2012) 3966 – 3971

objective is to maintain the output y to track the desired trajectory with only output is available for measurement. The disturbance d is assumed to be a square wave with the amplitude ±1 and period the

π (s). The Gaussian basis functions in each input space are chosen as φik(Ii)=exp[−(Iimik)2/σik2] for 2

, 1 =

i and k=1,2,L,9 , and parameters σik =0.5 , [mi1,mi2,mi3,mi4,mi5,mi6,mi7,mi8,mi9]= ] 8 . 0 , 6 . 0 , 4 . 0 , 2 . 0 , 0 , 2 . 0 , 4 . 0 , 6 . 0 , 8 . 0

[− − − − . Given the positive matrices Q1= Q2 =diag[10,10], feedback and observer gain vectors are given as T =

[

144,24

]

c

K and T =

[

60,900

]

o

K . The initial states are chosen as

T ] 3 . 0 -, 2 . 0 [ ) 0 ( = −

x , xˆ(0)=[0.2,0.1]T and let adaptation gains as γDg =100.

Let desired trajectory yd =0.1sin(t). Fig.1 shows the system output can track the desired output well. In Fig.2, the trajectory of the estimation state xˆ1 catches up to the trajectory of the system state x1

quickly and well. Fig. 3 shows that the tracking error has been attenuated efficiently.

0 5 10 15 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 time(sec) T racki ng r esp on se ,r ad desired actual 0 5 10 15 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 T racki ng r espo nse, ra d estimated actual 0 5 10 15 -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 e1 time(sec)

Fig. 1. Tracking response (y:dash, yd:solid) Fig. 2. Tracking response (x1:dash,ˆx1:solid) Fig. 3. The tracking error

6. Conclusions

The stable direct adaptive output feedback CMAC controller has been proposed for the trajectory tracking of uncertain nonlinear systems. This design obtains robustness in the sense that the self-tuning mechanism can automatically adapt the CMAC controller by using a learning algorithm and the global asymptotic stability of the algorithm is established via Lyapunov stability criterion. The simulation example is used to achieve a good trajectory following performance.

References

[1] R. M. Sanner and J. J. E. Slotine. Gaussian networks for direct adaptive control. IEEE Trans. Neural Networks 1992; 3:837–863.

[2] K. S. Narendra and K. Parthasarathy. Identification and control of dynamical systems using neural networks. IEEE Trans.

Neural Networks 1990; 1:4–27.

[3] J. S. Albus. A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Trans. ASME

J. Dyn. Syst., Meas., Contr 1975; 97:220–227.

[4] S. H. Lane, D. A. Handelman, and J. J. Gelfand. Theory and development of higher-order CMAC neural networks. IEEE

Contr. Syst. Mag. 1992; 12:23–30.

[5] Y. G. Leu, T. T. Lee and W. Y. Wang. Observer-based adaptive fuzzy-neural control for unknown nonlinear dynamical systems. IEEE Trans. Systems Man Cybern. B 1999; 29:583–591.

[6] J. H. Park and S. H. Kim. Direct adaptive output-feedback fuzzy controller for a nonaffine nonlinear system. IEE Proc.

Control Theory Application 2004; 151:65-72..

[7] P. A. Phan and T. J. Gale. Direct adaptive fuzzy control with a self-structuring algorithm. Fuzzy Sets and Systems 2008; 159: 871-899.

[8] L. X. Wang. Adaptive Fuzzy Systems and Control: Design and Stability Analysis. Englewood Cliffs, NJ: Prentice-Hall,

6 Chun-Sheng Chen/ Procedia Engineering 00 (2012) 000–000

References

Related documents

In this paper, a discrete robust adaptive iterative learning control (RAILC) is proposed for a class of uncertain non- linear systems with unknown control direction and random

For output feedback control, a nominal plant in output-feedback form is considered, and the observer backstepping procedure is amended to design a robust controller and an observer

The main purpose of this study is to design a much simpler control approach for a wastewater treatment plant using a data driven direct adaptive predictive controller

N EURAL O UTPUT F EEDBACK A DAPTIVE C ONTROL FOR N ONLINEAR N ONNEGATIVE U NCERTAIN S YSTEMS In this section, we consider the problem of character- izing neural adaptive output

For a special MIMO system class we show that the closed loop is asymptotically stable, and that the output feedback NMPC scheme recovers the performance of the state feedback in

This paper proposes a fuzzy adaptive prescribed performance control scheme for a class of uncertain chaotic systems with unknown control gains and unknown dead-zone inputs1. Firstly,

In this paper, an adaptive Hermite-polynomial-based CMAC neural control (AHCNC) system which is composed of a neural controller and a smooth compensator is proposed to

Decentralised robust sliding mode control for a class of nonlinear interconnected systems by static output feedback. Design of robust static output feedback for