Model reference control using sliding mode with Hamiltonian dynamics

(1)

Model reference control using sliding mode with Hamiltonian dynamics

R. J. Stonier J. Zajaczkowski

^∗

(Received 11 December 2000; revised 24 February 2003)

Abstract

Model reference control and model reference adaptive control has since its inception, found application in a wide range of applications from the control of simple mechanical structures to the more complex robotic manipulators. Sliding mode techniques largely simplify the task of tracking the reference model and are capable of accommodating the uncertainties present in the dynamics of the system. In this paper we are concerned with model tracking in finite time for plant and reference model which are given in Hamiltonian format. The method is applied to nonlinear plant and linear model, with partic- ular application to robot control. We also include the addition of a stabilising supervisory controller in terms of the Hamiltonian of the reference model.

∗School of Mathematics and Decision Science, Central Queensland University, Rockhampton, Queensland 4702, Australia. mailto:[email protected]

Seehttp://anziamj.austms.org.au/V45/E056/home.htmlfor this article, c Aus- tral. Mathematical Soc. 2003. Published December 14, 2003 ISSN 1446-8735

(2)

1 Introduction E2

2 Model reference control

MRC

E5

2.1 Assumptions . . . . E5 2.2 Control law —

MRC

. . . . E6 2.3 Robot illustration —

MRC

. . . . E8 2.4 Control law for robotic example —

MRC

. . . . E12 3 Model reference adaptive control —

MRAC

E16 3.1 Control and adaptation law in

MRAC

. . . . E17 3.2 Convergence . . . . E18 3.3 Robot illustration —

MRAC

. . . . E19 3.4 Control and adaptation law . . . . E20

4 Stability E24

4.1 Supervisory controller . . . . E24 4.2 Stable controller for sliding mode . . . . E26 4.3 Robot illustration . . . . E31 4.4 Computer simulation with supervisory controller . . . . E35 4.5 Asymptotic stability versus sliding mode . . . . E35

5 Conclusions E39

References E40

1 Introduction

Model reference control (mrc) and model reference adaptive control (mrac)

has a wide range of applications from the control of simple mechanical struc-

tures to the control of complex robotic manipulators. Specifically, it has

(3)

proven its practicality in control systems with constant or slowly-varying parameters. A whole range of different formalisms may be used to synthesise an adaptation mechanism. Sliding mode techniques largely simplify the task of tracking the reference model and are capable of accommodating the uncertainties present in the dynamics of the system. We use the Lyapunov method and sliding mode dynamics to determine controllers for (adaptive) tracking in finite time and also give sufficient conditions to ensure that the resulting system is stable.

Using the Hamiltonian formulation immediately provides a control system in standard state space format. Further, position and momentum coordinates are conjugate variables and the Hamiltonian itself is related to the energy of the system (in conservative systems it represents directly total energy of the system). Model reference control in Hamiltonian formulation [3] uses a Hamiltonian function and Lyapunov asymptotic stability techniques for mrc with no reference to sliding mode. (An earlier formulation has been presented by Skowronski [2].)

We begin by setting the Hamiltonian structure for the plant and model.

The plant is described by Hamiltonian canonical equations:

˙

q

_i

= ∂H

p

(q, p, a)

∂p

_i

,

˙

p

_i

= − ∂H

_p

(q, p, a)

∂q

_i

+ Q

^D_i

(q, p, d) + Q

^F_i

(q, p, a, u) , (1) where i = 1, . . . , n (we assume that n = 2) and H

_p

is the Hamiltonian of the system, Q

^D_i

is a damping force, Q

^F_i

is an external force, u is a control vector, and a = [a

₁

, . . . , a

_k

], k ≤ n , is a vector of adjustable plant parameters.

For technical reason we expand the vector a to a dimension n by assuming

a

_j

= 0 for n ≥ j > k . We also assume that damping coefficients represented

by vector d = [d

₁

, . . . , d

_n

] may be polluted by uncertainty, and thus not

known in exact form. The output state is x = [ q p ]

^T

. In mrc we assume

no uncertainty present in the system, and no adaptation parameters.

(4)

The reference model is designed as another Hamiltonian system with Hamiltonian H

_m

, and output x

_m

= [ q

_m

p

_m

]

^T

. In this model the damping force and external force are assumed to be known functions of time and are such that the reference model is Lagrange stable, that is, the model output is bounded:

˙

q

_m_i

= ∂H

m

(q

_m

, p

_m

)

∂p

_m_i

,

˙

p

_m_i

= − ∂H

_m

(q

_m

, p

_m

)

∂q

_m_i

+ Q

^D_m

i

(q

_m

, p

_m

, d

_m

) + Q

^F_m

i

(q

_m

, p

_m

, u

_m

). (2) Define the state error vector e = [ e

_q

e

_p

]

^T

as

e

_q_i

(t) = q

_i

(t) − q

_m_i

(t) ,

e

_p_i

(t) = p

_i

(t) − p

_m_i

(t) , i = 1, . . . , n . (3) The rate of change of e is

˙

e

_qi

= ∂H

_p

(q, p, a)

∂p

_i

− ∂H

_m

(q

_m

, p

_m

)

∂p

_m_i

,

˙

e

_pi

= − ∂H

_p

(q, p, a)

∂q

_i

+ ∂H

_m

(q

_m

, p

_m

)

∂q

_m_i

+ Q

^D_i

(q, p, d)

− Q

^D_m

i

(q

_m

, p

_m

, d

_m

) + Q

^F_i

(q, p, a, u) − Q

^F_m

i

(q

_m

, p

_m

, u

_m

) , i = 1, . . . , n . Note that error dynamics are not given in standard (canonical) Hamiltonian format as described in Skowronski [2].

To attempt to design control and adaptation laws in such a general form

as shown above would be impractical and so we consider a special form of the

dynamic Equations (1) and (2) which holds reasonable generality, in order

to find a suitable control and adaptation law in both mrc and mrac. We

begin first with Model Reference Control (without adjustable parameters a).

(5)

2 Model reference control

^MRC

Let us define the sliding mode variable s for our second order system:

s

_i

= ˙e

_q_i

+ λe

^α/β_q

i

, i = 1, . . . , n , where α and β are odd positive integers, α < β .

Consider the traditional Lyapunov function as a quadratic form of sliding mode variables:

V (s) = 1 2

n

X

i=1

s

²_i

. (4)

Denote by E

_q_i

, E

_p_i

the right-hand side of the Equation (1) and by E

_q_mi

, E

_p_mi

the right-hand side of the Equation (2). Then by definition

˙e

_q_i

= E

_q_i

− E

_q_mi

,

¨

e

_q_i

= E ˙

_q_i

− ˙ E

_q_mi

, i = 1, . . . , n . (5)

2.1 Assumptions

Reference model has its equilibria in the origin. The dynamics of the system satisfies

˙

q

_i

= g

_i

(q)p

_i

,

˙

q

_m_i

= g

_m_i

(q

_m

)p

_m_i

, i = 1, . . . , n , (6) where each g

_i

(·), g

_m_i

(·), for i = 1, . . . , n is a known bounded positive function.

Let us make a further assumption that ˙ E

_q_i

can be expressed as a linear function of E

_p_i

, i = 1, 2, with coefficient functions f

_i¹

and f

_i²

:

E ˙

_q_i

= f

_i¹

(q, p)E

_p_i

+ f

_i²

(q, p) , i = 1, . . . , n , (7)

(6)

and where each f

_i¹

(·), for i = 1, . . . , n is a known bounded positive function:

0 ≤ |f

_i¹

(q, p)| ≤ B , (8)

and B is a positive constant. A wide range of mechanical systems satisfy condition (6) and condition (7). Note that the control force is incorporated in E

_p_i

terms, see Equation (1).

2.2 Control law —

MRC

Now we calculate the time derivative of the Lyapunov function in order to extract the control law:

V (s) = ˙

n

X

i=1

s

_i

˙s

_i

=

n

X

i=1

s

_i

¨ e

_q_i

+ α

β λ ˙e

_q_i

e

^α/β−1_q

i

!

=

n

X

i=1

s

_i^h

f

_i¹

(q, p)E

_p_i

+ f

_i²

(q, p) − ˙ E

_q_mi

+ α

β λe

^α/β−1_q_i

E

_q_i

− E

_q_mi

#

=

n

X

i=1

s

i

"

f

_i¹

(q, p) − ∂H

_p

(q, p)

∂q

_i

+ Q

^D_i

(q, p, d) + Q

^F_i

(q, p, u)

!

+ f

_i²

(q, p) − ˙ E

_q_mi

+ α

β λe

^α/β−1_q_i

E

_q_i

− E

_q_mi

#

. (9)

The control laws are designed as follows:

Q

^F_i

= ∂H

_p

∂q

_i

− Q

^D_i

(q, p, d)

− 1

f

_i¹

f

_i²

(q, p) − ˙ E

_q_mi

+ α

β λe

^α/β−1_q

i

E

_q_i

− E

_q_mi

!

− K

f

_i¹

sgn(s

_i

) , i = 1, . . . , n , (10)

(7)

where K > 0 is constant.

Substituting for Q

^F_i

from the control law (10) into Equation (9), see that the control law selection makes the Lyapunov derivative semi-negative definite:

V (s) = − ˙

n

X

i=1

s

_i

Ksgn(s

_i

) = −K

n

X

i=1

|s

_i

| ≤ −K

n

X

i=1

s

²_i

!1/2

= − √

2KV

^1/2

≤ 0 . (11)

Obviously ˙ V = 0 only if s = 0 . This implies that V reaches the sliding surface in finite time T . Indeed, integrating inequality (11), we find that T must satisfy the inequality:

T ≤ t

₀

+

√ 2 (V (t

₀

))

^1/2

K .

Given the expression (9) for ˙ V , to show that s → 0 it is sufficient to show that ˙ V → 0 . First, we establish that s and ˙s are bounded (that in turn shows that ¨ V remains bounded and according to Barbalat’s lemma we have ˙ V → 0). Given Equation (11) obviously s and ˙s are bounded, see also expression (9). Thus s → 0 as t → ∞ . This in turn implies that error trajectories e

_q_i

, i = 1, . . . , n , tend to 0 as t → ∞ . Now, from (6) and the fact that ˙e

_q_i

, i = 1, . . . , n , tend to 0 as t → ∞ (see definition of s), we see that also e

pi

, i = 1, . . . , n , tend to 0 as t → ∞ . The latter comes from

˙e

_q_i

= g

_i

(q)p

_i

− g

_m_i

(q

_m

)p

_m_i

, i = 1, . . . , n , Because p

_i

= p

_m_i

+ e

_p_i

see that

g

i

(q)e

pi

= ˙e

qi

− (g

i

(q)p

i

− g

mi

(q

_m

)) p

mi

, i = 1, . . . , n , (12)

and this proves e

_p_i

, i = 1, . . . , n , tend to 0 as t → ∞ as the equilibria of the

reference model are at the origin, which requires p

_m_i

→ 0 . We have shown

more: namely that s → 0 in finite time.

(8)

2.3 Robot illustration —

MRC

To illustrate the method consider now the control of a cylindrical robotic manipulator [1], which has one revolute joint and two prismatic joints, see Figure 1.

The arm has length ` and its mass per unit is constant m

_a

/` . The length of the prismatic radial link changes when it slides through the hub. A force opposes the motion of the link and is modelled as a spring with an adjustable parameter k

_s

(we denote it ˆ k

_s

) which imposes zero force at r = 2`/3 . We ignore the vertical motion along the hub. Denote q

₁

= r and q

₂

= θ . The Hamiltonian describing the dynamics of the manipulator is

H(q, p) = 1 2

1 m

a

+ m

`

p

²₁

+ 1

C(q

1

) p

²₂

+ k

_s

q

₁

− 2 3 `

2!

,

where C(q) =

^m_4`^a

(q

³

+ (` − q)

³

) + m

_`

q

²

+ I ; and I is the effective moment of inertia of the rotating masses excluding m

a

and m

`

. We introduce the canonical transformation (q, p) → (Q, P ) .

q

₁

= Q

₁

√ m

_a

+ m

_`

+ 2

3 ` ; p

₁

= √

m

_a

+ m

_`

P

₁

; q

₂

= Q

₂

; p

₂

= P

₂

.

After the canonical transformation the Hamiltonian H(Q, P ) = 1

2 P

₁²

+ 1

C(Q

₁

/ √

m

_a

+ m

_`

+

²₃

`) P

₂²

+ k

_s

m

_a

+ m

_`

Q

²₁

!

. Without loss of generality, we return to the original notation keeping in mind that (p, q) denotes now the new coordinates (P, Q). The dynamics of the manipulator are now

˙

q

₁

= p

₁

,

(9)

(a)

(b)

Figure 1: Cylindrical robot manipulator: (a) top view; (b) side view.

(10)

˙

q

₂

= p

₂

C(q

_c

) ,

˙ p

₁

=

3 4 m

_a

+ m

_`

q

₁

m

_a

+ m

_`

+

1 8 m

_a

+ 2 3 m

_`

`

√ m

a

+ m

`

#

p

₂

C(q

c

)

!2

− k

s

(m

_a

+ m

_`

) q

₁

+ Q

^D₁

+ Q

^F₁

,

˙

p

₂

= Q

^D₂

+ Q

^F₂

, (13)

where

C(q

_c

) = m

_a

4`

q

_c³

+ (` − q

_c

)

³

+ m

_l

q

_c²

+ I , q

_c

= q

₁

√ m

_a

+ m

_`

+ 2

3 ` . (14)

Damping forces are defined by Q

^D_i

(q, p) = −d

_i

p

_i

, i = 1, 2 , and d

_i

are the positive damping coefficients.

Consider a reference model with dynamics

˙

q

_m₁

= p

_m₁

,

˙

q

_m₂

= p

_m₂

,

˙

p

_m₁

= − k

_s_m

m

_m_a

+ m

_m_`

q

_m₁

+ Q

^D_m

1

+ Q

^F_m

1

,

˙

p

_m₂

= −s

_m

q

_m₂

+ Q

^D_m₂

+ Q

^F_m₂

, (15) where analogously Q

^D_m_i

(q

_m

, p

_m

) = −d

mi

p

mi

, i = 1, 2 , and d

mi

are positive damping coefficients. The term s

_m

q

_m₂

represents spring forces. The equilibria of the model coincide with the original system, except that q

_m^e₂

= 0 , unlike the systems’ q

₂^e

that can be arbitrary.

With the given Lyapunov function (4), we find its time derivative

V ˙ = = s

₁

˙s

₁

+ s

₂

˙s

₂

(11)

= s

₁

e ¨

_q₁

+ α

β λe

^α/β−1_q₁

˙e

_q₁

!

+ s

₂

¨ e

_q₂

+ α

β λe

^α/β−1_q₂

˙e

_q₂

!

. In this case

˙e

_q₁

= ˙ q

₁

− ˙q

_m₁

= p

₁

− p

_m₁

= e

_p₁

˙e

_q₂

= p

₂

C(q

_c

) − p

_m₂

. Now we calculate ˙ C(q

_c

) noting that ˙ q

_c

=

^√_m^q^˙¹

a+m_`

. C(q ˙

_c

) = m

_a

4`

3q

²_c

q ˙

_c

− 3(` − q

_c

)

²

q ˙

_c

+ 2m

_`

q

_c

q ˙

_c

= 3 4

m

_a

q ˙

₁

√ m

a

+ m

`

2q

₁

√ m

a

+ m

`

+ 1 3 `

!

+ 2m

_`

q

₁

q ˙

₁

m

_a

+ m

_`

+ 4m

_`

` ˙ q

₁

3 √

m

_a

+ m

_`

. (16)

In the above formula ˙ q

₁

can be replaced by p

₁

. For notational convenience, denote by E

_i

the right-hand side of the system Equations (13), and by M

_i

the right-hand side of the model (15), i = 1, . . . , 4 . Then

˙e

_q_i

= E

_i

− M

_i

, i = 1, 2

˙e

_p_j−2

= E

_j

− M

_j

, j = 3, 4

¨

e

_q₁

= E

₃

− M

₃

,

¨

e

_q₂

= E ˙

₂

− M

₄

, where

E ˙

2

= C(q

_c

) ˙ p

₂

− ˙ C(q

_c

)p

₂

C

²

(q

_c

) . Now, rewrite the expression (9) for ˙ V as

V (s) = s ˙

1

E

3

− M

3

+ α

β λe

^α/β−1_q₁

e

p1

!

+ s

₂

E ˙

₂

− M

₄

α

β λe

^α/β−1_q

2

(E

₂

− M

₂

)

!

(12)

= s

₁

E

₃

− M

₃

+ α

β λe

^α/β−1_q₁

e

_p₁

!

(17) + s

₂

E

₄

C(q

_c

) − C(q ˙

_c

)p

₂

C

²

(q

_c

) − M

₄

+ α

β λe

^α/β−1_q

2

(E

₂

− M

₂

)

!

.

2.4 Control law for robotic example —

MRC

The control force is now incorporated in E

_p₁

and E

_p₂

-terms. We identify, from (9) and (17), the following terms in (10)

f

₁¹

= 1.0 , f

₁²

= 0 , f

₂¹

= 1

C(q

_c

) , f

₂²

= − C(q ˙

_c

)

C

²

(q

_c

) p

₂

, (18) and

E

_q_m1

= p

_m₁

, E

q_m2

= p

m2

,

E ˙

_q_m1

= E

_p_m1

= − k

_s_m

m

_m_a

+ m

_m_`

q

_m₁

+ Q

^D_m₁

+ Q

^F_m₁

,

E ˙

q_m2

= E

p_m2

= −s

m

q

m2

+ Q

^D_m₂

+ Q

^F_m₂

. (19) Note that

∂H

_p

∂q

₁

= −

"

3 4 m

_a

+ m

_`

q

₁

(m

_a

+ m

_`

) +

1 8 m

_a

+ 2 3 m

_`

`

√ m

a

+ m

`

#

p

₂

C(q

c

)

!2

+ k

s

(m

_a

+ m

_`

) q

₁

,

∂H

_p

∂q

₂

= 0 .

(13)

Then the control laws according to (10) are:

Q

^F₁

= −

"

3 4 m

a

+ m

`

q

₁

(m

_a

+ m

_`

) +

1 8 m

_a

+ 2 3 m

_`

`

√ m

a

+ m

`

#

p

₂

C(q

c

!2

+ k

_s

(m

_a

+ m

_`

) q

₁

− Q

^D₁

− 1

f

₁¹

−E

_p_m1

+ α

β λe

^α/β−1_q₁

e

_p₁

!

− K

f

₁¹

sgn(s

₁

) , (20)

Q

^F₂

= −Q

^D₂

− 1

f

₂¹

−E

_p_m2

− C(q ˙

_c

)p

₂

C

²

(q

_c

) + α

β λe

^α/β−1_q₂

(E

_q₂

− E

_q_m2

)

!

− K

f

₂¹

sgn(s

₂

) . (21)

Substituting the control law (20) and (21) into Equation (17) we obtain:

V ˙ = −s

₁

Ksgn(s

₁

) − s

₂

Ksgn(s

₂

) = −K (|s

₁

| + |s

₂

|)

≤ −K

s

²₁

+ s

²₂^1/2

= − √

2KV

^1/2

≤ 0 .

In our computer simulations we assumed the following parameter values for the plant and model: m

_a

= 10.0 , m

_m_a

= 10.5 , m

_`

= 1.25 , m

_m_`

= 1.5 ,

` = 1.0 , `

_m

= 1.3 , I = 1.0 , k

_s

= 100.0 , k

_s_m

= 110.0 , d

₁

= 5.0 , d

_m₁

= 6.5 , d

₂

= 0.001 , d

_m₂

= 0.8 , and s

_m

= 9.1 . The values for the constants were chosen as: K = 4 , λ = 2 , α = 3 and β = 5 . The initial conditions:

q

₁

= 0.1 , q

₂

= 3.0 , p

₁

= 0.0 , p

₂

= 0.0 , q

_m₁

= 0.6 , q

_m₂

= 0.1 , p

_m₁

= 4.0 , and p

_m₂

= 3.2 .

The error trajectory and controller time history for the simulations are

shown on Figure 2. The time history of the Lyapunov and its derivative,

together with the sliding mode variables are shown in Figure 3.

(14)

(a)

(b)

Figure 2: (a) Error convergence for mrc (b) Controller for mrc

(15)

(a)

(b)

Figure 3: (a) Lyapunov function and its derivative (b) Sliding mode vari-

ables

(16)

The convergence times of the error trajectories to within an -envelope of the origin (which is defined as a set of all states that lie within distance of from the origin) are: T

_c

= 9.47 for = 0.1 and T

_c

= 9.74 for = 0.05 .

3 Model reference adaptive control —

MRAC

Similarly we define a Lyapunov function for the adaptive case as a quadratic form of sliding mode variables plus its adaptive terms:

V (s, a, d) = 1 2

n

X

i=1

s

²_i

+ 1 2γ

n

X

i=1

˜ a

²_i

+ 1

2γ

n

X

i=1

d ˜

²_i

, (22)

where γ > 0 is constant and

˜

a

_i

= ˆ a

_i

− a

^∗_i

, d ˜

_i

= ˆ d

_i

− d

^∗_i

,

and where ˆ a

i

and ˆ d

i

are our estimates of uncertain parameters whereas a

^∗_i

and d

^∗_i

are their true values (but they may be unknown). We assume that condition (7 ) holds for mrac. Again we calculate the time-derivative of Lyapunov function in order to extract the control law:

V (s, a, d) = ˙

n

X

i=1

"

s

_i

˙s

_i

+ 1

γ ˙˜a

_i

˜ a

_i

+ 1 γ

d ˙˜

_i

d ˜

_i

#

=

n

X

i=1

"

s

_i

¨ e

_q_i

+ α

β λ ˙e

_q_i

e

^α/β−1_q_i

!

+ 1

γ ˙˜a

_i

˜ a

_i

+ 1 γ

d ˙˜

_i

d ˜

_i

#

=

n

X

i=1

s

_i

f

_i¹

(q, p)E

_p_i

+ f

_i²

(q, p) − ˙ E

_q_mi

+ α

β λe

^α/β−1_q

i

E

_q_i

− E

_q_mi

!

+ 1

γ ˙˜a

_i

˜ a

_i

+ 1 γ

d ˙˜

_i

d ˜

_i

#

=

n

X

i=1

"

f

_i¹

(q, p) − ∂H

_p

(q, p, a)

∂q

_i

+ Q

^D_i

(q, p, d)

(17)

+ Q

^F_i

(q, p, a, u)

+ f

_i²

(q, p) − ˙ E

_q_mi

(23) + α

β λe

^α/β−1_q

i

E

_q_i

− E

_q_mi

#

+ 1 γ

n

X

i=1

˙˜a

_i

˜ a

_i

+ d ˙˜

_i

d ˜

_i

. We assume that plant dynamics can be linearly parametrised in terms of the unknown parameters a

_i

. Note: the damping force Q

^D_i

(for all practical purposes) is linear in its damping coefficient. Let us denote

∂H

p

(q, p, a)

∂q

_i

= a

_i

∂H

_p^`

(q, p)

∂q

_i

,

Q

^D_i

(q, p, d) = d

_i

Q

^D`_i

(q, p) , (24) where superscript ` indicates function after linear parametrisation.

3.1 Control and adaptation law in

MRAC

We design the control law in mrac version:

Q

^F_i

= ˆ a

_i

∂H

_p^`

∂q

_i

− ˆ d

_i

Q

^D`_i

(q, p)

− 1

f

_i¹

− ˙ E

q_mi

+ α

β λe

^α/β−1_q_i

E

qi

− E

q_mi

!

− f

_i²

(q, p) − K

f

_i¹

sgn(s

_i

) , i = 1, . . . , n . (25) After substituting from Equation (25) into Equation (23) we obtain

V (s, a, d) = ˙

n

X

i=1

s

_i

"

(ˆ a

_i

− a

_i

) ∂H

_p^`

∂q

i

−

d ˆ

_i

− d

_i

Q

^D`_i

(q, p)

#

−

n

X

i=1

s

_i

Ksgn(s

_i

) + 1 γ

n

X

i=1

˙˜a

_i

˜ a

_i

+ d ˙˜

_i

d ˜

_i

(18)

=

n

X

i=1

"

˜

a

_i

s

_i

∂H

_p^`

∂q

i

+ 1 γ ˙ˆa

_i

!

+ ˜ d

_i

−s

_i

Q

^D`_i

(q, p) + 1 γ

d ˙ˆ

_i

!#

−

n

X

i=1

s

i

Ksgn(s

i

) . (26)

We now define the adaptation law ˆ ˙

a

_i

= − Kγ

√ γ sgn(˜ a

_i

) − γs

_i

∂H

_p^`

∂q

_i

− γK

√ γ sgn(˜ a

_i

) , (27) d ˙ˆ

_i

= − Kγ

√ γ sgn( ˜ d

_i

) + γs

_i

Q

^D`_i

− γK

√ γ sgn( ˜ d

_i

) . (28)

3.2 Convergence

Now we can return to our evaluation of ˙ V and prove it to be negative semi- definite, and consequently securing asymptotic tracking convergence. Sub- stituting control (25) and adaptation laws (27) and (28) into Equation (26), and using a simple algebraic inequality [4], we obtain

V (s, a, d) = −K ˙

n

X

i=1

"

√ 1

γ ˜ a

_i

sgn(˜ a

_i

) + 1

√ γ

d ˜

_i

sgn( ˜ d

_i

) + s

_i

sgn(s

_i

)

#

= −K

n

X

i=1

"

√ 1

γ |˜ a

_i

| + 1

√ γ | ˜ d

_i

|) + s

_i

sgn(s

_i

)

#

= −K

n

X

i=1

|s

_i

| ≤ −K

n

X

i=1

s

²_i

!1/2

≤ − √

2KV

^1/2

≤ 0 . (29)

Obviously ˙ V = 0 only if s = 0 . This implies that V reaches the origin in a finite time T , that is, V (T ) = 0 ; indeed, integrating inequality (29) see that

T ≤ t

₀

+

√ 2V

^1/2

(t

₀

)

K . (30)

(19)

Analogously, as in mrc section, we show that given the expression ( 23) we have s → 0 . It is sufficient to show that ˙ V → 0 . Clearly s and ˙s are bounded, this in turn shows that ¨ V remains bounded and according to Barbalat’s lemma we have ˙ V → 0 . Given Equation (29) obviously s and ˙s are bounded, see also Equation (23). Thus s → 0 as t → ∞ . This in turn implies that error trajectories e

_q_i

, i = 1, . . . , n , tend to 0 as t → ∞ . By the same argument as in mrc section we can show that also e

pi

, i = 1, . . . , n , tend to 0 as t → ∞ . And again s → 0 in finite time, see Equation (30).

3.3 Robot illustration —

MRAC

We consider the same example of robotic manipulator as in the mrc example but now we consider the damping forces as uncertain due to unknown damping coefficients. Damping forces are defined by Q

^D_i

(q, p, d) = −d

_i

p

_i

, i = 1, 2 , and d

i

are the positive damping coefficients of unknown value, and thus replaced in our control law by adjustable damping coefficients ˆ d

₁

and ˆ d

₂

. Similarly k

_s

is of uncertain value and thus subject to adaptation mechanism.

We modify the previous Lyapunov function to introduce adaptive terms V (s, d

₁

, d

₂

, k

_s

) =

2

X

i=1

s

²_i

+ 1 2γ

k ˜

_s²

+ 1

2γ ( ˜ d

²₁

+ ˜ d

²₂

) . Its time derivative is

V ˙ = s

₁

˙s

₁

+ s

₂

˙s

₂

+ 1

γ ˙˜k

_s

˜ k

_s

+ 1 γ

2

X

i=1

d ˙˜

_i

d ˜

_i

= s

₁

¨ e

_q₁

+ α

β λe

^α/β−1_q

1

˙e

_q₁

!

+ s

₂

e ¨

_q₂

+ α

β λe

^α/β−1_q₂

˙e

_q₂

!

+ 1

γ ˙˜k

_s

˜ k

_s

+ 1 γ

2

X

i=1

d ˙˜

_i

d ˜

_i

. (31)

Now, rewrite the expression for ˙ V using definitions and notation from the mrc section:

V (s) = s ˙

₁

E

_p₁

− E

_p_m1

+ α

β λe

^α/β−1_q

1

e

_p₁

!

(20)

+ s

₂

E ˙

_q₂

− E

_p_m2

α

β λe

^α/β−1_q₂

(E

_q₂

− E

_q_m2

)

!

+ 1

γ ˙˜k

s

˜ k

_s

+ 1 γ

2

X

i=1

d ˙˜

_i

d ˜

_i

= s

₁

E

_p₁

− E

_p_m1

+ α

β λe

^α/β−1_q₁

e

_p₁

!

+ s

₂

E

_p₂

C(q

c

) − C(q ˙

_c

)p

₂

C

²

(q

c

) − E

_p_m2

+ α

β λe

^α/β−1_q

2

(E

_q₂

− E

_q_m2

)

!

+ 1

γ ˙˜k

_s

˜ k

_s

+ 1 γ

2

X

i=1

d ˙˜

_i

d ˜

_i

. (32)

3.4 Control and adaptation law

Again, the control force is incorporated in E

_p₁

and E

_p₂

-terms. According to (25) our control law is

Q

^F₁

= −

3 4 m

_a

+ m

_`

q

₁

m

_a

+ m

_`

+

1 8 m

_a

+ 2 3 m

_`

`

√ m

_a

+ m

_`

#

p

₂

C(q

_c

!2

+

ˆ k

_s

(m

_a

+ m

_`

) q

₁

− ˆ Q

^D₁

− 1

f

₁¹

−E

_p_m1

+ α

β λe

^α/β−1_q₁

e

_p₁

!

− K

f

₁¹

sgn(s

₁

) , (33)

Q

^F₂

= − ˆ Q

^D₂

1 f

₂¹

−E

_p_m2

− C(q ˙

_c

)p

₂

C

²

(q

_c

) + α

β λe

^α/β−1_q₂

(E

_q₂

− E

_q_m2

)

!

− K

f

₂¹

sgn(s

₂

) , (34)

(21)

where K is a positive constant, and ˆ k

_s

, ˆ Q

^D`_i

= ˆ d

_i

Q

^D`_i

, i = 1, 2 , are our estimates of uncertain functions. Other terms are defined as in mrc case.

Substituting the control law (33) and (34) into Equation (32):

V ˙ = s

1

k ˜

s

q

₁

m

_a

+ m

_`

+ s

1

d ˜

1

p

1

+ s

2

d ˜

2

p

₂

C(q

_c

)

− s

₁

Ksgn(s

₁

) − s

₂

Ksgn(s

₂

) + 1

γ ˙ˆk

_s

k ˜

_s

+ 1 γ

2

X

i=1

d ˙ˆ

_i

d ˜

_i

. (35)

Then we formulate our adaptation law:

d ˙ˆ

₁

= − Kγ

√ γ − γs

₁

p

₁

, d ˙ˆ

₂

= − Kγ

√ γ − γs

₂

p

₂

C(q

c

) , ˆ ˙

k

_s

= − Kγ

√ γ − γs

₁

q

₁

m

_a

+ m

_`

. (36)

Then after substituting from Equation (36) into Equation (35) see that V ˙ = s

₁

k ˜

_s

q

₁

m

_a

+ m

_`

+ s

₁

d ˜

₁

p

₁

+ s

₂

d ˜

₂

p

₂

C(q

_c

) − s

₁

Ksgn(s

₁

)

− s

2

Ksgn(s

2

) + 1 γ

−γs

1

q

₁

m

_a

+ m

_`

k ˜

s

+ 1

γ [−γs

₁

p

₁

] ˜ d

₁

+ 1 γ

"

−γs

₂

p

₂

C(q

c

)

#

d ˜

₂

= −K(|s

₁

| + |s

₂

|) ≤ −K

s

²₁

+ s

²₂^1/2

≤ − √

2KV

^1/2

. (37)

In our computer simulations we assumed the parameter values as in mrc,

and we also set ˜ k

s

= 0.0 , ˜ d

1

= 0.0 , ˜ d

2

= 0.0 . The values for the constants

were chosen as K = 4 , γ = 2 , λ = 2 , α = 3 and β = 5 . Initial conditions

are the same as in mrc case. Error trajectory and controller time-history

(22)

(a)

(b)

Figure 4: (a) Error convergence for mrac (b) Controller for mrac

(23)

(a)

(b)

Figure 5: (a) Lyapunov function and its derivative (b) Sliding mode vari-

ables

(24)

are shown on Figure 4. The time history of the Lyapunov and its derivative, together with the sliding mode variables are shown in Figure 5.

The convergence times to the -envelope of the origin are T

_c

= 9.47 for

= 0.1 and T

_c

= 10.65 for = 0.05 . In some cases the convergence times are even shorter than in non-adaptive case despite uncertainty introduced to the system. It shows robustness of the adaptive scheme.

4 Stability

The sliding mode controller that we used up to now does not guarantee stability. We require a new control law that secures stability of the system.

4.1 Supervisory controller

Consider the two level control system with supervisory controller, illustrated on Figure 6.

The idea is to introduce a second-level controller designed to guarantee stability and take advantage of the properties of the sliding mode controller as the main controller without compromising its performance. The second- level controller acts as a supervisory controller, that is, when the sliding mode controller leads to instability of the system, it starts working to return stability of the system. Otherwise it remains idle.

We show here how develop the supervisory controller for mrc, as the controller for mrac would be the same with adaptive terms added.

Denote our control law (10) as u

^sm_i

(x) = Q

^F_i

(x) . Our task then is to de-

sign the second level controller that would guarantee that the control system

(25)

Figure 6: Two level control system.

(26)

is globally stable, that is

kxk ≤ B for every t > 0 ,

where B > 0 is an arbitrary constant chosen by design. To this end we append the sliding mode controller u

^sm_i

, i = 1, . . . , n , with a supervisory controller u

^S_i

(x), i = 1, . . . , n , which is zero inside the ball B

_x

= {x : kxk ≤ B} and is activated only when the system’s trajectory reaches the boundary of B

x

. Define the two-level controller as

u

_i

= u

^sm_i

(x) + I

_s

(x)u

^S_i

(x) , i = 1, . . . , n , (38) where

I

_s

(x) =

(

1 , for kxk ≥ B , 0 , otherwise.

We design u

^S_i

such that kxk ≤ B for all t > 0 .

4.2 Stable controller for sliding mode

Assume that the system and model are defined by (1) and (2), and that the model Hamiltonian has the form

H

m

(q

_m

, p

_m

) = 1 2

n

X

i=1

a

mi

p

²_m_i

+ f

m

(q

_m

) , (39) where a

_m_i

> 0 is a constant, f

_m

(·) is a known function, equivalent to a potential energy function. For the plant Hamiltonian H, the state equation for

˙

q

i

= p

i

/C

i

(q) , (40)

where functions C

_i

(q) 6= 0 and such that there exist estimation functions f

_est_i

(e

_q

, q) satisfying

∂H

_m

(e)

∂e

_q_i

p

_i

C

_i

(q) − a

_m_i

p

_m_i

!

≤ f

_est_i

e

_p_i

. (41)

(27)

Furthermore, the plant’s momenta are in the following form:

˙

p

_i

= −f

_p_i

(p, q, a) + Q

^D_i

+ Q

^F_i

, (42) where f

_p_i

(·) is a known nonlinear function polluted by uncertainty.

Then the plant dynamics for i = 1, . . . , n are

˙

q

i

= p

i

C

_i

(q) ,

˙

p

_i

= − ∂H

_p

(q, p)

∂q

_i

+ Q

^D_i

(q, p) + Q

^F_i

(q, p, u) . (43) The reference model for i = 1, . . . , n is

˙

q

mi

= a

mi

p

mi

,

˙

p

mi

= − ∂f

_m

(q

_m

)

∂q

_m_i

+ Q

^D_m_i

(q

_m

, p

_m

, d

_m

) + Q

^F_m_i

(q

_m

, p

_m

, u

_m

) . (44) Equations (43) and (44) give the following error dynamics:

˙

e

_qi

= p

_i

C

_i

(q) − a

_m_i

p

_m_i

,

˙

e

_pi

= − ∂H

_p

(q, p)

∂q

i

+ ∂f

_m

(q

_m

)

∂q

mi

+ Q

^D_i

(q, p)

−Q

^D_m_i

(q

_m

, p

_m

) + Q

^F_i

(q, p, u)

− Q

^F_m

i

(q

_m

, p

_m

, u

_m

) , i = 1, . . . , n . (45) Consider explicitly the terms of the controller u

_i

, see (10):

u

_i

= ∂H

_p

∂q

_i

− Q

^D_i

(q, p, d)

− 1

f

_i¹

f

_i²

(q, p) − ˙ E

_q_mi

+ α

β λe

^α/β−1_q

i

E

_q_i

− E

_q_mi

!

− K

f

_i¹

sgn(s

_i

) + I

_s

u

^S_i

, i = 1, . . . , n , (46)

Model reference control using sliding mode with Hamiltonian dynamics