Model Reference Adaptive Control for Actuators of a Biped Robot Locomotion

(1)

Abstract — Many robotics problems do not take dynamics of

actuators into account in the formulation of the control solution. The fallacy is in assuming that desired forces/torques can be instantaneously and accurately generated. In practice actuator dynamics may be unknown and can have significant transient effect on the overall results. This paper presents a Model Reference Adaptive Controller for the actuators of a robotic biped walker. The actuators learn and produce the torque required for a walking cycle presented in the form of a torque reference model. The actuators of the biped adapt themselves to match with the desired torque reference model with the aid of Lyapunov stability criterion. This control scheme for the biped robot is simulated on a sagittal plane. The results verify the MRAC scheme for the actuators.

Keywords- Model Reference Adaptive Control, Biped walker,

Actuator Dynamics.

I. INTRODUCTION

Biped walking dynamics is highly non-linear, has many degrees of freedom and requires developing highly complicated model to describe its walking behavior. There are number of novel approaches emerging in the field of biped walking to address this complicated control mechanism. Existing biped walking methods [3]-[5], give precise stability control for walking bipeds. However these methods require highly precise biped walking dynamics. In the recent years, biped walking through imitation has been a promising approach, since it avoids developing complex kinematics of the human walking trajectory and gives the biped a human like walking behavior. These methods combine the conventional control schemes to develop the walking mechanism for the bipeds. Examples include, imitation based on intelligent control methods like genetic algorithm [6], [4], fuzzy logic [7] , neural network approach [8], and other methods such as adaptation of biped locomotion [5], learning to walk through imitation [8] and reinforcement learning [4], [7]. But these methods cannot readily adapt their behavior to the changes in the dynamics of the process and the character of the disturbances [2]. Therefore there are adaptive control approaches [10]-[12].

Manuscript received on July 15th_{, 2009.}

Pavan K. Vempaty (corresponding author, pvempaty@oakland.edu), Ka C. Cheok and Robert N. K. Loh are with the Department of Electrical and Computer Engineering, Oakland University, Rochester, MI 48326 USA.

To address the problem of poorly known actuator dynamic characteristics and unpredictable variations of a biped system, we propose a Lyapunov based model reference adaptive control system (MRAC) method for the biped walking control. In MRAC the presence of the reference

model specifies the plants desired performance. The plant (biped actuators) adapts itself with the reference system (desired signals to the actuators). In this paper, reference system signals are represented by torques derived from a simulated biped walking control parameters [1]. Alternatively, we can also capture the walking control parameters from human walking movements using motion capturing devices. Once we have the walking control parameters either derived or captured, the reference model will be implemented based on this control data. The plant will then adapt itself to provided control data (reference model). Lyapunov’s stability criterion is applied in feedback for tuning the parameters to make the error between the reference and the plant tends to zero. Through this scheme, a robot can learn its behavior through its reference models.

II. PROBLEM DESCRIPTION A. Objective

[image:1.612.321.513.564.644.2]

Consider the objective of controlling a biped robot so that it imitates the movements of a person. Due to uncertainty associated with nonlinear actuator dynamics and unknown parameters, an adaptive control scheme will be used to achieve the objective. Fig.1. shows the concept where a model references adaptive control (MRAC) scheme is proposed for a biped robot to imitate human movement.

Fig. 1 Biped Robot, Its Actuators and Human Movements The study presented in this paper deals with the formulation, simulation and experimental aspects of the MRAC actuator scheme.

Model Reference Adaptive Control for

Actuators of a Biped Robot Locomotion

Pavan K. Vempaty, Ka C. Cheok, and Robert N. K. Loh

MRAC Actuator Scheme m

x

M F

(2)

Equations for describing the dynamics of a biped robot have been introduced in [1], and can be summarized as follows.

( )

(

, , ,

)

q =b

A q q&& q q M F& (1)

where qis the generalized coordinates of the robot, Mthe moments/torques applied to the joints in the robot and F the reaction forces at the contact of the robot’s feet and ground. Many literatures assume that M can be readily generated without considering dynamics of actuators. As an example, if a set of desired torques are calculated asM_d, then it would assume that M=M_d and applied directly as inputs to the robot. However, this is not a valid assumption since in practice the moments M will be generated by actuators. We model the moment as states x_a=Mas follows:

( , , )

a= a a+ a a+ a ext

x_& A x B u d q q_& τ (2) where uaare inputs of actuators and d q qa( , ,& τext) represents disturbance torques to the actuators due to robot movements. τ_ext_{is an external disturbance torque. It is}

assumed that we can monitor torques x_a by measuring the currents in the actuators.

The movements of a person is represented by the states r. The control objective can therefore be expressed as the desire to drive a part of q to r.

B. Adaptive Control Approach

[image:2.612.82.309.520.617.2]

The problem here is that we have to deal with unknowns and uncertainties in the dynamics and parameters of the systems described by (1) & (2). An MRAC scheme based on Lyapunov stability synthesis is proposed for dealing with the issue. Fig. 2.shows the adaptation scheme:

Fig. 2 MRAC scheme for the biped walker. III. PRELIMINARIES

A. Dynamics of Walking Biped

The bipedal model in this project has five links connected by pin joints as shown in the figure 3.a. One link represents the upper body and two links are for each lower

limb. The biped has two hip joints, two knee joints and two ankles at the tips of the lower limbs. There is an actuator located at each join and all of the joints are considered rotating only in the sagittal plane. As the system can move freely in the x-y-plane and contains five links, it has seven degrees of freedom. The corresponding seven coordinates are selected according to fig. 3a.

[

x_o,y_o, ,

α β β γ γ

_L, _R, _L, _R

]

T

=

q (3)

The coordinates

(

x y

_o

,

_o

)

fix the position of the center of mass of the torso, and the rest of the coordinates describe the joint angles. The link lengths are denoted as

(

l

₀

,

l

₁

,

l

₂

)

and masses as

(

m

₀

,

m

₁

,

m

₂

)

. The centers of mass of the links are located at the distances

(

r

₀

,

r

₁

,

r

₂

)

from the corresponding joints.

The model is actuated with four moments two of them acting between the torso and both thighs and two at the knee joints, which is shown in fig. 3.b.

M

=

[

M

_L₁

,

M

_R₁

,

M

_L₂

,

M

_R₂

]

T (4) The walking surface is modeled using external forces that affect both leg tips shown in fig. 3b.

F

=

[

F

_Lx

,

F

_Ly

,

F

_Rx

,

F

_RY

]

(5)

When the leg should touch the ground, the corresponding forces are switched on to support the leg. As the leg rises, the forces are zeroed.

Using Lagrangian mechanics, the dynamic equations for the biped system can be derived as shown in (1) where

7 7

)

(

q

X

A

∈

ℜ

is the inertia matrix and

(

)

7 1

,

q

M

F

X

q

b

&

∈

ℜ

is a vector containing the right hand sides of the seven partial differential equations. The closed form formulas for both A and b are presented in [1].

B. Computation of Desired Moments

The desired human movement is represented by

[

α

_m

β

_{l m}

β

_rm

γ

_lm

γ

_rm]T

= −

r

(6) An output y is constructed from the biped feedback info q as

y

=

[

α

β

_l

−

β

_r

γ

_l

γ

_r

]

T

=

C

_y

q

(7) Desired

Torque Generator

Adaptive Actuator Scheme

Dynamics of Biped Robot

r M_d M F

,

(3)

The desired torque is derived as the output of a controller that operates on the error between r and y. That is

[image:3.612.93.559.63.700.2]

M_d( )s =G_c

( )(

s r( )s −y( )s

)

(8)

Fig. 3a Biped robot with the corresponding 7 co-ordinates q=

[

x_o,y_o, ,

α β β γ γ

_L, _R, _L, _R

]

T.

Fig. 3b Biped robot with the moments

[

]

T

R L R

L

M

=

1

,

1

,

2

,

2 and the walking surface

external forces

F

=

[

F

_Lx

,

F

_Ly

,

F

_Rx

,

F

_RY

]

.

C. Dynamics of Actuators

We note that the actuator statesx_a=Mare the torques that will drive the biped robot. The goal is to find u_asuch that M→M_d. We assume that dc motors are used as actuators. It follows that (2) can be decoupled into individual motors representing the first-order stator-rotor dynamics (a_ai,b_ai) that generates output torque (x_ai) while subjected to disturbance (d_ai). That is

[

]

[

]

1 2 3 4

T

a a a a a

T

a a a a a

x x x x

u u u u

= =

x

u

,

{

}

{

}

1 2 3 4

, , ,

a a a a a

T

a a a a a

diag a a a a

diag b b b b

= =

A

B

[

1 2 3 4

]

( , , ) T

a ext = da da da da d q q_& τ

IV. MODEL REFERENCE ADAPTIVE CONTROL A. Configuration of MRAC Actuator

The adaptive actuator scheme is shown in Fig 4 below, where the reference model is specified by

m = m m+ m m

x& A x B u (9) and the control to the actuator by.

ˆ

a= a+ m

u Lx Nu (10)

where ,

{

}

{

}

1 2 3 4

, , , , , ,

diag l l l l

diag n n n n

= =

L

N

B. Input Estimator

In the MRAC approach, the movements of a person will represent the states r of the reference model

m = m m+ m m

x& A x B u . E.g., xm might be represented by

[

α

m

β

l m−

β

rm

γ

lm

γ

rm]T. However, we would not

know the inputu_m, which is required for the MRAC. Hence a problem in the approach is to estimate the unknown u_m knowingx_m. The unknown u_m can be represented as the output of an waveform shaping model, i.e.,

u u u u

m u u

= +

=

x A x w

u C x &

(11)

where A_u&C_u represents approximate waveform characteristics and wuis a sparse & small shaping input. 2

m

1

m

0

m

1

l

0

l

2

l

2

r

1

r

0

r

α

L

β

R

γ

(

x

0

, y

0

)

x

1 L

M

R1

2 R

M

2

L

M

F

Rx

Lx

F

Ly

F

Ry

(4)

[image:4.612.81.555.64.715.2]

Fig. 4 The MRAC with the input predictor.

E.g., (1) is a so-called

αβ

model if 0 1

0 0

u

 

= 

 

A &

[

1 0

]

u =

C . The reference model and shaping model can be augmented as

mu mu mu mu u

m mu mu

= +

=

x A x B w

x C x &

(12)

where , ,

0

m m m u

mu mu m

u u

     

=  =  = 

 

   

x A B C 0

x A B

x A I

& C_mu =

[

I 0

]

; 0&I are null and identity matrix, respectively, of appropriate dimensions. An estimate of the of u_m can be found using an observer of the form

[

]

[

]

ˆ ˆ ˆ

ˆ ˆ

mu mu mu mu m mu mu

m m mu

= + −

=

x A x K x C x

u 0 C x &

(13)

ˆ_m

u is used as the input to the MRAC as shown in (10)

C. MRAC

The adaptation algorithm in the MRAC adjusts the gains L and N based on Lyapunov stability design. In design, we assume that the disturbances are negligible. It follows from

the Lyapunov design that the gains

{

1, 2, 3, 4

}

diag l l l l

=

L &N=diag n

{

₁, n₂, n₃, n₄

}

are adjusted according to

1 (

)

1 ˆ

(

)

i i di ai ai

ai i

i i di ai mi

ai i

l

p M

x

b q

n

p M

x

u

b r

=

−

= −

−

(14)

where we have used the notation

[

]

[

]

1 2 3 4

ˆ ˆ ˆ ˆ ˆ

T

d d d d d

T

m m m m m

m m m m

u u u u

= =

M

u

(15)

D. Analysis of Error Dynamics

In the ensuing analysis we will assume that the uˆ_m

generated by the input observer converges to the unknown m

u . Define the errors e between the actuator torque a=

x M and the desired torque x_m=M_d as

a m

= −

e x x (16) It can be shown that

[

]

[

]

ˆ ( , , )

m a a m a

a m m a ext

= + − −

+ − +

e A e A B L A x

B N B u d q q τ

&

& (17)

E. Lyapunov Stability Analysis

Define a candidate for a Lyapunov function as

(

) (

)

(

) (

)

T

a a m a a m

T

a m a m

v trace trace = + − − − − + − − e Pe

A B L A Q A B L A

B N B R B N B

(18)

where P=PT >0, Q=QT >0 and R=RT >0 are positive definite matrices. The

(

)

(

)

(

)

(

)

( )

(

)

2 2 T T T

a a m a

T

a m a

v trace trace = + + − − − + −

e Pe e Pe

A B L A Q B L

B N B R B N & & & & & (19)

(

)

(

)

(

)

(

)

(

( )

)

(

)

2 ( , , ) 2

ˆ 2

T T T T

m m m a ext

T T

a a m a a

T T

a m am a

trace trace   = _ + _ + + − − + − + − +

e PA A P e e A d q q τ

A B L A Pex Q B L

B N B Peu R B N &

& &

Reference Model Input

Predictor

N

L

∑ Actuator _Dynamics

Adaptation Mechanism

( , , )

a ext d q q_& τ

a = x M m

u

m = d

x M

ˆ_m

u

(5)

From inspection, we choose 1

1

ˆ

T

a a

T

a am

−

= = −

B L Q Pex

B N R Peu &

& (20)

so that

2 ( , ,

)

T T T T

m m m a ext

v

_&

=

e



_

PA

+

A P e



_

+

e A d q q

_&

τ

₍₂₁₎

We note that expansion of (20) leads to the (14). We next choose an S=ST >0 and solve P from

PA

_m

+

A P

T_m

= −

S

(22)

We arrive at

2 ( , ,

)

T T T

m a ext

v

_&

= −

e Se

+

e A d q q

_&

τ

₍₂₃₎

where

v

&

is negative under the assumption that

2 ( , ,

)

T T T

m a ext

>

e Se

e A d q q

_&

τ

₍₂₄₎

or when error is more significant than the effect of disturbance. Hence we claim the overall dynamic system comprising of (17) and (20) has a candidate function that satisfies the Lyapunov stability criterion under condition (24).

V. SIMULATION RESULTS A. Description of Simulation Test

The proposed control scheme for the 5 link biped model consisting of (1)-(10) and (13)-(14) is simulated. The biped is excited with a walking cycle that is manually developed and provided as the desire human movement r. This walking cycle is repeated for the subsequent steps for every 1.5 seconds. Biped walking is tested for 3seconds. The desired torques M_d =

[

M_dL₁ M_dL₂ M_dL₃M_dL₄

]

T are generated using Eq. (8). Fig.5. shows the output

[

]

T

L L L

L

M

=

₁ ₂ ₃ ₄ of the adaptive

[image:5.612.78.466.54.689.2]

actuator scheme adapting itself with the reference input

M

_d. The first interval, 0 to 1.5 (s), of Fig. 5.1 to Fig 5.4 shows the adapted torques

M

oscillates and adjusts to converge to the desired torques

M

_d. The second interval, 1.5 to 3.0 (s), shows convergence settling smoothly to desired reference. Fig.5.5 illustrates the result of the biped walking behavior from the adapted torques M. Fig. 5.6 plots the height of the biped torso.

Fig. 5.1 Adaptation of M_L₁ to M_dL₁.

Fig. 5.2 Adaptation of M_L₂ to M_dL₂.

Fig. 5.3 Adaptation of ML3 to MdL3.

[image:5.612.327.523.63.652.2]

(6)

Fig. 5.5 A 2-step simulation of biped walking on the sagittal plane with the Y-axis representing the height of the biped

[image:6.612.116.276.161.313.2]

torso

y

_o.

Fig. 5.6 Height of the biped torso

y

_o.

Equation (2) includes disturbance torques d q q_a( , ,_& τ_ext)

[image:6.612.101.292.433.543.2]

which represents torque feedback and other external moments. This causes the dynamics of the actuator to vary. In the simulation, we disturbed the biped walker by introducing two sharp impulses as shown in Fig. 5.7. Fig. 5.8 shows that the biped recovers from the impact due to the external disturbance.

Fig. 5.7 Height of the biped torso with and without disturbance torques d q q_a( , ,_& τ_ext)_{, simulated over 6 (s).}

Fig. 5.8 Simulated external disturbance to (2), at t=0.65(s) and t=1.77(s).

VI. CONCLUSIONS

In this paper, we presented an MRAC technique to ensure that the actuators faithfully produce desired torques necessary for a walking robot. An observer was used to predict an anticipated state of the desired torque, thus cause the adaptive actuators to anticipate motion torques. We provided a proof to show that the MRAC scheme results in a stable system in the sense of Lyapunov when errors between the desired and actuator torques are significant. Simulation results verify that the system is robust when tested with various parameters and unknown coefficients. We plan to implement the MRAC scheme for an actual humanoid biped robot to follow the gait of a person in future work.

REFERENCES

[1] Olli Haavisto, Heikki Hyotyniemi, “Simulation tool of biped walking robot model,” M.S. Thesis, Helsinki University of Technology, Espoo, Finland, 2004.

[2] Karl J. Astrom, Bjorn Wittenmark, “Model-Reference Adaptive Systems,” in Adaptive Control, 2nd_{ed., Boston: Addison-Wesley} Longman, 1994, pp. 206-235.

[3] Shuuji Kajita, Fumio Kanehiro, Kenji Kaneko, Kiyoshi Fujiwara, Kensuke Harada, Kazuhito Yokoi and Hirohisa Hirukawa, “Biped walking pattern generation by using preview control of zero-moment point,” Proc. of IEEE Int. Conf. on Robotics & Automation, vol. 2, pp: 1620-1626, 2003.

[4] Seungsuk Ha, Youngjoon Han, Hernsoo Hahn., “Adaptive gait pattern generation of biped robot based on human’s gait pattern analysis,” Proceedings of the International Journals of Mechanical System Science and Engineering, vol. 1, no. 2, pp. 80-85, 2008. [5] Jun Nakanishi, Jun Morimoto, Gen Endo, Gordon Cheng, Stefen

Schaal, Mitsuo Kawato., “ Learning from demonstration and adaptation of biped locomotion with dynamical movement primitives,” Proceedings of the 4th_{IEEE /RAS International}

Conference on Humanoid Robotics, vol. 2, pp. 925-940, 2004. [6] Kristen Wolff, Peter Nordin, “Evolutionary learning from first

principles of biped walking on a simulated humanoid robot,” Proceedings of the business and Industry Symposium of the Simulation Technologies Conference ASTC’03, pp. 31-36, March 30th – April 3rd 2003.

[7] Changjiu Zhou, Qingchun Meng, “Dynamic balance of a biped robot using fuzzy reinforcement learning agents,” Fuzzy sets and systems archive, vol. 134, issue 1, pp 169-187, 2003.

[8] Poramate Manoonpong, Tao Geng, Toman Kulvicius, Bernd Porr, Florentin Worgotter., “Adaptive, Fast Walking in a Biped robot under neuronal control and learning,” PLos Computational Biology, 2007.

[9] Rawichote Chalodnan, David B. Grimes, Rajesh P. N. Rao., “Learning to walk through imitation,” Proceedings to the 20th International Joint Conference on Artificial Intelligence, pp. 2084-2090, 2007.

[10] Jerry Pratt, Chee-Meng Chew, Ann Torres, Peter Dilworth, Gill Pratt., “Virtual model control: an intuitive approach for bipedal locomotion,” Proceedings of the International Journal of Robotics Research, vol. 20, no. 2, pp. 129-143, 2001.

[11] Eugen Bobasu, Dan Popescu., “Adaptive nonlinear control algorithms of robotic manipulators,” Proceedings of the 7th_WSEAS

International Conference on Automation & Information, pp. 83-88, 2006.