Time-Varying Optimal Control

Another of stating the problem given by Eqs. (9.10) and (9.11) '. th

we WIsh to IS at

-x(k

+

+ +

ruCk)= O. k= 0.1. . ". N. [9.101 This a standard constrained-minima problem which can be solved usin the meth,od of multipliers. There will be one Lagrange vector. we call

+

I).for each value ofk, The proc d .

rewnte Eqs. (9.10) and (9.11) as e ureISto

+

= -x(k

+

+ +

^rUCk)= O. state equations. and [9.IOJ

ax(k) = xT

(k)QI - (k)

+

= O. adjoint equations. (9.14) The set of the equations. the adjoint equations, can be written as the back-ward difference equation

+

Qlx(k).

Restating the results in more convenient fonns, we have from Eq. (9.10) x(k

+

IJ=

+

ru(k).

I If the T, long. a control that moves the state along rapidly as mightbefeasible. Such comrols are called because they the state to a dead stop in most They correspondtoplacement of all poles o.See Problem 8.23.

2 equivalenr ofa number; it ensures thatxTQ1Xand are nonnegative for all possible x and u.

u=-Kx

that was used in Chapter8and illustrated by Eq. (9.2) in Example 9.t.

Given a discrete plant

x(k

+

¹⁾⁼

+

ruCk).

•

we wish to picku(k)so that a cost function

= -I (k)Q1X(k)

+

u^T 2

is minimized. Q, and Q, are symmetric weighting matrices to be selected by the designer. who bases the choice on the relative importance of the various states and controls. Some weight will almost always be selected for the control 0):

otherwise the solution will include large components in the control gains. and the states would be driven to zero at a very fast rate. which could saturate the actuator device.I The must also be nonnegative definite,' which most approximation in the system model used in the estimator: therefore. the estimation errors will still approach zero for stable estimator roots.

In short. it often useful to apply your knowledge of the physical aspects of the system at hand to break the design into simpler and more tractable subsets.

With luck. the whole job can be finished this way. At worst. insight will be gained that will aid in the design procedures to follow and in the implementation and checkout of the control system.

Optimal control methods are attractive because they handle MIMO systems easily and aid in the selection of the desired pole locations for5150systems. They also allow the designer to determine many good candidate values of the feedback gain,K.using very efficient computation tools. We will develop the time-varying optimal control solution first and then reduce it to a steady-state solution in the following section. The result amounts to another method of computingK in the centroIlaw Eq.(8.5)

9.2

cost function

366 Chapter 9 and Optimal Control

which is often rewritten as

where

Now we use Eq, (9.16) forx(k

+

S(k)x(k)=

+ +

ru(k»)

+

Q1x(k).

Next we use Eq. (9.21) for u(k) in the above

S(k)x(k)=

+

1) - rR-1r'S(I:

+

Q,x{k).

and collect all terms on one side

92 Optimal Control 367

[S(k) -

+ + +

^I)fR-'rTS(k

+

- Q,]x(k)= O.

(9.22) Because Eq. (9.22) must hold for anyx(k).the coefficient matrix must be iden-tically zero. from which follows a backward difference equation describing the solutionofS(k)

M(k

+

I)= S(k

+

I) - S(k

+ +

rTS(k

+

l)rr'rTS(k

+

I). (9.25) Equation (9.23) is called the discreteRiccatiequation.Itis not easy to solve because it is nonlinear in S. But note that the matrix to be inverted in Eq. (9.25)

R.has the same dimension as the number of controls. which is usually less than the number of states.

The boundary condition on the recursion relationship forS(k

+

I) is obtained from Eq. (9.19) and Eq, thus

u(k)= -K(k)x(k), where

1. LetSiN)

=

Q, andK(N)

=

2. Letk= N.

and we see now that the problem has been transformed so that the solution is described by the recursion relations Eq. (9.24) and Eq. (9.25) with the single boundary condition given by Eq. (9,26). The recursiou equations must be solved backwards because the boundary condition is given at the endpoint. To solve for u(k).we use Eq. (9.21)toobtain

K(k)=

+

rTS(k

+

and is the desired "optimal" time-varying feedback gain.

Let us now summarize the entire procedure:

Riccati equation

(9.211 (9.2C11

= S(k)x{k)

Solving foruCk),we obtain

u(k)=

+

fTS(k

+

= _R-1rTS(k

+

In Eq. we have defined

+

rTS(k

+

for convenience. If we now substitute Eq. (9,20) into Eq. (9.15) for and

+

I). we eliminate Then we substitute Eq. (9.21) into Eq. (9.16) to elimi-natex(k

+

I) as follows. From Eq. (9,15). we have

+

Q,x(k),

This definition allows the transformation of the two-point boundary-value prob-lem in x and to one in S with a single-point boundary condition. With the definition Eq. (9.20). the control Eq. (9.13) becomes

Q.u(k)= -rTS(k

+

I)x(k

+

= -rTS(k

+ +

fu(k).

and substituting Eq. (9.20). we have

S(k)x(k)=

+

I)x(k

+

Q,x(k).

and from Eq. (9.15) we can describe

+

I) in the forward difference equation form

Equations (9.16), (9.17), and either (9.15) or (9.18) are a set of coupled difference equations defining the optimal solution of x(k), and u(k). provIded the initial (or final) conditions are known. The initial conditions onx(k) must be given: however. usually would not be known. and we are led to the endpoint to establish a boundary condition for From Eq. (9.11) we see that u( 1 should be zero in order'to minimize:J because urN) has no effect on x(N) Eq. (9.IO)]. Thus Eq. (9.13) suggests that

+

I)= O. and Eq. (9.14) shows that a suitable boundary condition is

= Q,x(N). (9,191

A set of equations describing the solution to the optimal control problem is now completely specified.Itconsists of the two difference equations (9,16) and (9.15) with u given by Eq. (9.17). the final condition on given by Eq. (9, 19). and the initial condition on x would be given in the problem statement. The solutlon to this two-point boundary-value problem is not easy.

One method. called the sweep method by Bryson and Ho (l975). is to assume

sweep method

0 3 Q₂=0,(

2 I

0 k

4,5

4.0 ^Q²=0.01

1.0 O.S

•

9.2 Optimal Control 369

3.5 3.0

2.0

81.5

chosen to be 10ngeL the end characteristics would have been identical. and the constam gam portion would have existed for a longer time.

. The fact that the gain over the first portion of the example was constant is typIcal of the. optimal gains for all constant coefficient systems, provided that the problem time IS long enough. This means that Ihe optimal controller over the early. conslant-gain portion identical to the constant-gain cases discussed Chapter 8 and Section 9.1 except that the values of the constant gain.K.are based the mtmmlzatlOn of a cost funclion rather Ihan a computation based on speCIfied root locatIOns. We could also view result as a method to find Figure9.3

Example of control gains versus time, Example

(9.301 (9.291

[9.27J [9.16J

u(k)= -K(k)x(kl.

= 0.01, 0.1. and 1,0.

x(k

+

I)=

+

ru(k).

and plot the resulting time histories ofK.

where

which means the angle state is weighted but not Ihe angular velocity. Choose the control weighting matrix. a scalar in this case because there is a single control input, have three

Solution. Equations through (9.281 need be solvedforthe system transfer function G(s)= 1/ The problem length for purposes of defining:J was chosen to be 51 steps. which.

the sample period ofT=O. I sec, means that the total time 5.1 sec. This time chosen long enough so that it was apparent that the gains were constant over the initiallime period.

Figure9.3contains the resulting gain time histories plotted by the computer. We see from the figure that the problem length affects only the values ofK near the end, and in fact. the first portions of all cases show constant of the gains. If the problem length had been

3. Let M(k)= S(kl -

+

4. LetK(k - I)=

+

5. Store K(k - I).

6. LetS(k - I)=

+

Qj' 7. Letk= - I

S. Go to 3.

For any given initial condition forx.toapply the control. we use the stored gains K(k) and

Note that the optimal gain. K(k). changes at each time step but can be pre-computed and for later use as long as the length. N. of the problem is known. This is so because no knowledge of the initial statex(O)is required for computation of the control gainK(k).

• Example9.3 of

Solve for the time K for the satellite attitude-control example described in Appendi., A. I.Choose the stale weighting matrix to be

368 Chapter 9 Optimal

370 Chapter and Optimal Control

(9.32)

(933)

In document Digital Control of Dynamic Systems (1998) (Page 194-197)