Application of digital computers to on line optimisation

(1)

University of Warwick institutional repository: http://go.warwick.ac.uk/wrap

A Thesis Submitted for the Degree of PhD at the University of Warwick

http://go.warwick.ac.uk/wrap/39027

This thesis is made available online and is protected by original copyright. Please scroll down to view the document itself.

(2)

APPLICATION OF DIGITAL COMPUTERS

TO ON-LINE OPTIMISATION

r r

J. MONK B. Sc. (Eng)

Thesis presented for

the Degree of Ph. D.

School of Engineering Science

I

The University

_{of , Marwick}

(3)

Abstract

This _thesis _{is concerned} _with _hill _climbing _systems which _{may be used for} _parameter _{optimisation.} _{A simple}

system is described _which _{uses the slope} _{of the cost} function _{to approach} _{the zero slope}

condition _correspond- ing _{to a unique} _optimum

of a system characteristic.

A theoretical _analysis _using _estimation _theory _methods

is applied _{to a range of waveforms} _{to assess their} _value in slope estimation _{in the presence} _{of disturbances} _and system irregularities. The principal _signals discussed

are two and three level maximal-length _sequences and square _{waver . £inc waves are used for comparison}

purposes. More complex optimisers _using higher _order models of the cost function _{are also} _considered _with

particular _reference to the use of the special _properties

of three level _{maximal-length} _sequences. The results _of the theoretical _studies _{are confirmed} _{by a series} _of

experiments _using _{an internal} _combustion _engine _test _rig and a 1escription _{of the preparation} _{of the test} _rig is

given. Hill _climbing _{was carried} _{out on the ignition} angle to obtain _{maximum power at full} _throttle _and

(4)

Contents

Chapter 1 Introduction

1.1 _{The Principles} _{of Optimising} _Controllers _1-02

1.2 _{The Choice} _{of Perturbations} 1-03

1.3 _{Summary of Research} 1-06

Chapter

-2

Basic Optimisers

2.1 _{Dynamic Performance} _{of the Optimiser} 2-02 2.2 _Estimation _for _{a Noise} _{Free System} 2-07

2.3 _Effects _{of Noise} 2-11

2.4 _Compensation for _{Low Frequency} Drift 2-21 2.5 _Effect _{of Non-Linearities} _{on Identification} 2-26 2.6 _Effect _{of High Frequency} _{Perturbations} 2-29 2.7 _Application _{of Particular} _Waveforms 2-32

2.8 _Conclusions 2-45

Chapter 3 _Further _Linear _{and Non-Linear} Optimisers

3.1 Higher Order Linear Optimisers 3-02

3.2 _Non-Linear _Optimisers 3-10

3.3 _Conclusions 3-25

Chapter 4 Engine Instrumentation, Modelling _{and Programmes}

4.1 _{Engine Instrumentation} 4-03

4.2 _Modelling _{and Control} _{of Test Rig} 4-10

4.3 _{Engine Operating} _Programmes 4-41

Chapter 5 Experiments _{and Results}

5.1 _General 5-02

5.2 Preliminary Experiments 5-04

(5)

Chapter b Conclusions

Acknowledgements i

References ii

Symbolic Notation vii

Appendix Al Matrix Inversion

A1.1 _Inversion _{of ciI + bJ Matrix} A-01

A1.2 _Inversion by Partitioning A-01

A1.3 Inversion _{of a General Form} A-02

A1.4 Inversion _{of a Second General} Form A-03

Appendix A2 Detailed Operation _{of Instruments}

A2.1 Ignition Timing Unit A-06

A2.2 Torque Transducer A-07

A2.3 Precision Monostable A-11

Appendix A3 Fourth Order Auto-Correlation Function of r"

Three Level Maximal-Length Sequence A-14

Appendix A4 Detailed Results

A4.1 Pseudo-Random Binary Sequences A-16

A4.2 Pseudo-Random Ternary Sequences A-25

(6)

1-01

CUAPTER 1

Introduction

The advent of cheap and reliable digital _computers _suitable for _on-line _process _control _{has led to a new versatility} _{in the}

application _{of control} theory. In addition _{to conventional} _control, these _computers _enable _complicated _process _optimisation _{schemes to} be implemented _with _relative _ease. _{The high working} _{speed charac-}

teristic _{of computers} _{makes it} _possible _{to complete} _{complex modifi-} cations to the process data as it is received _{and then apply}

statistical principles to increase the accuracy of itsanalysis.

Optimisers _{may be designed} _{to modify} _process _parameters in response to changes in operating conditions. In this way, the system para-

meters will be adapted to compensate for lack _{of knowledge,} changes in process behaviour and environmental conditions. Further infor-

mation _about the process _{and the environment,} _{which may not have been} available for _optimiser design, _{can be developed} _on-line by observing

(7)

). -02

1.1 The Principles _{of Optimising} _Controllers

The three _major _procedures _{in an optimising} _controller _are

identification, _decision _{and modification.} _Firstly, _{a model of the}

performance curves is developed by observing the process under

control. The model may be extremely _simple _{and only} _valid over a small range _{of operating} _conditions. In some cases it is possible

to gather sufficient information for the model under normal operating conditions, but more generally, known probing signals are introduced

to perturb the process and the results used in the identification of the system. The amplitude _{of the perturbation} must be small to

minimise any disturbance to the process.

An evaluation of the performance is then made and the decision

procedure uses an adaptive algorithm _{or policy} to locate possible improvements _{in performance.} _Normally, _{the procedure} _{uses the}

current _{and possibly} _previous system models to evaluate a particular function _{of the process} _variables _which in some way represents

efficiency or cost. The maximum or minimum of this function will then represent the optimum performance. Most of the optimisation

methods _{used to locate} this optimum, _{such as conjugate} gradient3l, require the value _{of the first} derivative but this is not always essential32.

Finally _{the modification} _procedure _{makes the necessary} _changes

in the process parameters, providing these lie within the operating

(8)

1-03

1.2 The Choice _{of Perturbations}

Chapter 2 describes _{a simple} _optimiser _which _directly

identifies _{the slope of the cost function} _{at the operating} _point. the

This result is _{then proportional} to/change in operating _point taken to drive _{the system towards} _{the maximum.} Draper _{and Lia have}

obtained the estimate _{of the gain of the system being} _optimised, (which _is

the same as the first derivative _{of the cost} function), by perturbing the input _variable _using _{a sinusoidal} _perturbation _and phase detecting the response. Douce and King29 introduced _square waves as perturbations _{to simplify} the detection _problem. The

necessary multiplication _{and integration} is then reduced to a simple sign change and integration as the square wave has only two discrete

levels. _Roberts28 _{showed that} _{the sine wave is a preferable} _signal to square waves and random telegraph codes for a hill climber. In all these _methods, _{the output} _{of the phase detector} is directly

integrated _{and fed back to the input} _variable _{to achieve} _{optimisation.} Douce and Bond30 have shown that the introduction _{of a sample and}

hold _element _suspending input _variable _adjustments _eliminates

parameter excursions and allows higher loop gains.

Van der Crintenl _{has proposed} _{an alternative} _{method using}

stochastic signals for the perturbation. Correlation techniques are used to give an estimate of the integral of the system impulse

response _which is proportional _{to the required} _{change in parameter} setting. lie has also suggested _using _{a binary} _equivalent _{of noise} but major difficulties _{are encountered} _{due to statistical} fluctua-

(9)

1-04

excessive storage. Douce and Ng2 have proposed eliminating these

errors and restrictions by using pseudo-random binary sequences in

place of the truely random signals. These sequences are periodic

but have an auto-correlation function similar to white noise. They

are easily generated with shift registe_rs4 or the programmed equiv-

alent so that delayed versions need not necessarily be stored.

Pseudo-random binary _sequences _{are a sub-set} _{of the class} of maximal-length sequences. Sequences with a higher number of levels but similar auto-correlation functions may be generated using the

same principles. The impulsive auto-correlation function implies that _{the perturbation} _{of a linear} system and correlation of its

response gives the system impulse response directlyli and that the system gain can be derived by integration for use in the hill

climber. Fig. 1.2.1 shows the auto-correlation obtained with a

first _order _{system using} _two, _three, five _{and seven-level} sequences. For two and three level sequences, the correlation algorithm reduces

(10)

1-05

COAM1ä. ý4t 19N

b4 t11 r

Orr

Z Le-Jc6 SEQKLNtE

LgN4TH _3!.

x

it

SHIF1

x SKous tº+E rtc+wýý or cnosý-coRatt, ýYýoa

Fig. 1.2.1 Auto-correlation _{and results} of cross-correlation experi-

[image:10.2442.233.2245.158.3281.2]

(11)

1-06

1.3 _{Sun'm iry of Research}

The research for this thesis was primarily concerned with the

effect _{of the choice} _{of perturbation} _signal _{on the performance} _{of a} single parameter _optimisation _scheme. Chapter 2 surveys _{and analyzes} the use of a range of waveforms _{as perturbations} for _{a simple} _opti-

miser with _particular _reference to implementation _using _{a digital}

computer. The study is concerned with the effect _{on the system} _gain estimate of non-linearities, the selection _{of higher} frequency

perturbations _{and the presence} _{of steady} state levels, _{slow drifts} and stochastic disturbances _{at the output} _{of the system.} Methods for _{the elimination} _{of errors} _{due to drifts} _{and steady} _state levels are surveyed with particular reference to pseudo-random binary

sequences. A matrix. notation has been introduced into the analysis wherever possible as it provides results in a form directly

applicable to digital algorithms.

Chapter 3 considers some of the alternative optimiser decision

procedures which could be implemented. The effects of linearly filtering _{the estimates} _{of system} _{gain by introducing} _additional

dynamic _terms into _{the optimiser} _computing _elements _{arc investigated.}

Secondly, _{the two derivative} hill _climbers introduced by Godfrey _and Clarke27 _{are discussed.} _{These utilise} _{the special} _properties _of

pseudo-random ternary _sequences _which _allow _{the simultaneous} estimation _{of the first} _{and second derivatives} _{of a quadratic} _cost

function26 _{on a particular} _system _{configuration.} _{A study} _{is made of} the alternative _estimation _{schemes which} _could _{be used in this} typed hill _climber _{and some of the restrictions} _noted. _{The advantages} _and disadvantages _{of extending} _these _principles _{to higher} _order _cost

(12)

1-07

The theoretical _recults _{of these} _{two chapters} _{are then applied} to a working _process. Van der Grinten _{had carried} _{out practical}

studies _{on a gas fired} boiler33 _{and Draper} _{and Lia had obtained}

meaningful results _using _{a special} internal _combustion _rig. The

most suitable _process _available _proved _{to be an internal} _combustion engine test _rig. Several _{new instruments} _{were developed} _{to achieve} the full _{use of the potential} _{speed of a digitally} implemented _opti- miser, and a mathematical _{model of the test} _rig _{was built} _{as an aid}

in designing _{the on-line} _controllers _{and the optimisers.} _Finally _a

complete suite _{of programmes} _{was written} to allow _on-line _running, control and optimisation _{of the engine} test _rig. This work is

described _{in Chapter} _{4 and fig.} _1.3.1 _{shows the fully} _instrumented engine test rig.

Enquiries _within _{the motor industry} _{showed that} _{the adjustment}

of the ignition angle is currently based on a function _{of the speed}

and the inlet _manifold _pressure. The relationship between ignition

advance and speed is found by running the engine at full throttle

for a particular loading _{device setting} _{and manually adjusting} _the

-ignition angle to give maximum power. This is then repeated for

several values of load settings and a mechanical device designed to

adjust the ignition setting according to the results.

An automatic experiment was implemented in which the engine

could be held _{at constant} speed while the throttle _{was opened and then} the power maximised by adjusting the ignition _setting. The relation-

ship between _{the speed and the ignition} _angle _{to give maximum power} was than established by repeating _{the experiment} _{at a series} of

different _speeds. _{The results}

of a series of these experiments

(13)

[image:13.2447.39.2389.93.3450.2]

(14)

1-09

and square wave perturbations _{with many different} _apLimiscr

parameter _settings _{are reported} _{in Chapter 5.}

The principal _{contributions} of this thesis are the survey,

unification _{and amplification} of results concerning the

application _{of specific} waveforms _{- particularly} pseudo-random

sequences _{and square} _{waves - in the identification} of system

parameters. This is achieved, where relevant by the application

of matrix methods and results of estimation theory, the validity

of these methods has been tested by applying the results to a

working process.

(15)

z- 1

CHAPTER 2

Basic _Optimisers

Introduction

In this chapter, a dynamic model of the optimiser is developed

in the form of a feedback control system and by making assumptions

of linearity, conventional control theory is applied to derive

stability criteria, transient _performance _{and the effect} _{of process}

disturbances _{on the system output.} _{The model assumes that the slope}

of the cost function _{can be determined} exactly in the absence of

noise, but it is shown that the presence of low frequency drifts,

non-linearities _{or the use of an incorrect} perturbation frequency may

introduce _errors _into _this _estimate. _{As these errors and those}

caused by process noise are all a function of the perturbation and the estimation method used, a range of specific perturbation signals are studied. The matrix methods previously applied by Clarke9 to

(16)

2- 2

2.1 Dynamic _Performance

of the Optimiser

2.1.1 _{Model of the Optimiser}

A schematic diagram of .1 simple optimiser operating on a

system is given in fig. 2.1.1. By introducing _{a small} _perturbation into _{the system and noting} _its

effect _{at the output,} _{an estimate} _of the slope _{of the cost} function _{may be computed.} _All _past _values _of the estimate _{of the gain of the system are summed to give} _{the latest} operating point _{and on receipt} _{of a new estimate} _{of gain,} the

operating point is incremented. If _{the estimator} is used immediately after _{a change in operating} _point, _{the output} _will _contain _{a trans-}

ient _component _{in addition} _{to the response} _{to the perturbation.} _A true _value _{of the system gain will} _{not be available} _until _{the trans-} ient _{has died away and the performance} _{of the optimiser} _will _be

severely dependent _{on the detailed} dynamic _{characteristics} _{of the} system. The optimisers discussed therefore _{conunence esti.} _ation _of the system gain when the system has settled after a change in

operating _point has occurred. The time interval between _estimates is then not less than the sum of the miuimm system settling time

and the perturbation _period. The model can therefore be treated as a sampled data _{system with} _{a sampling} _{time given} by the period

between _successive _estimates.

For analysis, the system dynamics, _{the cost function} and the

estimation _procedure _{are replaced} by a device which samples the

operating _point _{of the system and produces} _{the gain estimate} _one sampling instant later. This device _will be a time delay and a

(17)

2- 3

PERTUtU _{J TION}

I 14

rCAIXATort

OCCMTýýJG '

? Y; TCM DYNAMICS

PotNT, O. '

H

CoiTýýFUNCTION

AcTu*t. Slope

5LC P1

LSTMATOPs

Fig. 2.1.1 - Schematic diagram of a simple optimiser

Fig. 2.1.2 _{- Optimiser} _{as a control} system

[image:17.2442.341.2089.325.1073.2] [image:17.2442.427.2061.1585.2278.2] [image:17.2442.446.2080.2705.3399.2]

(18)

2- 4

cost functioa. The model of the optimiser _{and the system is shr'-n}

in fig. 2.1.2 _{and has been drawn as an ordinary} _{closed loop control}

system to enable conventional _control _analysis _{to be applied.} The

input _{to the control} _system, _{(the desired} _slope), _will

normally be

zero but may be used to temporarily disturb _{the system from the}

optimum to observe its transient _{response performance.}

2.1.2 _{Dynamic Response}

The analysis _{may be simplified} _{by assuming a quadratic} _cost

function _{and consequently} _{a linear} _relationship _between _{the esti-} mated gain and the operating point. The slope of this transfer

characteristic will be given by the second derivative of the cost

function _{and will} _therefore _{be positive} _{when searching} _for _{a minimum} and negative _{when searching} for _{a maximum.} The linear z transform

model of the optimiser is shown in fig. 2.1.3, _{where the integrator} has gain g and the second derivative _{of the cost} function is K, a

constant. The forward _path transfer function is given by

Kg ýz-1)

and therefore _{the closed loop transfer} function is

Kg

z- (1 + Kg)

which will be stable when the pole at (1 + Kg) lies within the unit

circle, that is

-2< Kg < 0.

(19)

2- 5

fig. 2.1.4. _{When Kg equals -1, only one step is required} to reach

(20)

2- 6

" o$

2.1.4 _Dy

t,,, M*, G _{ýilýiOýM4Ntl.} _Oý 4. _L _?ýsr aLtj,

(21)

2- 7

2.2 Estimation _{for a Noise Free §Istýms}

2.2.1 Basic Estimators

The sampled response <y> of a dynamic system perturbed by a

sequence is given by the convolution integral,

00

y(mA) a1 h(t)u(mX _{- t)dt,} 0

where X is the time between samples and h(t) is the system impulse

response.

If _{the generator} _{of holds} its _value between _sampling intervals, then

co

ym ° jhnum-n,

n=1

nA

where hn

J

h(t. )dt

(n-1) A

and is termed the weighting sequence.

An optimiser requires the steady state gain of the system, which is

given by the final value of the system output in response to a unit

step. This is equivalent to

co

J

h(t)dt r

hn a a, say.

itý1 0

(22)

2- 8

and this corresponds to PJ samples then

N

ym "I

hnunrn

ni*1

which maybe written in matrix notation _as

N1 uN

J2 U1

yN JL UN-1

or more compactly,

y-

Uh.

Solving for it assuming U to be of rank 11,

h°U-

is

or alternatively prenultiplying both sides by U',

U Ih _aU Inu-

(2.2.1)

(2.2.2)

and assuming U'U is of rank 11 gives a matrix form of the Wiener

Hopf equation

hI

_{U'Ul ýý U'; ýý}

UN_i . ui hl

uIy _. U2 h2

.

ui "N hN

(2.2.3)

(23)

2- 9

NI

1

ý0 +1 +2

' ýN-1

4i 40

"

02 _"

"

4o +i

.

ý1J-1 _414o

I

and ým is the autocorrelation function of for a shift _{of m and} is given by,

1N

Om °NI _{anum+n ""} n=1

From the solution of these equations, the final value of the step response, s, is given by

s=1

'h,

, where 1 represents a column vector with unit elements. The compu-

tation _{of a will} be trivial _{when U'U is a scalar} _matrix.

2.2.2 _{The Effect} _{of Non-Zero} Steady State System Output _on

Slope Estimation

because of the non-linear cost function, _variations in the

parameter settings of the optimiser will alter the system gain and

also the steady state output level. For an output level of ap over

the period of perturbation,

(24)

2-10

and from this equations

hah-

_{ap[U'U)-lU'l,}

where the second term represents an error in the estimated dynamic

characteristics.

It has been suggested12 _that for _{p. r. b. s.,} _equation 2.2.4 _may be made independent of ap by subtracting the mean of the sequence

before _correlation. The matrix U1U will then be of rank N-1 however, _{and the solution} _{must be written} _as

U 'Uh - WE.

This will be true for all waveforms with zero mean since U11 is then zero.

As the original equation has only N degrees of freedom and

there are now 1J +1 parameters, additional information is necessary

for a complete solution. One assumption which has been made7 is

that practical systems do not respond instantaneously, implying that

hl is zero. The system equation may then be written as

Y-

-&

where

1 UN

-1

urn-2 ...

ul

1 U11 _{UN- 1...} U2

1 _ul _UN

... U3

Ta

0a

.1 uN-2

ao

h2

h3 and

IIN

0

Ul UN .

(2.2.5)

This may be solved to give

(25)

2-11

2.3 Effects _{of Noise}

2.3.1 Effects _{of noise on optimiser} _performances

The effect _{of noise in the system being optimised} is to

introduce _{an uncertainty} into _{the estimated} _{value of the system gain.}

Consider equation 2.2.5, where the true value of y is now given by

IQa+ E''ý

(2.3.1)

where P. is a vector representing the noise at the output of the

system, and the estimate of 2 is given by

.

in LT 'v

-p+ IT ºT]-1T ºe

The covariance of the parameter will be given by

E(! ýp- _{- 21lp - ýI} ý) _{° (T'T7-1T'E(c')T(T'TI-1}

If the elements of e are serially independent and of zero mean and

variance a2,

E(ce') _{° a2riu}

and covar (p) A a2 (T'T)"' 1 (2.3.2)

The uncertainty in p may be incorporated into the optimiser model by

adding noise to the estimated slope. Without loss of generality,

this _{may be represented} _{as a noise input} to the system given in

fig. _{2.1.3 and its effect} _{at the output may be determined} from the

relationship between the power spectral density _{of the noise,} _{; o(w)}

and tho power spectral density of the system output "to(w), where,

0 oýWý "C _c(w)

(26)

2-12

where G is the z transfer function _{of the}

_system

_and

T is the time between slope estimates.

For the optimiser,

G(ejwT) -

kg

,

e, Jwt- (1+kg)

and therefore,

IG12

k2g2

(1 + kg) 2+1-2

_{(1 + kg) coaw-r}

And the sampled error, Ee will beXzero except for angular frequencies in the range -T/T to n/T. Provided _{the low frequency} drifts in the system output are negligible or compensated for, the errors in

successive inputs will be independent and

Le

will be constant at

Tr2/2n, _-t/t <w< n/T, where r2 is the variance of the slope estimate.

Therefore,

Tr2k2g2/2, _r

it IT

<w<

+ kg) 2+1-2 (I +) _COCWT TT

O iw)

09w> _n/T

W<- ir /

and the total power at the output will be given by

n/T

pp (w) dw a2

'J T/ T _{(Tr2k2g2/2tr)dw}

-, r/, r ₀ (1 + kg) 2+1- 2(l + kg) coot

(27)

2-13

r2lkal

0<I k9 I<2

2-l

k9 I

and the ratio of the variance of the operating point to the variance of the estimated slope for this range of loop gains is plotted in

fig. 2.3.1.

2.3.2 Application _{of Least} _Squares _{to Estimation}

In the previous section it was shown that the variance of the first derivative directly _affects _{the wander of a hill} _climbing

system and if the magnitude of this wander becomes intolerable, it will he necessary to reduce this variance. This may be done

directly _{by taking} _several _periods _{of the perturbation} _{and averaging} the outputs over these periods. The variance will then be reduced

by 1/m for _{an experiment} _{of m periods.} Alternatively _{the period} _of the perturbation may be extended over a greater time, either by

incrementing _{the sequence after} _several _samples _{of output} _{or by}

using a longer sequence of the same class. In equation 2.2.1 it was assumed that the settling time was equal to the perturbation period

but if _{the perturbation} is extended, _{the estimated} _weighting

sequence becomes longer and it is possible to assume that the last

few values of the weighting sequence _{are zero.} The equation set will then be overdeterminate and, by using an alternative estimation

technique, it is possible _{to use the spare equations} to reduce the variance _{of the estimate.}

If it is assumed that the system has settled _after k ordinntes

(28)

N V

t 0

0

N

0

i 6

v

0

.i

Z O

a 0 c

J

W 0

r

w w W

r

s

4

0 0 0 _O _O

p.

V _p

> _i _vi

d- y W

2-14

(29)

2-15

hk+i ° hk+2 - ... « hn m 0,

and the system response may be written as

xe +e

(2.3.3)

where

UN _{UN_1 ....} _{UN-k+l 11} hl

Ul _uN _.... _{UN-k+2 ý1} 121

and 0

hk

ul'J-1 _UN-k _I1

ao

0 If _{the noise} is serially independent, _{of zero mean andA}

ý

riance a2, the principle of least squares5 may be applied and

A

Vy-

and providing X'X has rank k.

Ö= aºX)-1Xºy

and covar(Ä) _{- 02[XºX]'1.}

The estimate of the step response a is given by

1J°(1

0)

eNI

and its least squares estimate is therefore

!? ' 01 [Z

where var(ä) _{° a2(1'} O1(X'X1-1

0

(2.3.4)

(30)

2-16

2.3.3 Structure

_{of the (X'X] matrix}

The structure _{of the (X'X) matrix} is given by

fX

- III

"4o

_+i4

...

ßk_1

-

4i +o

+2

.ü1

...

" +o +t _I

it! ' _I1

(2.3.6)

where u is the mean value of sequence given by

N

nQ1

and 4m is the autocorrelation function for a shift of m and

N

irr °N unum+n"

ný1

2.3.4 _Generalised Least Squares

When it is not possible _{to assume that the noise is serially}

independent, _least _{squarctbecomes} _{an inefficient} _estimator. If the auto-correlation function of the noise is known, generalised least

squares _{may be applied,} but as such results are specific to the noise _present in the experiment, the application has not been

(31)

2-17

2.3.5 _Frequency _Approach

If the effects _{of bias are excluded,} _{the system response may}

be written _as

11 -+e.

a

Applying _{the discrete} _Fourier _transform _operator" _W,

aWUh+We.

where

111

1w _w2

Fl .""

1we 1) w2 (1J-1)

and W- -27j/IV. Now

WUhas,

1

w(iV-1)

p

w

(N-1) (N-1)

(2.3.7 )

where n is the diagonal _matrix _{whose elements are the frequency} com-

ponents of the perturbation and Q is the frequency representation of

the system. An estimate of the step response ä will be given by

sa _l'h

s l'r1.

Q 9o"

the zero frequency response of the system.

(32)

2-18

so that the correponding _equation is absent in the system equations

2.3.7 _{or if any unknown bias term is prosent,} _further _information

about the system will be necessary for a complete solution. The

conditions previously _applied to supply further information _were:

hk+l ° hk+2 ° ... u h11 0

and in the frequency domain this becomes:

1 wk _... _w(N-11 k

1 wk+1

"" _a0 _(2.3.8)

1 wN-1 _... _w(N-1) (N-1)

Any information _still _available _after _replacing _{the missing equations}

may be used to improve the variance _{of the estimates.}

The equation set 2.3.8 may be cut to express a given set of k

frequency _{components in terms of the remaining} _N-k _components.

Let _Eg,

ý where g is a vector of length k

and gis is a vector _{of length} 11-k

ax*

I

where the set is to be expressed in terms of the IL* components

and let the corresponding re-arrangement of the truncated Fi matrix

be fT where

W _{(! ?} _{ET 1}

and that of n be

(33)

2-19

Then the re-arranged _equation _{2.3.8 will} be

+0

and since ET* is square,

g* ei -

Equation _{2.3.7 may then be written}

f- _ýý _a'4) _{_Ifý} I-1ý _{X44 + Wa}

lk

These equations _{may be solved} _using least _squares _{when the error}

terms _{e, and consequently} _{the transformed} _error _terms, _{are indepen-} dent _{and of zero mean.} _{As the quantities} _{in these equations} _are

complex, _{the application} _{of least} _squares is not straightforward but if _frequency _components _{are known to be absent,} _{the equations} _{may be}

used to determine how much additional information _will be required for _{a solution} _{of the equation} _system _{in the time domain.}

2.3.6 _{The Identification} _{of Multivariable} _Systems

Consider _{a linear} _{system with} _{two inputs} _{and one outrut} _where

each input is stimulated by a different _{sequence, and <V>}

respectively. The output <y> will be given by the st _{of the}

separate _outputs _{<yU + yv> and therefore,}

K° : iu + : Lv

-+ +a01 +Q

Assuming the settling _{times of the weighting} _{sequences of the two}

paths h and Q to be ku and kV respectively. _{The equation.} _{may be written,}

(34)

2-20

L- SO+e,

where 9? m [hih2 ... hku 9192 """ 9kv ao)

IYT Ij

'UN UN-1 ... UN-ku+l t VN VN-1 _. _VN-kV+i

L"ý.

I1

U1 .... UN-ku+2 (V... VN-kV+2

Applying least _squares,

o

lSýS1-isýý.

where

4U

I

_.

$uy

S'S a ±vu I ivv

1 1N

andu and jvy are the autocorrelation matrices for and <v>, and

.

acv ° vu

is the cross-correlation matrix between and <v>.

The problem of inverting the S'S matrb _{may, be simplified} by

using uncorrelated _{sequences or using the same perturbation} for both

inputs _{but phase shifting} _{the second ku +1 times with respect} _to

the first. The cross-correlation _matrix _will _{be zero for the former}

and the latter, the four matrices in the top L. U. corner of SAS will

be replaced by the auto-correlation _matrix _{of the sequence used for}

(35)

2-21

2.4 Compensation for low frequency drift

The presence of low frequency drift _{of system output} can cause errors in estimation _{and the methods} _applied for its _removal in

p. r. b. s. cross-correlation _experiments _{are reviewed} in this section. The methods are presented and extended using a matrix notation to generalise their solutions.

2.4.1 _{Use of Reference} _Phase

One of the earliest reports" of errors introduced into

p. r. b. s. correlation experiments by ramp disturbances at the output of the system showed that _{a peak to peak error} of the order of 15% occurred _{when the mean square value} of the ramp was equal to the

mean square _value _{of the system} impulse repponse. It was sub-

sequently found12 that _{when the bias} _{was removed by using} _{a sequence} of zero mean, the cross-correlation between the ramp and the sequence became zero for one particular shift, termed the reference phase.

This _result _{was formalised13} for _{the discrete} _{case and later} _proved for _all _{p. r. b. s.} 14

In matrix form,

1°

Uh + ao2, o + alL'l,

where pm In 111712'n

_...

111

and ai is the amplitude of the linear drift term.

If is of zero mean, correlating gives:

(36)

2-22

and U' Uh n U'i -a1 U'p1

where one element of U'p1 will be zero when is a p. r. b. s.

It was then shown6 that the linear drift _{could be eliminated}

completely by using two periods of the p. r. b. s., starting _{at the}

reference _{phase, as an input and correlating} _{one period} _{of the}

sequence starting _{at the reference} _{phase with successive} blocks

of output. The mth set of equations for blocks _{of N output} _values will be given by

Umh + (alai +m o) + ao, o,

where U. is * the rmth to the (m + N) th rows of

U

V U

and the ramp disturbance is represented by (a1 1+ mEo). Correlating this _equation _with _<ur>, _{the sequence} _commencing _{at the reference}

phase, the bias and drift terms will be zero and therefore,

U

lym _{°U ff,} IUh

where U is given by

ur 1 Ur2 _. ur1J 0... 0

0 _ur1 _ur(N-1) _eil ₀₀

00 _url

.. " urN

and the equation may be written:

U

(1 ' IL 'h_ _UTh.

(37)

2-23

2.4.2 Weightin Methods

Several _authorsl5' 16 have suggested weighting the correlation

function _{and adding the results} _{over several} _periods _{and their}

results have been generalised17.

Consider _{the system equation} for several periods of input,

Uý t1 ý1

h

I rx

h+a.

a

I-

J

where a is the vector of polynomial coefficients and

Q

Jn

1 nN -N+1 (nN -N+ 1) 2 (nN -N+ 1) a

1 nN-N+2 _"

1"

1 _nN _nN2 I1gro'q

q being the highest order polynomial drift of interest. Weighting these equations,

I

V Q1

Qh

ay (2.4.1)

(38)

2-24

coefficients and

V'V *a

Vl 0 0

V2

0 ₀

.

where Vn is an N square diagonal _matrix.

Applying least _{squares to equation} _2.4.1,

0

U Q1

ur uº

_....

[ul

_{uº '. ..}

_U

q2 [h

_vºv;

ý a.

I

vºv

Q1 9J

. Q1 92

(2.4.2)

and the R. H. S. may be written

1u'fvl

_{+ V2 + ... lU}

l2äß + gz 2+...

1ý

h

U' (V1Q1 + 1222 + _... 1 _{, QýV1Q1 + Q'V2Q} _{+ ... )a}

If V is chosen such that

EI LV-1.21 + V292 + _... 1°0,

then equation 2.4.2 may be cut into two independent sets of equations

for It and a respectively. The former may be written,

IU'U' ... IfElL - [U'&V1 + V2 + ... ]UI

And if V is further constrained to give

(39)

2-25

then the estimate of h will be given by the usual cross-correlation

but after the system response has been weighted by V'V.

Any noise on the system output will be transformed by the

weighting *coefficients and if the noise has been assumed to be homo-

scedastic, it will lose this property on transformation and the

application of least squares may lead to an inefficient estimate of

the system and drift parameters.

2.4.3 Ordinary Least Squares

For general polynomial drift, the system equation 2.3.1. will

have the additional term Qla and therefore over one period of the

perturbation,

Laý

where a polynomial drift up to the qth order, q^N-k, may be

eliminated. Applying least squares,

U'U U'Q1 ýl Uý

Although _{this method will} _require _considerable _off-line computation,

only one period of the perturbation is needed and the method applies

(40)

2-26

2.5 Effect _{of non-linearities} _{on identification}

2.5.1 Analysis _{for a class of non-linear} _systems

The presence of a cost function within _{a system implies} a non-

linearity _{and this} _{may introduce} _errors into _{the estimates} _{of system} dynamics _{and system} _gain. Consider _{the simplified} _{system model}

shown in fig. 2.5.1 which assumes that the cost function and the dynamics _{are separable.}

For the simplest system, the cost function _will be a quadratic

ao + alx + a2x2 ,

where x is the instantaneous input to the cost function generator.

The response of the system y(t) to a sequence with amplitude p

will be given by:

00

2J. (ti)

0

{a2Z12(t

- T) + a1Z1(t - T)

+ ao)h2(T)dT,

where

0*

Z1(t - T)

(a + pu(t -T-

z))hl(z)dz

0

and a is the operating point. This may be written,

y(t)

J00

{a2a2

+ al« + ao + `,.

_{al + 2a2a)Z2(t - 'r)}

0

+ a2Z22(t _- _r)}h2(-r)dT,

(41)

J N

w %.

jI

j

y

C

,ý

7 I- 4

r e

v

t

äý ä

3 äßi

P4 Öä

I

w a

r

W s M

0

V

I

" t

N

H M cý

(42)

2-27

22(t

_{- T) -}

_J0

Correlating _with _{the sequence, which has a clock interval} _{of X, and}

normalising with respect to the sequence amplitude p,

li

ýuy(m) _" ta2a2 _{+ lila} _{+ aoI} _-

p

00 00

+ {al + 2a2a}

f

h2(T)

f

hl(z)ýuu(MN--r - z)dzdt

00

w 00

+ pat h2(T)

j

hl(zl) J hl(z) 000

1N

Nx

I

u(na -T-

)u(na -T-

z2)u(nX - nX)dz2d 1dr

N =l

(2.5.1)

The first _{term is a constant} _{and the impulse} _response _{of the system} may be determined from the second when ýuu is known. The error due

to the non-linearity, represented by the third term, is proportional to the amplitude of the sequence and independent of the operating

point. As the optimum is approached, the second term tends to zero and the final term may become dominant.

2.5.2 Effects _{of non-linearitics} in the frequency domain

The perturbation may be considered as the sum of several sine

waves where the effect of passing through linear dynamics will be to alter the phase and amplitude relationship betwaen harmonics. If

(43)

2-28

When the overall response is correlated with the original waveform,

the linear _{path will} be identified but additional terms will appear

at particular frequencies due to the intermodulation introduced by

the non-linearity. If the frequency components of the perturbation

sequence can be chosen so that these intermodulation frequencies do not

exist in the original sequence, there will be no correlation between these _additional terms _{and the sequence} and the error will be

(44)

2-29

2.6 _Effect _{of hi¬h frequency} _{perturbations}

The estimates of system dynamic response and gain have been

obtained in previous sections assuming the settling time was known

and well-defined. However, with an optimising system which contin-

uously changes its _operating point, it is possible that the system dynamics _will _{change to give} _{a longer} _settling _time. The assumption

will not then be valid and errors may be introduced into the estimates.

Consider _{a sequence of N samples perturbing} _{a system which}

settles in q intervals but is assumed to settle in k intervals,

where k is less than q. The true system output vector, in the

absence of noise, will be given by:

!/ 32

UN UN_ 1 ... UN-k+ 1I UN-k ... UN_q+ 1h1

ul UN _... uN-k+2

I UN

k+l

lie

hk

hk.

t

_UN-1

uN-k

uN-k-1

hq

Using the notation of section 2.3.2,

UN-k

UN-k+i

XO+

1

UV-k-

UN q+1

hk+l

hk+2

1'Q

+ ao?

(45)

2-30

Lt ° !? '

0)

ix'X)-1X'X 0

UN_k ...

uN-q

hk+ý

+ ill

01 ! X'X1-lX'

UN-k-i

hq

where the first term represents the true system gain and the second

term is the error.

If q is an integral _{number of perturbation} periods r, the error

term may be written as

E (? ' 2) fX'xi-lX'[L F1

D

where

UN-k _.. ul

2

UN-k u2

UN U

yp

u

uN-k-i UN

and the nth elements of E1 and are given by

r-1 r-1

m10 k+n, J+n and mII

htvV+n,

(46)

2-31

C. 1'ßz + t? '

£1 tXXl~Ixtu

Ei

where the first term is independent of the waveform and X'U may be

further _simplified _to

4k

4k+l

4k+2

ýN-1

N

.

42 _"

4'1

-

ý2

- -- ----

4N-k

---

- - --

(47)

2-32

2.7 Application _{of Particular} Waveforms

Careful _{design of an optimiser} _{should lead to a system where}

the perturbation is of a sufficiently low frequency to eliminate

errors caused by too short a perturbation period. If drift or bias is known to be present at the system outFA, it may be compensated for,

thus removing these sources of error. The effects _{of noise} and non- linearities _cannot _generally be eliminated _{and each class} _{of wave-}

form must be examined to determine the magnitude of the error.

2.7.1 Application _using _{pseudo-random} binary _sequences

11

For _{a p. r. b. s. with} _unit _clock _rate _{and unit} height,

ü=N _{and gym= 1,} _m=0

a- n] 0.

Therefore, _substituting in equation 2.3.6,

(N+ 1)<-< I1

XIX - _---F-

1'

and using the results of Al. 3,

(N - k)+ ý -1

IIT

(N + 1)(N - k) ---1---

-1' _{, N+1-k}

(2.7.1)

The estimates of the step response from equation 2.3.4 and the

(48)

2-33

NNkk

(N + T) (N - k) _n=1 Yn

pý1

un-p -N

and var(0)

NkQ2

(N+1)(N-k)

The effects of the ratio of settling time to perturbation

period on the variance of the estimate for a p. r. b. s. as shown in

figs. 2.7.1 _{and 2.7.2.}

Clarke9 has shown that _{the impulse} _response _obtained _{by this}

least _squares _estimator is equivalent _{to correlating} _{the response} _of the system with the perturbation, and estimating the bias on the

impulse _response _{by averaging} _{over the last} N-k _ordinates. _The weighting sequence is assumed to be zero after the system settling

time of k elements.

2.7.2 P. r. b. s. in non-linear _systems

When the estimation technique developed _{in the last} _section is

applied to the correlation obtained in equation 2.5.1, the estimate of the system gain will also contain an error term due to non-linear effects which in general will be non-zero. An optimiser using this sequence: will therefore settle off the optimum where the error term

is equal but of opposite sign to the system gain. If hl is a pure time delay, the error term is a constant _{and may be removed by test-} ing it _like-the bias _term. However, _{the effects} _{of the non-linearity}

may be removed completely by performing several different cxpcriments'A.

The effect of a quadratic non-linearity _of _{a system composed of two cascad-}

(49)

J tu 1111 ýcý`ý'ýr aýT, ýtMy

`Hinf _i~ry

T1049 _TV

1 0.4

IiiIII

_°5

I

1 as I1I1

IIIIIo.

_z

O

ýOO

SI,

M1Mtt LLNoil84 $00

Fig. 2.7.1 _{- Effect} _{of p. r. b. s. length} on noise sensitivity of gain estimator

N., ta

tiavb

Axyl.. _{" %tt«,}

º+o. Tor%=

'to 0.4ouli94te U$4%tH

Fig. 2.7.2 - Effect of settling time on noise sensitivity of

gain estimator using long p. r. b. s.

[image:49.2482.187.2338.80.3627.2]

(50)

2-35 0 0 ý, . r" ý'ýr. . ". ý'.. :; t,. U v 01% .o r. ca v a)

41 u

1v

. r4 N

1J O

to W

Ci N

U) 4

$

O f4

aa

v, w

a

pb ₀ dC _N

H _I`.

.D

O w O

ro >% u

41 ao

"P4 I. ý C,

", 4

G i4

oH

N

u _G7

"r4 t-1

1I ll

cd u

1.1 _N

'b

td _r4

Cl

au Cl

N w 0)

oa

JJ d

u _.o

ni u

w

wü

W tu

M _C)

" tj

f-. O

(51)

2-36

2.7.3 Impulsive _{autocorrelation} _function _with _{added bites}

Consider _{a sequence <a> formed by adding a constant} _{0 to a}

sequence with zero mean and impulsive _{autocorrelation} function.

For unit power, the autocorrelation _matrix _{for will} _{be given by}

(NIN - ZN). (N - Z)

Now _ýCC"ýuu+R2

and C°ß

therefore _using _{the result} _{of 2.3.6,}

(N

1

₁₎

Nik+

!

i>: 2::

1

N[--

BY 1(1

Inverting _using A1.3,

(N - k)lk +y1- Nß1

N-1

lxs 1-1 ---T--... ---- ---

N2(N - k) - Nß1' 1J

IN

+ ((N - 1)ß2 - 1)kl

I (N-1) J

1

and applying equation 2.3.5,

var(ä) (N - 1)ka2

N(N - k)

If <0> is of unit power, this becomes,

(N - 1)ka2

var(a) n

N(N - k) (1 - ß2) (2.7.2)

Fig. 2.7.4 _{shows the effect} _{on the variance} _of L for a range of

settling times, _{and sequence} lengths. _{Note that} _{the result} 2.7.2

[image:51.2506.266.2428.355.3226.2]

(52)

lo

t _rý

A

6,

t" .1t

2.37

7

_e

le A a( O

is t+ 0 u . ý3 U4 0

o 4J

r ";

N N C, v) . 1-4 0 94 C) x 41 a 0 0 to

N O

", A 14 ü. a 0 cl u d a U4 0

0u

v w U4 W "7 rz N . Qo .,. 4 to

0 N

0 ₀

(53)

2.38

2.7.4 _{Note on Rotation} Symmetry

Rotation _{symmetry of a waveform occurs when alternate} _half

cycles are identical in shape but reversed in sign. Signals

possessing this property have zero mean and contain only odd harmonics.

When the test _signal _{has rotation} _symmetry, _{the matrix} _{X'X will}

have the slightly simpler form:

Uff, TUT I0 X'X d ----l-

0t (N (2.7.3)

where UT is the V matrix truncated to the first k columns. The

inverse _{of the matrix} 2.7.3 may be found by inverting _{the upper and}

lower _square _matrices independently _{and therefore,}

1N

a0

/"1

yn,

a

and Var(ap) - N Q2

for-all _{sequences possessing} _rotational _symmetry. _{The matrix YZ'YX}

will be of rank N/2 and therefore the settling time of the system

must be less than half the perturbation period to obtain a solution.

Two waveform, will be uncorrelatcd _{when they have no common}

frequency _{components, and therefore} _{a sequence with rotation} _symmetry,

and consequently no even harmonics, will be uncorrelated with any

waveform whose fundamental is twice its own. If the second waveform

also has rotation symmetry, the process may be extended to produce

(54)

2-39

tification _{of multivariable} systems (2.3.6)l but ench time the fundn- mental frequency is doubled, the maximum Prttling tim(. which m. _{iy be}

identified is halved _{and experimental} _times _{may become exces!: ive,}

if _{the time constants} _{of the system are of the came order.} _1cthods for _generLiting _sequences _with _rotation _symmetry, impulsive _auto-

correlation functions anä doubling of the fundamental frequency have been given by Briggs and Codfrey19.

For the simple non-linear system described in section 2.5.1,

the error term may be eliminated by using a sequence with rotation

symmetry, as the odd harmonies will produce _{sums and differences} of frequencies _{at the even harmonic} frequencies _{when passed through} _an even order non-linearity, so that none of the intermo-lulation will

correlate with the original sequence.

2.7.5 Three level _m-sequences

A three level m-sequence possesses rotation _{symmetry and with}

levels _{±1 and 0, a power of 2(11 + 1)13..} Therefore

2(N3+ 1

Ik j0 X'X*

-- ---Or ---I

N

where k <C N/2 for a practical solution.

3j

2( N+1 ) '. 1k 0

Of I

I NJ

(55)

2-40

3Nk a

mý1

Jm

ßp1

um-rt 2(N + 1)

and

var(a)

_To

3ka2

or for a sequence with unit power,

var(c) - k°2 _N

Squaring a three level m-sequence gives a two level sequence

of period N/2 and an impulsive autocorrelation function. The

analysis for this waveform is covered in section 2.3.5 where

N3=

when the levels are made ± 1.

2.7.6 Square Wave Perturbations

The N-square circulsnt matrix U for 3 square wave perturbation

with levels ±1 is given by:

-1

11....

111 _{-1 ... -1 -1}

U

-1

1 _...

1

1 _...

-1

.

-1 -1 -1 ... -1 -1 1 1 _... 1 1

1 -1 -1 ... -1 -1 -1 1

... 1 1

1 1 1

... 1 -1 -1 -1 ... -1 1

1 1 1

... 1 _.1 -1 -1 ... -1 -1_

and since the square wave possesses rotation symmetry, it may be

(56)

2.41

N-4 N-8 _N""4(k-1)

N N

N-4

N-8 'U

, N-4(k-1) .

N

N-4

.. 11-4 N

and using the result of A14 and equations 2.3.4 and 2.3.5

N N2

81_ (1J _{- 2k + 2)}

N

bn

h=ZLk _n-k

02 and var (s) - (N

- 2k + 2)

which is plotted in fig. 2.7.5 for a range of sequence lengths and settling times.

2.7.7 Sine Wave Perturbations

The N-square circulant matrix U for a sine wave with unit

amplitude is given by:

Ro1

JI

W -W 1 w2-W 2"""" 4f'-W lý

0v w 1J w -6-1 w(lý-1)-W-(N-1)

w2-w 2 _wV-w-N _W-W..l

2nß

[image:56.2537.506.1919.191.869.2]

(57)

s

i .r

M Lº _rý

rEQ to ,nrp 0 4 (1

0

S jJ 8

Z. n a

(58)

2-43

U 'UT .U

00 +1 02

_0"

4'k_1

+1 40

" 42

"

0

Ok_l

ýi

" ýl 00

where 4m, the autocorreletion function for the sequence for shift m, is defined by:

)

N1 _ýw _{in+n

nJ (m+n)

n=1

J

(c, _{+ w-m)} 4

The matrix F LIT is of rank 2 since the sum of the pth and p- + 2th

row equals (w + cä 1) times the P+ 1th row. Solving for kb 2,

42 'iw+w 1)

lu lE _Zr, _mNiw-w' ₎

2

_(i j l)

Using the result of equations 2.3.4 and 2.3.5,

2N

n 8=

---- ý sin(2n - 1)S Yn

N con- n-i

202

and var(ä) - _tt

N cos y

and for unit power,

Q2 var(c)

N cos2N

[image:58.2506.480.1596.235.883.2]

(59)

2-44

9 ₀

r"

r;

_z

t

to 1W

äFN

00

.o

n

w

O

.4 U . 1-4

N

C, U) 0

93

.Q

14 .9

a

0

O O O.

ici

d ci

OW

I-ý 0

41

u d w w

W

n N