Analysis of a cross section of time series using structural time series models

(1)

-1-

ANALYSIS OF A CROSS-SECTION OF TIME SERIES USING STRUCTURAL TIME SERIES MODELS

by

Pablo Marshall Rivera

London School _{of Economics} _{and Political} Science

1990

(2)

-2-

ABSTRACT

This _study _deals _with _multivariate _structural _time _series _models, and in particular, _with the _analysis _{and modelling} _{of cross-sections} _of

time _series. In _this _context, _{no cause} _and _effect _{relationships} _are assumed between the time _series, _although they _are _subject _{to the} _same

overall envirorunent.

The _main _motivations in _the _analysis _of _{cross-sections} _of _time series are (i) the _gains in _efficiency in the _estimation _of the

irregular, _trend _and _seasonal _components; _and _(ii) _the _analysis _of models with common effects.

The study contains essentially two parts. The first one considers

models with a general specification for the _correlation of the irregular, _trend _{and seasonal} _components _across _the _time _series. _Four structural time series models are presented, and the estimation of the components of the time series, as well as the estimation of the parameters which define this components, is discussed.

The second part of the study deals with dynamic error components models where the irregular, trend and seasonal components are generated by common, as well as individual, effects. The extension to models for multivariate observations of cross-sections is also considered.

Several _applications _of the _methods _studied _are _presented.

(3)

-3-

LIST OF TABLES

2.3.1: _Coefficients _{of the Eigenvalues} _{of the Powers} _{of F}

to Form Xc, t-p 59

4.3.1: _Estimates _{of Price} _Effects ₁₀₁

4.3.2: _Slopes _{of the Biases} _{in Technical} _Progress ₁₀₂

4.3.3: _Error _Structure _{of the Models} ₁₀₃

4.3.4: Test _{of Hypotheses} 104

4.3.5: _{Demand Elasticities} 107

4.4.1: _Estimates _{in Total} _Quantity _{of Energy} _Models 116

4.4.2: _{Demand Forecasts} for 1987 117

4.4.3: _Absolute _Value _{of Prediction} _Errors for 1987 118

5.4.1: Eigenvalues _{and Eigenvectors} _{of E. and E 71} 148

6.2.1: _Parameter Values _{in the Evaluation} _{of the Efficiency} 160 6.2.2 _{to 6.2.11:} _Relative Efficiency _{of Estimators} Using _a

Single Time Series _{Compared to Estimators}

Using _{n Times Series} 162-166

6.4.1: _Estimation _Results for Univariate Models 183

6.4.2: _Estimation Results for _{the Multivariate} Model 184 7.2.1: Parameter Values in the Evaluation _{of the Efficiency} 200 7.2.2 _{to 7.2.11:} Relative Efficiency _{of Estimators} Using _a

Single Time Series Compared to Estimators

Using _{n Time Series} 202-206

7.4.1: Ordinary Least Square Regressions 214

7.4.2: Estimation Results for 'Univariate Models 216

7.4.3: Estimation Results for the Multivariate Model 218

8.3.1:

Number of Parameters

in Variance

Covariance

Matrices

(6)

-6-

8.3.2: _{Number of Parameters} _{in Variance} _Covariance _Matrices in Model Type II

8.4.1: _Estimates _{of Price} _Effects

8.4.2: _Slopes _{of the Biases} in Technical Progress 8.4.3: _Error Structure _{of the Model}

8.4.4: _{Demand Elasticities}

(7)

-7-

LIST OF FIGURES

4.3.1: _Biases _{in Technical} _Progress: _Other _Industry _Sector ₁₀₈ 4.3.2: _Biases _{in Technical} _Progress: _Domestic _Sector ₁₀₉ 4.3.3: _Biases _{in Technical} _Progress: _Other _Final _Users _Sector ₁₁₀

4.3.4: _Biases _{in Technical} _Progress: _Transport _Sector _ill

4.4.1: _Price _Index _{of Energy} _{(1971 Q1 - 100)} 115

6.4.1: _Labour _{Cost Time Series} (in logarithms) 182

(8)

-8-

(9)

-9-

ACKNOWLEDGMENTS

First _of _all _I _wish _to _espress _{my gratitude} _to _{my supervisor,}

Professor _Andrew _{C. Harvey,} _for _his _constant _support _{and guidance} _at

all

stages

of this

work.

am grateful to the _participants _at the Econometrics Workshop _and the _members _of _the Statistics Department _at _the _London _School _of Economics for _their _comments _{and suggestions.}

I _{am also} _grateful _to _Guido _del _Pino, _Osvaldo _Ferreiro, _German Rodriguez _{and Alvaro} _Vial _{who encouraged} _{me to} _undertake _postgraduate

studies.

Financial _{contributions} by _the _British _Council, _the _Pontificia

Universidad Catolica de _Chile _and _the _Instituto _Nacional _de Estadisticas de Chile is _{acknowledged.} _{A teaching} _{assistantship} _at _the London School _{of Economics} _{is also acknowledged.}

(10)

-10-

INTRODUCTION

It has _long _been _recognised _that _{many economic} _time _series

can be decomposed _{as a sum of trend,} _seasonal _{and irregular} _components,

(1)

Yt ý At + 7t

t=1,...,

T,

where yt denotes, _perhaps _after _some transformation, _the _t-th observation and at, yt and ft are the trend, _seasonal _and irregular

components.

Structural _time _series _modelling _deals _with _the _{specification} _and estimation of these components, and of the parameters which define

them. The historical development _{of structural} _time _series _models _{and a} review of the specifications usually considered in the literature can be found in _Harvey _{and Durbin} (1986) _{and Harvey} (1990). _In _the basic

structural model, the trend and seasonal components are defined by

At + pt-l + 71t,

ot = ot-, + btl

'Yt ý- 7t

-1- 7t -2-'*'- 'yt - S+ 1 (S)t I

where jAt is the level of the trend at time t, Ot is its slope, -yt is the _seasonal _component _{at time} t, s is the seasonal period, and nt, bt, wt, and the irregular component et in (1), are random shocks assumed to be mutually and serially uncorrelated, with expected values equal to

072 072

zero and variances _,, ag, o-,, 2 and _{. respectively.} Thus, the trend and seasonal effects in the time series are assumed random variables

changing

over time.

If

the variances

of the random shocks nt,

bt and wt

are equal to zero, model (1) collapses to a deterministic linear trend

(11)

-11-

greater than _zero _gives _flexibility _to _the _trend

and seasonal components to _evolve _over _time. _Model _(l)-(4)

can be written in _state space form _{and as a result} _the _Kalman _filter _algorithm _{can be used} _to estimate the unobserved _components. _{The variances} _{of the} _{random shocks,}

usually _called hyperparameters, _{can be estimated} _{by maximum likelihood.} Although _the _above _{specification} _may _be

extended to include

cyclical _components _{as in Harvey} (1985a), _{or general} _{ARMA structures} _as defined by _Box _and _Jenkins _(1976), _there _is _substantial

empirical

evidence _supporting _{specifications} like _the _one _presented _above, _or particular _cases _of it, for _{many economic} _time _series; _{see Harvey} _and

Todd (1983), _Kitagawa _and _Gersh _(1984), _Harvey _and _Durbin _(1986), Fernandez-Macho (1986), _{and Fernandez-Macho} _{and Harvey} _(1989).

This _study is _concerned _with _the _construction _of _structural _time series models for cross-sections _of time _series. In this _context, _an n-dimensional multivariate version of model (l)-(4) _{can be naturally}

constructed. It is _assumed that there is _no _cause _and _effect relationships between the _{n time} _series. However, _{as these} time _series are subject to the same overall environment, the multivariate white noise random shocks which drive the irregular, trend, and seasonal

components: ft, i7t, bt and wt, have variance covariance matrices: E,, E,

qj F-6 and E, respectively.

The two main motivations in the _study _of _multivariate _models for cross-sections of time series are (i) the gains in efficiency in the estimation of the trend and seasonal components, and (ii) the analysis

of models with common trend and seasonal effects.

Although, _{as mentioned} _earlier, the _{especification} (l)-(4) can be extended straightforwardly to a multivariate model, the estimation and

the analysis

of a model with

more than four

or five

time series

becomes

(12)

-12-

formulation _{and estimation}

of multivariate _models is the _{specification}

and testing _of hypotheses _which _reduce _the _{dimensionality} _of _the problem _and _simplify _the _analysis _and _the _{interpretation.} _Related _to

these ideas is _the _development _{of simple} _estimation _procedures. _That _is also _{an important} _objective in the study of multivariate _models.

Previous _studies _on _multivariate _structural _time

series _models include _Jones _(1966), _Enns _et _all _(1982), _Harvey _(1985b), Fernandez-Macho (1986), _{Fernandez-Macho,} _Harvey _and _Stock _(1987), Fernandez-Macho _{and Harvey} _(1989), _{Fernandez-Macho} _(1989), _{and Harvey}

(1990). _Harvey ₍₁₉₉₀₎ _{and Fernandez-Macho} ₍₁₉₈₆₎ _{are excellent} _reviews.

Jones (1966) _considered _a _multivariate _smoothing _model _which corresponds to the multivariate _version _of (l)-(4) but _with _{no slope} _or

seasonal terms. Enns et all (1982) _studied the _{same model} _{as Jones} _and assumed that the two variance _covariance _matrices in the _model, E. and E.,, were proportional; what is called the homogeneity hypothesis. They proposed _{a method} to _estimate the _parameters _of the _model by maximum

likelihood _{and showed how estimates} _of _the _trends _could _{be computed by} the Kalman filter. Harvey (1985b) _extended _{the multivariate} _exponential

smoothing model to include slope and seasonal components as in (l)-(4),

studied the estimation of the unobserved components by the Kalman filter, _{and formed} _the likelihood function _{by using} _the _Kalman filter

and the prediction error decomposition. Fernandez-Macho (1986) and Fernandez-Macho (1989) developed _the frequency domain _maximum likelihood _estimation _of _the _parameters in _{the multivariate} _version _of

(l)-(4), _and in _particular _cases _of it. Fernandez-Macho _and Harvey (1989) developed _{a test} for the homogeneity hypothesis _{used by Enns et} all (1982). Finally, Fernandez -Macho, Harvey and Stock (1987) and also Fernandez-Macho (1986) _studied dynamic factor analysis models.

(13)

-13-

which includes _the first _four _chapters, _is _concerned _with _the multivariate _structural _time _series _models _defined _above. _The _main

contributions here _are (i) _{a more} _detailed _study _of _the _model _with

exogenous

variables,

as it

is usually

the case in analysing

economic

time _series; (ii) _{a new expression} _for _the _likelihood _function; _and (iii) _{an econometric} _application _where _the _unobserved _trend

components have _{an economic} _{interpretation.} _{The second part}

of the study includes

chapters 5 to 8 _and is _concerned _with _the formulation _and _the estimation of dynamic _error _components _models. Although these kind _of models have received _significant _attention in the _econometric

literature, _the _approach _developed _here _{is new.}

The following lines _present _{a detailed} _account _of _the _content _of this _study, _with _special _reference _{to the new material.}

Chapter 1 defines _a _multivariate _regression _model _where _the residuals follow _{a structural} time _series _models. Four time _series

models, and basic statistical properties of them, are presented in Section 1.2. Section 1.3 _presents _the _standard Kalman filter _and Section 1.4 _the diffuse Kalman filter. These two filters _are alternative algorithms to obtain estimates of the unobserved components

in _the _model. They differ in _the _way _they deal _with _the initial

estimates used to start the recursions of the filter. The main contributions of this chapter are the development of formulas to start

the _standard Kalman filter in Section 1.3, and the simple and direct derivation _{of the diffuse} Kalman filter in Section 1.4.

In Chapter 2, Section 2.1 _presents three asymptotically equivalent

(14)

-14-

alternative _expression for _the _likelihood _function

which is simple and has _some _advantages

over the _more _standard _approaches. Section 2.4 compares the new expression _with _{the frequency} _domain _likelihood.

Chapter _{3 deals} _with _the _maximum _likelihood

estimation of the parameters in the _regression _model _defined _in _Chapter _1. _Section _3.2

considers the _estimation _of _the _vector _of _exogenous _variables, _and Section 3.3 _the _estimation _{of the parameters} _{in the variance}

covariance matrices _of the _random _shocks. Section 3.4 _analyses _the _estimation

strategy _{and Section} 3.5 _presents _results for _the _asymptotic behaviour of the estimators. Most _of the _results in this _chapter have _already

been _developed _in _the _literature, _although _the _results _in _Section _3.2 are considerably _{more general} than the ones found in previous _studies.

Chapter _{4 presents} _{an econometric} _study _for _the _{demand for} _energy in _the _U. _K. for _the _period _1971-1986. _The _study _illustrates _the techniques _{and procedures} _{of the} first three _chapters. Using _{a translog}

cost equation, Section 4.2 presents an econometric model where the technical _progress takes the factor _augmenting form. Assuming that these factors follow _stochastic trends, the _reduced form _of the _model has _the form defined in Chapter 1. Section 4.3 _estimates _the _model

separately for each of four economic sectors: other industry, domestic,

other final users, and transport; which use mainly four fuels: gas, electricity, oil and coal. Section 4.4 obtains forecasts of the

individual demands.

Chapter 5 formulates dynamic _error _components _models. In Section 5.1 _the _standard _{specification} _of _static _error components models considered in the literature is extended to dynamic models. Two kind of models are defined depending on whether the time series share the same

(15)

-15-

concept _of _{cointegration.} _Basic _statistical _properties _of _the _models are presented in Section 5.2, _while _sections _5.3 _{and 5.4 generalise} _the

specifications _{to multivariate} _observations _{and factor} _analysis _models respectively. The _contents _of _this _chapter, _{as well} _as _of _the _ones which follows _constitute _{a completely} _{new approach} _to _dynamic _error

components _models.

Chapter _{6 studies} _the _estimation _of _error _components

models type I where the time _series _are _not _assumed _to _share _the _{same trend} _and

seasonal _components. Chapter 7 deals _with _the _estimation _of _error components models type II, _where the time _series have _the _{same trend} and seasonal components. Finally, Chapter 8 presents _results for _the

estimation _{of multivariate} _error _components _models.

Some final _comments _with _respect _to _the _presentation _of _the material follows. Equations, tables _{and figures} _are _numbered _according

to the _section. The chapter _number is _omitted _except _{when referring} _to an equation in another chapter. Finally, the presentation uses extensively definitions and results in matrix algebra. A good reference

(16)

-16-

CHAPTER 1: MULTIVARIATE STRUCTURAL TIME SERIES MODELS

1.1 _Introduction

This _chapter defines _a _multivariate _regression _model _with stochastic trend _and _seasonal _components, _reviews basic _statistical

properties _{of multivariate} _structural time _series _models, _{and presents} the Kalman filter _algorithm for _the _estimation _of _the _unobserved components. For most of the definitions and results stated,

Fernandez-Macho (1986) _{and Harvey} (1990) _{are good references.}

The _multivariate _regression _model _with _stochastic _trend _and seasonal components is defined as

yt -B zt + at, t=

where yt is an (n x 1) vector of observations, zt is an (r x 1) vector of exogenous variables, B is an (n x r) matrix of fixed parameters

which

satisfy

the restrictions

vec(B) _-Sß,

and at is a residual component which follows a structural time series

model. In (1.1b), S is a known (nr x k) selection matrix and 0 is a (k x 1) vector representing the functionally independent parameters in B. Alternatively, _{the model can be written} as

yt _{= xt. 0} at,

with Xt the (n x k) matrix (z' t @ In)S, where @ represents the Kronecker product and In is the identity matrix of order n. For the residuals at,

(17)

-17-

model. Although detailed _{def initions}

are presented in Section 1-2, _all the _mentioned _models _for _the _vector

at _{can be written} in _the _state space form

(1.3a) _{C't = (Z @ In)} _{Ot + 'Et,}

t=1,

...

(1.3b) _{Ot - (T @ In)} _Ot-, _{+ (R @ In)}

Kt,

where Z, T and R are time invariant _{and known matrices} _{of dimension}

x P), (p x p) and (p x u) respectively, _{Ot is} _the _state _vector _which contains the trend _{and seasonal} _components, _{and ft} _{and Kt are} _{(n x 1)} and (nu _x 1) dimensional _random _shocks _assumed _to be _serially _and mutually uncorrelated, _normally distributed, _with _expected _values _equal

to _zero _{and variance} _covariance _matrices Ef _{and E. respectively.} _The random _shock _{Kt has u subcomponents} _of dimension (n x 1) each, _which

correspond to the random _shocks _of the trend _and _seasonal _effects.

These u subcomponents _{are also} _{assumed to be mutually} _{uncorrelated,} _and then

, the matrix Y-K is block diagonal. In the state space representation (1.3), (1.3a) is _called the _measurement _equation _and

(1.3b) _the _transition _equation.

Model (1.3) is _said _{to be homogeneous} if

EK

- DK @ 1: _{f P}

where DK is a diagonal matrix of dimension u. Thus, the model is homogeneous if _{the variance} _covariance _matrices _{of the} irregular _random

shock ft, and the variance covariance matrices of the u subcomponents

in _the _vector _{Kt are} _{proportional.}

Apart from this introduction, the chapter contains three more sections and an appendix. Section 1.2 defines the four time series

(18)

-18-

the _state _space _model (1.3), _with _{some additional} _results _concerning:

(i)

the initialisation

of the algorithm,

(ii)

the Kalman filter

for

an

homogeneous _model, _and _(iii) _the _treatment _of _the _vector _of coefficients 0, _are _presented in Section 1.3. Section 1.4 deals _with

the diffuse Kalman filter _which _extends the _standard filter to handle state space models with very general diffuse or semi diffuse initial

(19)

-19-

1.2 _Definitions _{and Basic} _Statistical _Properties

of the Models

This _section _defines _four _structural _time _series _models _for _the _(n x 1) vector _of _residuals _{cyt def ined} in Section 1.1. _{The models} _are:

local level, local _linear _trend, _seasonal _local _level, _and _basic structural model. Basic _statistical _properties _are _also _presented.

Specific _analyses for _{each model} _{are considered} _{in what follows.}

Local Level Model

The local level _model is defined _as

(2. la) _{Clt - ; It + Et,}

(2.1b)

At = At-,

+ ?

Itp

where jAt is an (n x 1) vector representing the level or trend of at, and ft and nt are (n x 1) random shocks assumed to be normally distributed, _serially _{and mutually} _{uncorrelated,} _with _expected _values equal to zero and variance covariance matrices E. and E 71 respectively.

The model is in the state space form (1.3) with p=u=

1, _{and Kt = qt.} Model (2.1) is _not _stationary _given the presence of a random walk trend; however, the stationary form of the model is

obtained by taking first differences in (2.1a). That gives

(2.2)

vt a (1-L)

at - t7t + (1-L)

ft,

where L is the lag operator. From this expression, the autocovariance

function

of

Pt

is given by

(20)

-20-

(2.3b)

r(ti)

--

Y-ft

and

(2.3c)

r(±k)

= 0,

if k ýý, 1.

It _then follows _that _the _differences _of _at _can _be _written _as _a restricted MA(l) _model, _which is _called the _reduced form _of the local

level _model. _{The autocovariance} _generating _function _evaluated _at _e-iX, which is equal to 2w times the _spectral density _evaluated _at the

frequency

X, is given by

(2.4) E?

7 + (2 -2 cos(X)) Y-,, -w4X4W.

Thus, _the _spectral density _{is real} _{and the model} _{is strictly} invertible if E

71 is positive definite.

The local level _model _{can be extended} _to include _af ixed _slope in the transition _equation (2.1b). In that _situation,

(2.5) _{At - At-,} ₊₀₊ _?_Itp

and, by repeated substitutions, the trend Vt can be expressed as

(2.6)

At -

4t +t

0)

with A*t satisfying (2.1b). Replacing (2.6) in (2.1a) shows that the model with a fixed slope 0 has the standard form (2.1) with t as an

exogenous variable and 0 as its coefficient. Fixed seasonal components can also be included in the model as exogenous variables in the measurement equation (2.1a).

Local Linear Trend Model

(21)

-21-

(2.7a)

Clt - At + ft _It= 1'..., _T,

(2.7b)

Pt - At-,

Ot-,

qtp

(2.7c) _{ot = Ot-,} _6tt

where the (n x 1) vector _{At is} _the _trend _{of at} _{and the} _{(n x 1) vector} Ot is its _slope. _{The (n x 1) dimensional}

random _shocks _Et, _{qt and 6t} are assumed to be _normally distributed, _serially _and _mutually

uncorrelated, _with _expected _values _equal _to _zero _and _variance covariance _matrices E., E

77 and Eb respectively.

The model is in _the _state _space _form _(1.3), _with _p=u-2,

and

with

(2.8a)

1039

(2.8b)

0 i

(2.8c)

(2.8d)

17ý , 6ý )9

and

(2.8e)

That is, Z is _{a (1 x 2) matrix,} T and R are (2 x 2) matrices, _{and Kt} and Ot are (2n x 1) vectors. Notice that in the state vector Ot, the first _{n elements} _correpond to the levels _{or trends} _{pt while} the last _n elements correspond to the slopes Ot. That is the way the components

(22)

-22-

taking _second _differences _in _(2.7a). _That _is,

(2.9)

pt a (1-L)

_{2 Cit = bt + (1-L)} _{? It} ₊ _(1-L)2

ft

I

From where, the _{autocovariance} _function _{of Pt is given} _by

(2.10a)

r(o)

- Eb +2

Y-71

+6

Y-ey

(2.10b)

r(ti)

--E

77 -4

Ef,

(2.10c) _r(±2) _{= Ef,}

and

(2.10d) _r(±k) _{- 09} _if _{k 2h, 2.}

t=3,..., T.

It follows _that _the _reduced _form _of _the _local _linear _trend

model is a restricted MA(2). The autocovariance _generating function _evaluated _at

e-"

is given by

Eb + (2 -2 cos(X)) E

71 + (2 -2 COS(>, ))2

Yf9

where _{-w 4X4} _{-r. The local} linear trend _model is _strictly invertible if _{the matrix} _{Eb is a positive} definite _matrix.

Notice _that _{when Eb is} _equal _to _zero _{and the} _component _{Ot at time} zero is defined as a fixed parameter, the local linear trend _model reduces to the local level model with af ixed slope in the trend _lit; see equation (2.5).

Seasonal Local Level Model

The seasonal local level model, with seasonal period s, is defined

(23)

-23-

(2.12a) _{Cit - /At + 7t + ft,} _t-1,

...

(2.12b)

At = Ilt- I

(2.12c)

S(L) -yt = wt,

where the (n x 1) vector _{At is} the level _{or trend} _{of at,} _{and the} (n x 1) vector _{Tt the} _seasonal _component. _{The polynomial} _{in the lag operator}

Ll S(L), is defined _{as S(L)} _{= (1 +L+...} _{+ Ls-1);} _and, _the _{(n x 1)} random shocks ft, qt and wt are assumed to be normally distributed,

serially and mutually uncorrelated, with expected values equal to zero and variance covariance matrices E. _I Y-,, and E. respectively.

The _model is _transformed into _the _state _space form (1.3) by defining _auxiliary _components Yltl***'-YS-2, _t, _{and by setting} _p=s, u-

2, and _,

(2.13a)

1,1,02

... '

0 ]p

000

0

-1

1

010

(2.13b) 00

00010

12

(2.13c)

0

(2.13d)

71 ý, 0)ý )v

and

I

(24)

-24-

That _is, _{Z is a (1 x s) matrix,} _{T is an (s x s) matrix,} _{R is an (s x 2)} matrix, _{Kt is} _{a (2n x 1) vector,} _{and the} _state _vector _{Ot has dimension}

(sn x 1) _. _{The auxiliary} _components _{71t''**"YS-2,} _{t correspond} _{to the} seasonal _effects _at times (t-l),..., (t-s+2) _{respectively.} The

stationary form _of the model is obtained by taking _seasonal differences in (2-12a) _{and using} the fact that (1-L) S(L) _{- (1-Ls).} That is,

(2.14) _{pt -E (1-Ls)} _{at = S(L)} _{qt + (1-L)} _{wt + (1-Ls)} _ft,

for _t- _s+l,

... J. The autocovariance function of Pt is then given by

(2.15a) r(o) _-s1:

77 +2 Y-cj +2 F- fp

(2.15b) r(ti) _{- (s-1)} E

71 - Ewy

(2.15c) r(t2) _- (s-2) E

77)

(2.15d) Eqy

(2.15e) r(ts) _-- Eel

and

(2.15f)

r(±k)

-0p

if

It follows that the reduced form of the seasonal local level is a restricted MA(s) model. The autocovariance generating function

evaluated at e-iX is

(2.16)

S2 E _{71 p} if

X=

(cs/c, ) F-71 + cl Y-W + cs EEO if

X

_;d

(25)

-25- As in _the _local _level

model ,af ixed slope 0 can be added to the trend _component _At _in _(2.12b),

and that is _equivalent to include _the time _{as an exogenous} _variable _with _{0 as its}

coefficient. On the _other hand, _if _{E. is} _equal _to _zero

and the _seasonal _components _at _time _zero are def ined _{as f ixed} _parameters, _the _model _reduces _to _the _local _level model with fixed _seasonal dummies as exogenous variables.

Basic Structural _Model

The basic _structural _model, _with _seasonal _period _{s, is defined}

as

(2.17a)

Cit ý At + Yt + ft,

(2.17b) _{At - At-,} _{+ ot-l}

(2.17c)

ot = ot-I + btp

(2.17d) _S(L) _{^it - wt,}

Tq

where the (n x 1) vector At is the trend of at, the (n x 1) vector Ot is its _slope _{and the} (n x 1) vector _{-yt is} _the _seasonal _component. S(L)

= (1 +L+L2 +

... +Ls-1), withL the lag operator; andthe n dimensional _random _shocks _et, _nt, _{bt and wt are assumed to be normally}

distributed, _serially _{and mutually} _{uncorrelated,} _with _expected _values equal to zero and variance covariance matrices Y-,, E17P Eb and E. respectively.

The model is written in the _state _space form by defining auxiliary

components Y1 to ***9 7S _{- 2t} as in the seasonal local level model. In

terms _{of the} _general form (1.3), p= (s+l), u-3, and

(26)

-26-

0 0 ₀

₀

0 ₀

₀

0

0 -1 -1 -1

-1

(2.18b) ₀ ₀ ₁ ₀ ₀ ₀

0 0 ₀ ₁ ₀ ₀

0 0 0 0 ₁₀

(2.18c) _R-],

0

(2.18d)

lqý

, 6ý , Wý )p

and

(2. l8e)

0ý =( Aý _, 0ý _, pyt

III

I 'Yl t PYS _-2t

That is

,Z is a (1 x s+l) matrix, T is an (s+l x s+l) matrix, R is an (s+l _{x 3) matrix,} _{Kt is} _{a (3n x 1) vector} _{and the} _state _vector _{Ot has} dimension (n(s+l) _{x 1).} _{The stationary} form _of _the basic _structural

model is obtained by taking seasonal and regular differences in (2.17a), _{and using} _also _{the relation} (1-L) S(L) _{- (1-Ls).} That is,

. A)t

(2.19) _{pt a (1-L)(1-Ls)} _{at - S(L)} _{bt + (1-Ls)} _{i7t + (1-L)} 2(

+ (1L)(1Ls)

s+2,

From where the autocovariance function of Pt is given by

(2.20a) r(O) _=s Eb +2E _{77 +6} Ew +4 Ef,

(2.20b) r(±l) _{= (s-1)} Y-6 -4 F-W -2 Y-C,

(2.20c)

r(±2)

= (s-2)

Eb + Eop

(27)

-27-

(2.20e)

y,

(2.20f)

r(ts)

=-

_{Lý -2}

V-

Ec

(2.20g)

Efy

and

(2.20h)

r(±k)

= 02

_if _{k ý,}

Hence, _the _differences _of _at _follow _{a restricted} _MA(s+l)

model. The

autocovariance _generating function _evaluated _{at e-"} becomes

(2.21)

S2 F6, if

X

of

(CS/Cl) Eb ₊ _CS ₊ _C2 _+CC _Eft _if _X0

71 _11sI

where cr _ý (2 _-2 cos(Xr)) and _-v 4X4r. The model is _strictly invertible if both _{F-6 and E. ) are positive} _definite _matrices.

When Y-6 is _equal _to _zero, _{and the} _component _{Ot at time} _zero is defined _{as a fixed} _parameter, _{the basic} _structural _model _reduces _{to the} seasonal local level with the time as an exogenous variable. If also E.

(28)

-28-

1.3 _{The Kalman-Filter}

This _section _presents _the

standard Kalman filter formulas for _the state _space _model (1.3). _{The filter} _{is used to obtain}

estimates of the unobserved _components _{Ot and to form the likelihood} _of _the _model. _The

general formulas _{are presented} _assuming _that _{the vectors}

a, , ... 1 CeT are observed _{and that} _{the variance} _covariance _matrices _{of the random shocks} are known. This _assumptions _are _of _course _unrealistic _but _the rationality is the following. To _obtain _estimates _of _the _unknown parameters in the _model, the likelihood function is _maximised _using _a nonlinear _optimisation _procedure _which _requires _the _evaluation _of _this

function. _This _evaluation _is _provided _by _the _Kalman _filter

and, obviously, initial _values for the _parameters _are _needed. _{On the} _other hand, _once _the _parameters _of _{the model have been estimated,} _{the Kalman}

filter _presented _{in this} _section _yields _{the estimates} _{of the unobserved} trend _{and seasonal} _components in the state _vector Ot, _{as well} _{as their} mean square errors.

The section presents first general formulas for the Kalman filter,

and studies the construction of the initial quantities required in the filter _recursions. Particular _cases _of interest _are _considered later:

the Kalman filter for _{an homogeneous} _model _{and the} treatment _of the vector of exogenous variables 0 when the vectors at, t-1, _..., T, are not observed.

To _present _the _general Kalman filter _recursions, the following definitions _{are required:}

(3.1a)

41t = fal,

...,

at)f

(29)

-29-

(3.1c)

Pt . V(Ot / *t-, ),

(3. ld) _{Pt - V( Ot / qvt)}

p

(3. le) _vt _{= at}

-E (at / Tt _-1),

(3. lf ) _{Ft = V(vt} _{/ iVt-, ),}

where E(x) _and V(x) _represent the _expected _value _and _variance

covariance matrix of the random _variable _x. That is, 41t is the information _set _{up to} _time _t, _{mt is} _the _estimator _of _the _state _vector

at time t with information up to time t while vt is the one step ahead prediction error of at. Pt, Pt and Ft are condicional variances. With

these definitions, the Kalman filter formulas for the _general _model

(1.3)

, are

(3.2a) _{Pt - (T 0 In) Pt-1} (TI 0 In) _{+ (R @ In)} lK (RI

(3.2b) Ft _{= (Z @ In)} Pt (Z' @ In) _{+ r-E,}

(3.2e) _{pt . pt - Pt (7-' @ In) Ftl} (7- @ In) Pt)

(3.2d) _{mt ý (T @ In)} _mt-1 + Pt (Z' @ In) Ftl vt,

and

(3.2e)

vt - cit - (Z T0 In) 111t

-1;

see for example Anderson and Moore (1979, sec. 3.1) or Harvey (1990,

sec. 3.3). Under normality, the Kalman filter yields the minimum mean square error estimator of Ot and at conditional on an information set.

(30)

-30-

equations _minimise _the _{mean square} _errors _within _the _class _of linear

estimators. Formulas for _smoothing _and forecasting _are found in Anderson _{and Moore} _(1979, _ch. _{7) and Harvey} _(1990, _sec. _3.3).

Two important _questions, _with _respect _{to the} _recursions _(3.2) _arise at this point. These are, the _existence _{of a steady} _state Kalman filter

and the specification _of the initial _values _{mo and PO needed} to _start the filter. Harvey (1990, _sec. 3.3) _showed _that _the _models _presented

here _always _satisfy _necessary _conditions _for _{a steady} _state _Kalman filter. _That _is, _if _the _above _formulas _{are started} _with _{a positive} _semi definite _matrix _{Po, then}

(3.3)

lim

Pt ý P,

t-. )w

where P is unique, and independent of POI If also, the stationary form of the model is strictly invertible, the convergence is exponentially

fast.

With _respect _to the initial _values _mo _and PO, as no prior

information for these _quantities is, in _general, available, a dif fuse prior for the state vector at time zero is defined. That. is, the state vector at time zero is assumed to have a normal distribution with

expected value zero and variance covariance matrix (T Inp), where T is

a large number. On the other hand, it may be the case that only some

linear _combinations _of the _components of the state vector are def ined as diffuse, while for other linear combinations a proper prior at time zero can be defined. In that situation the state vector is said to have

a semi diffuse prior.

The general solution to deal with the Kalman filter under a diffuse or semi diffuse prior specification is presented in the next section, and that involves some modifications to the standard Kalman filter

(31)

-31-

general, is _presented _in _Harvey _(1990,

sec. 3.3) _and that _{may be} simpler in _many _practical _situations. _Harvey

showed that, for the models _considered here, _{a proper} _prior

can be formed _using the first _p vector _of _{observations,} _where _{p is} _the

number _of _components in the state _vector for _{each time} _series. _{The procedure} _is _{the following.} _From

(1.3a), _the _first _{p vectors}

p can be written in terms of Op, by repeated _substitution for _{Ot from} _(1.3b). _This _yields

(3.4a)

ZT2 -P R Z T3-P R

In] _Kp-l

0

ce _{Z T' -P}

Ce

2Z T2-P

In _Op

cep- Z T-1

OLP iz. i

Z Tl -P R Z T2-P R

In] Kp

Z T-1 R 01

Z T-1 R' _{ei '}

2

In] K2 +

p

which,

with

obvious

notation,

can be written

as

(3.4b) ₀₌ _{(H @ In)} _{Op + (H}

p@ In) Kp + -- + (H2 @ In) K2

- In) Op + e,

where oz, 0p and e are vectors of dimension np, H is _{a (p x p) matrix,}

HpI.... _{H2 are} (p x u) matrices _{and e is} _{a vector} _of _dimension _{np and} expected value zero. The generalised least squares estimator _of 0p is

then _given by

(3.5) _{mp ý (H-1 0 In)} _0,

and the variance covariance matrix of the estimation error by

(3.6) ppý (H-1 0 In) V(e) (H'-' @ In),

(32)

17

-32-

where V(e)

is the variance

covariance

matrix

of the vector

e; see also

Duncan _{and Horn} _(1972). _These _quantities _are _then _used _to _start _the recursive formulas (3.2) from _t- (p+l). _{The problem,} however, is to

find _analytic _expressions _for _{H and V(e)} _in _order _to _evaluate _{mp and} P

P, Harvey presented the solution for the local level model. In the following lines _the _solutions for _the _local linear _trend, _seasonal

local level _{and basic} _structural _models _{are also} _presented. _{The proofs,}

which require tedious matrix algebra, use equations (3.4) and the matrix algebra results presented in Appendix 1.1

In the local level _model _{the value} _{of p is the unity,}

(3.7a) _{M, = Cill}

and

(3.7b)

PI - Ef*

In the local linear trend model the value of p is two,

Q2

(3.8a)

M2

Ce2 - (YI

and

Ef

(3.8b)

P2

+E 77 +2

(33)

-33-

as-

(3.9a)

m

cis

s

Ce

2 ces

where _{as = (a, + ... + us)1s.} _{The matrix} _{Ps is formed}

with (3.6) _and (3.9b) _V(e)

= (Is @ Lve) + (A, @E 71 )+ (A2 @ 1: 6) 9

where the (s x s) matrices _H-1 _{and A, are given} _by

(3.9c)

(us)

(S-1Y

(S-1) i

-1

and

(3.9d)

(s-1) _(s-2) _(s-3) ₀

(s-2) _(s-2) _(s-3) ₁ ₀

(s-3) (s-3) (s-3)

. .. 1 0 1 0

0 0 0.

.. 0 0

while the (s x s) matrix A2 has all its _elements _equal to zero with the exception of the element in the first column and first row which is the unity.

(34)

-34-

(3.10a)

where ,

(3.10b)

and

(3.10b)

MS+

i=

cis 9 Qs+

i

5s

cis ₊ _cy

s

Ce3 +

(s-2)0

- -ces

0= _(C's+1

-

00 /

cis=

(11s)

[Cis+, + (Cis+g) + (as-1+20)

++

(a

2+(S-1)0)1'

The matrix Ps+j is formed _using _(3.6) _and

(3.10c) _V(e) _{- (Is+,} _{@ EE) + (A, 0E)+} _(A2 (A3

77 2

where the (s+l x s+l) matrices H-1, A,, A2 _{, and} A3 are given by

-(S-l)

(3.10d)

(3.10e)

-2

(S-1) H-1 - (1/2s) (s-3)

-(s-3)

s

S-1

s-2

S-1

s-2

1

0

222 _(S+l)

00.... 02

-2 _{-2 ....} -2 (S-1)

-2 _{-2 ....} 2(s-1) -(s-1)

(s-5)

....

1 o

....

0 0

(35)

-35-

(3.10f )

and

(3.10g)

A2 _ýI

s

S-1

s-2

2

1

₀

S-1 _s-2 _s-3 1 0 0

s-2

s-3

s-4

0

1

0

0 ₀

₀

0

0 ₀

₀

2

-10.0'

-1

10.0

0

00.0

A3=

....

0

00.0

Homozeneous Models

It _can be _shown _using _an induction _principle _that _under _the homogeneity _restriction (1.4),

(3.11a)

pt - Qt @ Ef)

(3.11b)

pt - Qt @ EE,

and

Ft - ft Ef,

t=p,...

T,

where

Qt,

Qt and ft

have dimensions

(p x p),

and (I

x 1)

respectively;

and they

are evaluated,

from

t=

(p+l),

according

with

the recursions

(3.12a)

Qt -T

Qt-l

T' +R

DK

RI,

(36)

-36- and

(3.12c)

ft -Z Qt Z, + 1,

which _are _exactly _equivalent _to _the Kalman filter _recursions _of _a univariate _model _with the _variance _of _the irregular _random _shock _Et

equal to the _unity _{and the} _variance _covariance _matrix _{Of Kt equal} _to DK

. The estimates of the state vector Ot, t= p+l,..., T, are then obtained from

(3.13)

mt - (T @ In) mt -1+ (Qt Z' f t- 1@ In) vt ,

The vector mt in (3.13) can also be computed running the Kalman f ilter

equations for _each time _series in turn, _with the _variance _of _{Et equal}

to the _unity _and the _variance _covariance _matrix _of _Kt _equal to D..

Thus, _the Kalman filter _recursions for _{an homogeneous} _model _{can be} computed separately for each time series as if the model were

univariate. The full matrices Pt, Pt and Ft are then formed using

(3.11).

Exov-enous Variables

To run the Kalman filter using equations (3.2), it was assumed that the vector of exogenous variable coefficients 0 was known. An important

result proved by Kohn and Ansley (1985) for the univariate case, n-1,

is _that _the f ilter _{can be run} conditional on 0. That is, if the vector of coefficients 0 is unknown, the prediction errors and the estimators

of the state vectors, can be expressed as explicit functions of 0. The idea _can be _extended to the multivariate case, n ýý, 1, in a straightforward form. Define the (np x 1) vector myt and the (np x k)

(37)

-37- (3.14a)

myt - (T @ In) Illy, t-1+ _{Kt ( yt - (Z T@ In) my, t-11,}

and

(3.14b) _{Mxt = (T @ In) Mx, t-1+}

Kt [ Xt - (Z T0 In) Mx, t-11,

for _t- _p+l,..., _{T; where Kt is the gain matrix} _defined

as Kt - (Pt (Z' @ In) Ft-l ] Then, from (3.2d) _and (3.2e), _the _estimator _of _the _state vector _{at time} t, _mt, _{can be written} _as

(3.15)

Mt = myt - Mxt ß,

provide (3.15) holds for t-p; _{and that} is immediat from (3.5). Using _, (3.2e), _{the prediction} _error _vt _{can be written} _as

(3.16)

Vt = Vyt - Vxt P,

t=p+1,...,

T,

where vyt is an (n x 1) vector of pseudo innovations after running the Kalman filter _over _the _time _series _yt, _{and Vxt} is _{an (n x k) matrix} _of

pseudo innovations after running the f ilter over each column of Xt. Notice _that _the _recursions for the _estimators _of the _state _vectors _and

(38)

-38- 1.4 _{The Diffuse} _Kalman _Filter

This _section _presents

alternative _{and more} _general Kalman filter formulas _for _the

situation in which the _state _vector in _{a state} _space model is defined _{as diffuse} _{or semi diffuse.} _{The results} _{were developed}

by _{De Jong} _(1988,1989)

although the _presentation here is _somehow different _{and more} _direct. _{The main advantage}

of the diffuse Kalman filter, _over _the _standard _formulas _presented _in _the

previous _section, is _that _the _problem _of _the _initial _conditions

needed to _start the recursions is _solved by _considering _an _extended _filter. _The disadvantage _is _that _{the recursions} _{are slightly} _{more complicated.}

Consider _the _state _space _model _(1.3) _with _initial _state _vector given by

00 =a+

where a is an (np x 1) normal random variable with expected value zero and variance covariance matrix Ea, A is an (np x r) matrix of known f ixed _values, _{and Z is} _{an (r x 1) normal} _{random variable} _with _expected

value zero and variance covariance matrix EZ. Even more, assume that a and t are uncorrelated between them and with ct and Kt defined in

(1.3). _{The vector} ý is _said _{to be diffuse} if Ef 1 converges _{to zero} in the Euclidean _norm. In _general, (4.1) specifies a semi diffuse state

vector at time zero; but if a-0, and A- Inp, 00 is said to be diffuse _and _the _results _presented here _coincide _with the ones in the previous section.

Consider _the idea _of _running the Kalman filter conditional on E. Def ine _{mý , Pý, Pý, vý and} Fý exactly as in (3.1) but conditional on

(39)

-39-

(4.2)

PC =E

0

Using _{an algebraic}

manipulation _similar _{to the one applied} _{to the model} with _exogenous _variables, _it _can _be _shown _that

mF and vý can be expressed _as

(4.3 a)

Mý _- _Mlt _- M2t ýo

t-1,...,

T,

and

(4.3b)

Vý _- _Vlt _- V2t E)

t=1,...,

T,

where ,

(4.4a) _{m1t - (T 0 In)}

mi, t-1 + Ký vlt,

(4.4b) M2t

- (T @ In) M2, _t-1 + Ký V2t$

(4.4c) _vlt _{- at}

- (Z T0 In) mi, t-1,

(4.4d) _v _2t ý (Z T@ In) M2, _t-ly

mlo = 01 M20 = _-A, _{and Ký is} the _gain _matrix defined _{as Ký - [Pý (Z'}

In) (Fý) -I]; _{see also} Rosenberg (1973). Thus

, m, t and v, t are obtained

after running the standard Kalman filter over the time series _{at with} t

= 0; while

M2 t and V2t are obtained

after

running

the same filter

over

each column of an (n x r) matrix of zeroes with the initial estimate for _the _state _vector _given by the columns _of _-A.

At _this _stage, two _general _results _concerning random variables are

required. If ý and z are random variables, then

(4.5a) E(z) _=E E(z

(40)

-40- (4.5b) _V(Z)

-E V(z / Z) +V E(z / e). ZZ

From (4.5) _and _(4.3) _follows _that _the

estimators _of the _state _vector unconditional _{on E, the prediction} _errors _{unconditional} _{on Z, and their}

conditional _variance _covariance _matrices _{are given} _by

(4.6 a) E(Ot / ýt) _{- Ull t-} _{142t E(E / ýt)}

9

(4.6b) _at

-E (cet / ýt -1)ý Vlt - V2t E(Z / ýt-, ),

(4.6 c) V(ot / ýt) _{= Pý +} 142t V(ý / Ot) _M21tv

and

(4.6 d) _{V (cit / 4, t}

-, )- F£

+ V2t V(Z / ýt-1) V;

ts

for _t-1.... _T. _Thus, _these _quantities _can _be _obtained _from _the formulas _given by _the _conditional _Kalman filter _{and with} _expressions

for _{E(E / ýt)} _{and V(Z / ýt).} _{To ob}

Analysis of a cross section of time series using structural time series models