ESTIMATING THE CONTINUOUS TIME CONSUMPTION BASED ASSET PRICING MODEL
Sanford J. Grossman Angelo Melino Robert J. Shiller
Working Paper No. 1643
NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue
Cambridge, MA 02138 June 1985
The research reported here is part of the NBEP's research program in Financial Markets and Monetary Economics. Any opinions
expressed are those of the authors and not those of the National Bureau of Economic Research.
June 1985
Estimating the Continuous Time Consumption Based Asset Pricing Model
ABSTRACT
The consumption based asset pricing model predicts that excess yields are determined in a fairly simple way by the market's degree of relative risk aversion and by the pattern of covariances between per capita consumption growth and asset returns. Estimation and testing is complicated by the fact that the model's predictions relate to the instantaneous flow of consumption and point—in—time asset values, but only data on the integral or unit average of the consumption flow is available. In our paper, we show how to estimate the parameters of interest consistently from the available data by maximum likelihood. We estimate the market's degree of relative risk aversion and the instantaneous covariances of asset yields and consumption using six different data sets. We also test the model's overidentifying restrictions.
Sanford J. Grossman Angelo Melino Robert J. Shiller Department of Economics Department of Economics Cowles Foundation for Dickinson Hall University of Toronto Research in Economics
Princeton University 150 St. George Street Yale University
Princeton, N.J. 08544 Toronto, Canada M5S lAl Box 2125 Yale Station
(609) 452—4035 (416) 978—6541 New Haven CT 06520
I.
JrodLtction
In this
paper, we provide an
empirical test of the continuous time intertemporal capital asset pricing model, first proposedby
Merton[1971].
The model as clarified by Breederi[1979] implies thatan asset
willbe
priced so that the expected return required will increase with its covariabilitywith per capita consumption
growth.
Previous tests of this theory (e.g. Grossman—ShillerEl9BOJ, Harsen—SincUetorE19833) have examined discrete time versionsa-F
the
model under the assumption that the timing interval ofthe
model matches exactly the sampling interval for available
data
on per capita consumption. That is, if we have data on quarter1 y consumpt ion,then the time period is
assumed to be1—quarter
of a
year. We show that if thetrue model is a
continuoustime
model,and time averaged
data (such as quarterlyconsumption)
is used to test
it,then substantial biases
maybe introduced
unless the estimation procedure is corrected to take account
of the
effectsof time averaging.
We providea procedure for
obtainiri consistent estimates
withtime averaged
data. Wethen estimate and test the model using data on
percapita
consumption
and the
cumulatedreal returns to holding portfolios
o-f stocks, bonds, and short—term paper.
II
The ModelIt
is usefulto review the Merton model. Our discussion
follows
closely the exposition of its generalization inGrossman—Shiller[1982J. In a discrete time model, each consumer
is assumed to maximize a time—additive utility function over a single consumption good
.j =0
where T is his time horizon. c(t) is consumption at time t and is the discount •factor between utility at time t arid t+h.
For the purposes of this paper, we assume that the period utility functior, is o-f the constant relative risk aversion (or isoelastic)
-f or ir
(2.2> Lt(C)
c'/(1—Pi)
Let v (t) denote the value o-f asset i at time t including
any accrued cash disbursements (such as dividends or coupons)
earned between t—h and t. Assume that asset i is freely tradeable. A standard argument shows that
(2.3) Etu' (c (t+h> )v (t+h) =
u
Cc (t) )v1 Ct)where the expectation is conditioned on all the information possessed by the trader at time t. Using (2.2> arid iterating
(2.3)
, we
can write(2.4>
Et(c(7)) vi(T) =
(l)t
for 7= t+h.t+2h,...
c(t)
v(t)
I-f we take the limit to continuous time and apply Ito's Lemma,
we obtain
(2.5) Etdv1
+ 1*A(A+1)*Var(dc)
—A*Etdc
+lndt
2 c c
=
A*Cov(dc,dv)
c
v
where Var and Coy denote the variance and covariance operators. Note that (2.5) holds for an individual. Under various assumptions about heterogeneity of information and wealth. (2.5)
can be aqregated over individuals so that
ccan be interpreted
as per capita cortsumption and A is replaced by a particular
weighted
averaqe of theindividual consumer's A (see
Grossmart—ShillerEl982)).
Clearly (2.5) holds for alltradeable
assets.
Ifis
defined as the excess rate of return of asseti over say short—term paper, then (2.5) can be used for these two assets to yield
(2.6)
ER A*Cov(R,dc/c)
The aggregate
parameter of relative r i sk aversi or, can be
computed by (2.6) given data
or mean excess returns and thecovariances
between excessreturns
and per capita consumption growth. Table I provides some estimates of A based on the descriptive statistics from Table 7. The various data sets and variable definitions are described more fully in Section III. At this point, we simply wish to draw attention to one of the important empirical anomalies associated with the model and the potential role for- time averaging as an explanation.The table shows that the mean excess return on stocks is associated with a relatively small covariance with consumption changes.
Therefore this can be justified only by an implausibly high estimate of the risk aversion parameter. Similiar conclusions are reached by examining the excess returns on bonds.
One explanation for this is based on the idea that a time averaged variable is smoother than the same point sampled variable.
In particular, if the true model holds in continuous time then the instantaneous rates of change in consumption can be more
var-i
ahi e (and
also
covari able with returns) than is
theaveraue
consumption chariqe across years or quarters.
Art emp1e
To understand this e-ffect, consider the following
verysimple process for v, the value of asset i
inexcess of asset
1, and consumption:
(2.7a)
dc =
pdt
+ d€
(2..7b)
dv =
J1dt
+ d'ri
where are correlated Brownian motions with Cov(d€,d)Thdt.
Let E(t) and
(t)
be the time averaged values of c(t) andv(t) ,i.e..,
E(t) =
T1j
c(t+s)ds
=T'f
v(t+s)ds-We will show that
(2.8> Cov(E,) EE(E(t)—E(t—T))((t)—(t—T))] —
pT.LT
— —, i'"r
— _l._)I O••
If
we normalize T=1, then the covariance of time averagedconsurnptions changes and price changes is 2/3 of the instantaneous value
o Roughly speaking,
this wouldlead
us to overestimateA
by
To understand (2.8)
just note
that(2.9) E(t) — E(t—T) =
pds + T'14_T
d€(7)ds
= pT +
T1f4+ST
dE(7)ds+
T1f45
dE(7)ds=
pT + T'ft_T
(T—t+s)d€(s)A similiar expression may be
derived
for (t)—(t—T). Hence,
(2.10> E[(E(t)—E(t—T))((t)—(t—-T))]
=pTu1T +
T2E[4_T(T—t+s>2d€d
+Equation
(2.8) is easily derived from the last expression.The purpose of this example is to give the reader a relatively simple view of the effect of time averaging in generatinq a
stochastic
process which is " smoother"
thanthe instantaneous
process. This suggests the possibility that assets appear to
have a
low risk (i.e.low covariaruce with consumption changes)
because measured consumption
changes are less van able than instantaneousconsumpti on changes. Since it
i s the coven ance withinstantaneous consumption changes that
is the relevant measureof an assets risk, this leads us to overestimate A.
In our simple example, A is overestimated by SOY.. As we shall
see below, for certain processes, the bias can be arbitrarily
large.
Multivariate Model
In our empirical work, we postulate a slightly more complicated
stochastic
process for consumption and asset values. DefineY(t) according to in
c(t)
—kc — gt
—C(t)
1 (2.11)Y(t) =
in
v1 (t) k1 — g1t = V1 (t) In v(t) —k, — g.t
V9(t)
In
v-.(t) — k3 —gt
V-.(t)We assume that Y(t) satisfies the stochastic differential equation
where B
and 2.. are (4x4) matrices and Z(t) is a vector o-fstandard
independent
Wiener processes. is assumedto be
symmetricand positive definite. Without any loss of Qenerality,
can be taken to be I ower
tn anqul ar
with positive diagonalelements. Let a denote the vector of nontrivial parameters
—'4 1,—s
in
Switching
to logarithms and applying Ito's Lemma we can rewrite (2.5) in terms of the V(t) process as(2.13) EEdVj(t) —
*dC(t))
+ —*g
+lnS)dt
+
1/2*P2*Var(dC)
—*Cov(dC.dV)
+1/2*Var(dV)
=
0.If
this is to hold at all points in time in m.s.. then (2.14) EtEdV(t) —*dC(t))
= C).The reason is that, according to our assumptions, the remaining terms in the expression are not functions of information. Since the model is homogeneous, the only way this sum can be constant is if it is zero.
Therefore (2.5) imposes the following restrictions on our nodel
(2.lSa)
J1B = 0(2.15b)
—+ 1/2*J1L3
+lru
= 0 i=1,2,3where
e)
and is the vector with unity in component i and zero elsewhere.Suppose that the process Y(t) is sampled at regular intervals. It is straightforward (see BergstromEl984)) to show that the
point sampled process has the representation
where = eB, the matrix
exponential
of B, and u(t) is the randomvariable 4_
et5l2dZs.
Let 7(t) denote the time average of the Y(t) process, i.e. V(t)=4_1 Y(s)ds. Upon integrating both sides of (2.16) we obtain(2.17) 7(t) = ØV(t—1)
+ (t)
where (t) is the random variable 4_1x_1
eB)dZs>dl.
Let + and q denote two 'smooth" real—valued functions and (s) a univariate Wiener process. Using the definition of the Ito integral, the following two results can be established:
(2.18)
f
f(t)Ef
g(s)d(s))dt =JE4f(t)dt](s)d(s)
(2.19)
ECftf(s)d(s)]Efq(7)d(7)J =
Jf(s)g(s)ds
where M Et1,t2]fl[t-.,t4J
and where the equality is understood in the mean square sense. Applying (2.18) element by element and other standard properties of the Ito integral allow us to write
(2.20)
[(t)
=4:4' e
d'rdZ(s + il-.iiled7dZs.
Define E(t)(t—'r) and F(r,w)
eB'ZeB'W. Applying(2.19)
and standard change of
variablerules, we obtain
(2.21a)
fl() 444
F(r,w)drdwds +
fJJ
F(r,w)drdwds
(2.21b) =
x4i F(r,w)drdwds
(2.21c) = 0 'r2.We conclude that '7(t) is a vector ARFIA(1,1)
process.
PhillipsEl97B]and BergstromCl984] develop similiar results although the latter only considers the case where B is invertible. We can therefore write
where the innovations
€(t)
have mean zero andcovariance
matrixS.
and e is
a matrix with spectral radius not exceedinq unity.Define y(t) (In c(t) in v1(t) in v2(t) in v.(t))'.
and let t) be its unit averaQe. Eq (2.22) can be rewritten
(2.23)
(t)
= + + s(t—1) + €(t) + $€(t—1) where= (IS)k+Øq. '( = (I—ø)g,
and t =(t+s)ds.
The restrictions(2.15) are
easily shown to imply 3i1 =J(I—ø)
=
0.
In particular, it also follows that the vector k cannot be identified uniquely. Jetherefore
impose the identification restriction k1 =A*kc
in our estimation. tedious arqumentalso shows that
(2.24) (J—J) ((t)—(t—1)) =
3i3jO
+ (J—J)€(t)
+
so that the time averaaed excess returns on asset i over j follows an M(1) process with coefficient .268.
To gain further intuition about the possible consequences of time averaqing suppose B =
diag()1,?2,?,X4).
Then it can be shown that :>(,j) =h(,2)Z(i,i)
,where
=
()E1
+ —2*(?+7)(1_eM3>
+ (1 + e>(1 —
+ (1 + e)(1 —
e)/X)
and equal to the obvious limits as or X goes to 0. Our simple example corresponds to the case h(0,0)= 2/3.
If the process were stationary around trend, the elgenvalues of B would have negative real parts. Sampling a few values, we see that h(—. 1,—. 1) =.60,
h(—.5,—.5) =.45,
h(—1,—1) =.28,
and h goesto
zero as
aridboth go to minus infinity. The bias in
the estimate of A using time averaged data and (2.5) car therefore
be arbitrarily larqeiIII DATA ANALYSIS
Data
DescriptionThe data are fully described in an appendix to this
paper
which is available from the authors. Here we shall give onlya broad description of the data to indicate how they were assembled and to show that they correspond as much as possible to the
concepts represented in the model above.
Six separate data sets were prepared, each intended to represent a series of observations on the four—element vector
.
The data sets differ in sample period, sources and assumptions about taxation. Table 2 summarizes the important differences.Data sets ore and two are long historical annual
time
series beginning in the year l89O These data sets are based on those used in Grossman and Shiller[1981] and described also inShillerEl9B2J Data sets three through six are quarterly time series. Data sets three an.d four
begin in the
second quarterof 1953. Data sets five and six
begin
in the second quarterof 1947. The use of annual and quarterly time series was dictated
more relevant comparison might be the ratio of A that would be obtained using time averaged data to that using point sampled data. Although details differ, it is easily shown that this ratio also can be arbitrarily large.
by the existinQ consumption data. Long time series data or consLtmption are available only on an annual basis. Quarterly consumption data are available only for the post—war period. Monthly consumption data are available startinQ in l99. We
did
not usethose
data here because of someconcern
as to the accuracyof
the monthly data and because of the somewhat shorter sample period that such data would impose.In all data sets, the first element o-f 9
is
the log ofreal per capita seasonally—adjusted consumption on nondurables and services. For years beginning with 1929 these data are from the National Income and Product Accounts of the United States. Earlier data are the Kuznets—Kendrick series. Since the published consumption series are total consumption over the period, the first element of 9
departs
somewhat from that hypothesized in the paper: it is the log of the integral ratherthan the integral of the 1og Note that we use a physical
measure of consumption directly and do not deflate nominal consumption by a price index that is averaged over the year,
which would have introduced another departure from the assumptions of our
model.
In all data sets the second element of 9
is
a measure of the interval averaged log cumulated real return on corporate stocks,the third element is
a measure of the interval averagedlog
cumulated
real return on short debt and the fourth elementSome Monte Carlo simulations indicate that the biases introduced by using the log of the average instead of the average of the
i s a measure of the interval averaged log cumul ated real return
or long—term bonds. The even—numbered data sets are based on
after—tax returns. In constructing these series, the (after—tax
in even—numbered cases) nominal returns were first computed
on a monthly basis. At that point, a choice had to be made
whether to use
the consumption deflator toconvert nominal returns
to real returns or to use one of the monthly price indices for
this purpose. The consumption deflator has the advantace that
it corresponds to the measure of consumption that is supposed
to enter the utility function. The monthly price indices have
the advantage that we can use them to produce a monthly real
series,
so that our interval average will correspond more closelyto the integral of the log of the real
portfolio value as
represented
in our model. It was decided to usethe
consumption deflator for data sets one through four and the monthly consumer price index for data sets five and six. ThUS, for example,the
second through fourth elements o-f the
vector in dataset two were
constructed by first producing monthly series
representing the cumulated after—tax nominal returns of the
assets. Each series represented the nominal value of the portfolio
of
an individual who reinvests all after—tax income from the asset in the same asset.- The average for the year of the logof
the
monthly portfolio values was used to construct an annualseries. Finally, the log of the consumption deflator was subtracted
Let (l+rjm) denote the monthly after—tax nominal return on asset i, and let ViL denote the cumulated after—tax return in month L.
from
each series to convert to a real series. With data set five, the first step in the construction ofthe
second through fourthelements of
was essentially the same. We first produced a monthly series of cumulated returns of the assets. However, jr data set five, this monthly series was subsequently deflated by dividing by the consumer price index, and a quarterly series wasproduced as
the average for the threemonths of
the quartero-f
the ion o-F this
monthly real series.With data sets five and six another adjustment was also
made
before the average log cumulated real
portfolio value wasentered
into the vector . In constructing the series, there
was great
concern that the data be
alignedproperly. The
Ibbotson—Sinquefiel d returns data for each
month are measuredfrom
the
end of the preceding month to the end of the currentmonth.
This provides
four point sampled observations on thelog cumulated real portfolio for each quarter. These were connected
by
straight lines and the integral under the
straight lineinterpolation
was used to estimate the corresponding component
of .
For
data sets one and two, the return on corporate stocks is computed from the Standard and Poors Composite Stock Price Index and associated dividend series. The return on short—term debt is computed from the prime commercial paper rate and the reEurn on long—term debt is computed using the Macaulay railroad bond yield data for the first part of the sample and the Moody Aaa bond yield average for the years after l93.For
data sets three and four,
allreturn
data come from serieson the CITIESE
data library. Stockreturns are aqain
computed usinq
themonthly Standard and Poor's Composite Stock
Price
Index, while thereturn on short debt is taken from the
return
on three—month treasury bills and the return on longdebt
is based on
yields of twenty—year treasury notes.For
data sets
five and six,return
datacome from Ibbotson
and
Sinquefieid[1982]. The stock return series is their series
common stocks, total returns; the short debt series is their
series
U.S. Treasury bills, total returns the lona debt returnseries
is their series loriq—term corporate bonds, total returns.
For after—tax
series, the assumed marginal income taxrate
for
1918 to 1980was that implicit in the spread
between municipal and corporate bond yields.. Before 1918, the marginal incometax
rate was set to zero. Since
the Ibbotson and Sinquefielddata
do
not allow a decomposition of returns into capital gainsand
income components, it was assumed for data set six that
all returns were taxed each month
asincome. For data sets
two and four, however, capital gains were assumed taxed each
month at a long—term capital gains rate. For the years l94
to
1978,the effective rate on long—term capital gains was one—half
the marginal income tax rate. For earlier years, the effective
rate on long—term capital gains was computed from the marginal income tax rate using tax rate data in Seltzer[l951J.
Frel
iminarie
Be-fore
considering
formal
estimation and
testing, it
isuseful
to
review some of the broadfeatures o-f
the
sixdata
sets which our model must eXplain. Some descriptive statistics are provided in Table 3.
For all six data sets, we observe that stock portfolios gave the highest average real return, approximately 6 p.8. on
a pre—tax basis or 47. a-f
ter—tax. Short—term paper vi
el ds averagedabout
27. p.a. on a
pre—taxbasis over
ourlongest historical
sample, but. the average yield fell to about zero in the
post-war period. After—tax real
returnsto
holdingshort—term paper
have
beersslightly negative. Long—term bonds, by contrast,
have
averagedessentially a zero real return over the last
century,on both a
pre—and after—tax basis. During the post—war period,
however,
pre—tax returns have beenslightly negative. On an
after—tax
basis •bondhol
ders have seen the real value of theirportfolios shrink by over 27. p.a.
According to the consumption based asset pricing model, these persistent differences in average yields must be accounted for by the insurance provided by the different portfolios against events which impinge adversely on consumption. Useful evidence about this hypothesis is obtained by looking at the covariance structure of measured portfolio yields and changes in consumption. Some caution is necessary since the model 's predictions pertain to the covari ance structure of the instantaneous returns and
our data
are constructed from di-ffererices of unit averaged values.However, if BQ.
the
latter can provide a reliable guide tothe sign and order of magnitude of the instantaneous covariance matrix.
Several empirical regularities emerge. As measured by
the variance, the change in consumption is the smoothest series, followed closely by
the yield
on short—term paper. Long—term bond yields have been fairly stable over our longest sample, whereas the variance of returns to holding a portf ol i
o of stockshas been several
orders of
magnitudelarger. In the post—war
period, real
returnsto
holdinglong—term bonds have been much
more volatile with a variance almost as large as the return to holding common stocks.
Of more interest are the covariarce properties. According to our
model,
it is not the variance but the covariance with consumption that is the relevant measure of a portfolio's risk. We find, uniformly across the six data sets, that stock yields have the largest covariance with changes in consumption, followed by short—term paper yields and then yields on long—term bonds. £ualitatively, this is exactly what the model requires given the ordering of the average yields. It indicates that the basicidea that insurance against adverse movements in consumption can account for observed yield differentials has some empirical promi se.
Evidence of potential
difficulties is
provided by the autocovariancestructure of
excess returnson bonds and stocks
over
short—term paper. Given ourassumptions
about the probabilistic structure o-f consumption arid portfolio valuesarid the -form o-f
preferences,
we expect the point sampled differencein
vi ci ds between any two portfol i
os to he serially uncorrel ated.As equation (2.24) shows, the
time averaged difference in yields
shoul
d have an MA ( 1) component with coefficient about .268.
This particular prediction is independent of the mean or covariance of returns or the deree of relative risk aversion.
Table 3 shows that
it is
important to take into account the consequences, of time averaging. The Box—Ljung statistics clearly indicate that the excess yieldsthat
are constructed from our data are not white noise. The adjusted excess returns referred to in Table 3 are filtered to remove the time dependencethat
is induced
by unit averaging. Judging from the Box—Ljungstatistics,
the
adjusted excess returns are indeed less seriallycorrelated
.Nonetheless,
the autocorrelations of the adjustedexcess returns to stocks remain statistically significant from zero in four of the six data sets.
Some Econometric Issues
It is demonstrated above that the vector of time averaged observations
has a representation of the form
(3.1)
(t) =
Y0(u)
++ Ø(c)(t—l)
+€(t)
+$()€(t—l)
where the disturbances €(t) are distributed independentlyset
,a) ,
where B1 denotes thefirst
row of the B matrix.Linear Gaussian processes have been studied extensively by econornetricians arid statisticians. Nonetheless, there are
several features of our
model which
put it outside of the standard assumptionsin the literature used to prove laws of large numbers
or
central limit theorems.First, the model contains a time
trend
so that sample autocovariances of the exogenous variabLes,i.e.
T1LXtXt_ where X. = (1 t),
do riot converge to welldefined
limits.
Secondly, the modelimposes restrictions not
only across the autoregressive and moving average matrices,
but
across theseand the contemporaneous
covariance matrix aswell. Finally, our model imposes the restriction that B be of rank one, so that Ø() will have three eiaenvalues on the
unit
circle. To
our knowledge, there are no lawsof large
numbers or central limit theorems that cover all three of these features. Application of the standard largesample
procedures to estimate and test our model must beconsidered
tentative.Although all the features of
our
model havenot
been treated together in the literature, we can use available results toform a reasonable guess about the sampling properties of the approximate (conditional) maximum likelihood estimator described
below. For example, it
appears
that alaw
oflarge
numbers which would allow for all three of the features noted abovewould
be a modest extension of the literature. Hannan et al.
El980] provide a law of large numbers for vector ARMAX models allowingfor
very
general restrictions and, in particular, dependenceacross
the covari ance
matrixof innovati oris
andthe
other parametersof
the model. Their assumptions about the error process are
clearly
satisfiedby our model, but they rule out
time trendsas regressors and require all roots of the autoregressive polynomial to be outside the unit circle. In the absence of complicated
restrictions or unit roots, the assumption that sample covariances
convergE to well
defined limits
can bereplaced
by the weakerGrenander conditions (see HannanEl97l J) which do allow for time trends as regressors. Similarly, in the absence of time trends
and other restrictions, strong laws of large numbers can be established even i-f the autoregressive process is explosive.
Individually, therefore, each o-f
the
three features of the modelhighlighted above
is
not an impediment to establishing a lawo-f
large
numbers.It is
well
known that unrestricted estimates o-f0 will
not be asymptotically normal if
there
are unit roots in the autoregressive polynomial. A case for a central limit theorem can be made only if the estimation procedure exploits the prior knowledgeo-f the structure of 0. Our restrictions imply that
is a co—integrated process (see Granger—Engletl9B2J>. These
processes have had a long history in
applied empirical researchunder
the
name of "error—correction" models. However, onlyrecently has there been any serious investigation o-f the sampling properties of the MLE or its approxirnants. Available theorems do not allow for a time trend or moving average terms but these
complications
do not appear topresent
any conceptual difficulties. The main result is that the inteqratingfactor4 is estimated
consistently by ML with a
sampling
error thatis
o(T'2).
The ML
estimators
for the remaininq parameters are consistentand asymptotically normal with a covariance matrix that is estimated consistently by the usual formula. In our model, the integrating
-factor is just B1. ,
appropriately
scaled. Since we are never concerned with testing restrictions on the components of B1 the rapid convergence of the estimated integrating factor does not appear to present a problem.We will proceed formally as if the standard large sample
procedures for in-fererice are valid under the maintained hypothesis
that
B is of rank one. s the preceding di scussi on makes
clear, however, some scepticism is in order.
Estimation Strategy
Several
strategies for the estimation of models with M errors have been proposed.5 In the time domain, it is natural to consider the maximum likelihood estimator, or one of its various approximants.Put e(l)O and
-For any
admissable o define e(t)
recursively
according to
4A nonstochastic vector c such
thatc't is stationary is called
an integrating factor. In our application,
it is any normalized
basis
vector for the row space o-f 1—0.(3.2) e(t) =
(t)
— iç)(LY.) —— Ø(i(t—1)
—E(c.)e(t—l).
Following Wilson (1973), we choose as our estimatorthe adrnissable vector which maximizes the approximate
(conditional) ion likelihood function
L(c)
(T—i) lnIS(cv)j — 1tr S(cM
T
where M
L e(t)e(t)'.t=2
Since 0
bets unit roots, we have little choice but to conditionon the first observation (1).. The assumption that e(l)0.
by contrast, is made solely out of convenience.
If B=0, the
spectral radius of E) is about .268, so the samplingdistribution
o-F
will
not be very sensitive to this assumption aboutthe initial innovation. Putting e(1)=0 does simplify the
computations somewhat. In particular, analytic derivatives can be easily and quickly computed using the method of adjoints
and a straightforward application of the chain rule..
Several
features
ofLa('.) make the evaluation of
cl.achallenging.. As with
any model with MA
errors,it
isnot possible
to
reducethe data through sufficient statistics and we
haveto
deal
witha likelihood function
that is not guaranteed tobe globally
concave. Our model poses
severaldifficulties in
addition
to these standard ones. For example, it is not possibleto concentrate out the covariance matrix, since S is functionally related to the regression parameters of (3.1). Also, some effort is required to evaluate (Ø(.>,S(.),$(ci). Details are provided
in MelinoEl985J, so we will give only a brief overview here. Define
the
matrices—B 1
0 0
F 6 H
K(3.4) C 0 —B L
ol
e
0 F.-)GH-,
oo o
i
0 0 F-
G— L)
C) B J C) C) C) —F
-Put
C = —C and denote the blocks of eC by 'etc.
it can be shown that(3.5) = F4K1 +
KjF4
—K11
—= H163 + K1F1 +
F1Kj.
It is also useful to note that 0 F4 Although the expressions appear to be unappetiinq, they are straightforward to implement given an algorithm for computing the matrix exponential. We used a routine based on a diagonal Fade approximation that has very nice numerical properties.6
Solving for (S.E3) given turned out to be much easier
than conjectured by Bergstromtl9B4j. WilsonEl972J provides a general algorithm for factoring the autocovariance function of a multivariate MA process. We adapted his suggestion to our special case and applied Newtons method to find the matrix E with spectral radius no greater than unity which is a root of the polynomial
(3.7)
0 +
(ft1e
=0.
Given an initial guess, E3(0), this leads to the iterative
scheme
6We
would
like to thank Dr. R.C. Ward of the Union Carbide Laboratory(.8)
(fl+l)[fl>_ç1*- (n)]— E(n)fl1E3'(n+1> =
n)fl1E3' (ri>.This
scheme exhibits quadratic convergence and turns outto
be
quite fast. On average, less than three iterations were
required to find E3 given (fl-).fl1).
In fact, we found that this
scheme rarely required more than 5 iterations. Siven E, it
is
straightforwardto solve
for S Using S —-Evaluation
of La(cY.) and its analytic derivatives is fairlyquick and
easy.
The maindifficulty in computing ca turned
out
to be the extraordinary large number of iterations required to refine its location.Par ameter Esti mates
Table 4 presents the estimated parameters o-f the constrained
model for each o-f the the six data sets.7 The estimates obtained using be-fore— and our constructed after—tax yields are remarkably
sirniliar, but there are considerable differences in the estimates across the three different sample periods.
Consider first the estimates of L, the covariarce matrix of the instantaneous innovations. Once again, correlations
are
displayed above the diagonal, and the lower triangular elements
are covariances. The estimates of
from the quarterly data
sets
are all similiar. However,there are some sharp
contrasts7Estimates were obtained using the GQOPT3 package provided by Professor Ouandt o-f Princeton University. Various algorithms were required to refine the location of The reported standard errors, however, are always calculated by inverting the matrix
o-f second
derivatives evaluated at the optimum. The Hessian
was computed using symmetrical numerical differences of analytic
with
the estimatesfrom
the annual samples which cast doubton
our assumption that L has been constant over time. Consumption
innovations appear
to have had a much smaller variance in thepost—war
period, as
havehad the innovations to the value
ofshort—term
paper.
By contrast, the innovations tostock market
Values have been slightly smoother, and those for long—term
bonds are roughly comparable. The covariances of the innovations
to portfolio values with consumpti on have the same ranki rig in
all six data sets, but they are much smaller in thepost—war
period.
ll six data sets yield small estimates o-f B1.
,the first
row o-F the B matrix. This indicates that the change in consumption
has only a very small predictable component, aside from trend.
The trend in consumption is estimated to be about 3%
p.a.using
the two long historical samples, about 2.5% using data sets
three and four, and about 1.6% pa. using data sets five and
six.
The corresponding point estimates for indicate,respectively,
a substantial preference for present consumption,a substantial preference for
future
consumption, and indifference. These apparent differences can't be taken too seriously sincethe estimated standard errors indicate substantial uncertainty. The differences in the
estimated
parameters o-frelative
risk aversion are extremely interesting. Using our two longest historical samples, we obtain estimates of A of just over 20. This is too large to be plausible. Nonetheless, as anticipated,accountinc -for unit averaQiriq
of consumption results in asubstantial
reduction.8
Data sets -five and six produce
a very plausible estimateof
A of
justover 2. Ey
contrast,
data sets three and four
produce an estimate of A over 150!
The difference ofthe parameter
estimates
obtained using thesevery simiuiar post—war quarterly
data
sets is very large,and
some clarification is in order. The estimates of A presented in Table 1 are derived fromrestrictions
which relate the unconditional means to the covariances
of consumption changes and portfolio returns. However, the
model provides us with further sources of information about
A..
Equation (2.14) tells us that the predictable change in
the
value o-f any portfolio is equal to amultiple of
the predictablechange in consumption, up to a constant. Since the multiple is just the parameter of relative risk aversion, this gives us another estimate a-f A based on the conditional information
in the sample. The maximum
likelihood estimator is usefully
viewed as suitably pooling the disparate estimates based on conditional and unconditional information..It turned out that the predictable change in consumption around
its
mean using the lagged information in data sets 1—4was essentially zero. As a result, the maximum likelihood estimate of A closely reflects the estimates in Table 1 adjusted for
unit averaging.. In data sets 5 and 6, however, the predictable
8For data set 1, we also estimated the model as if the data was
actually point sampled. tie obtained an estimate A =
27.24,
chariqe in consumption about its mean, while still small, was larqe enouqh to provide a fairly accurate estimate of . The
maximum likelihood estimate reported for these data sets reflects the conditional irforrratior in the sample.9
Our model imposes restrictions or the time averaged
representation of . In turn, the time averaged representation imposes additional structure on the parameters of the ARM(1,1) representation. Table 5 contains the lo likelihoods, denoted
L1, L,, arid L- respectively, for the fully restricted time averaged
estimates (Model 1) ,
the
unconstrained time averaged estimates(Model 2), and the unconstrained ARMA(1,1> estimates (Model 3). For the reasons discussed above, Models 2 and 3 were estimated under the maintained hypothesis that B is of rank one and 0
is the sum of the identity and a rank one matrix. For completeness, the log likelihoods for the totally unconstrained time averaged
and ARM(1,1) models, L and L respectively, are also reported.
The tests of the overidentifying restrictions imposed by the model are rejected with very high confidence when compared against either Model 2 or Model 3.
Curiously,
data sets three and four which produced the least plausible parameter estimates,9lmposing only the restrictions implied by (2.14), we obtained for the six data sets:
A =
663.54
949.86 439.67 983.73 2.07 2.60(s.e. ) (370.56) (370.56) ( * ) ( * ) (0.92) (1.21)
Because the Hessian was singular, we are unable to provide standard errors for the estimates from data sets three and four.
Hansen—Singleton[1983] also report a very sharp difference in the estimate of A depending upon whether or not conditioning information is used. (See their Table 5)
provide the weakest
evi. derce against the oven deriti fyi nq
restrictions. Finally
a comparison of L2and L- indicates
that there is some di-fficulty in accounting for the autocovariances
o-f
by time averaqing a
firstorder
process.Why
is the model rejected?
There are strona a priori reasons for linking consumption
and portfolio choices. Moreover, the sample means and cova— riances of portfolio yields and changes in consumption lend quaiitatve support to the notion of assets being priced inaccordance with the insurance they afford against adverse movements in consumptort. Yet the various goodness of fit tests reported above as well as the implausibly high estimates of relative
risk
aversion
from data sets one through four appear to constitute an overwhelming rejection of the model. What should we conclude?A response that cannot be dismissed is that the assumed
distribution of the goodness o-f
fit
tests is simply misleading.As we noted above, we cannot rely on the standard central limit theorems to establish the asymptotic distribution. Moreover, even if the large sample results obtain, as we conjecture, there is no guarantee that the asymptotic distribution provides a
close approximation for samples of the size we have examined. Unfortunately, establishing the small sample distribution either analytically or by Monte Carlo methods is infeasible. We choose
to take the
evidence against the model seriously and to focus attention on the specific sources of predictive failure.One is naturally led to examine more closely the various auxiliary assumptions that are
being tested joi riti y alongside
the hypothesis that agents behave as described by (2.3).
Thetwo most obvious are the stochastic process assumed to descri be
the evolution o-f
consumptionand portfolio values and the specific
form o-f
pr€ferences. Wewill concentrat.e on the
former.The stochastic differential equation (2.12) imposes many
overidentifying restrictions. One o-f
themis that the
timeaveraged vector has an RMA(pq) representation with pq=1.
To test this, the autoc:orrelatioris of the prediction errors
from Model 3 were calculated. Box—Liung tests did not indicate
any need forconsidering a higher order process.
lthoucih
the evidence suggests that an ARMA(1,1) representationfor '
is
a reasonable approximation, there are problems in accepting the restrictions that time averaging a first—order process imposes on this representation. Fhillips(1978) shows that if BO then ØI+B and E3.26B(I+(B—XBZ1)/4). Our unconstrained F<MPi(1 ,1)estimates of suggest that B is indeed small. There is little difficulty in accepting the restrictions which a small B matrix and time averaging impose on .
However,
this combination imposes a great deal o-f structure on E3 which is at odds with the data.Fr all six data sets, we found that both the constrained and unconstrained time averaged models produced estimates of
E3.2681.
The unconstrained ARMc(1,1) estimates of E3 differed from .2681in
LE-veral respects.
The most noticeable discrepancy was that the unrestricted estimate of the row of the moving averaqe matrix pErtainin to the consumpton equation was essentially zero.in all six dat.a sets. In fact, in data sets 1—2 and 5—6, the MA coef-ficierit for the innovation in consumption was more than
two standard deviations below .268. Failure to explain the
MA component
o-F consumption in arid of itself would lead to rejecti on
of
the model at the 5/.level
forthese data sets.
One possible explanation
for the apparent absence of amoving average component in the consumption equation is measurement
error. Suppose
the unit average o-f consumption
is measuredwith an
error that is serially uncorrelated and independentof
the true consumption process.
I-f the flow-of consumption
is
truly a random walk,the measured consumption series will
be an ARMA(1.1) process but with an MA coe-f-ficiènt less than .268. If one half of the variance of the change in measured consumption
is due to measurement
error, the MA coefficientwould
be predicted to fall to just
.127.As
pointed out earlier, our model predicts that the excess
returns of stocks and bonds over the yield on short—term paper
should be unpredictable. The time averaged excess returns should
therefore have an MA(1) structure with a coefficient of about
.268. These overidentifying predictions can be tested regardless
of the quality of the consumption data by simply regressing
the adjusted excess returns on various information sets. Moreover,
there is no problem in justifying the standard procedures to
test these orthoqonality restrictions. The results ar-c reported
in Table 6. The likelihood ratio test statisticq X, and theR
-for
eachof the
inudi vi dual rearessi ons is also reported.
The individual F are remarkably hih and the orthoqonality
restrictions are rejected with very high confidence. Since
yielddata that are point sampled are readily available, we
also tested these restrictions using the monthly point sampled
yields corresponding to data sets 1—6. Because a monthly price
index
wasnot available for our longest samples, we used the
log cumulated
nominalreturns, vt. in the information set.
These results are reported in the lower half of Table 6. Although
the individual R2 are much lower, as we would expect, the rejection
a-f the orthogonality restrictions is even more pronounced.
These
results are very similiar to those reported in Hansen and Sinaleton[1983J.One explanation for this predictive failure is simply that
the
covariance matrix c-f
the instantaneous innovations is notconstant but is state
dependent. This seems
extremelyplausible
and could also account for the noted differences in the estimates of Z -from different sample periods. However, taking account
c-f state dependent variances would make estimation and testing o-f the model practically impossible. Because our model imposes restrictions across the drift and diffusion parameters, making the latter state dependent would force us to abandon the linear constant coefficient model of the drift as well. We would be
(3.9) dy =
B(t,y)dt
+1(t,y)dZ.
The restrictions across the drift and diffusion effectively
rule out
any 0+ the convenient functional forms -for
B() aridand
the solution of the likelihood for even the point
sampled process
is difficult to implement. Computing the likelihood-function for the unit
averaged
process that solves (3.9) seemsunimaginable,
with current technology.
IV
Conclusions
The notion of insurance against events which impinge unfavourably on consumption choices can be used to rationalize, at least
qualitatively, the systematic differences in average yields
a-fforded by portfolios of stocks, bonds, and short—term paper. The sample means and covariances of portfolio returns and per
capita consumption growth indicate that the quantitative dif-ferences in averace yields can be rationalized only by implausibly high
aversion to risk. Taking account of the fact that measured consumption is unit averaged substantially reduces the degree of relative risk aversion required to rationalize the data.
Nonetheless, there remains considerable evidence that
casts doubt on this view o-F the world. In particular, it is
difficult to reconcile the importance of unit averaging of the consumption flow with the fact that the measured logarithm of detrended real per capita consumption has essentially no moving average component. lso, although the model allows the average
return on different portfolios to diverge due to different insurance characteristics, the particular
specification that
we examinedrequires that expected excess returns should be time invariant. This orthogoriality property is forcefully rejected by the data.
Addressing these particular predictive failures while taking account of unit
averaging
constitutes a formidable challenge for futureresearch.
excess excess on stocks over short term paper. on bonds over short term paper. Table 1 Estimates of A Using Unconditional Means and Covariances Data Set
ER
Cov(R' dc/c) A5ER
Cov(R' dc/c) 1 Annual 3 Quarterly 5 Quarterly .039 .012 .012 .0229 .049 .062 13.76 139.13 198.56 -.013 -.005 -.002 -.O8O -.0l3-.060
159.10 376.41 398.17 return returnTable 2 Important Differences Across Data Sets (1) )ata Set 1 2 3 (2) Sample
knnual Annual Quarterly
(3) Price Index Consumption Deflator Consumption Deflator Consumption Deflator (4) Marginal Tax Rate 0 Munic 0 (5) Capital Gains Taxed As 0 Long--Term Gain 0 (6) Averaging 1 1 1 4 5 6
Quarterly Quarterly Quarterly
Consumption Deflator CPI CPI Munic 0 Munic Long—Term Gain 0 Income 1 2 2 Notes: Column (3): To convert nominal returns to real returns, we used either the consumption deflator for non— durables and services or the consumer price index. Column (4): A 0 indicates pretax returns. Munic indicates that the marginal personal income tax rate implicit in the spread between municipal and corporate bond yields was used to construct after—tax returns. Column (6): Averaging method 1 takes the log monthly cumulated returns, forms a simple average for the period and subtracts the log price index. Averaging method 2 takes log monthly cumulated returns, subtracts the monthly log price index and computes the integral of the linear interpolation of the end of month values.
Table 3 Some Descriptive
Statistics
Variable Mean Correlation/Covariance' . - Mean Box-Ljung Statistic to Lag:—
6 12 18 Data Set 1 1890-1981 92 ObservationsMnc Alnv1 Amy2 Amy3 .032 .058 .019 .0260 .0225 .0225 -.040 —.048 .289 .029 .094 .0237 -.142 .099 .0231 .0229 -.128 .286 .692 .0256
R
R
R
tR
.039 -.013 .039 -.013 11.17 29.97** 9.10 13.48* 22.59* 48.07** 19.45 25.03* 32.27* 55 l1** 30.01* 30.90* 4303** 59.27** 42.68* 35.69 Data Set 2 1890-1980 91 ObservationsAlnc Amy1 Amy2 Amy3
.032 .048 .013 .0218 .0225 .0221 -.053 -.059 .261 .026 .o2i .0238 -.185 .159 Q233 .0233 -.160 .318 .780 .0254
R
R
tR
tR
.035 -.011 .035 -.011 13.02* 26.28** 11.58 11.21 25.01* 44.22** 23.50* 22.31* 36.72* 48.91** 36.69** 26.48 47.67** 54.10** 50.l1** 31.50 Data Set 3 1953: 3—1 983 :1 119 ObservationsAinc Amy1 Amy2 Amy3
.0248 .015 .0224 .027 .088 .0655 -.o13 .273 .0238 .026 .O51 .020 .077 .O29 .O51 -.067 .216 .247 .0215
R
R
tR
fR
.012 .012 24.84** 10.16 7.93 4.42 31 .52** 25.80* 15.66 15.50 54.91** 28.70 31.10 17.15 62.76** 34.28 37.91 21.78Table 3 (continued) Variable Mean 1 Correlation/Covariance
-
Van-
able3 Mean Box—hungStatistic
to Lag:——
6 12 18 Data Set 4 1953:3-1980:' 110 ObservationsAinc Mnv1 Amy2 Amy3
.0251 .011 .028 .080 .057 -.O67 .282 .0229 .073 .O23 .247 .311 .O19 .O45 -.042 .142 .348 .O88
R
R
1 1R
.013 .013 22.l3** 10.25 7.44 8.15 28.66** 14.17 14.30 10.30 50.82** 19.42 29.28* 13.45 56.77** 27.55 3439 22.94 Data Set 5 1947 :2-1981:' 139 Observations Data Set 6 1947 :2_i 980:L 135 ObservationsAinc Amy1 Amy2 Amy3 Ainc Amy1
.0243 .015 -
.0l6
.0244 .0293 .O37 .095 .O41O .080 .037 .073 .266 .O35 .061 .046 .263 .0221 .224 .140 .O54 .081 .242 .287 .041 .241 .344 .02i0 .033 .279R
R
tR
R
R
.015 .015 .012 27.24** l9.71** 8.03 10.33 27.81** 19.47** 34.88** 23.29* 14.67 12.15 35.38** 24.17* 80.04** 27.62 42.66** 15.12 77.86** 31.34* 88.38** 37.08* 50.62** 24.51 85.16** 42.06* Amy2 Amy3 .0230 .o254 .01l .o5o .O310.o
.O59 .093 .487 .062t
R
tR
.012 .0224 8.83 14.35* 15.24 16.58 41.86** 48.80** 21.08 133.00 mCorrelations are displayed above the diagonal.2
Denotes significant at 5% level. **Denotes significant at 1% level. tDenotes adjusted to remove the effects of time averaging.Table 4 Estimates for the Fully Constrained Model1 Data Set
K
g
A S B1 E2 1 (1890-1981) .12 (.06) .03 (.01) 21.23 (9.51) .79 (.37) .080 -.02l (.0223) (.098) (.0214) (.0231) .0240 .225 -.004 -.008 .0231 .047 .202 .374 .0226 .0235 744 -.050
.0275 .0240 .0286 2 (1890-1980) .14 (.07) .03 (.01) 21.40 (9.86) .77 (.39) .0211 -.0i7 (.0230) (.0212) (.0214) (.0233) .0240 .214 -.003 -.008 .0228 .041 .223 .378 —.o13 .0227 .0235 .790 -.045
.0267 .0241 Q277 3 (1953:3 - 1983:1) -4.61 (6.99) .026 (.022) 154.47 (3.39) 1.27 (.15).o2s
.042 .021 -.026 (.o16) (.013) (.031) (.035) .041 .234 .282 -.046 .0252 .199 .176 .0 83 .0 66 .0 21 .220 -.O14 fl3 fl447 .0221 4 (1953:3 - 1980:4) .14 (1.60) .025 (.og) 185.38 (29.85) 1.23 (.20) -.021 .042 .049 -.097 (.O12) (.016) (.077) (.040) .042 .228 .334 -.068 .091 .0238 .220 .049 .o82 .05l .014 .392 -.o16 .O311 .042 .0 5 (1947:2 — 1981 :4) -.23 (.71) .024 (.07) 2.12 (.98) 1.01 (.024) —.039 .0228 Q245 (.027) (.0216) (.0293) (.0236) .057 .224 .111 .003 .0l2 .0249 .155 .328 .O6O .079 .052 .155 .0687 .042 .O9l .0215Table 4 (continued) Data Set 6 (1947:2 - 1980:4)
K
.39 (.80)9
.0236 (.092) A 2.69 (1.34) S 1.01 (.026) B1 -.032 .0232 .0212 (.022) (.0221) (.0279)(.0251) E2.057
.194 .130 -.024 .O79 .0229 .213 .070 .o8l -.o52
.026
.080
1. Figures in parentheses are the estimated standard errors. 2. Correlations are displayed above the diagonal.Table 5 Goodness of Fit Tests Data Set Li (NP=18) L2 (NP=25) L2 (NP=34) L3 (NP4i) • L3 (NP50) Test of Model lvs Model 2 Test of Model lvs Model 3
Test Model Model
1 822.95 838.34 849.33 865.79 877.57 30.78 85.68 54.90 2 832.53 845.04 856.74 871.85 885.13 25.02 78.64 53.62 3 1794.79 1803.07 1835.54 1823.07 1852.67 16.56 56.56 40.00 4 1724.42 1746.34 1755.34 1755.68 1776.96 43.84 62.52 18.68 5 2044.57 2056.02 2073.78 2098.20 2123.08 22.90 107.26 84.36 6 2064.79 2078.36 2102.61 2119.85 2141.57 27.14 110.12 82.98 2os(7) 205(23) x2.os6) NP denotes the number of free parameters. =14.07 =35.17 26.30
x2.ol
x2.0l23 x2.0106) =18.48 =41.64 =32.00I number of observations.
R
R2 for the excess return on stocks equation. for the excess return on bonds equation. A Likelihood ratio test statistics z (-q)/v' where q is the number of restrictions under test. Table 6 Predictability of Excess Returns I. Adjusted Excess Returns (Time Averaged Data) Information Set ={
1, (t-l), £}
(10 restrictions) DataSet IR
R
A 1 91 .17 29 43.00 2 90 .17 .27 39.33 3 118 .26 .18 6.56 4 109 .22 .16 44.66 5 138 .18 .13 42.75 6 134 .18 .16 45.15 7.86 II. Monthly Returns (Point Sampled Information Set ={
Data) lnv?,t_i1nv1
1nv,_1 ,t
} (8 restrictions) Notes: DataSet IR
R
A 1 2 3 4 5 6 1102 1090 355 328 418 406 .20 .21 .16 .15 .05 .06 .20 .21 .13 .13 .04 .04 419.88 439.28 121.04 121.90 62.80 76.67 76.70 80.37 20.23 20.39 9.22 11.84Re-frences
BerqstrornqA..R. 'Continuous Time Stochastic Models and Issues
of
AqgreQation
Over Time,' in Z. Briliches and M. Intriligator(eds) Handbook
of Ecor,ometrics,Amsterdam:North—Holland,
(19B4)Breeden,D.T. 'Ar Irterternporal Asset Pricing Model with Stochastic
Consumption and Investment Opportunities, '
Journal
of FinancialEconomics, 7(1979) :265—296
Granqer,C.W.J. and RE. Engle 'Dynamic Model Specification with Equilibrium Constraints: Co—integration and Error Correction,'
Department of Economics WP.#85—18, University of
California,
San
Dieqo,
revised May 1985Grossman,S.J. and R.J. Shiller 'Preliminary Results on the
Determinants of Stock Market Variability,' unpublished
mimeo, June 1980
—'Or the Determinants of Stock Market Variability, ' American
Economic Review, 7(1981):222—227
—
'Consumption
Correlatedness and Risk Measurement in Economieswith Non—traded Assets and Heterogeneous Information,'
Journal of Financial Economics,10(1982): 195—210
Hannan,E.J. Multiple Time Series, New York:Wiley,1970
Hannan,E..J., W.,T.M. Dunsmuir and M. Deistler 'Estimation of
Vector AR'MAX Models,' Journal of Multivariate Analysis,
10(198W :275—295
Hansen,L.P. and K.J.
Singleton
'Stochastic Consumption,RiskAversion, and the Temporal Behaviour of Asset Returns,
Ibbotsori,R.,G. and
R.A.
Sinquefield Stocks, Bonds, Bills andInflation:The Past. and the
FLiture,
Charlottesville: Financial Analysts Research Foundation,1982 Melino,A. 'Estimation of Unit Averaqed Diffusion Processes,'
Department of Economics—Institute for Policy Analysis WP. #8502, University of Toronto, March 1985
Merton,R.C. 'Optimum Consumption and Portfolio Rules in a Continuous Time Mode1' Journal of Economic Theory. 3(1971):373—413
Osborne,D.R. 'Exact arid Approximate Maximum Likelihood Estimators
for Vector Moving Average Frocesses, '
Journal
of the RoyalStatistical Society, Series B, 39(1977):114—118
Phillips,P..C.,B.. 'The Treatment of Flow Data in the Estimation
of Continuous Time Systems, '
in
Bergstrom et.al (eds),Stability and Inflation, New York:Wiley, 1978
Seltzer,LH. The Nature and Tax Treatment of Capital Gains and Losses, New York:National Bureau of Economic Research,1951 Shiller,R.J 'Consumption, Asset Markets and Macroeconomic
Fluctuations,' Carnegie—Rochester Conference Series, 17(1982) 4ilson,S.T. 'The Factorization of Matricial Spectral Densities,'
SIAM Journal of Applied Mathematics, 23(1972):420—426 —
'The
Estimation of Parameters in Multivariate Time SeriesModels,' Journal of the Royal Statistical Society, Series B,