Estimation in Models of the Instantaneous Short Term Interest Rate By Use of a Dynamic Bayesian Algorithm

(1)

Estimation in Models of the Instantaneous Short Term

Interest Rate by use of a Dynamic Bayesian Algorithm

Ramaprasad Bhar

School of Banking and Finance The University of New South Wales

Sydney 2052 AUSTRALIA [email protected]

Carl Chiarella

School of Finance and Economics University of Technology Sydney PO Box 123, Broadway

NSW 2007 AUSTRALIA Fax. +61 2 9514 7711 [email protected]

Wolfgang J. Runggaldier

Dipartimento di Matematica Pura ed Applicata Universit´a di Padova

Via Belzoni 7 35131-Padova, ITALY [email protected]

(2)

Abstract

This paper considers the estimation in models of the instantaneous short interest rate from a new perspective. Rather than using discretely compounded market rates as a proxy for the instantaneous short rate of interest, we set up the stochastic dy-namics for the discretely compounded market observed rates and propose a dynamic Bayesian estimation algorithm (i.e. a ﬁltering algorithm) for a time-discretised ver-sion of the resulting interest rate dynamics. The ﬁlter solution is computed via a further spatial discretization (quantization) and the convergence of the latter to its continuous counterpart is discussed in detail. The method is applied to simulated data and is found to give a reasonable estimate of the conditional density function and to be not too demanding computationally.

1 Introduction

Literature on estimation in models of the instantaneous spot rate of interest has bur-geoned since the seminal contribution of Chan et al. (1992) (henceforth CKLS). The sustained interest in this topic is due to the great deal of activity, both amongst aca-demics and practitioners, in pricing interest rate related securities.

CKLS applied the generalised method of moments (GMM) to estimate the parameters of a one factor model of the instantaneous spot rate of interest that is mean-reverting in the drift term and has a diﬀusion term that is of constant elasticity in the spot rate. Their estimate for US data of about 1.5 for the elasticity in the diﬀusion term provoked much discussion as it was much higher than the value of 0.5 for the popular Cox, Ingersoll and Ross (1985) model.

Subsequent contributions either extended the basic CKLS formulation and/or consid-ered alternative estimation procedures.

Longstaﬀ and Schwartz (1992), Brenner et al. (1996), Andersen and Lund (1997) and Koedijk et al. (1997) in various ways add volatility dynamics to the model for interest rate dynamics.

As far as estimation methodology is concerned, GMM has remained the work-horse for most of the empirical studies cited. However Nowman (1997, 1998) applied the Gaus-sian estimation techniques developed by Bergstrom (1990) for continuous time stochastic differential equations. In contrast to GMM the Gaussian estimation methodology has the advantage of producing an exact maximum likelihood estimator. Episcopos (2000) subse-quently applied this methodology to estimate the parameters of the CKLS specification for the short-term interest rate for a number of countries. Interestingly he obtained esti-mates for the elasticity in the diffusion term that are much lower than those obtained by CKLS. The fact that two well established estimation methodologies can yield widely dif-fering parameter estimates suggests a need to look at the estimation issue from a different perspective and that is one of the motivations for the current paper.

Irrespective of the estimation methodology one employs, another signiﬁcant issue re-lates to what data is used to proxy the instantaneous spot rate of interest that itself is not an observed quantity. Proxy variables that have been used include US one-month Treasury bill rates (CKLS) and one-month interbank rates (Episcopos). It seems strange

(3)

that the literature has not developed in the direction of deriving and including in the estimation procedure, the stochastic diﬀerential equations that the assumed dynamics of the instantaneous spot rate imply for these market observed rates. Certainly Chapman et al. (1999) have provided some evidence that use of the indicated proxy variables may not induce a great deal of error. However given that the choice of appropriate estimation procedures is not yet an entirely settled issue, it would seem useful to establish a frame-work that removes entirely any potential for errors or biases due to choice of the proxy variable. The current paper provides such a framework.

In this paper we assume for the instantaneous spot rate of interest the stochastic differential equation specification used by CKLS. We then apply Ito’s Lemma to de-rive the stochastic differential equation that this specification implies for the discretely compounded rates observed in financial markets. The result is a set of two stochastic differential equations, one for the unobserved instantaneous spot rate the other for the observed discretely compounded rate. Since we are then dealing with a partially observed system the paper proposes a filtering methodology based on a dynamic Bayesian algo-rithm as a tool to estimate the short rate as well as the parameters in the model from observations of discretely compounded rates.

The plan of the paper is as follows: in section 2 we derive the stochastic dynamic system followed by the instantaneous spot rate and discretely compounded rates. We also show how one can set up the model in a way that the short rate takes values that are positive and belong to a compact interval. Since the data are observed in discrete time, in section 3 we outline the way in which the continuous time stochastic diﬀerential equation system is discretised with particular attention being given to ensuring that the discretised system generates only positive interest rates with the same support as the continuous time counterpart. In this section we also outline the dynamic Bayesian algorithm for the calculation of the distribution of the spot rate and the parameters, conditional on the observations. Section 4 describes the manner in which the dynamic Bayesian algorithm is made computable by an approximation via quantization (spatial discretisation) so that the calculations reduce to a sequence of recursive matrix operations. We also demonstrate the convergence of the approximations. In section 5 we discuss implementation issues and apply the approximating dynamic Bayesian algorithm to some simulated data. Section 6 concludes and makes suggestions for future research.

2 The Stochastic Process Implied for Discretely

Com-pounded Rates

Let r(t) denote the instantaneous spot rate of interest. We shall assume that r(t) is driven by a continuous time diﬀusion process having the general mean-reverting form

dr = K(¯r− r)dt + g(r)dw, (1)

where ¯r is the long-run instantaneous spot rate of interest, K is the speed of mean reversion, g is some suﬃciently well behaved function and w(t) is a Wiener process.

(4)

Subsequently we shall specialise the function g to have the form

g(r) = σrδ, (2)

for σ > 0 and δ > 0 constant. The estimation problem is to obtain estimates of r as well as of the parameters K, ¯r, σ and δ based on relevant market information.

Since r is the instantaneous spot rate of interest, money invested at this rate will com-pound continuously. Thus one dollar invested at time zero at this rate and continuously reinvested will grow by time t to the amount

A_∞(t) = exp( t

0

r(s)ds). (3)

In ﬁnancial markets we never observe the instantaneous spot rate of interest r (unless of course we equate r with overnight rates, but these are usually considered too noisy for empirical work). Rather we observe discretely compounded rates such as US Treasury bill one-month rates, one-month interbank rates, six-month LIBOR rates and so forth. All of these rates we refer to as discretely compounded rates. To our knowledge Sandmann and Sondermann (1997) were the ﬁrst to relate the dynamics of continuously compounded and discretely compounded rates.

In all of the empirical studies referred to earlier some form of one-month rate is used to proxy r(t) in equation (1). In this section we shall derive the stochastic diﬀerential equation followed by discretely compounded rates given that the instantaneous rate r(t) is driven by equation (1).

We use lm(t) to denote the level at time t of an interest rate that is compounded m

times per year. Thus l₁₂(t) would refer to a one-month rate.1 If one dollar is invested at this rate at time zero and then continuously re-invested at this rate then by time t the amount accumulated will be (see Appendix A.1.)

Am(t) = 1 + 1

m t

0

lm(s)ds. (4)

We seek the process for lm(t) that will make the continuously compounded amount

(A_∞(t)) and discretely compounded amount (Am(t)) the same at time t; that is such that

1 + 1 m t 0 lm(s)ds = exp( t 0 r(s)ds), (5)

given that r(t) follows the diﬀusion process (1).

The relationship (5) involves the integrals of both interest rate processes up to time t, in other words it involves the path history of both process. We therefore might expect that this path dependence could cause the process for lm(t) to be path-dependent, or in

other words to be a non-Markovian process. This indeed turns out to be so, however using the now well-established practice of expanding the state space (see e.g. Cheyette (1992), Ritchken and Sankarasubramanian (1995), Bhar and Chiarella (1997) and Inui and Kijima (1998)) it is possible to describe the dynamics of lm(t) as a Markovian system. We can

in fact show:

1_{Here we are using a 360 day count convention. Obvious adjustments could be made to allow for}

(5)

Proposition 2.1 The relationship (5) with r(t) driven by the diﬀusion process (1) implies

that lm(t) is driven by the two-dimensional Markovian System.

dr = K(¯r− r)dt + g(r)dw, (6)

dlm = lm[K(r¯

r − 1) + r]dt + lm g(r)

r dw. (7)

Proof : See Appendix A.2.

Notice that the second (7) makes sense only if we make sure that r, given by (6), is bounded away from zero. (To this eﬀect see the discussion at the end of this section).

Of course it would also be possible to take as our starting point a mean-reverting process for lm(t), viz.

dlm = K(¯lm− lm)dt + h(lm)dw, (8)

and indeed this is in fact the starting point of the empirical literature cited earlier. In this case the relationship (5) linking the process for lm(t) and r(t) will impose a certain

structure on the process for r(t). In fact we have

Proposition 2.2 The relationship (5) with lm(t) driven by the diﬀusion process (8)

im-plies that r(t) is driven by the two-dimensional Markovian System

dlm = K(¯lm− lm)dt + h(lm)dw, (9) dr = r[K( ¯ lm lm − 1) − r]dt + r h(lm) lm dw. (10)

Proof: Manipulations very similar to those yielding the proof of Proposition 2.1. Notice, again, that the second relation (10) makes sense only if lm, solution of (9), is

bounded away from zero.

It is also important to stress that if we were to make the diﬀusion process (8) the starting point of a theory of the term-structure of interest rates (e.g. a Vasicek model) then we would need to make the derivative price depend on the two state variables lm(t)

and r(t) since the instantaneous continuously compounded spot rate r(t) is the one that enters into the continuous hedging argument underpinning the derivation of the derivative pricing equations.

Returning then to the system (6) and (7), the one with which we shall work, we see from Proposition 2.1 that this linked two-dimensional system drives the dynamics of lm(t).

In this system lm(t) is observed in the market whereas r(t) is not observed. Hence we are

dealing with a partially observed system for which a ﬁltering methodology turns out to be an appropriate estimation tool.

To conclude this section we shall now discuss the issue that equation (6) does not guarantee positivity of its solution for any choice of g(·) according to (2). Besides the fact that positivity of its solution is a desirable feature for a model of the instantaneous spot rate, as mentioned after Proposition 2.1, we also need r to be bounded away from zero

(6)

for equation (7) to make sense. To get positivity of r, more precisely r > > 0, we may apply the transformation ρ = T (r), where,

T (r) := r, if r ≥ + η, ( + η) + 2η_π arctan π 2η(r− − η) , if r < + η, (11)

with 0 < η < , and use the variable ρ instead of r to describe the instantaneous spot rate. In fact, T (·) maps the real line into (, +∞); it also has the property of mapping the half line (−∞, + η] into (, + η]. We have now

Proposition 2.3 If rt satisﬁes (6), then ρt= T (rt) satisﬁes

dρt = F (ρt) dt + G(ρt) dwt (12) where, with T−1(ρ) = ρ 1_{ρ≥+η}+ ( + η) + 2η π tan π 2η (ρ− − η) 1_{<ρ<+η} (13) ˙ T T−1(ρ)= 1_{ρ≥+η}+ 1 1 + tan π 2η(ρ− − η) 2 1{<ρ<+η} (14) ¨ T T−1(ρ)= π η tan π 2η(ρ− − η) 1 + tan π 2η(ρ− − η) ₂2 1{<ρ<+η} (15) one has F (ρ) = ˙T T−1(ρ) K ¯r− T−1(ρ)+ 1 2 ¨ T T−1(ρ) g2 T−1(ρ), (16) and G(ρ) = ˙T T−1(ρ) g T−1(ρ). (17)

Proof : Relation (13)-(15) follows immediately from the deﬁnition of T (r) in (11) (see

also Chiarella, Pasquali and Runggaldier (2001)), while (12) is an easy consequence of the application of Ito’ s lemma to T (rt) when rt satisﬁes (6).

In addition to its positivity, it is desirable to have also boundedness of the solution of (12) and this not only in view of the interpretation of the solution, but also for theoretical purposes as will become clear from the time discretized model below. To obtain a solution ρt of (12) on a compact support, say [, H + ] ( > 0 small, H > large), it suﬃces to

multiply drift and diﬀusion coeﬃcients in (12) by χ(ρt) where

χ (ρ) =    1 if < ρ≤ H 0 if ρ > H + H+−ρ if H < ρ < H + (18)

(7)

where the third expression on the right corresponds to a Lipschitz interpolation that guarantees existence and uniqueness of a strong solution of the resulting equation (no matter what the value of δ > 0 in (2) is).

Summarizing, we shall take as our partially observable system for the (unobservable) instantaneous spot rate ρtand for the (observable) logarithm λm := log lmof the discretely

compounded rate lm, the following

     dρ = F (ρ) χ(ρ) dt + G(ρ) χ(ρ) dw, dλm = K ¯r ρ− 1 + ρ− g_2ρ2(ρ)2 dt + g(ρ)_ρ dw (19)

3 Discretisation of the Continuous Time System

(Dy-namic Bayes Algorithm)

Considering the continuous-time model (19), notice that, in reality, one cannot observe λmin continuous time but rather in discrete time. On the other hand, if one could observe

λm in continuous time, one could observe its quadratic variation, i.e. g(ρ)_ρ , that is an

invertible function of ρ, and so the ﬁltering problem (dynamic Baysian algorithm) would degenerate.

We thus perform a time discretization with time step ∆ > 0 of (19), using the Euler-Maruyama scheme and where yn := (λm)n− (λm)n−1 are the discrete time observations

and ∆wn= w(n+1)∆ − wn∆. Thus we consider the discrete time system

       ρn = ρn−1+ F (ρn−1) χ(ρn−1)∆ + G(ρn−1) χ(ρn−1) ∆wn−1, yn = K ¯ r ρ_n−1 − 1 + ρ_n−1− g 2_(ρ n−1) 2(ρ_n−1)2 ∆ + g(ρn−1) ρ_n−1 ∆wn−1. (20)

Putting, as in (2), g(ρ) = σρδ (δ > 0), the model parameters are θ = (K, σ, δ, ¯r) and we shall assume from the outset that they take only a ﬁnite number of possible values.

Since {∆wn} form an i.i.d. sequence of Gaussian random variables, the sequence ρn

generated by (20) is Markovian with transition kernel

¯

pθ(ρn| ρn−1) =N ρn; ρn−1+ F (ρn−1)χ(ρn−1)∆, G2(ρn−1)χ2(ρn−1)∆ (21)

where N (x; m, σ2) denotes a Gaussian random variable x with mean m and variance σ2 and where the subscript θ in ¯pθ refers to the fact that the transition kernel is conditional

on the choice of a parameter vector θ.

While the continuous time process ρt lives on the compact set [, H + ], its time

discretization may (even if only with small probability) take values outside of [, H + ]. We thus renormalize ¯pθ(ρn| ρn−1) to the set (, H + ] by putting

pθ(ρn| ρn−1) =

N (ρn; ρn−1+ F (ρn−1)χ(ρn−1)∆, G2(ρn−1)χ2(ρn−1)∆)

P {N (ρ ; ρ_n−1+ F (ρ_n−1)χ(ρ_n−1)∆, G2(ρ_n−1)χ2(ρ_n−1)∆)∈ (, H + ]}1ρn∈(,H+]. (22)

(8)

On the other hand, from (20) we also have p (yn| ρn−1, θ) =N yn; K ¯ r ρ_n−1 − 1 + ρ_n−1− g 2_(ρ n−1) 2(ρ_n−1)2, g2(ρ_n−1) (ρ_n−1)2 . (23)

Notice that, according to (20), ∆w_n−1 aﬀects the transition from ρ_n−1 to ρn as well as

the observation yn of ρn−1. This implies that, given ρn−1, the random variable ρn is

independent of ym for m < n. This then leads to the following dynamic/recursive Bayes

formula to compute the conditional (on the observations) joint distribution of ρn and θ

p(ρn, θ | yn)∝

_H+

p (yn| ρn−1, θ)· pθ(ρn| ρn−1)· p(ρn−1, θ | yn−1) dρn−1, (24)

where ∝ denotes proportional to and yn := (y₁,· · · , yn). Notice that the ﬁrst available

observation is in fact y₁ = (λm)1− (λm)0; notice also that θ is constant over time so that, for this component of the state, the dynamic Bayes formula acts as an ordinary Bayes formula. In our setting, in which ρn ∈ (, H + ] and θ takes a ﬁnite number of possible

values, the right hand side in (24) is indeed summable over ρn and θ and coincides with

the joint conditional distribution (the filter distribution) p(ρn, θ | yn) of ρn and θ given

yn modulo the proportionality (normalization) factor

ν = θ _H+ _H+ p (yn| ρn−1, θ)· pθ(ρn| ρn−1)· p(ρn−1, θ | yn−1) dρn−1dρn −1 . (25) On the basis of p(ρn, θ | yn) one can then derive the two marginal conditional

distri-butions p(ρn| yn) = θ p(ρn, θ | yn), (26) and p(θ | yn) = _H+ p(ρn, θ | yn) dρn. (27)

To start the recursions (24) one needs the initial joint distribution of ρ₀ and θ, for which one may assume independence of ρ₀ and θ so that

p(ρ₀, θ| y0) = p(ρ₀, θ) = p(ρ₀) p(θ) (28) with p(ρ₀) and p(θ) the individual initial distributions ρ₀ and θ.

4 Approximation of the Bayesian Algorithm by

Quan-tization; Convergence of the Approximations

To actually compute the recursions in (24), we shall introduce a quantization, namely we discretize the values of ρn ∈ (, H + ] (recall that θ takes already from the outset

(9)

a partition R1,· · · , RM of (, H + ] of the form Rh = (αh, βh] (h = 1,· · · , M) and pick representative elements rh ∈ Rh. Due to the property of the transformation T (r) in (11) of mapping (−∞, + η] into (, + η], one may choose α1 = , β1 = + η. Similarly, due to the form of χ(r) in (18), we shall choose αM = H, βM = H + .

Denoting then by P_[i,h](n,θ) the transition probability from ρ_n−1 = ri to ρn = rh, given

the parameter vector θ, from (22) we have, for i = 1,· · · , M − 1, P_[i,h](n,θ) := Pθρn ∈ Rh| ρn−1 = ri= P N (ρ; ri+ F (ri)∆, G2(ri)∆) ∈ Rh P {N (ρ; ri+ F (ri)∆, G2(ri)∆)∈ (, H + ]} = Φ βh_−ri_{−F (r}i_)∆ G(ri₎√_∆ − Φαh−r_G(ri−F (ri₎√_∆i)∆ Φ H+−ri_{−F (r}i_)∆ G(ri₎√_∆ − Φ−r_G(ri−F (ri₎√_∆i)∆ (29) with Φ(·) the cumulative standard Gaussian distribution function. For i = M we obtain the same expression as in (29) except that F (rM) and G(rM) have to be multiplied by (H + − rM)−1, which corresponds to the truncation factor χ(rM).

The conditional distribution p(ρn, θ | yn) in (24) can now be approximated (weakly)

by the conditional probability vector ˜p(ρ = rh, θ | yn) (h = 1,· · · , M), that is obtained from the following approximating dynamic recursive Bayes formula that corresponds to (24):-˜ p(ρn= rh, θ | yn)∝ M i=1 p(yn| ρn−1 = ri, θ)· P_[i,h](n,θ)· ˜p(ρn−1 = ri, θ | yn−1) (30) where ˜p(ρ₀ = ri, θ) = P{ρ₀ ∈ Ri} · p(θ) = p(θ) Ri p(ρ₀)dρ₀ with p(ρ₀) and p(θ) as in (28). We shall rewrite (30) in matrix form as

pθ_n ∝ P(n,θ)T Lθ_npθ_n−1 (31) where P(n,θ) is the matrix with the elements P_[i,h](n,θ) in (29) and where, for any given θ, pθ_n is the M−vector with entries p(ρn = rh, θ | yn); Lθn is the M × M−diagonal matrix

with entry in the i−th position given by p(yn| ρn−1 = ri, θ). Notice that (31) is an easily

computable recurrence relation.

In order that the proposed approximations via quantization are meaningful, one has to prove the convergence, in some sense, of the discretized conditional distributions ˜p(ρn =

rh, θ | yn) to the original continuous counterpart p(ρn, θ | yn). We shall actually prove

their weak convergence in the sense as specified in Theorem 4.1 below. For this purpose, given M , we use ρM_n to denote for the generic period n a random variable that takes the values rh (h = 1,· · · , M) corresponding to the given quantization. From the construction of this quantization, obtained by taking partitions and picking representative elements, one has that, provided the partition becomes finer and finer as M → ∞ (which, in the case of the choice of R1 = (, + η] and RM = (H, H + ] requires also → 0), for any continuous function F (ρ), for any period n, for any ρ_n−1 ∈ (, H + ], and any value of θ

(10)

one has

EθF (ρMn ) | ρn−1 −→ Eθ{F (ρn)| ρn−1} , for M → ∞ (32)

The conditional expectation on the right in (32) is computed on the basis of the transition kernel pθ(ρn| ρn−1) in (22), while the one on the left corresponds to its quantized

coun-terpart in (29). Notice also that any continuous function F of ρM_n or ρn is automatically

bounded since ρM_n and ρn take values in the compact set [, H + ].

Theorem 4.1 For any continuous function F (ρ), for any period n, for any sequence of

observations yn= (y₀,· · · , yn), and for any value of θ one has

EθF (ρMn ) | yn

−→ Eθ{F (ρn)| yn} , for M → ∞. (33)

where, for given θ, the expectation on the left is computed according to ˜p(ρn, θ | yn) (see

(30)) and the one on the right according to p(ρn, θ | yn) (see(24)).

Proof : See Appendix A.3.

5 Implementation

5.1 Computing the conditional probabilities in the

approximat-ing model

By computing the right hand side of the recursive formula (31) we obtain, for each period n, a vector πθ_n of un-normalized conditional probabilities for the various possible values rh (h = 1,· · · , M) of ρn as well as of those of θ, given the observations yn.

For later convenience, let us write this vector as

π_nθ = [π₁θ(n),· · · , · · · , π_Mθ (n)]

where the period n appears as an argument and the subscript refers to the various possible quantized values of ρn that are M in number.

With a slight notational change with respect to (31) and assuming that the possi-ble values for θ are K in number, denote by pθk

h (n)(h = 1,· · · , M; k = 1, · · · , K) the

normalized conditional probabilities for the various values of ρn= rh and θ = θk, namely

pθk

h (n) := P{ρn= rh, θ = θk|yn}.

The relationship between the πθk

h (n) and pθhk(n) is given by pθk j (n) = πθk h (n) K j=1 M i=1π θ_j i (n) .

The marginal conditional probabilities for the various values θk of the parameters can

now simply be obtained as

P{θ = θk|yn} = M h=1 P{ρn = rh, θ = θk|yn} = M h=1πhθk(n) K j=1 M i=1π θj i (n) .

(11)

Analogously, for the marginal conditional probabilities of the various values of ρn = rh we have P{ρn= rh|yn} = K k=1 P{ρn= rh, θ = θk|yn} = K k=1πhθk(n) K j=1 M i=1π θ_j i (n) .

5.2 Estimates of the conditional distribution

We have simulated the system (20) over 65 time steps using the parameter set laid out in Table 1. We have chosen this parameter set around empirical estimates for the U.S. obtained by Episcopos (2000). The dynamic Bayesian algorithm as outlined in Section 4 was then applied to generate the conditional distributions p(ρn = rh, θ|yn). Table 2

sets out the values of the parameters , η and H used in the T (r) (equation 11) and χ(ρ) (equation 12) transformations.

The entire sequence of distributions is displayed in the three-dimensional graph in figure 1a. One axis on the plane displays the range of r values, the other axis on the plane indicates the set of θ vectors. The elements of each θ vector are laid out in Table 3. Thus the value θ = 10 for instance indicates that the values of the elements of θ are drawn from the vector Theta 10 in Table 3. Note that the true value of the parameters occurs at the vector Theta 45, whilst the peak of the distribution occurs at around Theta 39, so that the algorithm has found a reasonable estimate of the parameter set. It should be borne in mind that we have used a very coarse discretisation of the θ vector. The distribution of relevance occurs at the final interest rate, which for the simulation run reported has occurred at r₂₁. This distribution is plotted in figure 1b, again we see that the true theta vector is close to the global maximum of the distribution.

κ σ δ r¯ ∆

0.02 0.01 0.42 0.065 1/104

Table 1: Simulation Parameters

η H

1.0× 10−7 5.0× 10−7 0.3 Table 2: Transformation Parameters

(12)

Theta kappa sigma delta r bar Theta 1 0.02 0.002 0.412 0.065 Theta 2 0.02 0.002 0.413 0.065 Theta 3 0.02 0.002 0.414 0.065 Theta 4 0.02 0.002 0.415 0.065 Theta 5 0.02 0.002 0.416 0.065 Theta 6 0.02 0.002 0.417 0.065 Theta 7 0.02 0.002 0.418 0.065 Theta 8 0.02 0.002 0.419 0.065 Theta 9 0.02 0.002 0.420 0.065 Theta 10 0.02 0.004 0.412 0.065 Theta 11 0.02 0.004 0.413 0.065 Theta 12 0.02 0.004 0.414 0.065 Theta 13 0.02 0.004 0.415 0.065 Theta 14 0.02 0.004 0.416 0.065 Theta 15 0.02 0.004 0.417 0.065 Theta 16 0.02 0.004 0.418 0.065 Theta 17 0.02 0.004 0.419 0.065 Theta 18 0.02 0.004 0.420 0.065 Theta 19 0.02 0.006 0.412 0.065 Theta 20 0.02 0.006 0.413 0.065 Theta 21 0.02 0.006 0.414 0.065 Theta 22 0.02 0.006 0.415 0.065 Theta 23 0.02 0.006 0.416 0.065 Theta 24 0.02 0.006 0.417 0.065 Theta 25 0.02 0.006 0.418 0.065 Theta 26 0.02 0.006 0.419 0.065 Theta 27 0.02 0.006 0.420 0.065 Theta 28 0.02 0.008 0.412 0.065 Theta 29 0.02 0.008 0.413 0.065 Theta 30 0.02 0.008 0.414 0.065 Theta 31 0.02 0.008 0.415 0.065 Theta 32 0.02 0.008 0.416 0.065 Theta 33 0.02 0.008 0.417 0.065 Theta 34 0.02 0.008 0.418 0.065 Theta 35 0.02 0.008 0.419 0.065 Theta 36 0.02 0.008 0.420 0.065 Theta 37 0.02 0.010 0.412 0.065 Theta 38 0.02 0.010 0.413 0.065 Theta 39 0.02 0.010 0.414 0.065 Theta 40 0.02 0.010 0.415 0.065 Theta 41 0.02 0.010 0.416 0.065 Theta 42 0.02 0.010 0.417 0.065 Theta 43 0.02 0.010 0.418 0.065 Theta 44 0.02 0.010 0.419 0.065 Theta 45 0.02 0.010 0.420 0.065 Theta 46 0.02 0.012 0.412 0.065 Theta 47 0.02 0.012 0.413 0.065 Theta 48 0.02 0.012 0.414 0.065 Theta 49 0.02 0.012 0.415 0.065 Theta 50 0.02 0.012 0.416 0.065 Theta 51 0.02 0.012 0.417 0.065 Theta 52 0.02 0.012 0.418 0.065 Theta 53 0.02 0.012 0.419 0.065 Theta 54 0.02 0.012 0.420 0.065 Theta 55 0.02 0.014 0.412 0.065 Theta 56 0.02 0.014 0.413 0.065 Theta 57 0.02 0.014 0.414 0.065 Theta 58 0.02 0.014 0.415 0.065 Theta 59 0.02 0.014 0.416 0.065 Theta 60 0.02 0.014 0.417 0.065 Theta 61 0.02 0.014 0.418 0.065 Theta 62 0.02 0.014 0.419 0.065 Theta 63 0.02 0.014 0.420 0.065 Theta 64 0.02 0.016 0.412 0.065 Theta 65 0.02 0.016 0.413 0.065 Theta 66 0.02 0.016 0.414 0.065 Theta 67 0.02 0.016 0.415 0.065 Theta 68 0.02 0.016 0.416 0.065 Theta 69 0.02 0.016 0.417 0.065 Theta 70 0.02 0.016 0.418 0.065 Theta 71 0.02 0.016 0.419 0.065 Theta 72 0.02 0.016 0.420 0.065 Theta 73 0.02 0.018 0.412 0.065 Theta 74 0.02 0.018 0.413 0.065 Theta 75 0.02 0.018 0.414 0.065 Theta 76 0.02 0.018 0.415 0.065 Theta 77 0.02 0.018 0.416 0.065 Theta 78 0.02 0.018 0.417 0.065 Theta 79 0.02 0.018 0.418 0.065 Theta 80 0.02 0.018 0.419 0.065 Theta 81 0.02 0.018 0.420 0.065

(13)

1 11 21 31 41 51 61 71 81 1 11 21 31 41 51 61 0 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009 θ r

(a) The overall 3-dimensional distribution

0 10 20 30 40 50 60 70 80 θ 0.0000 0.0002 0.0004 0.0006 0.0008 0.0010 0.0012 0.0014 Prb

(14)

6 Conclusion

Taking as a starting point the stochastic differential equation for the instantaneous spot rate of interest used by CKLS, we have derived the stochastic differential equation for the discretetly compounded interest rates that are observed in financial markets. The resulting system of stochastic differential equations for the unobserved instantaneous rate and the market observed discretely compounded interest rates is used as the basis of an estimation procedure that does not require the use of proxy variables for the unobserved rate. The estimation procedure proposed here is a dynamic Bayesian algorithm (stochastic filtering algorithm) that is approximated by a quantization procedure, for which conver-gence results have been demonstrated. The algorithm has been applied to simulated data based on some parameter values obtained in earlier empirical studies. We have obtained quite satisfactory estimates via the updated conditional posterior distributions.

In addition to avoiding the use of proxy variables, the methodology proposed here has the advantage that the updated posterior distribution may be used to calculate a range of statistical quantities of interest. A further advantage is that a number of available discretely compounded rates may be used as the observed quantities. In this way one could obtain the one-factor model of the instantaneous spot rate most consistent with a set of discretely compounded rates whose maturities are of most relevance to the application at hand.

Future research should focus on applying the methodology of this paper to estimate the parameters of the instantaneous spot rate of interest process for data in a number of markets.

7 Appendix 1 : Accumulation at the Discretely

Com-pounded Rate

Consider discrete time intervalst. One dollar invested at time 0 at the discrete rate lm(0) grows to

a₁ = 1 + 1

mlm(0)t

at time t. The amount a₁ is then reinvested at the rate lm(t) over the next time

internal and grows to

a₂ = a₁(1 + 1

mlm(t)t) by time 2t.

Continuing this reasoning, by time t(≡ T t) the amount accumulated is aT =

(T −1)

i=0

(1 + 1

mlm(it)t). By proceeding to the limit T → ∞, t → 0 we obtain

lim T →∞,t→0aT = Am(t)≡ 1 + 1 m t 0 lm(s)ds.

(15)

8 Appendix 2 : Proof of Proposition 2.1

First we take the diﬀerential of equation (5) to obtain 1

mlm(t) = r(t)A∞(t). (34)

We note that the quantity A_∞(t) satisﬁes

dA_∞(t) = r(t)A_∞(t)dt. (35)

By applying Ito’s lemma to equation (34) (this application of Ito’s lemma is simplified by the fact that the stochastic differential equation (35) for A_∞(t) contains no diffusion term) we find that

dlm = mA∞[κ(¯r− r) + r2]dt + mA∞g(r)dw. (36)

Using (34), if r is bounded away from zero, we can eliminate mA_∞(t) to obtain

dlm = lm[κ(r¯

r − 1) + r]dt + lm g(r)

r dw. (37)

The dynamics of the interest rate processes are governed by the two stochastic diﬀerential equation (1) for r(t) and (37) for lm(t). Proposition 2.1 then follows.

9 Appendix 3 : Proof of Theorem 4.1

We proceed by induction on n. For n = 0 we have by construction

EθF (ρM₀ )| y0= EθF (ρM₀ ) → Eθ{F (ρ0)} = EθF (ρ0)| y0 (38) Assume the statement is true for n− 1.

Consider ﬁrst the function

ΦF(r; θ, yn) :=

_H+

F (ρ) p(yn| r, θ) pθ(ρ| r)dρ. (39)

With this function ΦF and the use of the dynamic Bayes’ formula (24) with (25) the right

hand side of (33) can be expressed as

Eθ{F (ρn) | yn} = _H+ F (ρ) p(ρ, θ| yn) dρ = _H+ F (ρ) _H+ p(yn| r, θ) pθ(ρ| r) p(r, θ | yn−1)dr dρ _H+ _H+ p(yn| r, θ) pθ(ρ| r) p(r, θ | yn−1)dr dρ = _H+ ΦF(r; θ, yn) p(r, θ| yn−1)dr _H+ Φ₁(r; θ, yn) p(r, θ| yn−1)dr = Eθ{ΦF(ρn−1; θ, yn) | y n−1_} Eθ{Φ1(ρn−1; θ, yn)| yn−1}. (40)

(16)

Analogously, let ΦM_F (ri; θ, yn) := M h=1 F (rh)p(yn| ri, θ) P_[i,h](n,θ) = M h=1 F (rh)p(yn| ri, θ) Rh pθ(ρ| ri) dρ (41)

where ri, rh are the representative elements ri ∈ Ri and rh ∈ Rh respectively (i, h = 1,· · · , M) and P_[i,h](n,θ) are the transition probabilities in (29). Analogously to (40), with ΦM_F and with the approximating dynamic Bayes’ formula (30), the left hand side in (33) can be expressed as EθF (ρMn )| yn = M h=1 F (rh)˜p(rh, θ| yn) = M h=1 F (rh) M h=1 p(yn| ri, θ) P_[i,h](n,θ)p(r˜ i, θ| yn−1) M h=1 M h=1 p(yn| ri, θ) P_[i,h](n,θ)p(r˜ i, θ| yn−1) = M h=1 ΦM_F (ri; θ, yn) ˜p(ri, θ| yn−1) M h=1 ΦM₁ (ri; θ, yn) ˜p(ri, θ| yn−1) = Eθ ΦM_F ρM_n−1; θ, yn | yn−1 EθΦM₁ ρM_n−1; θ, yn | yn−1. (42)

Notice now that F (·) is continuous and so uniformly continuous on each of the sets Rh (h = 1,· · · , M); furthermore, p(yn| r, θ) is bounded uniformly in yn, r, θ. From (39)

and (41) we then have that, given η > 0, there is M_F0, depending on F , such that, for M > M_F0 and uniformly in r∈ [, H + ], one has

| ΦM F (r; θ, yn)− ΦF(r; θ, yn)| ≤ M h=1 Rh| F (r h_{) p(y} n| r, θ) − F (ρ) p(yn| r, θ) | pθ(ρ| r) dρ ≤ η M h=1 Rh pθ(ρ| r) dρ. (43)

Next, due to the boundedness of F and of p(yn| r, θ) (uniformly in its arguments), the

continuity of p(yn| r, θ) and pθ(ρ| r) in r, and the, uniform in r, integrability of pθ(ρ| r)

with respect to ρ, we can use dominated convergence to obtain the continuity in r of ΦF(r; θ, yn) as deﬁned in (39). By the induction hypothesis we then have that, given

(17)

F , such that, for M > M_F1(yn, yn−1, θ),

| EθΦF(ρM_n−1; θ, yn) | yn−1− EθΦF(ρM_n−1; θ, yn) | yn−1 | < η. (44)

From (43) and (44) it then follows that

| EθΦMF (ρMn−1; θ, yn)| yn−1− Eθ{ΦF(ρn−1; θ, yn)| yn−1} |

≤ | EθΦMF (ρMn−1; θ, yn)| yn−1− EθΦF(ρM_n−1; θ, yn)| yn−1 |

+| EθΦF(ρM_n−1; θ, yn)| yn−1− Eθ{ΦF(ρn−1; θ, yn)| yn−1} | < 2η

(45)

for M > max{M_F0, M_F1(yn, yn−1, θ)} so that, for each θ, yn, yn−1 and each continuous

function F (·), lim

M→∞Eθ

ΦM_F (ρM_n−1; θ, yn)| yn−1 = EθΦF(ρn−1; θ, yn)| yn−1. (46)

Combining this with (40) and (42), we have shown the induction step and thus proved the theorem.

10 References

Andersen, T.and Lund, J. (1997): Estimating continuous-time stochastic volatility models of the short-term interest rate. Journal of Econometrics 77, 343-377.

Bali, T.G.(1999): An empirical comparison of continuous time models of the short-term interest rate. The Journal of Futures Markets 19, 777-797.

Bergstrom, A. (1990): Continuous time econometric modelling, Oxford University Press.

Bhar, R.and Chiarella, C.(1997): Transformation of Heath-Jarrow-Morton mod-els to Markovian systems. European Jounal of Finance 3, 1-26.

Brenner, R., Harjes, R. and Kroner, K. (1996): Another look at models of the short-term interest rate. Journal of Financial and Quantitative Analysis 31, 85-107.

Chan, K., Karolyi, A., Longstaff, F. and Sanders, A. (1992): An empirical comparison of alternative models of the short-term interest rate. Journal of Finance 47, 1209-1227.

Chapman, D., Long, J.and Pearson, N.(1999): Using proxies for the short rate: when are three months like an instant? Review of Financial Studies 12(4), 763-806.

Chiarella, C., Pasquali, S. and Runggaldier, W.J. (2001) : On Filtering in Markovian Term Structure Models (an approximation approach), University of Technol-ogy, Sydney, Quantitative Finance Research Group, Working paper 65.

Cheyette, O.(1992): Term structure dynamics and mortgage valuation. The Jour-nal of Fixed Income pp. 28-41.

Cox, J., Ingersoll, J. and Ross, S. (1985): A theory of the term structure of interest rates. Econometrica 53, 385-407.

(18)

Episcopos A. (2000): Further evidence on alternative continuous time models of the short-term interest rate. Journal of International Financial Markets, Institutions & Money 10, 199-212.

Inui, K. and Kijima, M.(1997): A Markovian framework in multi-factor Heath-Jarrow-Morton models. Journal of Financial and Quantitative Analysis 33(3), 423-440.

Koedijk, K., Nissen, F., Schotman, P. and Wolff, C. (1997): The dynamics of the short-term interest rate volatility reconsidered. European Finance Review 1, 105-130. Longstaff, F. and Schwartz, E. (1992): Interest rate volatility and the term structure of interest rates. Journal of Finance 47, 1259-1282.

Nowman, K.(1997): Gaussian estimation of single-factor continuous time models of the term structure of interest rates. Journal of Finance 52, 1695-1706.

Nowman, K. (1998): Continuous-time short term interest rate models. Applied Financial Economics 8, 401-407.

Ritchken, P. and Sankarasubramanian, L.(1995): Volatility structures of for-ward rates and the dynamics of the term structure. Mathematical Finance 5(1), 55-72.

Sandmann, K, and Sondermann, D.(1997): A note on the stability of lognormal interest rate models and the pricing of Eurodollar futures. Mathematical Finance 7(2),119-125.

Vasicek, O.(1997): An equilibrium characterization of the term structure. Journal of Financial Economics 5, 177-188.