Periodic autoregressive conditional duration

(1)

Munich Personal RePEc Archive

Periodic autoregressive conditional

duration

Aknouche, Abdelhakim and Almohaimeed, Bader and

Dimitrakopoulos, Stefanos

University of Science and Technology Houari Boumediene, Qassim

University, Leeds University

8 July 2020

Online at

https://mpra.ub.uni-muenchen.de/101696/

(2)

Periodic autoregressive conditional duration

Abdelhakim Aknouche

*

, Bader Almohaimeed

**

, and Stefanos

Dimitrakopoulos

1***

*_{Department of Mathematics, College of Science, Qassim University (Saudi Arabia) & Faculty of}

Mathematics, University of Science and Technology Houari Boumediene (Algeria)

**_{Department of Mathematics, College of Science, Qassim University, Saudi Arabia.}

***_{Division of Economics, Leeds University, UK}

Abstract

We propose an autoregressive conditional duration (ACD) model with periodic time-varying parameters and multiplicative error form. We name this model periodic autore-gressive conditional duration (PACD). First, we study the stability properties and the moment structures of it. Second, we estimate the model parameters, using (profile and two-stage) Gamma quasi-maximum likelihood estimates (QMLEs), the asymptotic prop-erties of which are examined under general regularity conditions. Our estimation method encompasses the exponential QMLE, as a particular case. The proposed methodology is illustrated with simulated data and two empirical applications on forecasting Bitcoin trad-ing volume and realized volatility. We found that the PACD produces better in-sample and out-of-sample forecasts than the standard ACD.

Keywords: Positive time series, autoregressive conditional duration, periodic time-varying models, multiplicative error models, exponential QMLE, two-stage Gamma QMLE.

(3)

1 Introduction

Recent research in time series analysis tends to avoid transforming original data prior to mod-eling and prefers to represent them directly through models that take into account the actual support of their distributions. Such an approach parallels to that of generalized linear models

(GLM) for independent data (McCullag and Nelder, 1989). In this way, numerous time se-ries models with “specific values” have, recently, received great interest, such as integer-valued models, including count and binary specifications, and positive-valued models.

A well-known model for positive-valued time series data is the autoregressive conditional

duration (ACD), introduced by Engel and Russell (1998). Originally designed to model

du-rations between financial events in high-frequency microstructure markets, the ACD model is also useful for modeling a broad range of data, such as regularly-spaced return range series (Chou, 2005), daily realized volatility (Lanne, 2006; Zheng et al, 2015; Aknouche and Francq, 2019) and trading volume (Li, 2019; Aknouche and Francq, 2020). Various generalizations of the ACD model have been proposed to take into account additional facts of positive time series data (Pacurar, 2008; Hautsch, 2012; Bhogal and Variyam, 2019).

As in the case of GARCH models, it has been documented that the high persistence observed in empirical studies utilizing the standard ACD specification, is in fact artificial and can be avoided by considering ACD models with time-varying parameters (Diebold, 1986; Andersen and Bollerslev, 1997; Mikosch and Starica, 2004; Hejer and Veltic 2007; Caporin et al, 2017; Gallo and Ortanto, 2018). In this paper, we extend the literature on time-varying ACD models, by proposing an ACD model, the parameters of which are allowed to evolve periodically over time. We name this model periodic autoregressive conditional duration (PACD).

Such a model aims to represent seasonally varying positive-valued series. The observed pro-cess is defined as the product of a unit meanindependent and periodically distributed (henceforth

ipdS) innovation process with the conditional mean of the model having a GARCH-type

spec-ification with periodic time-varying parameters. We first study the stability properties of the PACD model, such as the existence of periodically stationary and ergodic solutions with finite moments or log-moments. Such properties are needed in the estimation stage, which is the second contribution of this paper.

(4)

is used, since it is well-adapted to the support of the distribution of the data, and it does not require specifying a distribution for the periodically distributed innovation sequence. However, because of the periodicity of that sequence, the EQMLE may be less efficient than the Gamma QMLE (GQMLE) which, in fact, accounts for the periodicity of the model innovation.

Consequently, we propose a two-stage Gamma QMLE (2S-GQMLE) which i) utilizes the EQMLE (or a profile GQMLE) in the first stage, ii) estimates the variance innovations, and then iii) uses the latter as a by-product in the second stage of the computation of the GQMLE. Consistency and asymptotic normality (CAN) of the proposed QMLEs are established and the relative efficiency of the 2S-GQMLE is studied for some specific conditional distributions.

The PACD can be used to model various seasonal positive-valued phenomena (realized volatility, trading volumes and transaction rates). The day-of-the-week pattern may be present in all these phenomena, which means that each day of the week may have its own distribution (Franses and Paap, 2000; Boynton et al, 2009; Tsiakas, 2006; Charles, 2010). In that sense, a time-invariant ACD model for daily data is just an average model that does not take into account the specificities of the underlying measures across days. Other examples of non-financial intraday series that may be characterized by periodicity are wind power and wind speed series (Ambach and Croonenbroeck, 2015; Ambach and Schmid, 2015; Ziel et al, 2016).

Our empirical applications concern Bitcoin trading volume data and the UN realized volatil-ity. Both series are characterized by the day-of-the-week effect and we show that the PACD produces better in-sample and out-of-sample forecasts than the benchmark ACD.

The rest of this paper is outlined as follows. In Section 2 we define the PACD and some special cases of it, and describe the link/relationship between the PACD and the periodic GARCH of Bollerslev and Ghysels (1996). In Section 3 we derive the stability conditions of our model. In Section 4, various Gamma QMLEs are proposed and their asymptotic properties are studied. In section 5 we conduct a simulation study and in section 6 we present the empirical results from two series (Bitcoin trading volume and UN realized volatility). All the proofs are given in the Appendix. A Supplementary material accompanies this paper.

(5)

2 Periodic Autoregressive Conditional Duration model

All random variables and processes in this paper are defined on a probability space (Ω,F_{, P}₎

and valued in the set of positive real numbers R₊ _{= (0}_,_∞_{), which is endowed with the Borel}

field B₍R_{+). Let} _S _≥ _{1 be a positive integer called the period, and} _ω_t_{, α}_t₁_{, ..., α}_tq_{, β}_t₁_{, ..., β}_tp

(p, q _∈N₌_{₀_,₁_{, ...}_}_{) be positive real parameters} _S_{-periodic over time, i.e.} _ω_t ₌_ω_t₊_kS_, _α_ti ₌

αt+kS,i (i= 1, ..., q) andβtj =βt+kS,j (j = 1, ..., p) for all integerskandt. Let also{ξt, t∈Z}be

a sequence of positive random variables with E(ξt) = 1 for allt, and a finite Var(ξt) = σt2 >0.

Assume that _{ξt, t∈Z} is ipdS in the sense that ξt D

=ξt+S for all t, where D

= denotes equality in distribution.

A positive-valued stochastic process _{Yt, t ∈Z} is said to be a MEM (multiplicative error

model; Engle, 2002) periodicautoregressive conditional durationwith orderspandq(henceforth PACD(p, q)) if Yt is given for all t∈Z by

Yt =ψtξt (2.1a) and ψt=ωt+ q X i=1 αtiYt₋i+ p X j=1 βtjψt₋j (2.1b)

where the innovation term ξt is independent of ψt₋j for all j ≥ 1. To ensure the almost sure

(a.s.) positivity of ψt, it is assumed that ωt >0, αti ≥ 0, and βtj ≥ 0, for all t ∈ Z, i= 1, .., q

and j = 1, ..., p. To emphasize the periodicity of the model, let t = nS +v for n _∈ Z _and

1_≤v _≤S. Then, equation (2.1b) can be written as follows

ψv+nS =ωv+ q X i=1 αviYv₋i+nS + p X j=1 βvjψv₋j+nS, n∈Z, 1≤v ≤S,

where by season or channel v (1 _≤ v _≤ S) we denote the set _{..., v₋S, v, v+S, v+ 2S, ..._}

with corresponding parameters ωv, αvi, βvi and σv2 = V ar(ξv+nS). Let Ft be the σ-Algebra

generated by _{Yt−i, i≥0}. The conditional mean and conditional variance of the model (2.1)

are given respectively by

(6)

and

V ar(Yt|Ft₋1) = σ2tψ

2

t. (2.2b)

The PACD model, thus, follows the quadratic variance-to-mean relationship (i.e. the condi-tional variance is proporcondi-tional to the squared condicondi-tional mean), where σ2

t >0 is the variance

of ξt and is S-periodic by construction (from the ipdS property of the innovation sequence {ξt, t∈Z}). The specification (2.1) is a multiplicative error model (MEM) in the sense of

Engle (2002), but the conditional mean equation (2.1b) has rather periodic time-varying coef-ficients. For S = 1, model (2.1) reduces to the standard autoregressive conditional duration (ACD in short) of Engel and Russell (1998). No specification for the distribution of _{ξt, t∈Z}

is imposed apart the semiparametric quadratic variance-to-mean function (2.2b). However, a useful family of conditional distributions satisfying (2.2b) is the Gamma distribution with shape

1 σ2 t and scale 1 σ2 tψt, that is Yt|Ft−1 ∼Γ 1 σ2 t, 1 σ2 tψt , (2.3)

where ψt satisfies (2.1b). In the latter case, the innovation term ξt in (2.1) will be marginally

Gamma distributed ξt∼Γ 1 σ2 t, 1 σ2 t , (2.4)

and the process defined by (2.3) is called Gamma PACD(p, q). A notable particular case of model (2.3) appears when the variance σ2

t ≡ 1 is constant, so ξt ∼ Γ (1,1), which corresponds

to theexponential PACD. As in the time-invariant case, the periodic ACD model can be seen as a squared periodic GARCH (PGARCH) model as proposed by Ghysels and Bollerslev (1996). Indeed, consider the following real-valued PGARCH(p, q) process given by

Xt= p htηt (2.5a) and ht=ωt+ q X i=1 αtiXt2₋i+ p X j=1 βtjht−j (2.5b)

where _{ηt, t∈Z} is an ipdS sequence with mean zero and unit variance, and the parameters

ωt, αti and βtj are defined as above. It is clear that the squared PGARCH process defined by

(7)

let _{Yt, t∈Z} be a PACD model given by (2.1), and assume {zt, t∈Z} is an independent

and identically distributed (iid) sequence uniformly distributed in _{−1,1_} (see also Francq and Zakoian, 2019 for the non-periodic case S = 1). Assume _{zt, t ∈Z} and {ξt, t∈Z} are

independent and define the process _{Xt, t∈Z} by

Xt =zt

p

Yt=

p

htηt

where ht=ψt satisfies (2.5b) and ηt =zt√ξt is a term of an ipdS sequence. Hence {Xt, t∈Z}

is a PGARCH model in the sense of (2.5). Note finally that a PACD model admits a weak periodic ARMA (PARMA) (Lund and Basawa, 2000; Francq etal, 2011). SettingYt=ψt+εt,

the process_{Yt, t∈Z} may be written in the following PARMA

Yt=ωt+ max(_Xp,q) i=1 (αti+βtj)Yt−i+εt− p X j=1 βtjεt−j where εt=Yt−E(Yt|Ft₋1) =ψt(ξt−1) (2.6)

is a zero-mean term of a martingale difference sequence with a finite periodic varianceE(ε2

t) =

E(ψ2

t)E(ξt−1)2 =E(ψt2)σ2t.

A more general PACD, which is not necessarily MEM is defined through a conditional distribution of the form

Yt|Ft₋1 ∼Fψt (2.7)

whereFψ is a cumulative probability distribution (with positive support) with meanψ, andψt

is given by (2.1b).

3 Periodic ergodicity and finite moment conditions

We now give necessary and/or sufficient conditions for model (2.1) to be strictly periodically stationary andperiodically ergodic. Such properties are recalled in the Supplementary material. We also consider conditions for the existence of finite moments. Combining (2.1a) and (2.1b)

(8)

we obtain the following stochastic recurrence equation (SRE)

Yt=AtYt₋1+Bt (3.1)

driven by the ipdS sequence {(At, Bt), t∈Z}, where Yt = (Yt, ..., Yt₋q+1, ψt, ..., ψt₋p+1)′, Bt =

ωtξt,0(q₋1)×1, ωt,0(p₋1)×1 ′ , and At=                       αt1ξt · · · αt,q−1ξt αtqξt βt1ξt · · · βt,p−1ξt βtpξt 1 _{· · ·} 0 0 0 _{· · ·} 0 0 ... . .. ... ... ... . .. ... ... 0 _{· · ·} 1 0 0 _{· · ·} 0 0 αt1 · · · αt,q₋1 αtq βt1 · · · βt,p₋1 βtp 0 _{· · ·} 0 0 1 _{· · ·} 0 0 ... . .. ... ... ... . .. ... ... 0 _{· · ·} 0 0 0 _{· · ·} 1 0                       ,

0m_×n being the null matrix of dimension m×n. Let

γS = inf

1

nElogkAnS...A2A1k, n≥1

be the top Lyapunov exponent associated with the ipdS-driven SRE (3.1) (Aknouche et al,

2020). Let also βt=          βt1 · · · βt,p₋1 βtp 1 _{· · ·} 0 0 ... ... ... ... 0 _{· · ·} 1 0          ,

and denote by ρ(A) the spectral radius of the squared matrix A, i.e. the maximum modulus of the eigenvalues of A. The following result gives the conditions for equation (3.1) to have a unique strictly periodically stationary and periodically ergodic solution.

Theorem 3.1 i) Assume E(log (ξv)) <∞ for all 1 ≤ v ≤ S. A necessary and sufficient

(9)

periodically ergodic solution is that

γS _<₀_. ₍₃_.₂₎

Such a solution is given for all t_∈Z _by

Yt= ∞ X j=0 j−1 Y i=0 At₋iBt₋j, (3.3)

where the series in the right hand side of (3.3) converges absolutely almost surely.

ii) If (2.1)admits a strictly periodically stationary solution then

ρ S_Y−1 v=0 βS−v ! <1. (3.4)

In the special case where p = q = 1, the periodic stationarity condition (3.2) is simplified as follows S X v=1 E(log (αvξv−1+βv))<0, while (3.4) reduces to S Q v=1 βv <1.

Conditions for the existence of moments of the P ACD(p, q) process are given as follows.

Theorem 3.2 Assume E(ξv)<∞for all 1≤v ≤S. A sufficient condition for the process

given by (2.1) to be strictly periodically stationary and periodically ergodic with E(Yt)<∞ is

that ρ S_Y−1 v=0 E(AS−v) ! <1. (3.5)

Some remarks are in order:

- In the case, where S = 1, the conditional mean coefficients are time-invariant, that is

ωtj =ω, αtj =αj andβtj =βj. Therefore, using a similar device by Chen and An (1998), (3.5)

reduces to the following stationarity in mean condition

q X i=1 αi+ p X j=1 βj <1

as provided by Engle and Russell (1998).

(10)

following condition

S Y v=1

(αv +βv)<1. (3.6)

Theorem 3.3 i)Under (3.2) there exists κ >0 such that for all 1_≤v _≤S

E(ψvκ)<∞ and E(Y

κ

v )<∞. (3.7)

ii) Let _{Yt, t∈Z} be a strictly periodically stationary solution of (2.1) and assume that

E(ξm

v ) (m∈N∗) is finite for all 1≤v ≤S. A sufficient condition for E(Yvm) to be finite (for

all 1_≤v _≤S) is that ρ S_Y₋1 v=0 E A⊗m S₋v ! <1 (3.8)

where A⊗m _{is the Kronecker product:} _A_⊗_A_{⊗ · · · ⊗}_A _with _m _factors_.

In the special case of Gamma PACD with p=q = 1, explicit conditions equivalent to (3.8) can be given. These conditions are also necessary for the existence of finite moments.

Proposition 3.1 The Gamma P ACD(1,1) model (2.3) admits a unique nonanticipative

periodically ergodic solution _{Yt, t∈Z} such that:

i) E(Yv)<∞ (1≤v ≤S) if and only if (3.6) holds.

ii) E(Y2

v)<∞ ( 1≤v ≤S) if and only if E(ξv2)<∞ (1≤v ≤S), (3.6) and

S Y v=1 α2_vE ξ_v2₋₁+ 2αvβv +βv2 <1. (3.9) iii) E(Y3

v)<∞ (1≤v ≤S) if and only if E(ξv3)<∞ ( 1 ≤v ≤S), (3.6), (3.9) and

S Y v=1 E ξ_v3₋₁α3_v+ 3 σ_v2₋₁+ 1α2_vβv+ 3αvβv2+β 3 v <1. (3.10) iv) E(Y4 v) < ∞ (1≤v ≤S) if and only if E(ξv4) < ∞ ( 1 ≤v ≤S), (3.6), (3.9), (3.10)

and the following hold

S Y v=1 E ξv4₋1 α4v + 4 1 +σ2v₋1 1 + 2σ2v₋1 α3vβv+ 6 1 +σv2₋1 α2vβv2+ 4αvβv3+βv4 <1. (3.11) For the particular exponential PACD(1,1) model, Yt|Ft−1 ∼ Γ

1, 1

ψt

(11)

Proposition 3.1 the moments E(ξ2v), E(ξv3) and E(ξv4) by 1, 6 and 24 respectively, and σv2

by 1 for all 1_≤v _≤S.

4 Gamma quasi-maximum likelihood estimates

LetY1, Y2, ..., YT be a series generated from the PACD(p, q) model, which we can rewrite in the

following form YnS+v =ψnS+vξnS+v, ψnS+v =ψnS+v(θ0) = ω0v+ q P i=1 α0 viYnS+v−i+ p P j=1 β0 vjψnS+v−j, 1_≤v _≤S, n_∈Z ₍₄_.₁₎

where the true parameter θ0 = (θ01′, θ20′, ..., θS0′)′ with θv0 = (ωv0, α0v1, ..., αvq0 , βv01, ..., βvp0 )′ (1≤v ≤

S) belongs to a parameter space Θ _⊂ (0,_∞)_×[0,_∞)(p+q)S. The true innovation variance parameter σ2

0 = (σ201, ..., σ20S)′ withσ02v =V ar(ξnS+v) (1≤v ≤S) also belongs to a parametric

space ∆ _⊂ RS

+. The sample size T = N S (N ≥ 1) is assumed without loss of generality a multiple ofS. Given initial valuesY0, ..., Y1−q,ψe0, ...,ψe1−p and a generic parameterθ∈Θ define

e ψnS+v(θ) = ωv+ q X i=1 αviYnS+v₋i+ p X j=1 βvjψenS+v₋j(θ), 1≤v ≤S, n≥0, (4.2a)

as an observable proxy for ψnS+v(θ). The latter is defined as a periodically stationary solution

of the following generic model (θ _∈Θ)

ψnS+v(θ) =ωv+ q X i=1 αviYnS+v₋i+ p X j=1 βvjψnS+v₋j(θ), 1≤v ≤S, n∈Z. (4.2b)

4.1 Exponential and profile Gamma QMLEs

The true conditional distribution of (4.1) is unknown due to the unpecification of the law of

ξv (1 ≤ v ≤S). Thus, a quasi-maximum likelihood estimate (QMLE) which does not require

any precise knowledge of the conditional distribution is suitable for estimating the parameter

θ0 involved in the conditional mean. Among many possible QMLEs, the one computed on the basis of the exponential distribution (EQMLE in short) is especially useful for positive duration data because it reduces to the maximum likelihood estimate when ξv is exponentially

(12)

distributed (Aknouche and Francq, 2020). A more general QMLE, which can be more efficient than the EQMLE in the periodic time-varying innovation context is the one computed on the basis of the Gamma distribution with arbitrary fixed variance parameters. Let (σ2

t)t be fixed

known positive numbers, S-periodic over t, i.e. σ2

v+kS = σv2, for all k ∈ {0, ..., N −1}. The

profile Gamma likelihood associated with σ2 _{= (}_σ2

1, ..., σS2)′ >0 is given, ignoring constants, by

e LT (θ) =_T1 T X t=1 elt(θ), (4.3a) elt(θ) =_σ12 t Yt e ψt(θ)+ log e ψt(θ) , t_≥1. (4.3b)

The profile Gamma QMLE (GQMLE) θbG of θ0 is, then, the minimizer ofLeT (θ) over Θ, b

θG = arg min

θ∈ΘLeT (θ) . (4.4)

When σ2 _{= (1}_{, ...,}₁₎′_{, the GQMLE defined by (4}_._{4) reduces to the EQMLE and is denoted}

byθbE (Aknouche and Francq, 2020).

LetγS₍_A0_{) be the top Lyapunov exponent associated with (}_A0

t, t∈Z) where the matrixA0t

is just At defined in (3.1) with θ0 in place of θ. To establish the strong consistency of θbG we

need to the following assumptions.

A1 γS₍_A0₎_<_{0 and} _∀_θ _∈_Θ_{, ρ} _S₋₁ Q v=0 βS₋v <1. A2 θ0 ∈Θ and Θ is compact. A3 The polynomials α0 v(z) = q P i=1 α0 vizi and βv0(z) = 1− p P j=1 β0

vjzj have no common root,

α0

v(1)6= 0, and αvq0 +βvp0 6= 0 for all 1≤v ≤S.

A4 ξv is non-degenerate for all 1≤v ≤S.

As seen in Section 3,γS₍_A0₎_<_{0 in}_A1_{ensures periodic stationarity and periodic ergodicity} of the PACD model (4.1). The condition ρ

_S₋₁ Q v=0

βS₋v

< 1 is imposed for the invertibility of equation (4.2b) for any θ_∈Θ. The compactness assumption A2 is standard while A3 and A4

are made to guarantee the identifiability of the model.

Theorem 4.1 Let θbG

be a sequence of EQMLEs defined by (4.3). Under A1-A4,

b

(13)

Turn now to the asymptotic normality property of bθG. The following assumptions are to be

considered.

A5 θ0 belongs to the interior of Θ.

A6 The matrices I θ0, σ2 = S X v=1 σ2 0v σ4 v E 1 ψ2 v(θ0) ∂ψv(θ0) ∂θ ∂ψv(θ0) ∂θ′ , J θ0, σ2 = S X v=1 1 σ2 vE 1 ψ2 v(θ0) ∂ψv(θ0) ∂θ ∂ψv(θ0) ∂θ′ (4.5)

are finite, and J(θ0, σ2) is nonsingular for all σ2 >0.

Theorem 4.2 Under A1-A6 we have

√ NθbG−θ0 _D →N ₍₀_,_{Σ) as}_N _{→ ∞} _{for all} _σ2 _>₀ ₍₄_.₆_a₎ where Σ =J θ0, σ2− 1 I θ0, σ2J θ0, σ2− 1 (4.6b)

is block-diagonal and _→D stands for convergence in distribution.

Remark 4.1

i) When σ2 _{= (1}_{, ...,}₁₎′ _:= ₁_{, the EQMLE has a covariance matrix in a ”sandwich” form}

and is, in general, not asymptotically efficient unlessσ2

0 =1and the conditional distribution is exponential.

ii) For the special exponential PACD(p, q) model corresponding to V ar(ξv) = 1 for all

1_≤ v _≤ S, if we set σ2 ₌₁ _then _J₍_θ

0,1) =I(θ0,1) and the asymptotic covariance matrix of the EQMLE reduces to Σ =J(θ0,1)−1. The EQMLE is thus asymptotically efficient.

iii) If ξv has a constant variance, i.e. σ02v =V ar(ξv) =σ20 for all 1 ≤v ≤S, then it suffices to take σ2 _{= (1}_{, ...,}₁₎′ _{and apply the EQMLE. We would have} _I₍_θ

0,1) = σ02J(θ0,1) and the covariance matrix would be equal to Σ = σ2

0J(θ0,1)−1. In this case, the EQMLE is the best QMLE among all QMLEs belonging to the linear exponential family.

iv) For the non-periodic ACD corresponding to S= 1 and then σ2

0v =σ02 for all 1≤v ≤S, it is natural to take σ2

v = σ2 for all 1≤ v ≤ S. In this case, the profile likelihood (4.3) would

be given by Let(θ) = _σ12_T1 T P t=1 Yt e ψt(θ) + log e ψt(θ)

and the resulting GQMLE then reduces to

maximizing _T1 T P t=1 Yt e ψt(θ) + log e ψt(θ)

(14)

is why, in general, the EQMLE is the most used QMLE for non-periodic ACD even when the latter is strictly (conditionally) Gamma distributed.

v) When the profile varianceσ2_{coincides with the true variance}_σ2

0we would haveJ(θ0, σ02) =

I(θ0, σ02) and Σ =J(θ0, σ20)− 1

, where the GQMLE is the most efficient among all QMLEs be-longing to the exponential family. As σ2

0 is generally unknown, a crucial step is to get a consistent estimate _bσ2

0 and construct with it an estimated (profile) log-likelihood from which a new Gamma QMLE, called the two-stage Gamma QMLE (2S-GQMLE), is computed. The resulting estimate would have the aforementioned efficiency property.

vi) Theorem 4.1 and 4.2 also hold for the non-MEM PACD (2.7). It suffices to replace the assumptions A1-A4 by the following:

A1’ The process _{Yt, t∈Z} is strictly periodically stationary and periodically ergodic.

A2’E Yt1+ǫ

<_∞ for someǫ >0.

A3’ψt(θ) =ψt(θ0) a.s. ⇒θ =θ0.

4.2 Estimating the innovation variances

To estimate the unknown variances σ2

0 under the MEM constraint recall (2.1)-(2.2) and let

ut = (Yt−ψt)2−V ar(Yt|Ft₋1) =ψ2t (ξt−1)2−σ02t . Then, (Yt−ψt)2 ψ2 t =σ 2 0t+vt (4.7a) where vt = _ψut2 t = (ξt−1) 2 − σ2

0t. The sequence (vt) is thus zero-mean iid with variance

E (ξt−1)2−σ2₀t

2

, which is finite under the following assumption.

A7 E(ξv4)<∞for all v = 1, ..., S.

Since ψt=ψt(θ0) depends on the unknown parameter θ0, the regressand in (4.7a) is unob-servable. If we replace θ0 by a consistent estimate, say the GQMLE in (4.4), then we get the following approximate regression but with observable regressand

(Yt−ψbt) 2 b ψ2 t =σ2₀t+bvt, (4.7b)

(15)

where ψbt=ψt

b

θG

. From (4.7b) a feasible OLS estimate (OLSE) ofσ2

0 is given by b σ2_v = 1 N N_X−1 n=0 (Yv+nS−ψbv+nS) 2 b ψ2 v+nS , for all v = 1, ..., S. (4.8)

The following result shows that the OLSE σ_b2

v (1≤v ≤S) is consistent and asymptotically

Gaussian.

Theorem 4.3 Under A1-A4

b

σ2

v →σ02v a.s. asN → ∞, for all v = 1, ..., S. (4.9a)

If in addition A7 holds then for all v = 1, ..., S

√ N _bσv2−σ20v D →N ₍₀_,_Λ_v_{) as} _N _{→ ∞} ₍₄_.₉_b₎ where Λv =E (ξv −1)2−σ02v 2 .

A consistent estimate of the limiting variance Λv in (4.9b) is given by

b Λv = _N12 N_X₋1 n=0 b ξv+nS−1 2 −bσv2 2 , v = 1, ..., S, (4.10) where ξbv+nS = Yv+nS b

ψv+nS is the residual of model (2.1). With (4.9b) and (4.10), the asymptotic

matrices in (4.5) may also be estimated. A consistent estimate of Σ is

b Σ =Jb−1Ib−1Jb−1 (4.11) where b J = _N1 N_X₋1 n=0 S X v=1 1 σ2 vψv2+nS(θbG) ∂ψ_ψv₊_nS(θbG) ∂θ ∂ψ_ψv₊_nS(θbG) ∂θ′ , Ib= 1 N N_X₋1 n=0 S X v=1 b σ2 v σ4 vψv2+nS(θbG) ∂ψv+nS(bθG) ∂θ ∂ψv+nS(θbG) ∂θ′ .

4.3 Two-stage Gamma QMLE

We have seen above that the asymptotic distribution and then the asymptotic efficiency of the profile GQMLE depend on the choice of the profile varianceσ2_{. To improve the efficiency of the}

(16)

GQMLE, we can replace in (4.4) the profile variancesσ2 by the OLS estimatesσ_b2 = (_bσ21, ...,bσ2S)′

given by (4.8). The resulting estimate is denoted by 2S-GQMLE and is given by the following steps.

Algorithm 4.1 Two-stage GQMLE

i) Fix an arbitrarily σ2 _>₀_{, for example} _σ2 _{= (1}_{, ...,}₁₎′_.

ii) Get the profile GQMLE θbG from (4.4).

iii) Estimate the variance innovation σ2

0 using bσ2 in (4.8).

iv) Consider the 2S-GQMLE as a solution of the following problem

b θ∗G= arg min θ∈Θ N_X₋1 n=0 S X v=1 _Y v+nS b σ2 vψev+nS(θ) + 1 b σ2 vlogψev+nS(θ) . (4.12)

Consistency of asymptotic normality of bθ∗

G are a by-product of Theorems 4.1-4.2.

Corollary 4.1 Under A1-A4

√

Nθb∗_G₋θ0

_D

→N ₀_{, J θ}₀_{, σ}2−1 _as _N _{→ ∞}_{for all} _σ2 _>₀_.

The latter result shows that whatever the distribution of (ξv)_v is, the 2S-GQMLE bθ∗G is

asymptotically the most efficient one among all QMLEs belonging to the linear exponential family (cf. Gourieroux etal, 1984; Wooldridge, 1999). In particular, θb∗

Gin never asymptotically

less efficient than the profile GPQMLEθbG and therefore than the EQMLE bθE.

5 Simulation study

We examine the finite-sample behavior of the Gamma QMLEs, as defined above, using many simulated PACD(1,1) series with sample size T = 2000. We consider two distributions for the innovation ξt in (2.1), namely i) the exponential distribution (ξt ∼ E (1) ≡ Γ (1,1)) so that

Yt|Ft−1 ∼ Γ (1,1/ψt), and ii) the Gamma distribution (ξt ∼ Γ σ0−t2, σ0−t2

) so that Yt|Ft−1 ∼ Γ σ₀−t2, σ−0t2/ψt

, whereσ₀−v2 (1≤v ≤S).

For these two cases we take S = 5, which is representative of many real daily trading measurements, such as trading volumes and realized volatilities. The true conditional mean parameters, which are reported in Tables 5.1 and 5.2, are chosen so that the PACD model to

(17)

be stable in the sense of Section 3, while implying fairly persistent series that are in accordance with the empirical evidence. For each case and for each series we compute the EQMLE and the two-stage GQMLE (2S-GQMLE), using 1000 Monte Carlo replications.

The starting parameter value in the nonlinear optimization routines (4.4) and (4.12) is set to the true value, while the unobservable starting valuesY0 andψ0(θ) of the PACD(1,1) equation are set to the intercept ω0

0. The two-stage GQMLE is calculated, with the EQMLE computed in the first stage.

EQMLE 2S-GQMLE v θ0 v ω0v α0v βv0 σ20v ωv0 α0v βv0 1 True Mean Std 0.5 0.5191 0.3163 0.6 0.5981 0.0709 0.35 0.3484 0.0698 1 0.9817 0.0968 0.5 0.5191 0.3168 0.6 0.5980 0.0710 0.35 0.3484 0.0699 2 True Mean Std 0.9 0.8941 0.3624 0.4 0.4003 0.0723 0.5 0.5039 0.0921 1 0.9917 0.1001 0.9 0.8940 0.3628 0.4 0.4001 0.0723 0.5 0.5040 0.0924 3 True Mean Std 1.5 1.4558 0.4795 0.5 0.5006 0.0784 0.5 0.5063 0.1010 1 0.9837 0.0992 1.5 1.4559 0.4806 0.5 0.5005 0.0783 0.5 0.5063 0.1011 4 True Mean Std 0.45 0.4521 0.4000 0.45 0.4490 0.0635 0.45 0.4453 0.0795 1 0.9859 0.1041 0.45 0.4525 0.4007 0.45 0.4492 0.0636 0.45 0.4452 0.0798 5 True Mean Std 0.7 0.6797 0.3828 0.55 0.5512 0.0726 0.40 0.4051 0.0804 1 0.9792 0.0939 0.7 0.6793 0.3832 0.55 0.5510 0.0728 0.40 0.4053 0.0806 Table 5.1. EQMLE and 2S-GQMLE results for 1000 PACD(1,1) series withn = 2000

generated from the exponential Γ (1,1/ψt) distribution.

Means and standard deviations of the estimates bθE and θb∗G over the 1000 replications are

reported in Table 5.1 for the exponential PACD(1,1) model and in Table 5.2 for the homolog Gamma PACD(1,1) model. It can be observed from Tables 5.1-5.2 that the results are consistent

(18)

with asymptotic theory. They are indeed almost identical in the exponential case with a slight superiority of the EQMLE over the 2S-GQMLE (cf. Table 5.1). The two estimates are, in fact, asymptotically efficient in this case but the EQMLE is much simpler to compute. For the Gamma PACD model in Table 5.2, θb∗

G outperforms θbE in terms of bias and variability, as

expected. In all cases, the 2S-GQMLE is the least risky one in the misspecification case.

EQMLE 2S-GQMLE v θ0 v ω0v α0v βv0 σ20v ωv0 α0v βv0 1 True Mean Std 0.2 0.2099 0.1573 0.4 0.4022 0.0408 0.5 0.4991 0.0768 0.5 0.4976 0.0428 0.2 0.2020 0.1550 0.4 0.4012 0.0399 0.5 0.5006 0.0754 2 True Mean Std 0.9 0.8843 0.1462 0.3 0.3045 0.0654 0.6 0.6046 0.0891 0.3 0.2981 0.0240 0.9 0.8934 0.1452 0.3 0.3036 0.0553 0.6 0.6009 0.0798 3 True Mean Std 0.3 0.3341 0.2856 0.5 0.5030 0.1130 0.4 0.3815 0.1240 1.5 1.4729 0.1582 0.3 0.3417 0.2826 0.5 0.5058 0.1100 0.4 0.3777 0.1201 4 True Mean Std 0.4 0.4040 0.2645 0.45 0.4526 0.0702 0.45 0.4534 0.0971 1 0.9809 0.0935 0.4 0.3995 0.2599 0.45 0.4525 0.0681 0.45 0.4541 0.0933 5 True Mean Std 0.5 0.4997 0.2729 0.55 0.5546 0.0921 0.35 0.3494 0.1002 2 1.9424 0.2247 0.5 0.4923 0.2487 0.55 0.5546 0.0834 0.35 0.3533 0.0838 Table 5.2. EQMLE and 2S-GQMLE results for 1000 PACD(1,1) series with T = 2000

generated from the Gamma Γ (1/σ2

0t,1/σ20tψt) distribution.

6 Empirical applications

6.1 Application to Bitcoin trading volume data

In our application, we fit the PACD(1,1) model to the daily Bitcoin trading volume (BTV). The dataset was obtained from the webpage www.blockchain.com. This series spans from July,

(19)

Figure 6.1: Daily Bitcoin trading volume (BTV).

3, 2017 to June, 26, 2020, with a total ofT = 1092 = 7_×156 observations. Figure 6.1 displays the time series plot of the data.

In the context of Bitcoin prices, Mbanga (2019) found evidence of the presence of the day-of-the-week pattern. Our aim here is to show that the Bitcoin volume data are also characterized by the day-of-the-week effect, which implies a period of S = 7. Such a case is different from the data usually encountered in non-cryptocurrency returns (such as stocks, exchange rates), which are characterized by a periodicity of S = 5, due to the existence of non-trading days at each week (Franses and Paap, 2000; Tsiakas, 2006).

Table 6.1 provides some descriptive statistics for the full sample and for each day of the week separately. The mean of BVT series is clearly different from one day to another. The difference is more pronounced for the Kurtosis and skewness across the days. Also, the estimated kernel densities of the data across the days are visually different (see Supplementary material). In that regard, we suspect that the day-of-the week effect may characterize the Bitcoin trading volume series.

Day Full series Mon Tue Wed Thu Fri Sat Sun Mean 40.8394 33.3621 41.3257 42.9304 46.3669 46.7979 44.2866 30.8063 Std 47.2340 40.9090 41.2095 49.5528 52.3069 57.7127 48.9096 34.3056 Kurtosis 17.2929 9.1734 6.6888 16.2964 11.3676 25.8981 10.0401 5.8247 Skewness 2.9736 2.3813 1.9404 3.1176 2.5808 3.8368 2.3894 1.8163

(20)

We first estimate a standard ACD(1,1) model (i.e. PACD with S = 1), using the EQMLE as recommended in Remark 4.1, (iv). This model is used as a competitor to our PACD(1,1). The initial parameter values are set to θ(0) ₌ _ω(0)_{, α}(0)_{, β}(0) _{= (0}_.₁_,₀_.₃_,₀_._{5) and the starting} values of the conditional mean equation are fixed toY0 =ψ0 =ω(0). The estimated parameters and their asymptotic standard errors (ASE) in parentheses, obtained from Theorem 4.2-4.3, are reported in Table 6.2. In particular, the ASE of σ_b2 _{is computed from (4}_._{10). The persistence} parameter estimate, α_b+βb= 0.9865, indicates a strong persistence in the series, as expected.

b

ω α_b βb _bσ2 _α_b₊_βb _IMSFE _IMAFE 0.8293

(0.2062) 0(0..46150270) 0(0.5250.0296) 0(0.3351.0019) 0.9865 701.9662 14.01243 Table 6.2. EQML estimates for the ACD(1,1); BTV series.

Table 6.2 also displays the in-sample mean square (one-step ahead) forecast error (IMSFE) and the in-sample mean absolute forecast error (IMAFE) given by IMSFE= 1

T T P t=1 (Yt−ψbt)2 and IMAFE= _T1 T P t=1 Yt−ψbt

, respectively. Unreported sample autocorrelations of the residu-als consolidate the validity of the estimated ACD(1,1). Since this model does not take into account the day-of-the-week effect, we fit a 7-periodic PACD(1,1) to the BVT series. To this end, we utilize the 2S-GQMLE by starting from the EQMLE in the first stage with the fol-lowing initial parameter values for the optimization routine: ω(0) _{= (0}_.₆_,₀_.₂₅_,₀_.₃₅_,₃_,₂_,₄_,₁_._5),

α(0) _{= (0}_.₂₅_,₀_.₁₅_,₀_.₁_,₀_.₃_,₀_.₃_,₀_.₄_,₀_._{35) and} _β(0) _{= (0}_.₇_,₀_.₈_,₀_.₈₅_,₀_.₄_,₀_.₅_,₀_.₂_,₀_._{45). These} ini-tial values are arbitrary but we checked that the estimates are robust even with other iniini-tial

(21)

values. Day v σ_b2 v ωbv αbv βbv αbv +βbv IMSFE IMAFE Mon 1 0.2526 (0.0050) 0(0..01653537) 0(0.5205.0421) 0(0.5645.0481) 1.0850 Tue 2 0.2716 (0.0102) 2(0..99454361) 0(0.5423.0379) 0(0.7173.0430) 1.2597 Wed 3 0.3682 (0.0180) 0(0..31092705) 0(0.1544.0414) 0(0.8710.0428) 1.0244 Thu 4 0.2839 (0.0075) 0(0..01256255) 0(0.4951.0690) 0(0.5641.0772) 1.0591 Fri 5 0.3299 (0.0092) 0(0..28895360) 0(0.3663.0655) 0(0.6358.0714) 1.0021 Sat 6 0.2129 (0.0014) 1(0..30405645) 0(0.4422.0724) 0(0.4786.0810) 0.9207 Sun 7 0.2372 (0.0043) 0(0..35134721) 0(0.4339.0781) 0(0.2256.0914) 0.659 56 All 7 Q v=1 (α_bv +βbv) 0.9025 651.9418 13.1164

Table 6.3. 2S-GQML estimates for the PACD(1,1); BTV series.

The 2S-GQML estimates and their ASEs in parentheses are reported in Table 6.3. We observe that the estimates are quite different across the days. The persistence parameters over the days show locally explosive behaviors except for Saturday and Sunday. However, the whole persistence parameter, Q7_v₌₁(α_bv +βbv) = 0.9025 (also called the monodromy estimate) is, as

expected, considerably smaller than the one given by the estimated standard ACD(1,1). All results have been obtained irrespective of any distributional specification of the models.

Note that the ASE of estimates for the PACD are larger than those obtained for the ACD. This is due to the fact that for the PACD the ASEs are computed for lower channel series with sample size T

S = 156. To get the same precision as with the ACD we should consider larger

series with the sample size multiplied at least by 7. Nevertheless, in term of in-sample forecast ability (IMSFE and IMASE), the PACD model outperforms the standard ACD.

To compare the out-of-sample forecasting performance of the two models, we estimate the two competing models on the basis of the first Tf observations of the series, where 1 < Tf < T.

Then, we compute the one-step ahead forecast on the period (Tf + 1, ..., T) based on

b

(22)

We finally calculate for each model the following three criteria: i) the mean square forecast error MSFE= 1 T₋Tf T P t=Tf+1

(Yt −ψbt)2, ii) the mean absolute forecast error MAFE =

1 T₋Tf T P t=Tf+1 Yt−ψbt

, and iii) the mean QLIKE (cf. Patton, 2011; Aknouche and Francq, 2019) MQLI= _T₋1_T f T P t=Tf+1 (logψbt+ Y_ψ_bt t).

Table 6.4 shows these computed values of these criteria for the two models and for various truncated series with sample sizeTf. It can be observed that irrespective of the chosen sample

size, the PACD yields better out-of-sample forecasts with regard to the aforementioned criteria. Overall, the PACD(1,1) outperforms the ACD(1,1), both in terms of in-sample and out-of-sample forecasting power.

Tf 500 600 700 800 900 1000 ACD MSFE MAFE MQLI 207.8341 8.6659 3.8983 165.8957 7.8318 3.7735 89.6771 6.9127 3.6879 88.3917 6.6050 3.6052 94.4021 6.9080 3.6959 104.8815 7.6159 3.8487 PACD MSFE MAFE MQLI 193.1508 8.0187 3.8759 151.0290 7.1074 3.7495 79.2446 6.1546 3.6605 80.8776 5.9031 3.5754 80.4284 6.02684 3.6635 93.9384 6.6908 3.8144 Table 6.4. Out-of-sample forecasting performance of the PACD and ACD; BVT series.

6.2 Application to the UN realized volatility

The second dataset is the daily UN realized volatility (RV) that covers the sample period from January 04, 1999 to December, 31, 2008 with a total of T = 2489 observations. The plot of the index series is displayed in Figure 6.2.

Table 6.5 reports some descriptive statistics concerning the whole series and the subseries corresponding to the five trading days. It can be easily seen that these statistics strongly indicate that the distributions of realized volatility are significantly different across the trading days. This is also confirmed by the estimated kernel density of each trading day (Supplementary

(23)

Figure 6.2: Daily UN realized volatility UN (UN-RV).

material). These facts suggests using a 5-periodic PACD(1,1) model for these data.

Day Full series Mon Tue Wed Thu Fri Sample size 2489 469 511 514 504 491 Mean 1.3085 1.1674 1.2528 1.3028 1.3631 1.4511 Std 1.7699 1.4919 1.4628 1.5890 1.9073 2.2648 Kurtosis 47.1196 26.3686 13.9103 23.4364 37.6800 55.2210 Skewness 5.1689 3.9488 2.9500 3.7675 5.0477 6.0455

Table 6.5. Day-of-the-week pattern in the UN-RV series.

As a reference model, we first estimate a standard ACD(1,1) for the data. Table 6.6 shows the EQML estimates and their asymptotic standard errors in parenthesis. The results signal a high persistence near to instability.

b

ω α_b βb _bσ2 _α_b₊_βb _{IMSFE IMAFE} 0.0109

(0.0031) 0(0..28490157) 0(0..70840162) 0(0.2841.0005) 0.9933 1.1122 0.4842 Table 6.6. EQML estimates for the ACD(1,1); UN-RV series.

Table 6.7 displays the 2S-GQML estimates of the PACD(1,1) based on the UN-RV series. These estimates are quite different across the days and are all significant. Also, the persistence parameter, given byQ7_v₌₁(α_bv+βbv) = 0.8897, is significantly smaller than that obtained from the

the ACD. The ASEs of the estimates for the PACD are smaller than in the first application, since the series here is quite longer. Moreover, the PACD model outperforms the standard

(24)

ACD, according to the IMSFE and IMASE criteria. Day v σ_b2 v ωbv αbv βbv αbv +βbv IMSFE IMAFE Mon 1 0.3805 (0.0082) 0(0..01640098) 0(0.2374.0390) 0(0.6702.0417) 0.9076 Tue 2 0.2597 (0.0017) 0(0..01540084) 0(0.3320.0291) 0(0.6903.0348) 1.0222 Wed 3 0.2552 (0.0012) 0(0..00150088) 0(0.4023.0327) 0(0.6521.0354) 1.0544 Thu 4 0.2670 (0.0019) 0(0..01260078) 0(0.3052.0359) 0(0.7065.0388) 1.0117 Fri 5 0.2682 (0.0023) 0(0..08840132) 0(0.4138.0390) 0(0.4851.0442) 0.8989 All 7 Q v=1 (α_bv +βbv) 0.8897 1.0570 0.4757

Table 6.7. 2S-GQML estimates for the PACD(1,1); UN-RV series.

We finally compare the out-of-sample forecasting performance of the two models, using the same devices as before. From Table 6.8 it can be concluded that for all truncated series (with sample size Tf), the PACD gives more accurate forecasts, in terms of the MSFE and MAFE

values. Regarding the mean QLIKE criterion, the PACD is clearly the best one (except for

Tf = 1100 and Tf = 1200, where the models are almost comparable).

Tf 1100 1200 1500 1600 1800 2000 ACD MSFE MAFE MQLI 1.0803 0.3662 0.4485 1.1501 0.3644 0.4534 1.4973 0.4219 0.5903 1.6645 0.4548 0.6690 2.1461 0.5509 0.9126 2.9891 0.6966 1.1470 PACD MSFE MAFE MQLI 0.9729 0.3656 0.4564 1.0299 0.3564 0.4547 1.3543 0.4094 0.5893 1.4932 0.4353 0.6686 1.9275 0.5189 0.9074 2.7234 0.6536 1.1404 Table 6.8. Out-of-sample forecasting values for the PACD and ACD (UN-RV series).

(25)

7 Conclusion and future research

A GARCH-like model for positive-valued data with seasonal behavior has been proposed. The model consists of an ACD model with parameters evolving periodically over time. In our methodology for studying and building such a model, we considered QML estimates that are distribution free and are consistent and asymptotically Gaussian under general conditions. In particular, our proposed two-stage Gamma QMLE takes into account the periodicity of the innovation sequence, giving more accurate results compared to the exponential QMLE. The proposed estimates are also consistent and asymptotically normal for more general non-MEM forms.

The model was applied to two daily financial series with different periods (S = 7 and

S = 5) in an attempt to capture the day-of-the-week effect. A third application to daily S&P 500 volumes (with S = 5) is displayed in the Supplementary material and leads to the same conclusions.

Our model can be applied to other data frequencies, such as monthly data with S = 12 and quarterly data withS = 4. Moreover, it may also be utilized as an approximate model for count time series data with large values, such as the daily number of transactions in a market. Although our model is named periodic ACD in reference to the ACD proposed by Engel and Russell (1998), it is not recommended to model intraday durations, which are rather characterized by a (stochastic) time-varying period, due to the irregularly-spaced nature of durations. Furthermore, modeling intraday positive-valued series generally requires very large periods for which the estimation of the parameters becomes very challenging.

For models with large periods, some basis functions for reducing the number of parameters, such as Fourier approximation (Bollerslev et al, 2000; Rossi and Fantazani, 2015; Bracher and Held, 2017), periodic B-splines (Ziel et al, 2015) or periodic wavelets (Ziel et al, 2016) could be adapted to our model. These issues are left for future research.

(26)

Proofs

Proof of Theorem 3.1 i) The ipdS-driven SRE (3.1) can be embedded in the following

system of S SREs

YnS+v =AnS+vY₍n₋1)S+v+BnS+v, n ∈Z, v ∈ {0, ..., S−1}, (A.1)

where A_nS₊_v ₌ QS−1

i=0 AnS+v−i and BnS+v = PSj=0−1

Qj−1

i=0 AnS+v−iBnS+v−j are such that the

sequence _{(A_nS₊_v_,B_nS₊_v₎_{, n} _∈Z_} _is _iid _{for all} _v _{∈ {}₀_{, ..., S} ₋₁_}_{. The standard top Lyapunov}

exponentγv(S) associated with theiid-driven SRE (A.1) is given for all v ∈ {0, ..., S −1}by (cf.

Bougerol and Picard, 1992a)

γ(S) v = inf ₁ nElog A_nS₊_vA₍_n −1)S+v...AS+v , n _≥1 (A.2) = lim n_→∞ 1

nlogkAnS+vAnS+v−1...Av+1k a.s.

Since Eξv <∞ it follows that Elog+kAvk< ∞ and Elog+kBvk <∞ for all 0 ≤ v ≤ S−1.

Therefore, by Theorem 2.5 of Bougerol and Picard (1992a), equation (A.1) admits a unique nonanticipative strictly stationary and ergodic solution YnS+v, n∈Z under γ

(S)

v <0. That

solution is given for all v _{∈ {}0, ..., S₋1_} by

Y_nS₊_v = ∞ X j=0 j₋1 Y i=0 A₍_n −i)S+vB(n₋j)S+v, n∈Z, v ∈ {0, ..., S−1} (A.3)

which is exactly (3.3), where the series in equality (A.3) converges absolutely a.s. This shows that _{Yt, t∈Z} is the unique causal strictly periodically stationary and periodically ergodic

solution of (3.1). By a sandwiching argument, it is easily seen that for all v _{∈ {}0, ..., S₋1_}

γ(S)

v = lim_n

→∞

1

nlogkAnS+vAnS+v−1...Av+1k= lim_n_→∞

1

nlogkAnSAnS−1...A1k:=γ

(S)_.

ii) If model (3.1) admits a nonanticipative strictly periodically stationary solution_{Yt, t∈Z}

then from the non-negativity of the coefficients ofAt in (3.1) it follows that for all k >1,

Yv ≥ k X j=0 j−1 Y i=0 Av−iBv−j, a.s.,

(27)

so the series P∞

j=0

j_Q₋1

i=0

Av−iBv−j converges a.s. Therefore j_Q₋1

i=0

Av−iBv−j → 0 a.s. as j → ∞, from

which we have to show that

j_Q−1

i=0

Av₋i →0 a.s. as j → ∞.This holds whenever

lim

j_→∞ j₋1

Y i=0

Av₋iem = 0, a.s. for all 1≤m≤r, (A.4)

wherer =p+qand (em)₁_≤_m_≤_ris the canonical basis ofRr. SinceAthas the same zero-structure

as the matrix At in Bougerol and Picard (1992b), then (A.4) follows from their results using

the same argument.

ii) By the nonnegativity of _{At, t∈Z} we have

γS₍_A₎ ≥γS₍_β_{) := log}_ρ S_Y₋1 v=0 βS₋v ! . (A.5)

If (3.1) admits a strictly periodically stationary solution then γS₍_A₎_< _{0. In view of (}_A._{5), it}

follows that

γS₍_β₎_<₀ ₍_A.₆₎

which in turn implies (3.4).

Proof of Theorem 3.2 Theorem 3.2 is a particular case of Theorem 3.3, ii).

Proof of Theorem 3.3 i) The proof is similar to the one of Lemma 2.3 of Berkes et al

(2003). See Supplementary material. ii) Define nYet, t∈Z o by      e Yt=AtYet₋1+Bt t≥1 e Yt= 0 t ≤0, (A.7) and let Y(v) ₍₀

≤v _≤S₋1) be a random variable having the same distribution as the term

YnS+v of the unique periodically stationary solution given by (3.1). By construction YenS+v

L

→

Y(v)asn _{→ ∞}. Letm = 2. From the weak convergence theory (Billingsley, 1968), to show that

EvecY(v)Y(v)′is finite for allv, it is sufficient to show that lim inf

n→∞E vecYe′ nS+vYenS+v <

∞ for all v. Set VnS+v =E

vecYe′

nS+vYenS+v

(28)

S-periodic difference equation VnS+v =E A⊗v2 VnS+v−1+ [E(Av⊗Bv) +E(Bv ⊗Av)]E e YnS+v +vec(E(BvBv′)) (A.8) where E A⊗t2

, E(At⊗Bt) and vec(E(BtBt′)) are finite S-periodic matrices over t. Since, the

last two terms of the right-hand side of (A.8) are bounded, it follows that lim

n_→∞VnS+v exists for

every 1_≤v _≤S as long as (3.8) holds, from which follows the proof form= 2. For general m, the proof is similar.

Before giving the proof of Proposition 3.1, we need to state the following well-known result on linear ordinary periodic difference equations. Let

yt=atyt₋1+bt, t∈Z, (A.9)

be an ordinary difference equation with S-periodic positive coefficients at = at+S > 0 and

bt=bt+S >0 for all t∈Z.

Lemma 1 The real-valued ordinary difference equation (A.9) admits a unique solution

{yt, t∈Z} if and only if

S Y v=1

av <1.

Proof of Proposition 3.1 It is well-known that if Yt|Ft₋1 ∼ Γ

1 σ2 t, 1 σ2 tψt then the conditional moments up to the fourth order are given by

E(Yt|Ft−1) = ψt (A.10a) E Y_t2_|F_t₋₁₌ _{1 +}_σ2 t ψ_t2 (A.10b) E Yt3|Ft₋1 = 1 +σ2t 1 + 2σt2 ψt3 (A.10c) E Yt4|Ft−1= 1 +σt2 1 + 2σ2t 1 + 3σt2 ψt4. (A.10d)

In view of (A.10) it turns out that the conditional moment of Yt of order i is a polynomial

in ψt with degree i (i= 1,2,3,4). HenceE(Yti)<∞ if and only if E(ψti)<∞ (i= 1,2,3,4),

conditions for which are given as follows.

(29)

S-periodic difference equation

E(ψt) = (αt+βt)E(ψt−1) +ωt, t∈Z. (A.11)

By Lemma 1, there is a unique solution of (A.11) if and only if (3.6) holds. ii) For the existence of the second moments E(Y2

v) (1 ≤ v ≤ S), expanding E(ψ2t) using

(2.1b) and (3.10a)-(3.10b), we find the following linear periodic difference equation

E ψ2t = α2tE ξt2₋1 + 2αtβt+βt2 E ψt2₋1 +Kt(1), t∈Z, (A.12) where Kt(1) = (2αtωt+ 2βtωt)E(ψt−1) +ωt2

is finite if and only if E(ψt−1) < ∞, and thus if and only if (3.6) holds. By Lemma 1, there

exists a unique solution to (A.12) if and only if (3.6) and (3.9) are satisfied. iii) Expanding E(ψ3

t) using (2.1b) and (A.10a)-(A.10c) we get the following linear periodic

difference equation E ψ3_t= α_t3E ξ_t3₋₁+ 3α2_tβt σ2t−1+ 1 +β_t3+ 3αtβt2 E ψ_t3₋₁+Kt(2), t∈Z (A.13) where Kt(2) = 3α2tωtE ξt2−1 + 6αtβtωt+ 3βt2ωt E ψ_t2₋₁+ 3βtω2t + 3αtωt2 E(ψt₋1) +ωt3

is finite if and only if E ψ2

t₋1

<_∞ and E(ψt−1)<∞. By Lemma 1, equation (A.13) admits

a unique solution if and only if (3.6), (3.9) and (3.10) hold. iv) Expanding E(ψ4

t) using (2.1b) and (A.10a)-(A.10d) we get the following linear periodic

difference equation E ψ4t = α4tE ξt4₋1 + 4α3tβt 1 +σ2t₋1 1 + 2σt2₋1 + 6α2tβt2 1 +σt2₋1 + 4αtβt3+βt4 E ψt4₋1 +Kt(3) (A.14)

(30)

where Kt(3) = 4α3tωtE Yt3₋1 + 12α2 tβtωtE Yt2₋1ψt₋1 + 12αtβt2ωtE Yt₋1ψt2₋1 +4β3 tωtE ψ3t₋1 + 6α2 tωt2E Yt2₋1 + 12αtβtωt2E(Yt₋1ψt₋1) + 6βt2ωt2E ψt2₋1 +4αtωt3E(Yt₋1) + 4βtωt3E(ψt₋1) +ωt4

is finite under (3.6), (3.9) and (3.10). By Lemma 1, equation (A.14) admits a unique positive solution if and only if (3.6), (3.9), (3.10) and (3.11) hold.

Proof of Theorem 4.1 Theorem 4.1 will be proved by showing several lemmas below. In what follows M >0 and ρ_∈ (0,1) stand for constants that are not necessarily the same when appearing in different terms. Let LT (θ) and lt(θ) be defined in the same way as LeT(θ) and elt(θ) in (4.3a) and (4.3b), respectively, with ψt(θ) in place of ψet(θ).

Lemma 1 Under A1 and A2 we have

sup θ_∈Θ LT (θ)−LeT (θ) →0 a.s. asT _{→ ∞}.

Proof Rewrite (4.2b) in a vector form as follows

ψ_t=βtψ_t −1+αt, t∈Z, (A.15) whereψ_t= (ψt(θ), ψt₋1(θ), ..., ψt₋p+1(θ))′ and αt= ωt+ q P i=1 αtiYt₋i,0, ...,0 ′ 1×p . By A1 and the assumption A2 of compactness of Θ it follows that

sup θ_∈Θ ρ S_Y₋1 v=0 βS₋v ! <1. (A.16) Iterating (A.15) gives ψ_t= t−1 X k=0 k_Y−1 i=0 βt−iαt−k+ t Y i=0 βt−iψ₀, t∈Z.

Denote by ψe_tand α_e_t the vectors obtained fromψ_t and α_t, respectively, while replacing ψt₋j(θ)

byψet₋j with fixed initial values. From (A.15) and (A.16) we thus get

sup θ∈Θ ψ_t₋ψe_t= sup θ∈Θ t−1 X k=t₋q k−1 Y i=0 βt−i αt₋k−αet₋k + t−1 Y i=0 βt−i ψ₀₋ψe₀ ≤M ρ t_. ₍_A.₁₇₎

(31)

Using the inequality log_xy _≤ _min(|y−_y,xx|₎ for positive x and y (cf. Francq and Zakoian, 2019) it follows that sup θ∈Θ LT (θ)−LeT (θ) ≤ T1 T X t=1 sup θ∈Θ 1 σ2 t hψet−ψt(θ) e ψtψt(θ) Yt+ logψt(θ) e ψt(θ) i ≤ max 1≤v≤Ssup_θ_∈_Θ ω−2 v σ2 v M T T X t=1 ρtYt+ max 1≤v≤Ssup_θ_∈_Θ ω−2 v σ2 v M T T X t=1 ρt. The existence ofE Yδ t

(cf, Theorem 3.3, i)) implies, by the Borel-Cantelli lemma, thatρt_Y t→

0 a.s. and the conclusion follows by C´esaro’s lemma.

Lemma 2Under A1-A4there is t_∈Z _{such that} _ψ_t₍_θ_{) =} _ψ_t₍_θ₀₎_a.s._{if and only if} _θ ₌_θ_0.

Proof From the assumption ρ

_S₋₁ Q v=0

βS−v

<1 inA1, the polynomials (βv(L))v are

invert-ible for all 1 _≤ v _≤ S and all θ _∈ Θ. Assume ψt(θ) = ψt(θ0) a.s. for some t ∈ Z. Using the

second equality in (4.1) and (4.2b) we have

_α v(L) βv(L) − α0 v(L) β0 v(L) Yv+nS = _ω0 v β0 v(L)− ωv βv(L) for all 1_≤v _≤S. If αv(L) βv(L) 6= α0 v(L) β0

v(L) for some 1 ≤ v ≤ S then there exists a deterministic periodic time-varying

combination of Yt₋j, j ≥ 1. This contradicts A4 which assumes (ξt, t ∈Z) non-degenerate,

since by (2.6) we have Yt=E(Yt|Ft₋1) +ψt(ξt−1). Therefore,

αv(z) βv(z) = α0 v(z) β0 v(z) ∀ |z| ≤1 and ω0 v β0 v(L) − ωv βv(L) for all 1≤v ≤S,

and by the assumption A3 of no common roots between α0

v(z) and βv0(z) it follows that

αv(z) = α0v(z), βv(z) =βv0(z) and ωv =ω0v for all 1≤v ≤S.

Lemma 3 Under A1 S X v=1 E(lv(θ0))<∞, and PS_v₌₁E(lv(θ)) is minimized at θ =θ0.

Proof By Jensen’s inequality and Theorem 3.3, ii) we have

S X v=1 E(logψv(θ0)) = 1_δ S X v=1 Elogψv(θ0) δ ≤ 1 δ S X v=1 logEψv(θ0) δ <_∞.

(32)

Hence S X v=1 E(lv(θ0)) = S X v=1 1 σ2 vE[ξv + logψv(θ0)] = S X v=1 1 σ2 v + S X v=1 1 σ2 vE(logψv(θ0))<∞.

Using the inequality log (x)_≤x₋1 we have for all θ_∈Θ

S X v=1 E(lv(θ))− S X v=1 E(lv(θ0)) = S X v=1 1 σ2 vE h logψv(θ) ψv(θ0) + ψv(θ0) ψv(θ) −1 i ≥ S X v=1 1 σ2 vE h log ψv(θ) ψv(θ0) + log ψv(θ0) ψv(θ) i = 0, (A.18)

showing that PS_v₌₁E(lv(θ)) is minimized at θ0.

Lemma 4 For any θ₆=θ0 there exists a neighborhood V (θ) such that

lim N_→∞inf inf_θ_∈V₍θ) e LN S θ > 1 S S X v=1 E(lv(θ0)).

Proof For all θ _∈Θ and any positive integer k, let V_k₍_θ_{) be the open ball of center} _θ _and

radius 1/k. In view of Lemma 1 we have

lim N→∞infθ∈Vinf_k₍_θ₎_∩_Θ e LT θ ≥ lim N→∞infθ∈Vinf_k₍_θ₎_∩_ΘLT θ − lim N→∞inf infθ∈Θ LT (θ)−LeT (θ) ≥ lim N→∞inf 1 N N_X₋1 n=0 1 S S X v=1 inf θ∈V_k(θ)∩Θlv+nS θ .

By the ergodic theorem for the stationary sequencenPS_v₌₁lv+nS θ o nwithE PS v=1lv+nS θ ∈

R_{∪ {∞}} _{(cf, Billingsley 1995, p. 495) it follows that}

lim N_→∞inf 1 N N₋1 X n=0 1 S S X v=1 inf θ∈V_k(θ)∩Θlv+nS θ = _S1 S X v=1 E inf θ∈V_k(θ)∩Θlv θ .

Beppo-Levi’s theorem (e.g. Billingsley, 1995, p. 219) yields

1 S S X v=1 E inf θ∈V_k(θ)∩Θlv+nS θ → S1 S X v=1 E(lv(θ)) as k → ∞,

(33)

Proof of Theorem 4.1 The proof of the theorem is completed by standard compactness arguments using Lemmas 2-4.

Proof of Theorem 4.2 The proof of Theorem 4.2 is based on a Taylor expansion of ∂LeT(θ)

∂θ

at θ0 which, byA5 and the strong consistency ofθbG, yields

0 = √N∂LeT_∂θ(θbG) =√N∂LT(θ0) ∂θ + √ N∂2LT(θ∗) ∂θ∂θ′ b θG−θ0 +√N ∂LeT(θbG) ∂θ − ∂LT(θbG) ∂θ where θ∗ _{is between} _θ_b

G and θ0. The derivatives ∂L_∂θT(θ) and ∂

2_L T(θ) ∂θ∂θ′ are given by ∂LT(θ) ∂θ = 1 T T X t=1 ∂lt(θ) ∂θ = 1 T T X t=1 1₋ Yt ψt(θ) 1 σ2 tψt(θ) ∂ψt(θ) ∂θ ∂2_L T(θ) ∂θ∂θ′ = 1 T T X t=1 h 1₋ Yt ψt(θ) 1 σ2 tψt(θ) ∂2_ψ t(θ) ∂θ∂θ′ + 2Yt ψt(θ) −1 1 σ2 tψ 2 t(θ) ∂ψt(θ) ∂θ ∂ψt(θ) ∂θ′ i .

Therefore, the asymptotic normality result (4.6) follows whenever the following lemmas are established. Lemma 5 Under A1-A2 i) sup θ_∈V(θ0) √ N∂LT(θ) ∂θ − ∂LeT(θ) ∂θ _N→p →∞ 0, ii) θ_∈supV(θ0) √ N∂2LT(θ) ∂θ∂θ′ − ∂2_L_e T(θ) ∂θ∂θ′ _N→p →∞ 0

for some neighborhood V (θ0) of θ0.

Proof Following the same lines of Francq and Zakoian (2019, Section 7.4), it is easily seen that under A1, Eθ0 σ2 1 vψv(θ) ∂ψv(θ0) ∂θ <1,Eθ0 σ2 1 vψv2(θ) ∂2_ψ v(θ0) ∂θ∂θ′ <1,Eθ0 σ2 1 vψ2v(θ) ∂ψv(θ0) ∂θ ∂ψv(θ0) ∂θ′ <1. (A.19)

By (A.17), the compactness of Θ (cf. A2) and the fact thatρ

_S₋₁ Q v=0 βS₋v <1 (cf. A1) we have ψet1(θ) − 1 ψt(θ) ≤ ψM ρt(θt) ψt(θ) e ψt(θ)(1 +M)ρ t_, sup θ_∈Θ ∂ψt(θ) ∂θ − ∂ψet(θ) ∂θ ≤ M ρt_{, sup} θ_∈Θ ∂2ψt(θ) ∂θ∂θ′ − ∂2_ψ_e t(θ) ∂θ∂θ′ ≤M ρt_,

(34)

and sup θ_∈V(θ0) √ N∂LT(θ) ∂θ − ∂LeT(θ) ∂θ = sup θ_∈V(θ0) 1 S√N T X t=1 h 1 e ψt(θ) − 1 ψt(θ) Yt σ2 tψt(θ) ∂ψt(θ) ∂θ +_σ12 t 1₋ Yt ψt(θ) 1 ψt(θ) − 1 e ψt(θ) _∂ψ t(θ) ∂θ + 1₋ Yt ψt(θ) 1 σ2 tψet(θ) _∂ψ t(θ) ∂θ − ∂ψet(θ) ∂θ i ≤ M S√N S X v=1 N_X₋1 n=0 ρn(1 +ξv+nS) 1 + _ψ 1 t(θ0) ∂ψt(θ0) ∂θ . (A.20) Therefore, (A.20) and the Markov inequality implies that for allε >0,

P √1 N S X v=1 N₋1 X n=0 ρn(1 +ξv+nS) 1 + _ψ 1 t(θ0) ∂ψt(θ0) ∂θ > ε ! ≤ 2ε 1 +Eθ0 1 + _ψ 1 t(θ0) ∂ψt(θ0) ∂θ √1 N N_X₋1 n=0 ρn_→0 as N _→0,

from which the result i) follows. The same argument shows result ii). Lemma 6 Under A1-A6, ∂2_L T(θ∗) ∂θ∂θ′ p → N→∞ 1 sJ θ0, σ 2

for any θ∗ _between _θ_b

G and θ0.

Proof LetVk(θ0) (k ∈N∗) be the open ball with centerθ0 and radius 1/k. Assume thatn is large enough so that θ∗ _{belongs to} _V

k(θ0). By periodic stationarity and periodic ergodicity

of nsupθ_∈Vk(θ0) ∂2lt(θ) ∂θi∂θj −E _∂2_l t(θ0) ∂θi∂θj o we have ∂2LT(θ∗) ∂θi∂θj −J θ0, σ 2 i,j = ∂2LT(θ∗) ∂θi∂θj −E ∂2_L T(θ0) ∂θi∂θj ₌ 1 T T X t=1 ∂2_l t(θ∗) ∂θi∂θj −E ∂2_l t(θ0) ∂θi∂θj ≤ N S1 T X t=1 sup θ_∈Vk(θ0) ∂2lt(θ) ∂θi∂θj −E _∂2_l t(θ0) ∂θi∂θj a.s. → N→∞ 1 S S X v=1 E sup θ_∈Vk(θ0) ∂2lv(θ) ∂θi∂θj −E _∂2_l v(θ0) ∂θi∂θj !_.

The Lebesgue dominated convergence theorem yields

lim k_→∞E _θ_∈sup_V k(θ0) ∂2lv(θ) ∂θi∂θj −E _∂2_l v(θ0) ∂θi∂θj !₌_E _lim k_→∞_θ_∈sup_V k(θ0) ∂2lv(θ) ∂θi∂θj −E _∂2_l v(θ0) ∂θi∂θj !_{= 0}_,

(35)

Lemma 7 Under A1-A6 √ N∂LT(θ0) ∂θ L → N_→∞ N 0, 1 S2I θ0, σ 2_.

Proof It is clear that √N∂LT(θ0)

∂θ = T P t=1 1 S√N ∂lt(θ0)

∂θ is a term of a square integrable periodically

ergodic Martingale. Since by the periodic ergodic theorem (cf. Supplementary material)

1 N T X t=1 ∂lt(θ0) ∂θ ∂lt(θ0) ∂θ′ = 1 N T X t=1 (1₋ξt)2 _σ4 1 tψ2t(θ0) ∂ψt(θ) ∂θ ∂ψt(θ) ∂θ′ a.s. → N_→∞I θ0, σ 2_,

the result thus follows from the martingale central limit theorem (e.g. Billingsley, 1995). Proof of Theorem 4.3 Set Uv,n(θ) = (

Yv+nS−ψv+nS(θ))2

ψ2

v+nS(θ) , and denote by oa.s.(1) a term

converging almost surely to 0 as N _{→ ∞}. If we show

b σv2 = N1 N₋1 X n=0 Uv,n(θ0) +oa.s.(1) (A.21)

then the result (4.9a) would follow from standard arguments. Now a Taylor expansion of Uv,n

b θG around θ0 yields b σv2 = N1 N_X₋1 n=0 Uv,n b θG = _N1 N₋1 X n=0 Uv,n(θ0) + b θG−θ0 1 N N_X₋1 n=0 ∂Uv,n(θ∗) ∂θ (A.22) where θ∗ _{is between} _θ_b

G and θ0. Note that

∂Uv,n(θ) ∂θ = 2 _Y2 v+nS ψ2 v+nS(θ) − Yv+nS ψv+nS(θ) 1 ψv+nS(θ) ∂ψv+nS(θ) ∂θ .

Following the same lines of Francq and Zakoian (2019, p. 197) it can be easily seen that

E sup θ∈V(θ0) Y2 v+nS ψ2 v+nS(θ) ! <_∞,E sup θ∈V(θ0) Yv+nS ψv+nS(θ) ! <_∞, E sup θ∈V(θ0) ψv+nS1 (θ) ∂ψv+nS(θ) ∂θ ! <_∞

(36)

we get lim sup N_→∞ N− 1 N_X₋1 n=0 ∂Uv,n(θ∗) ∂θ ≤ lim supN_→∞ N−1 N_X₋1 n=0 sup θ∈V(θ0) ∂Uv,n(θ) ∂θ = E sup θ∈V(θ0) ∂Uv,n(θ) ∂θ ! <_∞. Thus b θG−θ0 1 N N_X₋1 n=0 ∂Uv,n(θ∗) ∂θ =oa.s.(1),

and in view of (A.22) we obtain (A.21). Result (4.9b) follows by a similar argument.

References

Aknouche, A. and Francq, C. (2020). Count and duration time series with equal conditional stochastic and mean orders. Econometric Theory, forthcoming.

Aknouche, A. and Francq, C. (2019). Two-stage weighted least squares estimator of the conditional mean of observation-driven time series models. MPRA paper 97382.

Aknouche, A., Demmouche, N., Dimitrakopoulos, S. and Touche, N. (2020). Bayesian analysis of periodic asymmetric power GARCH models. Studies in Nonlinear Dynamics and

Econometrics, forthcoming.

Ambach, D. and Croonenbroeck, C. (2015). Obtaining superior wind power predictions from a periodic and heteroscedastic wind power prediction tool. In Stochastic Models, Statistics and

Their Applications, edt, 225–232.

Ambach, D. and Schmid, W. (2015). Periodic and long range dependent models for high frequency wind speed data. Energy 82, 277–293.

Andersen, T. and Bollerslev, T. (1997). Intraday periodicity and volatility persistence in financial markets. Journal of Empirical Finance 4, 115–158.

Bhogal, S.K. and Variyam, R.T. (2019). Conditional duration models for high-frequency data: A review on recent developments. Journal of Economic Surveys 33, 252–273.

Billingsley, P. (1999). Convergence of probability measures. 2nd edition, Wiley, New York. Billingsley, P. (1995). Probability and measure. 3rd edition, Wiley, New York.

(37)

Bollerslev, T., Cai, J. and Song, F.M. (2000). Intraday periodicity, long memory volatil-ity, and macroeconomic announcement effects in the US Treasury bond market. Journal of

Empirical Finance 7, 37–55.

Bougerol, P. and Picard, N. (1992a). Strict stationarity of generalised autoregressive pro-cesses. Annals of Probability 20, 1714–1730.

Bougerol, P. and Picard, N. (1992b). Stationarity of GARCH processes and some nonneg-ative time series. Journal of Econometrics 52, 115–127.

Boynton, W., Oppenheimer, H.R. and Reid, S.F. (2009). Japanese day-of-the-week return patterns: New results. Global Finance Journal 20, 1–12.

Bracher, J. and Held, L. (2017), Periodically stationary multivariate autoregressive models.

arXiv preprint, arXiv:1707.04635.

Caporin, M., Rossi, E., and Santucci De Magistris, P. (2017). Chasing volatility: a persistent multiplicative error model with jumps. Journal of Econometrics 198, 122–145.

Charles, A. (2010). The day-of-the-week effects on the volatility: The role of the asymmetry.

European Journal of Operational Research 202, 143–152.

Chen, M. and An, H.Z. (1998). A note on the stationarity and the existence of moments of the GARCH model. Statistica Sinica 8, 505-510.

Chou, R.Y. (2005). Forecasting financial volatilities with extreme values: The conditional autoregressive range (CARR) Model. Journal of Money, Credit, and Banking 37, 561–582.

Diebold, F. (1986). Modeling the persistence of conditional variances: A comment.

Econo-metric Reviews 5, 51–56.

Engle, R. (2002). New frontiers for Arch models. Journal of Applied Econometrics 17, 425–446.

Engle, R. and Russell, J. (1998). Autoregressive conditional duration: A new model for irregular spaced transaction data. Econometrica 66, 1127–1162.

Francq, C. and Zakoian, J.-M. (2019). GARCH models: structure, statistical inference and

financial applications. John Wiley & Sons, 2nd edt.

Francq, C., Roy, R. and Saidi, A. (2011). Asymptotic Properties of Weighted Least Squares Estimation in Weak PARMA Models. Journal of Time Series Analysis 32, 699–723