Robust GMM Tests for Structural Breaks

(1)

Working Paper Series

_______________________________________________________________________________________________________________________

National Centre of Competence in Research Financial Valuation and Risk Management

Working Paper No. 66

Robust GMM Tests for Structural Breaks

Patrick Gagliardini Fabio Trojani Giovanni Urga

First version: May 2003 Current version: June 2003

This research has been carried out within the NCCR FINRISK project on

“Interest Rate and Volatility Risk”.

___________________________________________________________________________________________________________

(2)

First Version: May 2003. This Version: June 2003

Robust GMM Tests for Structural Breaks

^∗

Abstract

We propose a class of new robust GMM tests for endogenous structural breaks. The tests are based on supremum and average statistics derived from robust GMM estimators with a bounded influence function.

They imply a bounded linearized asymptotic bias of size and power under local model misspecifications.

We illustrate the trade-oﬀ between robustness and eﬃciency of the new robust tests in some Monte Carlo simulation where we find that the performance of classical GMM tests is highly unstable under local model misspecifications. In particular, the loss in power can be very substantial in some models. Similarly, the size of the classical tests can be very unstable, especially in models based on nonlinear orthogonality functions. Robust testing procedures yield important power improvements under a misspecified model while under an exact reference model they yield a performance comparable to the one of classical tests.

Moreover, robust procedures stabilize the size under a model misspecification for those models where important size instabilities of the classical tests have been observed. Finally, classical testing procedures based on the supremum functional seem to be less stable than those based on the average, implying larger power improvements of robust procedures in this case when compared with the case of an average statistic.

Keywords: Robust Tests, Structural Breaks, Monte Carlo J.E.L. Classification Numbers: C10, C12, C13, C15.

Patrick Gagliardini, Institute of Finance, University of Southern Switzerland, Via Buﬃ 13,

CH-6900 Lugano, e-mail: [email protected].

Fabio Trojani, Institute of Finance, University of Southern Switzerland, Via Buﬃ 13,

CH-6900 Lugano, e-mail: [email protected].

Giovanni Urga, Faculty of Finance, Cass Business School, 106 Bunhill Row, London EC1Y 8TZ (U.K.).

Tel.: +/44/20/70408696, Fax.: ++/44/20/70408885, e-mail: [email protected]

http://www.staﬀ.city.ac.uk/~giourga.

∗We wish to thank Loriano Mancini, Claudio Ortelli, Paolo Porchia and participants to the International Con- ference on MODELLING STRUCTURAL BREAKS, LONG MEMORY AND STOCK MARKET VOLATILITY (held in London at Cass Business School, 6-7 December 2002) for discussions and helpful comments. G. Urga wishes to thank ESRC under Grant number R000238145 (”New methods for detecting multiple structural breaks in time series ”) for funding this research. Patrick Gagliardini and Fabio Trojani gratefully acknowledge the financial support of the Swiss National Science Foundation (grant 12-65196.01 and NCCR FINRISK).

(3)

1 Introduction

We propose a class of new Generalized Method of Moments (GMM, Hansen (1982)) tests for endogenous structural breaks (Andrews (1993)) that are robust to local model misspecifications of a given reference model. GMM based test statistics defining tests for structural breaks are typically obtained as the supremum, the average or some related functional of sequences of quadratic GMM statistics (see also Andrews and Ploberger (1994)), each being asymptotically chi-square distributed under the null of no break¹ . Such GMM functionals are typically evaluated at some GMM model parameter estimate, conditionally on a given break date. A general GMM statistical functional (as for instance a GMM estimator or the size/power of a GMM test) is robust to local model misspecifications if and only if it is based on a GMM model with a bounded orthogonality function. Moreover, non robust quadratic GMM statistics can be robustified by applying a weighted orthogonality function that bounds optimally the influence of a general model misspecification (see Ronchetti and Trojani (2001)). This defines robust GMM (RGMM) estimation and testing procedures of simple parametric hypotheses in a fairly general GMM setting. In this paper, we propose a class of robust supremum and average tests for structural breaks, which are defined over sequences of quadratic RGMM statistics based on a bounded orthogonality function.

The need for robust statistical procedures for estimation and testing has been stressed by many authors and is now widely recognized (see for instance, Hampel (1974), Koenker and Bassett (1978), Huber (1981), Peracchi (1990, 1991), Heritier and Ronchetti (1994), Krishnakumar and Ronchetti (1997), Ronchetti and Trojani (2001), Ortelli and Trojani (2002) and Mancini, Ronchetti and Trojani (2003)). In particular, it is now well-known that a necessary robustness requirement for a Maximum-Likelihood (ML)-type GMM test based on an asymptotically chi-square distributed statistic is a bounded influence function (Hampel (1974)) of the GMM estimator defining the statistic. Thus, GMM estimators with unbounded influence function do not imply robust inference

1 Asymptotic critical values for GMM tests of structural breaks are derived by the Functional Central Limit Theorem and have typically to be computed by simulation; see for instance Andrews (1993) and Andrews (2003).

Analytical approximations have been proposed in Hansen (1997).

(4)

procedures for tests based on ML-type, asymptotically chi-square distributed, GMM statistics.

This implies that tests for structural breaks defined via functionals of sequences of such statistics cannot ensure the local robustness of inferences on structural breaks. Therefore, we construct supremum and average tests for breaks using sequences of RGMM statistics with bounded influence function. In the GMM setting, a bounded influence function of a test statistic is equivalent to the boundedness of the given orthogonality function. Therefore, RGMM statistics are obtained by truncating appropriately the unbounded orthogonality function of a nonrobust GMM model setting; see again Ronchetti and Trojani (2001).

We quantify the performance of our robust testing procedures for structural breaks in some Monte Carlo experiments of a few GMM model settings. This issue has been so far largely unexplored in the literature. For the estimation problem, Fiteni (2002) derived the asymptotic properties of a robust break date estimator defined through the supremum functional over a sequence of robustified loss functions. These results apply to a standard linear regression model setting. To our knowledge, the robust testing problem has been not addressed so far in the literature. We propose a class of general RGMM tests, applying to linear and nonlinear model settings, and quantify the contribution of robustness when testing for breaks in a few Monte Carlo simulations.

Our findings are as follows. First, the performance of classical GMM tests is highly unstable under local model misspecifications. In particular, the loss in power in the presence of local departures from conditional normality can be substantial in some models. Similarly, the size of the classical GMM tests can be very unstable, especially in GMM models based on nonlinear orthogonality functions. Second, robust testing procedures for structural breaks yield important power improvements under a locally misspecified model. At the same time, under an exact reference model, they have power properties comparable to those of the classical tests. Third, robust procedures stabilize the size under a model misspecification for those models where important size instabilities of the classical tests have been observed. Finally, classical testing procedures

(5)

based on the supremum functional seem to be less stable than those based on the average, implying larger improvements of robust procedures upon classical methodologies, when applied to supremum functionals.

The rest of the paper is organized as follows. Section 2 reviews Andrews (1993) GMM testing setting for structural breaks. Section 3 introduces robust GMM tests for structural breaks and studies formally their local robustness properties in neighborhoods of a reference model. Section 4 analyzes by Monte Carlo simulations the empirical properties of the new robust tests in linear and nonlinear GMM testing settings, while Section 5 concludes and gives suggestions for further developments.

2 GMM tests for structural breaks

We briefly review Andrews (1993) GMM testing setting for structural breaks and we write the relevant statistics as functionals on a suitable set of probability distributions. This formalism will allow us to analyze in a second step the robustness properties of GMM tests for structural breaks. In the following we adopt the symbol =⇒ to denote weak convergence in the sense of Pollard (1984, pp. 64-66) for sequences of random elements of a space of bounded Euclidean valued cadlag functions on Π ⊂ [0, 1], equipped by the supremum norm topology and by the corresponding Borel sigma algebra. The symbol →d denotes convergence in distribution, ∇ denotes the gradient operator, B¡

R^k¢

is the Borel sigma algebra on R^k, k·k is the Euclidean norm.

2.1 Hypotheses of structural changes

We consider a parametric model indexed by parameters (β_t, δ0) ∈ Θ = B × ∆ ⊂ R^p× R^q, for t = 1, 2, .., and test the null hypothesis of parameter stability:

H₀: β_t= β₀ for all t ≥ 1 and some β0∈ B ⊂ R^p . (1)

(6)

Several alternative hypotheses may be of interest in the present setting. The simple one time change alternative with known change point² π ∈ Π ⊂ (0, 1) is given by:

H1T(π) : β_t=









β₁(π) for t = 1, .., T π β₂(π) for t = T π + 1, .., T

, (2)

for some constant vectors β₁(π), β₂(π) ∈ B. A natural alternative where the change point π ∈ Π ⊂ (0, 1) is unknown is:

HA(Π) = [

π∈Π

H1T(π) .

In this case one tests for the presence of a break in the known interval Π. Finally, when applying tests for structural breaks as general diagnostics tools, a natural alternative may be

H₁: β_s6= βt for some s, t ≥ 1 .

Although this hypothesis is more general than ∪Π⊂(0,1)H_A(Π) the robust GMM tests for structural breaks considered in this paper have power³ also against H₁.

2.2 Full sample GMM estimators

Let W = {Wt: t ≥ 1} be a stochastic process with values in W ⊂ R^k, defined on a measur- able space (Ω, F), and let m : R^k × Θ → R^υ be an orthogonality function⁴ . GMM tests for structural breaks can be defined based on full sample or partial sample GMM estimators (see An- drews (1993)). Since in Section 2.3 below we will focus on testing procedures based on Lagrange Multiplier (LM) statistics, we can consider here for brevity only the first class of GMM estimators.

A full sample GMM (FS-GMM) estimator eθ = µeβ

0

, eδ

0¶0

is the asymptotic functional solution of the orthogonality condition

0 = lim

T→∞

1 T

XT t=1

EP[m (Wt, β, δ)] , (3)

2For technical reasons Π is assumed to be a closed set (see also Andrews (1993)).

3See Andrews (1993) and Andrews and Ploberger (1994).

4 For brevity and easy notation we focus on the exactly identified case (ν = p + q), even if the overidentified case can be treated along the same lines.

(7)

where P is a probability measure on (Ω, F) such that the solution of (3) is well-defined and unique under the null (1)⁵ . In the following it will be convenient to work with the finite dimensional distributions Pt of Wt, defined by

P_t(A) = P (W_t∈ A) , for any A ∈ B¡

R^k¢

and t ≥ 1. Defining P^T = _T¹ XT t=1

Pt, we impose for the rest of the paper the following asymptotic stationarity assumption, ensuring existence of a weak limit P_∞ for the sequence©

P_T : T ≥ 1ª .

Assumption 1 There exists a probability measure P_∞ on¡ R^k, B¡

R^k¢¢

such that P_∞is the weak limit of©

PT : T ≥ 1ª :

PT → P∞ , weakly as T → ∞.

If the functional Q 7→ EQ[m (W, β, δ)] is weakly continuous for any (β, δ) ∈ Θ, equation (3) can be written in the equivalent form

0 = E_P

∞[m (W, β, δ)] , (4)

where P_∞= limT→∞PT. A suﬃcient condition for the weak continuity of Q 7→ E^Q[m (W, β, δ)]

is the boundedness of the orthogonality function m. In fact, boundedness of m is precisely the condition that is required in order to ensure the local robustness of a general GMM estimator; see also Section 3 below. The solution eθ¡

P_∞¢ :=³

eβ¡ P_∞¢⁰

, eδ¡ P_∞¢⁰´0

of (4) for any suitable P_∞ defines the asymptotic functional structure of the FS-GMM estimator eθ =

µeβ

0

, eδ

0¶₀ .

To define a FS-GMM estimator associated with a sample WT := {Wt: 1 ≤ t ≤ T } let PW^T :=

1 T

PT

t=1δWt be the empirical distribution of W^T, where δWt is the measure with point mass at Wt. Under standard regularity conditions one has P_W_T → P∞ weakly, P -almost surely, as T → ∞. The finite sample FS-GMM estimator is bθ :=

µβb

0

, bδ

0¶0

:=³ βe¡

P_W_T¢⁰ , eδ¡

P_W_T¢⁰´0

, i.e.

the solution of (4) for P_W_T. Specifically, this gives the finite sample estimating equations 0 = E_P

WT

h m³

W, bβ, bδ´i

= 1 T

XT t=1

m³

Wt, bβ, bδ´ .

5This defines implicitly a domain for the functional eθ.

(8)

Under the null hypothesis (1) of no break, weak convergence under P of a finite sample FS-GMM estimator bθ to eθ¡

P_∞¢

can be ensured with Assumption 1 in Andrews (1993). We report this result in the next theorem using a functional notation.

Theorem 2 Under regularity conditions on (Θ, m, P ) and under the null hypothesis (1) (see An- drews (1993), Assumption 1 p. 830), every sequencen

bθ : T ≥ 1o

of FS-GMM estimators satisfies

√T³ bθ − eθ¡

P_∞¢´

→^dN⁰¡ P_∞¢−1

S (P )¹²B (1) , where the matrix

N¡ P_∞¢

=£ M¡

P_∞¢ M_δ¡

P_∞¢ ¤ , is given by

M¡ P_∞¢

= E_P

∞

h∇β⁰m³ W, eθ¡

P_∞¢´i , M_δ¡

P_∞¢

= E_P

∞

h∇δ⁰m³ W, eθ¡

P_∞¢´i ,

and where {B (π) : π ∈ [0, 1]} is a standard Brownian motion in R^υ. The matrix S (P ) is given by:

S (P ) = lim

T→∞V ar_P Ã 1

√T XT t=1

m³ W_t, eθ¡

P_∞¢´! .

For simplicity of exposition we assume in the sequel that the processn m³

Wt, eθ¡ P_∞¢´

: t ≥ 1o is a martingale diﬀerence⁶ under P , implying

S(P ) = S(P_∞) := E_P

∞

· m³

W, eθ¡ P_∞¢´

m³ W, eθ¡

P_∞¢´⁰¸ ,

if m is bounded and under the null (1) of no break. Under standard conditions, consistent estimators of S(P_∞) and M¡

P_∞¢ are S¡

P_W_T¢

and M¡ P_W_T¢

, respectively.

The next section introduces Andrews (1993) LM-GMM tests for structural breaks. They are obtained from the FS-GMM estimators defined above.

2.3 Test statistics

Consistent asymptotically equivalent GMM test statistics for testing H₀against H_1T(π) are Wald- type, Lagrange Multiplier-type (LM) or Likelihood ratio-type statistics. Without loss of generality

6This assumption is satisfied in many applications, such as for instance those of Section 4.

(9)

we focus in the sequel on LM test functionals. In our setting a LM test can be defined by means of the statistic (see also Andrews (1993), p. 837)

LMdT(π) = T

π (1 − π)· LM¡

π, P_W_T¢

= T

π (1 − π)U⁰¡

π, P_W_T¢ U¡

π, P_W_T¢ ,

where

U¡

π, P_W_T¢

= H¡ P_∞¢¹₂

E_P

WT ,π

hm³

W, bβ, bδ´i

= H¡

P_∞¢¹₂ 1 T

XT π t=1

m³

W_t, bβ, bδ´ ,

with

H¡ P_∞¢

= S¡ P_∞¢−1

M¡ P_∞¢

Σ¡ P_∞¢

M⁰¡ P_∞¢

S¡ P_∞¢−1

, Σ¡

P_∞¢

= h

M⁰¡ P_∞¢

S¡ P_∞¢−1

M¡

P_∞¢i⁻¹ ,

and with the empirical finite measure P_W_T_,π= _T¹ PT π t=1δ_W_t.

The LM statistic dLMT(π) is particularly simple to compute, since it requires only the computation of a single FS-GMM estimator. This is a clear advantage when working with RGMM statistics, because RGMM estimators for time series are typically more computationally intensive than classical ones (see Section 3 below).

As P_W_T_,π→ πP∞weakly when T → ∞, P almost surely, the asymptotic functional structure of the functional U is determined as

U¡ π, P_∞¢

= H¡ P_∞¢¹₂

E_πP

∞

hm³ W, eθ¡

P_∞¢´i

. (5)

Therefore, the asymptotic functional structure of LM¡

·, P∞¢

is given by

LM¡ π, P_∞¢

= U⁰¡ π, P_∞¢

U¡ π, P_∞¢

. (6)

In this paper we focus on a class of supremum and average statistics bξ^sup_T and bξ^ave_T , respectively, to test H0 against alternatives of the form HA(Π) or H1. The test statistic bξ^sup_T is defined by

bξ^sup_T := sup

π∈Π

d

LM_T(π) = T · eξ^sup¡ P_W_T¢

, (7)

(10)

where

eξ^sup¡ P_W_T¢

= sup

π∈Π

µ 1

π (1 − π)LM¡

π, P_W_T¢¶ . The test statistic bξ^ave_T is defined by

bξ^ave_T :=

Z

Π

LMdT(π) dλ (π) = T · eξ^ave¡ P_W_T¢

, (8)

where

eξ^ave¡ P_W_T¢

= Z

Π

µ 1

π (1 − π)LM¡

π, P_W_T¢¶

dλ (π) ,

and λ is the Lebesgue measure on Π. The asymptotic functional structure of eξ^sup and eξ^ave is obtained from (7), (8) and (6) as

eξ^sup¡ P_∞¢

= sup

π∈Π

µ 1

π (1 − π)LM¡

π, P_∞¢¶

, eξ^ave¡ P_∞¢

= Z

Π

1

π (1 − π)LM¡ π, P_∞¢

dλ (π) . (9) From (6) and (9) we deduce that the robustness properties of the functionals eξ^sup(·) and eξ^ave(·) are completely determined by the ones of the functional U . Therefore, one can expect to obtain a class of tests for structural breaks with better robustness properties when working with quadratic functionals based on a robust functional U . This in turn will require working with GMM test statistics and estimators based on a GMM setting with a bounded orthogonality function m; see also Section 3 below.

In a likelihood setting, statistics of the form (8) define an optimal test in terms of a weighted average power criterion based on an uniform prior for the break date π ∈ Π (see Andrews and Ploberger (1994)). Specifically, average type tests can be interpreted as the optimal test for structural breaks in the case of alternative hypotheses very near to the null⁷ . When constructing robust tests for structural breaks in a likelihood setting we can therefore expect robust versions of the bξ^ave_T statistic to produce the highest weighted power at the model, when compared for instance with robust versions of bξ^sup_T .

7 To test for more global structural break alternatives, optimal tests of the average exponential form can be applied (Andrews and Ploberger (1994)). Robust versions of such tests can be constructed readily with the same procedure used in Section 3, where robust versions of bξ^sup_T and bξ^ave_T are obtained.

(11)

The null asymptotic distribution of dLM_T(·) as a process indexed by π implies, by the Func- tional Central Limit theorem, the one of (7) and (8). This is summarized by the next result due to Andrews (1993).

Theorem 3 Under regularity conditions on (Θ, m, P ) and under the null hypothesis (1) (see Andrews (1993), Assumption 1 p. 830 and Assumption 3 p. 835) it follows:

1. dLMT(·) =⇒ Q^p(·) as a process indexed by π ∈ Π, where Qp(π) := 1

π (1 − π)Jp(π)⁰Jp(π) :=(Bp(π) − πB^p(1))⁰(Bp(π) − πB^p(1))

π (1 − π) ,

and Bp(·) is a p−dimensional standard Brownian motion on [0, 1].

2. bξ^sup_T →^dsup_π_∈ΠQp(π) under P . 3. bξ^ave_T →d

R

ΠQ_p(π) dλ (π) under P .

Based on this result, critical values for bξ^sup_T , bξ^ave_T can be computed by simulation of the process Qp(·). Results analogous to the one in Theorem 3 can be derived for the asymptotic distribution of dLMT(·) under appropriate sequences of parametric local alternatives to the null hypothesis (1); see Theorem 3 in Andrews (1993) for details.

3 Robust GMM tests for structural breaks

Statistical robustness deals with inference procedures that are based on smooth statistical functionals. In this paper we focus on inference procedures for structural breaks which are locally robust in nonparametric neighborhoods of a parametric reference model, that is procedures that are not excessively sensible to small deviations from a reference model.

Our goal is to develop smooth test functionals for breaks around some fix parametric reference model belonging to a (nonparametric) neighborhood of relevant model distributions. Therefore, a minimal robustness requirement is continuity of such functionals. A second stronger requirement is their Fréchet diﬀerentiability (see for instance Bednarsky (1993)). Based on (9), we can expect the power and level functionals of robust tests for breaks based on the supremum or the average to satisfy the first or the second requirement, respectively, if and only if the statistical functional

(12)

U in (5) satisfies the first or the second requirement, respectively. Therefore, a first focus is on working in GMM settings where such statistical functionals are Fréchet diﬀerentiable.

3.1 Fréchet diﬀerentiability

Let P be a probability measure on (Ω, F) satisfying the conditions for Theorem 2 and 3 to hold.

This will be the reference model in our robust setting⁸ . The boundedness of the orthogonality function m is a necessary condition for a general GMM statistic like the FS-GMM estimator eθ or the functional U in (5) to be Fréchet diﬀerentiable. More specifically, an orthogonality function m is unbounded if and only if the influence function of a GMM statistic is unbounded.

Unboundedness of the influence function implies an unbounded asymptotic bias of a GMM statistic in a neighborhood of P_∞(see for instance Ronchetti and Trojani (2001)). For the rest of the paper, we therefore assume a GMM setting based on a bounded orthogonality function m.

Assumption 4 The orthogonality function m is such that kmk_∞:= sup

(w,θ)∈W×Θkm (w, θ)k < ∞ .

Under Assumption 4 and further regularity conditions which are typically a subset of those suffi- cient for Theorem 2 and 3 to hold, Fréchet differentiability of the GMM functionals eθ and U at P_∞can be implied (see for instance Clarke (1986) and Heritier and Ronchetti (1994)). Therefore, we assume in the sequel their Fréchet differentiability.

Assumption 5 The functionals eθ and U are Fréchet diﬀerentiable.

The important property of Fréchet diﬀerentiable testing functionals for robust inference purposes is their uniform convergence in distribution over sequences of distributional asymptotic neighborhoods of the given reference model. This feature provides a way to compute uniform asymptotic expansions where the linearized asymptotic bias under contamination in the level and the power of the test can be formally bounded over neighborhoods of the reference model. We address this issue in the next section for our robust supremum and average tests for structural breaks.

8Typically, this will be a parametric distribution; see also the examples in Section 4 below.

(13)

3.2 Uniform convergence

To study the robustness properties of GMM testing procedures for structural breaks we consider sequences U^ε := {U^ε,T : T ≥ 1} of local neighborhoods of P . Denote by M the vector space of finite signed measures on (Ω, F) and by |·|V the variation norm on this space. A local neighborhood U_ε,T of P is defined by

Uε,T =n

Q^ε,T ∈ M : Q^ε,T is probability measure :¯¯Q^ε,T − P¯¯

V < ε/√ To

.

Let further Q^ε,T :=n

Q^ε,T_t : t = 1, .., To

be the corresponding sequence of finite dimensional distributions of {W^t: t = 1, ..T } under Q^ε,T, i.e.

Q^ε,T_t (A) = Q^ε,T(Wt∈ A) ,

for any A ∈ B¡ R^k¢

. Uniform weak convergence of U¡

·, PW^T

¢ as a process indexed by π over

sequences Uεof asymptotic neighborhoods of the reference model is defined next.

Definition 6 The sequence © U¡

·, PWT

¢: T ≥ 1ª

converges weakly as a process indexed by π to Jp(·), uniformly over the sequence U^ε, if

L^ε,T³√

T³ U¡

·, PWT

¢− U³

·, Q^ε,TT

´´´

=⇒ J^p(·) (T → ∞)

uniformly in Q^ε,T, where L^ε,T is the process distribution under Q^ε,T whereas Jp(·) is the Brownian Bridge process in Theorem 3.

Notice that, by construction,¯¯¯Q^ε,Tt − Pt

¯¯

¯V < ε/√

T for any t, implying¯¯¯Q^ε,TT − PT

¯¯

¯V < ε/√ T , where for brevity we denote now by |·|V the variation norm on the space of finite signed measures on¡

R^k, B¡ R^k¢¢

. Therefore, U_ε,T neighborhoods of probability measures on (Ω, F) translate in a natural way into neighborhoods of probability measures on¡

R^k, B¡ R^k¢¢

.

Uniform convergence in distribution over a sequence U^εgives us a way to study the robustness properties of the asymptotic size and power functionals of bξ^sup_T and bξ^ave_T under a sequence of local contaminations Q^ε,T_T , T ≥ 1. For brevity we focus on the robustness of the asymptotic size functionals lim_T_→∞α^sup(·) and limT→∞α^ave(·) of supremum and average tests for structural

(14)

breaks, respectively. They are defined under the null (1) by

Tlim→∞α^sup³ Q^ε,T_T ´

= lim

T→∞Q^ε,T_T ³

bξ^sup_T > ξ^sup₀ ´ ,

Tlim→∞α^ave³ Q^ε,T_T ´

= lim

T→∞Q^ε,T_T ³

bξ^ave_T > ξ^ave₀ ´ ,

where ξ^sup₀ and ξ^ave₀ are the critical values of the tests for a given nominal size α0. Given the Fréchet diﬀerentiability Assumption 5 we can assume uniform convergence in distribution of the sequence©

U¡

·, PWT

¢: T ≥ 1ª

; see also Clarke (1986) and Heritier and Ronchetti (1994) for more details on the relation between Fréchet diﬀerentiability and uniform convergence in distribution.

Assumption 7 The sequence© U¡

·, PW^T

¢: T ≥ 1ª

converges weakly as a process indexed by π to J_p(·), uniformly over the sequence Uε.

The Fréchet diﬀerentiability Assumption 5 and Assumption 7 imply a uniformly bounded bias of power and level of tests for structural breaks, over asymptotic distributional neighborhoods of the reference model. This gives formally the robustness of a testing procedure for structural breaks under local misspecifications of a reference model.

3.3 Asymptotic level expansions of tests for structural breaks

We next provide an expansion of the asymptotic size functional of robust supremum and average tests for structural breaks over sequences of asymptotic neighborhoods of the reference model P . For simplicity of exposition we work in the sequel with asymptotic ε−contaminated neighborhoods.

Let M∞ be a set of asymptotically stationary measures defined by

M∞=©

Q ∈ M : Q is a probability measure and QT → Q_∞weakly as T → ∞ª .

An ε−contaminated neighborhood Uε,T of P is defined by:

U_ε,T =

½ Q^η,T =

µ 1 − η

√T

¶ P + η

√TQ : Q ∈ M∞and η < ε

¾ .

The asymptotic behaviour of the size of tests for structural breaks under local model misspecifications is characterized by the next theorem⁹ .

9The proof is in the Appendix.

(15)

Theorem 8 Under Assumption 4, 7 and under the null hypothesis (1) it follows, for any η < ε:

Tlim→∞α^sup³ Q^η,T_T ´

= α₀+ η²µ^sup· d²¡

P_∞, Q_∞¢ + o¡

η²¢ , and

Tlim→∞α^ave³ Q^η,T_T ´

= α₀+ η²µ^ave· d²¡

P_∞, Q_∞¢ + o¡

η²¢ , where d²¡

P_∞, Q_∞¢

is given by d²¡

P_∞, Q_∞¢

= E_Q

∞

³ m³

W, eθ¡

P_∞¢´´⁰ H¡

P_∞¢ E_Q

∞

³ m³

W, eθ¡ P_∞¢´´

,

µ^sup= − ∂L^sup(ξ^sup₀ , x)

∂x

¯¯

x=0

, µâve= − ∂Lâve(ξâve₀ , x)

∂x

¯¯

x=0

, and L^sup(·, x), L^ave(·, x), are the cumulative distribution function of

sup

π∈Π

J_p^∗(π, x)⁰J_p^∗(π, x) π (1 − π) ,

Z

Π

J_p^∗(π, x)⁰J_p^∗(π, x)

π (1 − π) dλ (π) , respectively, where the process J_p^∗(·, x) is defined by

J_p^∗(π, x) = Jp(π) − π(1 − π) · x .

Theorem 8 shows that the asymptotic linearized bias in the level of tests for structural breaks is proportional to d²¡

P_∞, Q_∞¢

. In particular, if m is bounded the bias in the level of supremum and average tests for breaks is bounded over sequences of asymptotic neighborhoods of the reference model. This implies formally the robustness of GMM tests for breaks based on bounded orthogonality functions.

Theorem 8 can be also used to give asymptotic bounds on the maximal bias in the level of tests based on a bounded orthogonality function. In particular, an orthogonality function such that

sup

(w,θ)∈W×Θ

¯¯m (w, θ)⁰H¡ P_∞¢

m (w, θ)¯¯ < c² , (10) for some constant c >√

υ, implies up to terms of order o¡ η²¢

:

¯¯

¯ lim_T

→∞α^sup³ Q^η,T_T ´

− α0

¯¯

¯ ≤ η²µ^sup· c² , ¯¯¯ lim_T

→∞α^ave³ Q^η,T_T ´

− α0

¯¯

¯ ≤ η²µ^ave· c² . (11)

For testing purposes the bounds (11) can be used to choose the constant c in dependence of the maximal amount of contamination expected (ε) and the maximal bias in the asymptotic level

(16)

which a researcher is willing to accept. In this case, the derivatives µ^sup, µ^ave, will have to be computed numerically, by simulating the distribution of

sup

π∈Π

J_p^∗(π, x)⁰J_p^∗(π, x) π (1 − π) ,

Z

Π

J_p^∗(π, x)⁰J_p^∗(π, x) π (1 − π) ,

for several values of x in a neighborhood of 0. For instance, the local robustness of tests for structural breaks in dependence of model parameters like α₀ or Π could be studied, thereby producing information about the degree of robustness required in a particular model setting. Notice that in the case where Π = {π0} and λ = δπ0 (the Dirac distribution at π₀) the above tests collapse to a test for a break at a known break date. In this case (but only in this case) the results of Theorem 8 coincide with those of Theorem 1 in Ronchetti and Trojani (2001), where the local robustness of the size functional for standard Maximum Likelihood-type GMM tests has been characterized.

Since in this case the distribution of the random variable ¡

J_p^∗(π0, x)⁰J_p^∗(π0, x)¢

/ (π0(1 − π⁰)) is noncentral chis-quare, full analytical expressions for µ^sup= µ^ave become available.

The bound (10) is satisfied by RGMM estimators with bounded self-standardized sensitivity, i.e. such that¹⁰

sup

(w,θ)∈W×Θ

¯¯

¯m (w, θ)⁰S¡ P_∞¢−1

m (w, θ)¯¯¯ < c² .

Therefore, we consider supremum and average tests for breaks based on such RGMM estimators and their orthogonality functions. Details on the definition and the computation of such GMM estimators and their orthogonality functions in a general GMM setting are provided in Ronchetti and Trojani (2001), p. 45-48.

1 0Indeed:

¯¯m (w, θ)⁰H¡ P_∞¢

m (w, θ)¯

¯ ≤¯¯¯m (w, θ)⁰S¡

P_∞¢−1m (w, θ)¯¯¯ , since the matrix

S¡ P_∞¢¹₂

H¡ P_∞¢

S¡ P_∞¢¹₂ is an orthogonal projection matrix.

(17)

4 Monte Carlo simulations

This section studies by means of Monte Carlo simulation the size and power properties of the above robust GMM tests for breaks in diﬀerent model settings. We compare the performance of the RGMM tests with the one of the classical GMM tests by focusing on the eﬃciency under ideal conditions and the robustness under local departures from a conditionally normal reference model.

4.1 Testing for structural breaks in a linear regression model

We first consider tests for a break in the slope coeﬃcient of a linear regression model with an autoregressive regressor. The model is given by:

yt= γ + β_txt+ σut

xt= α + ρxt−1+ σεεt

, (12)

where:

β_t=









β₁, for t = 1, ..., T π₀ β₂, for t = T π0+ 1, ...T

,

for some π₀ ∈ Π. The error term εt in the process x_t is i.i.d.N (0, 1) distributed. For the error term u_t in the linear regression model (12) we simulate a set of diﬀerent distributions according to Model 1a-1e below. Specifically, we set:

• Model 1a: u^t∼ i.i.d. N (0, 1) ,

• Model 1b: u^t∼ i.i.d. t⁵/p 5/3,

• Model 1c: u^t∼ i.i.d. t³/√ 3,

• Model 1d: ut∼ i.i.d. CN (0.05, 0, 3) /√ 1.4,

• Model 1e: ut∼ i.i.d. CN (0.1, 0, 3) /√ 1.8,

where tnis a Student distribution with n degrees of freedom and CN (x, 0, 3) is a standard normal distribution contaminated with probability x by a further zero mean normal distribution having

(18)

standard deviation 3. All error distributions have been standardized. The standard orthogonality conditions for a least squares estimation of the linear regression model (12) are based on an orthogonality function given by

ψ(Wt, β, δ) =







ψ₁(Wt, β, δ) ψ₂(W_t, β, δ)





 =











 1 xt





 (y^t− γ − βxt)

(y_t− γ − βxt)²/σ²− 1





 ,

where Wt = (yt, xt)⁰, δ = ¡ γ, σ²¢0

. Classical FS-GMM estimators are obtained by using such orthogonality conditions. Since ψ is unbounded these estimators are not robust. Hence, also tests based on LM-type GMM statistics derived from such estimators are not robust. Robust FS-RGMM estimators of (12) can be constructed by applying orthogonality conditions based on a truncated orthogonality function given by

m(W_t, β, δ) =







m1(Wt, β, δ) m2(Wt, β, δ)





 =







A1ψ₁(Wt, β, δ)wc1(A1[ψ₁(Wt, β, δ)]) A2[ψ₂(Wt, β, δ) − τ²] wc2(A2[ψ₂(Wt, β, δ) − τ²])





 , (13) where w_c(z) := min(1, c/ kzk), defines a set of Huber’s weights that downweight observations which are influential (in terms of asymptotic bias) for a classical least squares estimation of the model; see also Hampel et al. (1986), Section 4.4., for more details. The constants c1 > √

2, c2> 1 are tuning constants that control the amount of robustness in the estimation of (γ, β)⁰ and σ², respectively. The matrix A1∈ R²^×2 and the scalars A2, τ2 are determined as the solution of the implicit equations:

0 = EP0[m(Wt, β₀, δ0)] , I = EP0

h

m(Wt, β₀, δ0)m(Wt, β₀, δ0)⁰i ,

where P0 is the reference model distribution of a linear regression model (12) with normally distributed error terms ut. Specifically, since under P0 the random variable (yt− γ0− β0xt)²/σ²₀ is X1² distributed, irrespectively of (β₀, δ0), the correction constanta A2, τ2 have to be computed

(19)

only once, before starting the robust estimation algorithm.

Table 1a-1e present the results of our Monte Carlo simulations for Models 1a-1e, respectively, in the given linear regression setting. The break date is fixed as t0= 0.5 · T , where T = 200,300, respectively. In the simulations for T = 200 we have set α = γ = 0, β₁ = 1, β₂ = 1, 1.1, 1.2, 1.3, σu = σe = 1, ρ = 0.2, 0.5, 0.8. In the simulations for T = 300 the values of β₂− β1 applied for T = 200 have been multiplied by a factor p

2/3 in order to obtain comparable local alternatives across the diﬀerent sample sizes. The tuning constants for the RGMM test have been set at c₁ = 3, c₂ = 3. We also fixed Π = [p₀, 1 − p0], where p₀ = 0.25. The nominal level of the tests is α₀ = 0.05. Finally, for the classical tests we provide the results of two GMM statistics. The first (OLS) uses a restricted estimate of S¡

P_∞¢

which uses explicitly the independence of (xt) and (ut). The second statistic (GMM) is the standard one which uses an unrerstricted estimate of S¡

P_∞¢

. RGMM is the robust GMM statistic based on an unrestricted estimate of S¡ P_∞¢

using the bounded orthogonality function (13).

Table 1a shows that the power loss under normality of supremum and average RGMM tests for breaks is moderate, with losses relatively to the classical GMM tests that are typically below 10%. As expected, the power of classical and robust average tests is above the one of tests based on the supremum functional. Table 1b shows that already under a t5 error distribution the power curves of classical and robust GMM tests are virtually undistinguishable. Notice, however, that classical GMM average tests are in this setting systematically oversized, especially for the cases T = 200 and ρ = 0.2, 0.5. This eﬀect seems to derive from the estimation of the asymptotic covariance matrix S¡

P_∞¢

of m. Indeed, the GMM average tests using an OLS based estimation of S¡

P_∞¢

present virtually no such oversize. However, their power curves are systematically below the ones of the robust average tests. In Table 1c, under a t₃error distribution, these issues are more pronounced with a clearly higher power of robust GMM tests relatively to standard procedures. Notice that in this model setting the systematic oversize of classical average tests is particularly pronounced. For example, the empirical sizes of the classical GMM tests are for all

(20)

parameter choices ρ = 0.2, 0.5, 0.8 and T = 200 above 0.075. Table 1d shows similar patterns as for a t5 distribution when simulating under a CN (0.05, 0, 3) error distribution: the power curves of classical and robust GMM tests under such a setting are virtually undistinguishable and the classical average test shows a tendence to a slight oversize when ρ = 0.2, 0.5, T = 200 and when T = 300. Under the CN (0.1, 0, 3) setting in Table 1e these issues are more pronounced with the power curves of supremum and average tests that are above the ones of classical procedures and with again an oversize of classical average tests for breaks.

4.2 Testing for structural breaks in an ARCH model

To investigate the properties of classical and robust GMM tests for structural breaks in nonlinear models, we now consider an ARCH model setting (Engle (1982)). We analyze two types of tests for breaks in this setting by testing for a break in the intercept and the slope, respectively, of the conditional variance equation in an ARCH(1) model.

4.2.1 Breaks in the autoregressive parameter of the conditional variance equation

The model specification is given by:

y_t = σ_tu_t,

σ²_t = α0+ α1,ty²_t₋₁,

where

α1,t=









α_1,1, for t = 1, ..., T π₀ α2,1, for t = T π0+ 1, ...T

,

for some π₀ ∈ Π. For utwe simulate again a set of distributions near to a standard normal one, according to Models 2a-2e below:

• Model 2a: u^t∼ i.i.d. N (0, 1) ,

• Model 2b: u^t∼ i.i.d. t⁷/p 7/5

• Model 2c: u^t∼ i.i.d. t⁵/p 5/3

(21)

• Model 2d: ut∼ i.i.d. CN (0.05, 0, 3) /√ 1.4

• Model 2e: u^t∼ i.i.d. CN (0.1, 0, 3) /√ 1.8

All error distributions have been standardized. The orthogonality conditions for a classical FS- GMM estimation of the model are defined by an orthogonality function given by:

ψ(Wt, β, δ) =





 1 y²_t₋₁





 1 σ²_t

µy²_t σ²_t − 1

¶ ,

where σ²_t = α₀ + α₁y_t²₋₁, W_t = (y_t, y_t₋₁), β = α₁, δ = α₀. This orthogonality function is unbounded, so that the implied GMM estimators are not robust. The orthogonality function for a robust GMM estimation of the model is given by:

m(W_t, β, δ) = A [ψ(W_t, β, δ) − τ (yt−1)] w_c(A [ψ(W_t, β, δ) − τ (yt−1)]) , (14)

for some tuning constant c > √

2. The matrix A ∈ R² and the vector function τ are defined by the implicit equations:

0 = EP0

h

m(Wt, β₀, δ0) | y^t−1

i , I = EP0

h

m(Wt, β₀, δ0)m(Wt, β₀, δ0)⁰i ,

(15)

where P0 is the reference model distribution of an ARCH(1) model with conditional normally distributed error terms ut. The shift factors τ (yt−1), t = 1, ..., T , can be computed by using an analytical Laplace approximation of the integrals involved in the solution of (15), as proposed in Mancini, Ronchetti and Trojani (2003). This avoids the numerical computation of such integrals and largely reduces the computation time of robust GMM estimators in the present and related settings. Details on the computation of τ (y_t₋₁) and the corresponding robust FS-GMM estimator for the moment conditions (15) are given in the Appendix.

Table 2a-2e present the results of our Monte Carlo simulations for Models 2a-2e, respectively, in the given ARCH(1) model setting. The break date is fixed as t0= 0.5 · T , where T = 500, 1000.

In the simulations for T = 1000 we have set α0 = 0.01, α1 = 0.2, α2 = 0.2, 0.3, 0.4, 0.5. In the

(22)

simulations for T = 500 the values of β₂− β1 applied for T = 1000 have been multiplied by a factor√

2 in order to obtain comparable local alternatives across the diﬀerent sample sizes. The tuning constants for the RGMM test have been set at c = 6.18. We also fixed Π = [p0, 1 − p⁰], where p0 = 0.45. The nominal level of the tests is α0 = 0.05. Finally, for the classical tests we provide the results of two GMM statistics. The first (OLS) is based on a restricted estimate of S¡

P_∞¢

which uses explicitly the independence of u_tand u_t₋₁. The second statistic (GMM) is the standard one which uses an unrestricted estimate of S¡

P_∞¢

. RGMM is the robust GMM statistic based on an unrestricted estimate of S¡

P_∞¢

using the bounded orthogonality function (14).

Table 2a shows that the power loss under normality of average RGMM test for breaks is moderate and always below 10%. Moreover, even under normality, the power of RGMM supremum tests is above the one of classical GMM procedures. Finally, classical GMM procedures based on the supremum imply a too conservative size behavior while RGMM tests based on the supremum do correct such a size distorsion in the right direction. As expected, the power of classical and robust average tests is above the one of tests based on the supremum functional. Table 2b shows that under a t₇ error distribution the gain in power of RGMM tests based on the supremum is very large, with increases that are sometimes around 80%-100%. Also in the case of the average statistic, RGMM procedures do produce clear power increases in this setting. Both supremum and average classical statistics yield a too conservative size behavior that is corrected in the right direction by RGMM tests for breaks. Such issues are even more apparent in Table 2c, under a t5distribution, where also in the case of the average statistic RGMM procedures yield very large power improvements. The results in Tables 2d and 2e (for a CN (0.05, 0, 3) and a CN (0.1, 0, 3) error distribution, respectively) are qualitatively similar to the ones of Table 2b and 2c, with eﬀects that are however quantitatively even larger than in the case of a Student t error distribution.

(23)

4.2.2 Breaks in the intercept parameter of the conditional variance equation

In this case the model is defined by:

y_t = σ_tu_t,

σ²_t = α0,t+ α1y_t²₋₁ ,

where:

α0,t=









α_0,1, for t = 1, ..., T π₀ α_0,2, for t = T π₀+ 1, ...T

,

for some π0 ∈ Π. The set of distributions simulated for u^t is the same as the one of the last section, and will be listed according to Models 3a-3e, respectively. The functions ψ, m defining the orthogonality function for a classical and a robust FS-GMM estimation of the model are as above with the relevant arguments that are now β = α₀and δ = α₁. In particular, the matrix M used for the computation of the LM tests is now diﬀerent from the one used in Section 4.2.1.

Table 3a-3e present the results of our Monte Carlo simulations for Models 3a-3e, respectively, in the given ARCH(1) model setting. In the simulations for T = 1000 we have set α0,1= 0.01, α0,2= 0.01, 0.0125, 0.015 and α2= 0.2. In the simulations for T = 500 the values of α0,2− α^0,1 applied for T = 1000 have been multiplied by a factor√

2 in order to obtain comparable local alternatives across the diﬀerent sample sizes. All further parameters are the same as the ones of Section 4.2.1.

Table 3a shows that the power loss under normality of average RGMM test for breaks is in this case virtually negligible. Moreover, the classical average test presents a slight tendence to an oversize which is corrected in the right direction by the robust average test. As for the tests of Section 4.2.1, the power under normality of RGMM supremum tests is above the one of classical GMM procedures. Similarly, classical GMM procedures based on the supremum imply a too conservative size behavior while RGMM tests based on the supremum do correct such a size distorsion in the right direction. As expected, the power of classical and robust average tests is again above the one of tests based on the supremum functional. Table 2b shows that under a t₇

(24)

error distribution the gain in power of RGMM tests based on the supremum is large, with increases that are sometimes around 30%-40%. In the case of the average statistic, RGMM procedures do produce power increments of about 10% in this setting. Finally, the too conservative behavior of classical supremum tests is corrected in the right direction by RGMM tests for breaks. In Table 3c, under a t5 distribution, both supremum and average RGMM procedures yield large power improvements relatively to their classical counterparts. Again, the size distorsions of classical tests based on the supremum are corrected in the right direction by RGMM tests for breaks.

The results in Tables 3d and 3e (for a CN (0.05, 0, 3) and a CN (0.1, 0, 3) error distribution, respectively) are similar to the findings in Table 3b and 3c, with eﬀects that are however again quantitatively larger than in the case of a Student t error distribution.

5 Conclusions

We proposed a class of new supremum and average RGMM tests for structural breaks, which imply a bounded asymptotic bias of size and power under local model misspecifications. Robustness of the new tests is obtained by computing the supremum or the average over a sequence of GMM Lagrange Multiplier statistics in a setting based on a bounded orthogonality function. Monte Carlo simulation showed the new robust GMM tests to perform very well already under slight deviations from the reference model, both in terms of the eﬃciency and the robustness of the inference procedure, when compared with standard (nonrobust) GMM tests for structural breaks.

Due to the intrinsic diﬃculty in developing perfectly specified parametric models for econometric applications, it is expected that RGMM tests for breaks can help in providing some more robust and consistent evidence on the presence of breaks in the statistical analysis of economic data series.

(25)

6 Appendix 1: Proof of Theorem 8

By Assumption 7:

√T³ U¡

·, PWT

¢− U³

·, Q^η,TT

´´

=⇒ J^p(·) ,

uniformly over the sequence U^ε as T → ∞. Moreover, a von Mises (1947) expansion under the null hypothesis (1) gives up to terms of order O³

η/√ T´

:

√T · H¡ P_∞¢−¹2

U³

·, Q^η,TT

´

= √

T E_πQ^η,T

T

³ m³

W, eθ³

Q^η,T_T ´´´

= ηE_πP

T

³∇θ⁰m³ W, eθ¡

P_T¢´´

D³ eθ´ ¡

P_T, Q_T¢ +ηE_πQ

T

³ m³

W, eθ¡ PT

¢´´· π

where

Q_T 7−→ D³ eθ´ ¡

PT, Q_T¢

is the Fréchet derivative of eθ at PT in direction Q_T−P^T, under the null hypothesis (1). Consistency of eθ at the model implies as T → ∞ under further mild regularity conditions

√T · H¡

P_∞¢−1/2

U³ Q^η,T_T ´

→ η · b¡

π, P_∞, Q_∞¢ ,

where

b¡

π, P_∞, Q_∞¢

= πh E_P

∞

³

∇θ⁰m³ W, eθ¡

P_∞¢´´

D³ eθ´ ¡

P_∞, Q_∞¢ + πE_Q

∞

³ m³

W, eθ¡

P_∞¢´´i

= πh N⁰¡

P_∞¢ D³

eθ´ ¡

P_∞, Q_∞¢ + πE_Q

∞

³m³ W, eθ¡

P_∞¢´´i

. (16)

To compute b¡

π, P_∞, Q_∞¢

note that under the null hypothesis (1) one has that the full sample moment conditions

0 = E_P

∞

³m³ W, eθ¡

P_∞¢´´

, (17)

hold for the FS-GMM estimator eθ. The boundedness of m then gives the Fréchet derivative D³

eθ´ ¡

P_∞, Q_∞¢

by implicitly diﬀerentiating (17) in the direction Q_∞− P∞ to get

0 = E_P

∞

³

∇^θ⁰m³ W, eθ¡

P_∞¢´´

D³ eθ´ ¡

P_∞, Q_∞¢ + E_Q

∞

³ m³

W, eθ¡ P_∞¢´´

,