Detecting changes in cross-sectional dependence in multivariate time series

(1)

arXiv:1206.2557v4 [math.ST] 21 May 2014

Detecting changes in cross-sectional dependence in

multivariate time series

Axel B¨

ucher

Institut f¨ur Angewandte Mathematik Universit¨at Heidelberg

Im Neuenheimer Feld 294, 69120 Heidelberg, Germany [email protected]

Ivan Kojadinovic, Tom Rohmer

Laboratoire de math´ematiques et applications, UMR CNRS 5142 Universit´e de Pau et des Pays de l’Adour

B.P. 1155, 64013 Pau Cedex, France

{ivan.kojadinovic,tom.rohmer_}@univ-pau.fr

Johan Segers

Institut de statistique, biostatistique et sciences actuarielles Universit´e catholique de Louvain

Voie du Roman Pays 20, B-1348 Louvain-la-Neuve, Belgium [email protected]

May 22, 2014

Abstract

Classical and more recent tests for detecting distributional changes in multi-variate time series often lack power against alternatives that involve changes in the cross-sectional dependence structure. To be able to detect such changes better, a test is introduced based on a recently studied variant of the sequential empirical copula process. In contrast to earlier attempts, ranks are computed with respect to relevant subsamples, with beneficial consequences for the sensitivity of the test. For the computation of p-values we propose a multiplier resampling scheme that takes the serial dependence into account. The large-sample theory for the test statistic and the resampling scheme is developed. The finite-sample performance of the pro-cedure is assessed by Monte Carlo simulations. Two case studies involving time series of financial returns are presented as well.

(2)

1 Introduction

Given a sequence X₁_{, . . . ,}X_n_of_d_{-dimensional observations, change-point detection aims}

at testing

H0 : ∃F such that X1, . . . ,Xn have c.d.f. F (1.1)

against alternatives involving the nonconstancy of the c.d.f. UnderH0and the assumption

that X₁_{, . . . ,}X_n _{have continuous marginal c.d.f.s} _F₁_{, . . . , Fd}_{, we have from the work of}

Sklar (1959) that the common multivariate c.d.f. F can be written in a unique way as

F(x_{) =}_C_{_F₁₍_x₁₎_{, . . . , Fd}₍_xd₎_}_, x_∈Rd_,

where the function C : [0,1]d _→ _[0_,_{1] is a} _copula _{and can be regarded as capturing the}

dependence between the components of X₁_{, . . . ,}X_n_{. It follows that} _H₀ _{can be rewritten}

as H0,m∩H0,c, where

H0,m: ∃F1, . . . , Fd such that X1, . . . ,Xn have marginal c.d.f.s F1, . . . , Fd, (1.2)

H0,c : ∃C such thatX1, . . . ,Xn have copula C. (1.3)

Classical nonparametric tests for H0 are based on sequential empirical processes; see

e.g. Bai (1994), Cs¨org˝o and Horv´ath (1997, Section 2.6) and Inoue (2001). For mod-erate sample sizes, however, such tests appear to have little power against alternative hypotheses that leave the margins unchanged but that involve a change in the copula, i.e., when H0,m∩(¬H0,c) holds. Empirical evidence of the latter fact can be found in

Holmes et al. (2013, Section 4). For that reason, nonparametric tests for change-point detection particularly sensitive to changes in the dependence structure are of practical interest.

Several tests designed to capture changes in cross-sectional dependence structure were proposed in the literature. Tests based on Kendall’s tau were investigated by Gombay and Horváth (1999) (see also Gombay and Horváth, 2002), Quessy et al. (2013) and Dehling et al. (2013). Although these have good power when the copula changes in such a way that Kendall’s tau changes as well, they are obviously useless when the copula changes but Kendall’s tau does not change or only very little. Tests based on functionals of sequential empirical copula processes were considered in Rémillard (2010), Bücher and Ruppert (2013), van Kampen and Wied (2013) and Wied et al. (2013). How-ever, the power of such tests is often disappointing; see Section 5 for some numerical evidence.

It is our aim to construct a new test forH0 that is more powerful than its predecessors

against alternatives that involve a change in the copula. The test is based on sequential empirical copula processes as well, but the crucial difference lies in the computation of the ranks. Whereas in R´emillard (2010) and subsequent papers, ranks are always computed with respect to the full sample, we propose to compute the ranks with respect to the relevant subsamples; see Section 2 for details. The intuition is that in this way, the copulas of those subsamples are estimated more accurately, so that differences between copulas of disjoint subsamples are detected more quickly. The phenomenon is akin to the one observed in Genest and Segers (2010) that the empirical copula, which is based on

(3)

pseudo-observations, is often a better estimator of a copula than the empirical distribution function based on observations from the copula itself. For another illustration in the context of tail dependence functions, see B¨ucher (2013a).

The paper is organized as follows. The test statistic is presented in Section 2, and its asymptotic distribution under the null hypothesis is found in Section 3. Next, Section 4 contains a detailed description of the multiplier resampling scheme and its asymptotic validity under the null hypothesis. The results of a large-scale Monte Carlo simulation study are reported in Section 5, and two brief case studies are given in Section 6. Sec-tion 7 concludes. Proofs and details regarding the simulaSec-tion study are deferred to the Appendices.

In the rest of the paper, the arrow ‘ ’ denotes weak convergence in the sense of Definition 1.3.3 in van der Vaart and Wellner (2000). Given a set T, let ℓ∞₍_T_{) denote}

the space of all bounded real-valued functions on T equipped with the uniform metric.

2 Test statistic

We now describe our test statistic and highlight the difference with the one in R´emillard (2010) and B¨ucher and Ruppert (2013). LetX₁_{, . . . ,}X_n_{be random vectors. For integers}

1_≤k_≤l _≤n, let Ck:l be the empirical copula of the sample Xk, . . . ,Xl. Specifically, Ck:l(u) = 1 l₋k+ 1 l X i=k 1( ˆUk:l i ≤u), (2.1) for u_∈_[0_,_1]d_{, where} ˆ Uk:l i = 1 l₋k+ 1(R k:l i1, . . . , Rkid:l), i∈ {k, . . . , l}, (2.2) with Rk:l ij = Pl

t=k1(Xtj ≤Xij) the (maximal) rank of Xij among Xkj, . . . , Xlj. (Because

of serial dependence, there can be ties, even if the marginal distribution is continu-ous; think for instance of a moving maximum process.) An important point is that the ranks are computed within the subsample X_k_{, . . . ,}X_l _{and not within the whole sample} X₁_{, . . . ,}X_n_{. As we continue, we adopt the convention that} _Ck_:_l _{= 0 if}_{k > l}_.

Write ∆ =_{(s, t)_∈[0,1]2 _:_s _≤_t_}_{. Let} _λn₍_{s, t}_{) = (}_⌊_nt_{⌋ − ⌊}_ns_⌋₎_/n _{for (}_{s, t}₎_∈_{∆. Our}

test statistic is based on the difference process, D_n_{, defined by}

D_n₍_s,u_{) =}√_{n λn}₍₀_{, s}₎_λn₍_s,₁₎_{_C_1:_⌊_ns_⌋₍u₎₋_C_⌊_ns_⌋_+1:_n₍u₎_} _(2.3)

for (s,u₎ _∈ _[0_,_1]d+1_{. For every} _s _∈ _[0_,_{1], it gives a weighted difference between the}

empirical copulas atu _{of the first} _⌊_ns_⌋ _{and the last}_n_{− ⌊}_ns_⌋_{points of the sample. Large}

absolute differences point in the direction of a change in the copula. To aggregate over u_{, we consider the Cram´er–von Mises statistic}

Sn,k =

Z

[0,1]d{

(4)

The test statistic for detecting changes in cross-sectional dependence is then Sn = max 1≤k≤n−1Sn,k = sup_s_∈_[0_,_1] Z [0,1]d{ D_n₍_s,u₎_}2_d_C_1:_n₍u₎_. _(2.4)

Other aggregating functions can be thought of too, leading for instance to Kolmogorov– Smirnov and Kuiper statistics. In numerical experiments, the resulting tests were found to be less powerful than the one based on the Cram´er–von Mises statistic and hence are not considered further in this paper.

The null hypothesis of a constant distribution is rejected when Sn is large. The p-values are determined by the null distribution ofSn, whose large-sample limit is derived in Section 3. To estimate the p-values from the data, a multiplier bootstrap method is proposed in Section 4.

Finally, if H0 is rejected, there could be one or several abrupt or smooth changes

in the joint distribution. Moreover, the change(s) could concern one or more marginal distributions, the copula, or both. In the case where there is just a single (abrupt) change-point k∗ _{∈ {}₁_{, . . . , n}₋₁_}_{, one can for instance estimate it by}

k_n⋆ = arg max

1≤k≤n−1

Sn,k. (2.5)

We do not pursue the issue of single or multiple change-point estimation nor the diagnosis of the nature of the change-point.

Our test statistic Sn differs from the one considered in R´emillard (2010, Section 5.2) and B¨ucher and Ruppert (2013, Section 3.2) in the way the copulas of the subsamples

X_{k, . . . ,}X_l _{are estimated. Rather than the empirical copula} _Ck_:_l_{, these authors propose}

to use Ck:l,n(u) = 1 l₋k+ 1 l X i=k 1( ˆU1:n i ≤u), u ∈[0,1]d, (2.6)

with the convention that Ck:l,n = 0 if k > l. In comparison with Ck:l in (2.1), the ranks

for the subsampleX_{k, . . . ,}X_l _{are computed relative to the complete sample}X₁_{, . . . ,}X_n_.

The estimators Ck:l,n yield the difference process

DR

n(s,u) = √

n λn(0, s)λn(s,1)_{C1:⌊ns⌋,n(u)−C⌊ns⌋+1:n,n(u)} (2.7)

for (s,u₎ _∈ _[0_,_1]d+1_{. The process} DR

n is to be compared with the process Dn in (2.3).

The difference lies in the use of Ck:l,n rather than Ck:l. From the process DRn, one obtains

the test statistic

SnR= sup s∈[0,1] Z [0,1]d _D_R n(s,u) 2 dC1:n(u), (2.8)

which is the analogue ofSn in (2.4).

In the Monte Carlo simulation experiments (Section 5), we will see that Sn is usually more powerful than SR

n for detecting changes in the cross-sectional copula. Intuitively,

the reason is that the empirical copula Ck:l in (2.1) is often a better copula estimator

(5)

is also equal to the empirical copula of ˆU1:n

k , . . . ,Uˆl1:n, of which Ck:l,n is the empirical

distribution function.

In Genest and Segers (2010), situations are identified where the empirical copula of an independent random sample drawn from a given bivariate copula has a lower asymptotic variance than the empirical distribution function of that sample. Of course, the situation here is different from the one in the cited paper: multivariate rather than bivariate, serial dependence rather than independence. But still, we suspect the same mechanisms to be active.

3 Large-sample distribution

The asymptotic distribution underH0 of our test statistic Sn in (2.4) can be obtained by

writing it as a functional of the two-sided sequential empirical copula process studied in B¨ucher and Kojadinovic (2013). Let X₁_,X₂_{, . . .} _{be a strictly stationary} _d_{-variate time}

series with stationary c.d.f.F having continuous marginsF1, . . . , Fdand copulaC. Recall

Ck:l in (2.1) and ˆUik:l in (2.2). The two-sided sequential empirical copula process, Cn, is

defined by C_n₍_{s, t,}u_{) =} √_{n λn}₍_{s, t}₎_{_C_⌊_ns_⌋_+1:_⌊_nt_⌋₍u₎₋_C₍u₎_} _(3.1) = _√1 n ⌊nt⌋ X i=⌊ns⌋+1 n 1( ˆU⌊ns⌋+1:⌊nt⌋ i ≤u)−C(u) o , (3.2)

for (s, t,u₎ _∈ _∆_×_[0_,_1]d_{. The link of} C_n _{to our test statistic} _Sn _{in (2.4) is that, under}

H0, the difference processDn in (2.3) can be written as

D_n₍_s,u_{) =}_λn₍_s,₁₎C_n₍₀_{, s,}u₎₋_λn₍₀_{, s}₎C_n₍_s,₁_,u₎_, _(3.3)

for (s,u₎_∈_[0_,_1]d+1_.

Before focusing on the weak limit of the process D_n _under _H₀_{, let us briefly recall}

the notion ofstrongly mixing sequence. For a sequence of d-dimensional random vectors (Y_i₎_i_∈_{Z, the} _σ_{-field generated by (}Y_i₎_a_≤_i_≤_b_,_{a, b}_∈Z_{∪ {−∞}_,₊_∞}_{, is denoted by}_Fb

a. The

strong mixing coefficients corresponding to the sequence (Y_i₎_i_∈_Z _{are defined by} αr = sup p∈Z sup A∈F−∞p ,B∈F +∞ p+r |P(A_∩B)₋P(A)P(B)_|

for positive integer r. The sequence (Y_i₎_i_∈_Z _{is said to be} _{strongly mixing} _if _αr _→ _{0 as} r_{→ ∞}.

The weak limit of the two-sided empirical copula process C_n _{defined in (3.2) under}

strong mixing was established in B¨ucher and Kojadinovic (2013) under the following two conditions:

Condition 3.1. With probability one, there are no ties in each of the d component series X1j, X2j, . . ., where j ∈ {1, . . . , d}.

(6)

Condition 3.2. For any j _{∈ {}1, . . . , d_}, the partial derivatives Cj˙ =∂C/∂uj exist and are continuous on Vj =_{u_∈_[0_,_1]d_:_uj _∈₍₀_,₁₎_}_.

Condition 3.1 was considered in B¨ucher and Kojadinovic (2013) as continuity of the marginal distributions isnot sufficient to guarantee the absence of ties when the observa-tions are serially dependent (see e.g. B¨ucher and Segers, 2013, Example 4.2). One of the contributions of this work is to show that it actually can be dispensed with. Condition 3.2 was proposed in Segers (2012) and is nonrestrictive in the sense that it is necessary for the candidate weak limit of C_n _{to exist pointwise and have continuous trajectories.}

As we continue, for any j _{∈ {}1, . . . , d_}, we define ˙Cj to be zero on the set _{u _∈

[0,1]d _: _uj _{∈ {}₀_,₁_}} _{(see also Segers, 2012; B¨}_{ucher and Volgushev, 2013). Also, for any} j _{∈ {}1, . . . , d_} and any u_∈_[0_,_1]d_, u(j) _{is the vector of [0}_,_1]d _{defined by}_u(j)

i =uj if i=j

and 1 otherwise.

The weak convergence of the process C_n _{defined in (3.2) actually follows from that of}

the process B_n₍_{s, t,}u_{) =} _√1 n ⌊nt⌋ X i=⌊ns⌋+1 {1(U_i _≤u₎₋_C₍u₎_}_, ₍_{s, t,}u₎_∈_∆_×_[0_,_1]d_, _(3.4)

where U₁_{, . . . ,}U_n _{is the unobservable sample obtained from} X₁_{, . . . ,}X_n _{by the}

proba-bility integral transforms Uij = Fj(Xij), i _{∈ {}1, . . . , n_}, j _{∈ {}1, . . . , d_}, and with the convention that B_n₍_{s, t,}_·_{) = 0 if} _⌊_nt_{⌋ − ⌊}_ns_⌋_{= 0.}

IfU₁_{, . . . ,}U_n_{is drawn from a strictly stationary sequence (}U_i₎_i_∈_Z_{whose strong mixing}

coefficients satisfy αr =O(r−a_{) with} _{a >}_{1, we have from B¨}_{ucher (2013b) that} _B

n(0,·,·)

converges weakly inℓ∞([0,1]d+1) to a tight centered Gaussian processZ_C _{with covariance}

function

cov_{Z_C₍_s,u₎_,Z_C₍_t,v₎_}_{= min(}_{s, t}₎X k∈Z

cov_{1(U₀ _≤u₎_,₁₍U_k_≤v₎_}_.

The latter is actually a consequence of Lemma 2 in B¨ucher (2013b) stating thatB_n₍₀_,_·_,_·₎

is asymptotically uniformly equicontinuous in probability, which in turn implies thatZ_C

has continuous trajectories with probability one. As a consequence of the continuous mapping theorem, B_n B_C _in _ℓ∞_(∆_×_[0_,_1]d_{), where}

B_C₍_{s, t,}u_{) =} Z_C₍_t,u₎₋Z_C₍_s,u₎_, ₍_{s, t,}u₎_∈_∆_×_[0_,_1]d_. _(3.5)

The following result is a consequence of Theorem 3.4 of B¨ucher and Kojadinovic (2013) and the arguments used in the proof of Lemma A.2 of B¨ucher and Segers (2013). Its proof is given in Appendix A.

Proposition 3.3. Let X₁_{, . . . ,}X_n _{be drawn from a strictly stationary sequence} ₍X_i₎_i_∈_Z with continuous margins and whose strong mixing coefficients satisfyαr =O(r−a₎_, _{a >}₁_. Then, provided Condition 3.2 holds,

sup (s,t,u)∈∆×[0,1]d Cn(s, t,u)−C˜n(s, t,u) P →0, (3.6)

(7)

where ˜ C_n₍_{s, t,}u_{) =}B_n₍_{s, t,}u₎₋ d X j=1 ˙ Cj(u₎B_n₍_{s, t,}u(j)₎_. _(3.7)

Consequently, C_n C_C _in _ℓ∞_(∆_×_[0_,_1]d₎_{, where, for} ₍_{s, t,}u₎_∈_∆_×_[0_,_1]d_,

C_C₍_{s, t,}u_{) =}B_C₍_{s, t,}u₎₋ d X j=1 ˙ Cj(u)BC(s, t,u(j)). (3.8)

In view of (3.3), the weak limit of D_n _under _H₀ _{is a mere corollary of Proposition 3.3}

and the continuous mapping theorem.

Corollary 3.4. Under the conditions of Proposition 3.3, D_n D_C _in _ℓ∞_([0_,_1]d+1₎_,

where, for any (s,u₎_∈_[0_,_1]d+1_,

D_C₍_s,u_{) =} C_C₍₀_{, s,}u₎₋_sC_C₍₀_,₁_,u₎_, _(3.9) with C_C _{defined in} _(3.8)_{. As a consequence,}

Sn S = sup

s∈[0,1]

Z

[0,1]d{

D_C₍_s,u₎_}2_d_C₍u₎_. _(3.10)

The covariance function of D_C _{can be expressed in terms of the one of}C_C _by

cov_{D_C₍_s,u₎_,D_C₍_t,v₎_}₌_{_min(_{s, t}₎₋_st_} _cov_{C_C₍₀_,₁_,u₎_,C_C₍₀_,₁_,v₎_}_.

4 Resampling

In order to compute p-values forSn based on (3.10), we propose to use resampling meth-ods. Tracing back the definition of SnviaD_n _toC_n _{in (3.1) and using the approximation}

via ˜C_n _{in (3.7), we find that it suffices to construct a resampling scheme for} B_n _defined

in (3.4) and to estimate the first-order partial derivatives, ˙Cj, ofC.

4.1 Multiplier sequences

In the case of i.i.d. observations, Scaillet (2005) proposed to use amultiplier approach in the spirit of van der Vaart and Wellner (2000, Chapter 2.9) to resample B_n_{. When the}

first-order partial derivatives ofCare estimated by finite-differencing as in R´emillard and Scaillet (2009), the resulting resampling scheme for C_n _{is frequently referred to as a} _multiplier

bootstrap. In a nonsequential setting based on independent observations, B¨ucher and Dette (2010) compared the finite-sample behavior of the various resampling techniques proposed in the literature and concluded that the multiplier bootstrap of R´emillard and Scaillet (2009) has, overall, the best finite-sample properties. This technique was revisited the-oretically by Segers (2012) who showed its asymptotic validity under Condition 3.2. A sequential generalization of the latter result will be stated later in this section. In the case of independent observations, the multiplier bootstrap is based on i.i.d. multiplier sequences. We say that a sequence of random variables (ξi,n)i∈Z is an i.i.d. multiplier

(8)

(M0) (ξi,n)i∈Zis i.i.d., independent ofX₁, . . . ,X_n, with distribution not changing withn, having mean 0, variance 1, and being such that R∞

0 {P(|ξ0,n|> x)}

1/2_d_{x <}_∞_.

Starting from the seminal work of Bühlmann (1993, Section 3.3), Bücher and Ruppert (2013) and Bücher and Kojadinovic (2013) have studied adependent multiplier bootstrap for C_n _{which extends the multiplier bootstrap of Rémillard and Scaillet (2009) to the}

sequential and strongly mixing setting. The key idea in B¨uhlmann (1993) is to replace i.i.d. multipliers by suitably serially dependent multipliers that will capture the serial dependence in the data. In the rest of the paper, we say that a sequence of random variables (ξi,n)i∈Z is a dependent multiplier sequence if:

(M1) The sequence (ξi,n)i∈Z is strictly stationary with E(ξ₀_,n) = 0, E(ξ₀2_,n) = 1 and sup_n_≥₁E(_|ξ0,n|ν) < ∞ for all ν ≥ 1, and is independent of the available sample X₁_{, . . . ,}X_n_.

(M2) There exists a sequence ℓn _{→ ∞} of strictly positive constants such thatℓn=o(n) and the sequence (ξi,n)i∈Z is ℓn-dependent, i.e., ξi,n is independent of ξi+h,n for all h > ℓn and i∈N.

(M3) There exists a function ϕ : R _→ _[0_,_{1], symmetric around 0, continuous at 0,}

satisfying ϕ(0) = 1 andϕ(x) = 0 for all_|x_|>1 such that E(ξ0,nξh,n) =ϕ(h/ℓn) for

all h_∈Z_.

Ways to generate dependent multiplier sequences are mentioned in Section 5 and Appendix C.

4.2 Computing p-values via resampling

Let M be a large integer and let (ξi,n(1))i∈Z, . . . ,(ξ_i,n(M))_i_∈Z be M independent copies of the same multiplier sequence. We will define two multiplier resampling schemes for the process B_n _{in (3.4). These will lead to two resampling schemes for the test statistic} _Sn

in (2.4), on the basis of which approximate p-values can be computed.

Recall that Ck:l in (2.1) is the empirical copula ofXk, . . . ,Xl, which is the empirical

distribution of the vectors of rescaled ranks ˆUk:l

i in (2.2). For any m ∈ {1, . . . , M} and

(s, t,u₎_∈_∆_×_[0_,_1]d_{, let} ˆ B(m) n (s, t,u) = 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m)_{1( Û1:n i ≤u)−C1:n(u)}, (4.1) and ˇ B(m) n (s, t,u) = 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m)_{1( Û⌊ns⌋+1:⌊nt⌋ i ≤u)−C⌊ns⌋+1:⌊nt⌋(u)} = _√1 n ⌊nt⌋ X i=⌊ns⌋+1 (ξ(_i,nm)₋ξ¯_⌊(_nsm)_⌋_+1:_⌊_nt_⌋)1( Û⌊ns⌋+1:⌊nt⌋ i ≤u), (4.2)

(9)

where ¯ξ_k(m_:_l) is the arithmetic mean of ξ_i,n(m) for i _{∈ {}k, . . . , l_}. By convention, the sums are zero if _⌊ns_⌋ = _⌊nt_⌋. Note that the ranks are computed relative to the complete sampleX₁_{, . . . ,}X_n _{for ˆ}B(_nm)₍_{s, t,}_·_{), whereas they are computed relative to the subsample}

X_⌊_ns_⌋₊₁_{, . . . ,}X_⌊_nt_⌋ _{for ˇ}B(_nm)₍_{s, t,}_·_).

In order to get to resampling versions of ˜C_n _{in (3.7), we need estimators of the}

first-order partial derivatives of C. A simple estimator based on X_{k, . . . ,}X_l _{consists of finite}

differencing at a bandwidth of h _≡ h(k, l) = min_{(l₋k+ 1)−1/2_,₁_/₂_}_{. Varying slightly}

upon the definition in R´emillard and Scaillet (2009) and following Kojadinovic et al. (2011a, Section 3), we put

˙

Cj,k:l(u) =

Ck:l(u+hej)−Ck:l(u−hej)

min(uj+h,1)₋max(uj ₋h,0)

foru _∈_[0_,_1]d_{, where}e_j _{is the}_j_{th canonical unit vector in}Rd_{. Note that if}_h_≤_uj _≤₁₋_h_,

the denominator is just 2h. The more general form of the denominator corrects for boundary effects (uj close to 0 or 1). Proceeding for instance as in Kojadinovic et al. (2011a, proof of Proposition 2), we find that the previous estimator is uniformly bounded. The resampling versions ˆB(_nm) _{and ˇ}B(_nm) _of B_n _{then lead to the following resampling}

versions for ˜C_n_{: for (}_{s, t,}u₎_∈_∆_×_[0_,_1]d_,

ˆ C(m) n (s, t,u) = ˆB(nm)(s, t,u)− d X j=1 ˙ Cj,1:n(u) ˆBn(m)(s, t,u(j)), ˇ C(m) n (s, t,u) = ˇB(nm)(s, t,u)− d X j=1 ˙ Cj,⌊ns⌋+1:⌊nt⌋(u) ˇBn(m)(s, t,u(j)). (4.3)

Recall that λn(s, t) = (_⌊nt_{⌋ − ⌊}ns_⌋)/n. The difference process D_n _{is to be resampled by}

one of the following two methods: ˆ D(m) n (s,u) = λn(s,1) ˆC(nm)(0, s,u)−λn(0, s) ˆC(nm)(s,1,u) = ˆC(m) n (0, s,u)−λn(0, s) ˆC(nm)(0,1,u), ˇ D(m) n (s,u) = λn(s,1) ˇC(nm)(0, s,u)−λn(0, s) ˇCn(m)(s,1,u).

For resampling the test statistic, one has the choice between ˆ S_n(m) = sup s∈[0,1] Z [0,1]d{ ˆ D(m) n (s,u)}2dC1:n(u), (4.4) ˇ S_n(m) = sup s∈[0,1] Z [0,1]d{ ˇ D(m) n (s,u)}2dC1:n(u). (4.5)

Finally, approximate p-values of the observed test statisticSncan be computed via either

1 M M X m=1 1Sˆ_n(m) _≥Sn or 1 M M X m=1 1 Sˇ_n(m)_≥Sn . (4.6)

(10)

The null hypothesis is rejected if the estimated p-value is smaller than the desired signif-icance level.

By comparison, note that for the test statistic SR

n in (2.8) based on the process DRn

in (2.7), an approximate p-value can be computed using the multiplier processes

DR,(m)

n (s,u) = ˆB(nm)(0, s,u)−λn(0, s) ˆB(nm)(0,1,u), (4.7)

where ˆB(_nm)_{is defined in (4.1); see also R´emillard (2010, Section 5.2) and B¨}_{ucher and Ruppert}

(2013, Section 3.2).

4.3 Asymptotic validity of the resampling scheme

We establish the asymptotic validity of the multiplier resampling schemes described above under the null hypothesis. First, we need to impose conditions on the data generating process X₁_{, . . . ,}X_n _{and the multiplier sequences (}_ξ(m)

i,n )i∈Z for m∈ {1, . . . , M}. Condition 4.1. One of the following two conditions holds:

(i) The random vectors X₁_{, . . . ,}X_n _{are i.i.d. and} ₍_ξ(1)

i,n)i∈Z, . . . ,(ξ_i,n(M))_i_∈Z are

indepen-dent copies of a multiplier sequence satisfying (M0).

(ii) The random vectors X₁_{, . . . ,}X_n _{are drawn from a strictly stationary sequence}

(X_i₎_i_∈Z whose strong mixing coefficients satisfyαr =O(r−a)for somea >3+3d/2,

and (ξ(1)_i,n)i∈Z, . . . ,(ξ_i,n(M))_i_∈Z are independent copies of a dependent multiplier

se-quence satisfying (M1)–(M3) with ℓn =O(n1/2−γ₎ _{for some} ₀_{< γ <}₁_/₂_.

In both cases, the stationary distribution of X_i _{has continuous margins and a copula} _C satisfying Condition 3.2.

If the random vectors X₁_{, . . . ,}X_n _{are i.i.d., they can also be considered to be drawn}

from a strongly mixing, strictly stationary sequence. Hence, for the multiplier sequences (ξ(_i,nm))i∈Z, one could either assume (M0) or (M1)–(M3): both should work. However, as

discussed in B¨ucher and Kojadinovic (2013, Section 2), the use of dependent multipliers in the case of independent observations is likely to result in an efficiency loss. This is illustrated in the Monte Carlo simulations reported in B¨ucher and Ruppert (2013, Section 3) and carried out for the test based on the statistic SR

n defined in (2.8) which

is resampled using multiplier processes asymptotically equivalent to those given in (4.7): the use of dependent multipliers in the case of serially independent data usually results in a loss of power and in a slightly more conservative test. Thus, in finite samples, if there is no evidence against serial independence, it appears more sensible to work under (M0). We can now state the asymptotic distributions of the multiplier resampling schemes under the null hypothesis of a constant distribution. We provide two propositions, one for the resampling scheme based on ˆB(_nm) _{in (4.1) and another one for the scheme based}

(11)

Proposition 4.2. If Condition 4.1 holds, then C_n,Cˆ(1) n , . . . ,Cˆ(nM) C_C_,C(1) C , . . . ,C (M) C

in _{ℓ∞(∆_×[0,1]d)_}M+1, where C_C _{is defined in} _(3.8)_{, and}C(1)

C , . . . ,C

(M)

C are independent copies of C_C_{. As a consequence, also}

D_n,Dˆ(1) n , . . . ,Dˆ(nM) D_C_,D(1) C , . . . ,D (M) C , in _{ℓ∞_([0_,_1](d+1)₎_}M+1_{, where} _D C is defined in (3.9) and D(1)C , . . . ,D (M) C are independent copies of D_C_{. Finally,} Sn,Sˆn(1), . . . ,Sˆn(M) S, S(1), . . . , S(M)

where S is defined in (3.10) and S(1)_{, . . . , S}(M) _{are independent copies of} _S_.

Under Condition 4.1(i), the above result can be easily proved by starting from Theo-rem 1 of Holmes et al. (2013) and adapting the arguments used in Segers (2012, proof of

Proposition 3.2). Under Conditions 4.1(ii) and 3.1, the result was obtained in B¨ucher and Kojadinovic (2013, Proposition 4.2). The additional arguments allowing to avoid Condition 3.1 will

be given in the proof of the next result.

Proposition 4.3. If Condition 4.1 holds, then the conclusions of Proposition 4.2 also hold with Cˆ(_nm) _{replaced by} Cˇ_n(m)_, Dˆ(_nm) _{replaced by} Dˇ(_nm)_{, and} _Sˆ_n(m) _{replaced by} _Sˇ_n(m)_.

The proof of Proposition 4.3 is somewhat involved and is given in detail in Appendix B.

Combining the last claims of Propositions 4.2 and 4.3 with Proposition F.1 in B¨ucher and Kojadinovic (2013), we obtain that a test based on Sn whose p-value is computed using one of the

two approaches in (4.6) will hold its level asymptotically asn _{→ ∞}followed byM _{→ ∞}.

5 Simulation study

Large-scale Monte Carlo experiments were carried out in order to study the finite-sample performance of the derived tests for detecting changes in cross-sectional dependence. The main questions addressed by the study are the following:

(i) How well do the tests hold their size under the null hypothesis H0 in (1.1) of no

change?

(ii) What is the power of the tests against the alternativeH1,c of a single change in

cross-sectional dependence at constant margins? Specifically, the alternative hypothesis is H1,c∩H0,m with H0,m in (1.2) and H1,c defined by

H1,c :∃distinct C1 and C2, and k⋆ ∈ {1, . . . , n−1} such that

(12)

(iii) What happens if the change in distribution is only due to a change in the margins, the copula remaining constant? Specifically, the alternative hypothesis isH1,m∩H0,c

with H0,c given in (1.3) andH1,m defined by

H1,m:∃ distinct F1,1, F1,2 as well as F2, . . . , Fd and k1⋆ ∈ {1, . . . , n−1}

such that X₁_{, . . . ,}X_k⋆

1 have marginal c.d.f.s F1,1, F2, . . . , Fd

and X_k⋆

1+1, . . . ,Xn have marginal c.d.f.sF1,2, F2, . . . , Fd. (5.2)

In addition to the three questions above, many others can be formulated, involving other alternative hypotheses for instance. The problem is complex and there are countless ways of combining factors in the experimental design. In our study, the settings were chosen to represent a wide and hopefully representative variety of situations, in function of the three questions above. The main factors of our experiments are summarized below:

• Test statistics:

– Our statistic Sn in (2.4) with p-values computed via resampling using ˆSn(m)

or ˇSn(m) in (4.4) and (4.5), respectively. As we continue, we shall simply talk

about the test based on ˆSn or ˇSn, respectively, to distinguish between these

two situations. – The statistic SR

n in (2.8) of B¨ucher and Ruppert (2013), with p-values

com-puted according to the resampling method for DR

n in (4.7). • Sample size: n _{∈ {}50,100,200_}.

• Number of samples per setting: 1 000.

• Cross-sectional dimension: d_{∈ {}2,3_}.

• Significance level: α= 5%.

• Serial dependence: The data were generated either as being serially independent or via two time-series models, an autoregressive process and a multivariate version of the exponential autoregressive model considered in Auestad and Tjøstheim (1990) and Paparoditis and Politis (2001, Section 3.3). Independent standard normals were used as multipliers for independent observations, while for the serially depen-dent datasets, the dependepen-dent multiplier sequences were generated from initial inde-pendent standard normal sequences using the “moving average approach” proposed initially in B¨uhlmann (1993) and revisited in some detail in B¨ucher and Kojadinovic (2013, Section 6.1). The value of the bandwidth parameter ℓn defined in

Condi-tion (M2) was chosen automatically using the approach described in B¨ucher and Kojadinovic (2013, Section 5). See Appendix D for details.

• Margins: in all but one setting, the margins were kept constant, i.e., H0,m in (1.2)

was assumed. In one case (see Table 5), a break as in H1,m in (5.2) was assumed,

the marginal distribution of the first component changing from the N(0,1) to the

N(µ,1) distribution.

• Copulas: Clayton, Gumbel–Hougaard, Normal, Frank, with positive or negative (in-sofar possible) association, as well as asymmetric versions obtained via Khoudraji’s device (Khoudraji, 1995; Genest et al., 1998; Liebscher, 2008).

• Alternative hypotheses involving a single change-point occurring at time k⋆ ₌_⌊_nt_⌋

with t_{∈ {}0.1,0.25,0.5,0.75_}:

– H0,m∩H1,c with a change of the parameter within a copula family.

(13)

– H1,m∩H0,c, i.e., a change of one of the margins rather than of the copula.

– For the serially dependent case, a change in the copula of the innovations, leading to a gradual change of the copula of the marginal distributions of the observables.

The experiments were carried out in theRstatistical system (R Development Core Team, 2013) using thecopulapackage (Hofert et al., 2013). To allow us to reuse previously writ-ten code, the rescaled ranks in (2.2) were computed by dividing the ranks by l₋k+ 2 instead of l₋k+ 1. Because (4.4) only involves rescaled ranks computed from the entire sample, the test based on ˆSn can be implemented to be substantially faster than the one

based on ˇSn for larger sample sizes. The corresponding routines are available in the R

package npcp (Kojadinovic, 2014).

For the sake of brevity, only a representative subset of the results is reported here. Specifically, the following tables are provided in Appendix D:

• Size of the tests under the null hypothesis H0:

– Table 1: Percentage of false rejections when data are serially independent. – Table 2: Percentage of false rejections when data are serially dependent.

• Power of the tests against specific alternatives:

– Table 3: Power againstH0,m∩H1,c involving a change of the copula parameter

within a copula family and at serial independence.

– Table 4: Power against H0,m∩H1,c involving a change of copula family at a

constant value of Kendall’s tau and at serial independence.

– Table 5: Power against H1,m∩H0,c involving a change in one of the margins

and at serial independence..

– Table 6: Power against _¬H0 when data are serially dependent and the change

occurs in the copula of the innovations.

Besides findings of a more anecdotical nature, the following conclusions may be drawn from the results:

• All tests hold their level reasonably well in the case of serial independence (Table 1), with minor fluctuations depending on sample size, test statistic, copula parameter and copula family.

• In case of serial dependence, the test based on ˆSnis too conservative for the sample sizes under consideration (Table 2). In line with this observation, the test based on

ˇ

Sn appears to be more powerful than the one based on ˆSn (Table 6).

• For alternative hypotheses involving a change in the copula, the tests based on ˆSn

and ˇSn have a higher power thanSR

n (Tables 3 and 4). When the copula changes in

such a way that Kendall’s tau remains constant, the power of SR

n is especially low.

With respect to that last setting, note that distinguishing copulas on the basis of low amounts of data is known to be difficult (Genest et al., 2009; Kojadinovic et al., 2011b). The fact that the change-point is unknown makes the problem even harder.

(14)

• For alternative hypotheses involving a change in one of the margins, it is the test statisticSR

n that is substantially more powerful thanSn(Table 5). The weak power

of Sn can be explained by the fact that it is designed for detecting changes in the copula. Another tentative reading of the results is that the test based on Sn, regarded as a procedure for testing H0,c, is relatively robust against small changes

in one margin. In contrast, the test based on SR

n behaves as an all-purpose test

for the hypothesis of a constant distribution rather than as a test for a constant copula.

6 Case studies

As an illustration, we first applied the test based on ˇSnto bivariate financial data consist-ing of daily logreturns computed from the DAX and the Standard and Poor 500 indices. Following Dehling et al. (2013, Section 7), attention was restricted to the years 2006– 2009. The corresponding closing quotes were obtained from http://quote.yahoo.com

using the get.hist.quote function of the tseries R package (Trapletti and Hornik, 2013), which resulted in n = 993 bivariate logreturns. Dependent multiplier sequences were generated as explained in Appendix C. An approximate p-value of 0.04 was obtained, providing some evidence against H0. The conclusion is in line with the results reported

in Dehling et al. (2013). Of course, as discussed earlier, it is only under the assumption thatH0,m in (1.2) holds that it would be fully justified to decide to rejectH0,c in (5.1) on

the basis of the previous approximate p-value. The value of the change- point estimator

k⋆

n in (2.5) is 529, corresponding to February 22nd, 2008.

As a second illustration, we followed again Dehling et al. (2013) and considered n = 504 bivariate logreturns computed from closing daily quotes of the Dow Jones Industrial Average and the Nasdaq Composite for the years 1987 and 1988. The former quotes, not being available on http://quote.yahoo.com anymore, were taken from the R package

QRM (Pfaff and McNeil, 2013). This two-year period is of interest because it contains October 19th, 1987, known as “black Monday” (see Dehling et al., 2013, Figure 4). An approximate p-value of 0.59 was obtained. Hence, despite the extreme events that oc-curred during the period under consideration, the test based on ˇSn detects no evidence againstH0 in the data, which is in line with the results reported in Dehling et al. (2013).

7 Conclusion

We have demonstrated that the sensitivity of rank-based tests for the null hypothesis of a constant distribution against changes in cross-sectional dependence can be improved if ranks are computed with respect to relevant subsamples. In this way, the test we propose achieves in many cases a higher power than the one proposed in Bücher and Ruppert (2013). The limit distribution of the test statistic under the null hypothesis is unwieldy, but approximate p-values can still be computed via a multiplier resampling scheme. To deal with potential serial dependence, we make use of dependent multiplier sequences, an idea going back to Bühlmann (1993) and revisited in Bücher and Kojadinovic (2013).

(15)

Here are some potential avenues for further research:

– Once the null hypothesis has been rejected, the nature of the nonstationary needs to be investigated further: is there a single change-point or is there more than one? Or maybe the change is gradual rather than sudden? And does the change concern the margins or the copula?

– Can one detect a change in the copula without the hypothesis that the margins are constant?

– The procedure is computationally intensive because the ranks have to be recomputed for every k _{∈ {}1, . . . , n₋1_}. Efficient algorithms for reutilizing calculations from one value ofk to the next one might speed up the computations.

Acknowledgments

The authors are grateful to Mark Holmes, Jean-Fran¸cois Quessy and Martin Ruppert for fruitful discussions, and to an anonymous referee for pointing out that Condition 3.1 might be dispensed with.

The research by A. B¨ucher has been supported in parts by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Project A7) of the German Research Foundation (DFG), which is gratefully acknowledged.

J. Segers gratefully acknowledges funding by contract “Projet d’Actions de Recher-che Concert´ees” No. 12/17-045 of the “Communaut´e fran¸caise de Belgique” and by IAP research network Grant P7/06 of the Belgian government (Belgian Science Policy).

References

B. Auestad and D. Tjøstheim. Identification of nonlinear time series: First order charac-terization and order determination. Biometrika, 77:669–687, 1990.

J. Bai. Weak convergence of the sequential empirical processes of residuals in ARMA models. The Annals of Statistics, 22(4):2051–2061, 1994.

A. B¨ucher. A note on nonparametric estimation of bivariate tail de-pendence. Statistics and Risk Modeling, page in press, 2013a. URL

http://www.ruhr-uni-bochum.de/imperia/md/content/mathematik3/publications/taildep_buec

A. B¨ucher. A note on weak convergence of the sequential multivariate empirical process under strong mixing. Journal of Theoretical Probability, page in press, 2013b.

A. B¨ucher and H. Dette. A note on bootstrap approximations for the empirical copula process. Statistics and Probability Letters, 80(23–24):1925–1932, 2010.

A. B¨ucher and I. Kojadinovic. A dependent multiplier bootstrap for the sequential em-pirical copula process under strong mixing. arXiv:1306.3930, 2013.

(16)

A. B¨ucher and M. Ruppert. Consistent testing for a constant copula under strong mixing based on the tapered block multiplier technique. Journal of Multivariate Analysis, 116: 208–229, 2013.

A. B¨ucher and J. Segers. Extreme value copula estimation based on block maxima of a multivariate stationary time series. arXiv:1311.3060, 2013.

A. B¨ucher and S. Volgushev. Empirical and sequential empirical copula processes under serial dependence. Journal of Multivariate Analysis, 119:61–70, 2013.

P. B¨uhlmann. The blockwise bootstrap in time series and empirical processes. PhD thesis, ETH Z¨urich, 1993. Diss. ETH No. 10354.

M. Cs¨org˝o and L. Horv´ath. Limit theorems in change-point analysis. Wiley Series in Probability and Statistics. John Wiley & Sons, Chichester, UK, 1997.

H. Dehling, D. Vogel, M. Wendler, and D. Wied. An efficient and robust test for a change-point in correlation. arXiv:1203.4871, 2013.

C. Genest and J. Segers. On the covariance of the asymptotic empirical copula process.

Journal of Multivariate Analysis, 101:1837–1845, 2010.

C. Genest, K. Ghoudi, and L.-P. Rivest. Discussion of “Understanding relationships using copulas”, by E. Frees and E. Valdez. North American Actuarial Journal, 3:143–149, 1998.

C. Genest, B. R´emillard, and D. Beaudoin. Goodness-of-fit tests for copulas: A review and a power study. Insurance: Mathematics and Economics, 44:199–213, 2009.

E. Gombay and L. Horv´ath. Change-points and bootstrap. Environmetrics, 10(6), 1999. E. Gombay and L. Horv´ath. Rates of convergence for U-statistic processes and their bootstrapped versions. Journal of Statistical Planning and Inference, 102:247–272, 2002.

M. Hofert, I. Kojadinovic, M. M¨achler, and J. Yan. copula: Multivariate dependence with copulas, 2013. URL http://CRAN.R-project.org/package=copula. R package version 0.999-7.

M. Holmes, I. Kojadinovic, and J-F. Quessy. Nonparametric tests for change-point de-tection `a la Gombay and Horv´ath. Journal of Multivariate Analysis, 115:16–32, 2013. A. Inoue. Testing for distributional change in time series. Econometric Theory, 17(1):

156–187, 2001.

A. Khoudraji. Contributions à l’étude des copules et à la modélisation des valeurs extrêmes bivariées. PhD thesis, Université Laval, Québec, Canada, 1995.

I. Kojadinovic. npcp: Some nonparametric tests for change-point detection in (multi-variate) observations, 2014. URL http://CRAN.R-project.org/package=npcp. R package version 0.0-1.

(17)

I. Kojadinovic, J. Segers, and J. Yan. Large-sample tests of extreme-value dependence for multivariate copulas. The Canadian Journal of Statistics, 39(4):703–720, 2011a. I. Kojadinovic, J. Yan, and M. Holmes. Fast large-sample goodness-of-fit for copulas.

Statistica Sinica, 21(2):841–871, 2011b.

E. Liebscher. Construction of asymmetric multivariate copulas. Journal of Multivariate Analysis, 99:2234–2250, 2008.

E. Paparoditis and D.N. Politis. Tapered block bootstrap. Biometrika, 88(4):1105–1119, 2001.

B. Pfaff and A. McNeil. QRM: Provides R-language Code to Examine Quantitative Risk Management Concepts, 2013. URL http://CRAN.R-project.org/package=QRM. R package version 0.4-9.

J.-F. Quessy, M. Sa¨ıd, and A.-C. Favre. Multivariate Kendall’s tau for change-point detection in copulas. The Canadian Journal of Statistics, 41:65–82, 2013.

R Development Core Team. R: A Language and Environment for Statistical Com-puting. R Foundation for Statistical Computing, Vienna, Austria, 2013. URL

http://www.R-project.org. ISBN 3-900051-07-0.

B. R´emillard. Goodness-of-fit tests for copulas of multivariate time series. Social Science Research Network, 1729982:1–32, 2010. URLhttp://ssrn.com/abstract=1729982. B. R´emillard and O. Scaillet. Testing for equality between two copulas. Journal of

Multivariate Analysis, 100(3):377–386, 2009.

J.P. Romano and M. Wolf. A more general central limit theorem form-dependent random variables with unbounded m. Statistics and Probability Letters, 47:115–124, 2000. O. Scaillet. A Kolmogorov-Smirnov type test for positive quadrant dependence.Canadian

Journal of Statistics, 33:415–427, 2005.

J. Segers. Asymptotics of empirical copula processes under nonrestrictive smoothness assumptions. Bernoulli, 18:764–782, 2012.

A. Sklar. Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris, 8:229–231, 1959.

A. Trapletti and K. Hornik. tseries: Time series analysis and computational finance, 2013. URLhttp://CRAN.R-project.org/package=tseries. R package version 0.10-32.

A.W. van der Vaart and J.A. Wellner. Weak convergence and empirical processes. Springer, New York, 2000. Second edition.

M. van Kampen and D. Wied. A nonparametric constancy test

for copulas under mixing conditions. working paper, 2013. URL

(18)

D. Wied, H. Dehling, M. van Kampen, and D. Vogel. A fluctuation test for constant Spearman’s rho with nuisance-free limit distribution. Computational Statistics and Data Analysis, page in press, 2013.

A

Proof of Proposition 3.3

Let us first introduce additional notation. For integers 1_≤k _≤l_≤n, let Hk:l denote the

empirical c.d.f. of the unobservable sample U_{k, . . . ,}U_l _{and let} _Hk_:_l,j_{, for} _j _{∈ {}₁_{, . . . , d}_}_,

denote its margins. The empirical quantile functions are

H_k−_:_l,j1 (u) = inf_{v _∈[0,1] :Hk:l,j(v)≥u}, u∈[0,1],

which are collected in a vector via

H−1 k:l(u) = H −1 k:l,1(u1), . . . , H −1 k:l,d(ud) , u _∈_[0_,_1]d_.

By convention, the previously defined quantities are all taken equal to zero if k > l. From the proof of Theorem 3.4 in B¨ucher and Kojadinovic (2013), we have that (3.6) holds with C_n _{replaced by} Calt

n , where Calt n (s, t,u) = 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 [1_{U_i _≤H−1 ⌊ns⌋+1:⌊nt⌋(u)} −C(u)], (s, t,u)∈∆×[0,1] d_.

To show (3.6), it remains therefore to prove that sup (s,t,u)∈∆×[0,1]d Calt_n (s, t,u)−C_n(s, t,u) P →0. (A.1)

To do so, we adapt the arguments used in Lemma A.2 of B¨ucher and Segers (2013). Fix 1 _≤ k _≤ l _≤ n and u _∈ _[0_,_1]d_{. For} _i _{∈ {}_{k, . . . , l}_}_{, the} _d _{components of ˆ}Uk:l

i defined

in (2.2) can be expressed as ˆUk:l

ij =Hk:l,j(Uij), j ∈ {1, . . . , d}. Next, notice that

1 Uij _≤H_k−_:_l,j1 (uj) ₋1 Uˆ_ijk:l _≤uj =1 Uij < H_k−_:_l,j1 (uj) +1 Uij =H_k−_:1_l,j(uj) −1 Uˆk:l ij < uj −1 Uˆk:l ij =uj =1 Uij =H_k−_:_l,j1 (uj) ₋1 Uˆ_ijk:l =uj ,

as x < H−1₍_u_{) if and only} _H₍_x₎ _{< u} _{for any distribution function} _H_{. Since ˆ}_Uk:l ij = Hk:l,j(Uij) =uj implies Uij =Hk−:1l,j(uj), we obtain that

0_≤1

Uij _≤H_k−_:_l,j1 (uj) ₋1 Uˆ_ijk:l _≤uj

≤1

Uij =H_k−_:1_l,j(uj) .

Combining the previous inequality with the decomposition 1_{U_i _≤H−1 k:l(u)} −1( Ûik:l ≤u) = d X p=1 " Y 1≤j≤p 1_{Uij _≤H_k−_:_l,j1 (uj)_} Y p<j≤d 1( Û_ijk:l_≤uj) − Y 1≤j≤p−1 1_{Uij ≤Hk−:1l,j(uj)} Y p−1<j≤d 1( Û_ijk:l_≤uj) # ,

(19)

we obtain that 0_≤1_{U_i _≤H−1 k:l(u)} −1( ˆUik:l ≤u)≤ d X j=1 1 Uij =H_k−_:_l,j1 (uj) . (A.2)

It follows that the supremum in (A.1) is smaller than

d X j=1 sup (s,t,u)∈∆×[0,1] 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 1 Uij =H_⌊−_ns1_⌋_+1:_⌊_nt_⌋_,j(u) _≤ d X j=1 sup u∈[0,1] 1 √ n n X i=1 1(Uij =u).

Using the fact that 1 Uij =u

≤1 Uij _≤u

−1 Uij _≤u₋1/n

, the latter is smaller

d sup

u,v∈[0,1]d

ku−vk1≤n−1

|B_n₍₀_,₁_,u₎₋B_n₍₀_,₁_,v₎_|₊_dn−1/2_,

where B_n _{is defined in (3.4). Using the asymptotic uniform equicontinuity in probability}

ofB_n_{established in Lemma 2 of B¨}_{ucher (2013b), we finally obtain (A.1), which completes}

the proof.

B

Proof of Proposition 4.3

We shall only prove the result in the case of strongly mixing observations, that is, when Condition 4.1(ii) is assumed. The proof is similar but simpler when Condition 4.1(i) is assumed instead.

It is sufficient to show the statement involving ˇC(_nm)_{. The statements for ˇ}D(_nm) _and

ˇ

Sn(m) then follow from the continuous mapping theorem.

For any m_{∈ {}1, . . . , M_} and (s, t,u₎_∈_∆_×_[0_,_1]d_{, put}

B(m) n (s, t,u) = 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m)_{1(U_i _≤u₎₋_C₍u₎_}_, _(B.1) C(m) n (s, t,u) = B(nm)(s, t,u)− d X j=1 ˙ Cj(u₎B(m) n (s, t,u(j)).

[Recall that u(j) _{= (1}_{, . . . ,}₁_{, u}_j_,₁_{, . . . ,}₁₎ _∈_[0_,_1]d_{, with} _u_j _{appearing at the} _j_-th

coordi-nate.] From Theorem 2.1 in B¨ucher and Kojadinovic (2013), we have that

B_n,B(1) n , . . . ,B(nM) _B C,B(1)_C , . . . ,B(_CM) in_{ℓ∞_(∆_×_[0_,_1]d₎_}M+1_{, where} _B(1) C , . . . ,B (M)

C are independent copies of BC in (3.5), and

thus, from the continuous mapping theorem and (3.6), we find that

C_n,C(1) n , . . . ,C(nM) C_C_,C(1) C , . . . ,C (M) C

(20)

in _{ℓ∞_(∆_×_[0_,_1]d₎_}M+1_{. It is therefore sufficient to show that} sup (s,t,u)∈∆×[0,1]d ( ˇC(_nm)−C(_nm))(s, t,u) P →0 (B.2)

for every m_{∈ {}1, . . . , M_}. Below, we will show the following two assertions: first, sup (s,t,u)∈∆×[0,1]d (ˇB(_nm)−B(_nm))(s, t,u) P →0, (B.3)

and second, for every δ_∈(0,1/2) and every ε_∈(0,1), sup u∈[0,1]d δ≤uj≤1−δ sup (s,t)∈[0,1]2 t−s≥ε Cj,˙ _⌊_ns_⌋_+1:_⌊_nt_⌋(u)−Cj˙ (u) P →0. (B.4)

In view of the structure of ˇC(_nm) _{in (4.3), the assertions (B.3) and (B.4) imply (B.2), as}

we show next. Clearly,

( ˇC(_nm)−C(_nm))(s, t,u) ≤ (ˇB(_nm)−B(_nm))(s, t,u) + d X j=1 C˙_j,_⌊_ns_⌋_+1:_⌊_nt_⌋(u) (ˇB(_nm)−B_n(m))(s, t,u(j)) + d X j=1 Cj,˙ _⌊_ns_⌋_+1:_⌊_nt_⌋(u)−Cj˙ (u) B(_nm)(s, t,u(j)) . (B.5)

Taking suprema over (s, t,u₎_∈_∆_×_[0_,_1]d_{, the first and the second term on the right-hand}

side of (B.5) converge to zero in probability because of assertion (B.3) and uniform bound-edness of ˙Cj,k:l(see Kojadinovic et al., 2011a, proof of Proposition 2). The third term on

the right-hand side of (B.5) converges to zero in probability because of assertion (B.4) and the fact that (s, t,u₎_7→B(_nm)₍_{s, t,}u(j)_{) vanishes as soon as}_s₌_t_or_uj _{∈ {}₀_,₁_}_{, and is}

asymptotically uniformly equicontinuous in probability as a consequence of Lemma A.3 in B¨ucher and Kojadinovic (2013).

It remains to show (B.3) and (B.4). The proof of the latter assertion is simplest and is given first.

Proof of (B.4). Observe that

C⌊ns⌋+1:⌊nt⌋(u) =C(u) +

1

√

n λn(s, t)Cn(s, t,u).

Fixδ _∈(0,1/2) andε_∈(0,1). Without loss of generality, assume that n is large enough so that the bandwidth h=hn(s, t) = 1/p_⌊nt_{⌋ − ⌊}ns_⌋ is less than δ whenever t₋s_≥ε. Then, for t₋s_≥ε and u_∈_[0_,_1]d _with _δ_≤_uj _≤₁₋_δ_{, we have}

˙ Cj,⌊ns⌋+1:⌊nt⌋(u) = 1 2h{C(u+hej)−C(u−hej)} + 1 2h√n λn(s, t){Cn(s, t,u+hej)−Cn(s, t,u−hej)}.

(21)

By the assumption of existence and continuity of ˙Cj onVj (see Condition 3.2), and since 0_≤Cj˙ _≤1, it follows from the mean-value theorem that

sup u∈[0,1]d δ≤uj≤1−δ 1 2h{C(u+hej)−C(u−hej)} −Cj˙ (u) → 0, h_→0.

Using (3.6) and the fact thatB_n_{is asymptotically uniformly equicontinuous in probability,}

it can be verified that C_n _{is asymptotically uniformly equicontinuous in probability as}

well. It follows that sup u∈[0,1]d δ≤uj≤1−δ sup (s,t)∈[0,1]2 t−s≥ε C_n(s, t,u+he_j)−C_n(s, t,u−he_j) P →0. Finally, 1 2h√n λn(s, t) = 1 2pλn(s, t) ≤ 1 2pε₋1/n.

Combine the four previous displays to arrive at the desired conclusion.

The proof of (B.3) is more complicated. Using the notation introduced in Appendix A, let us define the auto-centered version of the process B(_nm) _{in (B.1) as}

˚B(m) n (s, t,u) = 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m)_{1(U_i _≤u₎₋_H_⌊_ns_⌋_+1:_⌊_nt_⌋₍u₎_} = √1 n ⌊nt⌋ X i=⌊ns⌋+1 (ξ(_i,nm)₋ξ¯_⌊(m_ns)_⌋_+1:_⌊_nt_⌋)1(U_i _≤u₎_, _(B.6)

with the usual convention that empty sums are zero.

Proof of (B.3). Consider the decomposition

(ˇB(_nm)−B(_nm))(s, t,u) ≤ Bˇ(_nm)(s, t,u)−˚B_n(m)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u)) + ˚B(_nm)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u))−B(_nm)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u)) + B(_nm)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u))−B(_nm)(s, t,u) .

Write out the definitions of the processes B(_nm)_{, ˚}B_n(m) _{and ˇ}B(_nm) _{in (B.1), (B.6) and (4.2),}

respectively, and take suprema over (s, t,u₎_∈_∆_×_[0_,_1]d _{to obtain}

sup (s,t,u)∈∆×[0,1]d (ˇB(_nm)−B(_nm))(s, t,u) ≤ sup (s,t,u)∈∆×[0,1]d Bˇ(_nm)(s, t,u)−˚B_n(m)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u)) (B.7) + sup (s,t,u)∈∆×[0,1]d 1 √_n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m) H_⌊_ns_⌋_+1:_⌊_nt_⌋(u)−C(u) (B.8) + sup (s,t,u)∈∆×[0,1]d B(_nm)(s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u))−B(_nm)(s, t,u) . (B.9)

(22)

1- The term (B.8): Let δ _∈ (0,1/2), to be specified later. We split the supremum into two parts, according to whether t₋s is smaller or larger thanan =n−1/2−δ_:

An,1 = sup (s,t,u)∈∆×[0,1]d t−s≤an 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m) H_⌊_ns_⌋_+1:_⌊_nt_⌋(u)−C(u) , An,2 = sup (s,t,u)∈∆×[0,1]d t−s≥an 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m) H_⌊_ns_⌋_+1:_⌊_nt_⌋(u)−C(u) .

We will show that both An,1 and An,2 converge to zero in probability.

(a) Since both Hk:l and C take values in [0,1], a crude bound forAn,1 is

An,1 ≤

1

√

n(n an+ 1) max1≤i≤n|ξ

(m)

i,n | ≤2n−δ ₁max_≤_i_≤_n|ξi,n|.

Using the fact that, from (M1), for any ν _≥ 1, supn≥1E[|ξ (m)

1,n|ν] < ∞, we have

that, for every α >0 andν _≥1 such that ν > 1/α, P max 1≤i≤n|ξ (m) i,n | ≥nα ≤nP _|ξ(₁m_,n)_{| ≥}nα ≤n1−να E[_|ξ₁(m_,n)_|ν]_→0.

Apply the previous display with α _∈ (0, δ) to find that An,1 converges to zero in

probability.

(b) Recall B_n _{in (3.4). Observe that}

B_n₍_{s, t,}u_{) =} ⌊nt⌋ − ⌊_√ ns⌋ n {H⌊ns⌋+1:⌊nt⌋(u)−C(u)}. We have An,2 = sup (s,t,u)∈∆×[0,1]d t−s≥an 1 ⌊nt_{⌋ − ⌊}ns_⌋ ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m) B_n(s, t,u) ≤ sup ⌊ns⌋<⌊nt⌋ ⌊nt⌋+1−⌊ns⌋≥nan 1 ⌊nt_{⌋ − ⌊}ns_⌋ ⌊nt⌋ X i=⌊ns⌋+1 ξ_i,n(m) sup (s,t,u)∈∆×[0,1]d B_n(s, t,u) ≤ max 1≤k≤l≤n l−k≥nan 1 l₋k+ 1 l X i=k ξ(_i,nm) sup (s,t,u)∈∆×[0,1]d B_n(s, t,u) .

By weak convergence B_n B_C _in _ℓ∞_(∆ _×_[0_,_1]d_{), the supremum at the end of}

the previous display is bounded in probability. Writing bn = nan, it is sufficient to show that max 1≤k≤l≤n l−k≥bn 1 l₋k+ 1 l X i=k ξ_i,n(m) P →0, n _{→ ∞}.

(23)

Fixη >0. The probability that the previous maximum exceeds η is bounded by X 1≤k≤l≤n l−k≥bn P 1 l₋k+ 1 l X i=k ξ_i,n(m) > η . (B.10)

Fix ν _≥ 2, to be specified later. By stationarity and Markov’s inequality, the previous expression is bounded by

X 1≤k≤l≤n l−k≥bn η−ν(l₋k+ 1)−ν E l−k+1 X i=1 ξ_i,n(m) ν ≤η−νn X bn≤r≤n r−ν E r X i=1 ξ(_i,nm) ν .

Recall that the sequence (ξ_i,n(m))i∈Z isℓ_n-dependent from (M2) and assume thatnis sufficiently large so thatn _≥2ℓn+bn. Then, by Corollary A.1 in Romano and Wolf (2000), there exists a constant Cν, depending only on ν, such that

E r X i=1 ξ_i,n(m) ν ≤C_νν(4ℓnr)ν/2 E[_|ξ₁(m_,n)_|ν].

Using the fact that, from (M1), sup_n_≥₁E[_|ξ₁(_,nm)_|ν_]_<_∞_{, and up to a multiplicative}

constant, the expression in (B.10) is bounded by

n X bn≤r≤n r−ν(ℓnr)ν/2 _≤n2b−nν/2ℓν/n 2 =O n2−(1/2−δ)ν/2+(1/2−γ)ν/2 =O n2+(δ−γ)ν/2 .

The right-hand side converges to zero if we chooseδ=γ/2 and then ν >8/γ. 2- The term (B.9): We have to show that, for every η, λ >0,

P " sup (s,t,u)∈∆×[0,1]d B(_nm){s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u)} −B(_nm)(s, t,u) > λ # ≤η,

for all sufficiently large n.

Fix η, λ > 0. Since (B.9) is smaller than 2 sup(s,t,u)∈∆×[0,1]d|B

(m)

n (s, t,u)|, and using

the fact that B(_nm) _{vanishes on the diagonal} _s ₌ _t _{and is asymptotically uniformly}

equicontinuous in probability, there exists ε _∈ (0,1) such that, for all n sufficiently large, P    sup (s,t,u)∈∆×[0,1]d t−s<ε B(_nm){s, t,H_⌊−_ns1_⌋_+1:_⌊_nt_⌋(u)} −B(_nm)(s, t,u) > λ   ≤η/2.

(24)

Setting ζn = sup(s,t,u)∈∆×[0,1]d t−s≥ε k

H−1

⌊ns⌋+1:⌊nt⌋(u)−uk1, we shall now show that, for alln

sufficiently large, An= P      sup (s,t,u,v)∈∆×[0,1]2d t−s≥ε,ku−vk1≤ζn B(_nm)(s, t,u)−B(_nm)(s, t,v) > λ      ≤η/2,

which will complete the proof. Using again the asymptotic uniformly equicontinuity in probability of B(_nm)_{, there exists} _µ_∈₍₀_,_{1) such that, for all} _n _{sufficiently large,}

An,1 = P      sup (s,t,u,v)∈∆×[0,1]2d t−s≥ε,ku−vk1≤µ B(_nm)(s, t,u)−B(_nm)(s, t,v) > λ      ≤η/4.

We then boundAnbyAn,1+An,2, whereAn,2 = P(ζn> µ). From the weak convergence

of B_n _toB_C _in_ℓ∞_(∆_×_[0_,_1]d_{), we have that} sup (s,t,u)∈∆×[0,1]d t−s≥ε |H⌊ns⌋+1:⌊nt⌋(u)−C(u)| ≤ sup (s,t,u)∈∆×[0,1]d |B_n₍_{s, t,}u₎_{| ×}_n−1/2_× _sup (s,t)∈∆ t−s≥ε {λn(s, t)_}−1 P_→0.

Using the fact that sup_u_∈_[0_,_1]_|H⌊ns⌋+1:⌊nt⌋,j(u)−u|= supu∈[0,1]|H⌊−ns1⌋+1:⌊nt⌋,j(u)−u| for j _{∈ {}1, . . . , d_} (for instance, by symmetry arguments on the graphs of H⌊ns⌋+1:⌊nt⌋,j

and H_⌊−_ns1_⌋_+1:_⌊_nt_⌋_,j), we immediately obtain that ζn _→P 0, which implies that, for all n

sufficiently large, An,2 ≤η/4, and thus that, for all n sufficiently large, An ≤η/2.

3- The term (B.7): For the following arguments, it is sufficient to assume that the sequence (ξ(_i,nm))i∈Z appearing in ˇB(nm) and ˚B(nm) satisfies only (M1) with E[{ξ0(,nm)}2]>0

not necessarily equal to one.

LetK >0 be a constant and let us first suppose that, for anyn _≥1 andi_{∈ {}1, . . . , n_},

ξ_i,n(m) _{≥ −}K. With (A.2) in mind, the term (B.7) is smaller thanAn,1+An,2, where

An,1 = sup (s,t,u)∈∆×[0,1]d 1 √ n ⌊nt⌋ X i=⌊ns⌋+1 (ξ_i,n(m)+K)h1_{U_i _≤H−1 ⌊ns⌋+1:⌊nt⌋(u)} −1( ˆU ⌊ns⌋+1:⌊nt⌋ i ≤u) i and An,2 = sup (s,t,u)∈∆×[0,1]d K+ ¯ξ_⌊(m_ns)_⌋_+1:_⌊_nt_⌋ √ n ⌊nt⌋ X i=⌊ns⌋+1 h 1_{U_i _≤H−1 ⌊ns⌋+1:⌊nt⌋(u)} −1( ˆU ⌊ns⌋+1:⌊nt⌋ i ≤u) i .

Let us first show that An,1 P

→0. Plugging (A.2) into the expression ofAn,1, we bound

An,1 by An,1,1+· · ·+An,1,d, where An,1,j = sup (s,t,u)∈∆×[0,1] 1 √_n ⌊nt⌋ X i=⌊ns⌋+1 (ξ_i,n(m)+K)1 Uij =u .