Validity of the Parametric Bootstrap Procedure

6.4 Parametric Bootstrap Procedure for Goodness-of-Fit Testing

6.4.2 Validity of the Parametric Bootstrap Procedure

Motivation

Consider testing the appropriateness of various dependence structures on the basis of a random sample X₁ = (X₁₁, . . . , X_1d), . . . , X_n = (X_n1, . . . , X_nd) from a continuous random vector X with cumulative distribution function F . Denote by F₁, . . . , F_d the univariate marginal distributions of X and let C : R^d → R be the copula associated with F . C is the cumulative distribution function of U = ξ(X), where ξ : R^d → R^d is defined for all x1, . . . , xd∈ R by

ξ(x₁, . . . , x_d) = (F₁(x₁), . . . , F_d(x_d)).

The vectors U₁ = ξ(X₁), . . . , U_n = ξ(X_n) are only observed if the marginals F₁, . . . , F_d are known. However, F_j can be estimated by

F_jn= 1 n + 1

i=1

1{X_ij≤t},

for all t ∈ R and j ∈ {1, . . . , d}.

Letting ξ_n(x₁, . . . , x_d) = (F_1n(x₁), . . . , F_dn(x_d))^T, for all x₁, . . . , x_d ∈ R, we can base a test of the hypothesis

H₀ : C ∈ C = {C_θ : θ ∈ O}

on the pseudo-observations

Uc₁ = ξ_n(X₁), . . . , cU_n= ξ_n(X_n).

statistic

S_n= φ(G^Cn) (6.102)

with

G^Cn = n^1/2(C_n− C_θ_n),

where C_θ_n is a parametric estimate of C_θ derived from the estimation θ_n= T (X₁, . . . , X_n) of θ under H₀ while C_n is the empirical copula defined for all u ∈ [0, 1]^d by

C_n(u) = 1 n

i=1

1_{{ b}_U

i≤u}.

Another way is to base the test on Kendall’s distribution, i.e. the distribution function K of the probability integral transformation W = F (X). Since we can write W in the form W = C(U ), a consistent estimator of K is given by the empirical distribution K_n of the pseudo-observations Wc₁ = C_n( bU₁), . . . , cW_n= C_n( bU_n), defined by

K_n(w) = 1 n

i=1

1_{c_W

i≤w}.

Therefore, if K_θ denotes the distribution function of W when C = C_θ ∈ O, and if K_θ_n is a parametric estimate of K_θ derived from θ_n= T (X₁, . . . , X_n) under the subsidiary hypothesis

H₀^K : K ∈ K = {K_θ : θ ∈ O}, a goodness-of-fit test can be based on a continuous functional

S_n = φ(G^Kn) (6.103)

G^Kn = n^1/2(K_n− K_θ_n).

Whether H₀^C is tested using G^Cn or H₀^K is tested using G^Kn, the limiting distribution of the test Sn depends not only on the unknown parameter θ but also possibly on the nuisance parameters F1, . . . , Fd. Although a parametric bootstrap may help to find valid P -values, this cannot be done on the basis of the results of Stute et al [52] (see [23]), because of the presence of dependence among the set of pseudo-observations bU₁, . . . , bU_n and cW₁, . . . , cW_n. It then becomes necessary to establish the validity of the parametric bootstrap in situations where the hypothesis to be tested

Chapter 6. A Review of Goodness-of-Fit Test Statistics 79

concerns the distribution P of an unobservable s-variate random vector U , viz.

H₀ : P ∈ P = {P_θ : θ ∈ O},

where O is an open subset of R^p, and U = ξ(X) for some function ξ : R^d→ R^s of an observable d-variate random vector X.

In order to encompass procedures based on G^Cn and G^Kn as special cases, suppose that a test of H₀ is to be derived from a continuous functional

S_n = φ(G^An) (6.104)

of an abstract empirical process of the form

G^An = n^1/2(A_n− A_θ_n),

where A_θ_n and A_nare respectively parametric and nonparametric estimate of an abstract quantity A that depends on P . More generally, A is a function mapping a closed rectangle T ⊂ [−∞, ∞]

into R^s, and A_θ denotes the form taken by A when P = P_θ for some θ ∈ O. Therefore, T = [0, 1]^d, s = d and Aθ = Cθ for the test based on G^Cn; T = [0, 1], s = 1 and Aθ = Kθ for a test based on G^Kn.

In order to show that the parametric bootstrap yields a valid approximation to the null distribution of the empirical process G^An under appropriate conditions, the processes

Θ_n = n^1/2(θ_n− θ) and

An = n^1/2(A_n− A)

need to converge weakly, as n → ∞, respectively to a centered random variable Θ and a centered process A in the space D(T, R^s) of c`adl`ag processes from T to R^s.

Symbolically, we write

Θ_n = n^1/2(θ_n− θ) Θ (6.105)

and

Aⁿ = n^1/2(An− A) A. (6.106)

Validity of the One-Level Parametric Bootstrap

Let U₁, . . . , U_n be a random sample from some distribution P , and assume that we want to test the hypothesis

H₀ : P ∈ P = {P_θ : θ ∈ O},

where P is a family of probability measures on R^d indexed by the parameter θ living in an open set O ⊂ R^p. Assume that P is identifiable, i.e.,

θ 6= θ⁰ ⇒ P_θ 6= P_θ⁰.

Let T ⊂ [−∞, ∞] be a closed rectangle and suppose that the test of H₀ is to be based on an abstract mapping A : A → R^s. Suppose that A = A_θ when P = P_θ, and let A = {A_θ : θ ∈ O}.

Then the identifiability is ensured if for each > 0,

inf

sup

t∈T

kA_θ(t) − A_θ₀(t)k : θ ∈ O and |θ − θ₀| >

> 0.

Furthermore, assume that the mapping θ → Aθ is Frechet differentiable with derivative ˙Aθ, i.e., for all θ0 ∈ O,

lim

khk→0sup

t∈T

kA_θ₀_+h(t) − A_θ₀(t) − ˙A_θ(t)hk

khk = 0. (6.107)

Finally, let θn = Tn(U1, . . . , Un) be a consistent estimate of θ and assume that the D(T, R^s )-valued process A_n = Υ_n(U₁, . . . , U_n) estimates A consistently. Suppose specifically that the process Θ_n= n^1/2(θ_n− θ) and An= n^1/2(A_n− A) have centered Gaussian limits when n → ∞ as in 6.105 and 6.106.

Before we give the conditions under which the weak limits of the processes G^An = n^1/2(A_n− A) and G^An^? = n^1/2(A^?_n− A^?) are independent and identically distributed and then guarantee that a parametric bootstrap based on the process An is valid, let us give the following definitions (see [23]).

Definition 6.2. A family P = {Pθ : θ ∈ O} is said to belong the class S(λ) for a given measure λ (independent of θ) if

1. The measure P_θ is absolutely continuous with respect to λ for all θ ∈ O.

2. The density p_θ = ^dP_dλ^θ admits first and second order derivatives with respect to all compo-nents of θ ∈ O. The gradient (row) vector with respect to θ is denoted ˙p_θ, and the Hessian

Chapter 6. A Review of Goodness-of-Fit Test Statistics 81 converges weakly in D(T, R^s) × R^p to a centered Gaussian pair (A, W) and the Fr´echet derivative A of A defined by Equation (6.107) satisfies˙

A(t) = E[A(t)W˙ P(t)]

and

A^?n= n^1/2(A^?_n− A).

The conditions under which the weak limits of the processes G^An = n^1/2(A_n− A_θ_n) and G^An^? = n^1/2(A^?_n− A_θ^?_n) are independent and identically distributed are given by the following theorem, which then guarantees that a parametric bootstrap based on the process An is valid.

Theorem 6.15. Assume that P ∈ S(λ) and that as n → ∞,

(An, Θ_n, WP,n) (A, Θ, WP) in D(T, R^s) × R^p⊗2, where the limit is a centered Gaussian process.

Let Γ = E[ΘW^TP] and set a(t) = E[A(t)WP] for every t ∈ T. Then, as n → ∞, (An, A^?n, Θ_n, Θ^?_n) (A, A^?, Θ, Θ^?) in D(T, R^s)^⊗2× R^p⊗2.

In the limit, A^? = A^⊥+ aΘ and Θ^? = Θ^⊥+ ΓΘ are defined in terms of an independent copy (A^⊥, Θ^⊥) of (A, Θ). If in addition (An, θ_n) is P-regular for A × O, then

(G^An, G^An^?) (A − ˙AΘ, A^⊥− ˙AΘ^⊥) in D(T, R^s)^⊗2, as n → ∞.

A Two-Level Parametric Bootstrap

When performing a goodness-of-fit test based on a continuous functional S_n = φ(G^An) of a process G^An = n^1/2(An− Aθn), we have to compute Aθn at various points, but this is not always easy. For tests based on the empirical copula, we have Aθn = Cθn, and many copula families are not algebraically closed. In this case, a simple way to solve the problem is to generate a random sample V₁^?, . . . , V_m^? from the probability measure Q_θ_n with distribution function C_θ_n and for u ∈ [0, 1]^d, to approximate C_θ_n(u) by

Cˇ_n^?(u) = 1 m

j=1

1{V_j^?≤u}.

In other words, we replace A_θ_n by an approximation ˇA^?_n= Ψ_m(V₁^?, . . . , V_m^?) built from a random sample V₁^?, . . . , V_m^? from

Q ∈ Q = {Q : θ ∈ O}.

Chapter 6. A Review of Goodness-of-Fit Test Statistics 83

For the approach to make sense, we assume that if A = Aθ0 and ˇAn = Ψm(V1, . . . , Vm) for a random sample V₁, . . . , V_m from Q = Q_θ₀, then

Aˇⁿ= n^1/2( ˇAn− A) ˇA ∈ D(T, R^s), (6.108)

as n → ∞ (and hence m → ∞).

Given that such a process exists, the following method can be used to circumvent the lack of a closed form for A_θ_n in the computation of the test S_n.

1. Compute θ_n= T_n(U₁, . . . , U_n) and let A_n= Υ_n(U₁, . . . , U_n).

2. Given U₁, . . . , U_n, generate a random sample V₁^?, . . . , V_m^? from Q_θ_n. 3. Let ˇA^?_n= Ψm(V₁^?, . . . , V_m^?) and compute

S_n= φ(G^An^ˇ^?), (6.109)

where

G^A^ˇ

n = n^1/2(A_n− ˇA^?_n). (6.110) A second parametric bootstrap procedure is necessary to approximate the distribution of Sn. To this end (see [23]), take N large and repeat the following steps for every k ∈ {1, . . . , N }:

1. Given U₁, . . . , U_n, V₁, . . . , V_n, generate a random sample U_1,k^? , . . . , U_n,k^? from P_θ_n. 2. Compute θ^?_n,k = T_n(U_1,k^? , . . . , U_n,k^? ) and let A^?_n,k = Υ(U_1,k^? , . . . , U_n,k^? ).

3. Given U₁, . . . , U_n, V₁^?, . . . , V_n^? and U_1,k^? , . . . , U_n,k^? , generate a random sample V_1,k^??, . . . , V_n,k^??

from Qθ^?_n,k. 4. Let

Aˇ^??_n,k = Ψ_m(V_1,k^??, . . . , V_n,k^??) and compute

S_n,k^? = φ(G^An,k^ˇ^?), (6.111) where

G^A^ˇ

? = n^1/2(A^? − ˇA^?? ). (6.112)

Under the convention that large values of Sn lead to the rejection of H0, and under regularity

In order to establish the validity of the conditions of the previous two-level parametric bootstrap, Genest and R´emillard [23] first introduced the following notation. Let U₁, . . . , U_n and V₁, . . . , V_m independent random samples from P_θ_n and Q_θ_n, respectively.

2. Given U₁^?, . . . , U_n^?, V₁^?, . . . , V_m^? and θ^?_n= Tn(U₁^?, . . . , U_n^?), the random vectors V₁^??, . . . , V_n^??

The following result (see [23]) gives the conditions under which the weak limits of the processes

G^A^ˇ

n = n^1/2(A_n− ˇA^?_n)

Chapter 6. A Review of Goodness-of-Fit Test Statistics 85

and

G^A^ˇ

n = n^1/2(A^?_n− ˇA^??_n)

are independent and identically distributed, and then proves the validity of a two-level parametric bootstrap.

Theorem 6.16. Assume that P ∈ S(λ), Q ∈ S(ν) and that as n → ∞, (An, ˇAn, Θ_n, WP,n, WQ,n) (A, ˇA, Θ, WP, WQ)

and that the limit is a centered Gaussian process in D(T, R^s)^m⊗2× R^p⊗3. Let Γ = E[ΘW^Tp] and set a(t) = E[A(t)W^Tp] and ˇa(t) = ˇA(t)W^TQ for every t ∈ T. Then as n → ∞,

(An, A^?n, ˇAⁿ, ˇA^?n, ˇA^??n, Θ_n, Θ^?_n) (A, A^?, ˇA,Aˇ^?, ˇA^??, Θ, Θ^?)

in D(T, R^s)^⊗5 × R^p⊗2. In the limit, A^? = A^⊥ + aΘ, Θ^? = Θ^⊥ + ΓΘ, ˇA^? = ˇA^⊥ + ˇaΘ, Aˇ^??= ˇA^⊥⊥+ ˇaΘ^? where (A^⊥, Θ^⊥) is an independent copy of (A, Θ). In addition, the processes A,ˇ Aˇ^⊥ and ˇA^⊥⊥ are mutually independent and identically distributed, as well as independent of A, A^⊥, Θ and Θ^⊥. Moreover, if (A_n, θ_n) is P-regular for A × O and ˇA_n is Q-regular for A, then

(G^An^ˇ^?, G^An^ˇ^??) (A − ˇA^⊥− ˙AΘ, A^⊥− ˇA^⊥⊥− ˙AΘ^⊥) in D(T, R^s)^⊗2 as n → ∞.

In document Aspects of copulas and goodness-of-fit (Page 88-96)