• No results found

Bootstrapping in non-regular smooth function models

N/A
N/A
Protected

Academic year: 2021

Share "Bootstrapping in non-regular smooth function models"

Copied!
16
0
0

Loading.... (view fulltext now)

Full text

(1)

Contents lists available atSciVerse ScienceDirect

Journal of Multivariate Analysis

journal homepage:www.elsevier.com/locate/jmva

Bootstrapping in non-regular smooth function models

Mihai C. Giurcanu

University of Florida, United States

a r t i c l e i n f o Article history:

Received 23 April 2011 Available online 9 May 2012 AMS subject classifications: 62G09

62G15 62G20 Keywords:

Smooth function model Non-regular estimators Standard bootstrap m-out-of-nbootstrap Oracle bootstrap Confidence intervals

a b s t r a c t

We study the large sample behavior of the standard bootstrap, them-out-of-nbootstrap, and the oracle bootstrap (Giurcanu and Presnell, 2009) [14] percentile confidence intervals in non-regular smooth function models. We show that the oracle bootstrap percentile confidence intervals are consistent while the standard bootstrap and the m

-out-of-nbootstrap confidence intervals are inconsistent. Further analysis of coverage probabilities reveals that, for large samples, the iterated oracle bootstrap percentile confidence intervals are more accurate than their non-iterated versions. We also describe the large sample local behavior of the bootstrap confidence intervals for parameter values near the points of inconsistency of the standard bootstrap. In a simulation study, we describe the finite sample local behavior of various bootstrap confidence intervals.

©2012 Elsevier Inc. All rights reserved.

1. Introduction

Babu [1] shows that, in non-regular smooth function models, the standard bootstrap estimator of thedistributionof a smooth function of the sample mean is inconsistent for some values of the mean parameter vector, while Shao [25] shows that them-out-of-nbootstrap estimator is consistent over the entire parameter space. In this paper, we show that both the standard bootstrap and them-out-of-nbootstrap percentileconfidence intervalsare inconsistent for some values of the mean parameter vector, while the test inversion and them-out-of-nbootstrap test inversion confidence regions are consistent over the entire parameter space. Moreover, asymptotic expansions of coverage probabilities reveal that, for large samples, the test inversion confidence regions are more accurate than them-out-of-nbootstrap test inversion confidence regions.

Giurcanu and Presnell [14] propose the oracle bootstrap as an alternative to them-out-of-nbootstrap for consistent estimation of the distributions of many non-regular estimators, including the square of the sample mean, the Hodges and Stein estimators, and sparse estimators such as the LASSO. In this paper, we focus on the consistency and the higher order properties of oracle-bootstrap percentile confidence intervals and their iterated versions in non-regular smooth function models. We show that the oracle-bootstrap percentile confidence intervals are consistent over the entire parameter space, and that iteration increases their accuracy by an order of magnitude for large sample sizes.

It is well known that fixed parameter consistency may not adequately describe the large sample behavior of bootstrap estimators. For example, while the standard parametric bootstrap inconsistently estimates the distribution of both the Hodges and Stein estimators over a subset of the parameter space, it nevertheless performs better than the m -out-of-nbootstrap and a parametric version of the oracle bootstrap near the points of inconsistency [24]. More generally, a pointwise-consistent bootstrap estimator of the distribution of a non-locally asymptotically equivariant estimator is typically not locally uniform convergent at the points of inconsistency of the standard bootstrap [2,21]. In our local asymptotics analysis, we show that if the local parameter is close to a point of inconsistency, then the oracle bootstrap percentile confidence intervals are consistent, and that both the standard bootstrap and them-out-of-nbootstrap percentile

E-mail address:[email protected].

0047-259X/$ – see front matter©2012 Elsevier Inc. All rights reserved.

(2)

confidence intervals are inconsistent. Furthermore, if the local parameter is moderately close to a point of inconsistency, then all bootstrap confidence intervals are inconsistent. Finally, if the local parameter is moderately far from a point of inconsistency, then all bootstrap confidence intervals are consistent.

We conclude this section with an outline. In Section2, we present the asymptotic behavior of the standard bootstrap percentile confidence intervals in non-regular models. In Section3, we present the large sample results for them

-out-of-nbootstrap confidence intervals and for them-out-of-nbootstrap test inversion confidence regions. In Section4, we present the asymptotic behavior of the oracle bootstrap percentile confidence intervals and their iterated versions and in Section5, we study the large sample local behavior of the bootstrap confidence intervals. In Section6, we provide some computational details and the results of an empirical study which describe the finite sample local behavior of the bootstrap confidence intervals. The proofs of the theoretical results can be found in theAppendix.

2. Standard bootstrap estimation

Consider thesmooth function model(see, e.g., [4,15]) given by a random sampleX1:n

=

X1

, . . . ,

Xnof i.i.d. observations

from a distributionPonRdsuch thatE

(

X

)

=

µ

and var

(

X

)

=

Σ, whereX

PandΣis positive definite. Let

θ

=

f

(µ)

be the parameter of interest, wheref

:

Rd

Ris asmoothfunction, and let

θ

ˆ

n

=

f

(

X

¯

n

)

be itsnaturalestimator, where

¯

Xnis the sample mean. The smooth function model is calledregular if

f

(µ)

̸=

0 for all

µ

Rdandnon-regularif there exists a

µ

Rdsuch that

f

(µ)

=

0, where

f

(µ)

is the gradient off

(µ)

. LetΘ

=

f

(µ)

:

µ

Rd

,

2f

(µ)

̸=

0

, S

=

µ

Rd

: ∇

f

(µ)

=

0

, andΘ0

=

f

(µ)

:

µ

S

be the image set ofSunderf, where

2f

(µ)

is the Hessian matrix off

(µ)

. By a two-term Taylor expansion off

(

X

¯

n

)

about

µ

, we have

f

(

X

¯

n

)

=

f

(µ)

+

γ

T

(

X

¯

n

µ)

+

(

X

¯

n

µ)

TΓ

(

X

¯

n

µ)

+

oP

(

n

−1

),

where

γ

= ∇

f

(µ)

andΓ

=

(

1

/

2

)

2f

(µ)

. If

θ

Θ

\

Θ

0, thenn1/2

(

θ

ˆ

n

θ)

N

(

0

, γ

TΣ

γ )

, where denotes convergence

in distribution. If

θ

Θ0, thenn

(

θ

ˆ

n

θ)

d

j=1

λ

jUj2, where

λ

1

, . . . , λ

dare the eigenvalues ofΣ1/2Γ Σ1/2andU1

, . . . ,

Ud

are i.i.d.

N

(

0

,

1

)

. LetF

(

x

)

=

Pr

TZ

x

)

andG

(

x

)

=

Pr

(

ZTΓZ

x

)

, whereZ

N

(

0

,

Σ

)

.

The standard bootstrap confidence intervals were introduced by Efron [13] and their higher order properties were characterized by Hall [16]. LetX

1:n

=

X

1

, . . . ,

X

n be a standard bootstrap resample, i.e.,X

1:nis a with replacement random

sample of sizenfromX1:n. Let

θ

ˆ

n

=

f

(

X

¯

n

)

be the bootstrap version of

θ

ˆ

n, and letL

ˆ

θˆ∗

n

(

x

)

=

Pr

ˆ

θ

n

− ˆ

θ

n

x

|

X1:n

,F

ˆ

θˆ∗ n

(

x

)

=

ˆ

Lθˆ∗ n

(

x

/

n 1/2

)

, andG

ˆ

ˆ θ∗

n

(

x

)

= ˆ

Lθˆn

(

x

/

n

)

be the bootstrap estimators ofLθˆn

(

x

)

=

Pr

(

ˆ

θ

n

θ

x

)

,Fθˆn

(

x

)

=

Pr

n1/2

(

θ

ˆ

n

θ)

x

, and Gθˆn

(

x

)

=

Pr

n

(

θ

ˆ

n

θ)

x

, respectively. Note that if

θ

Θ

\

Θ0andLθˆn

(

x

)

is continuous atLθˆ1

n

(

1

α)

, whereLˆ1 θn

(

1

α)

is the

(

1

α)

quantile ofLθˆn

(

x

)

, an exact

α

level upper confidence limit for

θ

is

θ

ˆ

n

L

−1 ˆ θn

(

1

α)

. Let

ˆ

Lθˆ∗1 n

(

1

α)

be the

(

1

α)

quantile of

ˆ

Lθˆ∗

n

(

x

)

, then the

α

level bootstrap percentile upper confidence interval for

θ

is

ˆ

In

(α)

=

−∞

,

θ

ˆ

n

− ˆ

Lˆ1 θ∗ n

(

1

α)

. Similarly, the bootstrap percentile lower, equal-tailed, and symmetric confidence intervals for

θ

areI

ˆ

L

n

(α)

=

ˆ

θ

n

− ˆ

Lθˆ∗1 n

(α),

,

ˆ

InET

(α)

=

ˆ

θ

n

−ˆ

L −1 ˆ θ∗ n

((

1

+

α)/

2

),

θ

ˆ

n

−ˆ

L −1 ˆ θ∗ n

((

1

α)/

2

)

, andI

ˆ

S n

(α)

=

ˆ

θ

n

−ˆ

L1 ˆ θ∗ n

(α),

ˆ

θ

n

L1 ˆ θ∗ n

(α)

, where

ˆ

L′ ˆ θ∗ n

(

x

)

=

Pr

| ˆ

θ

n

− ˆ

θ

n

| ≤

x

|

X1:n

(see, e.g., [15]).

Recall that an

α

level confidence intervalIn

(α)

is consistent if its asymptotic coverage probability equals the nominal

level, i.e., limn→∞Pr

In

(α))

=

α

, and is inconsistent otherwise (see, e.g., [26, p. 329]). The following theorem shows

thatI

ˆ

n

(α)

is consistent for

θ

Θ

\

Θ0and is inconsistent for

θ

Θ0. Part (ii) of this theorem gives the asymptotic coverage probability ofI

ˆ

n

(α)

for

θ

Θ0. Similar results can be readily derived for the other types of confidence intervals.

Theorem 1.Suppose f is two times continuously differentiable and E

(

X

2

) <

.

(i) If

θ

Θ

\

Θ0, thenlimn→∞Pr

∈ ˆ

In

(α))

=

α

.

(ii)If

θ

Θ0, thenlimn→∞Pr

∈ ˆ

In

(α))

=

1

Pr

ZTΓZ

FN(10,Σ)

(

1

α

;

Z

) <

0

, where FN(10,Σ)

(

1

α

;

y

)

is the

(

1

α)

quantile of the distribution of 2yTΓZ

+

ZTΓZ .

3. Them-out-of-nbootstrap estimation

Them-out-of-nbootstrap estimation was introduced by Bickel and Freedman [5] and its large sample and higher order properties were analyzed by Bickel et al. [6]. Cheung et al. describe them-out-of-nbootstrap confidence intervals for

θ

under the constraint that

f

(µ)

=

0. However, solving

f

(µ)

=

0 for

µ

, one may be able to determine the parameter

θ

, and thus, the confidence interval estimation may not be suitable under such constraints on the parameters.

LetX

1:m

=

X

1

, . . . ,

X

mbe anm-out-of-nbootstrap resample, i.e.,X

1

, . . . ,

X

mis a with replacement random sample

of sizemfromX1:n, withm

=

o

(

n

)

and m

→ ∞

. Let

θ

ˆ

m

=

f

(

X

¯

m

)

be them-out-of-n bootstrap version of

θ

ˆ

n, and

letL

ˆ

θˆ∗ m

(

x

)

=

Pr

ˆ

θ

m

− ˆ

θ

n

x

|

X1:n

,F

ˆ

θˆ∗ m

(

x

)

= ˆ

Lθˆm

(

x

/

m 1/2

)

, andG

ˆ

ˆ θ∗

(3)

percentile upper confidence interval for

θ

isI

ˆ

m

(α)

=

−∞

,

θ

ˆ

n

(

m

/

n

)

1/2L

ˆ

θˆ∗1

m

(

1

α)

. Similarly, them-out-of-nbootstrap percentile lower, equal-tailed, and symmetric confidence intervals for

θ

areI

ˆ

L

m

(α)

=

ˆ

θ

n

(

m

/

n

)

1/2

ˆ

Lθˆ∗1 m

(α),

,I

ˆ

ET m

(α)

=

ˆ

θ

n

(

m

/

n

)

1/2L

ˆ

θˆ∗1 m

((

1

+

α)/

2

),

θ

ˆ

n

(

m

/

n

)

1/2L

ˆ

θˆ∗1 m

((

1

α)/

2

)

, andI

ˆ

S m

(α)

=

ˆ

θ

n

(

m

/

n

)

1/2

ˆ

L1 ˆ θ∗ m

(α),

ˆ

θ

n

+

(

m

/

n

)

1/2

ˆ

L1 ˆ θ∗ m

(α)

, whereL

ˆ

′ ˆ θ∗ m

(

x

)

=

Pr

| ˆ

θ

m

− ˆ

θ

n

| ≤

x

|

X1:n

. The following theorem shows that them-out-of-nbootstrap percentile upper confidence interval is consistent for

θ

Θ

\

Θ0and is inconsistent for

θ

Θ0.

Theorem 2. Suppose f is two times continuously differentiable, E

(

X

2

) <

,

λ

j

0for all j

=

1

, . . . ,

d, m

=

o

(

n

)

, and

m

→ ∞

. Thenlimn→∞Pr

∈ ˆ

Im

(α))

=

α

if

θ

Θ

\

Θ0andlimn→∞Pr

∈ ˆ

Im

(α))

=

0if

θ

Θ0.

Bootstrap confidence regions can be derived by inverting bootstrap test statistics in analogy with the classical theory (see, e.g., [9, Section 9.2.1]). Consider testingH0

:

θ

=

θ

0versusHa

:

θ > θ

0using the test statisticTn

(

X1:n

, θ

0

)

=

n1/2

(

θ

ˆ

n

θ

0

)

if

θ

0

Θ

\

Θ

0andTn

(

X1:n

, θ

0

)

=

n

(

θ

ˆ

n

θ

0

)

if

θ

0

Θ0. UnderH0,Tn

(

X1:n

, θ

0

)

γ

TZif

θ

0

Θ

\

Θ0andTn

(

X1:n

, θ

0

)

ZTΓZ

if

θ

0

Θ0. An

α

level test inversion confidence region for

θ

is defined as In

(α)

=

θ

Θ

\

Θ0

: ˆ

Fn−1

(

1

α)

n1/2

(

θ

ˆ

n

θ)

θ

Θ0

: ˆ

Gn1

(

1

α)

n

(

θ

ˆ

n

θ)

,

whereF

ˆ

n

(

x

)

=

Pr

(

γ

ˆ

nTΣ

ˆ

1/2 n U

x

|

X1:n

)

andG

ˆ

n

(

x

)

=

Pr

(

UTΣ

ˆ

1/2 n Γ

ˆ

nΣ

ˆ

1/2

n U

x

|

X1:n

)

are conditional distribution functions

givenX1:n,U

N

(

0

,

I

)

is independent ofX1:n,

γ

ˆ

n

= ∇

f

(

X

¯

n

)

ˆ

n

=

(

1

/

2

)

2f

(

X

¯

n

)

, andΣ

ˆ

n

=

n−1

ni=1

(

Xi

− ¯

Xn

)(

Xi

− ¯

Xn

)

T.

The correspondingm-out-of-nbootstrap test inversion confidence region for

θ

is

ˆ

Jm

(α)

=

θ

Θ

\

Θ0

: ˆ

Fˆ1 θ∗ m

(

1

α)

n1/2

(

θ

ˆ

n

θ)

θ

Θ0

: ˆ

Gˆ1 θ∗ m

(

1

α)

n

(

θ

ˆ

n

θ)

.

The following corollary shows that the test inversion confidence regions and the m-out-of-n bootstrap test inversion confidence regions are consistent over the entire parameter space.

Corollary 1. Suppose f is two times continuously differentiable, E

(

X

2

) <

, m

=

o

(

n

)

, and m

→ ∞

. Then, for all

θ

Θ, limn→∞Pr

In

(α))

=

α

andlimn→∞Pr

∈ ˆ

Jm

(α))

=

α

.

SupposeE

X

5

<

and that the characteristic function ofXsatisfies the Cramer condition lim sup∥t∥→∞

|

Eexp

(

itTX

)

|

<

1, and thus, the Edgeworth expansion of the distribution functionFZn

(

x

)

ofZn

=

(

Z

(1)

n

, . . . ,

Zn(d)

)

T

=

n1/2

(

X

¯

n

µ)

holds (see, e.g., [4, Theorem 2, p. 436]):

FZn

(

x

)

=

I

(

z

x

)

1

+

n−1/2p1

(

z

)

+

n−1p2

(

z

)

φ

Σ

(

z

)

dz

+

O

(

n−3/2

),

(1)

where

φ

Σ

(

z

)

is the density function of N

(

0

,

Σ

)

,pi

(

z

)

are odd/even polynomials inz for odd/eveni

=

1

,

2 (see, e.g.,

[15, pp. 162–167]), and I

(

A

)

is the indicator function of an event A. Let H

(

x

)

=

E

p1

(

Z

)

I

TZ

x

)

and J

(

x

)

=

E

ZTΓZI

TZ

x

)

. Furthermore, letf(i1···ik)

=

k

/∂µ

i1

· · ·

∂µ

ik

f

(µ)

fork

2,Ξi

=

(

1

/

3

!

)

f(ijk)

:

j

,

k

=

1

, . . . ,

d

Rd×d

=

1

, . . . ,

Ξd

)

Rd×d 2 ,Π

=

1

, . . . ,

Πd

)

Rd×d3,Πi

=

i1

, . . . ,

Πid

)

Rd×d 2 ,Πij

=

(

1

/

4

!

)

f(ijkl)

:

k

,

l

=

1

, . . . ,

d

Rd×d,K

(

x

)

=

E

p2

(

Z

)

I

(

ZTΓZ

x

)

,M

(

x

)

=

E

p1

(

Z

)

ZTΞ(Z

Id

)

ZI

(

ZTΓZ

x

)

,N

(

x

)

=

E

ZTΠ(Z

Id2

)(

Z

Id

)

ZI

(

ZTΓZ

x

)

,P

(

x

)

=

(

1

/

2

)

E

(

ZTΞ(Z

I d

)

Z

)

2I

(

ZTΓZ

x

)

, where

denotes the Kronecker product. The following lemma provides the Edgeworth and the Cornish–Fisher expansions ofFθˆn

(

x

)

,F

−1 ˆ θn

(α)

,Gθˆn

(

x

)

, andG −1 ˆ θn

(α)

, respectively. Similar expansions have been obtained by Cheung et al. [10] for

θ

Θ0.

Lemma 1. Assume that f is five times continuously differentiable, E

X

5

<

, andlim sup∥t∥→∞

|

Eexp

(

itTX

)

|

<

1.

(i) If

θ

Θ

\

Θ0, then Fθˆn

(

x

)

=

F

(

x

)

+

n−1/2

H

(

x

)

J

(

x

)

+

O

(

n−1

)

and Fˆ−1 θn

(α)

=

F−1

(α)

n−1/2

F

(

F−1

(α))

−1

H

(

F−1

(α))

J

(

F−1

(α))

+

O

(

n−1

)

; (ii) If

θ

Θ0, then Gθˆn

(

x

)

=

G

(

x

)

+

n−1

K

(

x

)

M

(

x

)

N

(

x

)

+

P′′

(

x

)

+

O

(

n−3/2

)

and G−1 ˆ θn

(α)

=

G−1

(α)

n−1

G

(

G−1

(α))

−1

K

(

G−1

(α))

M

(

G−1

(α))

N

(

G−1

(α))

+

P′′

(

G−1

(α))

+

O

(

n−3/2

)

. LetZm

=

(

Zm( 1)

, . . . ,

Zm( d)

)

T

=

m1/2

(

X

¯

m

− ¯

Xn

)

, then the Edgeworth expansion ofF

ˆ

Z

m

(

x

)

, the conditional distribution function ofZm∗givenX1:n, holds:

ˆ

FZm

(

x

)

=

I

(

z

x

)

1

+

m−1/2p

ˆ

1n

(

z

)

+

m−1p

ˆ

2n

(

z

)

φ

Σˆn

(

z

)

dz

+

OP

(

m−3/2

),

(2)

where

φ

Σˆn

(

z

)

is the conditional density function ofN

(

0

,

Σ

ˆ

n

)

givenX1:n,p

ˆ

in

(

z

)

are odd/even polynomials inzfor odd/even

i

=

1

,

2, the sample versions of p1

(

z

)

andp2

(

z

)

defined by(1). LetH

ˆ

n

(

x

)

=

E

ˆ

p1n

(

Z

)

I

(

γ

ˆ

nTZ

x

)

|

X1:n

,

ˆ

Jn

(

x

)

=

E

ZTΓ

ˆ

nZI

(

γ

ˆ

nTZ

x

)

|

X1:n

, andQ

ˆ

n

(

x

)

=

E

ˆ

p1n

(

Z

)

n1/2

γ

ˆ

nTZI

(

ZTΓ

ˆ

nZ

x

)

|

X1:n

(4)

lemma provides the empirical Edgeworth and the empirical Cornish–Fisher expansions of the m-out-of-n bootstrap estimatorsF

ˆ

θˆ∗ m

(

x

)

,

ˆ

Fˆ1 θ∗ m

(α)

,G

ˆ

θˆ∗ m

(

x

)

, and

ˆ

Gˆ1 θ∗ m

(α)

, respectively.

Lemma 2. Suppose f is five times continuously differentiable, m

=

o

(

n

)

, m

→ ∞

, E

X

5

<

, andlim sup∥t∥→∞

|

Eexp

(

itTX

)

|

<

1. (i) If

θ

Θ

\

Θ0, thenF

ˆ

θˆ∗ m

(

x

)

=

ˆ

Fn

(

x

)

+

m−1/2

ˆ

Hn

(

x

)

− ˆ

Jn

(

x

)

+

OP

(

m−1

)

and F

ˆ

θ−ˆ∗1 m

(α)

=

F

ˆ

−1 n

(α)

m −1/2

ˆ

Fn

(

F

ˆ

−1 n

(α))

−1

ˆ

Hn

(

F

ˆ

n−1

(α))

− ˆ

Jn

(

F

ˆ

−1 n

(α))

+

OP

(

m−1

)

. (ii)If

θ

Θ0, then G

ˆ

θˆ∗ m

(

x

)

=

ˆ

Gn

(

x

)

n−1/2Q

ˆ

n

(

x

)

+

OP

m−1

+

mn−1

and G

ˆ

ˆ1 θ∗ m

(α)

=

G

ˆ

−1 n

(α)

+

n −1/2

ˆ

Gn

(

G

ˆ

−1 n

(α))

−1

ˆ

Qn

(

G

ˆ

−1 n

(α))

+

OP

m−1

+

mn−1

.

Next theorem shows that the test inversion confidence regions are more accurate than them-out-of-nbootstrap test inversion confidence regions.

Theorem 3.Suppose f is five times continuously differentiable, m

=

o

(

n

)

, m

→ ∞

, E

X

5

<

, andlim sup∥t∥→∞

|

Eexp

(

itTX

)

|

<

1. (i) If

θ

Θ

\

Θ0, thenPr

θ

In

(α)

=

α

+

O

(

n−1/2

)

andPr

θ

∈ ˆ

Jm

(α)

=

α

+

O

(

m−1/2

)

. (ii)If

θ

Θ0, thenPr

θ

In

(α)

=

α

+

O

(

n−1

)

andPr

θ

∈ ˆ

Jm

(α)

=

α

+

O

m−1

+

mn−1

.

4. Oracle bootstrap estimation

The oracle bootstrap is a particular type of the empirical likelihood bootstrap [8] and of the intentionally-biased bootstrap [18] that adapts a parametric bootstrap proposed by Putter and van Zwet [22] to nonparametric setting. First, the empirical distribution is embedded in a parametric family of weighted empirical distributions of the sample and then Putter and van Zwet’s parametric bootstrap procedure is applied to this family. To construct the parametric family

{ ˆ

Pυ

:

υ

Rd

}

associated with the smooth function model, we choose a vector of weights

(w

1

, . . . , w

n

)

to minimize

n−1

n

i=1log

(

n

w

i

)

,

the Kullback–Leibler divergence of

(w

1

, . . . , w

n

)

to the vector of uniform weights

(

n−1

, . . . ,

n−1

)

, subject to the constraints

n

i=1

w

iXi

=

υ

,

n

i=1

w

i

=

1, and

w

i

0 for alli

=

1

, . . . ,

n. The solution of this constraint optimization problem coincides

with the empirical likelihood weights for testing the null hypothesisH0

:

µ

=

υ

[20]. If

υ

is not in the convex hull ofX1:n,

then we arbitrarily takeP

ˆ

υ

= ˆ

Pn, whereP

ˆ

nis the empirical distribution of the sample.

Let

µ

˜

nbe a sequence ofn1/2-consistent estimators of

µ

such that Pr

˜

µ

n

S

1 whenever

µ

S, i.e.,

µ

˜

nhas the ‘‘oracle property’’ for the setS. A typical oracle bootstrap resampleX1Ď:n

=

X1Ď

, . . . ,

XnĎ is a sequence ofnconditionally

i.i.d. random draws fromP

ˆ

µ˜

n and

µ

˜

nplays the role of

µ

in the oracle bootstrap world. Let

θ

ˆ

Ď

n

=

f

(

X

¯

nĎ

)

and

θ

˜

n

=

f

(

µ

˜

n

)

be the oracle bootstrap versions of

θ

ˆ

nand

θ

, respectively, whereX

¯

nĎis the sample mean of the oracle bootstrap resample.

Then,

˜

LθˆnĎ

(

x

)

=

Pr

ˆ

θ

Ď

n

− ˜

θ

n

x

|

X1:n

,F

˜

θˆnĎ

(

x

)

= ˜

LθˆnĎ

(

x

/

n1/2

)

, andG

˜

θˆnĎ

(

x

)

= ˜

LθˆnĎ

(

x

/

n

)

are the oracle bootstrap versions

ofLθˆn

(

x

)

,Fθˆn

(

x

)

, andGθˆn

(

x

)

, respectively. Thus, the

α

level oracle bootstrap percentile upper confidence interval for

θ

is

˜

In

(α)

=

−∞

,

θ

ˆ

n

− ˜

Lˆ1 θĎ n

(

1

α)

. Similarly, the oracle bootstrap percentile lower, equal-tailed, and symmetric confidence intervals for

θ

are defined asI

˜

nL

(α)

=

ˆ

θ

n

− ˜

L −1 ˆ θĎ n

(α),

,I

˜

ETn

(α)

=

ˆ

θ

n

− ˜

L −1 ˆ θĎ n

((

1

+

α)/

2

),

θ

ˆ

n

− ˜

L −1 ˆ θĎ n

((

1

α)/

2

)

, and

˜

InS

(α)

=

ˆ

θ

n

− ˜

L1 ˆ θĎ n

(α),

ˆ

θ

n

+ ˜

L1 ˆ θĎ n

(α)

, respectively, where

˜

Lˆ θĎ n

(

x

)

=

Pr

| ˆ

θ

nĎ

− ˜

θ

n

| ≤

x

|

X1:n

.

Note that there are many estimators

µ

˜

n which have the oracle property for the setS. For example, ifS

= {

µ

0

}

is a singleton, let

µ

˜

n

=

µ

0

+

(

X

¯

n

µ

0

)

I

(

|∇

f

(

X

¯

n

)

| ≥

an

)

, where ‘‘

’’ is the component-wise vector to vector multiplication

(Hadamard product),I

(

|∇

f

(

X

¯

n

)

| ≥

an

)

=

I

(

|∇

f

(

X

¯

n

)

(j)

| ≥

an

)

:

j

=

1

, . . . ,

d

,an

0 andn1/2an

→ ∞

, e.g.,an

=

n−1/2log

(

log

(

n

))

. Another option is to use a LASSO-type estimator which has the oracle property for the setS, such as an adaptive LASSO-type estimator (see, e.g., [28]):

˜

µ

n

=

argmin b

n

i=1

(

Xi

b

)

T

(

Xi

b

)

+

λ

n d

j=1

| ¯

Xn(j)

µ

(0j)

|

−1

|

b(j)

µ

(0j)

|

,

wheren−1/2

λ

n

0 and

λ

n

→ ∞

. If the non-empty setSis not a singleton, then for all

ν

S, let

µ

˜

n

=

ν

+

(

X

¯

n

ν)

I

|∇

f

(

X

¯

n

)

| ≥

an

. In the case of the adaptive LASSO estimator, let

˜

µ

n

=

argmin b

n

i=1

(

Xi

b

)

T

(

Xi

b

)

+

λ

n d

j=1

| ¯

Xn(j)

ν

(j)

|

−1

|

b(j)

ν

(j)

|

.

Then, set

µ

˜

n

= ˜

µ

n,νˆn, where

ν

ˆ

n

=

argminν∈S

∥ ˜

µ

n

− ¯

Xn

2and

∥ · ∥

is the Euclidean norm. Next theorem shows that the

(5)

Theorem 4. Assume that f is two times continuously differentiable and E

(

X

2

) <

. Thenlimn→∞Pr

∈ ˜

In

(α))

=

α

for all

θ

Θ. LetAn

= ∩

dj=1

|∇

f

(

X

¯

n

)

(j)

| ≥

nr

, wherer

(

0

,

1

/

2

)

. Next lemma shows that ifan

=

nr and

f

(µ)

(j)

̸=

0 for

allj

=

1

, . . . ,

d, then there exists a constantc

>

0 such that Pr

(

An

)

=

1

O

exp

(

cn1−2r

)

. Since

µ

˜

n

= ¯

XnonAn, it

follows that the oracle bootstrap and the standard bootstrap are equivalent except on an event of probability exponentially small. If

θ

Θ0, next lemma shows that there exists a constantc

>

0 such that Pr

(

Bn

)

=

1

O

exp

(

cn1−2r

)

, where Bn

= {∥∇

f

(

X

¯

n

)

nr

}

and

∥ · ∥

∞ is the infinity norm. Therefore,

f

(

µ

˜

n

)

=

0 except on an event of probability

exponentially small.

Lemma 3. Suppose f is twice continuously differentiable, E

X

2

<

, andlogE

exp

(

tTX

)

<

in a neighborhood of

0

Rd.

1. If

f

(µ)

(j)

̸=

0for all j

=

1

, . . . ,

d, there exists c

>

0such thatPr

(

A

n

)

=

1

O

exp

(

cn1−2r

)

.

2. If

θ

Θ0, there exists a constant c

>

0such thatPr

(B

n

)

=

1

O

exp

(

cn1−2r

)

.

Suppose

θ

Θ

\

Θ0. Similarly to them-out-of-nbootstrap, under the conditions ofLemma 4, the following empirical Edgeworth expansion ofF

˜

ZĎ n

(

x

)

holds:

˜

FZĎ n

(

x

)

=

I

(

z

x

)

1

+

n−1/2p

˜

1n

(

z

)

+

n−1p

˜

2n

(

z

)

φ

Σ˜n

(

z

)

dz

+

OP

(

n−3/2

),

(3)

whereZnĎ

=

n1/2

(

X

¯

nĎ

− ˜

µ

n

)

, and givenX1:n,

φ

Σ˜n

(

z

)

is the conditional density ofZĎ

N

(

0

,

Σ

˜

n

)

˜

n

=

ni=1

w

˜

i

Xi

− ˜

µ

n

)(

Xi

˜

µ

n

)

T,

w

˜

i,i

=

1

, . . . ,

n, are the oracle bootstrap weights corresponding toP

ˆ

µ˜n,p

˜

in

(

z

)

are odd/even polynomials inzfor odd/eveni

=

1

,

2, the oracle bootstrap versions ofp1

(

z

)

andp2

(

z

)

defined by(1). Let

γ

˜

n

= ∇

f

(

µ

˜

n

)

˜

n

=

(

1

/

2

)

2f

(

µ

˜

n

)

Rd×d

˜

n

=

(

Ξ

˜

1n

, . . . ,

Ξ

˜

dn

)

Rd×d 2 ,Ξ

˜

in

=

(

1

/

3

!

)

f

˜

n(ijk)

:

j

,

k

=

1

, . . . ,

d

Rd×d

˜

n

=

(

Π

˜

1n

, . . . ,

Π

˜

dn

)

Rd×d 3 ,

˜

Πin

=

(

Π

˜

i1n

, . . . ,

Π

˜

idn

)

Rd×d 2 ,Π

˜

ijn

=

(

1

/

4

!

)

f

˜

n(ijkl)

:

k

,

l

=

1

, . . . ,

d

Rd×d, andf

˜

(i1···ik) n

=

f(i1···ik)

(

µ

˜

n

)

, wherek

2. Let furtherF

˜

n

(

x

)

=

Pr

(

γ

˜

nTZĎ

x

|

X1:n

)

,H

˜

n

(

x

)

=

E

(

p

˜

1n

(

ZĎ

)

I

(

γ

˜

nTZĎ

x

)

|

X1:n

)

,

˜

Jn

(

x

)

=

E

(

ZĎTΓ

˜

nZĎI

(

γ

˜

nTZĎ

x

)

|

X1:n

)

,G

˜

n

(

x

)

=

Pr

ZĎTΓ

˜

nZĎ

x

|

X 1:n

,K

˜

n

(

x

)

=

E

˜

p2n

(

ZĎ

)

I

(

ZĎTΓ

˜

nZĎ

x

)

|

X1:n

,M

˜

n

(

x

)

=

E

˜

p1n

(

ZĎ

)

ZĎTΞ

˜

n

(

ZĎ

Id

)

ZĎI

(

ZĎTΓ

˜

nZĎ

x

)

|

X1:n

,

˜

Nn

(

x

)

=

E

ZĎTΠ

˜

n

(

Z

˜

Ď

I d2

)(

ZĎ

Id

)

ZĎI

(

ZĎTΓ

˜

nZĎ

x

)

|

X1:n

, andP

˜

n

(

x

)

=

(

1

/

2

)

E

(

ZĎTΞ

˜

n

(

ZĎ

I d

)

ZĎ

)

2I

(

ZĎTΓ

˜

nZĎ

x

)

|

X1:n

. Next lemma gives the empirical Edgeworth and the empirical Cornish–Fisher expansions ofF

ˆ

θˆĎ

n

(

x

)

,

ˆ

F−1 ˆ θnĎ

(α)

,G

ˆ

θˆĎ n

(

x

)

, and

ˆ

Gˆ1 θĎ n

(α)

, respectively.

Lemma 4. Assume that f is five times continuously differentiable, E

X

5

<

, andlim sup∥t∥→∞

|

Eexp

(

itTX

)

|

<

1.

(i) If

θ

Θ

\

Θ0, then F

˜

θˆnĎ

(

x

)

=

F

˜

n

(

x

)

+

n−1/2

˜

Hn

(

x

)

− ˜

Jn

(

x

)

+

OP

(

n−1

)

and F

˜

θˆ−Ď1 n

(α)

=

F

˜

n−1

(α)

n−1/2

˜

Fn

(

F

˜

−1 n

(α))

−1

˜

Hn

(

F

˜

n−1

(α))

− ˜

Jn

(

F

˜

−1 n

(α))

+

OP

(

n−1

)

; (ii) If

θ

Θ0, then G

˜

θˆnĎ

(

x

)

= ˜

Gn

(

x

)

+

n−1

˜

Kn

(

x

)

− ˜

Mn

(

x

)

− ˜

Nn

(

x

)

+ ˜

P ′′ n

(

x

)

+

OP

(

n−3/2

)

andG

˜

θˆĎ1 n

(α)

= ˜

Gn1

(α)

n−1

˜

Gn

(

G

˜

−1 n

(α))

−1

˜

Kn

(

G

˜

n1

(α))

− ˜

Mn

(

G

˜

−1 n

(α))

− ˜

Nn

(

G

˜

−1 n

(α))

+ ˜

P ′′ n

(

G

˜

−1 n

(α))

+

OP

(

n−3/2

)

.

Bootstrap iteration is known to further reduce the size distortion of a bootstrap test, to further reduce the bias of an estimator, and to further improve the coverage probability of a bootstrap confidence interval [3,17,10]. Bootstrap iteration is usually used to estimate a parameter of the distribution of a statistical quantity defined in terms of a bootstrap estimator. For example, to obtain a (coverage calibrated) iterated bootstrap confidence interval, one first estimates the coverage probability of the bootstrap confidence interval using the iterated bootstrap, and then, the nominal level of the bootstrap confidence interval is re-adjusted based on this estimate. Specifically, letX1Ď:n

=

X1Ď

, . . . ,

XnĎbe an oracle bootstrap resample, and let

ˆ

θ

n

− ˜

L −1 ˆ θĎ n

(

1

α)

be the

α

level oracle bootstrap percentile upper confidence limit, and let

π

n

(α)

=

Pr

θ

≤ ˆ

θ

n

− ˜

Lˆ1 θĎ

n

(

1

α)

be the coverage probability of the confidence interval. A typical iterated oracle bootstrap corresponding toX1Ď:nis a collection

X1ĎĎ:n

=

X1ĎĎ

, . . . ,

XnĎĎofnconditionally i.i.d. draws fromP

ˆ

µ˜Ďn. Let

ˆ

θ

ĎĎ

n be the iterated oracle bootstrap version of

θ

ˆ

n. The iterated

oracle bootstrap estimate of

π

n

(α)

is

π

˜

n

(α)

=

Pr

˜

θ

n

≤ ˆ

θ

nĎ

− ˜

LĎ −1 ˆ θnĎĎ

(

1

α)

|

X1:n

, where

˜

LĎ ˆ θnĎĎ

(

x

)

=

Pr

ˆ

θ

ĎĎ n

− ˜

θ

nĎ

x

|

X1Ď:n

. Thus, the

α

level coverage calibrated confidence interval isI

˜

Ďn

(α)

=

−∞

,

θ

ˆ

n

− ˜

L−1 ˆ θnĎ

(

1

− ˜

π

−1 n

(α))

. Similarly, the iterated standard bootstrap and them-out-of-nbootstrap confidence intervals areI

ˆ

n

(α)

=

−∞

,

θ

ˆ

n

− ˆ

Lˆ1 θ∗ n

(

1

− ˆ

π

n−1

(α))

and

ˆ

I∗ m

(α)

=

−∞

,

θ

ˆ

n

(

m

/

n

)

1/2

ˆ

L−1 ˆ θ∗ m

(

1

− ˆ

π

−1 m

(α))

, respectively, where

π

ˆ

n

(α)

=

Pr

θ

ˆ

n

θ

ˆ

n

− ˆ

L ∗−1 ˆ θ∗∗ n

(

1

α)

|

X1:n

,

ˆ

π

m

(α)

=

Pr

ˆ

θ

n

≤ ˆ

θ

m

(

l

/

m

)

1/2

ˆ

L ∗−1 ˆ θ∗∗ l

(

1

α)

|

X1:n

(6)

Next theorem shows that iterated oracle bootstrap percentile confidence intervals are more accurate than their non-iterated versions by an order of magnitude for large sample sizes.

Theorem 5.Assume that f is five times continuously differentiable, E

X

5

<

, andlim sup∥t∥→∞

|

exp

(

itTX

)

|

<

1.

(i) If

θ

Θ

\

Θ0, thenPr

θ

∈ ˜

In

(α)

=

α

+

O

(

n−1/2

)

andPr

θ

∈ ˜

n

(α)

=

α

+

O

(

n−1

)

. (ii)If

θ

Θ0, thenPr

θ

∈ ˜

In

(α)

=

α

+

O

(

n−1

)

andPr

θ

∈ ˜

n

(α)

=

α

+

O

(

n−3/2

)

.

In practice, the bootstrap confidence bounds do not have closed form expressions, and thus, we usually approximate them by Monte-Carlo methods. Specifically, letX1Ď:1n

, . . . ,

X1Ď:BnbeBoracle bootstrap resamples, and let

˜

n,B

(

1

α)

denote

the

(

1

α)

Bth order statistic of

(

θ

ˆ

nĎb

− ˜

θ

n

)

, for b

=

1

, . . . ,

B, where

θ

ˆ

nĎb is the bootstrap version of

θ

ˆ

n onX1Ď:bn. For

eachb

=

1

, . . . ,

B, letX1ĎĎ:nb1

, . . . ,

XĎĎ bC

1:n beC iterated oracle bootstrap re-resamples and let

˜

nĎ,b,C

(

1

α)

denote the

(

1

α)

Cth order statistic of

(

θ

ˆ

nĎĎbc

− ˜

θ

nĎb

)

, forc

=

1

, . . . ,

C, where

θ

ˆ

nĎĎbc is the bootstrap version of

θ

ˆ

nonX1ĎĎ:nbc. Since

˜

LĎˆ−1 θĎĎ n

(

1

α)

(

θ

ˆ

nĎ

− ˜

θ

n

)

⇐⇒

(

1

α)

≤ ˜

LĎˆ θĎĎ n

(

ˆ

θ

Ď n

− ˜

θ

n

)

,

π

˜

n

(α)

=

1

Pr

˜

LĎˆ θĎĎ n

(

ˆ

θ

Ď n

− ˜

θ

n

) < (

1

α)

|

X1:n

. The Monte-Carlo approximation of

π

˜

n

(α)

is (see also, e.g., [11, p. 224])

˜

π

BC n

(α)

=

1

B −1 B

b=1 I

C−1 C

c=1 I

(

θ

ˆ

nĎĎbc

− ˜

θ

nĎb

≤ ˆ

θ

nĎb

− ˜

θ

n

) < (

1

α)

.

Due to the parametric structure of the oracle bootstrap, we can use a bootstrap recycling algorithm to approximate

π

˜

n

(α)

. Similar algorithms have been proposed by Newton and Geyer [19] for the parametric bootstrap. To this end, we generateC

independent oracle bootstrap resamplesX1Ď:1n

, . . . ,

X1Ď:Cn, and use the importance resampling identity to reweight the terms of the Monte-Carlo approximations of the iterated bootstrap expectations as follows:

Pr

ˆ

θ

ĎĎ n

− ˜

θ

Ď b n

≤ ˆ

θ

Ď b n

− ˜

θ

n

|

X1Ď:bn

C−1 C

c=1 I

ˆ

θ

Ďc n

− ˜

θ

Ď b n

≤ ˆ

θ

Ď b n

− ˜

θ

n

dP

ˆ

µ˜Ďb n

(

X Ďc 1:n

)

dP

ˆ

µ˜ n

(

X Ďc 1:n

)

=

C−1 C

c=1 I

ˆ

θ

Ďc n

− ˜

θ

nĎb

≤ ˆ

θ

nĎb

− ˜

θ

n

˜

w

bc

,

where

˜

w

bc

=

n

i=1

˜

w

Ďb i

˜

w

i

m(ic)

,

and m(ic)

=

#

{

j

:

XcjĎ

=

Xi

}

,

#Adenotes the number of elements of a setA,

w

˜

iĎband

w

˜

iare the oracle bootstrap weights corresponding toP

ˆ

˜ µĎb

n and

ˆ

Pµ˜n, respectively. Note thatC−1

C

c=1

w

˜

bc

Pr

1 asC

→ ∞

. For finiteC, (C−1

w

bc

:

c

=

1

, . . . ,

C) are not valid weights since some

bootstrap recycling estimates of

π

˜

n

(α)

may be greater than 1. A remedy to this problem is to use the following approximation instead: Pr

ˆ

θ

ĎĎ n

− ˜

θ

nĎb

≤ ˆ

θ

nĎb

− ˜

θ

n

|

X1Ď:bn

C

c=1 I

ˆ

θ

Ďc n

− ˜

θ

nĎb

≤ ˆ

θ

nĎb

− ˜

θ

n

¯

w

bc

,

where

w

¯

bc

=

w

˜

bc

/

C

c=1

w

˜

bc. In fact, this version of the oracle bootstrap recycling algorithm is implemented in the

simulation study carried out in Section6. 5. Local asymptotics

In this section, we study the large sample behavior of the bootstrap percentile confidence intervals when the mean parameter vector is in a local neighborhood of the set of points of inconsistency of the standard bootstrap. Specifically, assume thatX1:nn

=

X1n

, . . . ,

Xnnis a sample ofni.i.d. observations from a distributionPnonRd, such thatE

(

X1n

)

=

µ

n

and var

(

X1n

)

=

Σ, where thelocal parameter

µ

nis defined as

µ

n

=

µ

+

hn−δwith

δ >

0,

µ

S, andh

Rd

\ {

0

}

is a nonzero constant vector. The behavior of the bootstrap confidence intervals depends on how far is the local parameter from S; specifically, we consider three cases for

δ

, namely

δ <

1

/

2,

δ

=

1

/

2, and

δ >

1

/

2.

Local parameters ‘‘close’’ toS. Consider first the case when

δ >

1

/

2, that is, when the local parameter is ‘‘close’’ toS. By a Taylor expansion off

(

X

¯

n

)

about

µ

n, since

f

n

)

=

O

(

n−δ

)

, it follows thatn

(

θ

ˆ

n

θ

n

)

=

ZnTΓZn

+

oP

(

1

)

and

n1/2

f

(

X

¯

n

)

=

Zn

+

oP

(

1

)

, where

θ

n

=

f

n

)

andZn

=

n1/2

(

X

¯

n

µ

n

)

. Similarly to the proof ofTheorem 1, we obtain

lim n→∞Pr

θ

n

∈ ˆ

In

(α)

=

1

Pr

ZTΓZ

FN(10,Σ)

(

1

α

;

Z

) <

0

,

References

Related documents

Interchangeable (IC) Drug Products Added Addition of the following IC Drug Product to the ADBL has resulted in the creation of a New IC Grouping, effective September 1, 2013:

[r]

I was supplied in that case with voluminous information both in terms of medical records from the Veterans Administration and reports of individuals who knew

Sun J, Hunter P, Tyler A &amp; Willby N (2018) The influence of hydrological and land use indicators on macrophyte richness in lakes – A comparison of catchment

The Effects of Contingent Convertible (CoCo) Bonds on Insurers' Capital. Requirements under

Radio Frequency (RF) energy from wireless communications devices can interact with some electronic devices. For this reason, the FDA helped develop a detailed test method to

The pdeb prom oter region and mapping o f transcriptional start sites. CHAPTER 5: THE PDEB PROMOTER REGION MAPiWG

As an effective tool in this thesis we present a Principal Components Analysis (PCA) based technique to analyze the Capacitive Lead Frame test data and to detect