Contents lists available atSciVerse ScienceDirect
Journal of Multivariate Analysis
journal homepage:www.elsevier.com/locate/jmva
Bootstrapping in non-regular smooth function models
Mihai C. Giurcanu
University of Florida, United States
a r t i c l e i n f o Article history:
Received 23 April 2011 Available online 9 May 2012 AMS subject classifications: 62G09
62G15 62G20 Keywords:
Smooth function model Non-regular estimators Standard bootstrap m-out-of-nbootstrap Oracle bootstrap Confidence intervals
a b s t r a c t
We study the large sample behavior of the standard bootstrap, them-out-of-nbootstrap, and the oracle bootstrap (Giurcanu and Presnell, 2009) [14] percentile confidence intervals in non-regular smooth function models. We show that the oracle bootstrap percentile confidence intervals are consistent while the standard bootstrap and the m
-out-of-nbootstrap confidence intervals are inconsistent. Further analysis of coverage probabilities reveals that, for large samples, the iterated oracle bootstrap percentile confidence intervals are more accurate than their non-iterated versions. We also describe the large sample local behavior of the bootstrap confidence intervals for parameter values near the points of inconsistency of the standard bootstrap. In a simulation study, we describe the finite sample local behavior of various bootstrap confidence intervals.
©2012 Elsevier Inc. All rights reserved.
1. Introduction
Babu [1] shows that, in non-regular smooth function models, the standard bootstrap estimator of thedistributionof a smooth function of the sample mean is inconsistent for some values of the mean parameter vector, while Shao [25] shows that them-out-of-nbootstrap estimator is consistent over the entire parameter space. In this paper, we show that both the standard bootstrap and them-out-of-nbootstrap percentileconfidence intervalsare inconsistent for some values of the mean parameter vector, while the test inversion and them-out-of-nbootstrap test inversion confidence regions are consistent over the entire parameter space. Moreover, asymptotic expansions of coverage probabilities reveal that, for large samples, the test inversion confidence regions are more accurate than them-out-of-nbootstrap test inversion confidence regions.
Giurcanu and Presnell [14] propose the oracle bootstrap as an alternative to them-out-of-nbootstrap for consistent estimation of the distributions of many non-regular estimators, including the square of the sample mean, the Hodges and Stein estimators, and sparse estimators such as the LASSO. In this paper, we focus on the consistency and the higher order properties of oracle-bootstrap percentile confidence intervals and their iterated versions in non-regular smooth function models. We show that the oracle-bootstrap percentile confidence intervals are consistent over the entire parameter space, and that iteration increases their accuracy by an order of magnitude for large sample sizes.
It is well known that fixed parameter consistency may not adequately describe the large sample behavior of bootstrap estimators. For example, while the standard parametric bootstrap inconsistently estimates the distribution of both the Hodges and Stein estimators over a subset of the parameter space, it nevertheless performs better than the m -out-of-nbootstrap and a parametric version of the oracle bootstrap near the points of inconsistency [24]. More generally, a pointwise-consistent bootstrap estimator of the distribution of a non-locally asymptotically equivariant estimator is typically not locally uniform convergent at the points of inconsistency of the standard bootstrap [2,21]. In our local asymptotics analysis, we show that if the local parameter is close to a point of inconsistency, then the oracle bootstrap percentile confidence intervals are consistent, and that both the standard bootstrap and them-out-of-nbootstrap percentile
E-mail address:[email protected].
0047-259X/$ – see front matter©2012 Elsevier Inc. All rights reserved.
confidence intervals are inconsistent. Furthermore, if the local parameter is moderately close to a point of inconsistency, then all bootstrap confidence intervals are inconsistent. Finally, if the local parameter is moderately far from a point of inconsistency, then all bootstrap confidence intervals are consistent.
We conclude this section with an outline. In Section2, we present the asymptotic behavior of the standard bootstrap percentile confidence intervals in non-regular models. In Section3, we present the large sample results for them
-out-of-nbootstrap confidence intervals and for them-out-of-nbootstrap test inversion confidence regions. In Section4, we present the asymptotic behavior of the oracle bootstrap percentile confidence intervals and their iterated versions and in Section5, we study the large sample local behavior of the bootstrap confidence intervals. In Section6, we provide some computational details and the results of an empirical study which describe the finite sample local behavior of the bootstrap confidence intervals. The proofs of the theoretical results can be found in theAppendix.
2. Standard bootstrap estimation
Consider thesmooth function model(see, e.g., [4,15]) given by a random sampleX1:n
=
X1, . . . ,
Xnof i.i.d. observationsfrom a distributionPonRdsuch thatE
(
X)
=
µ
and var(
X)
=
Σ, whereX∼
PandΣis positive definite. Let
θ
=
f(µ)
be the parameter of interest, wheref:
Rd→
Ris asmoothfunction, and let
θ
ˆ
n=
f(
X¯
n)
be itsnaturalestimator, where¯
Xnis the sample mean. The smooth function model is calledregular if
∇
f(µ)
̸=
0 for allµ
∈
Rdandnon-regularif there exists aµ
∈
Rdsuch that∇
f(µ)
=
0, where∇
f(µ)
is the gradient off(µ)
. LetΘ=
f
(µ)
:
µ
∈
Rd,
∇
2f(µ)
̸=
0
, S=
µ
∈
Rd: ∇
f(µ)
=
0
, andΘ0
=
f
(µ)
:
µ
∈
S
be the image set ofSunderf, where∇
2f(µ)
is the Hessian matrix off(µ)
. By a two-term Taylor expansion off(
X¯
n)
aboutµ
, we havef
(
X¯
n
)
=
f(µ)
+
γ
T(
X¯
n−
µ)
+
(
X¯
n−
µ)
TΓ(
X¯
n−
µ)
+
oP(
n−1
),
whereγ
= ∇
f(µ)
andΓ=
(
1/
2)
∇
2f(µ)
. Ifθ
∈
Θ\
Θ0, thenn1/2
(
θ
ˆ
n−
θ)
N(
0, γ
TΣγ )
, where denotes convergencein distribution. If
θ
∈
Θ0, thenn(
θ
ˆ
n−
θ)
dj=1
λ
jUj2, whereλ
1, . . . , λ
dare the eigenvalues ofΣ1/2Γ Σ1/2andU1, . . . ,
Udare i.i.d.
∼
N(
0,
1)
. LetF(
x)
=
Pr(γ
TZ≤
x)
andG(
x)
=
Pr(
ZTΓZ≤
x)
, whereZ∼
N(
0,
Σ)
.The standard bootstrap confidence intervals were introduced by Efron [13] and their higher order properties were characterized by Hall [16]. LetX∗
1:n
=
X∗
1
, . . . ,
X∗
n be a standard bootstrap resample, i.e.,X
∗
1:nis a with replacement random
sample of sizenfromX1:n. Let
θ
ˆ
n∗=
f(
X¯
∗
n
)
be the bootstrap version ofθ
ˆ
n, and letLˆ
θˆ∗n
(
x)
=
Pr
ˆ
θ
∗ n− ˆ
θ
n≤
x|
X1:n
,Fˆ
θˆ∗ n(
x)
=
ˆ
Lθˆ∗ n(
x/
n 1/2)
, andGˆ
ˆ θ∗n
(
x)
= ˆ
Lθˆn∗(
x/
n)
be the bootstrap estimators ofLθˆn(
x)
=
Pr(
ˆ
θ
n−
θ
≤
x)
,Fθˆn(
x)
=
Pr
n1/2(
θ
ˆ
n−
θ)
≤
x
, and Gθˆn(
x)
=
Pr
n(
θ
ˆ
n−
θ)
≤
x
, respectively. Note that if
θ
∈
Θ\
Θ0andLθˆn(
x)
is continuous atL−θˆ1n
(
1−
α)
, whereL−ˆ1 θn(
1−
α)
is the(
1−
α)
quantile ofLθˆn(
x)
, an exactα
level upper confidence limit forθ
isθ
ˆ
n−
L−1 ˆ θn
(
1−
α)
. Letˆ
L−θˆ∗1 n(
1−
α)
be the(
1−
α)
quantile ofˆ
Lθˆ∗n
(
x)
, then theα
level bootstrap percentile upper confidence interval forθ
isˆ
In(α)
=
−∞
,
θ
ˆ
n− ˆ
L−ˆ1 θ∗ n(
1−
α)
. Similarly, the bootstrap percentile lower, equal-tailed, and symmetric confidence intervals forθ
areIˆ
Ln
(α)
=
ˆ
θ
n− ˆ
L−θˆ∗1 n(α),
∞
,ˆ
InET(α)
=
ˆ
θ
n−ˆ
L −1 ˆ θ∗ n((
1+
α)/
2),
θ
ˆ
n−ˆ
L −1 ˆ θ∗ n((
1−
α)/
2)
, andIˆ
S n(α)
=
ˆ
θ
n−ˆ
L ′−1 ˆ θ∗ n(α),
ˆ
θ
n+ˆ
L ′−1 ˆ θ∗ n(α)
, whereˆ
L′ ˆ θ∗ n(
x)
=
Pr
| ˆ
θ
∗ n− ˆ
θ
n| ≤
x|
X1:n
(see, e.g., [15]).Recall that an
α
level confidence intervalIn(α)
is consistent if its asymptotic coverage probability equals the nominallevel, i.e., limn→∞Pr
(θ
∈
In(α))
=
α
, and is inconsistent otherwise (see, e.g., [26, p. 329]). The following theorem showsthatI
ˆ
n(α)
is consistent forθ
∈
Θ\
Θ0and is inconsistent forθ
∈
Θ0. Part (ii) of this theorem gives the asymptotic coverage probability ofIˆ
n(α)
forθ
∈
Θ0. Similar results can be readily derived for the other types of confidence intervals.Theorem 1.Suppose f is two times continuously differentiable and E
(
∥
X∥
2) <
∞
.(i) If
θ
∈
Θ\
Θ0, thenlimn→∞Pr(θ
∈ ˆ
In(α))
=
α
.(ii)If
θ
∈
Θ0, thenlimn→∞Pr(θ
∈ ˆ
In(α))
=
1−
Pr
ZTΓZ
−
FN−(10,Σ)(
1−
α
;
Z) <
0
, where FN−(10,Σ)(
1−
α
;
y)
is the(
1−
α)
quantile of the distribution of 2yTΓZ+
ZTΓZ .3. Them-out-of-nbootstrap estimation
Them-out-of-nbootstrap estimation was introduced by Bickel and Freedman [5] and its large sample and higher order properties were analyzed by Bickel et al. [6]. Cheung et al. describe them-out-of-nbootstrap confidence intervals for
θ
under the constraint that∇
f(µ)
=
0. However, solving∇
f(µ)
=
0 forµ
, one may be able to determine the parameterθ
, and thus, the confidence interval estimation may not be suitable under such constraints on the parameters.LetX∗
1:m
=
X∗
1
, . . . ,
X∗
mbe anm-out-of-nbootstrap resample, i.e.,X
∗
1
, . . . ,
X∗
mis a with replacement random sample
of sizemfromX1:n, withm
=
o(
n)
and m→ ∞
. Letθ
ˆ
m∗=
f(
X¯
∗
m
)
be them-out-of-n bootstrap version ofθ
ˆ
n, andletL
ˆ
θˆ∗ m(
x)
=
Pr
ˆ
θ
∗ m− ˆ
θ
n≤
x|
X1:n
,Fˆ
θˆ∗ m(
x)
= ˆ
Lθˆm∗(
x/
m 1/2)
, andGˆ
ˆ θ∗percentile upper confidence interval for
θ
isIˆ
m(α)
=
−∞
,
θ
ˆ
n−
(
m/
n)
1/2Lˆ
−θˆ∗1m
(
1
−
α)
. Similarly, them-out-of-nbootstrap percentile lower, equal-tailed, and symmetric confidence intervals forθ
areIˆ
Lm
(α)
=
ˆ
θ
n−
(
m/
n)
1/2ˆ
L−θˆ∗1 m(α),
∞
,Iˆ
ET m(α)
=
ˆ
θ
n−
(
m/
n)
1/2Lˆ
−θˆ∗1 m((
1+
α)/
2),
θ
ˆ
n−
(
m/
n)
1/2Lˆ
−θˆ∗1 m((
1−
α)/
2)
, andIˆ
S m(α)
=
ˆ
θ
n−
(
m/
n)
1/2ˆ
L ′−1 ˆ θ∗ m(α),
ˆ
θ
n+
(
m/
n)
1/2ˆ
L ′−1 ˆ θ∗ m(α)
, whereLˆ
′ ˆ θ∗ m(
x)
=
Pr
| ˆ
θ
∗ m− ˆ
θ
n| ≤
x|
X1:n
. The following theorem shows that them-out-of-nbootstrap percentile upper confidence interval is consistent for
θ
∈
Θ\
Θ0and is inconsistent forθ
∈
Θ0.Theorem 2. Suppose f is two times continuously differentiable, E
(
∥
X∥
2) <
∞
,λ
j≥
0for all j=
1, . . . ,
d, m=
o(
n)
, andm
→ ∞
. Thenlimn→∞Pr(θ
∈ ˆ
Im(α))
=
α
ifθ
∈
Θ\
Θ0andlimn→∞Pr(θ
∈ ˆ
Im(α))
=
0ifθ
∈
Θ0.Bootstrap confidence regions can be derived by inverting bootstrap test statistics in analogy with the classical theory (see, e.g., [9, Section 9.2.1]). Consider testingH0
:
θ
=
θ
0versusHa:
θ > θ
0using the test statisticTn(
X1:n, θ
0)
=
n1/2(
θ
ˆ
n−
θ
0)
ifθ
0∈
Θ\
Θ0andTn
(
X1:n, θ
0)
=
n(
θ
ˆ
n−
θ
0)
ifθ
0∈
Θ0. UnderH0,Tn(
X1:n, θ
0)
γ
TZifθ
0∈
Θ\
Θ0andTn(
X1:n, θ
0)
ZTΓZif
θ
0∈
Θ0. Anα
level test inversion confidence region forθ
is defined as In(α)
=
θ
∈
Θ\
Θ0: ˆ
Fn−1(
1−
α)
≤
n1/2(
θ
ˆ
n−
θ)
∪
θ
∈
Θ0: ˆ
G−n1(
1−
α)
≤
n(
θ
ˆ
n−
θ)
,
whereFˆ
n(
x)
=
Pr(
γ
ˆ
nTΣˆ
1/2 n U≤
x|
X1:n)
andGˆ
n(
x)
=
Pr(
UTΣˆ
1/2 n Γˆ
nΣˆ
1/2n U
≤
x|
X1:n)
are conditional distribution functionsgivenX1:n,U
∼
N(
0,
I)
is independent ofX1:n,γ
ˆ
n= ∇
f(
X¯
n)
,Γˆ
n=
(
1/
2)
∇
2f(
X¯
n)
, andΣˆ
n=
n−1
ni=1(
Xi− ¯
Xn)(
Xi− ¯
Xn)
T.The correspondingm-out-of-nbootstrap test inversion confidence region for
θ
isˆ
Jm(α)
=
θ
∈
Θ\
Θ0: ˆ
F−ˆ1 θ∗ m(
1−
α)
≤
n1/2(
θ
ˆ
n−
θ)
∪
θ
∈
Θ0: ˆ
G−ˆ1 θ∗ m(
1−
α)
≤
n(
θ
ˆ
n−
θ)
.
The following corollary shows that the test inversion confidence regions and the m-out-of-n bootstrap test inversion confidence regions are consistent over the entire parameter space.
Corollary 1. Suppose f is two times continuously differentiable, E
(
∥
X∥
2) <
∞
, m=
o(
n)
, and m→ ∞
. Then, for allθ
∈
Θ, limn→∞Pr(θ
∈
In(α))
=
α
andlimn→∞Pr(θ
∈ ˆ
Jm(α))
=
α
.SupposeE
∥
X∥
5
<
∞
and that the characteristic function ofXsatisfies the Cramer condition lim sup∥t∥→∞|
Eexp(
itTX)
|
<
1, and thus, the Edgeworth expansion of the distribution functionFZn(
x)
ofZn=
(
Z(1)
n
, . . . ,
Zn(d))
T=
n1/2(
X¯
n−
µ)
holds (see, e.g., [4, Theorem 2, p. 436]):
FZn
(
x)
=
I
(
z≤
x)
1+
n−1/2p1(
z)
+
n−1p2(
z)
φ
Σ(
z)
dz+
O(
n−3/2),
(1)where
φ
Σ(
z)
is the density function of N(
0,
Σ)
,pi(
z)
are odd/even polynomials inz for odd/eveni=
1,
2 (see, e.g.,[15, pp. 162–167]), and I
(
A)
is the indicator function of an event A. Let H(
x)
=
E
p1(
Z)
I(γ
TZ≤
x)
and J(
x)
=
E
ZTΓZI(γ
TZ≤
x)
. Furthermore, letf(i1···ik)=
∂
k/∂µ
i1· · ·
∂µ
ik
f(µ)
fork≥
2,Ξi=
(
1/
3!
)
f(ijk):
j,
k=
1, . . . ,
d
∈
Rd×d,Ξ=
(Ξ
1, . . . ,
Ξd)
∈
Rd×d 2 ,Π=
(Π
1, . . . ,
Πd)
∈
Rd×d3,Πi=
(Π
i1, . . . ,
Πid)
∈
Rd×d 2 ,Πij=
(
1/
4!
)
f(ijkl):
k,
l=
1, . . . ,
d
∈
Rd×d,K(
x)
=
E
p2(
Z)
I(
ZTΓZ≤
x)
,M(
x)
=
E
p1(
Z)
ZTΞ(Z⊗
Id)
ZI(
ZTΓZ≤
x)
,N(
x)
=
E
ZTΠ(Z⊗
Id2)(
Z⊗
Id)
ZI(
ZTΓZ≤
x)
,P(
x)
=
(
1/
2)
E
(
ZTΞ(Z⊗
I d)
Z)
2I(
ZTΓZ≤
x)
, where
⊗
denotes the Kronecker product. The following lemma provides the Edgeworth and the Cornish–Fisher expansions ofFθˆn(
x)
,F−1 ˆ θn
(α)
,Gθˆn(
x)
, andG −1 ˆ θn(α)
, respectively. Similar expansions have been obtained by Cheung et al. [10] forθ
∈
Θ0.Lemma 1. Assume that f is five times continuously differentiable, E
∥
X∥
5
<
∞
, andlim sup∥t∥→∞|
Eexp(
itTX)
|
<
1.(i) If
θ
∈
Θ\
Θ0, then Fθˆn(
x)
=
F(
x)
+
n−1/2
H(
x)
−
J′(
x)
+
O(
n−1)
and Fˆ−1 θn(α)
=
F−1(α)
−
n−1/2
F′(
F−1(α))
−1
H(
F−1(α))
−
J′(
F−1(α))
+
O(
n−1)
; (ii) Ifθ
∈
Θ0, then Gθˆn(
x)
=
G(
x)
+
n−1
K(
x)
−
M′(
x)
−
N′(
x)
+
P′′(
x)
+
O(
n−3/2)
and G−1 ˆ θn(α)
=
G−1(α)
−
n−1
G′(
G−1(α))
−1
K(
G−1(α))
−
M′(
G−1(α))
−
N′(
G−1(α))
+
P′′(
G−1(α))
+
O(
n−3/2)
. LetZ∗ m=
(
Z ∗ m( 1), . . . ,
Z∗ m( d))
T=
m1/2(
X¯
∗m
− ¯
Xn)
, then the Edgeworth expansion ofFˆ
Z∗m
(
x)
, the conditional distribution function ofZm∗givenX1:n, holds:ˆ
FZ∗ m(
x)
=
I(
z≤
x)
1+
m−1/2pˆ
1n(
z)
+
m−1pˆ
2n(
z)
φ
Σˆn(
z)
dz+
OP(
m−3/2),
(2)where
φ
Σˆn(
z)
is the conditional density function ofN(
0,
Σˆ
n)
givenX1:n,pˆ
in(
z)
are odd/even polynomials inzfor odd/eveni
=
1,
2, the sample versions of p1(
z)
andp2(
z)
defined by(1). LetHˆ
n(
x)
=
E
ˆ
p1n(
Z∗)
I(
γ
ˆ
nTZ ∗≤
x)
|
X1:n
,ˆ
Jn(
x)
=
E
Z∗TΓˆ
nZ∗I(
γ
ˆ
nTZ ∗≤
x)
|
X1:n
, andQˆ
n(
x)
=
E
ˆ
p1n(
Z∗)
n1/2γ
ˆ
nTZ ∗ I(
Z∗TΓˆ
nZ∗≤
x)
|
X1:n
lemma provides the empirical Edgeworth and the empirical Cornish–Fisher expansions of the m-out-of-n bootstrap estimatorsF
ˆ
θˆ∗ m(
x)
,ˆ
F−ˆ1 θ∗ m(α)
,Gˆ
θˆ∗ m(
x)
, andˆ
G−ˆ1 θ∗ m(α)
, respectively.Lemma 2. Suppose f is five times continuously differentiable, m
=
o(
n)
, m→ ∞
, E
∥
X∥
5
<
∞
, andlim sup∥t∥→∞|
Eexp(
itTX)
|
<
1. (i) Ifθ
∈
Θ\
Θ0, thenFˆ
θˆ∗ m(
x)
=
ˆ
Fn(
x)
+
m−1/2
ˆ
Hn(
x)
− ˆ
Jn′(
x)
+
OP(
m−1)
and Fˆ
θ−ˆ∗1 m(α)
=
Fˆ
−1 n(α)
−
m −1/2
ˆ
F′ n(
Fˆ
−1 n(α))
−1
ˆ
Hn(
Fˆ
n−1(α))
− ˆ
J ′ n(
Fˆ
−1 n(α))
+
OP(
m−1)
. (ii)Ifθ
∈
Θ0, then Gˆ
θˆ∗ m(
x)
=
ˆ
Gn(
x)
−
n−1/2Qˆ
n′(
x)
+
OP
m−1+
mn−1
and Gˆ
−ˆ1 θ∗ m(α)
=
Gˆ
−1 n(α)
+
n −1/2
ˆ
G′ n(
Gˆ
−1 n(α))
−1ˆ
Q′ n(
Gˆ
−1 n(α))
+
OP
m−1+
mn−1
.Next theorem shows that the test inversion confidence regions are more accurate than them-out-of-nbootstrap test inversion confidence regions.
Theorem 3.Suppose f is five times continuously differentiable, m
=
o(
n)
, m→ ∞
, E
∥
X∥
5
<
∞
, andlim sup∥t∥→∞|
Eexp(
itTX)
|
<
1. (i) Ifθ
∈
Θ\
Θ0, thenPr
θ
∈
In(α)
=
α
+
O(
n−1/2)
andPr
θ
∈ ˆ
Jm(α)
=
α
+
O(
m−1/2)
. (ii)Ifθ
∈
Θ0, thenPr
θ
∈
In(α)
=
α
+
O(
n−1)
andPr
θ
∈ ˆ
Jm(α)
=
α
+
O
m−1+
mn−1
.4. Oracle bootstrap estimation
The oracle bootstrap is a particular type of the empirical likelihood bootstrap [8] and of the intentionally-biased bootstrap [18] that adapts a parametric bootstrap proposed by Putter and van Zwet [22] to nonparametric setting. First, the empirical distribution is embedded in a parametric family of weighted empirical distributions of the sample and then Putter and van Zwet’s parametric bootstrap procedure is applied to this family. To construct the parametric family
{ ˆ
Pυ:
υ
∈
Rd}
associated with the smooth function model, we choose a vector of weights
(w
1, . . . , w
n)
to minimize−
n−1
ni=1log
(
nw
i)
,the Kullback–Leibler divergence of
(w
1, . . . , w
n)
to the vector of uniform weights(
n−1, . . . ,
n−1)
, subject to the constraints
ni=1
w
iXi=
υ
,
ni=1
w
i=
1, andw
i≥
0 for alli=
1, . . . ,
n. The solution of this constraint optimization problem coincideswith the empirical likelihood weights for testing the null hypothesisH0
:
µ
=
υ
[20]. Ifυ
is not in the convex hull ofX1:n,then we arbitrarily takeP
ˆ
υ= ˆ
Pn, wherePˆ
nis the empirical distribution of the sample.Let
µ
˜
nbe a sequence ofn1/2-consistent estimators ofµ
such that Pr
˜
µ
n∈
S
→
1 wheneverµ
∈
S, i.e.,µ
˜
nhas the ‘‘oracle property’’ for the setS. A typical oracle bootstrap resampleX1Ď:n=
X1Ď, . . . ,
XnĎ is a sequence ofnconditionallyi.i.d. random draws fromP
ˆ
µ˜n and
µ
˜
nplays the role ofµ
in the oracle bootstrap world. Letθ
ˆ
Ď
n
=
f(
X¯
nĎ)
andθ
˜
n=
f(
µ
˜
n)
be the oracle bootstrap versions of
θ
ˆ
nandθ
, respectively, whereX¯
nĎis the sample mean of the oracle bootstrap resample.Then,
˜
LθˆnĎ(
x)
=
Pr
ˆ
θ
Ďn
− ˜
θ
n≤
x|
X1:n
,F
˜
θˆnĎ(
x)
= ˜
LθˆnĎ(
x/
n1/2)
, andG˜
θˆnĎ(
x)
= ˜
LθˆnĎ(
x/
n)
are the oracle bootstrap versionsofLθˆn
(
x)
,Fθˆn(
x)
, andGθˆn(
x)
, respectively. Thus, theα
level oracle bootstrap percentile upper confidence interval forθ
is˜
In(α)
=
−∞
,
θ
ˆ
n− ˜
L−ˆ1 θĎ n(
1
−
α)
. Similarly, the oracle bootstrap percentile lower, equal-tailed, and symmetric confidence intervals forθ
are defined asI˜
nL(α)
=
ˆ
θ
n− ˜
L −1 ˆ θĎ n(α),
∞
,I˜
ETn(α)
=
ˆ
θ
n− ˜
L −1 ˆ θĎ n((
1+
α)/
2),
θ
ˆ
n− ˜
L −1 ˆ θĎ n((
1−
α)/
2)
, and˜
InS(α)
=
ˆ
θ
n− ˜
L ′−1 ˆ θĎ n(α),
ˆ
θ
n+ ˜
L ′−1 ˆ θĎ n(α)
, respectively, where˜
L′ˆ θĎ n(
x)
=
Pr
| ˆ
θ
nĎ− ˜
θ
n| ≤
x|
X1:n
.Note that there are many estimators
µ
˜
n which have the oracle property for the setS. For example, ifS= {
µ
0}
is a singleton, letµ
˜
n=
µ
0+
(
X¯
n−
µ
0)
◦
I(
|∇
f(
X¯
n)
| ≥
an)
, where ‘‘◦
’’ is the component-wise vector to vector multiplication(Hadamard product),I
(
|∇
f(
X¯
n)
| ≥
an)
=
I
(
|∇
f(
X¯
n)
(j)| ≥
an)
:
j=
1, . . . ,
d
,an
→
0 andn1/2an→ ∞
, e.g.,an=
n−1/2log
(
log(
n))
. Another option is to use a LASSO-type estimator which has the oracle property for the setS, such as an adaptive LASSO-type estimator (see, e.g., [28]):˜
µ
n=
argmin b
n
i=1(
Xi−
b)
T(
Xi−
b)
+
λ
n d
j=1| ¯
Xn(j)−
µ
(0j)|
−1|
b(j)−
µ
(0j)|
,
wheren−1/2
λ
n→
0 andλ
n→ ∞
. If the non-empty setSis not a singleton, then for allν
∈
S, letµ
˜
n,ν=
ν
+
(
X¯
n−
ν)
◦
I
|∇
f(
X¯
n)
| ≥
an
. In the case of the adaptive LASSO estimator, let
˜
µ
n,ν=
argmin b
n
i=1(
Xi−
b)
T(
Xi−
b)
+
λ
n d
j=1| ¯
Xn(j)−
ν
(j)|
−1|
b(j)−
ν
(j)|
.
Then, set
µ
˜
n= ˜
µ
n,νˆn, whereν
ˆ
n=
argminν∈S∥ ˜
µ
n,ν− ¯
Xn∥
2and∥ · ∥
is the Euclidean norm. Next theorem shows that theTheorem 4. Assume that f is two times continuously differentiable and E
(
∥
X∥
2) <
∞
. Thenlimn→∞Pr(θ
∈ ˜
In(α))
=
α
for allθ
∈
Θ. LetAn= ∩
dj=1
|∇
f(
X¯
n)
(j)| ≥
n−r
, wherer
∈
(
0,
1/
2)
. Next lemma shows that ifan=
n−r and∇
f(µ)
(j)̸=
0 forallj
=
1, . . . ,
d, then there exists a constantc>
0 such that Pr(
An)
=
1−
O
exp
(
−
cn1−2r)
. Since
µ
˜
n= ¯
XnonAn, itfollows that the oracle bootstrap and the standard bootstrap are equivalent except on an event of probability exponentially small. If
θ
∈
Θ0, next lemma shows that there exists a constantc>
0 such that Pr(
Bn)
=
1−
O
exp
(
−
cn1−2r)
, where Bn= {∥∇
f(
X¯
n)
∥
∞≤
n−r}
and∥ · ∥
∞ is the infinity norm. Therefore,∇
f(
µ
˜
n)
=
0 except on an event of probabilityexponentially small.
Lemma 3. Suppose f is twice continuously differentiable, E
∥
X∥
2
<
∞
, andlogE
exp(
tTX)
<
∞
in a neighborhood of0
∈
Rd.1. If
∇
f(µ)
(j)̸=
0for all j=
1, . . . ,
d, there exists c>
0such thatPr(
An
)
=
1−
O
exp
(
−
cn1−2r)
.2. If
θ
∈
Θ0, there exists a constant c>
0such thatPr(B
n)
=
1−
O
exp
(
−
cn1−2r)
.Suppose
θ
∈
Θ\
Θ0. Similarly to them-out-of-nbootstrap, under the conditions ofLemma 4, the following empirical Edgeworth expansion ofF˜
ZĎ n(
x)
holds:˜
FZĎ n(
x)
=
I(
z≤
x)
1+
n−1/2p˜
1n(
z)
+
n−1p˜
2n(
z)
φ
Σ˜n(
z)
dz+
OP(
n−3/2),
(3)whereZnĎ
=
n1/2(
X¯
nĎ− ˜
µ
n)
, and givenX1:n,φ
Σ˜n(
z)
is the conditional density ofZĎ∼
N(
0,
Σ˜
n)
,Σ˜
n=
ni=1w
˜
i
Xi
− ˜
µ
n)(
Xi−
˜
µ
n)
T,w
˜
i,i=
1, . . . ,
n, are the oracle bootstrap weights corresponding toPˆ
µ˜n,p˜
in(
z)
are odd/even polynomials inzfor odd/eveni=
1,
2, the oracle bootstrap versions ofp1(
z)
andp2(
z)
defined by(1). Letγ
˜
n= ∇
f(
µ
˜
n)
,Γ˜
n=
(
1/
2)
∇
2f(
µ
˜
n)
∈
Rd×d,Ξ
˜
n=
(
Ξ˜
1n, . . . ,
Ξ˜
dn)
∈
Rd×d 2 ,Ξ˜
in=
(
1/
3!
)
f˜
n(ijk):
j,
k=
1, . . . ,
d
∈
Rd×d,Π˜
n=
(
Π˜
1n, . . . ,
Π˜
dn)
∈
Rd×d 3 ,˜
Πin=
(
Π˜
i1n, . . . ,
Π˜
idn)
∈
Rd×d 2 ,Π˜
ijn=
(
1/
4!
)
f˜
n(ijkl):
k,
l=
1, . . . ,
d
∈
Rd×d, andf˜
(i1···ik) n=
f(i1···ik)(
µ
˜
n)
, wherek≥
2. Let furtherF˜
n(
x)
=
Pr(
γ
˜
nTZĎ≤
x|
X1:n)
,H˜
n(
x)
=
E(
p˜
1n(
ZĎ)
I(
γ
˜
nTZĎ≤
x)
|
X1:n)
,˜
Jn(
x)
=
E(
ZĎTΓ˜
nZĎI(
γ
˜
nTZĎ≤
x)
|
X1:n)
,G˜
n(
x)
=
Pr
ZĎTΓ˜
nZĎ≤
x|
X 1:n
,K˜
n(
x)
=
E
˜
p2n(
ZĎ)
I(
ZĎTΓ˜
nZĎ≤
x)
|
X1:n
,M˜
n(
x)
=
E
˜
p1n(
ZĎ)
ZĎTΞ˜
n(
ZĎ⊗
Id)
ZĎI(
ZĎTΓ˜
nZĎ≤
x)
|
X1:n
,˜
Nn(
x)
=
E
ZĎTΠ˜
n(
Z˜
Ď⊗
I d2)(
ZĎ⊗
Id)
ZĎI(
ZĎTΓ˜
nZĎ≤
x)
|
X1:n
, andP˜
n(
x)
=
(
1/
2)
E
(
ZĎTΞ˜
n(
ZĎ⊗
I d)
ZĎ)
2I(
ZĎTΓ˜
nZĎ≤
x)
|
X1:n
. Next lemma gives the empirical Edgeworth and the empirical Cornish–Fisher expansions ofFˆ
θˆĎn
(
x)
,ˆ
F−1 ˆ θnĎ(α)
,Gˆ
θˆĎ n(
x)
, andˆ
G−ˆ1 θĎ n(α)
, respectively.Lemma 4. Assume that f is five times continuously differentiable, E
∥
X∥
5
<
∞
, andlim sup∥t∥→∞|
Eexp(
itTX)
|
<
1.(i) If
θ
∈
Θ\
Θ0, then F˜
θˆnĎ(
x)
=
F˜
n(
x)
+
n−1/2
˜
Hn(
x)
− ˜
Jn′(
x)
+
OP(
n−1)
and F˜
θˆ−Ď1 n(α)
=
F˜
n−1(α)
−
n−1/2
˜
F′ n(
F˜
−1 n(α))
−1
˜
Hn(
F˜
n−1(α))
− ˜
J ′ n(
F˜
−1 n(α))
+
OP(
n−1)
; (ii) Ifθ
∈
Θ0, then G˜
θˆnĎ(
x)
= ˜
Gn(
x)
+
n−1
˜
Kn(
x)
− ˜
Mn′(
x)
− ˜
N ′ n(
x)
+ ˜
P ′′ n(
x)
+
OP(
n−3/2)
andG˜
−θˆĎ1 n(α)
= ˜
G−n1(α)
−
n−1
˜
G′ n(
G˜
−1 n(α))
−1
˜
Kn(
G˜
−n1(α))
− ˜
M ′ n(
G˜
−1 n(α))
− ˜
N ′ n(
G˜
−1 n(α))
+ ˜
P ′′ n(
G˜
−1 n(α))
+
OP(
n−3/2)
.Bootstrap iteration is known to further reduce the size distortion of a bootstrap test, to further reduce the bias of an estimator, and to further improve the coverage probability of a bootstrap confidence interval [3,17,10]. Bootstrap iteration is usually used to estimate a parameter of the distribution of a statistical quantity defined in terms of a bootstrap estimator. For example, to obtain a (coverage calibrated) iterated bootstrap confidence interval, one first estimates the coverage probability of the bootstrap confidence interval using the iterated bootstrap, and then, the nominal level of the bootstrap confidence interval is re-adjusted based on this estimate. Specifically, letX1Ď:n
=
X1Ď, . . . ,
XnĎbe an oracle bootstrap resample, and letˆ
θ
n− ˜
L −1 ˆ θĎ n(
1
−
α)
be theα
level oracle bootstrap percentile upper confidence limit, and letπ
n(α)
=
Pr
θ
≤ ˆ
θ
n− ˜
L−ˆ1 θĎn
(
1−
α)
be the coverage probability of the confidence interval. A typical iterated oracle bootstrap corresponding toX1Ď:nis a collection
X1ĎĎ:n
=
X1ĎĎ, . . . ,
XnĎĎofnconditionally i.i.d. draws fromPˆ
µ˜Ďn. Letˆ
θ
ĎĎn be the iterated oracle bootstrap version of
θ
ˆ
n. The iteratedoracle bootstrap estimate of
π
n(α)
isπ
˜
n(α)
=
Pr
˜
θ
n≤ ˆ
θ
nĎ− ˜
LĎ −1 ˆ θnĎĎ(
1−
α)
|
X1:n
, where˜
LĎ ˆ θnĎĎ(
x)
=
Pr
ˆ
θ
ĎĎ n− ˜
θ
nĎ≤
x|
X1Ď:n
. Thus, theα
level coverage calibrated confidence interval isI˜
Ďn(α)
=
−∞
,
θ
ˆ
n− ˜
L−1 ˆ θnĎ(
1− ˜
π
−1 n(α))
. Similarly, the iterated standard bootstrap and them-out-of-nbootstrap confidence intervals areI
ˆ
∗n(α)
=
−∞
,
θ
ˆ
n− ˆ
L−ˆ1 θ∗ n(
1− ˆ
π
n−1(α))
andˆ
I∗ m(α)
=
−∞
,
θ
ˆ
n−
(
m/
n)
1/2ˆ
L−1 ˆ θ∗ m(
1− ˆ
π
−1 m(α))
, respectively, whereπ
ˆ
n(α)
=
Pr
θ
ˆ
n≤
θ
ˆ
n∗− ˆ
L ∗−1 ˆ θ∗∗ n(
1−
α)
|
X1:n
,ˆ
π
m(α)
=
Pr
ˆ
θ
n≤ ˆ
θ
m∗−
(
l/
m)
1/2ˆ
L ∗−1 ˆ θ∗∗ l(
1−
α)
|
X1:n
Next theorem shows that iterated oracle bootstrap percentile confidence intervals are more accurate than their non-iterated versions by an order of magnitude for large sample sizes.
Theorem 5.Assume that f is five times continuously differentiable, E
∥
X∥
5
<
∞
, andlim sup∥t∥→∞|
exp(
itTX)
|
<
1.(i) If
θ
∈
Θ\
Θ0, thenPr
θ
∈ ˜
In(α)
=
α
+
O(
n−1/2)
andPr
θ
∈ ˜
IĎn(α)
=
α
+
O(
n−1)
. (ii)Ifθ
∈
Θ0, thenPr
θ
∈ ˜
In(α)
=
α
+
O(
n−1)
andPr
θ
∈ ˜
IĎn(α)
=
α
+
O(
n−3/2)
.In practice, the bootstrap confidence bounds do not have closed form expressions, and thus, we usually approximate them by Monte-Carlo methods. Specifically, letX1Ď:1n
, . . . ,
X1Ď:BnbeBoracle bootstrap resamples, and letℓ
˜
n,B(
1−
α)
denotethe
(
1−
α)
Bth order statistic of(
θ
ˆ
nĎb− ˜
θ
n)
, for b=
1, . . . ,
B, whereθ
ˆ
nĎb is the bootstrap version ofθ
ˆ
n onX1Ď:bn. Foreachb
=
1, . . . ,
B, letX1ĎĎ:nb1, . . . ,
XĎĎ bC1:n beC iterated oracle bootstrap re-resamples and let
ℓ
˜
nĎ,b,C(
1−
α)
denote the(
1−
α)
Cth order statistic of(
θ
ˆ
nĎĎbc− ˜
θ
nĎb)
, forc=
1, . . . ,
C, whereθ
ˆ
nĎĎbc is the bootstrap version ofθ
ˆ
nonX1ĎĎ:nbc. Since˜
LĎˆ−1 θĎĎ n(
1−
α)
≤
(
θ
ˆ
nĎ− ˜
θ
n)
⇐⇒
(
1−
α)
≤ ˜
LĎˆ θĎĎ n(
ˆ
θ
Ď n− ˜
θ
n)
,π
˜
n(α)
=
1−
Pr
˜
LĎˆ θĎĎ n(
ˆ
θ
Ď n− ˜
θ
n) < (
1−
α)
|
X1:n
. The Monte-Carlo approximation ofπ
˜
n(α)
is (see also, e.g., [11, p. 224])˜
π
BC n(α)
=
1−
B −1 B
b=1 I
C−1 C
c=1 I(
θ
ˆ
nĎĎbc− ˜
θ
nĎb≤ ˆ
θ
nĎb− ˜
θ
n) < (
1−
α)
.
Due to the parametric structure of the oracle bootstrap, we can use a bootstrap recycling algorithm to approximate
π
˜
n(α)
. Similar algorithms have been proposed by Newton and Geyer [19] for the parametric bootstrap. To this end, we generateCindependent oracle bootstrap resamplesX1Ď:1n
, . . . ,
X1Ď:Cn, and use the importance resampling identity to reweight the terms of the Monte-Carlo approximations of the iterated bootstrap expectations as follows:Pr
ˆ
θ
ĎĎ n− ˜
θ
Ď b n≤ ˆ
θ
Ď b n− ˜
θ
n|
X1Ď:bn
≃
C−1 C
c=1 I
ˆ
θ
Ďc n− ˜
θ
Ď b n≤ ˆ
θ
Ď b n− ˜
θ
n
dPˆ
µ˜Ďb n(
X Ďc 1:n)
dPˆ
µ˜ n(
X Ďc 1:n)
=
C−1 C
c=1 I
ˆ
θ
Ďc n− ˜
θ
nĎb≤ ˆ
θ
nĎb− ˜
θ
n
˜
w
bc,
where˜
w
bc=
n
i=1
˜
w
Ďb i˜
w
i
m(ic),
and m(ic)=
#{
j:
XcjĎ=
Xi}
,
#Adenotes the number of elements of a setA,
w
˜
iĎbandw
˜
iare the oracle bootstrap weights corresponding toPˆ
˜ µĎbn and
ˆ
Pµ˜n, respectively. Note thatC−1
Cc=1
w
˜
bcPr
−
→
1 asC→ ∞
. For finiteC, (C−1w
bc
:
c=
1, . . . ,
C) are not valid weights since somebootstrap recycling estimates of
π
˜
n(α)
may be greater than 1. A remedy to this problem is to use the following approximation instead: Pr
ˆ
θ
ĎĎ n− ˜
θ
nĎb≤ ˆ
θ
nĎb− ˜
θ
n|
X1Ď:bn
≃
C
c=1 I
ˆ
θ
Ďc n− ˜
θ
nĎb≤ ˆ
θ
nĎb− ˜
θ
n
¯
w
bc,
wherew
¯
bc=
w
˜
bc/
Cc=1
w
˜
bc. In fact, this version of the oracle bootstrap recycling algorithm is implemented in thesimulation study carried out in Section6. 5. Local asymptotics
In this section, we study the large sample behavior of the bootstrap percentile confidence intervals when the mean parameter vector is in a local neighborhood of the set of points of inconsistency of the standard bootstrap. Specifically, assume thatX1:nn
=
X1n, . . . ,
Xnnis a sample ofni.i.d. observations from a distributionPnonRd, such thatE(
X1n)
=
µ
nand var
(
X1n)
=
Σ, where thelocal parameterµ
nis defined asµ
n=
µ
+
hn−δwithδ >
0,µ
∈
S, andh∈
Rd\ {
0}
is a nonzero constant vector. The behavior of the bootstrap confidence intervals depends on how far is the local parameter from S; specifically, we consider three cases forδ
, namelyδ <
1/
2,δ
=
1/
2, andδ >
1/
2.Local parameters ‘‘close’’ toS. Consider first the case when
δ >
1/
2, that is, when the local parameter is ‘‘close’’ toS. By a Taylor expansion off(
X¯
n)
aboutµ
n, since∇
f(µ
n)
=
O(
n−δ)
, it follows thatn(
θ
ˆ
n−
θ
n)
=
ZnTΓZn+
oP(
1)
andn1/2
∇
f(
X¯
n)
=
2ΓZn+
oP(
1)
, whereθ
n=
f(µ
n)
andZn=
n1/2(
X¯
n−
µ
n)
. Similarly to the proof ofTheorem 1, we obtainlim n→∞Pr