R E S E A R C H
Open Access
Difference-based M-estimator of generalized
semiparametric model with NSD errors
Fangning Fu
1, Zhen Zeng
1and Xiangdong Liu
1**Correspondence:
1Department of Statistics, Jinan
University, Guangzhou, P.R. China
Abstract
In this paper, we consider the generalized semiparametric model (GSPM)
yi=h
(
xTiβ)
+f(ti) +ei, 1≤i≤n,whereh(·) is a known function,eiare dependent errors. We obtain an estimator of the parametric component
β
for the model by a difference-based M-estimator. In addition, we prove the asymptotic normality of the proposed estimator and investigate the weak convergence rate of the wavelet estimator off(·). Furthermore, we apply these results to a partially linear model with dependent errors.MSC: 60F05; 62F12; 62G05
Keywords: Generalized semiparametric model; NSD random variables; M-estimator; Asymptotic normality; Weak convergence rate
1 Introduction
Consider the generalized semiparametric model
yi=h
xTiβ+f(ti) +ei, 1≤i≤n, (1)
whereyiare scalar response variables,h(·) is a continuously differentiable known function,
the superscriptT denotes the transpose, xi= (xi1, . . . ,xid)Tare explanatory variables,βis
ad-dimensional unknown parameter,f(·) is an unknown function, and 0≤t1≤t2≤ · · · ≤ tn≤1. Some authors commented that the assumption of independence is a serious
restric-tion (see Huber [1] and Hampel [2]); so for the errorsei, we confine ourselves to negatively
superadditive dependent (NSD) errors. NSD random variables have been introduced by Hu [3] and are widely used in statistics; see [4–12].
The theory of the GSPM is an extension of the classical theory of partially linear mod-els; the component of the generalized parametrich(xT
iβ) for GSPM includes the linear
parametric component xT
iβ, exponential parametric componente
xTiβ, and so on.
As is well known, the generalized partially linear model and partially linear single-index model (h(·) is an unknown link function) are also derived from the partially linear model. There is a substantial amount of work for generalized partially linear model (see [13–
18] and, for a partially linear single-index model, [19–24]); this research is devoted to
presenting various methods to obtain estimators ofβ andf(ti) and investigating some
large-sample properties of these estimators.
In this paper, we consider a difference-based estimator method to estimate the unknown parametric componentβ. This difference-based estimator is optimal in the sense that the estimator of the unknown parametric component is asymptotically efficient. For example, Tabakan et al. [25] studied a difference-based ridge in a partially linear model. Wang et al. [26] obtained a difference-based approach to the semiparametric partially linear model. Zhao and You [27] used a difference-based estimator method to estimate the parametric component for partially linear regression models with measurement errors. Duran et al. [28] investigated the difference-based ridge and Liu-type estimators in semiparametric re-gression models. Wu [29] discussed a restricted difference-based Liu estimator in partially linear models. Hu et al. [30] presented a difference-based Huber–Dutter (DHD) estima-tor to obtain the root varianceσ and parameterβfor a partially linear model. However, Most of the results rely on the independence errors. Wu [31] studied the difference-based ridge-type estimator of parameters in a restricted partial linear model with correlated er-rors, but this paper just focuses on estimating the linear component. Zeng and Liu [32] used a difference-based and ordinary least-square method to obtain the estimator of an unknown parametric component, but this paper ignores the fact that a difference-based estimator may cause greater bias in moderately sized samples than other estimators. In-spired by these papers, we propose a difference-based M-estimator (DM) methods for generalized semiparametric model with NSD errors. The M-estimator is a most famous robust estimator, which was introduced by Huber [33]. In addition, onceβis estimated, we can estimatef(·) by a variety of nonparametric techniques. In this paper, the estimator off(·) is obtained by the wavelet method.
The paper has the following structure. In Sect.2, we present the estimation procedure. In Sect.3, we establish the main results. The proofs of the main results are provided in the
Appendix.
2 Estimation method
2.1 Notation
Throughout the paper, Z is the set of integers, N is the set of natural numbers, R is the set of real numbers. A sequence of random variablesηnis said to be of smaller order in
probability than a sequencedn(denoted byηn=oP(dn)) ifηn/dnconverges to 0 in
prob-ability, andηn=OP(dn) ifηn/dnis bounded in probability. Convergence in distribution is
denoted byHn D
→H. For any arbitrary functionh(·),h(·),h(·), andh(·) are the first, second, and third derivatives of h(·), respectively.xis the Euclidean norm of x, and
x=max{k∈Z:k≤x}. LetC0,C1,C2,C3,C4be positive constants, and letβ0be the
true parameter. LetΘ={β:|β–β0| ≤C0}.
2.2 Difference-based M-estimation
Lety˜i=
m
q=0dqyi+q,h˜i(β) =
m
q=0dqh(xTi+qβ),f˜(ti) =
m
q=0dqf(ti+q), ande˜i=
m
q=0dqei+q,
whered0,d1, . . . ,dmsatisfy the conditions m
q=0 dq= 0,
m
q=0
Theny˜i,h˜i(β),˜f(ti), ande˜ican be seen as themth-order differences ofyi,h(xTiβ),f(ti), and
ei, respectively. Hence, applying the differencing procedures, model (1) becomes
˜
yi=h˜i(β) +f˜(ti) +e˜i, 1≤i≤n–m. (3)
From Yatchew [34] we find that the application of differencing procedures in model (1) can remove the nonparametric effect in large samples, so we ignore the presence off˜(·). Thus (3) becomes
˜
yi=h˜i(β) +e˜i 1≤i≤n–m. (4)
Letρbe a convex function. Assume thatρhas a continuous derivativeψand there isa
such thatψ(a) = 0. We can propose the difference-based M-estimator given by minimizing
Q(β) =
n–m
i=1
ρy˜i–h˜i(β) +a
. (5)
Let ad×1 vectorβˆnbe the minimizer of (5) andβˆn∈Θ. Writeh˜i(β) =
m
q=0dq×
h(xT
i+qβ)xi+q,h˜ik(β) =
m
q=0dqh(xTi+qβ)x(i+q)k, with 1≤k≤d,h˜i(β) =
m
q=0dqh(xTi+qβ)× xi+qxTi+q, andh˜i(β)h˜jT(β) =
m
q=0dqh(xTi+qβ)xi+q
m
q=0dqh(xTj+qβ)xTj+q. Then the estimator
satisfies
∂Q(βˆn)
∂β = –
n–m
i=1
ψ(eˆ˜i+a)h˜i(βˆn) = 0 (6)
witheˆ˜i=y˜i–h˜i(βˆn). The convexity ofρguarantees the equivalence of (5) and (6) and the
asymptotic uniqueness of the solution; otherwise, it is unimportant.
We estimate the nonparametric functionf(·) by the wavelet method. The formal defini-tion of the wavelet method is the following.
Suppose that there exist a scaling functionφ(·) in the Schwartz spaceSland a
multires-olution analysis{Vm˜}in the concomitant Hilbert spaceL2(R) with the reproducing kernel
Em˜(t,s) given by
Em˜(t,s) = 2m˜E0
2m˜t, 2m˜s= 2m˜
k∈Z
φ2m˜t–kφ2m˜s–k.
LetAi= [si–1,si] denote intervals that partition [0, 1] withti∈Aifor 1≤i≤n. Then the
estimator of the nonparameterf(t) is given by
ˆ fn(t) =
n
i=1
yi– xTiβˆn Ai
Em˜(t,s)ds. (7)
3 Main results
We now list some conditions used to obtain the main results.
(C1) max1≤i≤nxi=O(1), and the eigenvalues ofn–1
n
i=1xixTi are bounded above and away from zero.
(C2) b,bc–d2> 0, whereb=E{ψ(η)},c=E{η2ψ(η)},d=E{ηψ(η)}withη=e˜
(C3) Eψ(e˜i+a) = 0.
(C4) The functionρis assumed to be convex, not monotone, and possessing bounded derivatives of sufficiently high order in a neighborhood of the pointxT
iβ0. In
particular,ψ(t)should be continuous and bounded in a neighborhood ofxT iβ0.
(C5) h(·)is assumed to possess bounded derivatives of sufficiently high order in a neighborhood of pointxT
iβ0.
(C6) f(·)∈Hα(Sobolev space) for someα> 1/2.
(C7) f(·)is a Lipschitz function of orderγ > 0.
(C8) φ(·)belongs toSl, which is a Schwartz space forl≥α, is a Lipschitz function of order 1, and has a compact support, in addition to| ˆφ(ξ) – 1|=O(ξ)asξ→0, whereφˆdenotes the Fourier transform ofφ.
(C9) si,1≤i≤n, satisfymax1≤i≤n(si–si–1) =O(n–1), and2m˜ =O(n1/3).
Remark1 Condition (C1) is often imposed in M-estimation theory of regression models. Condition (C2) is used by Silvapullé [35] for HD estimation. In this paper, this condition is also necessary for M-estimation. Condition (C3) is used by Wu [36] and Zeng and Hu [37] witha= 0. We require this in order that the expectation of (5) reaches its minimum at the true valueβ0. For Condition (C4), higher-order derivatives are technically
conve-nient (Taylor expansions), but their existence is hardly essential for the results to hold; see Huber [1]. Condition (C5) is quite mild and can be easily satisfied. Conditions (C6)–(C9) are used by Hu et al. [38].
Remark2 The assumption ofψ(a) = 0 and Condition (C4) are serious restrictions, which shows that the M-estimator in our paper is a particular case of the classical M-estimator. However, in our study, these conditions are necessary.
Theorem 3.1 Let{en,n≥1}be a sequence of NSD random variables with Een= 0,and let
for someδ> 0,
sup n≥1E|en|
2+δ<∞. (8)
Suppose that
sup j≥1
i:|i–j|≥u
cov(ei,ej)→0 as u→ ∞. (9)
Set˜ei=
m
q=0dqei+q,where{dq, 1≤q≤m}are defined in(2).Let{ci, 1≤i≤n–m}be an
array of constants satisfyingmax1≤i≤n–m|ci|=O(1),and suppose thatψ(a) = 0and
Condi-tions(C3)and(C4)hold.Then
(n–m)–1/2τ–1
n–m
i=1
ciψ(e˜i+a) D
→N(0, 1), (10)
provided that
τ2= lim n→∞(n–m)
–1
n–m
i=1
c2iVarψ(e˜i+a)
+ 2
n–m
i=1
n–m
j=i+1 cicjCov
ψ(e˜i+a),ψ(e˜j+a)
Theorem 3.2 Let{en,n≥1}be a sequence of NSD random variables with Een= 0
satisfy-ing conditions(8)and(9).Assume that conditions(C1)–(C5)hold.Then
(n–m)–1/2τβ–1E
∂2Q(β0) ∂β∂βT
(βˆn–β0)
D
→N(0,Id), (11)
provided that
τβ2= lim n→∞
1
n–m
n–m
i=1 ˜
hi(β0)h˜iT(β0)Var
ψ(˜ei+a)
+ 2
n–m
i=1
n–m
j=i+1 ˜
hi(β0)h˜jT(β0)Cov
ψ(˜ei+a),ψ(e˜j+a)
is a positive definite matrix,where Idis the identity matrix of order d.
Corollary 3.1 Let h(xTiβ) = xTiβ,and let{en,n≥1}be a sequence of NSD random
vari-ables with Een= 0 satisfying conditions(8)and(9).Assume that Conditions(C1)–(C4)
hold.Then
(n–m)–1/2τβ–1E
∂2Q(β 0) ∂β∂βT
(βˆn–β0)
D
→N(0,Id), (12)
provided that
τβ2= lim n→∞
1
n–m
n–m
i=1 ˜
xix˜Ti Var
ψ(e˜i+a)
+ 2
n–m
i=1
n–m
j=i+1 ˜
xix˜Tj Cov
ψ(˜ei+a),ψ(e˜j+a)
is a positive definite matrix.
Corollary 3.2 Let{en,n≥1}be a sequence of NSD random variables with Een= 0
sat-isfyingCov|i–j|>m¯(ei,ej) = 0 withm¯ <∞.Assume that Condition(C1)–(C5)and(8)hold.
Then
(n–m)–1/2τβ–1E
∂2Q(β0) ∂β∂βT
(βˆn–β0)
D
→N(0,Id),
provided that
τβ2= lim n→∞
1
n–m
n–m
i=1 ˜
hi(β0)h˜iT(β0)Var
ψ(˜ei+a)
+ 2
¯ m
k=1
n–m–k
i=1 ˜
hi+k(β0)h˜iT(β0)Cov
ψ(˜ei+k+a),ψ(e˜i+a)
is a positive definite matrix.
Corollary 3.3 (Zeng and Liu [32]) Letρ(t) =t2,h(xT
iβ) = xTiβ,and let {en,n≥1} be a
sequence of NSD random variables with Een= 0satisfying conditions(8)and(9).Assume
that conditions(C1)–(C2)hold.Then
(n–m)–1/2τβ–1
n–m
i=1 ˜
xix˜Ti(βˆn–β0)
D
→(0,Id),
provided that
τβ2= lim n→∞(n–m)
–1
n–m
i=1 ˜
xix˜Ti Var(˜ei) + 2 n–m
i=1
n–m
j=i+1 ˜
xix˜Tj Cov(e˜i,e˜j)
is a positive definite matrix.
Corollary 3.4 Letρ(t) =t2,h(xT iβ) =ex
T
iβ,and let{en,n≥1}be a sequence of NSD random
variables with Een= 0satisfying conditions(8)and(9).Assume that conditions(C1)–(C2)
hold.Then
(n–m)–12τ–1 β
n–m
i=1
m
q=0 dqe
xT i+qβ0x
i+q
2
(βˆn–β0)
D
→(0,Id),
provided thatτ2
β=limn→∞(n–m)–1Var(ni=1–me˜i
m
q=0dqex
T i+qβ0x
i+q)is a positive definite
matrix.
Theorem 3.3 Under the conditions of Theorem3.2,assume that Conditions(C6)–(C9)
hold.Then
sup
0≤t≤1
fˆn(t) –f(t)=OP
n–γ+O
P(τm˜) +OP
n–1/3M
n
as n→ ∞, (13)
where Mn→ ∞in arbitrary slowly rate,andτm˜ = 2–m˜(α–1/2)if1/2 <α< 3/2,τm˜ =
√ ˜ m2–m˜
ifα= 3/2,andτm˜ = 2–m˜ ifα> 3/2.
Appendix
A.1 Lemmas
In this section, we present the proofs of the main results. We first need some lemmas.
Lemma 1 Under Conditions(C1), (C4),and(C5),suppose that eisatisfies(8).Then
∂2Q(β 0) ∂β∂βT –E
∂2Q(β 0) ∂β∂βT =OP
(n–m)12 (14)
and
supn–3/2Rnl(β˜)=sup
n–3/2 ∂
3
∂β∂βT∂β l
Q(β˜)→0 as n→ ∞, (15)
Proof We have
∂2
∂β∂βTQ(β0) –E
∂2
∂β∂βTQ(β0)
=
n–m
i=1
ψ(e˜i+a)h˜i(β0)h˜iT(β0) +
n–m
i=1
ψ(e˜i+a)h˜i(β0)
–
E
n–m
i=1
ψ(˜ei+a)h˜i(β0)h˜iT(β0)
+E
n–m
i=1
ψ(˜ei+a)h˜i(β0)
=
n–m
i=1
ψ(e˜i+a)h˜i(β0)h˜iT(β0) –E
n–m
i=1
ψ(e˜i+a)h˜i(β0)h˜iT(β0)
+
n–m
i=1
ψ(e˜i+a)h˜i(β0) –
n–m
i=1
Eψ(˜ei+a)h˜i(β0)
:=I1+I2.
From (8) we have
sup i≥1
j:|i–j|≥u
covψ(ei+a),ψ(ej+a)→0 asu→ ∞.
Therefore, for a fixed smallε, there exists a positive integerδ=δεsuch that
sup i≥1
j:|i–j|≥δ
covψ(ei+a),ψ(ej+a)<ε
for 1≤k1,k2,l1,l2≤d, and thus
n–m
i=1
j:|i–j|≥1 ˜
hik1(β0)h˜il1(β0)h˜
jk2(β0)h˜
jl2(β0) ×Eψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
=
j:1≤|i–j|<δ
n–m
i=1 ˜
hik1(β0)h˜il1(β0)h˜
jk2(β0)h˜
jl2(β0) ×Eψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
+
n–m
i=1
j:|i–j|≥δ ˜
hik1(β0)h˜il1(β0)h˜
jk2(β0)h˜
jl2(β0) ×Eψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
≤2δ max
1≤i≤n–m
h˜i(β0)2
× max
1≤i,j≤n–mE
ψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
n–m
i=1
h˜i(β0)2
+ max
1≤i≤n–mh˜ i(β0)
×
n–m
i=1
i:|i–j|≥δ
Eψ(˜ei+a) –Eψ(e˜i+a)
ψ(˜ej+a) –Eψ(e˜j+a)
≤2δ max
1≤i≤n–m
h˜i(β0)2
× max
1≤i,j≤n–mE
ψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
n–m
i=1
h˜i(β0) 2
+ max
1≤i≤n–m
h˜i(β0) 4
(n–m)ε.
By Condition (C5) we have that max1≤i≤n{qm=1|dqh(xTi+qβ˜)|}, max1≤i≤n{mq=1|dq ×
h(xT
i+qβ˜)|}, andmax1≤i≤n{
m
q=1|dqh(xTi+qβ˜)|}are bounded by some constantC1. Then
by (C4) it follows that, for same constantM> 0,
(n–m)–2E
n–m
i=1
ψ(e˜i+a)h˜ik(β0)h˜il(β0) –E
n–m
i=1
ψ(˜ei+a)h˜ik (β0)h˜il(β0) 2
= (n–m)–2
n–m
i=1 ˜
hik2(β0)h˜il2(β0)E
ψ(e˜i+a) –Eψ(e˜i+a)
2
+
n–m
i=1
j:|j–i|≥1 ˜
hik1(β0)h˜il1(β0)h˜
jk2(β0)h˜
jl2(β0)
×Eψ(e˜i+a) –Eψ(e˜i+a)
ψ(e˜j+a) –Eψ(e˜j+a)
≤(n–m)–1 max
1≤i≤n–m
h˜i(β0) 2
(n–m)–1
n–m
i=1
h˜i(β0) 2
M
+ 2δ(n–m)–1 max
1≤i≤n–mh˜ i(β0)
2
(n–m)–1
n–m
i=1
h˜i(β0) 2
M
+ (n–m)–1 max
1≤i≤n–m
h˜i(β0) 4
ε
≤(2δ+ 1)(n–m)–1 max
1≤i≤n–m
h˜i(β0) 2
(n–m)–1
n–m
i=1
h˜i(β0) 2
M
+ (n–m)–1 max
1≤i≤n–m
h˜i(β0)4ε
= (2δ+ 1)(n–m)–1 max 1≤i≤n–m
m q=0 dqh
xiT+qβ0
xi+q
2
×(n–m)–1
n–m
i=1 m q=0 dqh
xTi+qβ0
xi+q
2 M
+ (n–m)–1 max
1≤i≤n–m
m q=0 dqh
xTi+qβ0
xi+q
≤(n–m)–1
(2δ+ 1)C21 max
1≤i≤n–m m
q=0
xi+q2(n–m)–1C12
n–m
i=1
m
q=0
xi+q2M
+C41
max
1≤i≤n–m m
q=0 xi+q2
2 ε ,
and from (C1) it follows that
(n–m)–1Var(I1) =O(1). (16)
By the Chebyshev inequality it suffices to verify thatI1=OP((n–m)
1
2). In the same way,
we easily obtain thatI2=OP((n–m)
1
2). Consequently, ∂2
∂β∂βTQ(β0) –E
∂2
∂β∂βTQ(β0) =OP
(n–m)12. (17)
Note that, for 1≤l≤d,
∂3 ∂β∂βT∂β
l
Q(β˜)
=
n–m
i=1 3ψ ˜ yi–
m
q=0
dqh(xi+qβ˜) +a
m
q=0
dqh(xi+qβ˜) m
q=0
dqh(xi+qβ˜)x(i+q)lxi+qxTi+q
–ψ
˜ yi–
m
q=0
dqh(xi+qβ˜) +a
m
q=0
dqh(xi+qβ˜)
3
x(i+q)lxi+qxTi+q
–ψ
˜ yi–
m
q=0
dqh(xi+qβ˜) +a
m
q=0
dqh(xi+qβ˜)x(i+q)lxi+qxTi+q .
By Conditions (C1), (C4), and (C5), for 1≤k,l,s≤dand some constantM> 0, we have
sup n–3/2
n–m
i=1 3ψ ˜ yi–
m
q=0
dqh(xi+qβ˜) +a
×
m
q=0
dqh(xi+qβ˜) m
q=0
dqh(xi+qβ˜)x(i+q)lx(i+q)kx(i+q)s
–ψ
˜ yi–
m
q=0
dqh(xi+qβ˜) +a
m
q=0
dqh(xi+qβ˜)
3
x(i+q)lx(i+q)kx(i+q)s
–ψ
˜ yi–
m
q=0
dqh(xi+qβ˜) +a
m
q=0
dqh(xi+qβ˜)x(i+q)lx(i+q)kx(i+q)s
≤(n–m)–1/2MC21+C13+C1
max
1≤i≤n–m m
q=0
xi+q(n–m)–1 n–m
i=1
m
q=0 xi+q2
→0, n→ ∞.
Lemma 2 If (C1)–(C5)hold,then √
n(βˆn–β0) =Op(1). (18)
Proof We can prove Lemma2by an argument similar to Lemma 4 of Silvapullé [35], so
we omit the details.
Lemma 3(Zhou and You [39]) If Condition(C8)holds,then
(a1) |E0(t,s)| ≤(1+C|t–ks|)k,|Em˜(t,s)| ≤
2m˜C
(1+2m˜|t–s|)k (wherek∈N,andC=C(k)is a constant
depending onkonly); (a2) sup0≤s≤1|Em˜(t,s)|=O(2m˜); (a3) supt01|Em˜(t,s)|ds≤C2;
(a4) 01Em˜(t,s)ds→1,n→ ∞.
A.2 Proof of Theorem3.1
By Condition (8) we have
sup n≥1
Ee2n<∞ and lim x→∞supn≥1Ee
2
nI
|en|>x
= 0,
from which it follows that
C3:=sup
n>m
(n–m)–1
n–m
i=1
m
q=0
Var(dqei+q) <∞,
and for allε> 0,
(n–m)–1
n–m
i=1
m
q=0
E(dqei+q)2I
|dqei+q| ≥
√
n–mε→0 asn→ ∞.
Then we can find a positive number sequence{εn,n≥1}withεn→0 such that
(n–m)–1
n–m
i=1
m
q=0
E(dqei+q)2I
|dqei+q| ≥
√ n–mεn
→0 asn→ ∞.
Now we define the integers:m0= 0 and forj= 0, 1, 2, . . . ,
m2j+1=min
m:m≥m2j, (n–m)–1 m
i=m2j+1
m
q=0
Var(dqei+q) >√εn ,
m2j+2=m2j+1+
1
εn
+m.
Denote
wherel=l(n) is the number of blocks of indicesIj. Then
l√εn≤(n–m)–1 l
j=1
i∈Ij
m
q=0
Var(dqei+q)≤(n–m)–1 n–m
i=1
m
q=0
E(dqei+q)2≤C3, (19)
and hence we havel≤C3/√εn. If the remainder term is not zero, then as the construction
ends, we put all the remainder terms into a block denoted byJl. Hence, by the Lagrange
mean value theorem, 1
√ n–m
n–m
i=1
ciψ(˜ei+a)
=√ 1
n–m
n–m
i=1
ciψ(e˜i+a) –
1
√ n–m
n–m
i=1 ciψ(a)
=√ 1
n–m
n–m
i=1
ciψ(ξi)e˜i, (20)
whereξi=t˜ei+afor somet∈[0, 1].
Moreover, settingai=τ–1ciψ(ξi), we have
1
√ n–m
n–m
i=1
ciψ(˜ei+a)
=√ 1
n–m
n–m
i=1 ai˜ei
=√ 1
n–m
l
j=1
i∈Ij
aie˜i+
1
√ n–m
l
j=1
i∈Jj
aie˜i
:=I+J.
By the argument in the proof of Theorem 4.1 in Zeng and Liu [32] we have
(n–m)–1/2
n–m
i=1 ai˜ei
D
→N(0, 1), (21)
which implies
(n–m)–1/2τ–1
n–m
i=1
ciψ(e˜i+a) D
→N(0, 1). (22)
The proof is completed.
A.3 Proof of Theorem3.2
Now we will use Theorem3.1to prove Theorem3.2. Expanding ∂β∂Q(βˆn) aboutβ0, we
have
∂
∂βQ(βˆn) = ∂
∂βQ(β0) + ∂2
∂β∂βTQ(β0)(βˆn–β0) +
1 2
Rnl(β˜,βˆn,β0)
whereβ˜=sβˆn+ (1 –s)β0for somes∈[0, 1], and
Rnl(β˜,βˆn,β0)
1≤l≤d
=(βˆn–β0)TRnl(β˜)(βˆn–β0), (βˆn–β0)TRn2(β˜)(βˆn–β0), . . . ,
(βˆn–β0)TRnd(β˜)(βˆn–β0)
T
.
From (6) we have
∂2
∂β∂βTQ(β0)(βˆn–β0) = –
∂
∂βQ(β0) –
1 2
Rnl(β˜,βˆn,β0)
1≤l≤d (23)
and, by Lemma1and Lemma2,
(n–m)–12
E ∂ 2
∂βT∂βQ(β0) +OP
(n–m)12(βˆn–β0)
= –(n–m)–12
∂
∂βQ(β0) +
1 2
Rnl(β˜,βˆn,β0)
1≤l≤d
= –(n–m)–12 ∂
∂βQ(β0) +oP(1)
= (n–m)–12
n–m
i=1
ψ(e˜i+a)h˜i(β0) +oP(1).
We now show that 1
√ n–mτβ
n–m
i=1
ψ(˜ei+a)h˜i(β0)
D
→N(0,Id). (24)
Letube a 1×dsuch thatu= 1. By the Cramér–Wold theorem it suffices to verify that 1
√ n–mτ
n–m
i=1
ψ(e˜i+a)uh˜i(β0)
D
→N(0, 1), (25)
whereτ2=lim
n→∞Var((n–m)–1/2ni=1–mψ(e˜i+a)uh˜i(β0)); by the definition ofτβ2,τ2> 0.
By Theorem3.1, (25) follows fromuh˜i(β0) =O(1). The proof is completed.
A.4 Proof of Theorem3.3
By (7) we have
ˆ
fn(t) –f(t) = n
i=1
yi–h
xTiβˆn Ai
Em˜(t,s)ds–f(t)
=
n
i=1
hxTiβ+f(ti) +ei–h
xTiβˆn Ai
Em˜(t,s)ds–f(t)
=
n
i=1
hxTiβ–hxTi βˆn Ai
+
n
i=1 f(ti)
Ai
Em˜(t,s)ds–f(t) + n
i=1 ei
Ai
Em˜(t,s)ds
:=I1+I2+I3.
By the argument in the proof of Theorem 3.2 in Hu [30] we have
I2=O
n–γ+O(τm˜) (26)
and
I3=OP
n–13Mn. (27)
By Lemma3, (C1), and (C5) we assume that
max
1≤i≤n
h(ξi)xTisup t
n
i=1
Ai
Em˜(t,s)ds≤C4,
whereξi=rxiTβ+ (1 –r)xTi βˆn,r∈[0, 1]. It follows that
I3≤sup
t
n
i=1
hxTiβ–hxTi βˆn Ai
Em˜(t,s)ds
≤sup t
n
i=1
h(ξi)xTi (β–βˆn) Ai
Em˜(t,s)ds
≤max
1≤i≤n
h(ξi)xiTsup t
n
i=1
Ai
Em˜(t,s)dsβ–βˆn
≤C4β–βˆn.
By Lemma2we get
I3=Op
n–1/2. (28)
Then Theorem3.2follows from (26), (27), and (28).
Acknowledgements
This authors would like to thank a referee and an Associate Editor for their comments and suggestions. Funding
The research is supported Support by National Natural Science Foundation of China [grant number 71471075]. Availability of data and materials
Not application. Competing interests
The authors declare that they have no competing interests. Authors’ contributions
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Received: 6 December 2018 Accepted: 4 March 2019
References
1. Huber, P.J.: Robust regression: asymptotics, conjectures and Monte Carlo. Ann. Stat.1(5), 799–821 (1973) 2. Hampel, F.R., Ronchetti, E.M., Rousseeuw, P.J., Stahel, W.A.: Robust Statistics. The Approach Based on Influence
Functions. Wiley, New York (1986)
3. Hu, T.Z.: Negatively superadditive dependence of random variables with applications. Chinese J. Appl. Probab. Statist. 16(2), 133–144 (2000)
4. Shen, Y., Wang, X.J., Yang, W.Z., Hu, S.H.: Almost sure convergence theorem and strong stability for weighted sums of NSD random variables. Acta Math. Sin. Engl. Ser.29(4), 743–756 (2013)
5. Xue, Z., Zhang, L.L., Lei, Y.J., Chen, Z.J.: Complete moment convergence for weighted sums of negatively superadditive dependent random variables. J. Inequal. Appl.2015, Article ID 117 (2015)
6. Wang, X.J., Deng, X., Zheng, L.L., Hu, S.H.: Complete convergence for arrays of rowwise negatively superadditive dependent random variables and its applications. Statistics48(4), 834–850 (2014)
7. Wang, X.J., Shen, A.T., Chen, Z.Y., Hu, S.H.: Complete convergence for weighted sums of NSD random variables and its application in the EV regression model. Test24, 166–184 (2015)
8. Wang, X.J., Wu, Y., Hu, S.H.: Complete moment convergence for double-indexed randomly weighted sums and its applications. Statistics52(3), 503–518 (2018)
9. Meng, B., Wang, D., Wu, Q.: On the strong convergence for weighted sums of negatively superadditive dependent random variables. J. Inequal. Appl.2017, Article ID 269 (2017)
10. Eghbal, N., Amini, M., Bozorgnia, A.: On the Kolmogorov inequalities for quadratic forms of dependent uniformly bounded random variables. Stat. Probab. Lett.81, 1112–1120 (2011)
11. Shen, A.T., Zhang, Y., Volodin, A.: Applications of the Rosenthal-type inequality for negatively superadditive dependent random variables. Metrika78, 295–311 (2015)
12. Shen, A.T., Xue, M.X., Volodin, A.: Complete moment convergence for arrays of rowwise NSD random variables. Stochastics88(4), 606–621 (2016)
13. Boente, G., He, X., Zhou, J.: Robust estimates in generalized partially linear models. Ann. Stat.34, 285–2878 (2016) 14. Cheng, G., Zhou, L., Huang, Z.J.: Efficient semiparametric estimation in generalized partially linear additive models for
longitudinal/clustered data. Bernoulli20(1), 141–163 (2014)
15. He, X., Fung, W., Zhu, Z.: Robust estimation in generalized partial linear models for clustered data. J. Am. Stat. Assoc. 100, 1176–1184 (2005)
16. Graciela, B., Daniela, R.: Robust inference in generalized partially linear models. Comput. Stat. Data Anal.54(12), 2942–2966 (2010)
17. Qin, G., Zhu, Z., Fung, W.K.: Robust estimation of generalized partially linear model for longitudinal data with dropouts. Ann. Inst. Stat. Math.68, 977–1000 (2016)
18. Lin, H., Fu, B., Qin, G., Zhu, Z.: Doubly robust estimation of generalized partial linear models for longitudinal data with dropouts. Biometrics73(4), 1132–1139 (2017)
19. Yu, Y., Ruppert, D.: Penalized spline estimation for partially linear single-index models. J. Am. Stat. Assoc.97, 1042–1054 (2002)
20. Xia, Y., Hardle, W.: Semi-parametric estimation of partially linear single-index models. J. Multivar. Anal.97, 1162–1184 (2006)
21. Wang, J.L., Xue, L.G., Zhu, L.X., Chong, Y.S.: Estimation for a partial-linear single index model. Ann. Stat.38(1), 246–274 (2010)
22. Huang, Z.S.: Statistical inferences for partially linear single-index models with error-prone linear covariates. J. Stat. Plan. Inference141(2), 899–909 (2011)
23. Lian, H., Liang, H., Carroll, R.: Variance function partially linear single-index models. J. R. Stat. Soc. B77(1), 171–194 (2015)
24. Yang, J., Lu, F., Yang, H.: Statistical inference on asymptotic properties of two estimators for the partially linear single-index models. Statistics52(6), 1193–1211 (2018)
25. Tabakan, G., Akdeniz, F.: Difference-based ridge estimator of parameters in partial linear model. Stat. Pap.51, 357–368 (2010)
26. Wang, L., Brown, L.D., Cai, T.T.: A difference based approach to the semiparametric partial linear model. Electron. J. Stat.5, 619–641 (2011)
27. Zhao, H., You, J.: Difference based estimation for partially linear regression models with measurement errors. J. Multivar. Anal.102, 1321–1338 (2011)
28. Duran, E.A., Hädle, W.K., Osipenko, M.: Difference based ridge and Liu type estimators in semiparametric regression models. J. Multivar. Anal.105(1), 164–175 (2012)
29. Wu, J.: Restricted difference-based Liu estimator in partially linear model. J. Comput. Appl. Math.300, 97–102 (2016) 30. Hu, H.C., Yang, Y., Pan, X.: Asymptotic normality of DHD estimators in a partially linear model. Stat. Pap.57(3), 567–587
(2016)
31. Wu, J.: Difference based ridge type estimator of parameters in restricted partially linear model with correlated errors. SpringerPlus5, 178 (2016)
32. Zeng, Z., Liu, X.D.: Asymptotic normality of difference-based estimator in partially linear model with dependent errors. J. Inequal. Appl.2018, Article ID 267 (2018)
33. Huber, P.J.: Robust estimation of a location parameter. Ann. Math. Stat.35, 73–101 (1964) 34. Yatchew, A.: An elementary estimator for the partial linear model. Econ. Lett.5, 135–143 (1997)
35. Silvapullé, M.J.: Asymptotic behavior of robust estimators of regression and scale parameter with fixed carriers. Ann. Stat.13(4), 1490–1497 (1985)
37. Zeng, Z., Hu, H.C.: Weak linear representation of M-estimation in GLMs with dependent errors. Stoch. Dyn.17, 1750034 (2017).https://doi.org/10.1142/S0219493717500344
38. Hu, H.C., Cui, H.J., Li, K.C.: Asymptotic properties of wavelet estimators in partially linear errors-in-variables models with long-memory errors. Acta Math. Appl. Sin. Engl. Ser.34(1), 77–96 (2018)