• No results found

ASYMTOTIC NORMALITY OF ESTIMATORS IN HETEROSCEDASTIC ERRORS-IN-VARIABLES MODEL FOR NA SAMPLES Ting Wang & Jing-jing Zhang

N/A
N/A
Protected

Academic year: 2022

Share "ASYMTOTIC NORMALITY OF ESTIMATORS IN HETEROSCEDASTIC ERRORS-IN-VARIABLES MODEL FOR NA SAMPLES Ting Wang & Jing-jing Zhang"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

ASYMTOTIC NORMALITY OF ESTIMATORS IN HETEROSCEDASTIC ERRORS-IN-VARIABLES MODEL FOR NA SAMPLES

Ting Wang & Jing-jing Zhang*

College of Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China.

Correspondence: [email protected] ABSTRACT

This article is concerned with the estimating problem of heteroscedastic partially linear errors-in-variables models.

We derive the asymptotic normality for estimators of the slope parameter and the nonparametric component in the case of known error variance with NA(negatively associated) random errors. Also, when the error variance is unknown, the asymptotic normality for the estimators of the slope parameter and the nonparametric component as well as variance function is considered under independent assumptions. Finite sample behavior of the estimators is investigated via simulations too.

𝐊𝐞𝐲𝐰𝐨𝐫𝐝𝐬: Partially linear errors-in-variables model, Negatively associated, Asymptotic normality, Heteroscedastic, Least-squares estimator.

𝑀𝑆𝐶: 62J12 ⋅ 62E20 1. INTRODUCTION

Consider the following heteroscedastic partially linear errors-in-variables (EV) model {𝑦𝑖= 𝜉𝑖𝛽 + 𝑔(𝑡𝑖) + 𝜀𝑖,

𝑥𝑖= 𝜉𝑖+ 𝜇𝑖. (1)

where 𝜀𝑖= 𝜎𝑖𝑒𝑖, 𝜎𝑖2= 𝑓(𝑢𝑖), (𝜉𝑖, 𝑡𝑖, 𝑢𝑖) are nonrandom design points, (𝑡𝑖, 𝑥𝑖, 𝑦𝑖) are observed samples, 𝛽 is an unknown parameter to be estimated, {𝜉𝑖} are the potential variables cannot be observed, {𝑦𝑖} are the response variables, {𝑥𝑖} are observed with measurement errors {𝜇𝑖} and with 𝐸𝜇𝑖= 0, and {𝑒𝑖} are random errors and with 𝐸𝑒𝑖= 0. Assume that there is a function ℎ(⋅) defined on closed interval [0,1] satisfying

𝜉𝑖= ℎ(𝑡𝑖) + 𝑣𝑖. (2)

where {𝑣𝑖} are also unknown design points.

Model (1) and its special cases have been widely studied by many authors. When the {𝜉𝑖} can be observed, 𝜎𝑖2= 𝜎2, and the errors {𝑒𝑖} are independent identically distribution(i.i.d), the model reduces to the homoscedastic partially linear regression model, which was studied by Engle et al (1986)[1]. When 𝑔(𝑡) ≡ 0, 𝜎𝑖2= 𝑓(𝑢𝑖), the model becomes into heteroscedastic partially linear regression model, which was extensively studied by Carroll (1982)[2], Robinson (1987)[3]. In addition, when 𝑔(𝑡) ≠ 0 , and the {𝜉𝑖} can not de directly observed, the model (1) degenerates into partially linear EV model, which can be seen in Cui and Li (1998)[4], Wang (1999)[5], Liang (1999)[6] and so on.

In recent decades, semi-parametric EV models have been widely concenred. Miao, Zhang and Wang(2013)[7] considered the strong consistency and asymptotic normality for the least square estimators in a linear EV regression model; Liu and Chen(2005)[8] discussed the consistency of estimators and derived the equivalence relation of weak or strong consistency for the estimators; Cui(2006)[9] summarized the T regression estimate and EM arithmetic in a linear EV regression model; Many of early results of the study of EV model can be seen in Fuller (1987)[10], Cheng and Van Ness (1999)[11] and Carrol (1995)[12].

In this paper, we consider the estimation problem for model (1) under the errors {𝑒𝑖, 1} being mean zero negatively associated(NA) random variables. A finite family of random variables {𝑋𝑖, 1} is said to be NA random variables if for every pair of disjoint subsets A and B of {1,2,...,n}, we have

𝐶𝑜𝑣(𝑓1(𝑋𝑖, 𝑖 ∈ 𝐴), 𝑓2(𝑋𝑗, 𝑗 ∈ 𝐵))0

whenever 𝑓1 and 𝑓2 are coordinatewise increasing function and such that the covariance exists. An infinite family of random variables is NA if every finite subfamily is NA.

The NA view was introduced by Alam and Saxena (1981)[13], and Joag-Dev and Proschan (1983)[14]

discovered the the character of multivariate distribution of NA sequence and discovered fundamental properties; Liang (2000)[15] discovered complete convergence; NA sequence not only has been applied in the multivariat statistical analysis, but also in the oceans, weather, and other engineering fields, risk analysis and time series analysis just as the same as other positive and negative dependent sequence. However, there are few asymptotic results for the estimators of parametric and nonparametric components in partial linear EV model regressions under NA error’s structure.

The paper is organized as follows. In Section 2, we list some assumptions. The main results are given in

(2)

Section 3. A simulation study is presented in section 4. Some preliminary lemmas are stated in Section 5. Proofs of the main results are provided in Sections 6.

2. ASSUMPTIONS

First, we assume that {𝑡𝑖, ℎ𝑖, 𝑣𝑖, 𝑔𝑖, 𝜀𝑖, 𝜇𝑖, 𝜉𝑖, 1} satisfy model (1), and that 𝑊𝑛𝑖(⋅)(1) are some weight functions defined on 𝐼 and set ℎ̃𝑖= ℎ(𝑡𝑖) − ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)ℎ(𝑡𝑗) , 𝑣̃𝑖= 𝑣𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝑣𝑗 , 𝑔̃𝑖= 𝑔(𝑡𝑖) −

𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝑔(𝑡𝑗) , 𝜀̃𝑖= 𝜀𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜀𝑗 , 𝜇̃𝑖= 𝜇𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗 and 𝜉̃𝑖= 𝜉𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜉𝑗. Then, we shall list some conditions, which will be used in the paper.

- Let {𝑒𝑖, 1} be a sequence of NA random variables with mean zero, and let {𝜇𝑖, 1} be a sequence of independent random variables with mean zero, and {𝑒𝑖, 1} is independent with {𝜇𝑖, 1}. Assume that 𝐸𝑒𝑖2= 1, sup𝑖𝐸|𝑒𝑖|𝑝< ∞, for some 𝑝 > 4, sup𝑖𝐸|𝜇𝑖|𝑝< ∞, for some 𝑝 > 4, and the 𝐸𝜇𝑖2= Ξ𝜇2> 0 is known.

- Let both of {𝑒𝑖, 1} and {𝜇𝑖, 1} be sequences of independent random variables with mean zero, 𝐸𝑒𝑖2= 1, 𝐸𝜇𝑖2= Ξ𝜇2> 0 and sup𝑖𝐸𝑒𝑖6+ sup𝑖𝐸𝜇𝑖6< ∞. {𝜇𝑖, 1} is independent of {𝑒𝑗, 1}.

• Let {𝑣𝑖, 1} in condition (1.2) be a sequence satisfying - lim𝑛→∞𝑛−1𝑛𝑖=1𝑣𝑖2= Σ0(0 < Σ0< ∞);

- lim𝑛→∞sup𝑛(√𝑛log𝑛)−1⋅ max1| ∑𝑚𝑖=1𝑣𝑗𝑖| < ∞.

- 0 < 𝑚0min1𝑓(𝑢𝑖)max1𝑓(𝑢𝑖)0< ∞;

- 𝑓(⋅), 𝑔(⋅) and ℎ(⋅) are continuous function and satisfy the first-order Lipschitz condition on 𝐼.

• The probability weight functions 𝑊𝑛𝑗(𝑡𝑖) be weight functions defined on [0,1] and satisfy - max1𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖) = 𝑂(1);

- max1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝐼(|𝑡𝑖− 𝑡𝑗| > 𝑛−1/4) = 𝑜(𝑛−1/4);

- max1,𝑗𝑊𝑛𝑗(𝑡𝑖) = 𝑜(𝑛−1/2log−1𝑛), - max1,𝑗𝑊𝑛𝑗(𝑡𝑖) = 𝑂(𝑛−𝑠).

• Let 𝑊̂𝑛𝑖(⋅)(1) be weight functions defined on 𝐼. Conditons A3(i)(ii)(iv) are satisfied replacing 𝑡𝑖 and 𝑊𝑛𝑖 by 𝑢𝑖 and 𝑊̂𝑛𝑖, respectively.

Remark 2.1 Conditions (A0)-(A3) are standard regularity conditions and used commonly in the literature, see Gao et al.(1994)[16] and Chen et al.(1998)[17];

Remark 2.2 Under some mild conditions, the following two weight functions satisfy hypothesis (A3):

𝑊𝑛𝑖(1)(𝑡) =1

𝑠𝑠𝑖

𝑖−1𝐾(𝑡−𝑠

𝑛)𝑑𝑠, 𝑊𝑛𝑖(2)(𝑡) = 𝐾(𝑡−𝑡𝑖

𝑛)[∑𝑛𝑗=1𝐾(𝑡−𝑡𝑖

𝑛)]−1.

where 𝑠𝑖= (𝑡𝑖+ 𝑡𝑖−1)/2, 𝑖 = 1,2, . . . , 𝑛 − 1, 𝑠0= 0, 𝑠𝑛= 1, 𝐾(⋅) is the Parzen-Rosenblatt kernel function, we can see Parzen(1962)[18], and the ℎ𝑛 is a bandwidth parameter.

3. MAIN RESULTS

For model (1), we want to seek the estimator of 𝛽 and 𝑔(⋅). Firstly, when the error are homoscedastic and the 𝜉𝑖 can

(3)

be observed, we can apply the least squares estimation method to estimate the parameter 𝛽. On the hand, we assume the parameter 𝛽 is known, and then to estimate 𝑔(⋅); for each given 𝛽, we have 𝑔(𝑡𝑖) = 𝐸(𝑦𝑖− 𝑥𝑖𝛽),1. Therefore, based on the (𝑥𝑖, 𝑡𝑖, 𝑦𝑖), we can define the estimator of 𝑔(⋅), that is 𝑔𝑛(𝑡, 𝛽) = ∑𝑛𝑖=1𝑊𝑛𝑖(𝑡)(𝑦𝑖− 𝑥𝑖𝛽). Then, based on the model (1), we can also define the LSE of 𝛽 by following formula:

𝑛𝑖=1[𝑦𝑖− 𝑥𝑖𝛽 − 𝑔𝑛(𝑡𝑖, 𝛽)]2− Ξ𝜇2𝛽2= 𝑚𝑖𝑛!

On the other hand, under this condition of partially linear EV model, Liang et al.(1999)[?] improved the LSE on the basis of the usually partially linear model, and employ the estimator of parameter 𝛽, write that

𝛽̂𝐿= [∑𝑛𝑖=1(𝑥̃𝑖2− Ξ𝜇2)]−1𝑛𝑖=1𝑥̃𝑖𝑦̃𝑖. (1)

where 𝑥̃𝑖= 𝑥𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝑥𝑗, 𝑦̃𝑖= 𝑦𝑖− ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝑦𝑗.

Secondly, when the errors are heteroscedastic, we consider two different cases according to 𝑓(⋅). If 𝜎𝑖2= 𝑓(𝑢𝑖) are known, then the 𝛽̂𝐿 is medified to be the weighted least-squares estimator (WLSE)

𝛽̂𝑊1 = [∑𝑛𝑖=1𝜎𝑖−2(𝑥̃𝑖2− Ξ𝜇2)]−1𝑛𝑖=1𝜎𝑖−2𝑥̃𝑖𝑦̃𝑖. (2) In fact, the 𝜎𝑖2= 𝑓(𝑢𝑖) are unknown and must be estimated. In the case, suppose that 𝐸𝑒𝑖2= 1, we have 𝐸[𝑦𝑖− 𝜉𝑖𝛽 − 𝑔(𝑡𝑖)]2= 𝑓(𝑢𝑖). Therefore, the estimator of 𝑓(𝑢𝑖) can be defined by

𝑓̂𝑛(𝑢𝑖) = ∑𝑛𝑗=1𝑊̂𝑛𝑗(𝑢𝑖)(𝑦̃𝑗− 𝑥̃𝑗𝛽̂𝐿)2− Ξ𝜇2𝛽̂𝐿2. (3) For convenience, we assume that min1𝑓̂𝑛(𝑢𝑖) > 0. Then we can define a nonparametric estimator of 𝜎𝑖2, 𝜎̂𝑛𝑖2 = 𝑓̂𝑛(𝑢𝑖).

In consequence, when the errors are heteroscedastic and unknown, the WLSE of 𝛽 is

𝛽̂𝑊2 = [∑𝑛𝑖=1𝜎̂𝑛𝑖−2(𝑥̃𝑖2− Ξ𝜇2)]−1𝑛𝑖=1𝜎̂𝑛𝑖−2𝑥̃𝑖𝑦̃𝑖. (4) Meanwhile, using 𝛽̂𝐿, 𝛽̂𝑊1, 𝛽̂𝑊2, we can define the three estimators for 𝑔(⋅):

𝑔̂𝐿(𝑡) = ∑𝑛𝑖=1𝑊𝑛𝑖(𝑡)(𝑦𝑖− 𝑥𝑖𝛽̂𝐿), (5)

𝑔̂𝑊1(𝑡) = ∑𝑛𝑖=1𝑊𝑛𝑖(𝑡)(𝑦𝑖− 𝑥𝑖𝛽̂𝑊1), (6)

𝑔̂𝑊2(𝑡) = ∑𝑛𝑖=1𝑊𝑛𝑖(𝑡)(𝑦𝑖− 𝑥𝑖𝛽̂𝑊2). (7)

In this paper, we provide some notions and a definition that will be used in the process of proof.

𝜂𝑖= 𝜀𝑖− 𝜇𝑖𝛽, 𝑆𝑛2= ∑𝑛𝑖=1𝜉̃𝑖2, 𝑇𝑛2= ∑𝑛𝑖=1𝜎𝑖−2𝜉̃𝑖2,

𝑆1𝑛2 = ∑𝑛𝑖=1(𝑥̃𝑖2− Ξ𝜇2), Σ1𝑛2 = V𝑎𝑟[∑𝑛𝑖=1𝜎𝑖−2(𝜉̃𝑖+ 𝜇𝑖)(𝜀𝑖− 𝜇𝑖𝛽)],

Γ𝑛2(𝑡) = V𝑎𝑟[∑𝑛𝑖=1𝑊𝑛𝑖(𝑡)(𝜀𝑖− 𝜇𝑖𝛽)], Δ2𝑛(𝑢) = ∑𝑛𝑖=1𝑊̂𝑛𝑖2(𝑢)V𝑎𝑟[(𝜀𝑖− 𝜇𝑖𝛽)2]. (8)

Definition 3.1 Let {𝑋𝑡, 𝑡 = 0, ±1, ±2, ⋯ } be a strictly stationary time series. For 𝑛 = 1,2, ⋯, define

𝜌(𝑛) = sup

𝑋∈𝐿2(𝐹−∞0 ),𝑌∈𝐿2(𝐹𝑛)

|𝐶𝑜𝑟𝑟(𝑋, 𝑌)|

where 𝐹𝑖𝑗 denotes the 𝜎 -algebra generated by {𝑋𝑡, 𝑖}, and 𝐿2(𝐹𝑖𝑗) consists of 𝐹𝑖𝑗-measurable random variables with finite second moment.

When 𝑓(⋅) is known, we give tne asymptotic normality for least-sqares estimators and weighted least- squares estimators of 𝛽 and 𝑔(⋅).

Theorem 3.1 Suppose that (A0)(i), (A1), (A2) and (A3) are satisfied. Then we have • If Σ𝑛2, then 𝑆𝑛2(𝛽̂𝐿− 𝛽)/Σ𝑛𝐷 𝑁(0,1);

• If Σ1𝑛2 , then 𝑇𝑛2(𝛽̂𝑊1− 𝛽)/Σ1𝑛𝐷 𝑁(0,1).

Theorem 3.2 Suppose that (A0)(i), (A1), (A2) and (A3) are satisfied. If 𝑛𝛤𝑛2(𝑡) → ∞ and ∑𝑛𝑖=1𝑊𝑛𝑖2(𝑡) = 𝑂(𝛤𝑛2(𝑡)), then we have

• [𝑔̂𝐿(𝑡) − 𝐸𝑔̂𝐿(𝑡)]/Γ𝑛(𝑡) →𝐷 𝑁(0,1);

• [𝑔̂𝑊1(𝑡) − 𝐸𝑔̂𝑊1(𝑡)]/Γ𝑛(𝑡) →𝐷 𝑁(0,1).

Remark 3.1 According to Zhang and Liang(2013) Remark (3.1), we think ∑2𝑛𝐶𝑛, ∑1𝑛𝐶𝑛𝑎𝑛𝑑𝑛𝛤𝑛2(𝑡) → ∞ is reasonable.

(4)

When 𝑓(⋅) is unknown, we give tne asymptotic normality for the estimators of 𝛽, 𝑔(⋅) and 𝑓(⋅) under the {𝑒𝑖, 1} is an independent sequence. And the proof of the Theorem 3.3, 3.4 and 3.5 , we can reference the Zhang and Liang (2013)[19].

Theorem 3.3 Suppose that (A0)(ii), (A1), (A2) and (A4) and (A3)(i)(ii)(iv) for some 1/2 < 𝑠 < 1 are satisfied.

Then 𝑇𝑛2(𝛽̂𝑊2− 𝛽)/𝛴1𝑛𝐷 𝑁(0,1).

Theorem 3.4 Suppose that (A0)(ii), (A1), (A2), (A4) and (A3)(i)(ii)(iv) for some 5/8 < 𝑠 < 1 are satisfied.

Assume that 𝑚𝑎𝑥1|𝑣𝑖| = 𝑂(𝑛1/3). For each 𝑡 ∈ [0,1], if 𝑛 ∑𝑛𝑖=1𝑊𝑛𝑖2(𝑡) → ∞, then we have [𝑔̂𝑊2(𝑡) − 𝐸𝑔̂𝑊2(𝑡)]/𝛤𝑛(𝑡) →𝐷 𝑁(0,1).

Theorem 3.5 Suppose that (A0)(ii), (A1), (A2), (A4) and (A3)(i)(ii)(iv) for some 𝑠 = 1/2 are satisfied. Assume that 𝑠𝑢𝑝𝑖𝐸𝜇𝑖8< ∞ and 𝑖𝑛𝑓𝑖𝑉𝑎𝑟[(𝜀𝑖− 𝜇𝑖𝛽)2] > 0. For each 𝑢 ∈ [0,1], if 𝑛 ∑𝑛𝑖=1𝑊̂𝑛𝑖2(𝑢) → ∞, then we have [𝑓̂𝑛(𝑢) − 𝐸𝑓̂𝑛(𝑢)]/𝛥𝑛(𝑢) →𝐷 𝑁(0,1).

4. SIMULATION STUDY

In this section, we carry out a simulation to study the finite sample performance of the proposed estimators. In particular, We examine how good the asymptotic normality is for the estimators of 𝛽, 𝑔(⋅) by Q-Q plot.

Observations are generated from {𝑦𝑖= 𝜉𝑖𝛽 + 𝑔(𝑡𝑖) + 𝜀𝑖,

𝑥𝑖= 𝜉𝑖+ 𝜇𝑖, 𝑖 = 1,2, ⋯ , 𝑛,

where 𝛽 = 1, 𝑔(𝑡) = sin(2𝜋𝑡), 𝜎𝑖2= 𝑓(𝑢𝑖), 𝑓(𝑢) = [1 + 0.5cos(2𝜋𝑢)]2, 𝑡𝑖= (𝑖 − 0.5)/𝑛 and 𝑢𝑖= (𝑖 − 1)/

𝑛 , 𝜉𝑖= 𝑡𝑖2+ 𝑣𝑖 with 𝑣𝑖= sin(𝑖)/(𝑛1/3) . {𝜇𝑖, 1} is an i.i.d. 𝑁(0,0. 22) sequence. { 𝑒𝑖, 1 } are subjected to multivariate normal distribution with 𝐸(𝑒1, ⋯ , 𝑒𝑛) = (0, ⋯ ,0) , 𝐶𝑜𝑣(𝑒𝑖, 𝑒𝑗) = −4−(𝑗−𝑖)−1𝑓𝑜𝑟𝑖 ≠ 𝑗 and 𝑉𝑎𝑟(𝑒𝑖) = 0. 52𝑓𝑜𝑟1. For the proposed estimators, the weight functions are taken as

𝑊𝑛𝑖(𝑡) = 𝐾((𝑡−𝑡𝑖)/ℎ𝑛)

𝑛𝑗=1𝐾((𝑡−𝑡𝑗)/ℎ𝑛), 𝑊̂𝑛𝑖(𝑢) = 𝐾((𝑢−𝑢𝑖)/𝑏𝑛)

𝑛𝑗=1𝐾((𝑢−𝑢𝑗)/𝑏𝑛). where 𝐾(⋅) is a Gaussian kernel function, ℎ𝑛 and 𝑏𝑛 are two bandwidth sequences.

It is well known that an important issue is the selection of an appropriate bandwidth sequences. This issue has been extensively stuied in the context of nonparametric regression. One of bandwidth selection rules is the delete- one cross-validation rule. It is noted that our estimators may involve two bandwidths. Hence, it is somewhat complicated to select appropriate bandwidths for our estimatos. we state the procedure in the following three steps:

• Select ℎ𝑛 by minimizing 𝐶𝑉1(ℎ𝑛) =1

𝑛𝑛𝑖=1(𝑦𝑖− 𝑥𝑖𝛽̂𝐿,−𝑖− 𝑔̂𝐿,−𝑖(𝑡𝑖))2 where 𝛽̂𝐿,−𝑖 and 𝑔̂𝐿,−𝑖(𝑡𝑖) are "Leave one out" versions of 𝛽̂𝐿 and 𝑔̂𝐿(𝑡𝑖).

• Select ℎ′𝑛 by minimizing 𝐶𝑉2(ℎ′𝑛) =1

𝑛𝑛𝑖=1(𝑦𝑖− 𝑥𝑖𝛽̂𝑊1,−𝑖− 𝑔̂𝑊1,−𝑖(𝑡𝑖))2 where 𝛽̂𝑊1,−𝑖 and 𝑔̂𝑊1,−𝑖(𝑡𝑖) are "Leave one out" versions of 𝛽̂𝑊1 and 𝑔̂𝑊1(𝑡𝑖).

• Select 𝑏𝑛 by minimizing 𝐶𝑉3(𝑏𝑛) =1

𝑛𝑛𝑖=1(𝑦𝑖− 𝑥𝑖𝛽̂𝑊2,−𝑖− 𝑔̂𝑊2,−𝑖(𝑡𝑖))2 where 𝛽̂𝑊2,−𝑖 and 𝑔̂𝑊2,−𝑖(𝑡𝑖) are "Leave one out" versions of 𝛽̂𝑊2 and 𝑔̂𝑊2(𝑡𝑖).

We found by calculation the corresponding optimal bandwidths ℎ1= 0.38 and ℎ2= 0.14.

We give the Q-Q plot for the estimator of 𝛽 and 𝑔(⋅) under the condition that 𝑓(⋅) is known. In Figure 1, we give the Q-Q plot for 𝛽̂𝐿 and 𝛽̂𝑊1 with 𝑛 = 100,300 and 500, respectively. In Figure 2, we provide the Q-Q plot for 𝑔̂𝐿(⋅) and 𝑔̂𝑊1(⋅) with 𝑛 = 100,300 and 500, respectively.

From Figure 1-2, we can see that:

• The asymptotic normality of 𝛽̂𝐿 or 𝛽̂𝑊1 is obvious, so does the asymptotic normality of 𝑔̂(⋅);

• The normality becomes more obvious with increasing sample size 𝑛.

(5)

Figure 1: The Q-Q plots for 𝛽̂𝐿 and 𝛽̂𝑊1 with N=500, n=100,300 and 500, respectively.

Figure 2: The Q-Q plots for 𝑔𝐿(⋅) and 𝑔𝑊1(⋅) with N=500, n=100,300 and 500, respectively.

(6)

5. PRELIMINARY LEMMAS

In the sequel, let 𝑐, 𝑐1, ⋯ and 𝐶, 𝐶1, ⋯ are some finite positive constants, whose values are unimportant and may change. 𝑎𝑛= 𝑂(𝑏𝑛) means |𝑎𝑛||𝑏𝑛|, while 𝑎𝑛= 𝑜(𝑏𝑛) means 𝑎𝑛/𝑏𝑛→ 0. 𝑎+= max(0, 𝑎), 𝑎= max(0, −𝑎).

And let {𝑒𝑖, 1} be a sequence of zero mean stationary NA random errors. Now, we introduce several lemmas, which will be used in the proof of the main results.

Lemma 5.1 (Baek and Liang (2006)and Baek(2006), Lemma 3.1) Let 𝛼 > 2. Assume that {𝑎𝑛𝑖, 1, 𝑛1} is a triangular array of real numbers with 𝑚𝑎𝑥1|𝑎𝑛𝑖| = 𝑂(𝑛−1/2) and ∑𝑛𝑖=1𝑎𝑛𝑖2 = 𝑜(𝑛−2/𝛼(𝑙𝑜𝑔𝑛)−1). If 𝑠𝑢𝑝𝑖𝐸|𝑒𝑖|𝑝<

∞ for some 𝑝 > 2𝛼/(𝛼 − 1). Then

𝑛𝑖=1𝑎𝑛𝑖𝑒𝑖= 𝑜(𝑛−1/𝛼)𝑎. 𝑠.

Remark 5.1 In Lemma 5.1, it is quite clear that 𝑝 > 2 as 𝛼 → ∞ and ∑𝑛𝑛𝑖𝑎𝑛𝑖𝑒𝑖= 𝑜(1) a.s.; and 𝑝 > 4 when 𝛼 > 4 and ∑𝑛𝑛𝑖𝑎𝑛𝑖𝑒𝑖= 𝑜(𝑛−1/4) a.s. In addition, if all of the "o" is changed into "O", then the conclusion is also right.

Lemma 5.2 (Hardle et al. (2000)(2000), Lemma A.3) Let 𝑉1, ⋯ , 𝑉𝑛 be independent random variables with 𝐸𝑉𝑖= 0, finite variances and 𝑠𝑢𝑝1𝐸|𝑉𝑗|𝑟< ∞(𝑟 > 2). Assume that {𝑎𝑘𝑖, 𝑘, 𝑖 = 1, ⋯ , 𝑛} is a sequence of real numbers such that 𝑠𝑢𝑝1,𝑘|𝑎𝑘𝑖| = 𝑂(𝑛−𝑝1) for some 0 < 𝑝1< 1 and ∑𝑛𝑗=1𝑎𝑗𝑖= 𝑂(𝑛𝑝2) for 𝑝2𝑚𝑎𝑥(0,2/𝑟 − 𝑝1).

Then

max1 | ∑𝑛𝑘=1𝑎𝑘𝑖𝑉𝑘| = 𝑂(𝑛−𝑠log𝑛)𝑎. 𝑠. 𝑓𝑜𝑟 𝑠 = (𝑝1− 𝑝2)/2.

Lemma 5.3 (Liu and Gan(2003)and Gan(2006)) Assume 𝑎𝑛 is a array of positive real numbers, and

𝑛=1𝜎𝑛2/𝑎𝑛2< ∞, where 𝜎𝑛2= 𝑉𝑎𝑟(𝑒𝑛). If 0 < 𝑎𝑛↑ ∞. Then

𝑛𝑖=1 𝑒𝑖

𝑎𝑛= 𝑜(1)𝑎. 𝑠.

Lemma 5.4 (Han-Ying Liang, Volker Mammitzsch and Josef Steinebach (2006)and Volker(2006), Lemma 4.4(ii)) Let {𝑎𝑛𝑖, 1, 𝑛1} be an array of real numbers and set 𝛥𝑛2 = 𝑉𝑎𝑟(∑𝑛𝑖=1𝑎𝑛𝑖𝑒𝑖). Assume that

𝑗:|𝑘−𝑗||𝑐𝑜𝑣(𝑒𝑘, 𝑒𝑗)| → 0𝑎𝑠𝑢 → ∞ uniformly for 𝑘1, and 𝑚𝑎𝑥1|𝑎𝑛𝑖| = 𝑜(𝛥𝑛), ∑𝑛𝑖=1𝑎𝑛𝑖2 = 𝑂(𝛥𝑛2). If

𝑛𝑖=1|𝑎𝑛𝑖| = 𝑂(1), Then

𝑛𝑖=1𝑎𝑛𝑖𝑒𝑖

Δ𝑛𝐷 𝑁(0,1).

The proof of the Lemmas 5.5 and 5.6, we can reference the Zhang and Liang (2011)[24] and Zhang and Liang (2013)[19].

Lemma 5.5

• Assumptions (A1), (A2) and (A3), one can imply that 𝑛−1𝑛𝑖=1𝜉̃𝑖2→ Σ0, max1|𝜉̃𝑖| = 𝑜(𝑛−1/2) and 𝑆𝑛−2𝑛𝑖=1|𝜉̃𝑖|;

• Using (A1), (A2) and (A3), imply that 𝐶1−1𝑛𝑖=1𝜎𝑖−2𝜉̃𝑖22 and 𝑇𝑛−2𝑛𝑖=1|𝜎𝑖−2𝜉̃𝑖|;

• Let 𝐴̃𝑖= 𝐴(𝑡𝑖) − ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝐴(𝑡𝑗), where 𝐴(⋅) = 𝑓(⋅), 𝑔(⋅) or ℎ(⋅). Then (A2)(ii) and (A3)(ii) imply that max1|𝐴̃𝑖| = 𝑜(𝑛−1/4).

Lemma 5.6 Under the condition of Lemma 5.5 and (A0), (A3), we have 𝑆1𝑛2 → 𝑆𝑛2𝑎. 𝑠.

6. PROOF OF MAIN RESULTS

In the sequel, we use the Abel Inequality (Härdle et al. (2000)[21], page 183). Let 𝐴1, 𝐴2, ⋯ , 𝐴𝑛; 𝐵1, 𝐵2, ⋯ , 𝐵𝑛(𝐵12𝑛0) to be two sequence of real numbers, and 𝑆𝑘 = ∑𝑘𝑖=1𝐴𝑖, 𝑀1= min1𝑆𝑘, 𝑀2= max1𝑆𝑘. Then, 𝐵1𝑀1𝑛𝑘=1𝐴𝑘𝐵𝑘1𝑀2. Let 𝐸𝑖, 𝐹𝑖(1) to be arbitrary real numbers and (𝑗1, 𝑗2, ⋯ , 𝑗𝑛) to be a permutation of (1, ⋯ , 𝑛) such that 𝐹𝑗1𝑗2𝑗𝑛. Then from the above equation, we have

| ∑𝑛𝑖=1𝐸𝑖𝐹𝑖| = | ∑𝑛𝑖=1𝐸𝑗𝑖𝐹𝑗𝑖|| ∑𝑛𝑖=1𝐸𝑗𝑖(𝐹𝑗𝑖− 𝐹𝑗𝑛)| + | ∑𝑛𝑖=1𝐸𝑗𝑖𝐹𝑗𝑛| 𝐶max1 |𝐹𝑖|max

1 | ∑𝑚𝑖=1𝐸𝑗𝑖|. (1)

(7)

Proof of Theorem 3.1. We prove only (a), as the proof of (b) is analogous. From (1) and (??), write that 𝛽̂𝐿− 𝛽 = 𝑆1𝑛−2[∑𝑛𝑖=1(𝜉̃𝑖+ 𝜇̃𝑖)(𝑦̃𝑖− 𝜉̃𝑖𝛽 − 𝜇̃𝑖𝛽) + 𝑛Ξ𝜇2𝛽]

= 𝑆1𝑛−2{∑𝑛𝑖=1[(𝜉̃𝑖+ 𝜇̃𝑖)(𝜀̃𝑖− 𝜇̃𝑖𝛽) + Ξ𝜇2𝛽] + ∑𝑛𝑖=1𝜉̃𝑖𝑔̃𝑖+ ∑𝑛𝑖=1𝜇̃𝑖𝑔̃𝑖}

= 𝑆1𝑛−2{∑𝑛𝑖=1[(𝜉̃𝑖+ 𝜇𝑖)(𝜀𝑖− 𝜇𝑖𝛽) + Ξ𝜇2𝛽] + ∑𝑛𝑖=1𝜉̃𝑖𝑔̃𝑖+ ∑𝑛𝑖=1𝜇̃𝑖𝑔̃𝑖

+ ∑𝑛𝑖=1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜉̃𝑖𝜇𝑗𝛽 − ∑𝑛𝑖=1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜉̃𝑖𝜀𝑗− ∑𝑛𝑖=1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜀𝑖𝜇𝑗

− ∑𝑛𝑖=1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑖𝜀𝑗+ 2 ∑𝑛𝑖=1𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑖𝜇𝑗𝛽

+ ∑𝑛𝑖=1𝑛𝑗=1𝑛𝑘=1𝑊𝑛𝑗(𝑡𝑖)𝑊𝑛𝑘(𝑡𝑖)𝜇𝑗𝜀𝑘− ∑𝑛𝑖=1𝑛𝑗=1𝑛𝑘=1𝑊𝑛𝑗(𝑡𝑖)𝑊𝑛𝑘(𝑡𝑖)𝜇𝑗𝜇𝑘𝛽}

: = 𝑆1𝑛−210𝑙=1𝐴𝑙𝑛. (2)

Thus. Using Lemma 5.6, in order to prove 𝑆𝑛2(𝛽̂𝐿− 𝛽)/Σ𝑛𝐷 𝑁(0,1), we verify that

𝐴1𝑛

Σ𝑛𝐷 𝑁(0,1)𝐴𝑘𝑛

Σ𝑛𝑃 0𝑓𝑜𝑟𝑘 = 2,3,4,5,8,10.𝐴𝑘𝑛

Σ𝑛𝑃 0𝑓𝑜𝑟𝑘 = 6,7,9.

Step 1. we prove that 𝐴1𝑛𝑛𝐷 𝑁(0,1).

Set 𝜔𝑖= (𝜉̃𝑖+ 𝜇𝑖)(𝜀𝑖− 𝜇𝑖𝛽) + Ξ𝜇2𝛽 and 𝑍𝑛𝑖= 𝜔𝑖𝑛. According to Zhang and Liang (2013)[19] we have Σ𝑛2𝐶𝑛. Using (A0), Lemma 5.5, Σ𝑛2𝐶𝑛. We deduce that 𝐸𝑍𝑛𝑖= 0, V𝑎𝑟(∑𝑛𝑖=1𝑍𝑛𝑖) = 1 and 𝐸|𝑍𝑛𝑖|2+𝛿< ∞. Owing to {𝜀𝑖} are sequence of zero mean stationary NA random variables, {𝑒𝑖𝜎𝑖− 𝜇𝑖𝛽} are also sequence of zero mean stationary NA random variables, {𝜉̃𝑖+ 𝜇𝑖} are sequence of i.i.d. random variables. Using Definition (1), we know (𝜉̃𝑖+ 𝜇𝑖)(𝜀𝑖− 𝜇𝑖𝛽) are sequence of 𝜌-mixing random variables, and the mixing coefficients 𝜌(𝑛) = 0. In this situation, we can know 𝜌-mixing is also a squence of strong mixing from Fan and Yao (2003)[25], and we have 0𝛼(𝑛)𝜌(𝑛)/4 = 0. Therefore, (𝜉̃𝑖+ 𝜇𝑖)(𝜀𝑖− 𝜇𝑖𝛽) are sequences of strong mixing random variables with the mixing coefficients 𝛼(𝑛) = 0. Thus. By the proof of Theorem 3.1 of Zhang and Liang (2013)[19], we accept the conclusion as correct.

Step 2. We prove that 𝐴𝑘𝑛𝑛→ 0 for 𝑘 = 2,3,4,5,8,10.

From (A0)(i), (A3) and Lemma 5.2, we can verity that

𝑛𝑖=1(𝜁𝑖− 𝐸𝜁𝑖) = 𝑂(𝑛12𝑙𝑜𝑔𝑛)𝑎. 𝑠. max

1 | ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗| = 𝑂(𝑛14𝑙𝑜𝑔𝑛)𝑎. 𝑠. (3) where 𝜁𝑖= |𝜇𝑖|, 𝜇𝑖2𝑜𝑟𝜇𝑖.

Since the {𝜀𝑖, 𝑖 = 1,2, . . . , 𝑛} are sequence of zero mean stationary NA random errors, the {𝜀𝑖+, 𝑖 = 1,2, . . . 𝑛} and {𝜀𝑖, 𝑖 = 1,2, . . . 𝑛} are all NA sequence. From Lemma 5.3, one can get 1/𝑛 ∑𝑛𝑖=1𝜀𝑖+= 𝑜(1)𝑎. 𝑠. ,1/

𝑛 ∑𝑛𝑖=1𝜀𝑖= 𝑜(1)𝑎. 𝑠. And |𝜀𝑖| = 𝜀𝑖++ 𝜀𝑖, we have

1

𝑛𝑛𝑖=1|𝜀𝑖| = 𝑜(1)𝑎. 𝑠. (4)

Hence, by applying (A0)(i) and (A3), Lemma 5.1 and 𝑎𝑛= 𝑊𝑛𝑗(𝑡𝑖), 𝛼 = 4, one can obtain that

max1 | ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜀𝑗| = 𝑜(𝑛14)𝑎. 𝑠. (5)

So. From (A0)(i), (A1), (A2), (A3), Lemma 5.5, (??), (3), (5) we deduce that

|𝐴2𝑛

Σ𝑛 | 𝐶

√𝑛| ∑𝑛𝑖=1𝜉̃𝑖𝑔̃𝑖| 𝐶

√𝑛{| ∑𝑛𝑖=1ℎ̃𝑖𝑔̃𝑖| + | ∑𝑛𝑖=1𝑣𝑖𝑔̃𝑖| + | ∑𝑛𝑖=1[∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝑣𝑗]𝑔̃𝑖|}

𝐶

√𝑛[𝑛 ⋅ max

1 |ℎ̃𝑖| ⋅ max

1 |𝑔̃𝑖| + max

1 |𝑔̃𝑖| ⋅ max

1 | ∑𝑛𝑖=1𝑣𝑘𝑖| +max1𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖) ⋅ max

1 |𝑔̃𝑖| ⋅ max

1 | ∑𝑛𝑗=1𝑣𝑘𝑗|]

= 𝑜(1) + 𝑜(𝑛−1/4log𝑛) = 𝑜(1).

𝐸(𝐴3𝑛

Σ𝑛)2 𝐶

𝑛{𝐸(∑𝑛𝑖=1𝑔̃𝑖𝜇𝑖)2+ 𝐸[∑𝑛𝑖=1𝑔̃𝑖𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗]2}

=𝐶

𝑛{∑𝑛𝑖=1𝑔̃𝑖2+ ∑𝑛𝑗=1[∑𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖)𝑔̃𝑖]2} = 𝑜(𝑛−1/2)

𝐴4𝑛= ∑𝑛𝑖=1ℎ̃𝑖𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗𝛽 + ∑𝑛𝑖=1𝑣𝑖𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗𝛽 −

𝑛𝑖=1𝑛𝑠=1𝑊𝑛𝑠(𝑡𝑖)𝑣𝑠𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗𝛽 : = 𝐷1𝑛+ 𝐷2𝑛+ 𝐷3𝑛. 𝐸(𝐷1𝑛

Σ𝑛)2 𝐶

𝑛𝑛𝑗=1[∑𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖)ℎ̃𝑖]2⋅ max

1 |ℎ̃𝑖|2⋅ max

1 | ∑𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖)|2= 𝑜(𝑛−1/2).

𝐸(𝐷2𝑛

Σ𝑛)2 𝐶

𝑛𝑛𝑗=1[∑𝑛𝑖=1𝑊𝑛𝑗(𝑡𝑖)𝑣𝑖]2⋅ max

1 | ∑𝑚𝑖=1𝑣𝑘𝑖|2⋅ max

1,𝑗 𝑊𝑛𝑗2(𝑡𝑖) = 𝑜(1).

|𝐷3𝑛| Σ𝑛

𝐶

√𝑛max

1 | ∑𝑚𝑠=1𝑣𝑘𝑠| ⋅ max

1𝑛𝑖=1𝑊𝑛𝑠(𝑡𝑖) ⋅ max

1 | ∑𝑛𝑗=1𝑊𝑛𝑗(𝑡𝑖)𝜇𝑗| = 𝑜𝑝(1).

References

Related documents

In addition, our estimates of the change in the effective tax rate (which hold the interest rate constant) were consistent with Administration estimates. This work could be extended

Finally, in comparison with neat PVA nanofibers presented in the literature, the resulting PVA/silica nanofiber mats provide considerable superiority, such as improved thermal

Bacterial colonization of disposable soft contact lenses is greater during corneal infiltrative events than during asymptomatic extended lens wear.. Keay L, Edwards K, Naduvilath

Fink’s model of an integrated course design ( Fink, 2003a, 2007 ) proposes more exciting and challenging learning goals. Pertaining to the six categories of learning in

coordinators have used a number of questionnaires as a substitute for the TAP3 which has been undergoing several phases of development in Year One. Development of a lengthy TAP3

Borrowing from Norton’s (Darvin and Norton 2015, p.48) explanation of the concept of Bourdieu’s (1986) “sens pratique” that she uses to provide a guide for learners in

Figure 10: Total Early-Stage Entrepreneurial Activity and Established Business Ownership in the United States, Percentage of Adults in Each Ethnic Group, GEM 2015.. 12% 14% 10% 9%