• No results found

Capturing Cross-Sectional Correlation with Time Series: with an Application to Unit Root Test

N/A
N/A
Protected

Academic year: 2021

Share "Capturing Cross-Sectional Correlation with Time Series: with an Application to Unit Root Test"

Copied!
21
0
0

Loading.... (view fulltext now)

Full text

(1)

Capturing Cross-Sectional Correlation with Time

Series: with an Application to Unit Root Test

Chor-yiu SIN

Wang Yanan Institute of Studies in Economics (WISE)

Xiamen University, Xiamen, Fujian, P.R.China, 361005

Email: [email protected]

January 2, 2007

Abstract

Traditional panel data analysis assumes cross-sectional uncorrelatedness. This is plausible when the cross-sectional units are households. Notably in the study of growth empirics or gravity models, panel data analysis is re-cently applied to cross-country data, in which cross-sectional dependence and unit root are common. Existing asymptotic theory either assumes that (i) T (the number of time-series units) goes to in nity while N (the number of cross-sectional units) is xed; or (ii) the cross-sectional units are well-ordered or well-indexed with "economic distance". In this paper, we assume that N goes to in nity while T is xed (as long as T is greater than q, where q is the number of restrictions under the null), which is more plausible in many panel data sets. On the other hand, no prior knowledge of the ordering or the indexation is assumed. Using the central limit theorem for stationary mixing random variables, we rst show thepN-consistency of an OLS estimator. We then construct a simple robust testing procedure that is insensitive to many possible cross-sectional correlations. The asymptotic critical values of this test can be simulated via Monte Carlo method. A number of Monte Carlo experiments suggest that our test has reasonable sizes in nite samples, even when N andT are as small as 50 and 2 respectively. It has non-trivial power when T is greater than or equal to 10.

Key Words: Correlation-insensitive test; Cross-sectional correlation; Eco-nomic distance; Robust testing; Stationary mixing random elds; Unit root.

JEL Classi cationC12; C21; C23

Acknowledgments: SIN thanks the comments from the participants at the Workshop on Se-quential Analysis, Time Series and Related Topics held in Academia Sinica on December 27-28, 2004; and those at Departmental Seminar of National Taipei University on January 29, 2007. The usual disclaimers apply.

(2)

1

Introduction

Throughout the paper, we consider the following linear regression model:

yit=xitβ+uit, (1.1)

where i= 1, . . . , N and t = 1, . . . , T, T 2, xit is a kx1-vector while both yit and

uit are scalars.

By now there is a huge literature covering cases that the time-series dimension

T goes to infinity. In this paper, we focus on cases that the cross-section dimension

N goes to infinity. Given this assumption, it turns out that our analyses are much easier when we assume that T is fixed. We maintain these two assumptions in the balance of the paper.

One major drawback in making inference on the parameter β is that as far as (asymptotic) distributions are concerned, it is hard to model and estimate the cross-sectional correlations. More precisely, in one or the other estimators, one may need to model and estimate, for t= 1, . . . , T, the following N(N 1)/2 terms:

E[xituitujtxjt], (1.2)

where i < j, and i, j = 1, . . . , N.

In a purely time-series context, the time-series correlations can be modelled using the natural ordering (that is, time) of the series. In a purely cross-section context, Conley (1999) models the cross-sectional correlations using a metric of

economic distance. In virtue of the use of economic distance may be controversial, recently voluminous papers in the literature, in one way or the other, capture the cross-sectional correlations by further assuming that T also goes to infinity. See, for instance, Bai (2003). While the models suggested in many papers are found applicable to many data sets, it is unclear if they perform well when T is rather small.

(3)

In this paper, following the lines in Conley (1999), we first invoke some moment conditions and some mixing conditions to justify the asymptotic normal of our esti-mators. For making statistical inference onβ, instead of using a metric of economic distance, we capture the cross-sectional correlation with the time-series units. In principle, T can be as small as 2. This contrasts tremendously with the existing literature in which T is assumed to go to infinity.

After constructing a generic Wald test with an OLS estimator in Section 2, we show in Section 3 that a unit root test and a cointegration test can be cast as special cases of a Wald test. Some generalization and some extension can be found in Section 4. Unlike the conventional Wald test, our test does not distributed as a

χ2. The critical values need to be simulated by Monte Carlo method. Some values of certain special cases are tabulated in Section 5. We close this paper with some Monte Carlo experiments in Section 6 and conclusions and discussions in Section 7.

2

The OLS Estimator and the Wald Test

yit=xitβ+uit, (2.1)

where i= 1, . . . , N and t = 1, . . . , T, T 2, xit is a kx1-vector while both yit and

uit are scalars.

Assumption (a). N → ∞and T is fixed. 2

In an extension of this paper, we will letT → ∞ at an appropriate rate ofN. For sake of exposition, we first consider the case in which the time period is divided into two parts. The first part (with T1 observations) is for estimating the parameter β (denoted as ˆβ) while the second part (with T2 observations) is for estimating the ”variance-covariance” matrix of ˆβ. Note T = T1 +T2. As one can see in the subsequent discussions, it is possible to have the estimation period and the testing period overlap to each other, though we still require that T 2.

(4)

Assumption (b). For t = 1, . . . , T, N−1/2 N i=1xituit −→LΓWtk, (2.2)

where Γ is a positive definite matrix andWtk is akdimensional standard normal random vector. 2

Though not necessary, at some point we may assume thatWtk’s areindependent

across t.

In a purely cross-sectional context, Conley (1999) shows that Assumption (b) holds in an even more general setting (see the proof of Proposition 2 there). Conley (1999) applies the CLT (central limit theorem) of the stationary mixing random field suggested in Bolthausen (1982).

In an extension to, say a fluctuation test, we will modify Assumption (b) as:

Assumption (b*). For t= 1, . . . , T, N−1/2 [rN] i=1xituit −→LΓWtk(r),∀r∈[0,1].2 Assumption (c). For t = 1, . . . , T, N−1 N i=1xitx it →Ma.s., (2.3)

where M is an kxk- invertible constant matrix. 2

Note Assumption (c) of homogeneity can be relaxed a little at the expense of a more complicated estimation for the ”variance-covariance” matrix.

Given Assumption (c), for N sufficiently large, (T1

s=1Ni=1xisxis)1 exists. We

may consider the usualOLS estimator forβin this context (see, for instance, Hsiao, 2003): ˆ β= ( T1 s=1 N i=1xisx is)1( T1 s=1 N i=1xisyis ). (2.4)

It is not difficult to see that:

(N1 T1 s=1 N i=1xisx is) N( ˆββ) = T1 s=1N 1/2N i=1xisuis −→L Γ T1 s=1W k s.

(5)

Thus we have the following theorem for the limiting distribution of ˆβ.

Theorem 2.1. Suppose Assumptions (a)-(c) hold.

N( ˆββ)−→LM1Γ( 1 T1 T1 s=1W k s). (2.5) 2

The difficult part in (2.5) is the estimation for Γ. In a time series context, there are numerous methods of estimation. See, for instance, de Long and Davidson (2000) and the references therein. Those references essentially assume some kind of mixing conditions in an ordered time series. In a cross-sectional context, Conley (1999) models the spatial correlation with a metric of economic distance; while Bai (2003) (and the reference therein) assumes that the time-series dimension (T in our context) goes to . In this paper, we do not model the economic distance on the one hand, and we allow T to be fixed on the other hand.

First note that given the OLS estimator ˆβ, we may define the residual in a straightforward way:

ˆ

uit =yit−xitβ,ˆ

where t= 1, . . . , T. For t=T1+ 1, . . . , T1+T2, consider the following term:

xituˆit = xit(yit−xitβˆ) = xit(uit+xit βxitβˆ) = xituitxitxit( ˆββ) = xituitxitxit( T1 s=1 N i=1xisx is)1( T1 s=1 N i=1xisuis ).

Therefore, in view of Assumptions (b)-(c),

N−1/2 N i=1xit ˆ uit = N1/2 N i=1xituit (N1 N i=1xitx it)( T1 s=1N 1N i=1xisx is)1(N−1/2 T1 s=1 N i=1xisuis )

(6)

= N1/2 N i=1xituit− 1 T1(N 1/2T1 s=1 N i=1xisuis ) +op(1) = (N1/2 N i=1xituit 1 T1 T1 s=1N 1/2N i=1xisuis ) +op(1) −→L Γ(Wtk− 1 T1 T1 s=1W k s). (2.6)

By (2.5)-(2.6), we are about to construct a Wald test statistic for the null hypothesis

H0 :β =β0. We need an additional assumption: Assumption (d). Tt=T1+1(Wtk− T1 1 T1 s=1Wsk)(Wk t T11 T1 s=1Wsk) is positive definite a.s. 2

Assumption (d) is non-trivial. Consider the simple case that T1 = T2 = 1. If

W1k =W2k a.s., the term (Wtk−T1

1 T1

s=1Wsk) is identically zero a.s. and Assumption

(d) is not satisfied. In fact Assumption (d) tells us that our method does not apply to one single time point. We require that T 2.

Assumption (d) assures that a.s., [Tt=T1+1(N1/2Ni=1xituˆit)(N1/2Ni=1xituˆit)]1 exists for N sufficiently large. Thus we may consider the following Wald test statis-tic: ˆ W =N( ˆββ0)Vˆ1N( ˆββ0), (2.7) where ˆV1 = (N1T1 s=1Ni=1xisxis)[Tt=T1+1(N−1/2 N i=1xituˆit)(N−1/2Ni=1xituˆit)]1 (N1T1 s=1Ni=1xisxis).

Now we are able to state the major theorem in this section:

Theorem 2.2. Suppose Assumptions (a)-(d) hold. ˆ W −→L T1 s=1W k s [ T t=T1+1 (Wtk− 1 T1 T1 s=1W k s)(Wk t 1 T1 T1 s=1W k s )]1 T1 s=1W k s. (2.8) 2

It should be noted despite the fact that aCLT is assumed for N1/2Ni=1xituit (see Assumption (b)), unlike the conventional Wald test, the limiting distribution is not χ2k even when T1 = 1. It is because unlike the usual estimator for the variance-covariance matrix of N( ˆββ), ˆV around (2.7) does not converge in probability

(7)

to a constant matrix. Instead, as with a term in Abadir and Paruolo (1997), it converges in distribution to:

M−1Γ[ 1 T12 T t=T1+1 (Wtk− 1 T1 T1 s=1W k s)(Wk t 1 T1 T1 s=1W k s )]ΓM−1 . (2.9)

As a result, the critical values from the limiting distribution in Theorem 2.2 needs to be simulated. Some critical values of certain special cases will be given in Section 5, after we discuss a general version of OLS and an extension toIV.

For a general null hypothesis H0 : =r0, where R is a qxk- matrix with full row rank of q and r0 is a qx1- vector, qk. It is easy to see from (2.5) that under the null: N(Rβˆr0)−→LRM1Γ( 1 T1 T1 s=1W k s).

RM−Wskis normally distributed with mean zero and varianceRM1ΓΓM1R. Note there exists aqxqpositive semi-definite matrix Λ such that ΛΛ =RM1ΓΓM1R. Abusing the notation, we can write:

N(Rβˆr0)−→L Λ( 1 T1 T1 s=1W q s), (2.10)

where Wsq is a q-dimensional standard normal random vector. Similar to (2.9),

ˆ VR R(N−1 T1 s=1 N i=1xisx is)1[ T t=T1+1 (N1/2 N i=1xit ˆ uit)(N−1/2 N i=1xit ˆ uit)] ·(N1 T1 s=1 N i=1xisx is)1R −→L RM−1Γ[ 1 T12 T t=T1+1 (Wtk− 1 T1 T1 s=1W k s)(Wk t 1 T1 T1 s=1W k s )]ΓM−1 R.

By the arguments similar to those for (2.10),

ˆ VR−→LΛ[ 1 T12 T t=T1+1 (Wtq− 1 T1 T1 s=1W q s)(Wq t 1 T1 T1 s=1W q s )]Λ. (2.11)

All in all, instead of Assumption (d), we impose the following condition which suffices for, a.s., the invertibility of ˆVR.

(8)

Assumption (e). Λ defined around (2.10) is positive definite. Tt=T1+1(Wtq− 1 T1 T1 s=1Wsq)(Wq

t T11 Ts=11 Wsq) is positive definite a.s. 2

Thus, for the general null hypothesisH0 : =r0, we may consider the following Wald test statistic:

ˆ

WR=

N(Rβˆr0)VˆR1N(Rβˆr0). (2.12) The limiting distribution of the test statistic is presented in the next theorem.

Theorem 2.3. Suppose Assumptions (a)-(c) and Assumption (e) hold.

ˆ WR −→L T1 s=1W q s [ T t=T1+1 (Wtq− 1 T1 T1 s=1W q s)(Wq t 1 T1 T1 s=1W q s )]1 T1 s=1W q s. (2.13) 2

3

Unit Root Test and Test for Cointegration

In this section, we investigate a unit root test for the time series {wit}. Following the lines in Fuller (1996), we assume that for each i, {wit} follows an AR(k). In other words, we have a linear regression model:

wit=xitβ+uit, (3.1)

where xit = (wit1, wit1, . . . , witk+1), t= 1, . . . , T and i= 1, . . . , N.

In other words, the Augmented Dickey-Fuller test in this setting is simply testing

β1 = 0 or = 0, where R is a 1xk- vector with the first element equals 1 and all

other elements equal 0.

If Assumption (a) holds, following the lines in Conley (1999), we may assume that some moment conditions and some mixing conditions hold for the cross-section series

{xituit}, t = 1, . . . , T. In other words, Assumptions (b)-(c) hold. In addition, if we

are willing to assume Assumption (e) holds, Theorem 2.3 applies. In fact, although the first part of Assumption (e) (the part about Λ) is hard to check, it may be easy to justify Assumption (d), which implies the second part of Assumption (e). First

(9)

we let FtN =σ{. . . , uNt1, utN}, where uNt = (u1t, . . . , uNt). It is plausible to assume that for all i= 1, . . . , N,

E[uNt |FtN1] = 0. Thus, for all t, s∈ T,t =s,

LimN→∞E[(N−1/2 N i=1xituit )(N1/2 N i=1xisuis )] = 0, (3.2)

which justifies the independence between the Wtk’s and thus Assumption (d). With a similar setting, Levin and Lin (1992, 1993), Quah (1994) and Levin, Lin and Chu (2002) construct similar tests, with the assumption that both N and T go to infinity. While their approaches may not be applicable to cases in which T is rather small, more importantly, all the papers mentioned above assume that the data are identically and independently distributed across i. The approach adopted in this paper dispenses with the assumption of independence across i. Furthermore, it is our conjecture that with more elaborated analyses, the assumption of identical distribution in this paper can be relaxed, following the lines in Im, Pesaran and Shin (2003).

It is interesting to note that unlike the pure time-series analysis, the rate of convergence of this unit root test is the same as that of tests of other parameters

β2, . . . , βk. In this paper we assume thatT is fixed and only N goes to infinity. It is

unclear if we obtain the some results on convergence, when T also goes to infinity. Next we consider a test for cointegration among a (k + 1)x1- vector wit (wit0, wit1, . . . , witk). Following the lines in Phillips and Durlauf (1986), as well as a long series of subsequent papers, we first consider the following linear regression model:

wit0 =xitβ+uit, (3.3)

(10)

Presumably all the elements ofwit are I(1). Based on (3.3), oneform of testing for no cointegration can be cast as H0 :β = 0. As a result, provided that Assump-tions (a)-(d) hold, the limiting distribution of the Wald test defined in (2.7) (with

β0 = 0) can be found in Theorem 2.2. Having said that, as suggested in Phillips and

Durlauf (1986), it is difficult to justify (3.2) and thus likely the Wtk’s in Theorem 2.2 are independent. Our results here are similar to many in the literature. See, for instance, Pedroni (2004). However, as we argue in the discussions on our unit root test, those papers are deficient in two aspects, namely, assuming that T also goes to infinitely; and assuming that the data are independent across i.

4

Generalization and Extension

In Section 2, we consider the case in which the time period is divided into two parts, the first part of which is used to estimate the parameter β (the OLS estimator is denoted as ˆβ), while the second part is used to estimate the ”variance-covariance” matrix of ˆβ. In this section, we first generalize our test to a more flexible case. Then we consider an IV (instrumental variable) estimator.

Define T ≡ {1, . . . , T}, where T 2. Consider two subsets of T, T1 and T2, not necessarily disjoint. The numbers of elements inT1 andT2, denoted as #T1 and #T2 respectively, are non-zero.

Instead of theOLS estimator in (2.4) above, we may consider an alternative one: ˆˆ β = ( s∈T1 N i=1xisx is)1( s∈T1 N i=1xisyis ). (4.1)

We have the following theorem for the limiting distribution ofβˆˆ.

Theorem 4.1. Suppose Assumptions (a)-(c) hold.

N(βˆˆβ)−→LM1Γ( 1 #T1 s∈T1 Wsk). (4.2) 2

(11)

Correspondingly, we define the residual as: ˆˆ

uit =yit−xitβ,ˆˆ

where t= 1, . . . , T. Correspondingly we replace Assumption (e) with the following:

Assumption (e’). Λ defined around (2.10) is positive definite. t∈T2(Wtq

1 #T1 s∈T1Wsq)(W q t #1T1 s∈T1Wq

s ) is positive definite a.s. 2

A Wald test statistic for the null hypothesisH0 : =r0 is defined as: ˆˆ WR= N(Rβˆˆr0)VˆˆR1N(Rβˆˆr0), (4.3) whereVˆˆR =R(N1s∈T1Ni=1xisxis)1[t∈T2(N1/2Ni=1xituˆˆit)(N1/2Ni=1xitˆuˆit)] (N1s∈T1Ni=1xisxis)1R.

The limiting distribution of Wˆˆ is presented in the next theorem.

Theorem 4.2. Suppose Assumptions (a)-(c) and Assumption (e’) hold. ˆˆ W −→L s∈T1 Wsq[ t∈T2 (Wtq− 1 #T1 s∈T1 Wsq)(Wtq 1 #T1 s∈T1 Wsq)]1 s∈T1 Wsq. (4.4) 2

Next we consider a Wald test derived from an IV estimator. Instead of the

OLS estimator in (4.1) above, suppose we have an instrument zit, which is also a

kx1-vector. Define the following IV estimator: ˜ β = ( s∈T1 N i=1zisx is)1( s∈T1 N i=1zisyis ). (4.5)

Correspondingly, we replace Assumptions (b) and (c) with the followings:

Assumption (b’). For t= 1, . . . , T, N−1/2 N i=1zituit −→L ΓWtk, (4.6)

where Γ is a positive definite matrix andWtk is akdimensional standard normal random vector. 2 Assumption (c’). For t= 1, . . . , T, N−1 N i=1zitx it →Ma.s., (4.7)

(12)

where M is an kxk- invertible constant matrix. 2

We have the following theorem for the limiting distribution of ˜β.

Theorem 4.3. Suppose Assumptions (a), and Assumptions (b’)-(c’) hold.

N( ˜ββ)−→LM1Γ( 1 #T1 s∈T1 Wsk). (4.8) 2

Define the residual as:

˜

uit =yit−xitβ,˜

where t = 1, . . . , T. A Wald test statistic for the null hypothesis H0 : = r0 is defined correspondingly as:

˜ WR= N(Rβ˜r0)V˜R1N(Rβ˜r0), (4.9) where ˜VR=R(N1s∈T1iN=1xiszis )1[t∈T2(N1/2Ni=1zitu˜it)(N1/2Ni=1zitu˜it)] (N1s∈T1Ni=1zisxis)1R.

The limiting distribution of ˜WR is presented in the next theorem.

Theorem 4.4. Suppose Assumption (a), Assumptions (b’)-(c’), and Assump-tion (e’) hold.

˜ WR−→L s∈T1 Wsq[ t∈T2 (Wtq− 1 #T1 s∈T1 Wsq)(Wq t #1T 1 s∈T1 Wsq)]1 s∈T1 Wsq. (4.10) 2

It should be noted that the limiting distribution in Theorem 4.4 is the same as that in Theorem 4.2, which is free from the nuisance parameters Γ and M. Nevertheless, it depends on the dependence between the Wtq’s as well as the way we choose T1 and T2. In the next section, assuming that the Wtq’s are independent across t,q = 1,2, we tabulate the critical values of the following two cases: (i) The disjoint case. More precisely, T1 ={1, . . . , T1} and T2 ={T1 + 1, . . . , T1 +T2}; (ii) The fully overlapping case. More precisely, T1 =T2 =T.

(13)

5

Simulating the Critical Values

TABLE 5.1

Quantiles of the Limiting Distribution in (5) or (8), k = 1.

α−th simulated quantiles T rv .800 .900 .950 .980 .990 2 DISJ 2.806 10.502 40.500 267.384 1063.563 OV ER 18.948 79.502 320.144 2118.335 8564.449 2z12 18.948 79.733 322.885 2025.152 8104.427 3 DISJ 9.273 36.517 147.250 947.310 3452.401 OV ER 5.375 12.882 27.866 74.468 151.616 3 2z22 5.335 12.790 27.774 72.767 147.758 4 DISJ 1.775 3.593 7.110 17.652 35.444 OV ER 3.579 7.386 13.491 27.004 44.591 4 3z32 3.577 7.382 13.500 27.494 45.490 5 DISJ 3.225 6.918 14.079 34.709 70.060 OV ER 2.927 5.680 9.597 17.531 26.328 5 4z42 2.938 5.682 9.633 17.550 26.496 6 DISJ 1.639 2.906 4.834 9.117 14.410 OV ER 2.625 4.853 7.914 13.631 19.782 6 5z52 2.614 4.872 7.932 13.588 19.508 7 DISJ 2.423 4.411 7.426 14.067 22.642 OV ER 2.412 4.410 6.974 11.525 15.841 7 6z62 2.419 4.404 6.986 11.525 16.032 8 DISJ 1.627 2.719 4.119 7.006 10.005 OV ER 2.233 3.978 6.058 10.145 13.845 8 7z72 2.288 4.104 6.392 10.272 13.992 9 DISJ 2.156 3.693 5.742 9.869 15.029 OV ER 2.177 3.827 5.812 9.132 12.295 9 8z82 2.195 3.892 5.982 8.858 12.663 10 DISJ 1.615 2.637 3.957 6.100 8.419 OV ER 2.090 3.694 5.617 8.636 11.601 10 9z92 2.125 3.733 5.685 8.842 11.736 20 DISJ 1.614 2.570 3.607 5.020 6.104 OV ER 1.835 3.073 4.518 6.654 8.273 20 19z192 1.856 3.147 4.611 6.786 8.616 30 DISJ 1.600 2.560 3.577 4.964 6.215 OV ER 1.761 2.963 4.232 6.082 7.589 30 29z292 1.778 2.986 4.326 6.270 7.857 50 DISJ 1.602 2.572 3.676 5.066 6.084 OV ER 1.715 2.854 4.108 5.852 7.247 50 49z492 1.722 2.868 4.121 5.902 7.329 100 DISJ 1.627 2.643 3.732 5.208 6.266 OV ER 1.667 2.782 3.981 5.692 6.824 100 99z992 1.681 2.785 3.977 5.648 6.968 121 DISJ 1.643 2.721 3.846 5.249 6.173 OV ER 1.670 2.792 4.023 5.702 7.004 121 120z1202 1.652 2.772 3.953 5.606 6.906 χ2 1 1.642 2.706 3.841 5.412 6.635

(14)

6

Monte Carlo experiments

TABLE 6.1(a)

Rejection Percentage under H0 :β1 = 0, ρ= 0. Size T T est 10% 5% 1% 2 DISJ 10.00 4.65 0.75 OV ER 9.85 4.65 0.70 W HIT E 11.45 6.95 1.75 10 DISJ 10.00 5.05 1.00 OV ER 10.15 4.80 0.90 W HIT E 10.65 4.80 1.25 50 DISJ 10.45 5.15 0.90 OV ER 10.05 4.25 1.05 W HIT E 9.95 5.15 0.75 TABLE 6.1(b)

Rejection Percentage underH0 :β1 = 0, ρ= 0.5. Size T T est 10% 5% 1% 2 DISJ 9.75 5.60 1.35 OV ER 9.65 5.35 1.35 W HIT E 22.35 15.85 5.80 10 DISJ 10.80 5.65 1.05 OV ER 10.40 5.60 1.55 W HIT E 20.65 13.20 4.55 50 DISJ 11.25 5.75 1.45 OV ER 11.30 5.20 1.25 W HIT E 20.80 13.55 4.85

(15)

TABLE 6.1(c)

Rejection Percentage underH0 :β1 = 0, ρ= 0.9. Size T T est 10% 5% 1% 2 DISJ 12.45 6.45 1.10 OV ER 11.85 6.05 1.00 W HIT E 62.05 55.45 44.30 10 DISJ 12.10 7.40 2.25 OV ER 11.35 5.85 1.65 W HIT E 58.00 51.70 39.60 50 DISJ 11.05 6.10 2.10 OV ER 10.75 5.80 1.35 W HIT E 57.30 49.70 36.70 TABLE 6.2(a)

Rejection Percentage under Ha :β1 = 0.1, ρ= 0. Size T T est 10% 5% 1% 2 DISJ 15.10 7.35 1.35 OV ER 14.60 7.15 1.40 W HIT E 29.50 20.00 7.80 10 DISJ 49.25 37.65 14.95 OV ER 67.70 52.80 24.35 W HIT E 73.50 62.65 39.40 50 DISJ 95.70 92.90 83.75 OV ER 99.95 99.90 98.65 W HIT E 99.95 99.95 99.20

(16)

TABLE 6.2(b)

Rejection Percentage under Ha :β1 = 0.5, ρ= 0. Size T T est 10% 5% 1% 2 DISJ 56.05 31.45 5.60 OV ER 57.20 31.15 5.45 W HIT E 99.95 99.65 99.05 10 DISJ 100.00 100.00 99.50 OV ER 100.00 100.00 100.00 W HIT E 100.00 100.00 100.00 50 DISJ 100.00 100.00 100.00 OV ER 100.00 100.00 100.00 W HIT E 100.00 100.00 100.00 TABLE 6.2(c)

Rejection Percentage under Ha :β1 = 0.9, ρ= 0. Size T T est 10% 5% 1% 2 DISJ 80.25 52.60 11.50 OV ER 83.80 53.30 11.20 W HIT E 100.00 100.00 100.00 10 DISJ 100.00 100.00 100.00 OV ER 100.00 100.00 100.00 W HIT E 100.00 100.00 100.00 50 DISJ 100.00 100.00 100.00 OV ER 100.00 100.00 100.00 W HIT E 100.00 100.00 100.00

(17)

7

Conclusions and Discussions

In this paper, we propose a Wald test for the parameters in a linear regression model, in which the correlations between N cross-sectional units (whereN goes to infinity) are hard to model, and are hard to estimate. We need T time-series unit, where T is fixed and in principle it may be as small as 2. This is in contrast to cases in the literature that either (i) the correlations between the cross-sectional units do not exist, see, for instance, Anderson (1978), Anderson and Hsiao (1981), Holz-Eakon, Newey and Rosen (1988), and Moon and Phillips (1999); (ii) the cross-sectional correlations are modelled with the geographicaldistance (see, for instance, Kelejian and Prucha, 1999 and the references in the field of spatial statistics therein) or the

economic distance proposed by Conley (1999); or (iii) T also goes to infinity, see, for instance, Kao (1999), Bai and Ng (2002), and Bai (2003).

In Section 3, we also consider a unit root test and a test for cointegration when we have data in both the time-series dimension and the cross-sectional dimension. Both topics are overwhelming in the field of economics over the past ten years. See, for instance, Levin and Lin (1992, 1993), Quah (1994), Levin, Lin and Chu (2002), Im, Pesaran and Shin (2003), and Pedroni (2004), all of which assume that both N and T go to infinity. As in the aforementioned papers, in Section 3 of this paper, these two tests are found to be special cases of the Wald test in the linear regression model developed in Section 2. On the other hand, we extend our OLS estimation to IV (instrumental variable) estimation in Section 4.

The method proposed in this paper has an interesting analogy with the classical

z−test for the population mean. Consider a special case of Model (1.1), in which

k = 1, xit = 1 and T1 =T2 =T ={1, . . . , T}:

yit =β+uit. (7.1)

(18)

get: vNt= +N1/2 N i=1uit, (7.2)

where vNt N1/2Ni=1yit. Given Assumption (b), for t= 1, . . . , T,

vNt− N β =N1/2 N i=1uit −→L ΓWt1, (7.3)

where Γ2 = LimN→∞E(N1/2Ni=1uit)2. Refer to the Wald test statistic in (2.7). First, we re-write the null hypothesis as H0 :N β =N β0. Second, it is easy to see that: ˆ = 1 T T s=1vNs ¯ vN, ˆ V = 1 T2 T t=1 (vNtv¯N)2.

And if we further assume the Wt1 in (7.3) be identically distributed, by Theorem 4.1 (for the case when k = 1), asN → ∞,

TvN −√N β0) T t=1(vNt−¯vN)2 = T T 1 (¯vN −√N β0) T t=1(vNt−v¯N)2/(T 1) −→L T T 1tT−1, (7.4) where tT1 denotes a random variable which is t distributed with T 1 degrees of freedom.

In spite of the analogy, hypotheses with the linear regression model in (1.1) is much more general than a classical z test. In the further study, we will extend Model (1.1) to cases with more heterogeneity. In particular, we will allow some fixed effects, random effects, random coefficients (see, for instance, Hsiao, 2003), and common factors (see, for instance, Bai, 2003).

In a classical ztest, as T → ∞, the limiting distribution is standard normal. It is interesting to derive the limiting distribution of our Wald test statistic, when apart from N → ∞, #T1 and/or #T2 also do. Furthermore, it has been shown

(19)

that under certain assumptions, the classical z test is optimal in testing for the population mean. It is also interesting to see the optimality results, especially those refer to the choice of T1 and T2.

(20)

REFERENCES

Abadir, K.M., Paruolo, P., 1997. Two Mixed Normal Densities from Cointegration Analysis. Econometrica, 65, 671-680.

Anderson, T.W., 1978. Repeated Measurement on Autoregressive Processes. Jour-nal of the American Statistical Association, 73, 371-378.

Anderson, T.W., Hsiao, C., 1981. Estimation of Dynamic Models with Error Com-ponents. Journal of the American Statistical Association, 76, 598-606.

Bai, J., 2003. Inference on Factor Models of Large Dimension. Econometrica, 71, 135-171.

Bai, J., Ng, S., 2002. Determining the Number of Factors in Approximate Factor Models. Econometrica, 70, 191-221.

Bolthausen, E., 1982. On the Central Limit Theorem for Stationary Mixing Ran-dom Fields. The Annals of Probability, 10, 1047-1050.

Conley, T.G., 1999. GMM Estimation with Cross Sectional Dependence. Journal of Econometrics, 92, 1-45.

Fuller, W.A., 1996. Introduction to Statistical Time Series, 2nd Edition. New York: Wiley.

de Jong, R.M., Davidson, J., 2000. Consistency of Kernel Estimators of Het-eroscedastic and Autocorrelated Covariance Matrices. Econometrica, 68, 407-423.

Holz-Eakon, D., Newey, W., Rosen, H., 1988. Estimating VARs with Panel Data.

Econometrica, 56, 1371-1395.

Hsiao, C., 2003. Analysis of Panel Data, 2nd Edition. New York: Cambridge University Press.

Im, K., Pesaran, H., Shin, Y., 2003. Testing for Unit Roots in Heterogeneous Panels. Journal of Econometrics, 115, 53-74.

Kao, C., 1999. Spurious Regression and Residual-Based Tests for Cointegration in Panel Data when the Cross-Section and Time Series Dimensions are Compa-rable. Journal of Econometrics, 90, 1-44.

(21)

Kelejian, H.H., Prucha, I.R., 1999. A Generalized Moments Estimator for the Autoregressive Parameter in a Spatial Model. International Economic Review, 40, 509-533.

Kotz, S., Balakrishnan, N., Johnson, N.L., 2000. Continuous Multivariate Distri-butions, New York: Wiley.

Levin, A., Lin, C.-f., 1992. Unit Root Tests in Panel Data: Asymptotic and Finite-Sample Properties. UCSD Department of Economics Discussion Paper 92-23, downloadable from http://www.econ.ucsd.edu/papers/dp93.html#92-23. Levin, A., Lin, C.-f., 1993. Unit Root Tests in Panel Data: New Results. UCSD

Department of Economics Discussion Paper 93-56, downloadable from http://www.econ.ucsd.edu/papers/dp93.html#93-56.

Levin, A., Lin, C.-f., Chu, C.-s., 2002. Unit Root Tests in Panel Data: Asymptotic and Finite-Sample Properties. Journal of Econometrics, 108, 1-24.

Newey, W.K., West, K.D., 1987. A Simple, Positive Semi-definite, Heteroskedas-ticty and Autocorrelation Consistent Covariance Matrix. Econometrica, 55, 703-708.

Pedroni, P., 2004. Panel Cointegration: Asymptotic and Finite Sample Proper-ties of Pooled Time Series Tests with an Application to the PPP Hypothesis.

Econometric Theory, 20, 597-625.

Phillips, P.C.B., Durlauf, S.N., 1986. Multiple Time Series Regression with Inte-grated Processes, Review of Economic Studies, 53, 473-495.

Phillips, P.C.B., Moon, H., 1999. Linear Regression Limit Theory for Nonstation-ary Panel Data. Econometrica, 67, 1057-1112.

Quah, D., 1994. Exploiting Cross-Section Variation for Unit Root Inference in Dynamic Data. Economics Letters, 44, 9-19.

White, H., 1980. A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity. Econometrica, 48, 817-838.

References

Related documents

The purpose of the plutocratic cost of living index is to provide a single summary measure of the amount of price change over a well defined domain of definition of commodities that

To that end, the Zone falls outside of the areas eligible to receive state aid subsidy however negotiations are currently in train between the Local Planning Authority,

logrecord.name Name of the logger logrecord.levelno Numeric logging level logrecord.levelname Name of logging level logrecord.pathname Path of source

We demonstrated that a heterogeneous distribution of the axial velocity component in the whole filament cross-section, with multiple bands of flow in opposite fila- ment

Ahuja’s longitudinal study of the patents and formalized collaborative projects among 97 large international companies in Western Europe, the US and Japan reveals that links

There were also a higher number of passes made in Brazilian soccer and a less direct style of play identified which could relate to the lower levels of aerobic power also

The University is very aware of the challenges facing international students in moving away from home to study, so we have a range of services and support to help you in the

3 years in Computer Software development Preferred Education 4 year college degree or equivalent technical study. Role Description  Facilitate the implementation of