Estimation of Higher-Order Spatial
Autoregressive Panel Data Error
Component Models
H
ARALD
B
ADINGER
P
ETER
E
GGER
CES
IFO
W
ORKING
P
APER
N
O
.
2556
C
ATEGORY10:
E
MPIRICAL ANDT
HEORETICAL METHODSF
EBRUARY2009
An electronic version of the paper may be downloaded β’ from the SSRN website: www.SSRN.com β’ from the RePEc website: www.RePEc.org β’ from the CESifo website: Twww.CESifo-group.org/wpT
CESifo Working Paper No. 2556
Estimation of Higher-Order Spatial
Autoregressive Panel Data Error
Component Models
Abstract
This paper develops an estimator for higher-order spatial autoregressive panel data error component models with spatial autoregressive disturbances, SARAR(R,S). We derive the moment conditions and optimal weighting matrix without distributional assumptions for a generalized moments (GM) estimation procedure of the spatial autoregressive parameters of the disturbance process and define a generalized two-stages least squares estimator for the regression parameters of the model. We prove consistency of the proposed estimators, derive their joint asymptotic distribution, and provide Monte Carlo evidence on their small sample performance.
JEL Code: C13, C21, C23.
Keywords: higher-order spatial dependence, generalized moments estimation, two-stages least squares, asymptotic statistics.
Harald Badinger Department of Economics Vienna University of Economics and
Business Administration Althanstrasse 39 - 45 1090 Vienna Austria [email protected] Peter Egger
Ifo Institute for Economic Research and Center for Economic Studies at
the University of Munich Poschingerstrasse 5
81679 Munich Germany [email protected]
I. Introduction
This paper considers the estimation of panel data models with higher-order spatially autocorrelated error components and spatially autocorrelated dependent variables. Spatial interactions in data may originate from various sources such as strategic interaction between jurisdictions (to attract firms or other mobile agents) and firms (in their price, quantity, or quality setting) or general equilibrium effects which disseminate with spatial decay due to their transmission through trade flows, migration, or input-output relationships.1 Data sets used in empirical studies often share three features: first, they are available in the form of panel data, with a large cross-sectional and a small time series dimension; second, spatial interactions of various kinds co-exist β such as geography-related, trade-related, migration-related interactions β or the decay function of a single spatial interaction is unknown; third, it is unclear whether spatial interactions are local β and affect only immediate neighbors β or global β and affect second third and other neighbors with repercussions. The estimator proposed here addresses the mentioned three features in a unified framework. It allows for panel data with a fixed but arbitrary number of channels or decay segments of spatial interaction in both the error components and the dependent variable, referred to as SARAR(R,S).
In developing the estimator, we build on the SARAR(1,1) generalized moments (GM) framework in Kelejian and Prucha (1999) for a single cross-section and the SARAR(0,1) model in Kapoor, Kelejian, and Prucha (2007) for panel data error components models. Obvious advantages of the GM framework is that it does not rely on distributional assumptions and that it can be applied to large data-sets without imposing any restrictions on the matrices of spatial interdependence. We derive GM estimators for the spatial regressive parameters of the disturbance process based on alternative weighting schemes for the moments. We then define a feasible generalized two-stages least squares (FGTSLS) estimator for the modelβs regression parameters. We determine asymptotic properties of the estimators for the case where the time dimension of the panel and the number of spatial interactions is fixed while the number of cross-sectional units approaches infinity. In particular, we prove that the proposed GM and FGTSLS procedures obtain consistent estimators of the model parameters and we derive their joint asymptotic distribution.
1
See Cliff and Ord (1973, 1981), Anselin (1988), and Cressie (1993) for classic references about spatial econometric models in general. Recent theoretical contributions of spatial panel data models include Baltagi, Song, and Koh (2003), Baltagi and Li (2004), Baltagi, Song, Jung, and Koh (2007), Kapoor, Kelejian, and Prucha (2007), Korniotis (2008), Baltagi, Egger, and Pfaffermayr (2008), and Lee and Yu (2008). Recent applications of spatial panel data models include Druska and Horrace (2004), Arbia, Basile, and Piras (2005), Egger, Pfaffermayr, and Winner (2005), Baltagi, Egger, and Pfaffermayr (2007), and Badinger and Egger (2008a).
The remainder of the paper is organized as follows. Section II introduces the basic model specification and some notation. Section III proposes GM estimators of the parameters of spatial dependence in the error components based on alternative weighting schemes of the moments. Section IV derives a two-stages least-squares routine to estimate the regression parameters of the model and derives the asymptotic variance-covariance matrix of all model parameters. The latter enables Wald tests on the structure and decay of spatial interactions in the SARAR(R,S) model. Section V presents the results of a Monte Carlo simulation exercise. Section VI summarizes our main findings and concludes. The appendix sketches the proofs of consistency and the asymptotic distribution of the model parameters, whereas the full details of the proofs are relegated to a technical appendix.
II. Basic Model Specification and Notation
The basic set-up of the error components model with spatially correlated error terms represents a generalization of Kapoor, Kelejian, and Prucha (2007), who consider a panel data error components model with nonstochastic explanatory variables and first-order spatial autoregressive disturbances, i.e., a SAR(1) model. The present paper delivers the following contributions. First, we allow for an R-th order spatial autoregressive process in the dependent variable and an S-th order spatial process in the disturbances cum error components, i.e., we consider a SARAR(R,S) panel data error components model. As we show below, this also covers the case of endogenous explanatory variables other than spatial lags of the dependent variable.2 Second, we prove consistency of proposed generalized moments (GM) estimators of the model parameters and derive their joint asymptotic distribution. In particular, we also relax the normality assumption used, for instance, in Kapoor, Kelejian, and Prucha (2007) to obtain a simplified version of the optimal weighting matrix for the moment conditions. Third, we provide some Monte Carlo evidence with a special emphasis on the spatial model parameter point estimates and the rejection probabilities of Wald tests of the SARAR(R,S) model against interesting alternatives such as the SARAR(1,1), SARAR(0,S), SARAR(R,0), and the non-spatial model.
The basic model comprises i=1,...,N cross-sectional units and t=1,...,T time periods. For time period t, the model reads
) ( ) ( ) ( ) ( 1 , , t t t t R N r rN rN N N N N X Ξ² W y u y = +
β
+ =Ξ»
, (1a) or 2See Lee and Liu (2008) and Badinger and Egger (2008b) for cross-sectional spatial models with a SARAR(R,S) process.
) ( ) ( ) (t N t N N t N Z Ξ΄ u y = + , (1b)
where yN(t) is an NΓ1 vector with cross-sectional observations of the dependent variable in year t, )XN(t is an NΓK matrix of observations on K non-stochastic explanatory variables, i.e., XN(t)=[x1,N(t),...,xK,N(t)] with each NΓ1 vector xk,N(t) denoting the observations on the k-th explanatory variable. The structure of spatial dependence in yN(t) is determined by the time-invariant NΓN matrices Wr,N, r=1,...,R, whose elements wij,r,N are assumed to
be known and will often (but need not) be specified as a decreasing function of geographical distance between the cross-sectional units i and j. The expression yr,N(t)=Wr,NyN(t) is referred to as the r-th spatial lag of yN. The specification of a higher-order process allows the strength of spatial interdependence in the dependent variable (reflected in the spatial autoregressive parameters
Ξ»
r,N, r=1,...,R) to vary across a fixed number of R subsets of relations between cross-sectional units.In equation (1b), the NΓ(K+R) design matrix is given by ZN(t)=[XN(t),YN(t)], with )] ( ),..., ( [ ) (t 1,N t R,N t N y y
Y = , and Ξ΄N =(Ξ²β²N,Ξ»β²N)β², where the KΓ1 parameter vector of the exogenous variables is given by Ξ²N =(Ξ²1,N,...,Ξ²K,N)β² and the RΓ1 vector of spatial autoregressive parameters of yN is defined as Ξ»N =(
Ξ»
1,N,...,Ξ»
R,N)β².The NΓ1 vector of error terms uN(t)=[u1,N(t),...,uN,N(t)]β² is assumed to follow a spatial autoregressive process given by
) ( ) ( ) ( 1 , , t t t S N m N N m N m N M u Ξ΅ u =
β
+ =Ο
, (1c) ) ( ) (t N N t N ΞΌ v Ξ΅ = + , (1d)where
Ο
m,N and Mm,N denote the time-invariant, unknown parameters and the known NΓN matrix of spatial interdependence, respectively. The structure of spatial correlation in the disturbances is determined by the S different, time-invariant NΓN matrices Mm,N. As in equation (1a), the specification of a higher-order process allows the strength of spatial interdependence in the disturbances (reflected in the parametersΟ
m,N, m=1,...,S) to vary across a fixed number of S subsets of relations between cross-sectional units. The expression) ( ) ( , ,N t mN N t m M u
u = is referred to as the m-th spatial lag of uN. The SΓ1 vector of the spatial autoregressive parameters of uN(t) is defined as ΟN =(
Ο
1,N,...,Ο
S,N)β².Finally, the NΓ1 vector of error terms Ξ΅N(t) consists of two components, ΞΌN and vN(t). As indicated by the notation, ΞΌN is time-invariant while vN(t) is not. The typical elements of
) (t
N
Ξ΅ and vN(t) are the scalars
Ξ΅
it,N and vit,N, respectively, and the NΓ1 vector of unit-specific error components is given by ΞΌN =(ΞΌ1,N,...,ΞΌN,N)β².
Stacking observations for all time periods such that t is the slow index and i is the fast index with all vectors and matrices, the model reads
N N N N N N X Ξ² Y Ξ» u y = + + , (2a) or N N N N Z Ξ΄ u y = + , (2b)
with the NTΓK regressor matrix XN =[Xβ²N(1),...,Xβ²N(T)]β², and YN =(y1,N,...,yR,N), where ] ) ( ),..., 1 ( [ , , ,N = β²rN β²rN T β² r y y
y is the NTΓ1 vector of observations on the r-th spatial lag of the
dependent variable yr,N. The NTΓ1 vector of disturbances uN =[uβ²N(1),...,uβ²N(T)] for the spatial autoregressive process of order S is given by
N S m mN T mN N N I M u Ξ΅ u =
β
β + =1 , , ( )Ο
, (2c)where IT is an identity matrix of dimension TΓT. The NTΓ1 vector Ξ΅N =[Ξ΅β²N(1),...,Ξ΅β²N(T)]β² is specified as N N N T N e I ΞΌ v Ξ΅ =( β ) + , (3a)
where eT is a unit vector of dimension TΓ1 and IN is an identity matrix of dimension N
NΓ . In light of (2c), the error term can also be written as
β
β
= = β β = β β = S m N N m N m N T S m N N m T N m N N 1 , , 1 , , (I M )u I (I M )u u Ξ΅Ο
Ο
. (3b) It follows thatβ
= β β β = S m N N m N m N T N 1 1 , , ) ] ( [I I M Ξ΅ uΟ
, (4a)and N R r N r N r N T N N R r N r N r N T N I I W X t Ξ² I I W u y [ ( ) ] ( ) [ ( ) 1] 1 , , 1 1 , , β = β =
β
β
+ β β β β =Ξ»
Ξ»
, (4b)The following assumptions are maintained throughout this paper.
Assumption 1.
Let T be a fixed positive integer. (a) For all 1β€tβ€T and 1β€iβ€N,N β₯1, the error components vit,N are identically and (mutually) independently distributed with E(vit,N)=0,
2 2
, )
(vitN v
E =Ο , where 0<
Ο
v2 <bv <β, and Evit,N4+Ξ· <β for someΞ·
>0. (b) For all 1,
1β€iβ€N N β₯ , the unit-specific error components
ΞΌ
i,N are identically and (mutually) independently distributed with E(ΞΌ
i,N)=0, E(ΞΌi2,N)=ΟΞΌ2, where 0<ΟΞΌ2 <bΞΌ <β, andβ < +Ξ·
ΞΌ
4 ,N iE for some
Ξ·
>0. (c) The processes {vit,N} and {ΞΌ
i,N} are independent of each other. Assumption 1 as maintained here is slightly stronger than that in Kapoor, Kelejian, and Prucha (2007), since it requires not only the fourth but also the (4+Ξ·
)-th moments of the error components to be finite for someΞ·
>0. This is required for the central limit theorem of Kelejian and Prucha (2008) to apply, which will be used to derive the asymptotic distribution of the parameter estimates in Section III.Assumption 1 implies that
2 2 ,
, )
( itN jsN v
E
Ξ΅
Ξ΅
=Ο
ΞΌ +Ο
for i= jand t =s, (5a)2 , , ) (
Ξ΅
itNΞ΅
jsN =Ο
ΞΌ E for i= j and tβ s, (5b) 0 ) ( it,N js,N = EΞ΅
Ξ΅
, otherwise. (5c)As a consequence, the variance-covariance matrix of the stacked error term Ξ΅N reads
NT v N T N N N E Ξ΅ Ξ΅ J I I β¦Ξ΅ 2 2 , = ( β² )=
Ο
ΞΌ( β )+Ο
, (6a)where JT =eTeβ²T is a TΓT matrix with unitary elements and INT is an identity matrix of dimension NT Γ NT. Equation (6a) can also be written as
N N v N 1, 2 1 , 0 2 , Q Q β¦Ξ΅ =
Ο
+Ο
, (6b)where
Ο
12 =Ο
v2 +TΟ
ΞΌ2. The two matrices Q0,N and Q1,N, which are central to the estimationof error component models and the moment conditions of the GM estimator, are defined as
N T T N T I J I Q0, =( β )β , (7) N T N T I J Q1, = β . (8)
Notice that Q0,N and Q1,N are both of order NT Γ NT, symmetric, idempotent, orthogonal to each other, and sum up to INT.3
Assumption 2.
(a) All diagonal elements of the matrices Wr,N, r=1,...,R, and Ms,N, s=1,...,S, are zero. (b) The admissible parameter space is restricted as follows:
) , ( , r r N N N r a a Ξ» Ξ»
Ξ»
β β , with <aΞ»Nr aNΞ»r β€aΞ»,r β€aΞ» <β , 0 , r = 1, β¦, R, andβ
< <β = λλ
A R r 1 r N , .The first part of Assumption (2b) requires the parameters
Ξ»
r,N, r =1,...,R to be finite. We take aΞ» such that max( , ),..., 1 r R r a aΞ» Ξ» =
= holds; the expression aΞ» will be used to denote an RΓ1 vector with elements aΞ». In the second part of Assumption (2b), the scalar AΞ» generally depends on the properties of the weights matrices Wr,N. For example, with row-normalized
matrices Wr,N, r=1,...,R, assuming that AΞ» =1 ensures that ( )
1 , ,
β
= β R r N r N r N W IΞ»
is invertible,as required in Assumption (2c). If the matrices Wr,N are not row-normalized, Assumption
(2c) is implied by 1 , ,..., 1 max β = ββ β β β β = rN R r
AΞ» W for some matrix norm β (see Horn and Johnson,
1985, p. 301). Analogous assumptions are made for the parameters of the spatial autoregressive error process:
) , ( , s s N N N s a a Ο Ο
Ο
β β , with <aNΟs aNΟs β€aΟ,s β€aΟ <β , 0 , s = 1, β¦, S, andβ
< <β = ΟΟ
A S m N m 1 , .We take aΟ such that max( , )
,..., 1 s S s a aΟ Ο =
= holds; the expression aΟ will be used to denote an
1 Γ
S vector with elements aΟ. As above, with row-normalized matrices Ms,N, s=1,...,S, the
3
Observe that pre-multiplying an NT Γ 1 vector with Q0,N transforms its elements into
deviations from cross-section specific sample means taken over time, and that pre-multiplying a vector by Q1,N transforms its elements into cross-section specific sample means. See
second part of this assumption ensures invertibility of ( ) 1 , ,
β
= β S m mN mN N M IΟ
if AΟ =1. If thematrices Ms,N are not row-normalized, Assumption (2c) is implied by
1 , ,..., 1 max β = ββ β β β β = sN S s AΟ M
for some matrix norm β .
(c) The matrices ( ) 1 , ,
β
= β R r N r N r N W IΞ»
and ( ) 1 , ,β
= β S m N m N m N MI
Ο
are nonsingular for) , ( r r N N r a aΞ» Ξ»
Ξ»
β β and ( r, r) N N s a aΟ ΟΟ
β β , respectively. This ensures that uN and yN areuniquely identified through equations (4a) and (4b).
Assumption 3.
The row and column sums of the matrices Wr,N, r=1,...,R, Ms,N, s=1,...,S,
1 1 , , ) ( β =
β
β R r rN rN N W IΞ»
, and 1 1 , , ) ( β =β
β S m mN mN N MI
Ο
are bounded uniformly in absolute value.(See Remark A.1 in Appendix A for a definition of row and column sum boundedness.)
In light of Assumptions 1-3 and Remark A.1 in the Appendix, it follows that E(uN)=0 and the variance-covariance matrix of uN is given by
β
β
= β = β β β β² β β = β² = S m mN mN N T S m mN mN N N T N N N E 1 1 , , 1 , 1 , , , (u u ) [I (I M ) ]β¦ [I (I M ) ] β¦uΟ
Ξ΅Ο
, (9a) andβ
β
= β = β β β² β + = β² S m mN mN N S m mN mN N v N N t t E 1 1 , , 1 1 , , 2 2 ) ( ) )( ( )] ( ) ( [u uΟ
ΞΌΟ
IΟ
M IΟ
M . (9b)Note that all variables and parameters except for the variances of the error components are allowed to depend on sample size N. Such a specification is consistent, for example, with models where the weights matrix is row-normalized and the number of neighbours of a given cross-sectional unit depends on sample size (see Kapoor, Kelejian, and Prucha, 2007, p. 102) or where the strength of interdependence (in terms of the spatial autoregressive parameters) changes with the number of neighbours. Note that XN is allowed to depend on sample size and may thus also contain spatial lags of exogenous variables. As a result, the model specification in equations (1a)-(1c) is fairly general, allowing for higher-order spatial dependence in the dependent variable, the explanatory variables, and the disturbances.
III. GM Estimation of a SAR(S) Model
In the following, we consider GM estimators for the spatial autoregressive parameters of the disturbance process in equation (1c) and derive the asymptotic joint distribution of all model parameters.
1. Moment Conditions
With an S-th order process (SAR(S), with S >1), the GM estimators of the parameters
N S N , ,
1 ,...,
Ο
Ο
,Ο
v2, andΟ
12 can be obtained by recognizing that β under Assumptions 1 and 2 β the moment conditions used by Kapoor, Kelejian, and Prucha (2007) hold for each matrixN s,
M , s=1,...,S. In particular, define for each Ms,N, s=1,...,S
] ) ( )[ ( ) ( 1 , , , , ,
β
= β β β = β = S m N N m T N m N N s T N N s T N s I M Ξ΅ I M u I M u Ξ΅Ο
. (10)A word on notation is in order here. In equation (10), subscript s has been introduced together with m to indicate that, with higher-order spatial processes, Ms,N and Mm,N meet in Ξ΅s,N. While we will use index s to refer to the matrix Ms,N, by which Ξ΅N is pre-multiplied in equation (10), index m is required for the summation over the terms
Ο
m,NMm,N.The moment conditions are then given by
Ma 0, 0, ] 2 ) 1 ( 1 [ ] ) 1 ( 1 [ N N N N N N v T N E T N E β² =
Ο
β = β² β Ξ΅ Q Ξ΅ v Q v , (11) M1,s ( ) 1 ] ) ( ) 1 ( 1 [ ] ) 1 ( 1 [ s,N 0,N s,N N 0,N T s,N s,N 0,N N v2 tr s,N s,N N T N E T N E Ξ΅ Q Ξ΅ vβ²Q I βMβ² M Q v = Mβ² M β = β² βΟ
, M2,s ( ) ] 0 ) 1 ( 1 [ ] ) 1 ( 1 [ , 0, β² 0, β β², 0, = β = β² β sN N N N N T sN N N T N E T N E Ξ΅ Q Ξ΅ v Q I M Q v , Mb 1, 1, ) 12 1 ( ] ) ( 1 [ ) 1 ( β²N N N = β²N Tβ² T β N N + β²N N N =Ο
N E N E N E Ξ΅ Q Ξ΅ ΞΌ e e I ΞΌ v Q v , M3,s ( ) ] 1 [ ] ) ( 1 [ ) 1 ( s,N 1,N s,N N T T s,N s,N N N 1,N T s,N s,N 1,N N N E N E N E Ξ΅β² Q Ξ΅ = ΞΌβ² eβ²e βMβ² M ΞΌ + vβ²Q I βMβ² M Q v ) ( 1 , , 2 1 tr sN sN N Mβ² M =Ο
, M4,s (1 β²s,N 1,N N)= [1 β²N( Tβ² T β β²s,N) N]+ [1 β²N 1,N( T β β²s,N) 1,N N]=0 N E N E N E Ξ΅ Q Ξ΅ ΞΌ e e M ΞΌ v Q I M Q v ,where
Ο
12 =Ο
v2+TΟ
ΞΌ2. The moment conditions associated with matrices Ms,N, s=1,...,S, through (10), are indexed with subscripts 1 to 4. The remaining two moment conditions,which do not depend on s, are denoted as Ma and Mb. For an S-th order process as given by equation (2c), we thus have (4S+2) moment conditions.4
Substituting equations (3b), (10), and (1c) into the 4S+2 moment conditions (11) yields a (4S+2) equation system in ( ,..., , , 12)
2 , ,
1
Ο
Ο
Ο
Ο
N S N v , which can be written as0
Ξ
Ξ³N β NbN = , (12)
where bN is a [2S+S(Sβ1)/2+2]Γ1 vector, given by
) , , ,..., ,..., , ,..., , ,..., ( 12 2 , , 1 , , 1 , 2 , 1 2 , 2 , 1 , , 1 β² =
Ο
NΟ
SNΟ
NΟ
SNΟ
NΟ
NΟ
NΟ
SNΟ
Sβ NΟ
SNΟ
vΟ
N b ,i.e., bN contains S linear terms
Ο
m,N, m=1,...,S, S quadratic termsΟ
m2,N, m=1,...,S,2 / ) 1 (Sβ
S cross products
Ο
m,NΟ
l,N, m=1,...,S β1 ,l=m+1,...,S, as well as 2 vΟ
andΟ
12. Forlater reference, we define the (S+2)Γ1 vector of all parameters as
) , , ,..., ( ) , , ( 2 1 2 , , 1 2 1 2 β²= β² β² = N
Ο
vΟ
Ο
NΟ
SNΟ
vΟ
N Ο ΞΈ . NΞ³ is a (4S+2)Γ1 vector with elements [
Ξ³
i,N], i=1,...,(4S+2), and ΞN is a )2 4
( S+ Γ[2S+S(Sβ1)/2+2] matrix with elements [
Ξ³
i,j,N], i=1,...,(4S+2), ] 2 2 / ) 1 ( 2 [ ,..., 1 + β + = S S Sj . The elements
Ξ³
i,N andΞ³
i,j,N will be defined below. Therow-index of the elements Ξ³N and ΞN will be chosen such that the equation system (12) has the following order. The first four rows correspond to the moment restrictions M1,1 to M4,1 associated with matrix M1,N through (10); rows five to eight correspond to M1,2 to M4,2
associated with matrix M2,N, and so forth; rows (Sβ4) to 4 correspond to the MS 1,S to M4,S associated with the matrix MS,N. Finally, rows (4S+1) and (4S+2) correspond to the moment conditions Ma and Mb, respectively, which do not depend on s.
4
Notice that further moment conditions are available through pre- and post-multiplying Q0,N
and Q1,N with Ξ΅r,β²N and Ξ΅s,N, rβ s, respectively, in moment conditions M1 and M3. The associated efficiency gain will depend on the properties of the weights matrices. If the two involved weights matrices are orthogonal, the corresponding moment condition is trivially satisfied and does not add any information. For the sake of brevity, we use the moment conditions given in (11), and leave an assessment of the potential efficiency gains from exploiting further moment conditions for future research.
The sample analogue to equation system (12) is given by ) ( ~ ~ N N N N N Ξ ΞΈ Ξ³ β b =
Ο
, (13)where the elements of ~ and Ξ³N Ξ~N are equal to those of Ξ³N and ΞN with the expectations operator suppressed and the disturbances uN replaced by (consistent) estimates u~ . N
GM estimates of the parameters
Ο
1,N,...,Ο
S,N, 2 vΟ
andΟ
12are then obtained as the solution to)] ( ~ ) ( [ )] ~ ~ ( ~ ) ~ ~ [( min arg 2 1 2 2 1, ,.., , , N N N N N N N N N N N N v S ΞΈ Ξ ΞΈ Ξ Ξ³ Ξ Ξ Ξ³ Ο Ο Ο Ο Ο Ο Ο β² = β β² β b b , (14)
i.e., the parameter estimates can be obtained from a (weighted) non-linear least squares regression of ~ on the columns of Ξ³N Ξ~N; )ΟN(ΞΈN can then be viewed as a vector of regression residuals. The optimal choice of the (4S+2)Γ(4S+2) weighting matrix ΞN and its estimation will be discussed below.
In the following, we define the elements of Ξ³N and ΞN, grouped by the corresponding moment conditions. Thereby, we use the following notation:
N N s T N s I M u u, =( β , ) , s=1,...,S, and (15a) N N m N s T N N m T N s T N sm I M I M u I M M u u , =( β , )( β , ) =( β , , ) , s=1,...,S, m=1,...,S. (15b) At this point, we need to introduce an index l=1,...,S for proper definition of the elements of
N
Ξ³ and ΞN.
Moment condition M1,s delivers s=1,...,S rows of equation system (12), appearing in rows
1 ) 1 (
4 sβ + with the following elements of Ξ³N and ΞN:
) 1 ( 1 , 1 ) 1 ( 4 β = + β T N N s Ξ³ E(usβ²,NQ0,Nus,N), (16a) ) ( ) 1 ( 2 , , 0 , , , 1 ) 1 ( 4 s mN E sN N smN T N β uβ² Q u = + β Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 1 ) 1 ( 4sβ + S+mN =βN Tβ E usmβ² NQ NusmN Ξ³ , m=1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 ) 1 ( 4 s S m mm l mN E smN N slN T N β uβ² Q u β = β + β β + + β Ξ³ , m=1,...,Sβ1, l=m+1,...,S,
) ( 1 , , , 1 2 / ) 1 ( 2 , 1 ) 1 ( 4s S S S N tr sN sN N Mβ² M = + β + + β
Ξ³
, 0 , 2 2 / ) 1 ( 2 , 1 ) 1 ( 4sβ + S+S Sβ + N =Ξ³
.Moment condition M2,s consists of s=1,...,S rows of equation system (12), appearing in rows
2 ) 1 (
4 sβ + with the following elements of Ξ³N and ΞN:
) ( ) 1 ( 1 , 0 , , 2 ) 1 ( 4 s N E sN N N T N β uβ² Q u = + β Ξ³ , (16b) ) ( ) 1 ( 1 , , 0 , , 0 , , , 2 ) 1 ( 4s mN E smN N N sN N mN T N β uβ² Q u +uβ² Q u = + β Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 2 ) 1 ( 4sβ + S+mN =βN T β E usmβ² NQ NumN Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 0 , , 2 / ) 1 ( ) 1 ( , 2 ) 1 ( 4s S m mm l mN E slN N mN smN N lN T N β uβ² Q u +uβ² Q u β = β + β β + + β Ξ³ , m=1,...,Sβ1, S m l= +1,..., , 0 , 1 2 / ) 1 ( 2 , 2 ) 1 ( 4sβ + S+S Sβ + N =
Ξ³
, 0 , 2 2 / ) 1 ( 2 , 2 ) 1 ( 4sβ + S+S Sβ + N =Ξ³
.Moment condition M3,s corresponds to s=1,...,S rows of equation system (12), appearing in
rows 4(sβ1)+3 with the following elements of Ξ³N and ΞN:
N N s 1 , 3 ) 1 ( 4 β + =
Ξ³
E(usβ²,NQ1,Nus,N), (16c) ) ( 2 , , 1 , , , 3 ) 1 ( 4 s mN E sN N smN N uβ² Q u = + βΞ³
, m=1,...,S, ) ( 1 , , 1 , , , 3 ) 1 ( 4s S mN E smN N smN N uβ² Q u β = + + βΞ³
, m=1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 3 ) 1 ( 4sβ + S m+ βmmβ +lβmN =βN E usmβ² NQ NuslNΞ³
, m=1,...,Sβ1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 3 ) 1 ( 4sβ + S+S Sβ + N =Ξ³
, ) ( 1 , , , 2 2 / ) 1 ( 2 , 3 ) 1 ( 4 s S S S N tr sN sN N Mβ² M = + β + + βΞ³
.Moment condition M4,s represents s=1,...,S rows of equation system (12) appearing in rows
4 ) 1 (
) ( 1 , 1 , , 4 ) 1 ( 4s N E sN N N N uβ² Q u = + β
Ξ³
, (16d) ) ( 1 , , 1 , , 1 , , , 4 ) 1 ( 4sβ + mN = N E usmβ² NQ NuN +usβ²NQ NumNΞ³
, m=1,...,S, ) ( 1 , , 1 , , , 4 ) 1 ( 4s S mN E smN N mN N uβ² Q u β = + + βΞ³
, m=1,...,S, ) ( 1 , , 1 , , , 1 , , 2 / ) 1 ( ) 1 ( , 4 ) 1 ( 4 s S m mm l mN E slN N mN smN N lN N uβ² Q u +uβ² Q u β = β + β β + + βΞ³
, m=1,...,Sβ1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 4 ) 1 ( 4sβ + S+S Sβ + N =Ξ³
, 0 , 2 2 / ) 1 ( 2 , 4 ) 1 ( 4sβ + S+S Sβ + N =Ξ³
.Moment condition Ma reflects one equation of the system in (12), appearing in row (4S+1) with the following elements of Ξ³N and ΞN:
) ( ) 1 ( 1 , 0 , 1 4S N E N N N T N β uβ² Q u = + Ξ³ , (16e) ) ( ) 1 ( 2 , 0 , , , 1 4S+ mN = N T β E umβ² NQ NuN Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 1 4S S mN E mN N mN T N β uβ² Q u β = + + Ξ³ , m=1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 4S S m mm l mN E mN N lN T N β uβ² Q u β = β + β β + + Ξ³ , m=1,...,Sβ1, l=m+1,...,S, 1 , 1 2 / ) 1 ( 2 , 1 4S+ S+S Sβ + N =
Ξ³
, 0 , 2 2 / ) 1 ( 2 , 1 4S+ S+S Sβ + N =Ξ³
.Moment condition Mb is associated with one equation of the system in (12), appearing in row )
2 4
( S+ with the following elements of Ξ³N and ΞN:
) ( 1 , 1 , 2 4S N E N N N N uβ² Q u = +
Ξ³
, (16f) ) ( 2 , 1 , , , 2 4S mN E mN N N N uβ² Q u = +Ξ³
, m=1,...,S, ) ( 1 , , 1 , , , 2 4S+ S+mN =βN E uβ²mNQ NumNΞ³
, m=1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 2 4S S m mm l mN E mN N lN N uβ² Q u β = β + β β + +Ξ³
, m=1,...,Sβ1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 2 4S+ S+S Sβ + N =Ξ³
, 1 , 2 2 / ) 1 ( 2 , 2 4S+ S+S Sβ + N =Ξ³
.This completes the specification of the elements of the matrices Ξ³N and ΞN. The similarity of the structure between the expressions resulting from the moment conditions Ma, M1,s, and M2,s on the one hand and Mb, M3,s, and M4,s on the other hand is apparent. First, they differ by the normalization factor and the corresponding matrix of quadratic forms, Q0,N and Q1,N,
respectively. Second, the rows in (12) associated with Ma, M1,s and M2,s,s=1,...,S, do not
depend on Ο12 whereas the rows associated with Mb, M3,s, and M4,s, s=1,...,S, do not depend
on
Ο
v2. This fact will be used to define an initial GM estimator, which is based on a subset of moment conditions (Ma, M1,s, and M2,s) only, in order to obtain an estimate of the optimalweighting matrix ΞN.
For future reference, we define the (2S+1)Γ1 vector Ξ³0N as the sub-vector containing rows s and (s+1), s=1,...,S and row (4S+1)of Ξ³N, corresponding to M1,s, M2,s, and Ma.
Moreover, we define the (2S+1)Γ[2S+S(Sβ1)/2+1] matrix Ξ0N as the sub-matrix containing rows s and (s+1), s=1,...,S, and row (4S+1) of ΞN, corresponding to M1,s, M2,s, and Ma.
Analogously, we define the (2S+1)Γ1 vector Ξ³1N as the sub-vector containing rows 2s, )
1 2
( s+ , s=1,...,S, and row (4S+2) of Ξ³N, corresponding to M3,s, M4,s, and Mb. Finally, we
define the (2S+1)Γ[2S+S(Sβ1)/2+1] matrix Ξ1N as the sub-matrix containing rows 2s, )
1 2
( s+ , s=1,...,S, and (4S+2) of ΞN, corresponding to M3,s, M4,s, and Mb.
2. Definition of GM Estimators
We next define three alternative GM estimators for the spatial autoregressive parameters of the disturbance process given by (1c) and the variances of the error components.5
2.1. Initial GM Estimation
The initial GM estimator is a special case of (14), using the identity matrix as weighting matrix ΞN and a subset of moment conditions (Ma, M1,s and M2,s) only. It is thus based on the
the vector Ξ³0N and the matrix Ξ0N. Define ΞΈ0N as the corresponding parameter vector that excludes Ο12, i.e., ΞΈ0 =(Οβ²N,Οv2)=(Ο1,N,...,ΟS,N,Οv2), and accordingly
) , ,..., ,..., , ,..., , ,..., ( 2 , , 1 , , 1 , 2 , 1 2 , 2 , 1 , , 1 0 = β² β N SN v S N S N N N N S N N S N N Ο Ο Ο Ο Ο Ο Ο Ο Ο Ο Ο b .
The initial GM estimator is then obtained as the solution to
5
See Kapoor, Kelejian, and Prucha (2007) for analogous conditions under SARAR(0,1) estimation, assuming only nonstochastic regressors in equation (1a).
} ] , 0 [ , ), ( ) ( min{ arg ) , ,..., ( 2, 0 0 0 0 2 , , 1,N ΟSN ΟvN = ΟN N β²ΟN N β β€ β€ ΟvNβ bv Ο) ) ) ΞΈ ΞΈ aΟ Ο aΟ , (17a) with ΟN0(ΞΈ0)=ΟN0(Ο,Οv2)=(~Ξ³0N βΞ~0Nb0).
Using these initial estimates of (
Ο
1,N,...,Ο
S,N) and2 v
Ο
, the parameter Ο12 can be estimatedfrom moment condition Mb:
β
β
= = β β² β = S m mN mN N N S m mN mN N N N 1 , , , 1 1 , , 2 , 1 ) ~ ~ ( ) ~ ~ ( 1 u u Q u uΟ
Ο
Ο
) ) ) (17b) ... ~ ~ ... ~ ~ 2 , 1 1 , 2 4 , , 2 4 , 1 1 , 2 4 2 4S Ξ³ S N Ξ³ S S S N Ξ³S S N Ξ³ + β +Ο
) β β +Ο
) β + +Ο
) = . ~ ... ~ ~ , , 1 2 / ) 1 ( 2 , 2 4 , 2 , 1 1 2 , 2 4 2 , 2 , 2 4S S SN Ξ³ S S N N Ξ³ S S S S S N SN Ξ³ +Ο
) β + +Ο
)Ο
) β β + + βΟ
) βΟ
) β 2.2. Weighted GM EstimationWhile the initial GM estimator as defined in (17) is consistent (as will be shown below), it is inefficient. First, it ignores the information contained in moment conditions (Mb, M3,s and
M4,s).
6
Second, it is well known from the literature on generalized method of moments estimation, that it is optimal to use as weighting matrix the inverse of the (properly normalized) variance-covariance matrix of the moments, evaluated at true parameter values. Denote the optimal weighting matrix, which will be derived in Subsection 3.2, by Ξ¨βN1. In general, the optimal weighting matrix is unknown and has to be estimated, e.g., using the results from the initial GM estimation. In Subsection 3.3 we derive a consistent estimator of
1 β N
Ξ¨ , referred to as Ξ¨~βN1. The optimally weighted GM estimator is based on all (4S+2) moment conditions and uses Ξ~N =Ξ¨~βN1 as the weighting matrix for the moment conditions. It is defined as: } ] , 0 [ ], , 0 [ , ), ( ~ ) ( { min arg ) ~ , ~ , ~ ,..., ~ ( 12 2 2 1, 2 , , 1,N
Ο
SNΟ
vNΟ
N =Ο
N β² NΟ
N β β€ β€Ο
vβ bvΟ
β cΟ
ΞΈ Ξ ΞΈ aΟ Ο aΟ , with cβ₯bv+TbΞΌ, andΟ
N(ΞΈ)=Ο
N(Ο,Ο
v2,Ο
12)= (~Ξ³N β~ΞNb). (18)As already mentioned, the optimal weighting matrix is derived without distributional assumptions. As a consequence it involves third and fourth moments of the error components
6
This does not mean that any GM estimator using all moment conditions is necessarily superior. In fact, Kapoor, Kelejian, and Prucha (2007) show in a Monte Carlo study on their SAR(1) model that the initial GM estimator performs much better (in terms of bias and RMSE) than the unweighted GM estimator using all moment conditions. Their results suggest that a proper weighting of the moment conditions, in particular the weighting of moment conditions Ma, M1, and M2 relative to moment conditions Mb, M3, and M4 is of crucial importance.
N it
v , and
ΞΌ
i,N. Kapoor, Kelejian, and Prucha (2008) use the assumption that Ξ΅it,N is normally distributed to obtain a simplified weighting matrix as an approximation of the true optimal weighting matrix. For comparison, we also consider this simplified optimal weighting matrix, which is shown to be a special case of Ξ¨βN1 in the Appendix and referred to as (Ξ¨oN)β1. The simplified weighted GM estimator is defined as the weighted GM estimator given in (18), using Ξ~N =(Ξ¨~oN)β1.3. Asymptotic Properties of the GM Estimator for ΞΈN
3.1 Consistency
In order to prove consistency, the following additional assumptions are required:
Assumption 4.
Assume that u~N βuN =DNΞN, i.e., u~i,N βui,N =di.,NΞN, for i=1,...,NT,7 where DN is an P
NTΓ matrix, the 1ΓP vector di.,N denotes the i-th row of DN and ΞN is a PΓ1 vector. Let dij,N be the j-th element of di.,N. For some Ξ΄ >0, we assume that β€ <β
+ d N ij t c d E , ( )2 Ξ΄ ,
where cd does not depend on N, and that N1/2 ΞN =Op(1).
Assumption 4 will be fulfilled in many settings, e.g., if model (1a) contains endogenous variables (such as spatial lags of yN) and is estimated using two-stages least squares. In that case, ΞN denotes the difference between the parameter estimates and the true parameter values and di.,N is the (negative of the) i-th row of the design matrix ZN (compare Lemma 1 in Subsection 2 of Section IV). Under certain conditions, Assumption 4 will also be satisfied if model (1a) involves a non-linear specification (see Kelejian and Prucha, 2008, p. 12). Finally, observe that Assumption 4 implies that
β
= + β NT i N i NT 1 2 ., 1 ) ( d Ξ΄ is Op(1). Assumption 5.
(a) The smallest eigenvalues of Ξ0Nβ²Ξ0N and 1 1 N NΞ
Ξ β² are bounded away from zero, i.e., 0 ) ( * min β² β₯
Ξ»
>Ξ»
i N i N ΞΞ for i = 1, 2. (b) Ξ~N βΞN =op(1), where ΞN are (4S+2)Γ(4S+2) nonstochastic, symmetric, positive definite matrices. (c) The largest eigenvalues of ΞN are bounded uniformly from above, and the smallest eigenvalues of ΞN are bounded uniformly away from zero.
7
Note that we use single indexation i=1,...,NT to refer to the elements of the vectors that are stacked over time periods. (See the remark on notation in Appendix A.)
Assumption 5 implies that the smallest eigenvalues of Ξβ²NΞN and Ξβ²NΞNΞN are bounded uniformly away from zero, ensuring that the true parameter vector ΞΈN is identifiable unique. Moreover, by the equivalence of matrix norms, it follows from Assumption 5 that ΞN and
1 β N
Ξ are O(1).
Assumptions 1-5 ensure consistency of the GM estimators for ( , , 12)
2
Ο
Ο
v N N ΟΞΈ = β² . We
summarize these results in the following theorems, which are proved in Appendix B.
Theorem 1a.Consistency of Initial GM Estimator ~0
N
ΞΈ
Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the initial GM estimators ( 1, ,..., , 2, )
0 = β² N v S N N Ο) Ο) Ο) )
ΞΈ defined by (17a), and Ο)12,N, defined
by (17b) are consistent for
Ο
1,N,...,Ο
S,N, 2 vΟ
, and Ο12, i.e., 0 , s, p N s N βΟ
βΟ
) , s=1,...,S, 2 2 0 , p v N v βΟ
βΟ
) , and 2 0 1 2 , 1 p N βΟ
βΟ
) as N ββ.Theorem 1b.Consistency of Weighted GM Estimator ΞΈ~N
Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the weighted GM estimators ΞΈ~N(Ξ~N)=[Ο~1,N(Ξ~N),...,Ο~S,N(Ξ~N),
Ο
~v2,N(Ξ~N),Ο
~12,N(Ξ~N)]β² defined by (18) are consistent forΟ
1,N,...,Ο
S,N,Ο
v2, andΟ
12, i.e.,0 ) ~ ( ~ , s, p N s N N β
Ο
βΟ
Ξ , s=1,...,S,Ο
~v2,N(Ξ~N)βΟ
v2βp 0 , andΟ
~12,N (Ξ~N)βΟ
12βp 0 as N ββ. This result holds for an arbitrary weighting matrix (that satisfies Assumption 5). Hence, it applies to both the optimally weighted GM estimator defined by (18) with Ξ~N =(Ξ¨~N)β1and the simplified optimally weighted GM estimator ΞΈ~No with Ξ~N =(Ξ¨~oN)β1.3.2 Asymptotic Distribution of GM Estimator for ΞΈN
In the following we consider the asymptotic distribution of the optimally weighted GM estimator ΞΈ~N. To establish asymptotic normality of ~ (~ , ~ ,~12, )
2 ,N N v N N Ο
Ο
Ο
ΞΈ = , we need some additional assumptions. Assumption 6.Let DN be defined as in Assumption 4, such that u~N βuN =DNΞN. For any real NΓN matrix AN, whose row and column sums are bounded uniformly in absolute value, it holds that )Nβ1Dβ²NANuN βNβ1E(Dβ²NANuN)=op(1 .
A sufficient condition for Assumption 6 is, e.g., that the columns of DN are of the form
N N N Ξ Ξ΅
Ο + , where the elements of ΟN are bounded uniformly in absolute value and the row and column sums of Ξ N are bounded uniformly in absolute value (see Kelejian and Prucha, 2008, Lemma C.2). This will be the case in many applications, e.g., for the model in equation (1a), if DN equals (the negative of) matrix ZN (compare Lemma 1 in Section IV).
Assumption 7.
Let ΞN be defined as in Assumption 4. Then, ) 1 ( ) ( ) (NT 1/2ΞN = NT β1/2Tβ²NΞΎN +op , with TN =(Tvβ²,N,TΞΌβ²,N)β², )ΞΎN =(vβ²N,ΞΌβ²N β², i.e., ) 1 ( ) ( ) ( ) (NT 1/2ΞN = NT β1/2Tvβ²,NvN + NT β1/2TΞΌβ²,NΞΌN +op ,
where TN is an (NT+N)ΓP-dimensional real nonstochastic matrix whose elements are bounded uniformly in absolute value; Tv,N is of dimension (NTΓP) and TΞΌ,N is of dimension (NΓP). As remarked above, ΞN typically denotes the difference between the parameter estimates and the true parameter values. Assumption 7 will be satisfied by many estimators. In Section IV, we verify that it holds if the model in equation (1a) is estimated by two-stages least squares (TSLS) or feasible generalized two-stages least squares (FGTSLS).
The limiting distribution of the GM estimator of ΞΈNwill be seen to depend on (the inverse of) the matrix Jβ²NΞNJN and the variance-covariance matrix of a vector of quadratic forms in vN and ΞΌN, denoted as qN. We consider each of these expressions in the following. The
) 2 ( ) 2 4
( S+ Γ S+ matrix JN of derivatives of the (4S+2)Γ1 vector of moment conditions in (11) is given by ΞΈ Ξ Ξ³ ΞΈ J β² β β β = ( ) ) ( N N N N N b ) , , ,..., ( ,1, , , , , , , 1 N i N i N S i N i j j j j v Ο Ο = , with (19a) = N s i j, , s N N i N i
Ο
β β β(Ξ³., Ξ., b ) , i=1,...,(4S+2), s=1,...,S, = N i v j,Ο , v N N i N iΟ
β β β(Ξ³., Ξ., b ) , i=1,...,(4S+2), = N i j,Ο1, 1 ., ., ) (Ο
β β β Ξ³i N Ξi NbN , i=1,...,(4S+2),Using 0 ΞΈ Ξ³ = β² β β N
and ignoring the negative sign, we have
N N N N N Ο ΞΈ Ξ b Ξ B J = β² β β = ) ( , (19b)
where ΞN is defined above and of dimension (4S+2)Γ[2S+S(Sβ1)/2+2] and BN is a ) 2 ( ] 2 2 / ) 1 ( 2
[ S+S Sβ + Γ S+ matrix of the form
) , , , ( 1 β²2, β²3, β²4, β² = N N N N B B B B B , (20a)
with B1 =(IS,0SΓ2), )]B2,N =[diagsS=1(2
Ο
s,N),0SΓ2 , and B3,N =(Bβ²3,1,N,...,B3β²,Sβ1,N)β² is an SS
S( β1)/2Γ matrix, consisting of (Sβ1) vertically arranged blocks B3,m,N, ) 1 ( ,..., 1 β = S
m , which have the following structure:
) , , , ( , , , ( ) 2 , , 3mN = CmN dmN EmN 0SβmΓ B , (20b)
where Cm,N is a (Sβm)Γ(mβ1) matrix of zeros,8 dm,N is a (Sβm)Γ1 vector, defined as )
,...,
( 1, ,
,N m N S N
m =
Ο
+Ο
d , and Em,N =
Ο
m,NISβm. Finally, B4,N is a 2Γ(S +2)matrix, defined as β₯ β¦ β€ β’ β£ β‘ = + Γ Γ 1 , 0 , 1 , 1 1 1 , 4 S S N 0 0 B . (20c)For later reference, note that BN has full column rank (S+2); as a consequence, the )
2 ( ) 2
(S+ Γ S+ matrix Bβ²NBN is positive definite (see, e.g., Greene, 2003, p. 835).
We next consider the vector qN and its limiting distribution. First, define qN(ΞΈN,ΞN) as the 1
) 2 4
( S+ Γ vector of sample moments as given by (11) with the expectation operator suppressed, evaluated at the true parameter values, and ignoring the deterministic constants:
8
= ) , ( N N N ΞΈ Ξ q β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β¦ β€ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β£ β‘ β² β² β² β² β² β² β² β² β² β² β N N b N N N a N N N S N N N S N N N S N N N S N N N N N N N N N N N N N N u C u u C u u C u u C u u C u u C u u C u u C u u C u u C u ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . ~ ~ ~ ~ ~ ~ ~ ~ , , , , 4 , , 3 , , 2 , , 1 , 1 , 4 , 1 , 3 , 1 , 2 , 1 , 1 1 , (21) where
β
β
= = β β β² β β² β β β = S m N m N m N T N N s N s T N S m N m N m N T N s T 1 , , , 0 , , , 0 1 , , , , 1 [ ( )] ( ) [ ( )] ) 1 ( 1 M I I Q M M I Q M I I CΟ
Ο
,β
β
= = β β β² + β β² β β β = S m mN mN N T N N s N s T N S m mN mN N T N s T 1 , , , 0 , , , 0 1 , , , , 2 [ ( )] [ ( )] [ ( )] ) 1 ( 2 1 M I I Q M M I Q M I I CΟ
Ο
,β
β
= = β β β² β β² β β = S m mN mN N T N N s N s T N S m mN mN N T N s 1 , , , 1 , , , 1 1 , , , , 3 [I (I M )]Q (I M M )Q [I (I M )] CΟ
Ο
,β
β
= = β β β² + β β² β β = S m mN mN N T N N s N s T N S m mN mN N T N s 1 , , , 1 , , , 1 1 , , , , 4 [ ( )] [ ( )] [ ( )] 2 1 I I M Q I M M Q I I M C Ο Ο ,β
β
= = β β β² β β β = S m N m N m N T N S m N m N m N T N a T 1 , , , 0 1 , , , [ ( )] [ ( )] ) 1 ( 1 M I I Q M I I C Ο Ο ,β
β
= = β β β² β β = S m N m N m N T N S m N m N m N T N b 1 , , , 1 1 , , , [I (I M )]Q [I (I M )] CΟ
Ο
. (22)By Assumption 3 and Remark A.1 in Appendix A, the row and column sums of the symmetric NTΓNT matrices Cp,s,N, p=1,...,4, s=1,...,S, Ca,N, and Cb,N are bounded
uniformly in absolute value. Also, note that C3,s,N, C4,s,N, Cb,N differ from C1,s,N, C2,s,N,
N a,
C only by the normalization and the use of Q1,N versus Q0,N.
In light of (21) and Lemma B.1 (see Appendix B), the elements of N1/2qN(ΟN,ΞN) can be expressed as
= ) , ( 2 / 1 N N N N q ΞΈ Ξ (1) . 2 / 1 , , 2 / 1 2 / 1 , , 2 / 1 2 / 1 , , 4 , , 4 2 / 1 2 / 1 , , 3 , , 3 2 / 1 2 / 1 , , 2 , , 2 2 / 1 2 / 1 , , 1 , , 1 2 / 1 2 / 1 , 1 , 4 , 1 , 4 2 / 1 2 / 1 , 1 , 3 , 1 , 3 2 / 1 2 / 1 , 1 , 2 , 1 , 2 2 / 1 2 / 1 , 1 , 1 , 1 , 1 2 / 1 p N N b N N a N N N a N N a N N N S N N S N N N S N N S N N N S N N S N N N S N N S N N N N N N N N N N N N N N N N N N N N N o N N N N N N N N N N N N N N N N N N N N + β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β₯ β¦ β€ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β’ β£ β‘ β² + β² β² + β² β² + β² β² + β² β² + β² β² + β² β² + β² β² + β² β² + β² β² + β² β β β β β β β β β β Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u Ξ Ξ± u C u , (23) where 2 ( ,, ) 1 , ,sN N psN N p N E D C u Ξ± = β β² , p=1,...,4, s=1,...,S, 2 ( , ) 1 ,N N aN N a N E D C u Ξ± = β β² , and ) ( 2 , 1 ,N N bN N b N E D C u Ξ± = β β²
. By Lemma B.1 the elements of the PΓ1 vectors Ξ±p,s,N,
4 ,..., 1
=
p , s=1,...,S, Ξ±a,N and Ξ±b,N are bounded uniformly