• No results found

Estimation and Testing of Higher-Order Spatial Autoregressive Panel Data Error Component Models

N/A
N/A
Protected

Academic year: 2021

Share "Estimation and Testing of Higher-Order Spatial Autoregressive Panel Data Error Component Models"

Copied!
43
0
0

Loading.... (view fulltext now)

Full text

(1)

ePub

WU

Institutional Repository

Harald Badinger and Peter Egger

Estimation and Testing of Higher-Order Spatial Autoregressive Panel Data

Error Component Models

Article (Accepted for Publication) (Refereed)

Original Citation:

Badinger, Harald and Egger, Peter (2013) Estimation and Testing of Higher-Order Spatial

Autoregressive Panel Data Error Component Models. Journal of Geographical Systems, 15 (4).

pp. 453-489. ISSN 1435-5930

This version is available at: http://epub.wu.ac.at/5468/

Available in ePubWU: March 2017

ePubWU, the institutional repository of the WU Vienna University of Economics and Business, is

provided by the University Library and the IT-Services. The aim is to enable open access to the scholarly output of the WU.

This document is the version accepted for publication and — in case of peer review — incorporates referee comments.

(2)

Estimation and Testing of Higher-Order Spatial Autoregressive

Panel Data Error Component Models

Harald Badinger

Department of Economics, Vienna University of Economics and Business

Peter Egger

Department of Management, Technology, and Economics at ETH Zürich; CEPR

May 2012

Abstract: This paper develops an estimator for higher-order spatial autoregressive panel data error component models with spatial autoregressive disturbances, SARAR(R,S). We derive the moment conditions and optimal weighting matrix without distributional assumptions for a generalized moments (GM) estimation procedure of the spatial autoregressive parameters of the disturbance process and define a generalized two-stage least squares estimator for the regression parameters of the model. We prove consistency of the proposed estimators, derive their joint asymptotic distribution, and provide Monte Carlo evidence on their small sample performance.

JEL-code: C13, C21, C23

Keywords: Higher-order spatial dependence; Generalized moments estimation; Two-stage least squares; Asymptotic statistics

(3)

I. Introduction

This paper considers the estimation of panel data models with higher-order spatially autocorrelated error components and spatially autocorrelated dependent variables (SARAR). Spatial interactions in data may originate from various sources such as strategic interaction between jurisdictions (to attract firms or other mobile agents) and firms (in their price, quantity, or quality setting) or general equilibrium effects which disseminate with spatial decay due to their transmission through trade flows, migration, or input-output relationships.1

Data sets used in empirical studies often share two features: first, they are available in the form of panel data, with a large cross-sectional and a small time series dimension. Second, spatial interactions of various kinds co-exist – such as geography-related, trade-related, migration-related interactions – or the decay function of a single spatial interaction is unknown. The estimator proposed here addresses these two features in a unified framework and provides a flexible setup for applied work, allowing specification tests, estimation, and inference in higher-order random effects panel data models.

There are two main motivations for the use of higher-order models. First, the distance between two cross-sectional units is not necessarily (only) geographical in nature, a point prominently made in the political science literature by Beck, Gleditsch, and Beardsley (2006). Possible candidates for channels of spatial dependence beyond adjacency or geographical distance between units (such as countries) relate to i) economic distance (e.g., trade, cross-border lending, migration, input-output relationships, profit shifting of multinational firms), ii) socio-economic distance (e.g., differences in per capita income, age structure, ethnic composition of population), iii) cultural distance (language, index of individualism, religion), or iv) political/institutional distance (electoral systems, degree of federalism, voting in international organisations). A higher-order approach allows including several weights matrices that are based on alternative concepts of distance, whose relative importance to each other is unknown. Apart from the fact that assessing the relevance of alternative transmission channels of spillovers is of interest in itself, wrongly imposing a first-order spatial regressive process may misattribute part of the spatial dependence in the data (due to omitted transmission channels) to the single transmission channel included in the model. As a consequence, the estimates of the spatial regressive parameters and their standard errors will then be biased and inconsistent.

Second, even with only one channel of interdependence (e.g., related to geographical distance), the functional form of the true distance decay function, reflected in the elements of the weights matrix, is typically unknown. The standard approach to assume a known spatial

1

See Anselin (2007) and Pinkse and Slade (2010) for surveys on the past, present, and future of spatial econometrics. A recent textbook on the matter is Le Sage and Pace (2009).

(4)

weights matrix with binary elements (e.g., for nearest neighbours) or elements that are specified as a decreasing function of distance (with known functional form and known distance decay parameter) seems highly restrictive. In fact, the distance decay function may exhibit discontinuities (e.g., border effects for interactions between units of different jurisdictions or countries) or a decay which is different from the one that is assumed by the researcher. Then, allowing subsets of the elements of the weights matrix to bear different spatial regressive parameters may significantly reduce the bias rooting in the assumption of an inadequate, preimposed decay function in a single weights matrix

The specification of higher-order models introduces non-trivial issues regarding the specification, interpretation, and estimation of these models. See Elhorst, Lacombe, and Piras (2012) and LeSage and Pace (2012) for a discussion of potential pitfalls in the use of higher-order models. However, these complications can be dealt with, while the alternative to stick with a misspecified first-order model for the sake of simplicity may result in inferior estimates. At the very least, the robustness of the results from a first order specification should be thoroughly explored, not only against variations in the specification of a single weights matrix (as is common in applied work), but also against the inclusion of further weights matrices, whenever economic theory suggests various channels of interdependence.

Estimation and testing of both random and fixed effects spatial regressive panel data models has been considered in the recent literature by Baltagi, Song, and Koh (2003) and Lee and Yu (2010) in a maximum likelihood framework and by Kapoor, Kelejain and Prucha (2007) as well as Mutl and Pfaffermayr (2011) using the generalized moments (GM) approach introduced by Kelejian and Prucha (1999). Obvious advantages of GM over ML estimation are that it does not rely on distributional assumptions and its computational simplicity. Moreover, comprehensive (cross-sectional) Monte Carlo Evidence by Arraiz, Drukker, Kelejian and Prucha (2010) shows that the large sample distribution provides a good approximation to the actual small sample distribution of the GM estimators of spatial regressive models.

The present paper builds on Kapoor, Kelejian, and Prucha (2007). They propose a GM estimator for the parameters of the spatial regressive error process in a random effects panel data model without endogenous explanatory variables (such as spatial lags of the dependent variable), derive a simplified weighting matrix for the moment conditions under the assumption of normally distributed error components, and prove consistency of the GM estimates. They also establish the asymptotic distribution of the regression parameters of the feasible generalized least squares (FGLS) estimates of the parameters of the main equation.

The present paper extends the estimation framework in Kapoor, Kelejian, and Prucha (2007) in several respects. In particular, it makes the following contributions:

(5)

estimates of the regression parameters and the GM estimates of the parameters of the spatial regressive disturbance process.

 Second, we allow for endogenous variables, including spatial lags of the dependent variable in the main equation, which is shown to affect the optimal weighting matrix for the moment conditions and the distribution of the GM estimates.

 Third, we dispense with the assumption of normally distributed error components, used by Kapoor, Kelejian, and Prucha (2007) to derive a simplified weighting matrix of the moments, retaining one of the main advantages of the GM approach of maximum-likelihood estimation.

 Fourth, we allow for higher-order rather than only first-order spatial regressive processes in both the dependent variable and the error process. This enables a more flexible design of the ‘spatial’ interdependence decay function and allows for the co-existence of more than one mode of interdependence as often suggested by economic theory (see Lee and Liu, 2010; and Badinger and Egger, 2011; for a treatment of higher-order spatial models with cross-section data).

 Finally, we provide some Monte Carlo evidence on the small sample performance of the proposed estimation procedure.

The remainder of the paper is organized as follows. Section II introduces the basic model specification. Section III proposes GM estimators of the parameters of spatial dependence in the error components. Section IV derives a two-stage least-squares (TSLS) routine to estimate the regression parameters of the model and derives the asymptotic distribution of all model parameters. Section V presents the results of a Monte Carlo simulation and section VI concludes. The detailed proofs are relegated to a technical appendix.

II. Basic Model Specification and Notation

The basic specification is a generalization of Kapoor, Kelejian, and Prucha (2007), who consider a panel data error components model with nonstochastic explanatory variables and first-order spatial autoregressive disturbances, i.e., a SAR(1) model. The present paper allows for an R-th order spatial autoregressive process in the dependent variable and an S-th order spatial process in the disturbances, i.e., we consider a SARAR(R,S) panel data error components model with i1,...,N cross-sectional units and t1,...,T time periods.2 For time period t, the model reads

2

Except for the ones on error components, the catalogue of assumptions in this paper extends to the case of fixed effects estimation (see Mundlak, 1978, for an early treatment of "within" parameter estimation in the context of an error components model).

(6)

) ( ) ( ) ( ) ( 1 , , t t t t N R r N N r N r N N N X β W y u y  

   , or (1a) ) ( ) ( ) (t N t N N t N Z δ u y   , (1b)

where yN(t) is an N1 vector with cross-sectional observations of the dependent variable in

year t, XN(t) is an NK matrix of observations on K non-stochastic explanatory variables, i.e., XN(t)[x1,N(t),...,xK,N(t)] with each N1 vector xk,N(t) denoting the observations on the k-th explanatory variable. The structure of spatial dependence in yN(t) is determined by

the time-invariant NN matrices Wr,N, r1,...,R, whose elements wij,r,N are assumed to be known (and often specified as decreasing function of geographical distance). The expression yr,N(t)Wr,NyN(t) is referred to as the r-th spatial lag of yN. The specification

of a higher-order process allows the strength of spatial interdependence in the dependent variable (reflected in the spatial autoregressive parameters r,N, r1,...,R) to vary across a fixed number of R subsets of relations between cross-sectional units.

In Equ. (1b), the N(KR) design matrix is given by ZN(t)[XN(t),YN(t)], with )] ( ),..., ( [ ) (t 1,N t R,N t N y y

Y  , and δN (βN,λN), where the K1 parameter vector of the

exogenous variables is given by βN (β1,N,...,βK,N) and the R1 vector of spatial autoregressive parameters of yN is defined as λN (1,N,...,R,N).

The N1 vector of error terms uN(t)[u1,N(t),...,uN,N(t)] is assumed to follow a spatial

autoregressive process given by

) ( ) ( ) ( 1 , , t t t N S m N N m N m N M u ε u

   , (1c) ) ( ) (t N N t N μ v ε   , (1d)

where m,N and Mm,N denote the time-invariant, unknown parameters and the known NN

matrix of spatial interdependence, respectively. The structure of spatial correlation in the disturbances is determined by the S different, time-invariant NN matrices Mm,N. The expression um,N(t)Mm,NuN(t) is referred to as the m-th spatial lag of uN. The S1 vector of the spatial autoregressive parameters of uN(t) is defined as ρN (1,N,...,S,N).

(7)

) (t

N

ε and vN(t) are the scalars it,N and vit,N, respectively, and the N1 vector of unit-specific error components is given by μN (μ1,N,...,μN,N).

Stacking observations for all time periods such that t is the slow index and i is the fast index with all vectors and matrices, the model reads

N N N N N N X β Y λ u y    , or (2a) N N N N Z δ u y   , (2b)

with the NTK regressor matrix XN [XN(1),...,XN(T)], and YN (y1,N,...,yR,N), where ] ) ( ),..., 1 ( [ , , ,N  rNrN Tr y y

y is the NT1 vector of observations on the r-th spatial lag of the dependent variable yr,N. The NT1 vector of disturbances uN [uN(1),...,uN(T)] for the spatial autoregressive process of order S is given by

N S m N N m T N m N I M u ε u

  1 , , ( )  , (2c)

where IT is an identity matrix of dimension TT. The NT1 vector εN [εN(1),...,εN(T)]

is specified as N N N T N e I μ v ε (  )  , (3a)

where eT is a unit vector of dimension T1 and IN is an identity matrix of dimension N

N . In light of (2c), the error term can also be written as

        S m N N m N m N T S m N N m T N m N N 1 , , 1 , , (I M )u I (I M )u u ε   . (3b) It follows that

     S m N N m N m N T N 1 1 , , ) ] ( [I I M ε u  , and (4a) N R r N r N r N T N N R r N r N r N T N I I W X t β I I W u y [ ( ) ] ( ) [ ( ) 1] 1 , , 1 1 , ,    

        , (4b)

The following assumptions are maintained throughout this paper. Assumption 1.

(8)

Let T be a fixed positive integer. (a) For all 1tT and 1iN,N1, the error components vit,N are identically and (mutually) independently distributed with E(vit,N)0,

2 2

, )

(vitN v

E  , where 0v2 bv , and Evit,N4  for some 0. (b) For all

1 ,

1iN N , the unit-specific error components i,N are identically and (mutually) independently distributed with E(i,N)0,

2 2 , ) (iN  E , where 02 b , and     4 ,N i

E for some  0. (c) The processes {vit,N} and {i,N} are independent of each other. Assumption 1 is slightly stronger than that in Kapoor, Kelejian, and Prucha (2007), since it requires not only the fourth but also the (4)-th moments of the error components to be finite for some 0. This is required to invoke the central limit theorem of Kelejian and Prucha (2010) in the derivation of the asymptotic distribution in section III.

Assumption 1 implies that

2 2 ,

, )

( itN jsN v

E     for ijand ts, (5a)

2 , , ) (itNjsN  E for ij and ts, (5b) 0 ) ( it,N js,NE   , otherwise. (5c)

As a consequence, the variance-covariance matrix of the stacked error term εN reads

NT v N T N N N E ε ε J I I Ωε 2 2 ,  (  )(  ) , (6a)

where JTeTeT is a TT matrix with unitary elements and INT is an identity matrix of

dimension NTNT. Eq. (6a) can also be written as

N N v N 1, 2 1 , 0 2 , Q Q Ωε   , (6b)

where 12 v2 T2. The two matrices Q0,N and Q1,N, which are central to the estimation

of error component models and the moment conditions of the GM estimator, are defined as

N T T N T I J I Q0, (  ) , (7) N T N T I J Q1,   . (8)

(9)

Notice that Q0,N and Q1,N are both of order NTNT, symmetric, idempotent, orthogonal to each other, and sum up to INT.

Assumption 2.

(a) All diagonal elements of Wr,N, r1,...,R, and Ms,N, s1,...,S, are zero. Without loss of generality, we assume that they are row-normalized in the following. b) The parameters r,N, r = 1, …, R, and s,N, s = 1, …, S, are finite and contained in the admissible parameter spaces

) , ( , r r N N N r a a      and , ( s, s) N N N s a a  

   ; with row-normalized matrices, we have 1

1 , 

R r N r  and 1 1 , 

S s N s  .3

Assumption 2 ensures invertibility of ( )

1 , ,

  S m N m N m N M I  and ( ) 1 , ,

  R r N r N r N W I  and thus

that y and N u are uniquely identified by (4a) and (4b). N

We emphasize that all results of the present paper hold under alternative normalizations of the weights matrices with corresponding modifications of the admissible parameter space (Lee and Liu, 2010). Notice further that the assumptions regarding the admissible parameters space given here and in Lee and Liu (2010) are sufficient but not necessary and might be overly restrictive. A detailed discussion of a possible relaxation of the constraints on the admissible parameter space is provided by Koch (2011a,b), Elhorst, Lacome, and Piras (2012), and LeSage and Pace (2012).

Assumption 3.

The row and column sums of Wr,N, r1,...,R, Ms,N, s1,...,S, 1

1 , , ) (  

R r N r N r N W I  , and 1 1 , , ) (  

S m N m N m N M

I  are bounded uniformly in absolute value.

By Assumptions 1-3 and Remark A.1 in the Appendix, it follows that E(uN)0 and the variance-covariance matrix of uN is given by

         S m N m N m N T S m N N m N m N T N N N E 1 1 , , 1 , 1 , , , (u u ) [I (I M ) ]Ω [I (I M ) ] Ωu    , and (9a) 3

All results of the present paper hold under alternative normalizations of the weights matrices with corresponding modifications of the admissible parameter space (Lee and Liu, 2010).

(10)

        S m N m N m N S m N m N m N v N N t t E 1 1 , , 1 1 , , 2 2 ) ( ) )( ( )] ( ) ( [u uIM IM . (9b)

All variables (including XN) and parameters except for the variances of the error components are allowed to depend on sample size N. As a result, the model specification in Eqs. (1a)-(1c) allows for higher-order spatial dependence in the dependent variable, the explanatory variables, and the disturbances.

III. GM Estimation of a SAR(S) Model

Below, we derive GM estimators for the spatial autoregressive parameters of the disturbance process (1c) and the asymptotic joint distribution of all model parameters.

1. Moment Conditions

With an S-th order process (SAR(S), with S 1), GM estimators of 1,N,...,S,N,

2

v

 , and 12

are obtained by recognizing that – under Assumptions 1 and 2 – the moment conditions used by Kapoor, Kelejian, and Prucha (2007) hold for each matrix Ms,N, s1,...,S. Define for each Ms,N, s1,...,S ] ) ( )[ ( ) ( 1 , , , , ,

       S m N N m T N m N N s T N N s T N s I M ε I M u I M u ε  . (10)

The moment conditions are then given by

Ma 2 , 0 , 0 ] ) 1 ( 1 [ ] ) 1 ( 1 [ N N N N N N v T N E T N E       ε Q ε v Q v , (11) M1,s ( ) 1 ] ) ( ) 1 ( 1 [ ] ) 1 ( 1 [ s,N 0,N s,N N 0,N T s,N s,N 0,N N v2 tr s,N s,N N T N E T N E ε Q ε vQ IMM Q vMM      , M2,s ( ) ] 0 ) 1 ( 1 [ ] ) 1 ( 1 [ , 0,0,  , 0,      sN N N N N T sN N N T N E T N E ε Q ε v Q I M Q v , Mb 1, 1, ) 12 1 ( ] ) ( 1 [ ) 1 ( N N N  NT TN N  N N N  N E N E N E ε Q ε μ e e I μ v Q v , M3,s ( ) ] 1 [ ] ) ( 1 [ ) 1 ( s,N 1,N s,N N T T s,N s,N N N 1,N T s,N s,N 1,N N N E N E N E εQ εμeeMM μvQ IMM Q v ) ( 1 , , 2 1 tr sN sN N MM  , M4,s (1 s,N 1,N N) [1 N( TT  s,N) N] [1 N 1,N( T  s,N) 1,N N]0 N E N E N E ε Q ε μ e e M μ v Q I M Q v ,

(11)

where 12 v2 T2. The moment conditions associated with matrices Ms,N, s1,...,S,

through (10), are indexed with subscripts 1 to 4. The remaining two moment conditions are independent of s and denoted as Ma and Mb. For an S-th order process as in (2c), we thus have (4S2) moment conditions.

It is apparent that under Assumptions 1-3 from M1,s and M3,s that there are potentially

) 1 (S

S further moment conditions, namely s s tr N T N E sN N sNvsN sN     (  ), 1 ] ) 1 ( 1 [ , , 2 , , 0 , Q ε M M ε  from M1,s and s s tr N N EsN N sN  ( sN sN),   1 ) 1 ( , , 2 1 , , 1 , Q ε M M ε  from M3,s 4

. For simplicity, we use only the moment conditions in (11) below, which are always available for weights matrices satisfying Assumptions 1 and 2. However, the results carry over to the more general estimator using all

2 ) 1 (

4SS S  moment conditions.

Substituting (3b), (10), and (1c) into the 4S2 moment conditions (11) yields a (4S 2) equation system in ( ,..., , , 12)

2 , ,

1   

N S N v , which can be written as

0 Γ

γNNbN  , (12)

where bN is a [2SS(S1)/22]1 vector, given by

) , , ,..., ,..., , ,..., , ,..., ( 12 2 , , 1 , , 1 , 2 , 1 2 , 2 , 1 , , 1    NSNNSNNNNSNS NSNvN b ,

i.e., bN contains S linear terms m,N, m1,...,S, S quadratic terms m2,N, m1,...,S,

2 / ) 1 (S

S cross products m,Nl,N, m1,...,S1,lm1,...,S, as well as v2 and 12. For later reference, we define the (S2)1 vector of all parameters as

) , , ,..., ( ) , , ( 12 2 , , 1 2 1 2     Nv   NSNvN ρ θ . N

γ is a (4S2)1 vector with elements [i,N], i1,...,(4S2), and ΓN is a )

2 4

( S [2SS(S1)/22] matrix with elements [i,j,N], i1,...,(4S2),

4

The efficiency gain from using these additional moment conditions depends on the properties of the weights matrices. If two weights matrices are orthogonal, i.e., Ms,NMs,N0, the corresponding moment condition is trivially satisfied for any set of (finite) parameter values and does not add any information.

(12)

] 2 2 / ) 1 ( 2 [ ,..., 1     S S S

j . The elements i,N and i,j,N will be defined below. The row-index of the elements γN and ΓN will be chosen such that the equation system (12) has the following order. The first four rows correspond to moment restrictions M1,1 to M4,1 associated with matrix M1,N through (10); rows five to eight correspond to M1,2 to M4,2 associated with matrix M2,N, and so forth; rows (S4) to 4S correspond to the M1,S to M4,S associated with matrix MS,N . Finally, rows (4S1) and (4S2) correspond to moment conditions Ma and

Mb, respectively, which are independent of s.

The sample analogue to (12) is given by

) ( ~ ~ N N N N N Γ θ γ  b  , (13)

where the elements of γ~ and N ΓN

~

are equal to those of γN and ΓN with the expectations

operator suppressed and the disturbances uN replaced by (consistent) estimates u~ . N

GM estimates of parameters 1,N,...,S,N,

2

v

 and 12are then obtained as the solution to

)] ( ~ ) ( [ )] ~ ~ ( ~ ) ~ ~ [( min arg 2 1 2 2 1, ,.., , , N N N N N N N N N N N N v S θ Θ θ Γ γ Θ Γ γ             b b , (14)

i.e., the parameter estimates can be obtained from a (weighted) non-linear least squares regression of γ~ on the columns of N ΓN

~

. The optimal choice of the (4S2)(4S2)

weighting matrix ΘN will be discussed below.

Below, we define the elements of γN and ΓN, grouped by the corresponding moment conditions, using N N s T N s I M u u, (  , ) , s1,...,S, and (15a) N N m N s T N N m T N s T N sm I M I M u I M M u u , (  , )(  , ) (  , , ) , s1,...,S, m1,...,S. (15b) M1,s delivers s1,...,S rows 4(s1)1 in (12): ) 1 ( 1 , 1 ) 1 ( 4    T N N sE(us,NQ0,Nus,N), (16a) ) ( ) 1 ( 2 , , 0 , , , 1 ) 1 ( 4 s mN E sN N smN T NuQ u     , m1,...,S,

(13)

) ( ) 1 ( 1 , , 0 , , , 1 ) 1 ( 4s S mN E smN N smN T NuQ u       , m1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 ) 1 ( 4s S m mm l mN E smN N slN T NuQ u           , m1,...,S1, lm1,...,S, ) ( 1 , , , 1 2 / ) 1 ( 2 , 1 ) 1 ( 4s S S S N tr sN sN N MM        , 0 , 2 2 / ) 1 ( 2 , 1 ) 1 ( 4s  SS S  N   . M2,s consists of s1,...,S rows 4(s1)2 in (12): ) ( ) 1 ( 1 , 0 , , 2 ) 1 ( 4s N E sN N N T NuQ u     , (16b) ) ( ) 1 ( 1 , , 0 , , 0 , , , 2 ) 1 ( 4s mN E smN N N sN N mN T NuQ uuQ u     , m1,...,S, ) ( ) 1 ( 1 , , 0 , , , 2 ) 1 ( 4s S mN E smN N mN T NuQ u       , m1,...,S, ) ( ) 1 ( 1 , , 0 , , , 0 , , 2 / ) 1 ( ) 1 ( , 2 ) 1 ( 4s S m mm l mN E slN N mN smN N lN T NuQ uuQ u           , m1,...,S1, S m l 1,..., , 0 , 1 2 / ) 1 ( 2 , 2 ) 1 ( 4s  SS S  N   , 0 , 2 2 / ) 1 ( 2 , 2 ) 1 ( 4s  SS S  N   . M3,s corresponds to s1,...,S rows 4(s1)3 in (12): N N s 1 , 3 ) 1 ( 4     E(us,NQ1,Nus,N), (16c) ) ( 2 , , 1 , , , 3 ) 1 ( 4 s mN E sN N smN N uQ u     , m1,...,S, ) ( 1 , , 1 , , , 3 ) 1 ( 4s S mN E smN N smN N uQ u       , m1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 3 ) 1 ( 4s S m mm l mN E smN N slN N uQ u           , m1,...,S1, lm1,...,S, 0 , 1 2 / ) 1 ( 2 , 3 ) 1 ( 4s  SS S  N   , ) ( 1 , , , 2 2 / ) 1 ( 2 , 3 ) 1 ( 4s S S S N tr sN sN N MM        . M4,s represents s1,...,S rows 4(s1)4 in (12): ) ( 1 , 1 , , 4 ) 1 ( 4s N E sN N N N uQ u     , (16d) ) ( 1 , , 1 , , 1 , , , 4 ) 1 ( 4s mN E smN N N sN N mN N uQ uuQ u     , m1,...,S,

(14)

) ( 1 , , 1 , , , 4 ) 1 ( 4s S mN E smN N mN N uQ u       , m1,...,S, ) ( 1 , , 1 , , , 1 , , 2 / ) 1 ( ) 1 ( , 4 ) 1 ( 4s S m mm l mN E slN N mN smN N lN N uQ uuQ u           , m1,...,S1, lm1,...,S, 0 , 1 2 / ) 1 ( 2 , 4 ) 1 ( 4s  SS S  N   , 0 , 2 2 / ) 1 ( 2 , 4 ) 1 ( 4s  SS S  N   .

Ma reflects the equation in row (4S1) of (12):

) ( ) 1 ( 1 , 0 , 1 4S N E N N N T NuQ u    , (16e) ) ( ) 1 ( 2 , 0 , , , 1 4S mN E mN N N T NuQ u    , m1,...,S, ) ( ) 1 ( 1 , , 0 , , , 1 4S S mN E mN N mN T NuQ u      , m1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 4S S m mm l mN E mN N lN T NuQ u          , m1,...,S1, lm1,...,S, 1 , 1 2 / ) 1 ( 2 , 1 4SSS S  N   , 0 , 2 2 / ) 1 ( 2 , 1 4SSS S  N   .

Mb is associated with row (4S2) of (12):

) ( 1 , 1 , 2 4S N E N N N N uQ u    , (16f) ) ( 2 , 1 , , , 2 4S mN E mN N N N uQ u    , m1,...,S, ) ( 1 , , 1 , , , 2 4S S mN E mN N mN N uQ u      , m1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 2 4S S m mm l mN E mN N lN N uQ u          , m1,...,S1, lm1,...,S, 0 , 1 2 / ) 1 ( 2 , 2 4SSS S  N   , 1 , 2 2 / ) 1 ( 2 , 2 4SSS S  N   .

For future reference, we define the (2S1)1 vector γ0N as the sub-vector containing rows s and (s1), s 1,...,S and row (4S1)of γN, corresponding to M1,s, M2,s, and Ma. Moreover, we define the (2S1)[2SS(S1)/21] matrix Γ0N as the sub-matrix

containing rows s and (s1), s 1,...,S, and row (4S1) of ΓN, corresponding to M1,s, M2,s, and Ma.

(15)

Analogously, we define the (2S1)1 vector γ1N as the sub-vector containing rows 2s,

) 1 2

( s , s1,...,S, and row (4S2) of γN, corresponding to M3,s, M4,s, and Mb. Finally, we

define the (2S1)[2SS(S1)/21] matrix Γ1N as the sub-matrix containing rows 2s,

) 1 2

( s , s1,...,S, and (4S2) of ΓN, corresponding to M3,s, M4,s, and Mb.

2. Definition of GM Estimators

We next define three alternative GM estimators for the spatial autoregressive parameters of the disturbance process given by (1c) and the variances of the error components.5

2.1. Initial GM Estimation

The initial GM estimator is a special case of (14), using the identity matrix as weighting matrix ΘN and a subset of moment conditions (Ma, M1,s and M2,s) only. It is based on γ0N and

0

N

Γ . Define θ0N as the corresponding parameter vector that excludes 12, i.e., ) , ,..., ( ) , ( 1, , 2 2 0 v N S N v N        ρ θ , and accordingly ) , ,..., ,..., , ,..., , ,..., ( 1, , 12, 2, 1, 2, 1, , 1, , 2 0    N SN v S N S N N N N S N N S N N            b .

The initial GM estimator is then obtained as the solution to

} ] , 0 [ , ), ( ) ( min{ arg ) , ,..., ( 2, 0 0 0 0 2 , , 1,NS NvN  N N N N    vNbv    a ρ a θ θ    , (17a) with N0(θ0)N0(ρ,2v) ) ~ ~ (γN0 Γ0Nb0 .

Using these initial estimates of (1,N,...,S,N) and v2, 12 can be estimated from moment condition Mb:

      S m N m N m N N S m N m N m N N N 1 , , , 1 1 , , 2 , 1 ) ~ ~ ( ) ~ ~ ( 1 u u Q u u      (17b) ... ~ ~ ... ~ ~ 2 , 1 1 , 2 4 , , 2 4 , 1 1 , 2 4 2 4S γ S N γ S S SN γ S S N γ        . ~ ... ~ ~ , , 1 2 / ) 1 ( 2 , 2 4 , 2 , 1 1 2 , 2 4 2 , 2 , 2 4S S SN γ S S N N γ S S S S S N S N γ          2.2. Weighted GM Estimation

While the initial GM estimator as defined in (17) is consistent, it is inefficient. First, it ignores the information contained in moment conditions (Mb, M3,s and M4,s). Second, as is known from the literature on GMM-estimation, it is optimal to use as weighting matrix the inverse of the (properly normalized) variance-covariance matrix of the moments, evaluated at true

5

See Kapoor, Kelejian, and Prucha (2007) for analogous conditions under SARAR(0,1) estimation, assuming only nonstochastic regressors in equation (1a).

(16)

parameter values. Denote the optimal weighting matrix, which will be derived in Subsection 3.2, by ΨN1 and its estimate by Ψ~N1. The optimally weighted GM estimator is based on all

) 2 4

( S moment conditions and uses Θ~NΨ~N1 as the weighting matrix. It is defined as } ] , 0 [ ], , 0 [ , ), ( ~ ) ( { min arg ) ~ , ~ , ~ ,..., ~ ( 12 2 2 1, 2 , , 1,NSNvNN  NNN    vbv   c    a ρ a θ Θ θ , with cbvTb, and ( ) ( , , ) 2 1 2    N θ N ρ v ) ~ ~ (γNΓNb . (18)

As already mentioned, the optimal weighting matrix is derived without distributional assumptions and involves third and fourth moments of the error components vit,N and i,N. Kapoor, Kelejian, and Prucha (2008) use the assumption that εit,N is normally distributed to obtain a simplified weighting matrix as an approximation of the true optimal weighting matrix. For comparison, we also consider such a weighting matrix, which is a special case of

1

N

Ψ (see the Appendix) and referred to as (ΨN)1. The simplified weighted GM estimator is defined as the weighted GM estimator given in (18), using Θ~N (Ψ~N)1.

3. Asymptotic Properties of the GM Estimator for θ N 3.1 Consistency

For proving consistency, the following additional assumptions are introduced:

Assumption 4.

Assume that u~NuNDNΔN, i.e., u~i,Nui,Ndi.,NΔN, for i1,...,NT,6 where DN is an P

NT matrix, the 1P vector di.,N denotes the i-th row of DN and ΔN is a P1 vector. Let dij,N be the j-th element of di.,N. For some  0, we assume that  

d N ij t c d E , ( )2  ,

where cd does not depend on N, and that N1/2 ΔNOp(1).

Assumption 4 will hold in many settings, e.g., if model (1a) contains endogenous variables (such as spatial lags of yN) and is estimated using 2SLS. In that case, ΔN denotes the

difference between the parameter estimates and the true parameter values and di.,N is the (negative of the) i-th row of the design matrix ZN (compare Lemma 1 in Subsection 2 of Section IV).

Assumption 5.

6

(17)

(a) The smallest eigenvalues of Γ0NΓ0N and Γ1NΓ1N are bounded away from zero, i.e., 0 ) ( * min     i N i N Γ

Γ for i = 1, 2. (b) Θ~NΘNop(1), where ΘN are (4S2)(4S2)

nonstochastic, symmetric, positive definite matrices. (c) The largest eigenvalues of ΘN are bounded uniformly from above, and the smallest eigenvalues of ΘN are bounded uniformly away from zero.

Assumption 5 implies that the smallest eigenvalues of ΓNΓN and ΓNΘNΓN are bounded uniformly away from zero, ensuring that the true parameter vector θN is identifiable unique. Moreover, by the equivalence of matrix norms, it follows from Assumption 5 that ΘN and

1

N

Θ are O(1).

Assumptions 1-5 ensure consistency of the GM estimators for θN (ρN,v2,12) as summarized in the following theorems (see Appendix B for a proof).

Theorem 1a. Consistency of Initial GM Estimator ~θN0

Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the initial GM estimators θ0N (ρ1,N,...,ρS,v2,N) defined by (17a), and 12,N, defined by (17b) are consistent for 1,N,...,S,N, v2, and 12, i.e.,

0 , s, p N s N    , s1,...,S, v2,N v2p 0 , and 12,N 12p 0 as N.

Theorem 1b. Consistency of Weighted GM Estimator θ~N

Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the weighted GM estimators ~ (~ ) [~ (~ ),...,~ (~ ),~ (~ ),~2 (~ )]

, 1 2 , , , 1   N N SN N vN N N N N N Θ ρ Θ ρ Θ Θ Θ θ  

defined by (18) are consistent for 1,N,...,S,N,v2, and

2 1  , i.e., 0 ) ~ ( ~ , s, p N s N N    Θ , s1,...,S, ~2, (~ ) 2 0 p v N N v    Θ , and ~ (~ ) 12 0 2 , 1 p N N    Θ as N .

This result holds for an arbitrary weighting matrix (satisfying Assumption 5). Hence, it applies to both the optimally weighted GM estimator defined by (18) with Θ~N (Ψ~N)1and its simplified variant θ~N with Θ~N (Ψ~N)1.

3.2 Asymptotic Distribution of GM Estimator for θN

In the following we consider the asymptotic distribution of the optimally weighted GM estimator θ~N. To establish asymptotic normality of θ~N (~ρN,~v2,N,~12,N), we introduce some additional assumptions.

(18)

Assumption 6.

Let DN be defined as in Assumption 4, such that u~NuNDNΔN. For any real NTNT

matrix AN, whose row and column sums are bounded uniformly in absolute value, it holds

that N 1 N N NN 1E( N N N)op(1)   u A D u A D .

A sufficient condition for Assumption 6 is, e.g., that the columns of DN are of the form

N N

N Π ε

π  , where the elements of πN are bounded uniformly in absolute value and the row and column sums of ΠN are bounded uniformly in absolute value (see Kelejian and Prucha, 2010, Lemma C.2). This will be the case in many applications, e.g., for the model in Eq. (1a), if DN equals (the negative of) matrix ZN (compare Lemma 1 in Section IV).

Assumption 7.

Let ΔN be defined as in Assumption 4. Then, ) 1 ( ) ( ) (NT 1/2ΔNNT 1/2TNξNop , with TN (Tv,N,T,N), ξN (vN,μN), i.e., ) 1 ( ) ( ) ( ) (NT 1/2ΔNNT 1/2Tv,NvNNT 1/2T,NμNop ,

where TN is an (NTN)P-dimensional real nonstochastic matrix whose elements are

bounded uniformly in absolute value; Tv,N is of dimension (NTP) and T,N is of dimension (NP). As remarked above, ΔN typically denotes the difference between the parameter estimates and the true parameter values. Assumption 7 will be satisfied by many estimators. In Section IV, we verify that it holds if the model in Eq. (1a) is estimated by TSLS or feasible generalized TSLS (FGTSLS).

The limiting distribution of the GM estimator of θNwill be shown to depend on (the inverse

of) the matrix JNΘNJN and the variance-covariance matrix of a vector of quadratic forms in

N

v and μN, denoted as qN. We consider each of these expressions in the following. The ) 2 ( ) 2 4

( S  S matrix JN of derivatives of the (4S2)1 vector of moment conditions in

(11) is given by θ Γ γ θ J      ( ) ) ( N N N N N b ) , , ,..., ( ,1, , , , , , , 1 N i N i N S i N i j j j j v    , with (19a)  N s i j, , s N N i N i    (γ., Γ., b ) , i1,...,(4S2), s1,...,S,

(19)

N i v j, , v N N i N i    (γ., Γ., b ) , i1,...,(4S2),  N i j,1, 1 ., ., ) (     γi N Γi NbN , i1,...,(4S2),

where γi.,N and Γi.,N denote the i-th row of γN and ΓN respectively.

Using 0 θ γ     N

and ignoring the negative sign, we have

N N N N N Γ b Γ B θ ρ J      ) ( , (19b)

where ΓN is defined above and of dimension (4S2)[2SS(S1)/22] and BN is a

) 2 ( ] 2 2 / ) 1 ( 2

[ SS S   S matrix of the form

) , , , ( 1 2, 3, 4,   N N N N B B B B B , (20a) with B1 (IS,0S2), 2, [ 1(2 s,N), S2)] S s N diag0 B , and ] , ) ,..., [( 3,1, 3, 1, ( 1)/2 2 , 3N  B N BSN0S S  B is an S(S1)/2(S2) matrix. The (S1) vertically arranged blocks, B3,m,N, m1,...,(S1), have the following structure:

) , , ( , , , , , 3mN CmN dmN EmN B  , (20b)

where Cm,N is a (Sm)(m1) matrix of zeros,7 dm,N is a (Sm)1 vector, defined as )

,...,

( 1, ,

,NmN S N

m  

d , and Em,N m,NISm. Finally, B4,N is a 2(S2)matrix, defined as           1 , 0 , 1 , 1 1 1 , 4 S S N 0 0 B . (20c)

We next consider the vector qN and its limiting distribution. First, define qN(θN,ΔN) as the

1 ) 2 4

( S  vector of sample moments as given by (11) with the expectation operator suppressed, evaluated at the true parameter values, and ignoring the deterministic constants:

7

(20)

 ) , ( N N N θ Δ q                                              N N b N N N a N N N S N N N S N N N S N N N S N N N N N N N N N N N N N N u C u u C u u C u u C u u C u u C u u C u u C u u C u u C u ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . ~ ~ ~ ~ ~ ~ ~ ~ , , , , 4 , , 3 , , 2 , , 1 , 1 , 4 , 1 , 3 , 1 , 2 , 1 , 1 1 , (21) where

           S m N m N m N T N N s N s T N S m N m N m N T N s T 1 , , , 0 , , , 0 1 , , , , 1 [ ( )] ( ) [ ( )] ) 1 ( 1 M I I Q M M I Q M I I C   ,

            S m N m N m N T N N s N s T N S m N m N m N T N s T 1 , , , 0 , , , 0 1 , , , , 2 [ ( )] [ ( )] [ ( )] ) 1 ( 2 1 M I I Q M M I Q M I I C   ,

          S m N m N m N T N N s N s T N S m N m N m N T N s 1 , , , 1 , , , 1 1 , , , , 3 [I (I M )]Q (I M M )Q [I (I M )] C   ,

           S m N m N m N T N N s N s T N S m N m N m N T N s 1 , , , 1 , , , 1 1 , , , , 4 [ ( )] [ ( )] [ ( )] 2 1 M I I Q M M I Q M I I C   ,

         S m N m N m N T N S m N m N m N T N a T 1 , , , 0 1 , , , [ ( )] [ ( )] ) 1 ( 1 M I I Q M I I C   ,

        S m N m N m N T N S m N m N m N T N b 1 , , , 1 1 , , , [I (I M )]Q [I (I M )] C   . (22)

By Assumption 3 and Remark A.1 in Appendix A, the row and column sums of the symmetric NTNT matrices Cp,s,N, p1,...,4, s1,...,S, Ca,N, and Cb,N are bounded uniformly in absolute value. Also, note that C3,s,N, C4,s,N, Cb,N differ from C1,s,N, C2,s,N,

N a,

C only by the normalization and the use of Q1,N versus Q0,N.

In light of (21) and Lemma B.1 (see Appendix B), the elements of N1/2qN(ρN,ΔN) can be expressed as

(21)

 ) , ( 2 / 1 N N N N q θ Δ (1) . 2 / 1 , , 2 / 1 2 / 1 , , 2 / 1 2 / 1 , , 4 , , 4 2 / 1 2 / 1 , , 3 , , 3 2 / 1 2 / 1 , , 2 , , 2 2 / 1 2 / 1 , , 1 , , 1 2 / 1 2 / 1 , 1 , 4 , 1 , 4 2 / 1 2 / 1 , 1 , 3 , 1 , 3 2 / 1 2 / 1 , 1 , 2 , 1 , 2 2 / 1 2 / 1 , 1 , 1 , 1 , 1 2 / 1 p N N b N N a N N N a N N a N N N S N N S N N N S N N S N N N S N N S N N N S N N S N N N N N N N N N N N N N N N N N N N N N o N N N N N N N N N N N N N N N N N N N N                                                                              Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u Δ α u C u , (23) where 2 ( ,, ) 1 , ,sN N psN N p N E D C u α    , p1,...,4, s1,...,S, 2 ( , ) 1 ,N N aN N a N E D C u α    , and ) ( 2 , 1 ,N N bN N b N E D C u α   

. By Lemma B.1 the elements of the P1 vectors αp,s,N,

4 ,..., 1

p , s1,...,S, αa,N and αb,N are bounded uniformly in absolute value.

Using (22), (3c), Assumption 7, and Q0,NεNQ0,NvN we obtain:

) , ( 2 / 1 N N N N q θ Δ (1) ) 1 ( 1 . )] ( [ 2 1 ) ( )] ( [ ) 1 ( 2 1 ) ( ) 1 ( 1 , , 1 , , 0 , , 4 , 1 , , , 1 , , 3 , 1 , , , 1 , , 2 , 0 , , , 0 , , 1 , 0 , , , 0 2 / 1 p N N b N N N N N a N N N N N s N N N s N s T N N N N s N N N s N s T N N N N s N N N s N s T N N N N s N N N s N s T N N o T T T N                                                                 ξ a ε Q ε ξ a v Q v ξ a ε Q M M I Q ε ξ a ε Q M M I Q ε ξ a v Q M M I Q v ξ a v Q M M I Q v , (25)

for s1,...,S. The (NTN)1 vector ξN (vN,μN), ap,s,NT1TNαp,s,N, p1,...,4, S

s1,..., , aa,NT1TNαa,N, and ab,NT1TNαb,N, which can also be written as ] ) ( , ) [( ) , ( ,, , , 1 , ,, , ,, , ,        N s p N N s p N v N s p v N s p N s p a a T T α T α a , s1,...,S, p1,...,4, and ] ) ( , ) [( ) , ( , , 1 , , , , ,        N a N N a N v N a v N a N a a a T T α T α a , ] ) ( , ) [( ) , ( , , , , 1 , , ,        N b N N b N v

References

Related documents

Almighty and everlasting God, you govern all things both in heaven and on earth: Mercifully hear the supplications of your people, and in our time grant us your peace; through Jesus

In this paper, we analyze new variants of SMPSO, a multi-objective swarm optimization technique, based on different archiving strategies in the scope of a benchmarking set of

Abstract : Using evidence on variation in the gender gap in labor force participation rates (LFPR) across home country groups in the United States, this paper attempts to assess

Reset fixture. Contact Martin service personnel if problem continues. color wheel will not index correctly, sensor circuit defective or magnet missing). After the time-out,

The questions that ask about your background will be used only to describe the types of students completing this survey.. The information will not be used to find out

Here, we demonstrate that cell surface perlecan HSPG mediates a distinct pathway for ligand internalization and lysosomal de- livery that is kinetically and biochemically distinct

Even drive with the licence expiry date site good driving license expiry date online using dmv is with the renewal in person at a web page with dates.. Nsw licence is this

This research was conducted with the aim to investigate influence of loneliness and grieving process among older people living in an elderly care facility and people living in