• No results found

Estimation of Higher-Order Spatial Autoregressive Panel Data Error Component Models

N/A
N/A
Protected

Academic year: 2021

Share "Estimation of Higher-Order Spatial Autoregressive Panel Data Error Component Models"

Copied!
111
0
0

Loading.... (view fulltext now)

Full text

(1)

Estimation of Higher-Order Spatial

Autoregressive Panel Data Error

Component Models

H

ARALD

B

ADINGER

P

ETER

E

GGER

CES

IFO

W

ORKING

P

APER

N

O

.

2556

C

ATEGORY

10:

E

MPIRICAL AND

T

HEORETICAL METHODS

F

EBRUARY

2009

An electronic version of the paper may be downloaded β€’ from the SSRN website: www.SSRN.com β€’ from the RePEc website: www.RePEc.org β€’ from the CESifo website: Twww.CESifo-group.org/wpT

(2)

CESifo Working Paper No. 2556

Estimation of Higher-Order Spatial

Autoregressive Panel Data Error

Component Models

Abstract

This paper develops an estimator for higher-order spatial autoregressive panel data error component models with spatial autoregressive disturbances, SARAR(R,S). We derive the moment conditions and optimal weighting matrix without distributional assumptions for a generalized moments (GM) estimation procedure of the spatial autoregressive parameters of the disturbance process and define a generalized two-stages least squares estimator for the regression parameters of the model. We prove consistency of the proposed estimators, derive their joint asymptotic distribution, and provide Monte Carlo evidence on their small sample performance.

JEL Code: C13, C21, C23.

Keywords: higher-order spatial dependence, generalized moments estimation, two-stages least squares, asymptotic statistics.

Harald Badinger Department of Economics Vienna University of Economics and

Business Administration Althanstrasse 39 - 45 1090 Vienna Austria [email protected] Peter Egger

Ifo Institute for Economic Research and Center for Economic Studies at

the University of Munich Poschingerstrasse 5

81679 Munich Germany [email protected]

(3)

I. Introduction

This paper considers the estimation of panel data models with higher-order spatially autocorrelated error components and spatially autocorrelated dependent variables. Spatial interactions in data may originate from various sources such as strategic interaction between jurisdictions (to attract firms or other mobile agents) and firms (in their price, quantity, or quality setting) or general equilibrium effects which disseminate with spatial decay due to their transmission through trade flows, migration, or input-output relationships.1 Data sets used in empirical studies often share three features: first, they are available in the form of panel data, with a large cross-sectional and a small time series dimension; second, spatial interactions of various kinds co-exist – such as geography-related, trade-related, migration-related interactions – or the decay function of a single spatial interaction is unknown; third, it is unclear whether spatial interactions are local – and affect only immediate neighbors – or global – and affect second third and other neighbors with repercussions. The estimator proposed here addresses the mentioned three features in a unified framework. It allows for panel data with a fixed but arbitrary number of channels or decay segments of spatial interaction in both the error components and the dependent variable, referred to as SARAR(R,S).

In developing the estimator, we build on the SARAR(1,1) generalized moments (GM) framework in Kelejian and Prucha (1999) for a single cross-section and the SARAR(0,1) model in Kapoor, Kelejian, and Prucha (2007) for panel data error components models. Obvious advantages of the GM framework is that it does not rely on distributional assumptions and that it can be applied to large data-sets without imposing any restrictions on the matrices of spatial interdependence. We derive GM estimators for the spatial regressive parameters of the disturbance process based on alternative weighting schemes for the moments. We then define a feasible generalized two-stages least squares (FGTSLS) estimator for the model’s regression parameters. We determine asymptotic properties of the estimators for the case where the time dimension of the panel and the number of spatial interactions is fixed while the number of cross-sectional units approaches infinity. In particular, we prove that the proposed GM and FGTSLS procedures obtain consistent estimators of the model parameters and we derive their joint asymptotic distribution.

1

See Cliff and Ord (1973, 1981), Anselin (1988), and Cressie (1993) for classic references about spatial econometric models in general. Recent theoretical contributions of spatial panel data models include Baltagi, Song, and Koh (2003), Baltagi and Li (2004), Baltagi, Song, Jung, and Koh (2007), Kapoor, Kelejian, and Prucha (2007), Korniotis (2008), Baltagi, Egger, and Pfaffermayr (2008), and Lee and Yu (2008). Recent applications of spatial panel data models include Druska and Horrace (2004), Arbia, Basile, and Piras (2005), Egger, Pfaffermayr, and Winner (2005), Baltagi, Egger, and Pfaffermayr (2007), and Badinger and Egger (2008a).

(4)

The remainder of the paper is organized as follows. Section II introduces the basic model specification and some notation. Section III proposes GM estimators of the parameters of spatial dependence in the error components based on alternative weighting schemes of the moments. Section IV derives a two-stages least-squares routine to estimate the regression parameters of the model and derives the asymptotic variance-covariance matrix of all model parameters. The latter enables Wald tests on the structure and decay of spatial interactions in the SARAR(R,S) model. Section V presents the results of a Monte Carlo simulation exercise. Section VI summarizes our main findings and concludes. The appendix sketches the proofs of consistency and the asymptotic distribution of the model parameters, whereas the full details of the proofs are relegated to a technical appendix.

II. Basic Model Specification and Notation

The basic set-up of the error components model with spatially correlated error terms represents a generalization of Kapoor, Kelejian, and Prucha (2007), who consider a panel data error components model with nonstochastic explanatory variables and first-order spatial autoregressive disturbances, i.e., a SAR(1) model. The present paper delivers the following contributions. First, we allow for an R-th order spatial autoregressive process in the dependent variable and an S-th order spatial process in the disturbances cum error components, i.e., we consider a SARAR(R,S) panel data error components model. As we show below, this also covers the case of endogenous explanatory variables other than spatial lags of the dependent variable.2 Second, we prove consistency of proposed generalized moments (GM) estimators of the model parameters and derive their joint asymptotic distribution. In particular, we also relax the normality assumption used, for instance, in Kapoor, Kelejian, and Prucha (2007) to obtain a simplified version of the optimal weighting matrix for the moment conditions. Third, we provide some Monte Carlo evidence with a special emphasis on the spatial model parameter point estimates and the rejection probabilities of Wald tests of the SARAR(R,S) model against interesting alternatives such as the SARAR(1,1), SARAR(0,S), SARAR(R,0), and the non-spatial model.

The basic model comprises i=1,...,N cross-sectional units and t=1,...,T time periods. For time period t, the model reads

) ( ) ( ) ( ) ( 1 , , t t t t R N r rN rN N N N N X Ξ² W y u y = +

βˆ‘

+ =

Ξ»

, (1a) or 2

See Lee and Liu (2008) and Badinger and Egger (2008b) for cross-sectional spatial models with a SARAR(R,S) process.

(5)

) ( ) ( ) (t N t N N t N Z Ξ΄ u y = + , (1b)

where yN(t) is an NΓ—1 vector with cross-sectional observations of the dependent variable in year t, )XN(t is an NΓ—K matrix of observations on K non-stochastic explanatory variables, i.e., XN(t)=[x1,N(t),...,xK,N(t)] with each NΓ—1 vector xk,N(t) denoting the observations on the k-th explanatory variable. The structure of spatial dependence in yN(t) is determined by the time-invariant NΓ—N matrices Wr,N, r=1,...,R, whose elements wij,r,N are assumed to

be known and will often (but need not) be specified as a decreasing function of geographical distance between the cross-sectional units i and j. The expression yr,N(t)=Wr,NyN(t) is referred to as the r-th spatial lag of yN. The specification of a higher-order process allows the strength of spatial interdependence in the dependent variable (reflected in the spatial autoregressive parameters

Ξ»

r,N, r=1,...,R) to vary across a fixed number of R subsets of relations between cross-sectional units.

In equation (1b), the NΓ—(K+R) design matrix is given by ZN(t)=[XN(t),YN(t)], with )] ( ),..., ( [ ) (t 1,N t R,N t N y y

Y = , and Ξ΄N =(Ξ²β€²N,Ξ»β€²N)β€², where the KΓ—1 parameter vector of the exogenous variables is given by Ξ²N =(Ξ²1,N,...,Ξ²K,N)β€² and the RΓ—1 vector of spatial autoregressive parameters of yN is defined as Ξ»N =(

Ξ»

1,N,...,

Ξ»

R,N)β€².

The NΓ—1 vector of error terms uN(t)=[u1,N(t),...,uN,N(t)]β€² is assumed to follow a spatial autoregressive process given by

) ( ) ( ) ( 1 , , t t t S N m N N m N m N M u Ξ΅ u =

βˆ‘

+ =

ρ

, (1c) ) ( ) (t N N t N ΞΌ v Ξ΅ = + , (1d)

where

ρ

m,N and Mm,N denote the time-invariant, unknown parameters and the known NΓ—N matrix of spatial interdependence, respectively. The structure of spatial correlation in the disturbances is determined by the S different, time-invariant NΓ—N matrices Mm,N. As in equation (1a), the specification of a higher-order process allows the strength of spatial interdependence in the disturbances (reflected in the parameters

ρ

m,N, m=1,...,S) to vary across a fixed number of S subsets of relations between cross-sectional units. The expression

) ( ) ( , ,N t mN N t m M u

u = is referred to as the m-th spatial lag of uN. The SΓ—1 vector of the spatial autoregressive parameters of uN(t) is defined as ρN =(

ρ

1,N,...,

ρ

S,N)β€².
(6)

Finally, the NΓ—1 vector of error terms Ξ΅N(t) consists of two components, ΞΌN and vN(t). As indicated by the notation, ΞΌN is time-invariant while vN(t) is not. The typical elements of

) (t

N

Ξ΅ and vN(t) are the scalars

Ξ΅

it,N and vit,N, respectively, and the NΓ—1 vector of unit-specific error components is given by ΞΌN =(ΞΌ1,N,...,ΞΌN,N)β€².

Stacking observations for all time periods such that t is the slow index and i is the fast index with all vectors and matrices, the model reads

N N N N N N X Ξ² Y Ξ» u y = + + , (2a) or N N N N Z Ξ΄ u y = + , (2b)

with the NTΓ—K regressor matrix XN =[Xβ€²N(1),...,Xβ€²N(T)]β€², and YN =(y1,N,...,yR,N), where ] ) ( ),..., 1 ( [ , , ,N = β€²rN β€²rN T β€² r y y

y is the NTΓ—1 vector of observations on the r-th spatial lag of the

dependent variable yr,N. The NTΓ—1 vector of disturbances uN =[uβ€²N(1),...,uβ€²N(T)] for the spatial autoregressive process of order S is given by

N S m mN T mN N N I M u Ξ΅ u =

βˆ‘

βŠ— + =1 , , ( )

ρ

, (2c)

where IT is an identity matrix of dimension TΓ—T. The NTΓ—1 vector Ξ΅N =[Ξ΅β€²N(1),...,Ξ΅β€²N(T)]β€² is specified as N N N T N e I ΞΌ v Ξ΅ =( βŠ— ) + , (3a)

where eT is a unit vector of dimension TΓ—1 and IN is an identity matrix of dimension N

NΓ— . In light of (2c), the error term can also be written as

βˆ‘

βˆ‘

= = βˆ’ βŠ— = βŠ— βˆ’ = S m N N m N m N T S m N N m T N m N N 1 , , 1 , , (I M )u I (I M )u u Ξ΅

ρ

ρ

. (3b) It follows that

βˆ‘

= βˆ’ βˆ’ βŠ— = S m N N m N m N T N 1 1 , , ) ] ( [I I M Ξ΅ u

ρ

, (4a)
(7)

and N R r N r N r N T N N R r N r N r N T N I I W X t Ξ² I I W u y [ ( ) ] ( ) [ ( ) 1] 1 , , 1 1 , , βˆ’ = βˆ’ =

βˆ‘

βˆ‘

+ βŠ— βˆ’ βˆ’ βŠ— =

Ξ»

Ξ»

, (4b)

The following assumptions are maintained throughout this paper.

Assumption 1.

Let T be a fixed positive integer. (a) For all 1≀t≀T and 1≀i≀N,N β‰₯1, the error components vit,N are identically and (mutually) independently distributed with E(vit,N)=0,

2 2

, )

(vitN v

E =Οƒ , where 0<

Οƒ

v2 <bv <∞, and Evit,N4+η <∞ for some

Ξ·

>0. (b) For all 1

,

1≀i≀N N β‰₯ , the unit-specific error components

ΞΌ

i,N are identically and (mutually) independently distributed with E(

ΞΌ

i,N)=0, E(ΞΌi2,N)=σμ2, where 0<σμ2 <bΞΌ <∞, and

∞ < +η

ΞΌ

4 ,N i

E for some

Ξ·

>0. (c) The processes {vit,N} and {

ΞΌ

i,N} are independent of each other. Assumption 1 as maintained here is slightly stronger than that in Kapoor, Kelejian, and Prucha (2007), since it requires not only the fourth but also the (4+

Ξ·

)-th moments of the error components to be finite for some

Ξ·

>0. This is required for the central limit theorem of Kelejian and Prucha (2008) to apply, which will be used to derive the asymptotic distribution of the parameter estimates in Section III.

Assumption 1 implies that

2 2 ,

, )

( itN jsN v

E

Ξ΅

Ξ΅

=

Οƒ

ΞΌ +

Οƒ

for i= jand t =s, (5a)

2 , , ) (

Ξ΅

itN

Ξ΅

jsN =

Οƒ

μ E for i= j and t≠s, (5b) 0 ) ( it,N js,N = E

Ξ΅

Ξ΅

, otherwise. (5c)

As a consequence, the variance-covariance matrix of the stacked error term Ξ΅N reads

NT v N T N N N E Ξ΅ Ξ΅ J I I ΩΡ 2 2 , = ( β€² )=

Οƒ

ΞΌ( βŠ— )+

Οƒ

, (6a)

where JT =eTeβ€²T is a TΓ—T matrix with unitary elements and INT is an identity matrix of dimension NT Γ— NT. Equation (6a) can also be written as

N N v N 1, 2 1 , 0 2 , Q Q ΩΡ =

Οƒ

+

Οƒ

, (6b)
(8)

where

Οƒ

12 =

Οƒ

v2 +T

Οƒ

ΞΌ2. The two matrices Q0,N and Q1,N, which are central to the estimation

of error component models and the moment conditions of the GM estimator, are defined as

N T T N T I J I Q0, =( βˆ’ )βŠ— , (7) N T N T I J Q1, = βŠ— . (8)

Notice that Q0,N and Q1,N are both of order NT Γ— NT, symmetric, idempotent, orthogonal to each other, and sum up to INT.3

Assumption 2.

(a) All diagonal elements of the matrices Wr,N, r=1,...,R, and Ms,N, s=1,...,S, are zero. (b) The admissible parameter space is restricted as follows:

) , ( , r r N N N r a a Ξ» Ξ»

Ξ»

∈ βˆ’ , with <aΞ»Nr aNΞ»r ≀aΞ»,r ≀aΞ» <∞ , 0 , r = 1, …, R, and

βˆ‘

< <∞ = λ

Ξ»

A R r 1 r N , .

The first part of Assumption (2b) requires the parameters

Ξ»

r,N, r =1,...,R to be finite. We take aΞ» such that max( , )

,..., 1 r R r a aΞ» Ξ» =

= holds; the expression aΞ» will be used to denote an RΓ—1 vector with elements aΞ». In the second part of Assumption (2b), the scalar AΞ» generally depends on the properties of the weights matrices Wr,N. For example, with row-normalized

matrices Wr,N, r=1,...,R, assuming that AΞ» =1 ensures that ( )

1 , ,

βˆ‘

= βˆ’ R r N r N r N W I

Ξ»

is invertible,

as required in Assumption (2c). If the matrices Wr,N are not row-normalized, Assumption

(2c) is implied by 1 , ,..., 1 max βˆ’ = ⎟⎠ ⎞ ⎜ ⎝ βŽ› = rN R r

AΞ» W for some matrix norm β‹… (see Horn and Johnson,

1985, p. 301). Analogous assumptions are made for the parameters of the spatial autoregressive error process:

) , ( , s s N N N s a a ρ ρ

ρ

∈ βˆ’ , with <aNρs aNρs ≀aρ,s ≀aρ <∞ , 0 , s = 1, …, S, and

βˆ‘

< <∞ = ρ

ρ

A S m N m 1 , .

We take aρ such that max( , )

,..., 1 s S s a aρ ρ =

= holds; the expression aρ will be used to denote an

1 Γ—

S vector with elements aρ. As above, with row-normalized matrices Ms,N, s=1,...,S, the

3

Observe that pre-multiplying an NT Γ— 1 vector with Q0,N transforms its elements into

deviations from cross-section specific sample means taken over time, and that pre-multiplying a vector by Q1,N transforms its elements into cross-section specific sample means. See

(9)

second part of this assumption ensures invertibility of ( ) 1 , ,

βˆ‘

= βˆ’ S m mN mN N M I

ρ

if Aρ =1. If the

matrices Ms,N are not row-normalized, Assumption (2c) is implied by

1 , ,..., 1 max βˆ’ = ⎟⎠ ⎞ ⎜ ⎝ βŽ› = sN S s Aρ M

for some matrix norm β‹… .

(c) The matrices ( ) 1 , ,

βˆ‘

= βˆ’ R r N r N r N W I

Ξ»

and ( ) 1 , ,

βˆ‘

= βˆ’ S m N m N m N M

I

ρ

are nonsingular for

) , ( r r N N r a aΞ» Ξ»

Ξ»

∈ βˆ’ and ( r, r) N N s a aρ ρ

ρ

∈ βˆ’ , respectively. This ensures that uN and yN are

uniquely identified through equations (4a) and (4b).

Assumption 3.

The row and column sums of the matrices Wr,N, r=1,...,R, Ms,N, s=1,...,S,

1 1 , , ) ( βˆ’ =

βˆ‘

βˆ’ R r rN rN N W I

Ξ»

, and 1 1 , , ) ( βˆ’ =

βˆ‘

βˆ’ S m mN mN N M

I

ρ

are bounded uniformly in absolute value.

(See Remark A.1 in Appendix A for a definition of row and column sum boundedness.)

In light of Assumptions 1-3 and Remark A.1 in the Appendix, it follows that E(uN)=0 and the variance-covariance matrix of uN is given by

βˆ‘

βˆ‘

= βˆ’ = βˆ’ βŠ— βˆ’ β€² βˆ’ βŠ— = β€² = S m mN mN N T S m mN mN N N T N N N E 1 1 , , 1 , 1 , , , (u u ) [I (I M ) ]Ω [I (I M ) ] Ωu

ρ

Ξ΅

ρ

, (9a) and

βˆ‘

βˆ‘

= βˆ’ = βˆ’ βˆ’ β€² βˆ’ + = β€² S m mN mN N S m mN mN N v N N t t E 1 1 , , 1 1 , , 2 2 ) ( ) )( ( )] ( ) ( [u u

Οƒ

ΞΌ

Οƒ

I

ρ

M I

ρ

M . (9b)

Note that all variables and parameters except for the variances of the error components are allowed to depend on sample size N. Such a specification is consistent, for example, with models where the weights matrix is row-normalized and the number of neighbours of a given cross-sectional unit depends on sample size (see Kapoor, Kelejian, and Prucha, 2007, p. 102) or where the strength of interdependence (in terms of the spatial autoregressive parameters) changes with the number of neighbours. Note that XN is allowed to depend on sample size and may thus also contain spatial lags of exogenous variables. As a result, the model specification in equations (1a)-(1c) is fairly general, allowing for higher-order spatial dependence in the dependent variable, the explanatory variables, and the disturbances.

(10)

III. GM Estimation of a SAR(S) Model

In the following, we consider GM estimators for the spatial autoregressive parameters of the disturbance process in equation (1c) and derive the asymptotic joint distribution of all model parameters.

1. Moment Conditions

With an S-th order process (SAR(S), with S >1), the GM estimators of the parameters

N S N , ,

1 ,...,

ρ

ρ

,

Οƒ

v2, and

Οƒ

12 can be obtained by recognizing that – under Assumptions 1 and 2 – the moment conditions used by Kapoor, Kelejian, and Prucha (2007) hold for each matrix

N s,

M , s=1,...,S. In particular, define for each Ms,N, s=1,...,S

] ) ( )[ ( ) ( 1 , , , , ,

βˆ‘

= βŠ— βˆ’ βŠ— = βŠ— = S m N N m T N m N N s T N N s T N s I M Ξ΅ I M u I M u Ξ΅

ρ

. (10)

A word on notation is in order here. In equation (10), subscript s has been introduced together with m to indicate that, with higher-order spatial processes, Ms,N and Mm,N meet in Ξ΅s,N. While we will use index s to refer to the matrix Ms,N, by which Ξ΅N is pre-multiplied in equation (10), index m is required for the summation over the terms

ρ

m,NMm,N.

The moment conditions are then given by

Ma 0, 0, ] 2 ) 1 ( 1 [ ] ) 1 ( 1 [ N N N N N N v T N E T N E β€² =

Οƒ

βˆ’ = β€² βˆ’ Ξ΅ Q Ξ΅ v Q v , (11) M1,s ( ) 1 ] ) ( ) 1 ( 1 [ ] ) 1 ( 1 [ s,N 0,N s,N N 0,N T s,N s,N 0,N N v2 tr s,N s,N N T N E T N E Ξ΅ Q Ξ΅ vβ€²Q I βŠ—Mβ€² M Q v = Mβ€² M βˆ’ = β€² βˆ’

Οƒ

, M2,s ( ) ] 0 ) 1 ( 1 [ ] ) 1 ( 1 [ , 0, β€² 0, βŠ— β€², 0, = βˆ’ = β€² βˆ’ sN N N N N T sN N N T N E T N E Ξ΅ Q Ξ΅ v Q I M Q v , Mb 1, 1, ) 12 1 ( ] ) ( 1 [ ) 1 ( β€²N N N = β€²N Tβ€² T βŠ— N N + β€²N N N =

Οƒ

N E N E N E Ξ΅ Q Ξ΅ ΞΌ e e I ΞΌ v Q v , M3,s ( ) ] 1 [ ] ) ( 1 [ ) 1 ( s,N 1,N s,N N T T s,N s,N N N 1,N T s,N s,N 1,N N N E N E N E Ξ΅β€² Q Ξ΅ = ΞΌβ€² eβ€²e βŠ—Mβ€² M ΞΌ + vβ€²Q I βŠ—Mβ€² M Q v ) ( 1 , , 2 1 tr sN sN N Mβ€² M =

Οƒ

, M4,s (1 β€²s,N 1,N N)= [1 β€²N( Tβ€² T βŠ— β€²s,N) N]+ [1 β€²N 1,N( T βŠ— β€²s,N) 1,N N]=0 N E N E N E Ξ΅ Q Ξ΅ ΞΌ e e M ΞΌ v Q I M Q v ,

where

Οƒ

12 =

Οƒ

v2+T

Οƒ

ΞΌ2. The moment conditions associated with matrices Ms,N, s=1,...,S, through (10), are indexed with subscripts 1 to 4. The remaining two moment conditions,
(11)

which do not depend on s, are denoted as Ma and Mb. For an S-th order process as given by equation (2c), we thus have (4S+2) moment conditions.4

Substituting equations (3b), (10), and (1c) into the 4S+2 moment conditions (11) yields a (4S+2) equation system in ( ,..., , , 12)

2 , ,

1

ρ

Οƒ

Οƒ

ρ

N S N v , which can be written as

0

Ξ“

Ξ³N βˆ’ NbN = , (12)

where bN is a [2S+S(Sβˆ’1)/2+2]Γ—1 vector, given by

) , , ,..., ,..., , ,..., , ,..., ( 12 2 , , 1 , , 1 , 2 , 1 2 , 2 , 1 , , 1 β€² =

ρ

N

ρ

SN

ρ

N

ρ

SN

ρ

N

ρ

N

ρ

N

ρ

SN

ρ

Sβˆ’ N

ρ

SN

Οƒ

v

Οƒ

N b ,

i.e., bN contains S linear terms

ρ

m,N, m=1,...,S, S quadratic terms

ρ

m2,N, m=1,...,S,

2 / ) 1 (Sβˆ’

S cross products

ρ

m,N

ρ

l,N, m=1,...,S βˆ’1 ,l=m+1,...,S, as well as 2 v

Οƒ

and

Οƒ

12. For

later reference, we define the (S+2)Γ—1 vector of all parameters as

) , , ,..., ( ) , , ( 2 1 2 , , 1 2 1 2 β€²= β€² β€² = N

Οƒ

v

Οƒ

ρ

N

ρ

SN

Οƒ

v

Οƒ

N ρ θ . N

Ξ³ is a (4S+2)Γ—1 vector with elements [

Ξ³

i,N], i=1,...,(4S+2), and Ξ“N is a )

2 4

( S+ Γ—[2S+S(Sβˆ’1)/2+2] matrix with elements [

Ξ³

i,j,N], i=1,...,(4S+2), ] 2 2 / ) 1 ( 2 [ ,..., 1 + βˆ’ + = S S S

j . The elements

Ξ³

i,N and

Ξ³

i,j,N will be defined below. The

row-index of the elements Ξ³N and Ξ“N will be chosen such that the equation system (12) has the following order. The first four rows correspond to the moment restrictions M1,1 to M4,1 associated with matrix M1,N through (10); rows five to eight correspond to M1,2 to M4,2

associated with matrix M2,N, and so forth; rows (Sβˆ’4) to 4 correspond to the MS 1,S to M4,S associated with the matrix MS,N. Finally, rows (4S+1) and (4S+2) correspond to the moment conditions Ma and Mb, respectively, which do not depend on s.

4

Notice that further moment conditions are available through pre- and post-multiplying Q0,N

and Q1,N with Ρr,′N and Ρs,N, r≠s, respectively, in moment conditions M1 and M3. The associated efficiency gain will depend on the properties of the weights matrices. If the two involved weights matrices are orthogonal, the corresponding moment condition is trivially satisfied and does not add any information. For the sake of brevity, we use the moment conditions given in (11), and leave an assessment of the potential efficiency gains from exploiting further moment conditions for future research.

(12)

The sample analogue to equation system (12) is given by ) ( ~ ~ N N N N N Ξ“ ΞΈ Ξ³ βˆ’ b =

Ο‘

, (13)

where the elements of ~ and Ξ³N Ξ“~N are equal to those of Ξ³N and Ξ“N with the expectations operator suppressed and the disturbances uN replaced by (consistent) estimates u~ . N

GM estimates of the parameters

ρ

1,N,...,

ρ

S,N, 2 v

Οƒ

and

Οƒ

12are then obtained as the solution to

)] ( ~ ) ( [ )] ~ ~ ( ~ ) ~ ~ [( min arg 2 1 2 2 1, ,.., , , N N N N N N N N N N N N v S ΞΈ Θ ΞΈ Ξ“ Ξ³ Θ Ξ“ Ξ³ Ο‘ Ο‘ Οƒ Οƒ ρ ρ ρ β€² = βˆ’ β€² βˆ’ b b , (14)

i.e., the parameter estimates can be obtained from a (weighted) non-linear least squares regression of ~ on the columns of Ξ³N Ξ“~N; )Ο‘N(ΞΈN can then be viewed as a vector of regression residuals. The optimal choice of the (4S+2)Γ—(4S+2) weighting matrix ΘN and its estimation will be discussed below.

In the following, we define the elements of Ξ³N and Ξ“N, grouped by the corresponding moment conditions. Thereby, we use the following notation:

N N s T N s I M u u, =( βŠ— , ) , s=1,...,S, and (15a) N N m N s T N N m T N s T N sm I M I M u I M M u u , =( βŠ— , )( βŠ— , ) =( βŠ— , , ) , s=1,...,S, m=1,...,S. (15b) At this point, we need to introduce an index l=1,...,S for proper definition of the elements of

N

Ξ³ and Ξ“N.

Moment condition M1,s delivers s=1,...,S rows of equation system (12), appearing in rows

1 ) 1 (

4 sβˆ’ + with the following elements of Ξ³N and Ξ“N:

) 1 ( 1 , 1 ) 1 ( 4 βˆ’ = + βˆ’ T N N s Ξ³ E(usβ€²,NQ0,Nus,N), (16a) ) ( ) 1 ( 2 , , 0 , , , 1 ) 1 ( 4 s mN E sN N smN T N βˆ’ uβ€² Q u = + βˆ’ Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 1 ) 1 ( 4sβˆ’ + S+mN =βˆ’N Tβˆ’ E usmβ€² NQ NusmN Ξ³ , m=1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 ) 1 ( 4 s S m mm l mN E smN N slN T N βˆ’ uβ€² Q u βˆ’ = βˆ’ + βˆ’ βˆ’ + + βˆ’ Ξ³ , m=1,...,Sβˆ’1, l=m+1,...,S,

(13)

) ( 1 , , , 1 2 / ) 1 ( 2 , 1 ) 1 ( 4s S S S N tr sN sN N Mβ€² M = + βˆ’ + + βˆ’

Ξ³

, 0 , 2 2 / ) 1 ( 2 , 1 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

.

Moment condition M2,s consists of s=1,...,S rows of equation system (12), appearing in rows

2 ) 1 (

4 sβˆ’ + with the following elements of Ξ³N and Ξ“N:

) ( ) 1 ( 1 , 0 , , 2 ) 1 ( 4 s N E sN N N T N βˆ’ uβ€² Q u = + βˆ’ Ξ³ , (16b) ) ( ) 1 ( 1 , , 0 , , 0 , , , 2 ) 1 ( 4s mN E smN N N sN N mN T N βˆ’ uβ€² Q u +uβ€² Q u = + βˆ’ Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 2 ) 1 ( 4sβˆ’ + S+mN =βˆ’N T βˆ’ E usmβ€² NQ NumN Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 0 , , 2 / ) 1 ( ) 1 ( , 2 ) 1 ( 4s S m mm l mN E slN N mN smN N lN T N βˆ’ uβ€² Q u +uβ€² Q u βˆ’ = βˆ’ + βˆ’ βˆ’ + + βˆ’ Ξ³ , m=1,...,Sβˆ’1, S m l= +1,..., , 0 , 1 2 / ) 1 ( 2 , 2 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

, 0 , 2 2 / ) 1 ( 2 , 2 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

.

Moment condition M3,s corresponds to s=1,...,S rows of equation system (12), appearing in

rows 4(sβˆ’1)+3 with the following elements of Ξ³N and Ξ“N:

N N s 1 , 3 ) 1 ( 4 βˆ’ + =

Ξ³

E(usβ€²,NQ1,Nus,N), (16c) ) ( 2 , , 1 , , , 3 ) 1 ( 4 s mN E sN N smN N uβ€² Q u = + βˆ’

Ξ³

, m=1,...,S, ) ( 1 , , 1 , , , 3 ) 1 ( 4s S mN E smN N smN N uβ€² Q u βˆ’ = + + βˆ’

Ξ³

, m=1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 3 ) 1 ( 4sβˆ’ + S m+ βˆ’mmβˆ’ +lβˆ’mN =βˆ’N E usmβ€² NQ NuslN

Ξ³

, m=1,...,Sβˆ’1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 3 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

, ) ( 1 , , , 2 2 / ) 1 ( 2 , 3 ) 1 ( 4 s S S S N tr sN sN N Mβ€² M = + βˆ’ + + βˆ’

Ξ³

.

Moment condition M4,s represents s=1,...,S rows of equation system (12) appearing in rows

4 ) 1 (

(14)

) ( 1 , 1 , , 4 ) 1 ( 4s N E sN N N N uβ€² Q u = + βˆ’

Ξ³

, (16d) ) ( 1 , , 1 , , 1 , , , 4 ) 1 ( 4sβˆ’ + mN = N E usmβ€² NQ NuN +usβ€²NQ NumN

Ξ³

, m=1,...,S, ) ( 1 , , 1 , , , 4 ) 1 ( 4s S mN E smN N mN N uβ€² Q u βˆ’ = + + βˆ’

Ξ³

, m=1,...,S, ) ( 1 , , 1 , , , 1 , , 2 / ) 1 ( ) 1 ( , 4 ) 1 ( 4 s S m mm l mN E slN N mN smN N lN N uβ€² Q u +uβ€² Q u βˆ’ = βˆ’ + βˆ’ βˆ’ + + βˆ’

Ξ³

, m=1,...,Sβˆ’1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 4 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

, 0 , 2 2 / ) 1 ( 2 , 4 ) 1 ( 4sβˆ’ + S+S Sβˆ’ + N =

Ξ³

.

Moment condition Ma reflects one equation of the system in (12), appearing in row (4S+1) with the following elements of Ξ³N and Ξ“N:

) ( ) 1 ( 1 , 0 , 1 4S N E N N N T N βˆ’ uβ€² Q u = + Ξ³ , (16e) ) ( ) 1 ( 2 , 0 , , , 1 4S+ mN = N T βˆ’ E umβ€² NQ NuN Ξ³ , m=1,...,S, ) ( ) 1 ( 1 , , 0 , , , 1 4S S mN E mN N mN T N βˆ’ uβ€² Q u βˆ’ = + + Ξ³ , m=1,...,S, ) ( ) 1 ( 2 , , 0 , , 2 / ) 1 ( ) 1 ( , 1 4S S m mm l mN E mN N lN T N βˆ’ uβ€² Q u βˆ’ = βˆ’ + βˆ’ βˆ’ + + Ξ³ , m=1,...,Sβˆ’1, l=m+1,...,S, 1 , 1 2 / ) 1 ( 2 , 1 4S+ S+S Sβˆ’ + N =

Ξ³

, 0 , 2 2 / ) 1 ( 2 , 1 4S+ S+S Sβˆ’ + N =

Ξ³

.

Moment condition Mb is associated with one equation of the system in (12), appearing in row )

2 4

( S+ with the following elements of Ξ³N and Ξ“N:

) ( 1 , 1 , 2 4S N E N N N N uβ€² Q u = +

Ξ³

, (16f) ) ( 2 , 1 , , , 2 4S mN E mN N N N uβ€² Q u = +

Ξ³

, m=1,...,S, ) ( 1 , , 1 , , , 2 4S+ S+mN =βˆ’N E uβ€²mNQ NumN

Ξ³

, m=1,...,S, ) ( 2 , , 1 , , 2 / ) 1 ( ) 1 ( , 2 4S S m mm l mN E mN N lN N uβ€² Q u βˆ’ = βˆ’ + βˆ’ βˆ’ + +

Ξ³

, m=1,...,Sβˆ’1, l=m+1,...,S, 0 , 1 2 / ) 1 ( 2 , 2 4S+ S+S Sβˆ’ + N =

Ξ³

, 1 , 2 2 / ) 1 ( 2 , 2 4S+ S+S Sβˆ’ + N =

Ξ³

.
(15)

This completes the specification of the elements of the matrices Ξ³N and Ξ“N. The similarity of the structure between the expressions resulting from the moment conditions Ma, M1,s, and M2,s on the one hand and Mb, M3,s, and M4,s on the other hand is apparent. First, they differ by the normalization factor and the corresponding matrix of quadratic forms, Q0,N and Q1,N,

respectively. Second, the rows in (12) associated with Ma, M1,s and M2,s,s=1,...,S, do not

depend on Οƒ12 whereas the rows associated with Mb, M3,s, and M4,s, s=1,...,S, do not depend

on

Οƒ

v2. This fact will be used to define an initial GM estimator, which is based on a subset of moment conditions (Ma, M1,s, and M2,s) only, in order to obtain an estimate of the optimal

weighting matrix ΘN.

For future reference, we define the (2S+1)Γ—1 vector Ξ³0N as the sub-vector containing rows s and (s+1), s=1,...,S and row (4S+1)of Ξ³N, corresponding to M1,s, M2,s, and Ma.

Moreover, we define the (2S+1)Γ—[2S+S(Sβˆ’1)/2+1] matrix Ξ“0N as the sub-matrix containing rows s and (s+1), s=1,...,S, and row (4S+1) of Ξ“N, corresponding to M1,s, M2,s, and Ma.

Analogously, we define the (2S+1)Γ—1 vector Ξ³1N as the sub-vector containing rows 2s, )

1 2

( s+ , s=1,...,S, and row (4S+2) of Ξ³N, corresponding to M3,s, M4,s, and Mb. Finally, we

define the (2S+1)Γ—[2S+S(Sβˆ’1)/2+1] matrix Ξ“1N as the sub-matrix containing rows 2s, )

1 2

( s+ , s=1,...,S, and (4S+2) of Ξ“N, corresponding to M3,s, M4,s, and Mb.

2. Definition of GM Estimators

We next define three alternative GM estimators for the spatial autoregressive parameters of the disturbance process given by (1c) and the variances of the error components.5

2.1. Initial GM Estimation

The initial GM estimator is a special case of (14), using the identity matrix as weighting matrix ΘN and a subset of moment conditions (Ma, M1,s and M2,s) only. It is thus based on the

the vector Ξ³0N and the matrix Ξ“0N. Define ΞΈ0N as the corresponding parameter vector that excludes Οƒ12, i.e., ΞΈ0 =(ρ′N,Οƒv2)=(ρ1,N,...,ρS,N,Οƒv2), and accordingly

) , ,..., ,..., , ,..., , ,..., ( 2 , , 1 , , 1 , 2 , 1 2 , 2 , 1 , , 1 0 = β€² βˆ’ N SN v S N S N N N N S N N S N N ρ ρ ρ ρ ρ ρ ρ ρ ρ ρ Οƒ b .

The initial GM estimator is then obtained as the solution to

5

See Kapoor, Kelejian, and Prucha (2007) for analogous conditions under SARAR(0,1) estimation, assuming only nonstochastic regressors in equation (1a).

(16)

} ] , 0 [ , ), ( ) ( min{ arg ) , ,..., ( 2, 0 0 0 0 2 , , 1,N ρSN ΟƒvN = Ο‘N N β€²Ο‘N N βˆ’ ≀ ≀ ΟƒvN∈ bv ρ) ) ) ΞΈ ΞΈ aρ ρ aρ , (17a) with Ο‘N0(ΞΈ0)=Ο‘N0(ρ,Οƒv2)=(~Ξ³0N βˆ’Ξ“~0Nb0).

Using these initial estimates of (

ρ

1,N,...,

ρ

S,N) and

2 v

Οƒ

, the parameter Οƒ12 can be estimated

from moment condition Mb:

βˆ‘

βˆ‘

= = βˆ’ β€² βˆ’ = S m mN mN N N S m mN mN N N N 1 , , , 1 1 , , 2 , 1 ) ~ ~ ( ) ~ ~ ( 1 u u Q u u

ρ

ρ

Οƒ

) ) ) (17b) ... ~ ~ ... ~ ~ 2 , 1 1 , 2 4 , , 2 4 , 1 1 , 2 4 2 4S Ξ³ S N Ξ³ S S S N Ξ³S S N Ξ³ + βˆ’ +

ρ

) βˆ’ βˆ’ +

ρ

) βˆ’ + +

ρ

) = . ~ ... ~ ~ , , 1 2 / ) 1 ( 2 , 2 4 , 2 , 1 1 2 , 2 4 2 , 2 , 2 4S S SN Ξ³ S S N N Ξ³ S S S S S N SN Ξ³ +

ρ

) βˆ’ + +

ρ

)

ρ

) βˆ’ βˆ’ + + βˆ’

ρ

) βˆ’

ρ

) βˆ’ 2.2. Weighted GM Estimation

While the initial GM estimator as defined in (17) is consistent (as will be shown below), it is inefficient. First, it ignores the information contained in moment conditions (Mb, M3,s and

M4,s).

6

Second, it is well known from the literature on generalized method of moments estimation, that it is optimal to use as weighting matrix the inverse of the (properly normalized) variance-covariance matrix of the moments, evaluated at true parameter values. Denote the optimal weighting matrix, which will be derived in Subsection 3.2, by Ξ¨βˆ’N1. In general, the optimal weighting matrix is unknown and has to be estimated, e.g., using the results from the initial GM estimation. In Subsection 3.3 we derive a consistent estimator of

1 βˆ’ N

Ξ¨ , referred to as Ξ¨~βˆ’N1. The optimally weighted GM estimator is based on all (4S+2) moment conditions and uses Θ~N =Ξ¨~βˆ’N1 as the weighting matrix for the moment conditions. It is defined as: } ] , 0 [ ], , 0 [ , ), ( ~ ) ( { min arg ) ~ , ~ , ~ ,..., ~ ( 12 2 2 1, 2 , , 1,N

ρ

SN

Οƒ

vN

Οƒ

N =

Ο‘

N β€² N

Ο‘

N βˆ’ ≀ ≀

Οƒ

v∈ bv

Οƒ

∈ c

ρ

ΞΈ Θ ΞΈ aρ ρ aρ , with cβ‰₯bv+TbΞΌ, and

Ο‘

N(ΞΈ)=

Ο‘

N(ρ,

Οƒ

v2,

Οƒ

12)= (~Ξ³N βˆ’~Ξ“Nb). (18)

As already mentioned, the optimal weighting matrix is derived without distributional assumptions. As a consequence it involves third and fourth moments of the error components

6

This does not mean that any GM estimator using all moment conditions is necessarily superior. In fact, Kapoor, Kelejian, and Prucha (2007) show in a Monte Carlo study on their SAR(1) model that the initial GM estimator performs much better (in terms of bias and RMSE) than the unweighted GM estimator using all moment conditions. Their results suggest that a proper weighting of the moment conditions, in particular the weighting of moment conditions Ma, M1, and M2 relative to moment conditions Mb, M3, and M4 is of crucial importance.

(17)

N it

v , and

ΞΌ

i,N. Kapoor, Kelejian, and Prucha (2008) use the assumption that Ξ΅it,N is normally distributed to obtain a simplified weighting matrix as an approximation of the true optimal weighting matrix. For comparison, we also consider this simplified optimal weighting matrix, which is shown to be a special case of Ξ¨βˆ’N1 in the Appendix and referred to as (Ξ¨oN)βˆ’1. The simplified weighted GM estimator is defined as the weighted GM estimator given in (18), using Θ~N =(Ξ¨~oN)βˆ’1.

3. Asymptotic Properties of the GM Estimator for ΞΈN

3.1 Consistency

In order to prove consistency, the following additional assumptions are required:

Assumption 4.

Assume that u~N βˆ’uN =DNΞ”N, i.e., u~i,N βˆ’ui,N =di.,NΞ”N, for i=1,...,NT,7 where DN is an P

NTΓ— matrix, the 1Γ—P vector di.,N denotes the i-th row of DN and Ξ”N is a PΓ—1 vector. Let dij,N be the j-th element of di.,N. For some Ξ΄ >0, we assume that ≀ <∞

+ d N ij t c d E , ( )2 Ξ΄ ,

where cd does not depend on N, and that N1/2 Ξ”N =Op(1).

Assumption 4 will be fulfilled in many settings, e.g., if model (1a) contains endogenous variables (such as spatial lags of yN) and is estimated using two-stages least squares. In that case, Ξ”N denotes the difference between the parameter estimates and the true parameter values and di.,N is the (negative of the) i-th row of the design matrix ZN (compare Lemma 1 in Subsection 2 of Section IV). Under certain conditions, Assumption 4 will also be satisfied if model (1a) involves a non-linear specification (see Kelejian and Prucha, 2008, p. 12). Finally, observe that Assumption 4 implies that

βˆ‘

= + βˆ’ NT i N i NT 1 2 ., 1 ) ( d Ξ΄ is Op(1). Assumption 5.

(a) The smallest eigenvalues of Ξ“0Nβ€²Ξ“0N and 1 1 N NΞ“

Ξ“ β€² are bounded away from zero, i.e., 0 ) ( * min β€² β‰₯

Ξ»

>

Ξ»

i N i N Ξ“

Ξ“ for i = 1, 2. (b) Θ~N βˆ’Ξ˜N =op(1), where ΘN are (4S+2)Γ—(4S+2) nonstochastic, symmetric, positive definite matrices. (c) The largest eigenvalues of ΘN are bounded uniformly from above, and the smallest eigenvalues of ΘN are bounded uniformly away from zero.

7

Note that we use single indexation i=1,...,NT to refer to the elements of the vectors that are stacked over time periods. (See the remark on notation in Appendix A.)

(18)

Assumption 5 implies that the smallest eigenvalues of Ξ“β€²NΞ“N and Ξ“β€²NΘNΞ“N are bounded uniformly away from zero, ensuring that the true parameter vector ΞΈN is identifiable unique. Moreover, by the equivalence of matrix norms, it follows from Assumption 5 that ΘN and

1 βˆ’ N

Θ are O(1).

Assumptions 1-5 ensure consistency of the GM estimators for ( , , 12)

2

Οƒ

Οƒ

v N N ρ

ΞΈ = β€² . We

summarize these results in the following theorems, which are proved in Appendix B.

Theorem 1a.Consistency of Initial GM Estimator ~0

N

ΞΈ

Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the initial GM estimators ( 1, ,..., , 2, )

0 = β€² N v S N N ρ) ρ) Οƒ) )

ΞΈ defined by (17a), and Οƒ)12,N, defined

by (17b) are consistent for

ρ

1,N,...,

ρ

S,N, 2 v

Οƒ

, and Οƒ12, i.e., 0 , s, p N s N βˆ’

ρ

β†’

ρ

) , s=1,...,S, 2 2 0 , p v N v βˆ’

Οƒ

β†’

Οƒ

) , and 2 0 1 2 , 1 p N βˆ’

Οƒ

β†’

Οƒ

) as N β†’βˆž.

Theorem 1b.Consistency of Weighted GM Estimator ΞΈ~N

Suppose Assumptions 1-5 hold. Then, provided the optimization space contains the parameter space, the weighted GM estimators θ~N(Θ~N)=[ρ~1,N(Θ~N),...,ρ~S,N(Θ~N),

Οƒ

~v2,N(Θ~N),

Οƒ

~12,N(Θ~N)]β€² defined by (18) are consistent for

ρ

1,N,...,

ρ

S,N,

Οƒ

v2, and

Οƒ

12, i.e.,

0 ) ~ ( ~ , s, p N s N N βˆ’

ρ

β†’

ρ

Θ , s=1,...,S,

Οƒ

~v2,N(Θ~N)βˆ’

Οƒ

v2β†’p 0 , and

Οƒ

~12,N (Θ~N)βˆ’

Οƒ

12β†’p 0 as N β†’βˆž. This result holds for an arbitrary weighting matrix (that satisfies Assumption 5). Hence, it applies to both the optimally weighted GM estimator defined by (18) with Θ~N =(Ξ¨~N)βˆ’1and the simplified optimally weighted GM estimator ΞΈ~No with Θ~N =(Ξ¨~oN)βˆ’1.

3.2 Asymptotic Distribution of GM Estimator for ΞΈN

In the following we consider the asymptotic distribution of the optimally weighted GM estimator ΞΈ~N. To establish asymptotic normality of ~ (~ , ~ ,~12, )

2 ,N N v N N ρ

Οƒ

Οƒ

ΞΈ = , we need some additional assumptions. Assumption 6.

Let DN be defined as in Assumption 4, such that u~N βˆ’uN =DNΞ”N. For any real NΓ—N matrix AN, whose row and column sums are bounded uniformly in absolute value, it holds that )Nβˆ’1Dβ€²NANuN βˆ’Nβˆ’1E(Dβ€²NANuN)=op(1 .

(19)

A sufficient condition for Assumption 6 is, e.g., that the columns of DN are of the form

N N N Ξ  Ξ΅

Ο€ + , where the elements of Ο€N are bounded uniformly in absolute value and the row and column sums of Ξ N are bounded uniformly in absolute value (see Kelejian and Prucha, 2008, Lemma C.2). This will be the case in many applications, e.g., for the model in equation (1a), if DN equals (the negative of) matrix ZN (compare Lemma 1 in Section IV).

Assumption 7.

Let Ξ”N be defined as in Assumption 4. Then, ) 1 ( ) ( ) (NT 1/2Ξ”N = NT βˆ’1/2Tβ€²NΞΎN +op , with TN =(Tvβ€²,N,TΞΌβ€²,N)β€², )ΞΎN =(vβ€²N,ΞΌβ€²N β€², i.e., ) 1 ( ) ( ) ( ) (NT 1/2Ξ”N = NT βˆ’1/2Tvβ€²,NvN + NT βˆ’1/2TΞΌβ€²,NΞΌN +op ,

where TN is an (NT+N)Γ—P-dimensional real nonstochastic matrix whose elements are bounded uniformly in absolute value; Tv,N is of dimension (NTΓ—P) and TΞΌ,N is of dimension (NΓ—P). As remarked above, Ξ”N typically denotes the difference between the parameter estimates and the true parameter values. Assumption 7 will be satisfied by many estimators. In Section IV, we verify that it holds if the model in equation (1a) is estimated by two-stages least squares (TSLS) or feasible generalized two-stages least squares (FGTSLS).

The limiting distribution of the GM estimator of ΞΈNwill be seen to depend on (the inverse of) the matrix Jβ€²NΘNJN and the variance-covariance matrix of a vector of quadratic forms in vN and ΞΌN, denoted as qN. We consider each of these expressions in the following. The

) 2 ( ) 2 4

( S+ Γ— S+ matrix JN of derivatives of the (4S+2)Γ—1 vector of moment conditions in (11) is given by ΞΈ Ξ“ Ξ³ ΞΈ J β€² βˆ‚ βˆ’ βˆ‚ = ( ) ) ( N N N N N b ) , , ,..., ( ,1, , , , , , , 1 N i N i N S i N i j j j j v Οƒ Οƒ = , with (19a) = N s i j, , s N N i N i

ρ

βˆ‚ βˆ’ βˆ‚(Ξ³., Ξ“., b ) , i=1,...,(4S+2), s=1,...,S, = N i v j,Οƒ , v N N i N i

Οƒ

βˆ‚ βˆ’ βˆ‚(Ξ³., Ξ“., b ) , i=1,...,(4S+2), = N i j,Οƒ1, 1 ., ., ) (

Οƒ

βˆ‚ βˆ’ βˆ‚ Ξ³i N Ξ“i NbN , i=1,...,(4S+2),
(20)

Using 0 ΞΈ Ξ³ = β€² βˆ‚ βˆ‚ N

and ignoring the negative sign, we have

N N N N N ρ ΞΈ Ξ“ b Ξ“ B J = β€² βˆ‚ βˆ‚ = ) ( , (19b)

where Ξ“N is defined above and of dimension (4S+2)Γ—[2S+S(Sβˆ’1)/2+2] and BN is a ) 2 ( ] 2 2 / ) 1 ( 2

[ S+S Sβˆ’ + Γ— S+ matrix of the form

) , , , ( 1 β€²2, β€²3, β€²4, β€² = N N N N B B B B B , (20a)

with B1 =(IS,0SΓ—2), )]B2,N =[diagsS=1(2

ρ

s,N),0SΓ—2 , and B3,N =(Bβ€²3,1,N,...,B3β€²,Sβˆ’1,N)β€² is an S

S

S( βˆ’1)/2Γ— matrix, consisting of (Sβˆ’1) vertically arranged blocks B3,m,N, ) 1 ( ,..., 1 βˆ’ = S

m , which have the following structure:

) , , , ( , , , ( ) 2 , , 3mN = CmN dmN EmN 0Sβˆ’mΓ— B , (20b)

where Cm,N is a (Sβˆ’m)Γ—(mβˆ’1) matrix of zeros,8 dm,N is a (Sβˆ’m)Γ—1 vector, defined as )

,...,

( 1, ,

,N m N S N

m =

ρ

+

ρ

d , and Em,N =

ρ

m,NISβˆ’m. Finally, B4,N is a 2Γ—(S +2)matrix, defined as βŽ₯ ⎦ ⎀ ⎒ ⎣ ⎑ = + Γ— Γ— 1 , 0 , 1 , 1 1 1 , 4 S S N 0 0 B . (20c)

For later reference, note that BN has full column rank (S+2); as a consequence, the )

2 ( ) 2

(S+ Γ— S+ matrix Bβ€²NBN is positive definite (see, e.g., Greene, 2003, p. 835).

We next consider the vector qN and its limiting distribution. First, define qN(ΞΈN,Ξ”N) as the 1

) 2 4

( S+ Γ— vector of sample moments as given by (11) with the expectation operator suppressed, evaluated at the true parameter values, and ignoring the deterministic constants:

8

(21)

= ) , ( N N N ΞΈ Ξ” q βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ ⎦ ⎀ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎣ ⎑ β€² β€² β€² β€² β€² β€² β€² β€² β€² β€² βˆ’ N N b N N N a N N N S N N N S N N N S N N N S N N N N N N N N N N N N N N u C u u C u u C u u C u u C u u C u u C u u C u u C u u C u ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . ~ ~ ~ ~ ~ ~ ~ ~ , , , , 4 , , 3 , , 2 , , 1 , 1 , 4 , 1 , 3 , 1 , 2 , 1 , 1 1 , (21) where

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² βŠ— β€² βˆ’ βŠ— βˆ’ = S m N m N m N T N N s N s T N S m N m N m N T N s T 1 , , , 0 , , , 0 1 , , , , 1 [ ( )] ( ) [ ( )] ) 1 ( 1 M I I Q M M I Q M I I C

ρ

ρ

,

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² + βŠ— β€² βˆ’ βŠ— βˆ’ = S m mN mN N T N N s N s T N S m mN mN N T N s T 1 , , , 0 , , , 0 1 , , , , 2 [ ( )] [ ( )] [ ( )] ) 1 ( 2 1 M I I Q M M I Q M I I C

ρ

ρ

,

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² βŠ— β€² βˆ’ βŠ— = S m mN mN N T N N s N s T N S m mN mN N T N s 1 , , , 1 , , , 1 1 , , , , 3 [I (I M )]Q (I M M )Q [I (I M )] C

ρ

ρ

,

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² + βŠ— β€² βˆ’ βŠ— = S m mN mN N T N N s N s T N S m mN mN N T N s 1 , , , 1 , , , 1 1 , , , , 4 [ ( )] [ ( )] [ ( )] 2 1 I I M Q I M M Q I I M C ρ ρ ,

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² βˆ’ βŠ— βˆ’ = S m N m N m N T N S m N m N m N T N a T 1 , , , 0 1 , , , [ ( )] [ ( )] ) 1 ( 1 M I I Q M I I C ρ ρ ,

βˆ‘

βˆ‘

= = βˆ’ βŠ— β€² βˆ’ βŠ— = S m N m N m N T N S m N m N m N T N b 1 , , , 1 1 , , , [I (I M )]Q [I (I M )] C

ρ

ρ

. (22)

By Assumption 3 and Remark A.1 in Appendix A, the row and column sums of the symmetric NTΓ—NT matrices Cp,s,N, p=1,...,4, s=1,...,S, Ca,N, and Cb,N are bounded

uniformly in absolute value. Also, note that C3,s,N, C4,s,N, Cb,N differ from C1,s,N, C2,s,N,

N a,

C only by the normalization and the use of Q1,N versus Q0,N.

In light of (21) and Lemma B.1 (see Appendix B), the elements of N1/2qN(ρN,Ξ”N) can be expressed as

(22)

= ) , ( 2 / 1 N N N N q ΞΈ Ξ” (1) . 2 / 1 , , 2 / 1 2 / 1 , , 2 / 1 2 / 1 , , 4 , , 4 2 / 1 2 / 1 , , 3 , , 3 2 / 1 2 / 1 , , 2 , , 2 2 / 1 2 / 1 , , 1 , , 1 2 / 1 2 / 1 , 1 , 4 , 1 , 4 2 / 1 2 / 1 , 1 , 3 , 1 , 3 2 / 1 2 / 1 , 1 , 2 , 1 , 2 2 / 1 2 / 1 , 1 , 1 , 1 , 1 2 / 1 p N N b N N a N N N a N N a N N N S N N S N N N S N N S N N N S N N S N N N S N N S N N N N N N N N N N N N N N N N N N N N N o N N N N N N N N N N N N N N N N N N N N + βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ βŽ₯ ⎦ ⎀ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎒ ⎣ ⎑ β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² β€² + β€² βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ βˆ’ Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u Ξ” Ξ± u C u , (23) where 2 ( ,, ) 1 , ,sN N psN N p N E D C u Ξ± = βˆ’ β€² , p=1,...,4, s=1,...,S, 2 ( , ) 1 ,N N aN N a N E D C u Ξ± = βˆ’ β€² , and ) ( 2 , 1 ,N N bN N b N E D C u Ξ± = βˆ’ β€²

. By Lemma B.1 the elements of the PΓ—1 vectors Ξ±p,s,N,

4 ,..., 1

=

p , s=1,...,S, Ξ±a,N and Ξ±b,N are bounded uniformly

References

Related documents