Chapter 10: Basic Linear Unobserved Effects Panel Data Models:
Microeconomic Econometrics I Spring 2010
10.1 Motivation: The Omitted Variables Problem
We are interested in the partial effects of the observable explanatory variables xjin the popu- lation regression function
E( y|x1, x2, ..., xk, c) (10.1)
• An unobserved,time-constant variable is called an unobserved effect in panel data analy- sis.
• Such a data set is usually called a balanced panel because the same time periods are available for all cross section units.
10.2 Assumptions about the Unobserved Effects and Explanatory Variables
10.2.1 Random or Fixed Effects?
• The basic unobserved effects model (UEM) can be written, for a randomly drawn cross section observation i , as
yit= xitβ+ ci+ uit, t = 1,2,..., T (10.11)
• In addition to unobserved effect, there are many other names given to ciin applications:
unobserved component, latent variable, and unobserved heterogeneity are common.
• If i indexes individuals, then ci is sometimes called an individual effect or individual heterogeneity; analogous terms apply to families, firms, cities, and other cross-sectional units.
• The uit are called the idiosyncratic errors or idiosyncratic disturbances because these change across t as well as across i .
• Nevertheless, later we will label two different estimation methods random effects esti- mation and fixed effects estimation.
10.2.2 Strict Exogeneity Assumptions on the Explanatory Variables
With an unobserved effect the most revealing form of the strict exogeneity assumption is
E( yit|xi1, xi2, ..., xiT, ci) = E(yit|xit, ci) = xitβ+ ci (10.12)
• When assumption (10.12) holds, we say that the xit: t = 1,2,..., T are strictly exogenous conditional on the unobserved effect ci.
• Example10.1(Program Evaluation):
log(wa geit) =θt+ zitγ+δ1pro git+ ci+ uit (10.16)
• Example 10.2 (Distributed Lag Model):
patentsit=θt+ zitγ+δ0RDit+δ1RDi,t−1+ · · · +δ5RDi,t−5+ ci+ uit (10.17)
• Example 10.3 (Lagged Dependent Variable):
log(wa geit) =β1log(wa gei,t−1) + ci+ uit, t = 1,2,..., T (10.18)
10.3 Estimating Unobserved Effects Models by Pooled OLS
Under certain assumptions, the pooled OLS estimator can be used to obtain a consistent esti- mator ofβin model (10.11). Write the model as
yit= xitβ+υit, t = 1,2,..., T (10.21)
whereυit≡ ci+ uit, t = 1,2,...T are the composite errors.
10.4 Random Effects Methods
10.4.1 Estimation and Inference under the Basic Random Effects Assumptions
• As with pooled OLS, a random effects analysis puts ci into the error term.
• Assumption RE.1:
1. E(uit|xi, ci) = 0, t = 1,..., T.
2. E(ci|xi) = E(ci) = 0 where xi≡ (xi1, xi2, . . . , xiT).
• Assumption RE.2: rank E(X0iΩ−1Xi) = K.
• WhenΩ has the form (10.31), we say it has the random effects structure.
• Assumption RE.3:
1. E(uiu0i|xi, ci) =σ2uIT. 2. E(c2i|xi) =σ2c
• In a panel data context, the FGLS estimator that uses the variance matrix (10.33) is what is known as the random effects estimator:
βˆRE=³XN
i=1
X0iΩˆ−1Xi´−1³XN
i=1
X0iΩˆ−1yi´
(10.34)
• Example 10.4(RE Estimation of the Effects of Job Training Grants):
log( ˆscra p) = .415 − .093 d88 − .270 d89 + .548 union − .215 grant − .377 grant−1
(.243) (.109) (.132) (.411) (.148) (.205)
10.4.2 Robust Variance Matrix Estimator
10.4.3 A General FGLS Analysis
If the idiosyncratic errors uit: t = 1,2,..., T are generally heteroskedastic and serially corre- lated across t, a more general estimator ofΩ can be used in FGLS:
Ω = Nˆ −1XN
i=1
ˆˆviˆˆv0i (10.38)
10.4.4 Testing for the Presence of an Unobserved Effect
From equation (10.37), we base a test of H0:σ2c= 0 on the null asymptotic distribution of
N−1/2
N
X
i=1 T−1X
t=1 T
X
s=t+1
υˆitυˆis (10.39)
which is essentially the estimator ˆσ2c scaled up byp N.
10.5 Fixed Effects Methods
10.5.1 Consistency of the Fixed Effects Estimator
Again consider the linear unobserved effects model for T time periods:
yit= xitβ+ ci+ uit, t = 1,..., T (10.41)
• Assumption FE.1: E(uit|xi, ci) = 0, t = 1,2,..., T.
• In this section we study the fixed effects transformation, also called the within transfor- mation.
• The fixed effects (FE) estimator, denoted by ˆβF E,is the pooled OLS estimator from the regression
¨yiton ¨xit, t = 1,2,..., T; i = 1,2,..., N (10.48)
• This set of equations can be obtained by premultiplying equation (10.42) by a time- demeaning matrix.
• Assumption FE.2: rank³ PT
t=1E( ¨x0it¨xit)´
= rankh
E( ¨X0iX¨i)i
= K.
• It is also called the within estimator because it uses the time variation within each cross section.
• The between estimator, which uses only variation between the cross section observations, is the OLS estimator applied to the time-averaged equation (10.45).
10.5.2 Asymptotic Inference with Fixed Effects
• Assumption FE.3: E(uiu0i|xi, ci) =σ2uIT.
• To see how to estimateσ2u, we use equation (10.51) summed across t :PT
t=1E( ¨u2it) = (T − 1)σ2u, and so [N(T − 1)]−1PN
i=1
PT
t=1E( ¨u2it) =σ2u.Now, define the fixed effects residuals as
ˆ
uit= ¨yit− ¨xitβˆF E, t = 1,2,..., T; i = 1,2,..., N (10.55)
which are simply the OLS residuals from the pooled regression (10.48).
• Example 10.5(FE Estimation of the Effects of Job Training Grants):
log( ˆscra p) = −.080 d88 − .247 d89 − .252 grant − .422 grant−1
(.109) (.133) (.151) (.210)
10.5.3 The Dummy Variable Regression
• The estimator ofβobtained from regression (10.57) is, in fact, the fixed effects estimator.
This is why ˆβF E is sometimes referred to as the dummy variable estimator.
10.5.4 Serial Correlation and the Robust Variance Matrix Estimator
Applying equation (7.26), the robust variance matrix estimator of ˆβF Eis
Ava ˆr( ˆβF E) = ( ¨X0X )¨ −1¡
N
X
i=1
X¨0iuˆiuˆ0iX¨i¢( ¨X0X )¨ −1 (10.59)
• Example 10.5 (continued):
log( ˆscra p) = −.080 d88 − .247 d89 − .252 grant − .422 grant−1
(.109) (.133) (.151) (.210) [.096] [.193] [.140] [.276]
10.5.5 Fixed Effects GLS
• Assumption FEGLS.3: E(uiu0i|xi, ci) =Λ, a T × T positive definite matrix.
• The fixed effects GLS (FEGLS) estimator is defined by
βˆF EGLS=¡
N
X
i=1
X¨i0Ωˆ−1X¨i¢−1¡
N
X
i=1
X¨i0Ωˆ−1¨yi¢
where ¨Xiand ¨yiare defined with the last time period dropped.
• Assumption FEGLS.2: rankE( ¨X0iΩˆ−1X¨i) = K.
10.5.6 Using Fixed Effects Estimation for Policy Analysis
10.6 First Differencing Methods
10.6.1 Inference
• Assumption FD.1: Same as Assumption FE.1.
Lagging the model (10.41) one period and subtracting gives
∆yit=∆xitβ+∆uit, t = 2,3,..., T (10.63)
where∆yit= yit− yi,t−1,∆xit= xit− xi,t−1, and∆uit= uit− ui,t−1.
• As with the FE transformation, this first-differencing transformation eliminates the un- observed effect ci.
• The first-difference (FD) estimator, ˆβFD, is the pooled OLS estimator from the regression
∆yiton∆xit, t = 2,..., T; i = 1,2,..., N (10.65)
• Assumption FD.2: rank³ PT
t=2E(∆x0it∆xit)´
= K.
• Assumption FD.3: E(eie0i|xi1, . . . , xiT, ci) =σ2eIT−1, where ei is the (T − 1) × 1 vector con- taining eit,t = 2,..., T.
10.6.2 Robust Variance Matrix
If Assumption FD.3 is violated, then, as usual, we can compute a robust variance matrix. The estimator in equation (7.26) applied in this context is
A ˆvar( ˆβF D) = (∆X0∆X)−1³XN
i=1
∆X0iˆeiˆe0i∆Xi
´
(∆X0∆X)−1 (10.70)
where∆X denotes the N(T −1)× K matrix of stacked first differences of xit.
• Example 10.6 (FD Estimation of the Effects of Job Training Grants):
∆log(scrapit) =δ1+δ2d89t+β1∆grantit+β2∆granti,t−1+∆uit
10.6.3 Testing for Serial Correlation
The regression is based on T − 2 time periods:
ˆeit= ˆp1ˆei,t−1+ errorit, t = 3,4,..., T; i = 1,2,..., N (10.71)
• Example10.6 (continued):
10.6.4 Policy Analysis Using First Differencing
In this one case,∆progi=∆progi2,and the first-differenced equation can be written as
∆yi2=θ2+∆zi2γ+δ1 pro gi2+∆ui2 (10.72)
• When ∆zi2 is omitted, the estimate of δ1 from equation (10.72) is the difference-in- differences (DID) estimator (see Problem 10.4): ˆ∆1=∆ytreat −∆ycortrol.
10.7 Comparison of Estimators
10.7.1 Fixed Effects versus First Differencing
With more than two time periods, a test of strict exogeneity is a test of H0:γ= 0 in the expanded equation
∆yt=∆xtβ+ wtγ+∆ut, t = 2,..., T
where wtis a subset of xt(that would exclude time dummies).
10.7.2 The Relationship between the Random Effects and Fixed Effects Estimators
Using the fact that j0TjT= T, we can writeΩ under the random effects structure as
Ω =σ2uIT+σ2cjTj0T=σ2uIT+ Tσ2cjT( j0TjT)−1j0T=σ2uIT+ Tσ2cPT= (σ2u+σ2c)(PT+ηQT)
where PT≡ IT− QT= jT( j0TjT)−1j0T andη≡σ2u/(σ2u+ Tσ2c).
• Equation (10.76) shows that the random effects estimator is obtained by a quasitime demeaning: rather than removing the time average from the explanatory and dependent variables at each t, random effects removes a fraction of the time average.
• Example 10.7 (Job Training Grants):
10.7.3 The Hausman Test Comparing the RE and FE Estimators