8.4 Weighted Least Squares Estimation
Before the existence of heteroskedasticity-robust statistics, one needed to know the form of heteroskedasticity
-Het was then corrected using WEIGHTED LEAST SQUARES (WLS)
-This method is still useful today, as if heteroskedasticity can be correctly modeled, WLS becomes more efficient than OLS
-ie: WLS becomes BLUE
8.4 Known Heteroskedasticity
-Assume first that the form of heteroskedasticity is known and expressed as:
) (
)
|
( u X 2 h X
Var
-Where h(X) is some function of the independent variables
-since variance must be positive, h(X)>0 for all valid combinations of X -given a random sample, we can write:
i i
i
i 2 Var ( u | X ) 2 h
8.4 Known Het Example
-Assume that sanity is a function of econometrics knowledge and other factors:
u rs
otherfacto econ
crazy 0 1
-However, by studying econometrics two things happen: either one becomes more sane as one understands the world, or one becomes more crazy as one is pulled into a never-ending vortex of causal
relationships. Therefore:
i i
i X econ
u
Var ( | ) 2
8.4 Known Heteroskedasticity
-Since h is a function of x, we know that:
i i
i i
h X
u E X
X h
u E
2 2
i | ) ( | )
Var(u
and
0 )
| /
(
-Therefore
| h ) (
] )
| /
[( i 2
2 2
2
i i
i i
i h X E u h h
u E
-So inclusion of the h term in our model can solve
heteroskedasticity
8.4 Fixing Het – And Stay Down!
-We therefore have the modified equation:
i i i
ik k
i i i
i i
h u h
x h
x h
h
y ...
1 1
0
-Or alternately:
(8.26)
... * *
* 1 1
0
*
i ik
k i
i x x u
y
-Note that although our estimates for B
Jwill change (and their standard errors become valid), their interpretation is the same as the
straightforward OLS model (don’t try to bring h into your
interpretation)
8.4 Het Fixing – “I am the law”
-(8.26) is linear and satisfied MLR.1
-if the original sample was random, nothing chances so MLR.2 is satisfied -If no perfect collinearity existed before, MLR.3 is still satisfied now -E(ui*|Xi*)=0, so MLR.4 is satisfied
-Var(ui*|Xi*)=σ2, so MLR.5 is satisfied
-if ui has a normal distribution, so does ui*, so MLR. 6 is satisfied
-Thus if the original model satisfies everything but het, the new model satisfies MLR. 1 to 6
8.4 Het Fix – Control the Het Pop
-These BJ* estimates are different from typical OLS estimates and are examples of GENERALIZED LEAST SQUARES (GLS) ESTIMATORS -this GLS estimation provides standard errors, t statistics and F statistics that are valid
-Since these estimates satisfy all 6 CLM assumptions, and because they are BLUE, GLS is always more efficient than OLS -Note that OLS is a special case of GLS where hi=1
8.4 Het Fix – Who broke it anyhow?
-Note that the R2 obtained from this regression is useful for F statistics but is NOT useful for its typical interpretation -this is due to the fact that it explains how much X* explains y*, not how much X explains y
-when GLS estimators are used to correct for heteroskedasticity, they are called WEIGHTED LEAST SQUARES (WLS) ESTIMATORS -most econometric programs have commands to minimize the weighted sum of squared residuals:
i ik
k i
i
x x h
y ... ) /
(
min
0
1 1
28.4 Incorrect Correcting?
What happens if h(x) is misspecified and WLS is run (ie: if one expects x
1to cause het but x
3actually causes het)
1) WLS is still unbiased and consistent (similar to OLS)
2) Standard Errors (thus t and F tests) are no longer valid
-to avoid this, one can always apply a fully robust
inference for WLS (as we say for OLS in 8.2)
-this can be tedious
8.4 Incorrect Correcting?
WLS is often criticized as being better than OLS ONLY IF the form of het is correctly chosen
-one may argue that making some correction for het is better than none at all
-there is always the option of using robust WLS estimation
-in cases of doubt, both robust WLS and robust
OLS results can be reported
8.4 Averages and Het
Heteroskedasticity will always exist when AVERAGES are used
-when using averages, each observation is the sum of all individual observations divided by group size:
i ii
x m
x /
-Therefore in our true regression, our error term is the sum of all individual observations’ error terms divided by group size:
i ii
u m
u /
8.4 Averages and Het
If the individual model is homoskedastic, and no correlation exists between groups, then the average equation is
heteroskedastic with a weight of h
i=1/m
i-In this way larger groups receive more weight in the regression and is due to the fact that
i
m
iu
Var ( )
2/
For example, assume that we run a regression on how math knowledge impacts grades in econ classes.
Bigger classes (Econ 299) would be weighted to give
more information than smaller classes (Econ 349.5 –
Love and Econ.)
8.4 Feasible GLS
-In the previous section we assumed that we knew the form of the heteroskedasticity, h
i(x)
-often this is not the case an we need to use data to estimate h
ihat
-this yields an estimator called FEASIBLE GLS (FGLS) or ESTIMATED GLS (EGLS)
-Although h(x) can be measured many ways, we assume that
k k
k k
x x
x x
e X
h
e X
u Var
...
2 ...
1 1 0
1 1 0
) (
(8.30)
)
|
(
8.4 Feasible GLS
-Note that while the BP test for Het assumed Het was linear, here we allow for non-linear Het
-although testing for linear Het is effective,
correcting for Het has issues with linear models as h(X) could be negative, making Var(u|X)
negative
-since delta is unknown, it must be estimated -using (8.30),
v e
u 2 2 0 1 x 1 ... k x k
-Where v, conditional on X, has a mean of unity
8.4 Feasible GLS
-If we assume v is independent of X,
e x
x
u ) ... k k log( 2 0 1 1
-Where e has zero mean and is independent of X
-note that the intercept changes, which is unavoidable but not drastically important
-as usual, we only have residuals, not errors, so we run the regression and obtain fitted values
k k x x
u ˆ ) ˆ ˆ ... ˆ
g(
oˆ
l 2 0 1 1
-To obtain: ˆ l oˆ g( u i 2 )
i e
h
8.4 FGLS
To use FGLS to correct for Heteroskedasticity,
1) Regress y on all x’s and obtain residuals uhat 2) Create log(uhat
2)
3) Regress log(uhat
2) on all x’s and obtain fitted values ghat
4) Estimate hhat=exp(ghat)
5) Run WLS using weights 1/hhat
8.4 FGLS
If we used the actual h(X), our estimator would be unbiased and BEST
-since h(X) is estimated using the same data as FGLS, it is biased and therefore not BEST
-however, FGLS is consistent and asymptotically more efficient than OLS
-therefore FGLS is a good alternative to OLS in large samples
-note that FGLS estimates are interpreted the same as OLS
-note also that heteroskedasticity-robust standard
errors can always be calculated in cases of doubt
8.4 FGLS Alternative
One alternative is to estimate ghat as:
2 2
1 0
2 ) ˆ ˆ ˆ ˆ ˆ
g( ˆ oˆ
l u y y
Using fitted y values from the OLS equation
-This changes step 3 above, but the remaining steps are the same
-Note that the Park (1996) test is based on FGLS but is inferior to our previous tests due to FGLS only being consistent