Table 6:5 SME definition by EU (Commission Recommendations, 2003)
6.3.1.4 Dynamic Panel Data (DPD) Estimators
An alternative is provided by an array of papers by Harris and Moffat. They argue that if one estimates ๐๐๐ก without other variables that potentially may derive a biased estimation because of an omitted variable problem. They tend to include all variables that may cause changes in productivity and estimate the lagoritmic Cobb Dauglas productivity estimation with system GMM estimator.
173
Given that the primary goal of this study is to investigate the effect of SBRR, other variables, which were defined in the previous section (6.3.1.3) are also included. Briefly, SBRR refers to the Small Business Rates Relief and its lags for each firm i in period t. The variables in vector X were defined in Section 6.3.1.3. Shortly, ๐ captures the initial34 effects
of receiving any relief or the uplift in relief, irrespective of level. Other variables are firm age (a), the regions (r) and broad sectors (s), immediate foreign ownership (IO), ultimate foreign ownership (FO), high growth firm dummy (HGF). To at least partly control for the competition, the model includes Marshall Specialisation (PS), Jacob Diversity (PD) and Herfindahl-Hirschman Index (HHI).
Hence, a logarithmic variable, ๐๐๐ก, is introduced:
๐๐๐ก = โ4๐=0(๐๐ต๐ ๐ ๐กโ๐),๐๐๐ก, ๐๐๐ก, ๐๐๐ก, ๐ ๐๐ก, ๐๐๐๐ก, ๐๐ท๐๐ก, ๐ป๐ป๐ผ๐๐ก, ๐ &๐ท๐๐ก, ๐ป๐บ๐น๐๐ก, ๐น๐๐๐ก, ๐ผ๐๐๐ก Then, the model becomes:
๐๐๐บ๐๐ด๐๐ก = ๐ฝ๐ฟ๐๐๐ฟ๐๐ก+ ๐ฝ๐๐๐๐พ๐๐ก+ ๐๐พ๐๐๐พ๐(๐กโ1)+ ๐๐๐๐๐๐(๐กโ1)+ ๐๐ฟ๐๐๐ฟ๐(๐กโ1)+ ๐๐๐๐๐๐๐ก+ ๐ง๐๐ก๐ + ๐๐๐ก+ ๐๐๐ก
To at least partly deal with unobserved heterogeneity, it is common to apply the within (demeaning) transformation, as in one-way fixed effects models, or to take first differences if the second dimension of the panel is a time series, as carried out by Harris and Moffat (2016) and Harris et al. (2015). Given the large observed time and relatively small number of firms, the lag feature was exploited within this analysis by converting the productivity function to a dynamic form. In other words, controlling for the lagged dependent variable and estimating the equation with GMM system estimator:
๐๐๐บ๐๐ด๐๐ก = ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ1)+ ๐ฝ๐ฟ๐๐๐ฟ๐๐ก+ ๐๐ฟ๐๐๐ฟ๐(๐กโ1)+ ๐ฝ๐๐๐๐พ๐๐ก+ ๐๐พ๐๐๐พ๐(๐กโ1)+ ๐๐๐๐๐๐(๐กโ1)+ ๐๐๐๐๐๐๐ก+ ๐ง๐๐ก๐ + ๐๐๐ก + ๐๐๐ก
The unique feature of DPD models is their capability of first differencing to remove unobserved heterogeneity. Nickell (1981) shows that the demeaning process creates a correlation between the regressor and error because it subtracts the individualโs mean value of the outcome variable and each independent variable from the respective variable. This correlation makes coefficients of the lag dependent variable biased. In other words, the mean of the lagged dependent variable is likely to contain observations of 0 through the period before on y, and the mean error, which is being subtracted from each error term,
174
contains contemporaneous values of error. One of the solutions to this may be first differences of the original model.
6.3.1.4.1 AndersonโHsiao (AH) estimator
However, there is still a correlation between the disturbance process (first-order moving average) and the differenced lagged dependent variable because the disturbance process contains lagged error term. At this point, the AndersonโHsiao estimator could be used to remove individual fixed effects with instrumental variables estimator by constructing instruments for the lagged dependent variable with the second/third lags. Anderson and Hsiaoโs (1982) approach is based on a different form of the original equation: ๐๐๐บ๐๐ด๐๐ก = ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ1)+ ๐ฝ๐(๐ฝ๐๐กโ ๐ฝ๐(๐กโ1)) + ๐ผ๐+ ๐๐๐ก , where ๐ฝ is all independent variables in the model and ๐ผ๐ is individual specific fixed effects. This model in AHโs framework becomes:
๐๐๐บ๐๐ด๐๐กโ ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ1)= (๐๐๐บ๐๐ด๐(๐กโ1)โ ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ2)) + ๐ฝ๐(๐ฝ๐๐กโ ๐ฝ๐(๐กโ1)) + +๐๐๐ก โ ๐๐(๐กโ1)
This cancels individual effects assumed to correlate with exogenous variable, but the difference of lagged endogenous variable is correlated with the error term (๐๐๐ก + ๐๐(๐กโ1)).
AH suggest (๐๐๐บ๐๐ด๐(๐กโ1)โ ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ2)) instrumenting with lagged difference (๐๐๐บ๐๐ด๐(๐กโ2)โ ๐ฝ๐บ๐๐ด๐๐๐บ๐๐ด๐(๐กโ3)) or level instruments (๐๐๐บ๐๐ด๐(๐กโ2)) because these differences should not be correlated with the differenced error term:๐ธ(๐ฆ๐,(๐กโ2)๐๐๐๐ก) = 0 and ๐ธ(๐๐ฆ๐,(๐กโ2)๐๐๐๐ก) = 0.
Later, Holtz-Eakin et al. and Arellano (1989) found level instruments (๐๐๐บ๐๐ด๐(๐กโ2)) to be superior because they both had smaller variance and no points of singularities. Furthermore, when level instruments are used, one year less is lost because of lags.
6.3.1.4.2 ArellanoโBond (AB) or Difference Estimator
In empirical work looking at productivity after Wooldridgeโs (2009) one-step estimation was introduced, Generalised Method of Moments firstly suggested by Holtz- Eakin et al. and populated by Arellano and Bond (1991) has become increasingly popular. The main idea behind the estimator is that the instrumental variables approach does not use all available information, so by including more information, more efficient estimates may be found. They separate ๐ฝ๐๐ก into two parts: ๐ฝ2๐๐ก and ๐ฝ3๐๐ก , where ๐ฝ2๐๐ก consists of strictly exogenous regressors and ๐ฝ3๐๐ก are predetermined regressors (could include lags of ๐๐๐บ๐๐ด as well) and endogenous regressors possibly correlated with the unobserved individual
175
effect. First differencing, as performed in AH estimator also removes individual effects and its associated omitted variable bias:
๐๐๐บ๐๐ด๐๐ก = ๐ฝ๐(๐ฝ2๐๐กโ ๐ฝ2๐(๐กโ1)) + ๐ฝ๐(๐ฝ3๐๐กโ ๐ฝ3๐(๐กโ1)) + ๐๐๐ก โ ๐๐(๐กโ1)
In standard 2SLS, so as well as AH estimator, the first observation is lost by applying the twice-lagged level in the instrument matrix:
( . ๐๐๐บ๐๐ด๐1 โฎ ๐๐๐บ๐๐ด๐(๐กโ๐) ),
where n is all lagged periods.
If the twice lagged instrument is included, two observations are lost:
( . ๐๐๐บ๐๐ด๐1 ๐๐๐บ๐๐ด๐2 โฎ ๐๐๐บ๐๐ด๐(๐กโ2) .. ๐๐๐บ๐๐ด๐1 โฎ ๐๐๐บ๐๐ด๐(๐กโ3) )
To reduce the loss of degrees of freedom, AB constructs a set of instruments from the second lag of ๐๐๐บ๐๐ด , one instrument pertaining to each time period:
( 0 ๐๐๐บ๐๐ด๐1 0 โฎ 0 0 0 ๐๐๐บ๐๐ด๐2 โฎ 0 โฏ โฏ โฏ โฑ โฆ 0 0 0 โฎ ๐๐๐บ๐๐ด๐(๐กโ2))
The columns of this instrument matrix are orthogonal to the transformed errors because the resulting moment conditions correspond to an expectation ๐ธ(๐๐๐บ๐๐ด๐(๐กโ2), ๐น๐ท๐๐๐๐๐๐ ) = 0.
This solution implies that all available lags can be used as instruments, for endogenous variables twice lagged or higher and for predetermined variables that are not strictly exogenous once lagged variables are also valid because they are only correlated with errors dated t-2 or earlier. Therefore, the instrumental matrix becomes:
( 0 ๐๐๐บ๐๐ด๐1 0 0 โฎ 0 0 ๐๐๐บ๐๐ด๐2 0 โฎ 0 0 0 ๐๐๐บ๐๐ด๐3 โฎ 0 0 0 ๐๐๐บ๐๐ด๐2 โฎ 0 0 0 ๐๐๐บ๐๐ด๐1 โฎ โฆ โฆ โฆ โฆ โฑ )
6.3.1.4.3 ArellanoโBover and Blundell and Bond (ABBB) or system estimator
Later, Arellano and Bover (1995) and Blundell and Bond (1998) have shown that the lagged levels might be slightly wrong instruments for first differenced variables, particularly if they follow a random walk, so they provide a modification which includes
176
lagged levels and lagged differences. The chief drawback of the System GMM estimator is the additional restrictions on the initial conditions of the process generating.
6.3.1.4.4 Instrumental variables in general
The motivation to use instrumental variables is to isolate โas good as randomโ variation in the treatment variable, so that selection and unobservable problems could be solved. Revisiting the final equation of the dynamic model:
๐๐๐บ๐๐ด๐๐ก = ๐ฝ๐บ๐๐๐ ๐ ๐๐๐บ๐๐ด๐(๐กโ1)+ ๐ฝ๐ฟ๐๐๐ฟ๐๐ก+ ๐ฝ๐๐๐๐พ๐๐ก+ ๐๐พ๐๐๐พ๐(๐กโ1)+ ๐๐๐๐๐๐(๐กโ1)+ ๐๐ฟ๐๐๐ฟ๐(๐กโ1)+ ๐๐๐๐๐๐๐ก+ ๐ง๐๐ก๐ + ๐๐๐ก+ ๐๐๐ก
Given that the capital was measured, it is directly related to investment: ๐ฝ๐๐๐๐พ๐๐ก = ๐ฝ1๐๐๐ผ๐๐ก+ ๐๐๐ก๐ผ โ ๐ฝ๐๐๐๐พ๐๐ก = ๐ฝ1๐๐๐ผ๐๐ก+ ๐ฝ2๐๐๐2๐๐ก + ๐๐๐ก๐ผ
The main concern is that ๐ธ(๐๐๐ก๐๐๐ก๐ผโ ๐ฝ๐๐๐๐ผ๐๐ก) โ 0, so the path diagram of the endogeneity problem is that ๐ฝ๐๐๐๐พ๐๐ก affects ๐๐๐บ๐๐ด๐๐ก but the error term (๐๐๐ก) influences both ๐ฝ๐๐๐๐พ๐๐ก and ๐๐๐บ๐๐๐ ๐ ๐๐ก.
The typical instrumental variable approach is to find variables that belong to the second equation but not the first one. Therefore, the following assumptions should be kept: a valid, nontrivial, first stage coefficient (๐ฝ2) for observables that belong in the participation equation but not the outcome equation (๐๐๐2๐๐ก) and a valid exclusion restriction (๐ธ[๐ฝ2๐๐๐2๐๐กโ ๐๐๐ก๐ผโ ๐๐๐ผ๐๐ก] = 0)
6.3.1.4.5 Sensitivity Analysis -Different Transformations, Steps and Ways
One way, one step system GMM estimator will be compared to the two ways, two steps system GMM estimator and one way, one step difference estimator. As previously discussed at the beginning of this section, some of these are expected to reduce observations and possibly weaker relationships. For instance, the difference GMM approach transforms the data to remove the fixed effects deals to reduce the inherent endogeneity. More specifically, it uses the first difference transformation. This seems a better choice than the within transformation (discussed in Appendix 10.2.2.1), as that transformation is likely to make each observation in the transformed data endogenous for a firm. The one shortcoming of this transformation is that it increases gaps in unbalanced panels. If some values of a variable are not available, then both values around the value will be missing in the transformed data. This motivates an alternative transformation: the forward orthogonal deviations (FOD) transformation, proposed by Arellano and Bover (1995).
177 6.3.2 Survival Analysis
Up until now, Section 6.3 focused on productivity. It defined the central issues in estimation (Subsection 6.3.1.1) and introduced two ways to estimate factors affecting productivity (6.3.1.2, 6.3.1.3 and 6.3.1.4). The other part of Section 6.3 is devoted to the survival analysis. It was defined and fundamental mechanisms were explained in the Methodology Review Chapter (Section 5.2). The chapter concluded that Cox proportional hazards (CPH) model is the preferred approach to be applied to this study. An advancement in survival analysis made it possible to reduce the amount of limiting assumptions and correct for various estimation flows. It became one of the most frequently applied techniques for the survival analysis because of its semi-parametric nature and theoretical foundations. The first part of this section will focus on implementation and the tests that were employed to test the model.
The Methodology Review Chapter (Section 5.3.1.2) proposed to supplement more standard Cox Regression with survival trees (ST). It also introduced the key concepts of ST. Section 7.3.2 further supplements this by extending the description of the ST for left- truncated and right-censored data which were found to be preferable for this analysis owing to the data structure and their ability to accommodate the longitudinal data.