• No results found

Dynamic Panel Data (DPD) Estimators

Table 6:5 SME definition by EU (Commission Recommendations, 2003)

6.3.1.4 Dynamic Panel Data (DPD) Estimators

An alternative is provided by an array of papers by Harris and Moffat. They argue that if one estimates ๐œ”๐‘–๐‘ก without other variables that potentially may derive a biased estimation because of an omitted variable problem. They tend to include all variables that may cause changes in productivity and estimate the lagoritmic Cobb Dauglas productivity estimation with system GMM estimator.

173

Given that the primary goal of this study is to investigate the effect of SBRR, other variables, which were defined in the previous section (6.3.1.3) are also included. Briefly, SBRR refers to the Small Business Rates Relief and its lags for each firm i in period t. The variables in vector X were defined in Section 6.3.1.3. Shortly, ๐œŒ captures the initial34 effects

of receiving any relief or the uplift in relief, irrespective of level. Other variables are firm age (a), the regions (r) and broad sectors (s), immediate foreign ownership (IO), ultimate foreign ownership (FO), high growth firm dummy (HGF). To at least partly control for the competition, the model includes Marshall Specialisation (PS), Jacob Diversity (PD) and Herfindahl-Hirschman Index (HHI).

Hence, a logarithmic variable, ๐‘‹๐‘–๐‘ก, is introduced:

๐‘‹๐‘–๐‘ก = โˆ‘4๐‘—=0(๐‘†๐ต๐‘…๐‘…๐‘กโˆ’๐‘—),๐œŒ๐‘–๐‘ก, ๐‘Ž๐‘–๐‘ก, ๐‘Ÿ๐‘–๐‘ก, ๐‘ ๐‘–๐‘ก, ๐‘ƒ๐‘†๐‘–๐‘ก, ๐‘ƒ๐ท๐‘–๐‘ก, ๐ป๐ป๐ผ๐‘–๐‘ก, ๐‘…&๐ท๐‘–๐‘ก, ๐ป๐บ๐น๐‘–๐‘ก, ๐น๐‘‚๐‘–๐‘ก, ๐ผ๐‘‚๐‘–๐‘ก Then, the model becomes:

๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก = ๐›ฝ๐ฟ๐‘™๐‘›๐ฟ๐‘–๐‘ก+ ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก+ ๐œ•๐พ๐‘™๐‘›๐พ๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘€๐‘™๐‘›๐‘€๐‘–(๐‘กโˆ’1)+ ๐œ•๐ฟ๐‘™๐‘›๐ฟ๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘‹๐‘™๐‘›๐‘‹๐‘–๐‘ก+ ๐‘ง๐‘–๐‘ก๐œ‡ + ๐œ‰๐‘–๐‘ก+ ๐‘’๐‘–๐‘ก

To at least partly deal with unobserved heterogeneity, it is common to apply the within (demeaning) transformation, as in one-way fixed effects models, or to take first differences if the second dimension of the panel is a time series, as carried out by Harris and Moffat (2016) and Harris et al. (2015). Given the large observed time and relatively small number of firms, the lag feature was exploited within this analysis by converting the productivity function to a dynamic form. In other words, controlling for the lagged dependent variable and estimating the equation with GMM system estimator:

๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก = ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)+ ๐›ฝ๐ฟ๐‘™๐‘›๐ฟ๐‘–๐‘ก+ ๐œ•๐ฟ๐‘™๐‘›๐ฟ๐‘–(๐‘กโˆ’1)+ ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก+ ๐œ•๐พ๐‘™๐‘›๐พ๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘€๐‘™๐‘›๐‘€๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘‹๐‘™๐‘›๐‘‹๐‘–๐‘ก+ ๐‘ง๐‘–๐‘ก๐œ‡ + ๐œ‰๐‘–๐‘ก + ๐‘’๐‘–๐‘ก

The unique feature of DPD models is their capability of first differencing to remove unobserved heterogeneity. Nickell (1981) shows that the demeaning process creates a correlation between the regressor and error because it subtracts the individualโ€™s mean value of the outcome variable and each independent variable from the respective variable. This correlation makes coefficients of the lag dependent variable biased. In other words, the mean of the lagged dependent variable is likely to contain observations of 0 through the period before on y, and the mean error, which is being subtracted from each error term,

174

contains contemporaneous values of error. One of the solutions to this may be first differences of the original model.

6.3.1.4.1 Andersonโ€“Hsiao (AH) estimator

However, there is still a correlation between the disturbance process (first-order moving average) and the differenced lagged dependent variable because the disturbance process contains lagged error term. At this point, the Andersonโ€“Hsiao estimator could be used to remove individual fixed effects with instrumental variables estimator by constructing instruments for the lagged dependent variable with the second/third lags. Anderson and Hsiaoโ€™s (1982) approach is based on a different form of the original equation: ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก = ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)+ ๐›ฝ๐‘—(๐ฝ๐‘–๐‘กโˆ’ ๐ฝ๐‘–(๐‘กโˆ’1)) + ๐›ผ๐‘–+ ๐‘’๐‘–๐‘ก , where ๐ฝ is all independent variables in the model and ๐›ผ๐‘– is individual specific fixed effects. This model in AHโ€™s framework becomes:

๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘กโˆ’ ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)= (๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)โˆ’ ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2)) + ๐›ฝ๐‘—(๐ฝ๐‘–๐‘กโˆ’ ๐ฝ๐‘–(๐‘กโˆ’1)) + +๐‘’๐‘–๐‘ก โˆ’ ๐‘’๐‘–(๐‘กโˆ’1)

This cancels individual effects assumed to correlate with exogenous variable, but the difference of lagged endogenous variable is correlated with the error term (๐‘’๐‘–๐‘ก + ๐‘’๐‘–(๐‘กโˆ’1)).

AH suggest (๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)โˆ’ ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2)) instrumenting with lagged difference (๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2)โˆ’ ๐›ฝ๐บ๐‘‰๐ด๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’3)) or level instruments (๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2)) because these differences should not be correlated with the differenced error term:๐ธ(๐‘ฆ๐‘–,(๐‘กโˆ’2)๐‘‘๐‘’๐‘–๐‘ก) = 0 and ๐ธ(๐‘‘๐‘ฆ๐‘–,(๐‘กโˆ’2)๐‘‘๐‘’๐‘–๐‘ก) = 0.

Later, Holtz-Eakin et al. and Arellano (1989) found level instruments (๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2)) to be superior because they both had smaller variance and no points of singularities. Furthermore, when level instruments are used, one year less is lost because of lags.

6.3.1.4.2 Arellanoโ€“Bond (AB) or Difference Estimator

In empirical work looking at productivity after Wooldridgeโ€™s (2009) one-step estimation was introduced, Generalised Method of Moments firstly suggested by Holtz- Eakin et al. and populated by Arellano and Bond (1991) has become increasingly popular. The main idea behind the estimator is that the instrumental variables approach does not use all available information, so by including more information, more efficient estimates may be found. They separate ๐ฝ๐‘–๐‘ก into two parts: ๐ฝ2๐‘–๐‘ก and ๐ฝ3๐‘–๐‘ก , where ๐ฝ2๐‘–๐‘ก consists of strictly exogenous regressors and ๐ฝ3๐‘–๐‘ก are predetermined regressors (could include lags of ๐‘™๐‘›๐บ๐‘‰๐ด as well) and endogenous regressors possibly correlated with the unobserved individual

175

effect. First differencing, as performed in AH estimator also removes individual effects and its associated omitted variable bias:

๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก = ๐›ฝ๐‘—(๐ฝ2๐‘–๐‘กโˆ’ ๐ฝ2๐‘–(๐‘กโˆ’1)) + ๐›ฝ๐‘—(๐ฝ3๐‘–๐‘กโˆ’ ๐ฝ3๐‘–(๐‘กโˆ’1)) + ๐‘’๐‘–๐‘ก โˆ’ ๐‘’๐‘–(๐‘กโˆ’1)

In standard 2SLS, so as well as AH estimator, the first observation is lost by applying the twice-lagged level in the instrument matrix:

( . ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 โ‹ฎ ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’๐‘›) ),

where n is all lagged periods.

If the twice lagged instrument is included, two observations are lost:

( . ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–2 โ‹ฎ ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2) .. ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 โ‹ฎ ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’3) )

To reduce the loss of degrees of freedom, AB constructs a set of instruments from the second lag of ๐‘™๐‘›๐บ๐‘‰๐ด , one instrument pertaining to each time period:

( 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 0 โ‹ฎ 0 0 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–2 โ‹ฎ 0 โ‹ฏ โ‹ฏ โ‹ฏ โ‹ฑ โ€ฆ 0 0 0 โ‹ฎ ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2))

The columns of this instrument matrix are orthogonal to the transformed errors because the resulting moment conditions correspond to an expectation ๐ธ(๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’2), ๐น๐ท๐‘’๐‘Ÿ๐‘Ÿ๐‘œ๐‘Ÿ๐‘ ) = 0.

This solution implies that all available lags can be used as instruments, for endogenous variables twice lagged or higher and for predetermined variables that are not strictly exogenous once lagged variables are also valid because they are only correlated with errors dated t-2 or earlier. Therefore, the instrumental matrix becomes:

( 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 0 0 โ‹ฎ 0 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–2 0 โ‹ฎ 0 0 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–3 โ‹ฎ 0 0 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–2 โ‹ฎ 0 0 0 ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–1 โ‹ฎ โ€ฆ โ€ฆ โ€ฆ โ€ฆ โ‹ฑ )

6.3.1.4.3 Arellanoโ€“Bover and Blundell and Bond (ABBB) or system estimator

Later, Arellano and Bover (1995) and Blundell and Bond (1998) have shown that the lagged levels might be slightly wrong instruments for first differenced variables, particularly if they follow a random walk, so they provide a modification which includes

176

lagged levels and lagged differences. The chief drawback of the System GMM estimator is the additional restrictions on the initial conditions of the process generating.

6.3.1.4.4 Instrumental variables in general

The motivation to use instrumental variables is to isolate โ€œas good as randomโ€ variation in the treatment variable, so that selection and unobservable problems could be solved. Revisiting the final equation of the dynamic model:

๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก = ๐›ฝ๐บ๐‘Ÿ๐‘œ๐‘ ๐‘ ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–(๐‘กโˆ’1)+ ๐›ฝ๐ฟ๐‘™๐‘›๐ฟ๐‘–๐‘ก+ ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก+ ๐œ•๐พ๐‘™๐‘›๐พ๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘€๐‘™๐‘›๐‘€๐‘–(๐‘กโˆ’1)+ ๐œ•๐ฟ๐‘™๐‘›๐ฟ๐‘–(๐‘กโˆ’1)+ ๐œ•๐‘‹๐‘™๐‘›๐‘‹๐‘–๐‘ก+ ๐‘ง๐‘–๐‘ก๐œ‡ + ๐œ‰๐‘–๐‘ก+ ๐‘’๐‘–๐‘ก

Given that the capital was measured, it is directly related to investment: ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก = ๐›ฝ1๐‘™๐‘›๐ผ๐‘–๐‘ก+ ๐‘’๐‘–๐‘ก๐ผ โ†’ ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก = ๐›ฝ1๐‘™๐‘›๐ผ๐‘–๐‘ก+ ๐›ฝ2๐‘™๐‘›๐‘‹2๐‘–๐‘ก + ๐‘’๐‘–๐‘ก๐ผ

The main concern is that ๐ธ(๐‘’๐‘–๐‘ก๐‘’๐‘–๐‘ก๐ผโƒ“ ๐›ฝ๐‘˜๐‘™๐‘›๐ผ๐‘–๐‘ก) โ‰  0, so the path diagram of the endogeneity problem is that ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก affects ๐‘™๐‘›๐บ๐‘‰๐ด๐‘–๐‘ก but the error term (๐‘’๐‘–๐‘ก) influences both ๐›ฝ๐‘˜๐‘™๐‘›๐พ๐‘–๐‘ก and ๐‘™๐‘›๐บ๐‘Ÿ๐‘œ๐‘ ๐‘ ๐‘–๐‘ก.

The typical instrumental variable approach is to find variables that belong to the second equation but not the first one. Therefore, the following assumptions should be kept: a valid, nontrivial, first stage coefficient (๐›ฝ2) for observables that belong in the participation equation but not the outcome equation (๐‘™๐‘›๐‘‹2๐‘–๐‘ก) and a valid exclusion restriction (๐ธ[๐›ฝ2๐‘™๐‘›๐‘‹2๐‘–๐‘กโˆ— ๐‘’๐‘–๐‘ก๐ผโƒ“ ๐‘™๐‘›๐ผ๐‘–๐‘ก] = 0)

6.3.1.4.5 Sensitivity Analysis -Different Transformations, Steps and Ways

One way, one step system GMM estimator will be compared to the two ways, two steps system GMM estimator and one way, one step difference estimator. As previously discussed at the beginning of this section, some of these are expected to reduce observations and possibly weaker relationships. For instance, the difference GMM approach transforms the data to remove the fixed effects deals to reduce the inherent endogeneity. More specifically, it uses the first difference transformation. This seems a better choice than the within transformation (discussed in Appendix 10.2.2.1), as that transformation is likely to make each observation in the transformed data endogenous for a firm. The one shortcoming of this transformation is that it increases gaps in unbalanced panels. If some values of a variable are not available, then both values around the value will be missing in the transformed data. This motivates an alternative transformation: the forward orthogonal deviations (FOD) transformation, proposed by Arellano and Bover (1995).

177 6.3.2 Survival Analysis

Up until now, Section 6.3 focused on productivity. It defined the central issues in estimation (Subsection 6.3.1.1) and introduced two ways to estimate factors affecting productivity (6.3.1.2, 6.3.1.3 and 6.3.1.4). The other part of Section 6.3 is devoted to the survival analysis. It was defined and fundamental mechanisms were explained in the Methodology Review Chapter (Section 5.2). The chapter concluded that Cox proportional hazards (CPH) model is the preferred approach to be applied to this study. An advancement in survival analysis made it possible to reduce the amount of limiting assumptions and correct for various estimation flows. It became one of the most frequently applied techniques for the survival analysis because of its semi-parametric nature and theoretical foundations. The first part of this section will focus on implementation and the tests that were employed to test the model.

The Methodology Review Chapter (Section 5.3.1.2) proposed to supplement more standard Cox Regression with survival trees (ST). It also introduced the key concepts of ST. Section 7.3.2 further supplements this by extending the description of the ST for left- truncated and right-censored data which were found to be preferable for this analysis owing to the data structure and their ability to accommodate the longitudinal data.