• No results found

Time-invariant Technical Inefficiency Models

3. Chapter 3: Efficiency Measurement, Methods, Estimation, Model Specification

3.8 Panel Data Models of Efficiency Measurement

3.8.2 Time-invariant Technical Inefficiency Models

The stochastic panel model with time-invariant inefficiency can be estimated under either the fixed effects or random effects framework (Wooldridge, 2010). The selection of the framework is dependent on the level of relationship permitted between inefficiency and the explanatory variables of the model (Parmeter and Kumbhakar, 2014). The fixed effects approach permits correlation between π‘₯π‘˜π‘‘ and π‘’π‘˜ , whereas the random effects approach does not.

The received literature on the fixed effects model in the frontier modelling framework is based on Schmidt and Sickles’s (1984) modification on the linear regression model which incorporates a unit-specific intercept in the basic linear model framework. They propose a model that estimates the persistent part of the inefficiency without specifying an explicit distribution of the inefficiency, labelled the distribution free approach.

96 This is not the case with OLS (COLS, MOLS).

97 A key question related to latent heterogeneity (the time-invariant individual effects) is whether the individual effects represent

(persistent) inefficiency, or whether the effects are independent of the inefficiency and reflect (persistent) unobserved heterogeneity.

98 Some of the strong distributional assumptions used to disentangle the separate effects of inefficiency and noise can be relaxed

(Coelli et al., 2005).

75 | P a g e

Classic fixed effects models take advantage of the panel structure to increase the explanatory power of the model by incorporating a unit-specific, time-invariant effect. So, the linear fixed effects model can be reinterpreted as:

ln(qkt) = 𝛽0+ βˆ‘ Ξ²ixkti+ π‘£π‘˜π‘‘ N i=1 βˆ’ π‘’π‘˜ ln(qkt) = 𝛽0βˆ’ π‘’π‘˜+ βˆ‘ Ξ²ixkti+ π‘£π‘˜π‘‘ N i=1 ln(qkt) = π›Όπ‘˜+ βˆ‘ Ξ²ixkti+ π‘£π‘˜π‘‘ N i=1

Which can be estimated consistently and efficiently by OLS after including individual dummies as regressors100 for π›Όπ‘˜β‰‘ 𝛽0βˆ’ π‘’π‘˜. The model is reinterpreted by treating π›Όπ‘˜

as the firm-specific inefficiency effect. The purpose of the effect is to capture the impact of all the factors that are specific to the unit and constant over time, and which have not been included already in the model. This means that the time-invariant units’ heterogeneity, e.g. location characteristics (urban or rural), a person’s ability (when modelling income), prevailing environmental conditions, etc. is reflected in π›Όπ‘˜. To retain the flavor of the frontier model once π›ΌΜ‚π‘˜ is available, the DMUs are compared on the basis of the following transformation to obtain an estimated value of π‘’π‘˜(Schmidt and Sickles, 1984):

For production functions: 𝑒̂ = maxπ‘˜

π‘˜ {𝛼̂ } βˆ’ π›Όπ‘˜ Μ‚π‘˜, π‘˜ = 1, … … … … , 𝐾, where π›ΌΜ‚π‘˜ is the

π‘˜ βˆ’th fixed effects estimate in the within-groups fixed effects linear regression model. This formulation implicitly assumes that the most efficient unit in the sample is 100 percent efficient. Therefore, the estimated inefficiency in the fixed-effects model is relative to the best unit in the sample. The unit-specific TE estimate equals π‘‡πΈπ‘˜= exp (βˆ’π‘’Μ‚)π‘˜ . For cost functions: 𝑒̂ = π›Όπ‘˜ Μ‚ βˆ’ minπ‘˜

π‘˜ {𝛼̂ }π‘˜ , π‘˜ = 1, … … … … , 𝐾 and the cost efficiency estimate equals πΆπΈπ‘˜ = exp (βˆ’π‘’Μ‚)π‘˜ .

The fixed effects approach empoweres the model with an important implication to allow for correlation101 between π‘₯

π‘˜π‘‘ and π‘’π‘˜.This may be a desirable property for empirical applications in which inefficiency is believed to be correlated with the inputs used (Mundlak, 1961). However, the model does not allow for separate identification of

100 This technique is often referred to as the least square dummy variable (LSDV) method. The coefficients of the dummies are the

estimates of π›Όπ‘˜.

101 An important limitation of the FE is that no other time-invariant variables, such as gender, race, region, etc., can be included in

76 | P a g e

inefficiency and individual heterogeneity, and this constitutes an important limitation that the recent literature aimed to capture, i.e. the true fixed effects (TFE) and true random effects (TRE) models (Greene, 2002).

The model proposed by Schmidt and Sickles (1984) is extended by Cornwell et al. (1990) in order to include a time-varying effect, without specifying an explicit distribution of the inefficiency. Early work on the model suggested direct manipulation of the fixed effects term; in other words, the time-varying part of the inefficiency term is defined as:

π›Όπ‘˜π‘‘ = πœƒπ‘˜0+ πœƒπ‘˜1𝑑 + πœƒπ‘˜2𝑑2

Despite the desirable decomposition between the time-invariant component πœƒπ‘˜0 and the time-varying component πœƒπ‘˜1𝑑 + πœƒπ‘˜2𝑑2 some further caveats need to be mentioned. First, it does not leave space for time-invariant heterogeneity that is not inefficiency; second, it assumes a unit-specific quadratic function of time102 to explain the time-

varying part, which might be quite restrictive. Recent panel data literature has tried to relax the assumption of a time-invariant inefficiency in two components (Cornwell and Schmidt, 1996).

Turning to the random effects model developed by Pitt and Lee (1981) the inefficiency term π‘’π‘˜ is assumed to be constant through time and randomly distributed since it must be uncorrelated with independent variables. If so, time-invariant regressors such as gender, race, etc., can be included in the model without leading to collinearity with

π›Όπ‘˜. Also, the RE framework facilitates cases in which independent variables show very little variation between time periods; in such cases, the FE may fail to identify the statistical significance of those variables since all variation is captured by the effects. However, it should be stressed that the RE model is restrictive in the sense that is does not allow for correlation between the RE unobserved time-invariant inefficiency and the independent variables (i.e. the regressors) and noise. When the assumption of no correlation between the covariates and university efficiency is indeed correct, then estimation of the stochastic frontier panel data model by using the RE framework provides more efficient estimates than estimation under the FE framework.

According to Pitt and Lee (1981), distributional assumptions on the random components of the model can be imposed and then estimation of the parameters of the model is feasible through ML.103 The relevant log-likelihood for a random effects model with a

half-normal distribution is derived by Lee and Tyler (1978) and discussed further by Battese and Coelli (1988). Inefficiency is not directly estimated via MLE, so once the parameters are estimated, JLMS-type conditional mean estimators can be used to receive an estimate for the unit-specific inefficiency (Kumbhakar, 1987).

102 Han et al. (2005) propose factor analytic forms for modelling π‘Ž

π‘˜π‘‘

103 An alternative to MLE is the use of generalised least squares (GLS) estimator; see Baltagi (2013) for an analytical review of the

77 | P a g e

The likelihood function for the π‘˜ βˆ’ th observation is Pitt and Lee (1981):

π‘™π‘›πΏπ‘˜ = π‘π‘œπ‘›π‘‘π‘Žπ‘›π‘‘ + 𝑙𝑛Φ ( πœ‡π‘˜βˆ— πœŽβˆ— ) +1 2ln(πœŽβˆ— 2) βˆ’1 2{ π›΄π‘‘πΈπ‘˜π‘‘2 πœŽπ‘£2 + (πœ‡ πœŽπ‘’ ) 2 βˆ’ (πœ‡π‘˜βˆ— πœŽβˆ— ) 2 } βˆ’ 𝑇𝑙𝑛(πœŽπ‘£) βˆ’ ln(πœŽπ‘’) βˆ’ 𝑙𝑛Φ (πœ‡ πœŽπ‘’) Withπœ‡π‘˜βˆ— = πœ‡πœŽπ‘£2βˆ’πœŽπ‘’2π›΄π‘‘πΈπ‘˜π‘‘ πœŽπ‘£2+π‘‡πœŽπ‘’2 andπœŽβˆ— 2 = πœŽπ‘£2πœŽπ‘’2

πœŽπ‘£2+π‘‡πœŽπ‘’2, Ξ€ sample size. For RE no assumptions on the

PDF of the inefficiency are made other than that inefficiency is an independently distributed random variable with non-negative values, i.e. π‘’π‘˜ ∼ 𝑖𝑖𝑑 β‰₯ 0 i.e. a half normal distribution. The stochastic error component is distributed as π‘£π‘˜π‘‘ ∼ 𝑖𝑖𝑑 𝑁(0, πœŽπ‘£2). The estimated parameters from the MLE estimation process are utilised in the next step, where the extended JLMS estimator of inefficiency is used to obtain an estimate of inefficiency. 𝐸(π‘’π‘˜|πΈπ‘˜) = πœ‡π‘˜βˆ—+ πœŽβˆ—[ πœ‘ (βˆ’πœ‡πœŽπ‘˜βˆ— βˆ— ) 1 βˆ’ Ξ¦ (βˆ’πœ‡π‘˜βˆ— πœŽβˆ—) ]

If π‘’π‘˜ ∼ |π‘ˆπ‘˜|, π‘ˆπ‘˜ ∼ 𝑁[0, πœŽπ‘’2] is distributed half-normally,104 then πœ‡ = 0. Also, in some

cases, πœ‡ may be a function of covariates of exogenous determinants of inefficiency, i.e.

πœ‡ = (𝑧′ π‘˜π›Ώ).

As previously mentioned, the inefficiency term here has a time-invariant interpretation since inefficiency levels may vary for different individuals, but they do not change over time therefore, time variation is an issue to be accommodated in the literature. The array of models introduced in the next section aim to give a time-varying dimension to inefficiency since the implications of the time-invariant hypothesis are too restrictive. This implies that an inefficient unit (e.g., a university) does learn over time; therefore, if the latent goal is productivity and efficiency improvements, a time-varying inefficiency framework should be structured.