Model evaluation - Materials and Methods - Quantification of land use and land cover in a Monso

3.2 Materials and Methods

3.2.5 Model evaluation

3.2.5.1 Overall regression performance

We used the cross-validation error metrics instead of the default out-of-bag (OOB) error of

CHAPTER 3. MAPPING FRACTIONAL LAND USE AND LAND COVER IN A MONSOON REGION:

THE EFFECTS OF DATA PROCESSING OPTIONS 75

between in-bag training samples and out-of-bag test samples, especially for spatial models. Instead, we used cross-validationRMSE to evaluate regression performance. TheRMSE of the

LULC type i is calculated as

RM SEi = v u u t Pn j=1(fi,j−fˆi,j∗ )2 n (3.5)

where fi,j is the observed and ˆfi,j∗ is the predicted LULC fraction for the type i in pixel j, and

n is the total number of pixels.

Furthermore, we used the coefficient of determination (R2) and Spearman’s rank correlation

coefficient (ρ) (Gibbons et al., 2003) also based on cross-validation. TheR2 _{was used to compare}

our results with the previous studies on fractional cover estimation (e.g. Fernandes et al., 2004). Spearman’s ρ was used to estimate the association between observed and predicted fractions

(Gibbons et al., 2003).

3.2.5.2 Relative contribution of data-processing options

Additionally to cross-validation error, we examined the relationship between the data-processing options and the performance of the fractional cover regression models. For this analysis, we built a linear model explaining the RMSE of the regression model for each LULC type by the

different data-processing options:

RMSEi =β0+β1Op+β2Ot+β3Os+ (3.6)

where RMSEi is the RMSE of the type i; Op is a categorical variable denoting the chosen

predictor set option, Ot time interval option, and Os smoothing option; is the error term.

Note that we did not include interaction terms based on the preliminary model selection using F-statistics (not shown here).

We assumed that the ‘relative contribution’ (‘relative importance’ in Grömping (2006)) of an option is that of the corresponding regressor to the linear model. Then we quantified relative contributions of the regressors by decomposing the amount of explained variance of the linear model due to regressors. We used proportional marginal variance decomposition (PMVD) method (Feldman, 2005; Grömping, 2006) which decomposes the explained variance of the linear model into non-negative contributions, which sum to the total variance explained. PMVD is able to deal with correlated regressors by averaging over different orderings. Moreover, it has desirable properties such as ‘admissibility’.

Each linear model (per type) was estimated based on the 16 samples from all 16 scenarios. Statistical significance of the type-wise models were tested using F-statistics to validate the model structure.

3.2.5.3 Marginal performance of data-processing options

The efficacy of a data-processing option was estimated by average regression performance of the scenarios using the option. We will call it ‘marginal performance’ in the following. The marginal performance (M) of a data-processing option k for a performance metricq is calculated as

Mk,q =

x∈skq(x)

|sk_| (3.7)

where sk is a set of scenarios using the option_k and _|_sk_| is the number of elements of _sk.

3.2.5.4 Relative importance of spectral bands and acquisition dates

We quantified the relative importance of spectral bands and acquisition dates to identify the most relevant ones for the regression performance. RF provides two importance metrics for quantifying the influence of input features (Breiman, 2001; Segal et al., 2011). Among the metrics, we used the increased mean square error (IMSE), which is a permutation-based measure. Another metric

namely increased node purity (INP) is measured by node purity, in case of regression the residual

sum of squares. We avoided to use INP because of the possible bias due to the random sub-

spacing (i.e. random selection of features). For classification, the INP is known to be biased as

the impurity measure (Gini index) favours predictor variables with many categories (Genuer et al., 2010; Strobl et al., 2007). IMSE of a featuref is derived as

IMSEf = Pntree k=1(MSEk−MSEf,k) ntree ×q 1 s2_/n tree (3.8) where ntree is the size of the forest, MSEk is the mean squared OOB error of tree k, M SEf,k

is the error after permuting the feature f and s2 is the standard deviation of the differences

between the two errors; ifs2 is zero, the division is omitted. Due to the cross-validation scheme,

we computed IMSEf in each cross-validation fold and averaged them. Note that the variable

importance metric is calculated from the OOB samples.

Our goal was to assess the relative importance of spectral bands and acquisition dates on the regression model. As we used the time series of multiple spectral bands, input features can be grouped either by band or by acquisition date.

CHAPTER 3. MAPPING FRACTIONAL LAND USE AND LAND COVER IN A MONSOON REGION:

THE EFFECTS OF DATA PROCESSING OPTIONS 77

We defined the importance of a band as the sum of the importance metrics of the features belonging to the band. Let a predictor setX ={x1, ..., xl}have l features some of which belong

to a spectral band b. We calculated importance of the band b as

IMSEb =

x∈bIMSEx

lb (3.9)

where lb is the number of the features belonging to the band.

To facilitate comparisons between different bands, we normalized IMSEb as

NIMSEb =

IMSEb

Pnband

b=1 IMSEb

(3.10) where nband is the number of the bands in a predictor set. To derive NIMSEb we used the two

groups of the scenarios: scenarios using ‘SR’ predictor set (S3, S7, S11, and S15) and scenarios using ‘Full’ predictor set (S4, S8, S12, and S16). As they are different in the number of spectral bands, we calculated two sets of NIMSEb. For each group individually, we calculated the mean

importance measures from the included scenarios.

Likewise the importance of an acquisition date is defined as the sum of the importance metrics of the features acquired at a particular date d:

IMSEd =

x∈dIMSEx

(3.11) where ld is the number of the features acquired at the date d. To derive IMSEd we used the

‘Full’ predictor set based scenarios (S4, S8, S12 and S16). As 8-day and 16-day data differ in the number of data points, we extracted two seasonal IMSEd curves individually by interval.

3.2.6 Software

We used GNU R 3.1.2 (R Core Team, 2014) and the R packages randomForest version 4.6–7

(Liaw, 2012),raster version 2.3–40 (Hijmans, 2014), andrelaimpo version 2.2–2 (Grömping,

2006). The geometry engineGEOS 3.4.2 (GEOS Development Team, 2014) was used via the R

packagergeos0.3–8 (Bivand et al., 2014) and the software TIMESAT 3.1 (Eklundh et al., 2012)

In document Quantification of land use and land cover in a Monsoon agricultural mosaic from space (Page 104-108)