The Effect of Temporal Gradient of Vegetation Indices on Early-Season Wheat Area Estimation Using Random Forest Classification

(1)

Article

1

The Effect of Temporal Gradient of Vegetation

2

Indices on Early-Season Wheat Area Estimation

3

Using Random Forest Classification

4

Mousa Saei Jamal Abad 1, Ali A. Abkar 2 and Barat Mojaradi 3,*

5

1_{Faculty of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran 19667-15433, Iran;}

6

[email protected]

7

2 _{Faculty of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran 19667-15433, Iran and}

8

AgriWatch BV, Enscede, the Netherlands ; [email protected]

9

3_{Department of Geomatics, School of Civil Engineering, Iran University of Science and Technology, Tehran,}

10

Iran ; [email protected]

11

* Correspondence: [email protected]; Tel.: +98-912-215-2274

12

Abstract: The early-season area estimation of the winter wheat crop as a strategic product is

13

important for decision makers. Classification of multi-temporal images is an approach which is

14

affected by many factors like appropriate training sample size, proper frequency and acquisition

15

times, vegetation indices (VIs) type, temporal gradient of spectral bands and VIs, appropriate

16

classifier and missed values because of cloudy conditions. This paper addresses the impact of

17

appropriate frequency and acquisition times and VIs type along with the spectral and VI gradient

18

on random forest (RF) classifier when missed values exist in multi-temporal images. To investigate

19

the appropriate temporal resolution for image acquisition, the study area was selected on an

20

overlapping area between two LDCM paths. In our developed method, the miss values of cloudy

21

bands for each pixel are retrieved by the mean of k-nearest ordinary pixels. Then the multi-temporal

22

image analysis is performed by considering different scenarios provided by decision makers in

23

terms of desired crop types that should be extracted at early-season in the study areas. The

24

classification results obtained by the RF decrease by 1.6% when temporally missed values retrieved

25

by the proposed method, which is an acceptable result. Moreover, the experimental results

26

demonstrated that if temporal resolution of Landsat 8 increased to one week the classification task

27

can be conducted earlier with almost better results in terms of OA and kappa. The impact of

28

incorporating VIs along with the temporal gradients of spectral bands and VIs as new features in

29

RF demonstrated that the OA and Kappa are improved 3.1% and 6.6%, respectively. Furthermore,

30

the obtained result showed that if only one image from seasonal changes of crops is available, the

31

temporal gradient of VIs and spectral bands play the main role to discriminate remarkably wheat

32

from barley. The experiments also demonstrated that if both wheat and barley merge to a single

33

class the crop area can be estimated two months earlier with 97.1 and 93.5 in terms of OA and kappa,

34

respectively.

35

Keywords: Wheat classification; Random Forest; Spectral gradient difference; Vegetation indices

36

37

1. Introduction

38

The global demand for food will increase due to increasing of the global population in this

39

century [1]. It is therefore necessary to accurately estimate the crop area to determine if the demand

40

for food can be met [2]. Farmers usually decide to change the crop type of their agricultural fields

41

due to the regional demand and drought condition. Therefore, the cultivated area of a given crop

42

alters from a growing season to next season. Consequently, early-season crop area forecasting, in

43

particular wheat crop, is an essential information for policy makers to manage yields for domestic

44

use, imports or exports.

45

(2)

The classification of remotely sensed data can provide regional distribution of individual crop

46

types for government agencies [3,4] and decision makers of commodity trade. Evidently, the

47

classification of agricultural crops using only one image captured at a given date has significant

48

shortcomings [5]. Firstly, the crop types may have different growth stages and secondly, different

49

crops may have similar spectra (e.g. wheat and barley in this case study) during the growing season

50

in a complex agricultural area [6]. It is therefore difficult to distinguish crop types at a certain point

51

in the growth cycle by an image in a given time. In contrast, multi-temporal images provide temporal

52

signatures of crops and represent the growth cycle by capturing data during the growing season [7].

53

Therefore, the temporal signatures of crops may be considered to discriminate crop types [8], and

54

maps of cropland distributions are usually obtained by the supervised classification of

multi-55

temporal images throughout the growing season [9,10]. A significant advantage of using

multi-56

temporal data is that the proportions of different crop types can be discriminated using the spectral

57

differences at different points in the growing season [6,11,12]. Recently, crop areas from non-crop

58

areas have been separated by detecting changes in greenness using time-series normalized difference

59

vegetation index (NDVI) data in [13]. Moreover, an approach based on object-based Normalized

60

Difference Vegetation Index (NDVI) time series analysis is used for regional-scale mapping of

61

agricultural land-use systems [14]. The crop area in early-season was also determined using

62

enhanced vegetation index (EVI) of multi-temporal Moderate Resolution Imaging Spectroradiometer

63

(MODIS) in [11,15]. In their study, the green-up rate of the crop canopy, i.e. the accumulated

64

difference of two consecutive images, is used. The obtained results indicated that multi-temporal

65

remote sensing approach could be used with confidence in early-season crop area prediction at least

66

one to two months ahead of anthesis. Moreover, with increasing number of images, applying

67

different VIs and their differences, the training sample size increases dramatically to overcome the

68

curse of dimensionality for supervised classification [16]. In the study which conducted by [17]

69

different data types including reflectance data, spectral indices, texture indices and DEM features are

70

used in a combined random forest (RF) and object bases image analysis scheme for agricultural area

71

mapping. But these approaches require cloud-free imagery and processing of multiple images is

72

prohibitive for operational implementation over large areas. In this regard, acquiring such frequent

73

cloud-free imagery is a serious limitation in estimating crop area [18,19]

74

Indeed, classification of multi-temporal images is a challenging task which is affected by many

75

factors that limit the accuracy. In particular, these issues are appropriate training sample size, proper

76

frequency and acquisition times and suitable VIs. The effect of these factors also depends on the

77

choice of the classifier [20]. Some studies are conducted to address some of these issues[21-24]. For

78

example, the optimal number and dates of images for land cover classification are identified with

79

feature importance yielded by RF classifier on remote sensing time-series [25]. In another study the

80

sensitivity of the RF classification to select training sample points, including sample size, spatial

81

autocorrelation and proportions of classes within the training sample were investigated [21]. In [26]

82

the effect of the time series length on crop mapping is investigated. In this study RF is used to

83

calculate the important scores for VIs and Jeffries –Matusita (JM) distance is used to measure class

84

separability for each time series. In a comprehensive study also five strategies that can be applied to

85

high resolution multi-temporal optical imagery to produce accurate crop type maps have been

86

selected and benchmarked at the global scale [27]. These strategies are RF, support vector machine

87

(SVM) with a redial basis function kernel, Dempster–Shafer fusion of the previous approaches, best

88

classifier with a mean-shift smoothing, best classifier with a temporal regular resampling. The

89

obtained classification results on the 12 test sites over the globe demonstrated that RF achieved

90

superior results. The RF also is used for winter wheat mapping by multi temporal images of LDCM

91

and Gao Fen 1 satellites [28].

92

In aforementioned circumstances, the study of optimal number of images to find critical

93

acquisition times and temporal resolution along with VIs and their gradient is necessary. This paper

94

will address to this issue that how much importance the temporal resolution of image acquisition for

95

multi temporal images classification and what happen when temporal resolution of Landsat 8

96

increase to one week? For this purpose, the study area is selected on overlap area between two

(3)

Landsat paths. Moreover, missed values problem in cloudy times is another matter which limits the

98

application of multi-temporal image classification. Indeed, many pixels of training or non-training

99

data are ignored if only one cloudy image exist in the time series. Although considerable research has

100

been previously conducted on the time-series classification, the problem of miss values caused by

101

cloudy pixels in such images is still an issue to which less attention has been paid. Indeed, the pixels

102

of data are either cloudless pixels or miss data pixels associated with miss values (i.e. bands with no

103

data) due to the availability of cloudy images in time series. One of the main problems in the

multi-104

temporal image classification is that some pixels of training/testing areas may be covered by cloud

105

during the analysis period. Hence, those pixels are ignored and neither incorporated as training data

106

in the classifier learning process nor as test data for the classification accuracy. On the other hand,

107

the rest of cloudy pixels cannot be classified by classifier. This paper remedies this problem for those

108

training/test data and for rest of cloudy pixels by retrieve of the miss values trough k-nearest neighbor

109

cloudless pixels. This paper proposes a method to retrieve the miss values and study the effect of

110

miss value retrieval of training data on classification results of multi temporal images.

111

This paper also addresses the impact of the appropriate frequency and acquisition times and VIs

112

along with the spectral and VI gradient on the RF classifier when miss values are available in the

113

multi-temporal images. The objective of this paper is therefore to investigate the practical

114

considerations of the multi-temporal image classification approach for consultants and agronomists.

115

The specific goals of this study are: (1) To assess the effect of miss values in training areas, due to

116

cloudy times, on RF classification learning; (2) To determine the appropriate data acquisition times

117

and temporal resolution; (3) To assess the effect of VIs and their temporal gradient along with

118

temporal gradient of spectral bands in different times; and (4) To determine the optimum number of

119

classes to be distinguished in early season. To our knowledge, there are currently no other

120

comprehensive assessments of multi-temporal image classification which takes into consideration

121

these practical aspects.

122

2. Materials and Methods

123

2.1. Study Area and Data: The study area is an agricultural region with an area of 755 km2,

124

located between latitudes 29° 49'–30° 05'N and longitudes 52° 39'–52° 55'E, in Southwest of Iran

125

(Figure 1. The area has various climates during the growing season and is covered by different

126

vegetation, soil, and rocky terrain types. The study area is situated on overlapping area between two

127

paths of 162 and 163, which is covered by 26 scenes of Landsat Data Continuity Mission (LDCM) data

128

captured from 8 Nov 2014 until 26 May 2015 during growing season. In this regard, the temporal

129

resolution in the study area increases from 2 weeks to one week to investigate the appropriate

130

temporal resolution for image acquisition.Among all images, 10 images having less than 20% cloud

131

in the study area are selected which 8 images belonging to path 162 and 2 images to path 163. The

132

acquisition times of these images are given in Table 1.

133

Table 1. Acquisition times of images.

134

Image No. Acquisition Date Image No. Acquisition Date T1 11-JAN-15 T6 17-APR-15

T2 03-FEB-15 T7 03-MAY-15

T3 28-FEB-15 T8 19-MAY-15

T4 07-MAR-15 T9 04-JUN-15

T5 16-MAR-15 T10 20-JUN-15

135

(4)

Figure 1. The study area situated in the overlap area between two paths 162 and 163 of LDCM.

137

The training data also were captured in 264 agriculture fields. The spatial distribution of the

138

training and test areas for the three classes of wheat, barley and other-classes are given in Figure 2

139

and Table 2. As shown, training and test data are selected in disjoint agricultural fields to consider

140

real situation.

141

Figure 2. The spatial distribution of the training and test data for three classes of Wheat, Barley and

142

(5)

Table 2. The number of training and test data in the study area.

144

Class # of training fields # of test fields # of training pixels # of test pixels

Wheat 128 280 559 1265

Barely 15 51 93 307

Other 121 252 1419 3027

2.2. Methodology

145

As mentioned above, the classification of multi-temporal images is affected by many factors such

146

as miss values in cloudy images, optimum training sample size, proper frequency and acquisition

147

times that limit the accuracy. Moreover, extraction of appropriate features (e.g. temporal gradient)

148

from the temporal signatures of crop types along with suitable times such that discriminate crops are

149

two major issues that should be considered in the multi-temporal image classification. In this regard,

150

different features like VIs and the temporal gradient of VIs and spectral bands are taken into account

151

as new features. To address aforementioned issues, we developed following methodology (Figure 3).

152

153

Figure 3. Flow chart describing the method developed in this study for multi-temporal image

154

classification

155

Step 1: A set of images with acceptable cloud percent, 20% in this study, is selected from input

156

time-series images. Radiometric correction is then conducted on the selected images (i.e. 10 images

157

for this study area) and cloudy pixels in each image are set to no data as missed values for all bands.

158

Hence, for each pixel, a feature vector with a dimension of 80 elements (8×10) is achieved.

159

Step 2: The missed values problem limits the use of training/test data to learn the classifier and

160

the rest of cloudy pixels also cannot be classified in the imaging scene. To overcome this shortcoming

161

for ground data, the bands of a pixel corresponding to cloudy times in a given training/testing class

(6)

(e.g. wheat), which contain no data are removed from the feature vector and flagged as cloudy pixels.

163

The corresponding bands from the feature vector of the cloudless pixels of that class are also removed

164

(i.e. the corresponding bands of cloudy times are removed in cloudless pixels of that class) to achieve

165

the same dimensionality with the cloudy pixels. In the reduced multi-dimensional feature space, for

166

each cloudy pixel, k-nearest neighbor cloudless pixels are then found. Therefore, the bands of those

167

pixels with no data are reconstructed by mean of k-nearest cloudless pixels to yield a feature vector

168

with full dimensionality. In other words, the miss values of cloudy bands are filled with the mean of

169

k-nearest from cloudless pixels which are spectrally similar to cloudy pixels in feature space. This is

170

also conducted for the whole cloudy pixels of the scene after the appropriate multi-temporal

171

classification model is selected.

172

Step 3: After miss value retrieval, the VIs are generated. In the current study 9 indices including

173

NDVI, SAVI, SR, EVI, STVI1, STVI3, STVI4, MSAVI1 and MNDVI are generated based on the

174

corresponding equation listed in Table 3, as new features.

175

Table 3. Remote sensing vegetation indices used in this study.

176

No. Index Formula Reference

1 Normalized Difference Vegetation Index

NIR- R

NDVI= _{NIR R}₊ [29]

2 Simple Ratio SR =NIR_R [30]

3 STress related

Vegetation Index1 STVI1=M IR×

(

R N IR

)

[31]

4 STress related

Vegetation Index 3 STVI3=NIR R

(

+MIR

)

[32]

5 STress related

Vegetation Index 4 4

R MIR

STVI NIR

NIR MIR

×

= −

+ [33,34]

6 Enhanced Vegetation

Index 2.5

–

6 7.5 1

NIR

E R

NIR R B

VI

+

 

= ×_ ₋ ₊ _ [8,35]

7 Modified Soil Adjusted Vegetation Index

(

)

(

)

(

2 1? 1 2 ? ?

)

2

NIR NIR NIR R

MSAVI= + + [36]

8

Modified Normalized Difference Vegetation

Index

(

)

(

min

)

– max

max

SWIR SWIR

MNDVI NDVI

SWIR SWIR

= ×

+

[37]

9 Soil Adjusted

Vegetation Index (1 )

NIR R

SAVI L

NIR R L

−

= +

+ + [38]

177

Step 4: Because temporal signature plays an important role in the interpretation of the growing

178

steps of crops, temporal gradient is used to quantitatively describe the seasonal change. The temporal

179

gradient of a given spectral band, TGSB, at the ith ( )and jth( ) times are then generated to obtain

180

the temporal changes of crops as follows.

181

182

TGSB ( , ) = SB ( ) - SB ( ) (10)

The temporal gradients of VIs, TGVI, are also computed and considered as new features, which

183

means for a given VI at the ith ( )and jth( ) times, a set of VI gradients in time sequence is generated

184

as follows:

185

186

TGVI ( , ) =VI ( ) - VI ( ) (11)

Step 5: The RF algorithm is conducted to explore the achieved features including spectral bands

187

at different times, VIs and their temporal gradients to yield an ensemble of decision trees. As

188

mentioned, 8 images from path 162 and 2 images from path 163, having less than 20% cloud cover

(7)

are selected. Therefore, for each pixel, a feature vector is obtained that includes spectral bands, VIs

190

along with their temporal gradients. To investigate the effect of multi-temporal image classification

191

issues by the RF classification, different scenarios are considered according to Table 4.

192

The RF algorithm can generally handle high dimensional data and use a large number of trees

193

in the ensemble. In RF, each tree in the forest is built and tested independently from other trees.

194

During training, each tree receives a new bootstrapped training set generated from the original

195

training set by subsampling with replacement [39]. Those samples which are not included during the

196

training of a tree are called Out-Of-Bag (OOB) samples of that tree. These samples can be used to

197

compute the Out-Of-Bag-Error (OOBE) of the tree as well as the ensemble which is an unbiased

198

estimate of the generalization error [40-42].

199

In the experiments, replacement and OOB are set to 15 times and 33%, respectively. First, the

200

optimum number of training sample size along with the number of trees are investigated. Then, the

201

effect of each issue is assessed to achieve an appropriate model for the multi-temporal classification.

202

Table 4. The different scenarios for the RF classification.

203

Experimental setup

Forest size (# of trees) 10 up to 80

Training sample size 10% up to 50%

Miss values Yes No

VIs Yes No

Temporal gradient of VIs and spectral bands Yes No

# Images 2 up to 10

3. Results and Discussion

204

Indeed, decision makers aim at achieving crop area from remotely sensed data in earlier season

205

with appropriate accuracy. Therefore, according to Table 5, based on the desired classes two cases of

206

classification are determined by the decision makers.

207

Table 5. Two cases of classification.

208

Case # of classes desired classes

1 3 wheat, barely, other

2 2 (wheat & barely),other

3.1 Experiment 1: Optimum Training Sample Size

209

In this experiment the sample size of the training data, set to 10% up to 50% without considering

210

missed values and only spectral bands of 8 images captured on path 162, without their gradient and

211

VIs are used for classification. The RF classification is performed with 15 times replacement and with

212

50 trees (forest size) in case 1. The averages of the overall accuracy (OA) and Kappa coefficient [43]

213

are computed and given in Figure 4.

(8)

Figure 4. Overall accuracy and Kappa coefficient changes to obtain optimum training sample size.

216

With increasing training sample size, the OA and the Kappa coefficient are improved (Figure 4).

217

This improvement is not remarkable when the training sample size become more than 30%. Hence,

218

we selected 30% as appropriate training size due to the cost consuming of filed works.

219

3.2 Experiment 2: Optimum Forest Size

220

Most of experiments used RF classification are conducted with the large number of trees which

221

is time consuming[26,44]. Hence, the optimum forest size for each dataset should be investigated and

222

determined. For this purpose, after achieving the optimum sample size in experiment 1, the optimum

223

forest size is also investigated through increasing trees from 10 up to 80 trees. In this experiment, the

224

RF classification is also conducted with 15 times replacement in case 1 using only the spectral bands

225

of all images. The average values of OAs are given in Figure 5. As demonstrated in Figure 4, the OA

226

becomes almost steady when the number of trees exceeds 50. Hence in this dataset, the optimum

227

forest size and optimum number of training sample size set to 50 trees and 30%, respectively.

228

229

(a) (b)

Figure 5. Effect of the number of trees on (a) overall accuracy (OA) and (b) Kappa coeficient

230

3.3 Experiment 3: The Effect of Miss Values in the Training Data

231

In fact, the pixels of ground data in some images are cloudless while at other times they may be

232

cloudy. In multi-temporal image classification, keeping these pixels is valuable in terms of being time

233

and cost consuming. Moreover, the classification of those pixels which have missed values due to the

234

cloudy times is important when multi-temporal images are used. As mentioned is step2 of the

235

methodology, the miss values (i.e. cloudy bands) of pixels are replaced by the average of k-nearest

236

cloudless pixels. To investigate the effect of the data retrieval method, the retrieval cloudy pixels of

237

the ground data are incorporated to train the RF. Moreover, the retrieval cloudy pixels also are

238

incorporated in test data and are classified. Table 6 shows that the OA value decreases by 1.6 % due

239

to the retrieval miss values by the mean of k-nearest neighbor algorithm. Therefore, it can be

240

concluded that when miss values in the cloudy pixels are retrieved and classified by the RF classifier,

(9)

the certainty of obtaining labels for these pixels are less than those of cloudless pixels. Note that all

242

time-series images are used in this experiment and k is set to 7 in this study.

243

Table 6. The effect of miss values on the overall accuracy in the RF classifier.

244

miss value OA Kappa Product Accuracy wheat

Product Accuracy barely

Product Accuracy others

Yes 93.2 85.3 91.5 51.1 97.7

No 95.4 90.3 96.5 62.6 98.0

3.4 Experiment 4: Appropriate Acquisition Times and Temporal Resolution

245

One of the challenges in using multi temporal data is to provide early information about

246

cultivation area and crop types for decision makers with a few images. Hence, at first only different

247

sequential times of path 162 were selected to assess which acquisition times were suitable for early

248

classification. Then classification was conducted on case 1 using different combinations of acquisition

249

time sequences. The summary of using 8 combinations of acquisition times that yield acceptable

250

results are given in Table 7.

251

Table 7. The (OA) and kappa of the RF classifier using combinations of acquisition times.

252

C o m b . No . 11 -JA N -1 5 03 -F EB-15 28 -F EB-15 07 -M AR-15 16 -M AR-15 17 -AP R -1 5 03 -M AY -1 5 19 -M AY -1 5 04 -JU N -1 5 20 -JU N -1 5 OA Kappa

1  -  -       93.2 85.3

2  -  -      - 93.1 85.2

3  -  -     - - 95.5 90.4

4  -  -    - - - 94.3 87.8

5  -  -   - - - - 91.2 81.2

6  -  -  - - - 86.8 71.5

7  - - - -  - - - - 94.1 87.5

8  - - - -   - - - 95.3 90.1

9     - - - 87.4 72.8

253

The superior result is obtained in Comb.3 when 6 images are used (Table 7). In contrast, an

254

acceptable accuracy in the early season is obtained for decision makers when only thefirst three

255

images of all images are used (i.e. Comb.6). Moreover, a suitable result is also achieved when only 2

256

images are used (i.e. Comb.7). The results may seem remarkable from the processing time standpoint

257

due to using only 2 images.

258

To demonstrate the effect of increasing the temporal resolution of LDCM to one week, the

259

images of path 163 (i.e. dates 03-FEB-15 and 07-MAR-15) are incorporated into the experiment.

260

Therefore, the temporal resolution of LDCM on case study area (i.e. overlap area of two paths of

261

Landsat 8) increased to one week. The experimental results demonstrated that, in this case study, the

262

one-week temporal resolution of LDCM (i.e. Comb. 9), compared to two weeks temporal resolution

263

of LDCM (i.e. Comb.6) achieved better results in terms of OA and kappa. This is due to the fact that

264

only two images from path 163 captured in the first steps of growing season are included. Moreover,

265

early classification (i.e. 07th_{of March) with better result can be yield if the temporal resolution of}

266

Landsat increases to one week. This is promising so that more images with high temporal resolution

267

like Sentinel2A and Sentinel2B can be employed. Although using a constellation of satellites with

268

high spatial and temporal resolution increases the accuracy, certainly a balance between some factors

269

like ground sampling size, processing time, accuracy and early season crop area estimation should

270

be considered.

(10)

3.5 Experiment 5: The Effect of VIs and their Gradient

272

This experiment aims at demonstrating the effect of VIs and their gradients on the classification

273

accuracy improvement and their capabilities to decrease scheduled time for image acquisition. For

274

this purpose, at first NDVI and STVI3 as examples of VIs are considered and temporal signatures of

275

wheat and barley are computed and depicted in Figure 6 for training data. As shown in Fig. 6 wheat

276

and barley have similar reflectance in the early stage of growing, making their discrimination difficult

277

in early season. In contrast to wheat, barley yields faster and loses its chlorophyll almost 3 weeks

278

earlier. Then temporal gradient of wheat and barley is computed and given in Figure 6. Although the

279

temporal signatures of these crops in the first stages of growing is almost the same and have high

280

spectral similarity, as shown in Fig. 6, the temporal gradient at the end of the season can provide

281

appropriate features for better discrimination. In particular, the temporal gradient between images

282

of T6 and T7 which acquired at 17 Apr. and 3 May respectively, could provide the first significant

283

difference between these similar classes for discrimination. In other words, from 17 Apr. to 3 May the

284

wheat stands on the late stage of its grows while barely lost chlorophyll more rapidly.

285

286

(a) (b)

(c) (d)

Figure 6. The NDVI and STVI3 indices and their temporal gradient for: (a) NDVI; (b) STVI3; (c)

287

temporal gradient of NDVI; (d) temporal gradient of STVI3, during growing season.

288

After illustrating the VIs and their temporal gradients, the gradient of spectral bands and the

289

aforementioned VIs (Table 2) along with their temporal gradients (i.e. Eqs. (10 & 11)) are incorporated

290

in the RF classification. Figure 6 and Figure 7 shows the results obtained for 28 combinations of

291

acquisition times when only spectral bands are used. In addition to spectral bands, the effect of

292

incorporating VIs and their spectral gradients obtained by Eq. (10) and Eq. (11) is given in Fig.7 in

293

terms of OA and Kapa coefficient. As demonstrated in Fig. 7 by utilizing the temporal gradient of

294

spectral bands and VIs along with their gradients the OA and Kappa are improved up to 5% and 9%,

295

respectively.

(11)

(a)

(b)

Figure 7. The classification results when only spectral bands are used versus spectral bands along with VIs

298

and their gradients are used: (a) overall accuracy (OA). (b) Kapa coefficient

299

Table 8. The overall accuracy (OA), kappa and producer’s accuracy (PA) of appropriate times with

300

their Vis and gradient by the RF classifier.

301

C

o

m

b

. No

.

11

-JA

N

-1

5

03

-F

EB-15

28

-F

EB-15

07

-M

AR-15

16

-M

AR-15

17

-AP

R

-1

5

03

-M

AY

-1

5

19

-M

AY

-1

5

04

-JU

N

-1

5

20

-JU

N

-1

5

VIs

TGVI&TGS

B

OA

Kappa

PA W

h

eat

PA B

arely

P

A

Ot

he

r

1  -  -  - - - 86.8 71.5 82.4 9.8 95.9

2  -  -  - - -  - 88.1 73.9 83.9 10.9 96.7

3  -  -  - - -   89.9 78.1 86.8 13.0 98.5

4  - - - -   - - - 95.3 90.1 97.7 45.5 99.0

5  - - - -   - - -  - 96.4 92.3 98.2 49.3 100

6  - - - -   - - -   99.3 98.4 99.4 92.5 99.9

(12)

From 28 combinations, 3 combinations which incorporate minimum acceptable number of

302

images such that discriminate classes of case 1 in early season are selected for comparison. To

303

demonstrate the impact of VIs and their gradients the OA, Kappa and product accuracies (PA) of

304

classes are given in Table 8. As shown, when VIs added to comb. 1 (i.e. early season with 2 weeks

305

temporal resolution) as new features the OA and Kappa improves 1.3%. While this improvement is

306

1.8% when utilizing TGVI and TGSB. Therefore, the use of VIs, TGVI and TGBS as new features the

307

OA and Kappa improve 3.1% and 6.6%, respectively. Moreover, when VIs, TGVI and TGBS features

308

added in Comb.4 not only improve the OA and Kappa but also the TGVI and TGBS features

309

particularly discriminate barley from wheat and improves the PA of barley 43.2%. In contrast to

310

Comb.4, in Comb.3 the TGVI and TGBS do not improve PA of barley significantly. This is due to the

311

similarity between wheat and barley in the early stage of growing. Therefore, the results

312

demonstrated that the appropriate model is Comb.6 which utilizes only 1 image from the first stage

313

of growing season and two images from seasonal change point of wheat and barley. It is worthy to

314

note that compared to Comb.3 the scheduled time decreases to 07-MAR-15 in Comb.7 and yield better

315

results in terms of OA and kappa when temporal resolution increase to one week (i.e. two images of

316

path 163 are used).

317

3.6 Experiment 6: The Effect of the Number of Classes

318

In practice, the classification of all crop types is time and cost consuming.Also when different

319

scenarios for image classification arise in terms of the number of classes, different classification

320

accuracies can be obtained. Moreover, discriminating two similar classes like wheat and barley

321

demands images from a seasonal change point in the temporal signature of those crops. On the other

322

hand, decision makers are usually demand to get information about the cultivated area in early

323

season even for only one or two crop types. Therefore, the number of desired classes to be determined

324

effect on early season detection by the RF classifier. In this experiment, the RF classification is

325

performed on classification case 2 of Table 5 using the appropriate models obtained through

326

experiment 5 (i.e. Comb.3, Comb.6 and Comb.7 of Table 8). The comparative results are given in Table

327

9. The achieved results demonstrated that, compared to the classification case 1, the classification

328

accuracy increased up to 7% in case 2 in terms of OA (i.e. when two classes of wheat and barley are

329

merged into one class). As a result, when decision makers demand to distinguish cultivate area of

330

both wheat and barley, the Comb.7 is an appropriate model which utilizes 4 images from the first

331

stage of growing season with one weak temporal resolution. In this regard, the crop area (Wheat

332

along with Barely) can be estimated with appropriate results almost two months earlier (Table 9).

333

Table 9. The overall accuracy (OA), kappa and producers’ accuracy (PA) when wheat and barley

334

are merged to one class by the RF classifier.

335

C ase no . 11 -JA N -1 5 03 -F EB-15 28 -F EB-15 07 -M AR-15 16 -M AR-15 17 -AP R -1 5 03 -M AY -1 5 19 -M AY -1 5 04 -JU N -1 5 20 -JU N -1 5 VIs Grad ient OA Kappa PA (Wh eat + B are ly\ ) PA O th er

1  -  -  - - -   95.9 90.7 92.1 97.8

2  - - - -   - - -   99.6 99.0 99.0 99.8

3     - - -   97.1 93.5 94.9 98.2

4. Conclusions

336

In this study, multi-temporal images were used to discriminate wheat, barley and other classes

337

by the RF classifier with the aims of using a few images acquired in the early stages of the growing

338

season rather than using those of the entire growing season. In the study area, 10 images from the

339

overlapping area between two over paths of 162 and 163 captured during 6 months were examined.

(13)

The experimental results demonstrated that 30% of the data and 50 trees were suitable as the

341

optimum training sample size and forest size, respectively in the study area. The miss values due to

342

the cloudy pixels in multi-temporal images were retrieved by the average of the k-nearest cloudless

343

pixels. The achieved results demonstrated that the OA decreased by 1.6 % due to the retrieval miss

344

values. Therefore, it is concluded that when miss values in the cloudy pixels are retrieved and

345

classified by the RF classifier, the certainty of obtaining labels for these pixels is less than the certainty

346

of labels of cloudless pixels. To determine appropriate acquisition times for identifying

347

aforementioned classes, different combinations of acquisition times were generated and evaluated.

348

The experiments demonstrated that the superior result was obtained when 6 images were used while

349

an acceptable accuracy in early season was gained when only thefirst three images of all images were

350

utilized. To demonstrate the effect of increasing the temporal resolution, the images of path 163 were

351

incorporated into the experiment. The experimental results showed that if temporal resolution of

352

Landsat 8 increased to one week the classification task can be conducted earlier with almost better

353

results in terms of OA and kappa.

354

Furthermore, VIs along with TGVI and TGSB were generated as informative features to

355

discriminate wheat and barley. In this regard, the impact of incorporating these generated features

356

was assessed with the aim of decreasing period of time series and improving the accuracy. The results

357

demonstrated that utilizing these new features improve the OA and Kappa 3.1% and 6.6%,

358

respectively. Furthermore, the obtained result showed that TGVI and TGSB play the main role to

359

discriminate remarkably wheat from barley if only one image from seasonal changes of crops is

360

available. The experiments also demonstrated that if the early season detection of both wheat and

361

barley as one class rather more important than separate them, the crop area can be estimated two

362

months earlier with 97.1 and 93.5 in terms of OA and kappa, respectively.

363

The results obtained by this study are promising such that images with high temporal resolution

364

like Sentinel2A and Sentinel2B can be used to achieve accurate discrimination in the early stage of

365

the growing season. Although using a constellation of satellites with high spatial and temporal

366

resolutions increases the accuracy, a balance between some factors like ground sampling size,

367

processing time, accuracy and early season crop area estimation should be certainly considered.

368

Author Contributions: All the authors listed contributed equally to the work presented in this paper.

369

Conflicts of Interest: The authors declare no conflict of interest.

370

References

371

1.

Godfray, H.C.J.; Beddington, J.R.; Crute, I.R.; Haddad, L.; Lawrence, D.; Muir, J.F.;

372

Pretty, J.; Robinson, S.; Thomas, S.M.; Toulmin, C. Food security: The challenge of

373

feeding 9 billion people. science

2010

, 327, 812-818.

374

2.

Padilla, F.; Maas, S.; González-Dugo, M.; Mansilla, F.; Rajan, N.; Gavilán, P.;

375

Domínguez, J. Monitoring regional wheat yield in southern spain using the grami

376

model and satellite imagery. Field Crops Research

2012

, 130, 145-154.

377

3.

Allen, R.; Hanuschak, G.; Craig, M. History of remote sensing for crop acreage in

378

usda's national agricultural statistics service.

2002

.

379

4.

Pan, Y.; Li, L.; Zhang, J.; Liang, S.; Zhu, X.; Sulla-Menashe, D. Winter wheat area

380

estimation from modis-evi time series data using the crop proportion phenology

381

index. Remote Sensing of Environment

2012

, 119, 232-242.

382

5.

Ghamisi, P.; Plaza, J.; Chen, Y.; Li, J.; Plaza, A. Advanced supervised spectral

383

classifiers for hyperspectral images: A review. IEEE Geoscience and Remote Sensing

384

Magazine (GRSM)

2017

.

(14)

6.

Vieira, C.; Mather, P.; McCULLAGH, M. The spectral-temporal response surface

386

and its use in the multi-sensor, multi-temporal classification of agricultural crops.

387

International Archives of Photogrammetry and Remote Sensing

2000

, 33, 582-589.

388

7.

Wardlow, B.D.; Egbert, S.L.; Kastens, J.H. Analysis of time-series modis 250 m

389

vegetation index data for crop classification in the us central great plains.

Remote

390

Sensing of Environment

2007

, 108, 290-310.

391

8.

Cammarano, D.; Fitzgerald, G.J.; Casa, R.; Basso, B. Assessing the robustness of

392

vegetation indices to estimate wheat n in mediterranean environments.

Remote

393

Sensing

2014

, 6, 2827-2844.

394

9.

de Colstoun, E.C.B.; Story, M.H.; Thompson, C.; Commisso, K.; Smith, T.G.; Irons,

395

J.R. National park vegetation mapping using multitemporal landsat 7 data and a

396

decision tree classifier. Remote Sensing of Environment

2003

, 85, 316-327.

397

10.

Langley, S.K.; Cheshire, H.M.; Humes, K.S. A comparison of single date and

398

multitemporal satellite image classifications in a semi-arid grassland. Journal of Arid

399

Environments

2001

, 49, 401-411.

400

11.

Carrão, H.; Gonçalves, P.; Caetano, M. Contribution of multispectral and

401

multitemporal information from modis images to land cover classification. Remote

402

2008

, 112, 986-997.

403

12.

Chen, J.; Lu, M.; Chen, X.; Chen, J.; Chen, L. A spectral gradient difference based

404

approach for land cover change detection.

ISPRS journal of photogrammetry and

405

remote sensing

2013

, 85, 1-12.

406

13.

Zheng, B.; Myint, S.W.; Thenkabail, P.S.; Aggarwal, R.M. A support vector machine

407

to identify irrigated crop types using time-series landsat ndvi data.

International

408

Journal of Applied Earth Observation and Geoinformation

2015

, 34, 103-112.

409

14.

Bellón, B.; Bégué, A.; Lo Seen, D.; de Almeida, C.A.; Simões, M. A remote sensing

410

approach for regional-scale mapping of agricultural land-use systems based on ndvi

411

time series. Remote Sensing

2017

, 9, 600.

412

15.

Lobell, D.B.; Asner, G.P. Cropland distributions from temporal unmixing of modis

413

data. Remote Sensing of Environment

2004

, 93, 412-422.

414

16.

Nitze, I.; Schulthess, U.; Asche, H. Comparison of machine learning algorithms

415

random forest, artificial neural network and support vector machine to maximum

416

likelihood for supervised crop type classification. Proc. of the 4th GEOBIA

2012

,

7-417

9.

418

17.

Lebourgeois, V.; Dupuy, S.; Vintrou, É.; Ameline, M.; Butler, S.; Bégué, A. A

419

combined random forest and obia classification scheme for mapping smallholder

420

agriculture at different nomenclature levels using multisource data (simulated

421

sentinel-2 time series, vhrs and dem). Remote Sensing

2017

, 9, 259.

422

18.

Chen, J.; Jönsson, P.; Tamura, M.; Gu, Z.; Matsushita, B.; Eklundh, L. A simple

423

method for reconstructing a high-quality ndvi time-series data set based on the

424

savitzky–golay filter. Remote sensing of Environment

2004

, 91, 332-344.

425

19.

Potgieter, A.; Apan, A.; Hammer, G.; Dunn, P. Early-season crop area estimates for

426

winter crops in ne australia using modis satellite imagery.

ISPRS Journal of

427

(15)

20.

Du, P.; Xia, J.; Zhang, W.; Tan, K.; Liu, Y.; Liu, S. Multiple classifier system for

429

remote sensing image classification: A review. Sensors

2012

, 12, 4764-4792.

430

21.

Millard, K.; Richardson, M. On the importance of training data sample selection in

431

random forest image classification: A case study in peatland ecosystem mapping.

432

Remote sensing

2015

, 7, 8489-8515.

433

22.

Fletcher, R.S. Using vegetation indices as input into random forest for soybean and

434

weed classification. American Journal of Plant Sciences

2016

, 7.

435

23.

Eisavi, V.; Homayouni, S.; Yazdi, A.M.; Alimohammadi, A. Land cover mapping

436

based on random forest classification of multitemporal spectral and thermal images.

437

Environmental monitoring and assessment

2015

, 187, 1-14.

438

24.

Pelletier, C.; Valero, S.; Inglada, J.; Champion, N.; Dedieu, G. Assessing the

439

robustness of random forests to map land cover with high resolution satellite image

440

time series over large areas. Remote Sensing of Environment

2016

, 187, 156-168.

441

25.

Nitze, I.; Barrett, B.; Cawkwell, F. Temporal optimisation of image acquisition for

442

land cover classification with random forest and modis time-series.

International

443

Journal of Applied Earth Observation and Geoinformation

2015

, 34, 136-146.

444

26.

Hao, P.; Zhan, Y.; Wang, L.; Niu, Z.; Shakir, M. Feature selection of time series

445

modis data for early crop classification using random forest: A case study in kansas,

446

USA. Remote Sensing

2015

, 7, 5347-5369.

447

27.

Inglada, J.; Arias, M.; Tardy, B.; Hagolle, O.; Valero, S.; Morin, D.; Dedieu, G.;

448

Sepulcre, G.; Bontemps, S.; Defourny, P. Assessment of an operational system for

449

crop type map production using high temporal and spatial resolution satellite optical

450

imagery. Remote Sensing

2015

, 7, 12356-12379.

451

28.

Liu, J.; Feng, Q.; Gong, J.; Zhou, J.; Liang, J.; Li, Y. Winter wheat mapping using a

452

random forest classifier combined with multi-temporal and multi-sensor data.

453

International Journal of Digital Earth

2017

, 1-20.

454

29.

Rouse JW, H.R., Schell JA, Deering DW Monitoring vegetation, systems in the great

455

plains with erts.

In Proceeding of Third Earth Resources,Technology Satellite

456

Symposium 1, Greenbelt, USA

1974

.

457

30.

Jordan, C.F. Derivation of leaf

‐

area index from quality of light on the forest floor.

458

Ecology

1969

, 50, 663-666.

459

31.

Ridao, E.; Conde, J.R.; Mı́nguez, M.I. Estimating fapar from nine vegetation indices

460

for irrigated and nonirrigated faba bean and semileafless pea canopies.

Remote

461

1998

, 66, 87-100.

462

32.

Pearson, R.L.; Miller, L.D. In

Remote mapping of standing crop biomass for

463

estimation of the productivity of the shortgrass prairie, Remote Sensing of

464

Environment, VIII, 1972; p 1355.

465

33.

Thenkabail, P.S.; Ward, A.D.; Lyon, J.G.; Merry, C.J. Thematic mapper vegetation

466

indices for determining soybean and corn growth parameters.

Photogrammetric

467

Engineering and Remote Sensing

1994

, 60, 437-442.

468

34.

Jafari, R.; Lewis, M.; Ostendorf, B. Evaluation of vegetation indices for assessing

469

vegetation cover in southern arid lands in south australia.

The Rangeland Journal

470

(16)

35.

Liu, H.Q.; Huete, A. A feedback based modification of the ndvi to minimize canopy

472

background and atmospheric noise. IEEE Transactions on Geoscience and Remote

473

Sensing

1995

, 33, 457-465.

474

36.

Qi, J.; Chehbouni, A.; Huete, A.; Kerr, Y.; Sorooshian, S. A modified soil adjusted

475

vegetation index. Remote sensing of environment

1994

, 48, 119-126.

476

37.

Nemani, R.; Pierce, L.; Running, S.; Band, L. Forest ecosystem processes at the

477

watershed scale: Sensitivity to remotely-sensed leaf area index estimates.

478

International journal of remote sensing

1993

, 14, 2519-2534.

479

38.

Huete, A.R. A soil-adjusted vegetation index (savi). Remote sensing of environment

480

1988

, 25, 295-309.

481

39.

Freund, Y.; Schapire, R.E. In Experiments with a new boosting algorithm, Icml, 1996;

482

pp 148-156.

483

40.

Leistner, C.; Saffari, A.; Santner, J.; Bischof, H. In Semi-supervised random forests,

484

2009 IEEE 12th International Conference on Computer Vision, 2009; IEEE: pp

506-485

513.

486

41.

Breiman, L. Out-of-bag estimation; Citeseer: 1996.

487

42.

Breiman, L. Random forests. Machine learning

2001

, 45, 5-32.

488

43.

Congalton, R.G.; Green, K.

Assessing the accuracy of remotely sensed data:

489

Principles and practices. CRC press: 2008.

490

44.

Wang, L.a.; Zhou, X.; Zhu, X.; Dong, Z.; Guo, W. Estimation of biomass in wheat

491

using random forest regression algorithm and remote sensing data. The Crop Journal

492

Creative Commons CC BY