Performance Evaluation of Cluster Validity Indices (CVIs) on Multi/Hyperspectral Remote Sensing Datasets

(1)

Article

Performance Evaluation of Cluster Validity Indices

(CVIs) on Multi/Hyperspectral Remote

Sensing Datasets

Huapeng Li1,*, Shuqing Zhang1, Xiaohui Ding1,2, Ce Zhang3and Patricia Dale4

1 _{Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130012,} China; [email protected] (S.Z.); [email protected] (X.D.)

2 _{University of Chinese Academy of Sciences, Beijing 100049, China}

3 _{Lancaster Environment Centre, Lancaster University, Lancaster LA1 4YQ, UK; [email protected]} 4 _{Environmental Futures Research Institute, School of Environment, Griffith University, Brisbane, QLD 4111,}

Australia; [email protected]

* Correspondence: [email protected]; Tel.: +86-431-8554-2230

Academic Editors: Guoqing Zhou, Qihao Weng and Prasad S. Thenkabail Received: 21 December 2015; Accepted: 21 March 2016; Published: 30 March 2016

Abstract:The number of clusters (i.e., the number of classes) for unsupervised classification has been recognized as an important part of remote sensing image clustering analysis. The number of classes is usually determined by cluster validity indices (CVIs). Although many CVIs have been proposed, few studies have compared and evaluated their effectiveness on remote sensing datasets. In this paper, the performance of 16 representative and commonly-used CVIs was comprehensively tested by applying the fuzzy c-means (FCM) algorithm to cluster nine types of remote sensing datasets, including multispectral (QuickBird, Landsat TM, Landsat ETM+, FLC1, and GaoFen-1) and hyperspectral datasets (Hyperion, HYDICE, ROSIS, and AVIRIS). The preliminary experimental results showed that most CVIs, including the commonly used DBI (Davies-Bouldin index) and XBI (Xie-Beni index), were not suitable for remote sensing images (especially for hyperspectral images) due to significant between-cluster overlaps; the only effective index for both multispectral and hyperspectral data sets was the WSJ index (WSJI). Such important conclusions can serve as a guideline for future remote sensing image clustering applications.

Keywords:cluster validity index; remote sensing; image clustering; cluster number of image

1. Introduction

Land use/cover data is crucial for diverse disciplines (e.g., ecology, geography, and climatology) since it serves as a basis for various “real world” applications [1–3]. Remote sensing technique have become the mainstream means to acquire land use/cover data, owing to its specific advantages, including synoptic views and cost-effectiveness [4,5]. Remote sensing image clustering, which utilizes only the statistical information inherent in the image without human interference, is one of the most widely used methods to produce land cover information [6,7]. It is also valued because of its high efficiency (i.e., it does not use training samples) [8].

The success of clustering (unsupervised classification) depends greatly on the proper determination of cluster number (i.e., the optimal number of classes) [9]: if the number of classes selected is less than the actual number, one or more separate classes would be merged into other classes; conversely, if larger, one or more homogeneous classes would be separated into different classes. The consequence is that the information contained in the raw data is incorrectly explored and used and the classification results will not be coincident with the “real” situation [10]. In this circumstance, the role

(2)

Remote Sens.2016,8, 295 2 of 22

of the cluster validity index (CVI), which is designed to detect the optimal cluster number for a given dataset, therefore, becomes critical [11].

Generally, a CVI is comprised of two indicators, namely compactness and separation. Compactness, which indicates the concentration of data points that belong to the same cluster, is usually measured by the distance between each data point and its cluster center [10]: the smaller the distance, the better the compactness of the cluster. Separation, which expresses the degree of isolation among clusters, is usually measured by the distance between cluster centroids: the larger the distance, the stronger the isolation of clusters [12]. Ideally, a dataset is partitioned with high compactness within each cluster and large separation between each pair of clusters. However, the two indicators are often mutually conflicting [13]; with increasing cluster number, the compactness becomes larger while the separation becomes smaller. Therefore, a good balance between the two indicators is required in the design of CVIs. To date, researchers from different disciplines have proposed a large number of CVIs for various types of applications.

In the remote sensing field, CVIs such as the Davies-Bouldin index (DBI) and the Xie-Beni index (XBI) have been widely used in image clustering applications. For example, DBI was employed to evaluate the fitness of candidate clustering by Bandyopadhyay and Maulik [9], and to guide satellite image clustering by Daset al.[14]; XBI was used to determine the optimal cluster number of IRS image by Maulik and Saha [15]; and was applied for multi-objective automatic image clustering [16,17]. However, in the absence of systematic and comprehensive evaluation of CVIs for remote sensing applications, CVIs are usually subjectively selected. This means that, without evaluation, they cannot necessarily be relied on. In fact, remote sensing data is well known for its complexity and uncertainty, with the specific characteristics as follows: (1) fuzzy and nonlinear class boundaries; (2) significant overlap among pixels from different classes (the overlap problem) [18]; and (3) high dimensionality and huge quantities of data. An appropriate CVI should, therefore, be designed taking account of these properties of remote sensing data.

To draw some general conclusions, although some efforts have been made to compare or evaluate the performance of CVIs in different environments [10,19–21], little attention has been paid to remote sensing data. Thus, the question remains as to how to select appropriate CVIs for remote sensing image clustering. Such a question can only be answered through an extensive evaluation of CVIs on various types of remote sensing data sets. However, to the best of our knowledge, few studies have addressed this issue. The objective of this paper is to fill that gap and identify one or several CVIs that are generally suitable for remote sensing datasets from a total of sixteen CVIs. The commonly used fuzzy c-means (FCM) and K-means algorithms were applied in this paper to cluster nine types of remote sensing datasets, including five types of multispectral and four types of hyperspectral images. This is of great significance since it can serve as a guideline for future remote sensing image clustering with diverse data types.

The remainder of this paper is organized as follows. In Section2, the clustering problem and the FCM and K-means algorithms are briefly outlined; the sixteen CVIs evaluated in this paper are reviewed and detailed in Section3; the experiments and results are provided in Section4; the results are analyzed and discussed in Section5; and conclusions are drawn in Section6.

2. The Clustering Problem

In this section, we briefly review the clustering problem and the classical fuzzy c-means algorithm. 2.1. The Clustering Problem

Clustering is widely used in many fields to derive information on distributions and patterns in raw data [11]. It aims at partitioning a given data set into groups (clusters) according to a predefined criteria (usually the Euclidean distance) [20]. LetX“ tx1,x2, . . . ,xNube a possible given dataset (with

Npoints), andKthe number of clusters (i.e., patterns) of the data. The purpose of clustering is to evolve a partition matrixUpXqof the data to determine a partitionC“ tC1,C2, . . . ,CKu, in which the

(3)

points in the same cluster are as close (i.e., have high similarity) as possible while those in different clusters are dispersed as far (i.e., have high dissimilarly) as possible. The partition matrix can be denoted asU“ rµi js, 1ďiďK, 1ďjď N, whereµijis the grade of membership of pointxjto cluster

Cipi“1, . . . ,Kq.

Clustering can be performed in two forms: crisp and fuzzy. In crisp clustering, any one point of the given dataset belongs to only one class of clusters, that isµij“ 1 ifxjPCi; otherwiseµij“0.

In fuzzy clustering, a point may belong to several or all classes with a certain grade of membership. In this case, the partition matrixUpXqis represented asU “ rµi js, whereµij P r0, 1s. It should be

noted that crisp clustering is a special version of fuzzy clustering in which the grade of membership of a point to a cluster is either 0 or 1. Once a fuzzy clustering structure is determined by a specific algorithm, each point of the given data will be assigned to the most likely cluster (i.e., with the largest grade of membership for that point). Through this process, the fuzzy clustering can be transformed into crisp clustering for real applications.

2.2. The Fuzzy C-Means Algorithm

The classical fuzzy c-means (FCM) algorithm proposed by Bezdek [22] has been successfully used in a wide domain of applications, such as agricultural engineering, image analysis, and target recognition, among others [20,23,24]. The objective of FCM is to evolve a set of cluster centers through minimizing the weighted within-cluster sum of squared error functionJm, which is defined as:

Jm“ N ÿ j“1 K ÿ i“1 pµijqm||xj´zi||2, 0ă N ÿ j“1 µijăN,iP t1, 2, . . . , Ku (1)

whereZ“ pz1,z2, . . . ,zKqis a group of cluster centers,ziPRd(dis the number of features included

in each point). || . . . || is a Euclidean norm measuring the similarity between a point and the corresponding cluster center. The weighting exponent m controls the fuzziness of the grade of membership. The partition matrixµijand the cluster center setZin the functionJmcan be calculated

using the following equations:

µij “ » – K ÿ i“1 p||xj´zi|| 2 ||xj´zk||2 q 1{pm´1qfi fl ´1 ,iP t1, 2, . . . ,Ku,jP t1, 2, . . . , Nu (2) and zi“ N ř j“1 pµijqmxj N ř j“1 pµijqm ,iP t1, 2, . . . , Ku (3)

The FCM algorithm iteratively searches the fuzzy partition matrix and the cluster centers with a greedy searching strategy, until either no more changes are found in the cluster centers or the differences between two successive cluster centers fall below a predefined threshold. Normally, the FCM algorithm consists of the following steps:

Step 1: Determine the number of clusterKand the weighting exponentm, initialize the cluster centersZpz1,z2, . . . ,zKqrandomly, and define a threshold of iteration terminationε.

Step 2: Update the fuzzy partition matrix using Equation (2). Step 3: Recalculate the cluster center setZnewusing Equation (3).

Step 4: If ||Znew´Z||ďε, stop the iteration and output the clustering result; otherwise, go to step 2.

(4)

Remote Sens.2016,8, 295 4 of 22

2.3. The K-Means Algorithm

The K-means algorithm is one of the most commonly used methods for unsupervised image classification [2]. Similar to FCM, the objective of K-means is to determine a set of cluster centers through minimizing the clustering metricM, which is defined as

M“ K ÿ i“1 ÿ x_jPC_i kxj´zi k (4)

whereCirepresents a cluster withzias its cluster center.

A greedy searching strategy is also employed in K-means to search for the optimal set of cluster centers, until a predefined termination condition is met. The main steps of the algorithm are as follows: Step 1: Determine the cluster numberKand the maximum iteration numberMax_iterto generate the initial cluster centers randomly.

Step 2: Assign pixelxjto clusterCiif ||xj´zi||ă||xj´zk||,kP t1, 2, . . . ,Ku, andi‰ k.

Step 3: Calculate new cluster center (znew_i ) for clusterCiasznew_i “

1 Ni

ř xjPCi

xj, whereNidenotes the

number of pixels in clusterCi.

Step 4: IfMax_iteris reached, terminate the cycle and output the clustering result; otherwise, go to Step 2.

3. Cluster Validity Indices (CVIs)

Broadly, current fuzzy CVIs (for fuzzy clustering) can be classified into two forms: one (called simple CVIs) only considers the fuzzy grades of membership to a class of the data (e.g., the partition coefficient), the other (called advanced CVIs) takes both fuzzy grads of membership and the geometrical properties (i.e.,the structure) of the original data into account (e.g., the well-known XBI) [10]. In fact, crisp CVIs (for crisp clustering) which only consider the geometrical properties of the original data (e.g., the well-known DBI) are special versions of advanced CVIs, and can also be used in fuzzy image clustering analysis [12]. In this study, a total of 16 representative and commonly used CVIs of different forms were chosen for evaluation, including three simple CVIs, and thirteen advanced CVIs.

It is noteworthy that some CVIs (e.g., XBI) indicate the optimal cluster number of data by using the maximum value, while the others use the minimum value. For convenience, we subsequently denote the former (the larger, the better CVI) as CVI+_{, and the latter (the lower, the better CVI) as CVI}´_. 3.1. Simple CVIs

(1) The partition coefficient (PC+_{) [}₂₅_{] evaluates the compactness by using the averaged strength} of belongingness of data, and is defined as:

PCpKq “ 1 N K ÿ i“1 N ÿ j“1 µ2_ij (5)

(2) The partition entropy (PE´_{) is formed based upon the logarithmic form of}_PC_[₂₂_{], and is} defined as: PEpKq “ ´1 N K ÿ i“1 N ÿ j“1 µijlog2puijq (6)

(3) The modification ofPC(MPC+) [26] is designed to reduce the monotonic tendency ofPCand PE. The index is defined as:

MPCpKq “1´ K

(5)

3.2. Advanced CVIs

(1) The Davies-Bouldin Index (DBI´_{) [}₂₇_{] estimates the ratio of within-cluster compactness to} between-cluster separation, which is defined as:

DBIpKq “ 1 K K ÿ i“1 maxt Si`Sk ||zi´zk||2 u,i‰k, (8) whereSi“ 1 Ni ř xjPCi||xj´zi|| 2_,_N

idenotes the number of data points in theith cluster (Ci).

(2) The Dunn Index (DI+) [28] evaluates a clustering by taking the minimum distance between-cluster as separation and the maximum distance between each pair of within-cluster points as compactness. The original index is defined as [28]:

DunnpKq “ min 1ďpďK ¨ ˝ min s`1ďqďK´1p dispCp,Cqq max 1ďiďKdiapCiq q ˛ ‚, (9)

wheredispCp,Cqqrefers to the distance between thepth andqth clusters, is calculated asdispCp,Cqq “

min

x_jPCp,xlPCq

p||xj´xl||q;diapCiqdenotes the maximum distance between any pair of within-cluster

points, which is measured asdiapCiq “ max xj,xlPCi

p||xj´xl||q.

(3) The Calinski-Harabasz Index (CHI+) [29] is a ratio-type index in which compactness is measured by the distance (WK) between each within-cluster point to its centroid, and separation

is based on the distance (BK) between each centroid to the global centroid (z),i.e.,:

CH IpKq “ BK K´1{ WK N´K, (10) whereBK“ K ř i“1 Ni||zi´z||2,WK“ K ř i“1 ř xjPCi ||xj´zi||2.

(4) The Fukuyama and Sugeno Index (FSI´_{) [}₃₀_{] is designed to measure the discrepancy between} fuzzy compactness and fuzzy separation,i.e.,:

FSIpKq “ K ÿ i“1 N ÿ j“1 um_ij||xj´zi||2´ K ÿ i“1 N ÿ j“1 um_ij||zi´ ´ z|| 2 (11)

(5) The Xie and Beni Index (XBI´_{) [}₃₁_{] is also a ratio-type index, which measures the average} within-cluster fuzzy compactness against the minimum between-cluster separation,i.e.,:

XBIpKq “ K ř i“1 N ř j“1 µ2_ij||xj´zi||2 N¨min i‰kt||zi´zk|| 2 u (12)

(6) The Kwon Index (KI´_{) [}₃₂_{] aims to overcome the shortcoming of XBI that decreases} monotonically when the cluster number approaches the actual cluster number of data. Here, a penalty function was introduced to the numerator of XBI,i.e.,:

KIpKq “ K ř i“1 N ř j“1 µ2_ij||xj´zi||2` 1 K K ř i“1 ||zi´ ´ z|| 2 min i‰kt||zi´zk|| 2_u (13)

(6)

Remote Sens.2016,8, 295 6 of 22

(7) The Tang Index (TI´_{) [}₃₃_{] also introduced a similar penalty function to the numerator} of XBI,i.e.: T IpKq “ K ř i“1 N ř j“1 µ2_ij||xj´zi||2` 1 KpK´1q K ř i“1 K ř k“1 k‰i ||zi´zk||2 min i‰k||zi´zk|| 2_`₁_{_K (14)

(8) The SC Index (SCI+) [34] measures the fuzzy compactness/separation ratio of clustering by using the difference between two functions,SC1andSC2,i.e.,:

SCIpKq “SC1pKq ´SC2pKq, (15)

whereSC1(Equation (16)) evaluates the compactness/separation ratio by considering the grades of membership and the original data: the larger theSC1, the better the clustering:

SC1pKq “ p1 K K ř i“1 ||zi´z||q K ř i“1 p N ř j“1 µm_ij||xj´zi||2{ N ř j“1 µijq , (16)

whileSC2(Equation (17)) measures the ratio by using the grades of membership only: the smaller the SC2, the better the clustering:

SC2pKq “ K´1 ř i“1 K ř k“i`1 p N ř j“1 pminpµij,µkjq2q{njkq p N ř j“1 max 1ďiďKµ 2 ijq{p N ř j“1 max 1ďiďKµijq (17) wherenjk “ N ř j“1 minpµij,µkjq.

(9) The Compose Within and Between scattering Index (CWBI´_{) [}₁₉_{] assesses the average} compactness and separation of fuzzy clustering by using the sum of two functions,i.e.,:

CWBIpKq “αScatpKq `DispKq, (18)

whereαis a weighing factor which equalsDispKmaxq, theDispKqwith the maximum cluster number; andScatpKqrefers to the average scattering (i.e., compactness) forKclusters, which is defined as:

ScatpKq “ 1 K K ř i“1 ||σpziq|| ||σpXq|| , (19)

where ||x|| “ pxT¨xq1{2; σpXq denotes the variance of data, which is defined as σpXq “ 1

N

N ř j“1

pxj´zq2; σpziq denotes the fuzzy variation of cluster i, which is defined as σpziq “

1 N N ř j“1 µijpxj´ziq2.

The smaller the value ofScatpKq, the better the compactness of the clustering.

The distance functionDispKqmeasuring the separation between clusters is defined as:

DispKq “ Dmax Dmin K ÿ i“1 p K ÿ k“1 ||zi´zk||q ´1 , (20)

(7)

whereDmax“maxt||zi´zk||u,Dmin“mint||zi´zk||u,i,kP{2, 3, . . . K}.

The smaller the value ofDispKq, the better the separation of clusters.

(10) The WSJ Index (WSJI´_{) [}₁₃_{], inspired by the CWBI, also uses a linear combination of averaged} fuzzy compactness and separation to measure clustering, which is defined as:

WSJ IpKq “ScatpKq ` SeppKq

SeppKmaxq (21)

where ScatpKq is given by Equation (19); SeppKq denotes the between-cluster separation, which is defined as SeppKq “ D 2 max D_min2 K ř i“1 p K ř k“1 ||zi´zk||2q ´1

, where Dmax “ maxt||zi´zk||u, Dmin “ mint||zi´zk||u;SeppKmaxqrefers to theSeppKqwith the maximum cluster number.

(11) The PBMF index (PBMFI+) [20] estimates within-cluster compactness and large separation between clusters of fuzzy clustering,i.e.,:

PBMFIpKq “ max i‰k t||zi´zk||u ˆ N ř j“1 µj1||xi´z1|| K K ř i“1 N ř j“1 µm_ij||xj´zi|| . (22)

(12) The SVF index (SVFI+) [35] emphasizes on low within-cluster variation (i.e., high compactness) and large separation between clusters,i.e.,:

SVFIpKq “ K ř i“1 min i‰k||zi´zk|| K ř i“1 maxxjPCiµ m ij||xj´zi|| . (23)

(13) The WL Index (WLI´_{) [}₁₂_{] measures both within-cluster compactness and between-cluster} separation of fuzzy clustering. Specifically, it takes both the minimum and the median distances between clusters as separation, which retains the clusters whose centroids are close to each other. The index is defined as:

W LIpKq “ W Ln 2W Ld

(24) where W Ln denotes the fuzzy compactness of clusters, which is defined as W Ln “

K ř i“1 p N ř j“1 µ2_ij||xj´zi||2 N ř j“1 µij

q;W Ldrefers to the separation between clusters, which is defined asW Ld “

1

2pmini‰kt||zi´zk||

2

u `median

i‰k t||zi´zk||

2_{uq, where min}

i‰kt||zi´zk||

2

u and mediant||zi´zk||2u

denote, respectively, the minimum distance and median distance between any pair of clusters. 4. Experiments and Results

In this section, the performance of the 16 CVIs introduced in Section3was evaluated using five types of multispectral, and four types of hyperspectral, remote sensing datasets (detailed below). For image clustering, the FCM and K-means algorithms were utilized here. The operational parameters in FCM were designated in line with previous studies [13]: threshold of iteration terminationε“ e´5, weighting exponentm“2, and the maximum iteration numberMax_iter“500; while the operational parameters in K-means as: the pixel change threshold = 0%, and the maximum iteration number Max_iter“500. For each of the images, the two algorithms were implemented with cluster number

(8)

Remote Sens.2016,8, 295 8 of 22

K “2, 3, . . . , 10, respectively. To overcome the shortcoming of the two algorithms that often trap on local optima, depending on the initial solutions [36], each implementation of the clustering was repeated five times and the best clustering result (with the minimum value ofJm(Equation (1)) orM

(Equation (4)) was retained for CVIs evaluation. 4.1. Datasets

The five multispectral data sets include QuickBird [37], Landsat TM, Landsat ETM+, GaoFen-1 [38], and FLC1 [39]. Their true/false color maps, the corresponding ground reference maps and the spectral curves of land use/cover classes were shown in Figure1. The four hyperspectral datasets include Hyperion [40], HYDICE [41], ROSIS [42] and AVIRIS [43]. Their false color maps, ground reference maps and spectral curves of land cover/use classes were presented in Figure2. The basic information on the remote sensing datasets employed in our experiments was detailed in Table1.

Remote Sens. 2016, 8, 295 8 of 23 d n

WL

K

WLI

2

)

(

=

₍₂₄₎

where

WL

n denotes the fuzzy compactness of clusters, which is defined as



= = = − = K i N j ij N j i j ij n z x WL 1 1 1 2 2 ) ( μ μ

;

WL

d refers to the separation between clusters, which is defined as

})

{

}

{

min

(

2

1

2 2 k i k i k i k i d

z

median

z

WL

=

−

+

−

≠ ≠ , where

min

{

}

2 k i k i_≠

z

−

z

and

}

{

z

_i

z

_k 2

median

−

denote, respectively, the minimum distance and median distance between any

pair of clusters.

4. Experiments and Results

In this section, the performance of the 16 CVIs introduced in Section 3 was evaluated using five types of multispectral, and four types of hyperspectral, remote sensing datasets (detailed below). For image clustering, the FCM and K-means algorithms were utilized here. The operational parameters in FCM were designated in line with previous studies [13]: threshold of iteration termination

e

=

ε

− 5, weighting exponent

m

=

2

, and the maximum iteration number

Max

_

iter

=

500

; while the operational parameters in K-means as: the pixel change threshold = 0%, and the maximum iteration number

Max

_

iter

=

500

. For each of the images, the two algorithms were implemented with cluster number

K

=

2

,

3

,

...,

10

, respectively. To overcome the shortcoming of the two algorithms that often trap on local optima, depending on the initial solutions [36], each implementation of the clustering was repeated five times and the best clustering result (with the minimum value of

J

m (Equation (1)) or

M

(Equation (4)) was retained for CVIs

evaluation.

4.1. Datasets

The five multispectral data sets include QuickBird [37], Landsat TM, Landsat ETM+, GaoFen-1 [38], and FLC1 [39]. Their true/false color maps, the corresponding ground reference maps and the spectral curves of land use/cover classes were shown in Figure 1. The four hyperspectral datasets include Hyperion [40], HYDICE [41], ROSIS [42] and AVIRIS [43]. Their false color maps, ground reference maps and spectral curves of land cover/use classes were presented in Figure 2. The basic information on the remote sensing datasets employed in our experiments was detailed in Table 1.

(a) (b) (c) Figure 1.Cont. Remote Sens. 2016, 8, 295 9 of 23 (d) (e) (f) (g) (h) (i) (j) (k) (l) (m) (n) (o)

Figure 1. The multispectral images. (a–c) the true/false color map, the ground reference map and the corresponding spectral curves of ground truth classes of QuickBird datasets; (d–f) the corresponding maps of Landsat TM datasets; (g–i) the corresponding maps of Landsat ETM+ datasets; (j–l) the corresponding maps of GaoFen-1 datasets; (m–o) the corresponding maps of FLC1 datasets. (a) True color map; (b) false color map (7, 5, 3); (c) false color map (7, 5, 3); (d) true color map; and (e) false color map (bands 12, 9, and 1).

(9)

Remote Sens.2016,8, 295 9 of 22 (d) (e) (f) (g) (h) (i) (j) (k) (l) (m) (n) (o)

Figure 1. The multispectral images. (a–c) the true/false color map, the ground reference map and the corresponding spectral curves of ground truth classes of QuickBird datasets; (d–f) the corresponding maps of Landsat TM datasets; (g–i) the corresponding maps of Landsat ETM+ datasets; (j–l) the corresponding maps of GaoFen-1 datasets; (m–o) the corresponding maps of FLC1 datasets. (a) True color map; (b) false color map (7, 5, 3); (c) false color map (7, 5, 3); (d) true color map; and (e) false color map (bands 12, 9, and 1).

Figure 1.The multispectral images. (a–c) the true/false color map, the ground reference map and the corresponding spectral curves of ground truth classes of QuickBird datasets; (d–f) the corresponding maps of Landsat TM datasets; (g–i) the corresponding maps of Landsat ETM+ datasets; (j–l) the corresponding maps of GaoFen-1 datasets; (m–o) the corresponding maps of FLC1 datasets. (a) True color map; (b) false color map (7, 5, 3); (c) false color map (7, 5, 3); (d) true color map; and (e) false color map (bands 12, 9, and 1).

Remote Sens. 2016, 8, x 10 of 23

(a) (b) (c)

(d) (e) (f)

(g) (h) (i)

(j) (k) (l)

Figure 2. The hyperspectral images. (a–c) the false color (FC) map, the corresponding ground reference map and the corresponding spectral curves of ground truth classes of Hyperion data sets; (d–f) the corresponding maps of HYDICE datasets; (g–i) the corresponding maps of ROSIS datasets; (j–l) the corresponding maps of AVIRIS datasets. (a) FC map (bands 93, 60, 10); (b) FC map (bands 120, 90, 10); (c) FC map (bands 90, 60, 10); and (d) FC map (bands 111, 90, 12).

Table 1. Basic information of the remote sensing data sets.

D S Y L R B W S GT

QuickBird Multi-spectral camera 2005

Yalvhe farm,

China 2.4 4 0.45–0.90 100 × 100

Road, paddy field, and farmland Landsat TM Thematic mapper 2005 JingYuetan reservoir, China 30 6 0.45–2.35 296 × 295 Forest, farmland, water, and town Landsat ETM+ Enhanced thematic mapper 2001 Zhalong reserve, China 30 6 0.45–2.35 150 × 139

Marsh, forest, water, and farmland

Gaofen-1 Wide filed imager 2015

Sanjiang Plain,

China 16 4 0.45–0.89 200 × 200

Water1, water2, grass, soil, and sand

(10)

Remote Sens.2016,8, 295 10 of 22 Remote Sens. 2016, 8, x 10 of 23 (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) (k) (l)

Figure 2. The hyperspectral images. (a–c) the false color (FC) map, the corresponding ground reference map and the corresponding spectral curves of ground truth classes of Hyperion data sets; (d–f) the corresponding maps of HYDICE datasets; (g–i) the corresponding maps of ROSIS datasets; (j–l) the corresponding maps of AVIRIS datasets. (a) FC map (bands 93, 60, 10); (b) FC map (bands 120, 90, 10); (c) FC map (bands 90, 60, 10); and (d) FC map (bands 111, 90, 12).

Table 1. Basic information of the remote sensing data sets.

D S Y L R B W S GT

QuickBird Multi-spectral camera 2005

Yalvhe farm,

China 2.4 4 0.45–0.90 100 × 100

Road, paddy field, and farmland Landsat TM Thematic mapper 2005 JingYuetan reservoir, China 30 6 0.45–2.35 296 × 295 Forest, farmland, water, and town Landsat ETM+ Enhanced thematic mapper 2001 Zhalong reserve, China 30 6 0.45–2.35 150 × 139

Marsh, forest, water, and farmland

Gaofen-1 Wide filed

imager 2015

Sanjiang Plain,

China 16 4 0.45–0.89 200 × 200

Water1, water2, grass, soil, and sand

Figure 2.The hyperspectral images. (a–c) the false color (FC) map, the corresponding ground reference map and the corresponding spectral curves of ground truth classes of Hyperion data sets; (d–f) the corresponding maps of HYDICE datasets; (g–i) the corresponding maps of ROSIS datasets; (j–l) the corresponding maps of AVIRIS datasets. (a) FC map (bands 93, 60, 10); (b) FC map (bands 120, 90, 10); (c) FC map (bands 90, 60, 10); and (d) FC map (bands 111, 90, 12).

Table 1.Basic information of the remote sensing data sets.

D S Y L R B W S GT

QuickBird Multi-spectral_camera 2005 Yalvhe farm,_China 2.4 4 0.45–0.90 100ˆ100 Road, paddy field, and farmland Landsat TM Thematic_mapper 2005 JingYuetanreservoir,

China 30 6 0.45–2.35 296

ˆ295 _{water, and town}Forest, farmland,

Landsat ETM+ Enhanced thematic mapper 2001 Zhalong reserve, China 30 6 0.45–2.35 150

ˆ139 Marsh, forest, water,_{and farmland}

Gaofen-1 Wide filed_imager 2015 Sanjiang

Plain, China 16 4 0.45–0.89 200ˆ200

Water1, water2, grass, soil, and sand FLC1 M7 scanner 1966 Tippecanoe_{County, US} 30 12 0.40–1.00 84ˆ183 wheat andSoybeans, oats, corn,

red clover Hyperion Hyperion 2001 OkavangoDelta,

Botswana

30 145 0.40–2.50 126ˆ146 Woodland, islandinterior, water and floodplain grasses HYDICE HYDICE 1995 Washington_{DC, US} 2 191 0.40–2.40 126ˆ82 Roads, trees, trail_{and grass}

ROSIS ROSIS 2001 Universityof Pavia, Italy

1.3 103 0.43–0.86 125ˆ148 Meadows, trees,asphalt, bricks and shadows

AVIRIS AVIRIS 1998 _{Valley, USA}Salinas 3.7 204 0.41–2.45 117ˆ143

Vineyard untrained, celery, fallow smooth, fallow plow and stubble

Note: D, datasets; S, sensor; Y, year; L, location; R, resolution (m); B, number of bands; W, spectral wavelength (µm); S, size of image (pixel by pixels); GT, ground truth classes.

4.2. Results

The nine types of images were clustered by FCM and K-means algorithms respectively, and each clustering result was evaluated using the corresponding ground-truth data (Figures1and2). Table2 shows the classification accuracies of the images achieved by the two algorithms. Similarly, both FCM and K-means generated good classification results, with the overall accuracy greater than 90%

(11)

for seven images. However, considering the length limitation of the paper, the clustering results by K-means and the corresponding cluster validity result for each image were not presented in as much detail as those by FCM, but were summarized at the end of the results.

Table 2. Classification accuracies of the remote sensing images acquired by FCM and

K-means algorithms.

Datasets K# Overall Accuracies (%) Kappa Coefficient

FCM K-Means FCM K-Means QuickBird 3 96.06 96.10 0.9354 0.9361 Landsat TM 4 95.78 95.27 0.9433 0.9363 Landsat ETM+ 4 94.41 96.30 0.9253 0.9565 Gaofen-1 5 98.34 98.79 0.9791 0.9848 FLC1 5 83.10 84.48 0.7847 0.8016 Hyperion 4 87.09 86.79 0.8260 0.8219 HYDICE 4 94.88 96.00 0.9238 0.9403 ROSIS 5 93.85 93.25 0.9129 0.9044 AVIRIS 5 99.63 99.63 0.9946 0.9946

Tables3–11illustrated the variations of the 16 CVIs with the number of clusters ranging from two to 10 by FCM for each image. The optimal cluster numbers of each image are indicated by the CVIs, shown in bold font. The clustering results of multispectral and hyperspectral datasets by FCM, respectively, are illustrated in Figures3and4. Note that only four clustering results for each image are presented, including the optimal one (underlined), one or two close to the optimal, and those indicated by many CVIs (usually larger than 4) (bold). For example, Figure3e–h illustrates the clustering results of Landsat TM image, in which the optimal clustering (Figure3g) is underlined, the two near-optimal clustering results (Figure3f,h) and the obviously-incorrect clustering indicated by many CVIs (Figure3e) are also presented.

Table 3.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the QuickBird image.

CVIs Cluster Number

2 3 * 4 5 6 7 8 9 10 PC+ 0.754 0.800 0.728 0.699 0.658 0.623 0.626 0.593 0.579 PE´ _0.565 _0.532 _0.749 _0.866 _1.008 _1.134 _1.150 _1.272 _1.341 MPC+ 0.509 0.700 0.637 0.624 0.590 0.560 0.572 0.542 0.533 DBI´ _0.822 _0.559 _0.692 _0.750 _0.779 _0.845 _0.802 _0.882 _0.915 DI+(e-3) 2.486 3.591 2.539 2.539 2.614 2.614 3.439 3.439 3.439 CHI+(e4) 1.142 2.041 2.026 2.030 1.908 1.779 2.116 1.998 1.971 FSI´_(e7) _´_0.535 _´_6.644 _´_7.247 _´_7.331 _´_7.093 _´_6.902 _´_7.440 _´_7.260 _´_7.092 XBI´ _0.160 _0.103 _0.218 _0.236 _0.231 _0.287 _0.237 _0.325 _0.277 KI´_(e3) _1.601 _1.027 _2.184 _2.370 _2.312 _2.883 _2.382 _3.266 _2.790 TI´_(e3) _1.601 _1.029 _2.189 _2.376 _2.320 _2.894 _2.395 _3.284 _2.808 SCI+ 0.477 2.477 2.516 2.752 2.837 2.554 3.546 3.456 3.349 CWBI´_(e-2) _4.076 _2.826 _4.284 _4.964 _5.399 _6.902 _6.592 _8.628 _8.499 WSJI´ _0.365 _0.171 _0.256 _0.339 _0.399 _0.648 _0.618 _1.060 _1.023 PBMFI+(e3) 1.588 3.214 0.635 0.336 0.068 0.060 0.022 0.034 0.013 SVFI+ 1.422 2.415 2.621 2.910 3.063 3.252 3.517 3.699 3.885 WLI´ _0.339 _0.252 _0.325 _0.418 _0.426 _0.453 _0.346 _0.374 _0.388

Note: *denotes the actual cluster number of the image; figures in bold face denote the optimal cluster numbers of the image identified by the CVIs; the data in the brackets of the first column is a multiplying factor (e.g., e-3 followed DI+_{) of the corresponding line.}

(12)

Remote Sens.2016,8, 295 12 of 22 Remote Sens. 2016, 8, 295 12 of 23 (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) (k) (l) (m) (n) (o) (p) (q) (r) (s) (t)

Figure 3. Clustering results of the multispectral images (each color represents a cluster). (a) QuickBird,

K = 2; (b) QuickBird, K = 3; (c) QuickBird, K = 4; (d) QuickBird, K = 5; (e) Landsat TM, K = 2; (f) Landsat TM, K = 3; (g) Landsat TM, K = 4; (h) Landsat TM, K = 5; (i) Landsat ETM+, K = 2; (j) Landsat ETM+,

K = 3; (k) Landsat ETM+, K = 4; (l) Landsat ETM+, K = 5; (m) GaoFen-1, K = 2; (n) GaoFen-1, K = 4; (o) GaoFen-1, K = 5; (p)GaoFen-1, K = 6; (q) FLC1, K = 2; (r) FLC1, K = 3; (s) FLC1, K = 5; and (t) FLC1, K

= 6.

Figure 3.Clustering results of the multispectral images (each color represents a cluster). (a) QuickBird, K = 2; (b) QuickBird,K= 3; (c) QuickBird, K = 4; (d) QuickBird, K= 5; (e)Landsat TM, K= 2; (f) Landsat TM, K= 3; (g) Landsat TM,K= 4; (h) Landsat TM, K = 5; (i)Landsat ETM+, K = 2; (j) Landsat ETM+,K= 3; (k) Landsat ETM+,K= 4; (l) Landsat ETM+,K= 5; (m)GaoFen-1, K= 2; (n) GaoFen-1,K = 4; (o) GaoFen-1,K= 5; (p) GaoFen-1,K = 6; (q)FLC1, K= 2; (r)FLC1, K= 3; (s) FLC1,K= 5; and (t) FLC1,K= 6.

(13)

(a) (b) (c) (d)

(e) (f) (g) (h)

(i) (j) (k) (l)

(m) (n) (o) (p)

Figure 4. Clustering results of the hyperspectral images by FCM (each color represents a cluster). (a) Hyperion, K = 2; (b) Hyperion, K = 3; (c) Hyperion, K = 4; (d) Hyperion, K = 5; (e) HYDICE, K = 3; (f) HYDICE, K = 4; (g) HYDICE, K = 5; (h) HYDICE, K = 6; (i) ROSIS, K = 3; (j) ROSIS, K = 4; (k) ROSIS, K

= 5; (l) ROSIS, K = 6; (m) AVIRIS, K = 3; (n) AVIRIS, K = 4; (o) AVIRIS, K = 5; and (p) AVIRIS, K = 6.

Table 3. Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the QuickBird image.

CVIs Cluster Number

2 3 * 4 5 6 7 8 9 10 PC+_0.754_0.800 _{0.728 0.699 0.658 0.623 0.626 0.593 0.579} PE−_0.565_0.532 _{0.749 0.866 1.008 1.134 1.150 1.272 1.341} MPC+_0.509_0.700 _{0.637 0.624 0.590 0.560 0.572 0.542 0.533} DBI− _0.822 _0.559 _{0.692 0.750 0.779 0.845 0.802 0.882 0.915} DI+_(e-3) _2.486 _3.591 _{2.539 2.539 2.614 2.614 3.439 3.439 3.439} CHI+_{(e4) 1.142 2.041 2.026 2.030 1.908 1.779}_2.116 _{1.998 1.971} FSI−_(e7) ₋_0.535 ₋_6.644 ₋_7.247 ₋_7.331 ₋_7.093 ₋_6.902 ₋_7.440 ₋_7.260 ₋_7.092 XBI−_0.160_0.103 _{0.218 0.236 0.231 0.287 0.237 0.325 0.277} KI−_{(e3) 1.601}_1.027 _{2.184 2.370 2.312 2.883 2.382 3.266 2.790} TI−_{(e3) 1.601}_1.029 _{2.189 2.376 2.320 2.894 2.395 3.284 2.808} SCI+ _{0.477 2.477 2.516 2.752 2.837 2.554}_3.546 _{3.456 3.349} CWBI−_{(e-2) 4.076}_2.826 _{4.284 4.964 5.399 6.902 6.592 8.628 8.499} WSJI−_0.365_0.171 _{0.256 0.339 0.399 0.648 0.618 1.060 1.023}

Figure 4. Clustering results of the hyperspectral images by FCM (each color represents a cluster). (a)Hyperion,K= 2; (b) Hyperion, K = 3; (c) Hyperion,K= 4; (d) Hyperion, K = 5; (e)HYDICE,

K= 3; (f) HYDICE,K= 4; (g) HYDICE,K= 5; (h) HYDICE,K= 6; (i)ROSIS,K= 3; (j) ROSIS,K= 4; (k) ROSIS,K= 5; (l) ROSIS,K = 6; (m) AVIRIS,K = 3; (n)AVIRIS, K= 4; (o) AVIRIS,K= 5; and (p) AVIRIS,K= 6.

Table 4.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the Landsat TM image.

CVIs Cluster Number

2 3 4 * 5 6 7 8 9 10 PC+ _0.922 _0.796 _0.794 _0.689 _0.657 _0.610 _0.587 _0.571 _0.472 PE´ _0.204 _0.521 _0.587 _0.856 _0.998 _1.168 _1.281 _1.354 _1.593 MPC+ _0.844 _0.695 _0.726 _0.612 _0.589 _0.545 _0.528 _0.517 _0.413 DBI´ _0.282 _0.764 _0.601 _0.999 _0.913 _1.055 _1.064 _1.185 _1.522 DI+_(e-3) _8.00 _7.548 _7.783 _8.042 _8.498 _8.893 _8.893 _8.893 _8.893 CHI+_(e5) _2.883 _2.608 _3.261 _2.914 _2.831 _2.568 _2.393 _2.348 _2.053 FSI´_(e7) _´_6.762 _´_7.850 _´_8.743 _´_8.423 _´_8.342 _´_8.132 _´_8.027 _´_7.945 _´_5.888 XBI´ _0.044 _0.177 _0.092 _0.599 _0.450 _0.626 _0.564 _0.661 _2.025 KI´_(e4) _0.334 _1.335 _0.693 _4.523 _3.402 _4.731 _4.259 _4.996 _15.300 TI´_(e4) _0.334 _1.335 _0.693 _4.515 _3.397 _4.721 _4.251 _4.985 _15.188 SCI+ _3.463 _3.319 _3.876 _3.609 _3.407 _2.916 _3.204 _2.362 _3.439 CWBI´_(e-2) _0.151 _0.142 _0.114 _0.291 _0.299 _0.403 _0.425 _0.457 _0.762 WSJI´ _0.167 _0.094 _0.057 _0.154 _0.159 _0.274 _0.307 _0.372 _1.015 PBMFI+_(e3) _1.014 _0.296 _0.018 _0.013 _0.006 _0.006 _0.001 _0.001 _0.001 SVFI+ _1.922 _2.287 _2.982 _3.367 _3.816 _4.106 _4.349 _4.224 _3.137 WLI´ 0.079 0.131 0.172 0.247 0.327 0.361 0.463 0.395 0.293

(14)

Remote Sens.2016,8, 295 14 of 22

Table 5.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the ETM+ image.

2 3 4 * 5 6 7 8 9 10 PC+ _0.404 _0.812 _0.760 _0.725 _0.703 _0.663 _0.628 _0.617 _0.593 PE´ _0.284 _0.503 _0.673 _0.802 _0.877 _1.012 _1.144 _1.204 _1.300 MPC+ _0.775 _0.717 _0.680 _0.656 _0.644 _0.607 _0.575 _0.569 _0.548 DBI´ _0.404 _0.603 _0.674 _0.731 _0.745 _0.861 _0.952 _0.937 _1.055 DI+_(e-3) _5.803 _7.595 _8.256 _8.889 _8.889 _0.104 _0.107 _0.107 _0.114 CHI+_(e5) _0.818 _0.909 _0.929 _0.929 _0.896 _0.893 _0.847 _0.847 _0.822 FSI´_(e8) _´_0.409 _´_0.483 _´_0.477 _´_0.461 _´_0.461 _´_0.439 _´_0.426 _´_0.422 _´_0.413 XBI´ _0.047 _0.090 _0.117 _0.169 _0.183 _0.202 _0.234 _0.216 _0.293 KI´_(e4) _0.100 _0.188 _0.244 _0.352 _0.382 _0.421 _0.489 _0.452 _0.613 TI´_(e4) _0.099 _0.189 _0.244 _0.353 _0.383 _0.422 _0.490 _0.454 _0.615 SCI+ _2.463 _3.259 _3.388 _3.723 _4.715 _5.137 _4.922 _4.779 _4.884 CWBI´ _0.054 _0.059 _0.076 _0.104 _0.112 _0.133 _0.164 _0.174 _0.226 WSJI´ _0.162 _0.351 _0.126 _0.214 _0.248 _0.348 _0.515 _0.601 _1.012 PBMFI+_(e3) _1.058 _0.156 _0.085 _0.088 _0.004 _0.005 _0.002 _0.014 _0.003 SVFI+ _2.173 _2.413 _3.027 _0.760 _3.155 _3.339 _3.521 _3.607 _3.648 WLI´ _0.093 _0.174 _0.307 _0.265 _0.244 _0.229 _0.259 _0.208 _0.279

Note:*denotes the actual cluster number of the image.

Table 6.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the GaoFen-1 image.

CVIs Cluster Number

2 3 4 5 * 6 7 8 9 10 PC+ _0.850 _0.751 _0.764 _0.779 _0.735 _0.710 _0.688 _0.669 _0.645 PE´ _0.374 _0.643 _0.663 _0.658 _1.183 _0.900 _0.978 _1.056 _1.140 MPC+ _0.700 _0.627 _0.620 _0.724 _0.682 _0.662 _0.643 _0.628 _0.606 DBI´ _0.587 _0.832 _0.767 _0.536 _0.701 _0.740 _0.789 _0.803 _0.896 DI+_(e-3) _4.715 _1.478 _1.470 _2.298 _2.348 _2.688 _3.028 _2.860 _2.965 CHI+_(e5) _0.764 _0.656 _0.681 _1.314 _1.185 _1.262 _1.270 _1.257 _1.197 FSI´_(e9) _´_0.662 _´_1.038 _´_1.253 _´_1.603 _´_1.571 _´_1.539 _´_1.519 _´_1.490 _´_1.462 XBI´ _0.100 _0.135 _0.136 _0.077 _0.198 _0.158 _0.140 _0.154 _0.285 KI´_(e4) _0.399 _0.541 _0.546 _0.309 _0.791 _0.634 _0.560 _0.617 _1.141 TI´_(e4) _0.399 _0.541 _0.546 _0.309 _0.792 _0.635 _0.561 _0.618 _1.141 SCI+ _1.031 _1.156 _1.364 _4.658 _3.899 _4.468 _4.470 _4.333 _3.992 CWBI´_(e-3) _0.147 _0.137 _0.144 _0.139 _0.240 _0.264 _0.268 _0.311 _0.445 WSJI´ _0.226 _0.934 _0.128 _0.114 _0.282 _0.348 _0.362 _0.487 _1.015 PBMFI+_(e4) _1.049 _0.243 _0.075 _0.053 _0.063 _0.019 _0.011 _0.040 _0.056 SVFI+ _2.315 _2.610 _3.027 _3.707 _3.739 _4.193 _4.009 _4.245 _4.165 WLI´ _0.201 _0.342 _0.307 _0.154 _0.174 _0.202 _0.196 _0.208 _0.226

Table 7.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the FLC1 image.

2 3 4 5 * 6 7 8 9 10 PC+ _0.760 _0.680 _0.584 _0.602 _0.555 _0.497 _0.451 _0.429 _0.398 PE´ _0.549 _0.834 _1.139 _1.160 _1.343 _1.556 _1.724 _1.831 _1.974 MPC+ _0.519 _0.520 _0.446 _0.503 _0.466 _0.414 _0.372 _0.358 _0.331 DBI´ _0.887 _0.909 _1.052 _0.896 _0.937 _1.008 _1.432 _1.410 _1.475 DI+_(e-2) _0.806 _1.330 _1.048 _1.613 _1.365 _1.495 _1.259 _1.259 _1.542 CHI+_(e4) _1.274 _1.537 _1.273 _1.616 _1.521 _1.354 _1.249 _1.194 _1.105 FSI´_(e6) _´_0.666 _´_5.029 _´_5.479 _´_8.809 _´_8.573 _´_7.979 _´_7.511 _´_7.200 _´_6.794 XBI´ _0.206 _0.186 _0.380 _0.224 _0.268 _0.334 _0.636 _0.616 _0.576 KI´_(e4) _0.299 _0.270 _0.552 _0.325 _0.390 _0.485 _0.924 _0.896 _0.837 TI´_(e4) _0.299 _0.270 _0.552 _0.325 _0.390 _0.485 _0.924 _0.896 _0.838 SCI+ _0.307 _0.624 _0.383 _0.913 _1.089 _0.672 _0.639 _0.815 _0.681 CWBI´ _0.126 _0.098 _0.123 _0.109 _0.128 _0.154 _0.221 _0.230 _0.238 WSJI´ _0.410 _0.598 _0.294 _0.241 _0.305 _0.425 _0.877 _0.968 _1.049 PBMFI+ _186.178 _84.915 _27.891 _5.076 _16.638 _5.969 _5.374 _2.140 _0.932 SVFI+ _1.185 _1.874 _2.389 _3.040 _3.424 _3.680 _3.439 _3.679 _3.803 WLI´ _0.415 _0.535 _0.708 _0.523 _0.584 _0.709 _7.591 _0.727 _0.777

(15)

Table 8.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the Hyperion image.

CVIs Cluster Number

2 3 4 * 5 6 7 8 9 10 PC+ _0.867 _0.759 _0.682 _0.658 _0.596 _0.568 _0.530 _0.494 _0.476 PE´ _0.337 _0.626 _0.869 _0.973 _1.176 _1.293 _1.435 _1.577 _1.666 MPC+ _0.735 _0.638 _0.576 _0.573 _0.515 _0.496 _0.463 _0.430 _0.417 DBI´ _0.472 _0.651 _0.732 _0.726 _0.853 _0.856 _0.946 _1.060 _1.032 DI+_(e-2) _2.169 _2.979 _2.837 _2.900 _3.489 _3.332 _3.518 _2.971 _3.916 CHI+_(e4) _4.690 _5.329 _5.328 _5.779 _5.485 _5.383 _5.158 _4.925 _4.846 FSI´_(e11) _´_4.610 _´_6.600 _´_6.902 _´_7.003 _´_6.808 _´_6.619 _´_6.414 _´_6.209 _´_6.039 XBI´ _0.061 _0.139 _0.157 _0.149 _0.217 _0.193 _0.228 _0.275 _0.239 KI´_(e3) _1.131 _2.552 _2.888 _2.750 _3.995 _3.560 _4.201 _5.064 _4.400 TI´_(e3) _1.131 _2.555 _2.892 _2.755 _4.004 _3.569 _4.213 _5.080 _4.416 SCI+ _2.241 _3.109 _3.161 _4.201 _4.012 _4.586 _4.574 _4.469 _4.605 CWBI´_(e-3) _0.440 _0.485 _0.597 _0.651 _0.900 _0.929 _1.118 _1.347 _1.333 WSJI´ _0.244 _0.681 _0.207 _0.244 _0.446 _0.491 _0.706 _1.019 _1.020 PBMFI+_(e6) _4.131 _6.247 _1.732 _0.319 _0.528 _0.325 _0.184 _0.009 _0.005 SVFI+ _2.204 _2.459 _2.867 _3.124 _3.307 _3.536 _3.697 _3.764 _3.885 WLI´ _1.222 _0.185 _0.241 _0.231 _0.242 _0.244 _0.254 _0.281 _0.307

Table 9.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the HYDICE image.

2 3 4 * 5 6 7 8 9 10 PC+ _0.752 _0.729 _0.669 _0.621 _0.587 _0.554 _0.541 _0.511 _0.502 PE´ _0.573 _0.717 _0.922 _1.106 _1.246 _1.382 _1.471 _1.598 _1.656 MPC+ _0.504 _0.594 _0.558 _0.526 _0.505 _0.479 _0.475 _0.450 _0.447 DBI´ _0.888 _0.669 _0.747 _0.824 _0.828 _0.899 _0.827 _0.888 _0.888 DI+_(e-2) _1.194 _1.070 _1.165 _1.236 _1.081 _1.190 _1.897 _1.098 _1.089 CHI+_(e4) _1.057 _1.494 _1.473 _1.618 _1.606 _1.508 _1.634 _1.587 _1.594 FSI´_(e12) _0.004 _´_1.258 _´_1.511 _´_1.592 _´_1.627 _´_1.603 _´_1.608 _´_1.579 _´_1.567 XBI´ _0.196 _0.105 _0.149 _0.168 _0.231 _0.258 _0.223 _0.315 _0.260 KI´_(e3) _2.025 _1.084 _1.545 _1.733 _2.393 _2.671 _2.306 _3.258 _2.694 TI´_(e3) _2.026 _1.085 _1.547 _1.737 _2.398 _2.678 _2.313 _3.268 _2.704 SCI+ _0.391 _1.878 _1.906 _1.997 _2.151 _1.733 _1.911 _1.691 _1.901 CWBI´_(e-4) _2.538 _1.761 _2.017 _2.395 _3.082 _3.508 _3.585 _4.598 _4.398 WSJI´ _0.422 _1.082 _0.229 _0.299 _0.482 _0.621 _0.671 _1.111 _1.034 PBMFI+_(e6) _27.978 _28.242 _7.311 _1.490 _2.274 _1.228 _0.250 _0.655 _0.118 SVFI+ _1.560 _2.490 _2.928 _3.298 _3.566 _3.689 _4.202 _4.352 _4.349 WLI´ _0.390 _0.263 _0.288 _0.323 _0.388 _0.406 _0.475 _0.474 _0.387

Table 10.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the ROSIS image.

CVIs Cluster Number

2 3 4 5 * 6 7 8 9 10 PC+ _0.703 _0.664 _0.615 _0.594 _0.548 _0.504 _0.477 _0.461 _0.443 PE´ _0.661 _0.878 _1.076 _1.204 _1.381 _1.541 _1.672 _1.775 _1.874 MPC+ _0.406 _0.495 _0.486 _0.492 _0.457 _0.421 _0.403 _0.393 _0.381 DBI´ _1.305 _0.796 _0.894 _0.876 _1.001 _1.464 _1.405 _1.371 _1.349 DI+_(e-2) _0.580 _0.656 _0.711 _0.678 _0.676 _0.606 _0.628 _0.562 _0.562 CHI+_(e4) _0.834 _1.323 _1.217 _1.109 _1.000 _0.980 _0.905 _0.844 _0.777 FSI´_(e11) _2.514 _´_1.127 _´_1.939 _´_2.526 _´_2.678 _´_2.627 _´_2.636 _´_2.671 _´_2.637 XBI´ _0.427 _0.158 _0.273 _0.213 _0.481 _0.715 _0.666 _0.623 _0.591 KI´_(e4) _0.790 _0.293 _0.506 _0.394 _0.890 _1.324 _1.233 _1.154 _1.094 TI´_(e4) _0.790 _0.293 _0.506 _0.394 _0.891 _1.325 _1.234 _1.155 _1.095 SCI+ _´_0.011 _0.316 _0.287 _0.390 _0.206 _0.333 _0.179 _0.147 _0.591 CWBI´_(e-3) _0.731 _0.424 _0.493 _0.451 _0.700 _0.872 _0.946 _1.009 _1.082 WSJI´ _0.502 _0.743 _0.261 _0.224 _0.432 _0.660 _0.789 _0.913 _1.058 PBMFI+_(e6) _2.883 _1.809 _0.068 _0.014 _0.010 _0.015 _0.007 _0.003 _0.002 SVFI+ _0.569 _1.720 _2.187 _2.958 _3.290 _3.163 _3.658 _4.011 _4.288 WLI´ _0.859 _0.453 _0.624 _0.665 _0.872 _0.678 _0.848 _0.857 _0.848

(16)

Remote Sens.2016,8, 295 16 of 22

Table 11.Variations of the 16 CVIs with cluster numbers ranging from 2 to 10 for the AVIRIS image.

2 3 4 5 * 6 7 8 9 10 PC+ _0.843 _0.838 _0.856 _0.732 _0.763 _0.700 _0.681 _0.661 _0.631 PE´ _0.395 _0.451 _0.451 _0.760 _0.691 _0.889 _0.945 _1.014 _1.135 MPC+ _0.686 _0.757 _0.808 _0.665 _0.715 _0.650 _0.636 _0.619 _0.590 DBI´ _0.690 _0.557 _0.383 _0.689 _0.617 _0.802 _0.864 _0.949 _0.962 DI+_(e-2) _0.615 _1.524 _1.567 _0.871 _1.198 _1.236 _1.523 _1.297 _1.312 CHI+_(e4) _2.105 _2.991 _5.686 _4.681 _6.460 _5.642 _6.716 _6.230 _5.556 FSI´_(e11) _´_1.098 _´_4.465 _´_6.317 _´_5.999 _´_6.803 _´_6.631 _´_6.331 _´_6.151 _´_6.004 XBI´ _0.117 _0.125 _0.065 _0.797 _0.562 _0.948 _0.696 _0.643 _0.829 KI´_(e4) _0.195 _0.210 _0.109 _1.335 _0.943 _1.589 _1.168 _1.080 _1.394 TI´_(e4) _0.195 _0.210 _0.109 _1.338 _0.946 _1.595 _1.173 _1.086 _1.401 SCI+ _1.113 _1.958 _4.715 _3.844 _5.684 _4.813 _5.351 _5.869 _6.006 CWBI´_(e-3) _0.982 _0.545 _0.407 _1.257 _1.367 _2.082 _1.938 _1.997 _2.376 WSJI´ _0.360 _0.665 _0.079 _0.276 _0.335 _0.742 _0.676 _0.721 _1.016 PBMFI+_(e5) _21.751 _14.564 _0.855 _4.139 _0.950 _1.279 _1.925 _0.910 _1.002 SVFI+ _1.942 _2.653 _3.414 _3.544 _3.620 _4.129 _3.422 _3.220 _3.007 WLI´ _0.235 _0.213 _0.127 _0.173 _0.149 _0.200 _0.158 _0.152 _0.135

Figure3a–d shows the clustering results of the simple QuickBird image with the cluster number K“2, 3, 4, 5. The three ground truth classes (road, paddy field, and farmland) of the image were well identified with cluster numberK“3 (Figure3b). As listed in Table3, the majority of CVIs correctly indicated the actual cluster number of this simple image (except CHI, FSI, SCI, and SVFI).

Figure3e-h illustrates the clustering results of the Landsat TM image with the cluster number K“2, 3, 4, 5. Among them, the clustering withK“4 succeeded in separating the four ground truth classes of the image (forest, farmland, water, and town) (Figure3g). The clustering withK“2 was obviously incorrect since three ground truth classes,i.e., forest, farmland, and town were merged into one class (Figure3e). Unfortunately, as shown in Table4most indices (DBI, PC, PE, MPC, XBI, KI, TI, PBMFI, and WLI) underestimated the real situation, which preferred two as the cluster number of the image; whereas a clear overestimation was given by DI and SVFI; only five CVIs including CHI, FSI, SCI, CWBI, and WSJI provided the actual cluster number of the image.

Figure3i–l portrays the clustering results of the Landsat ETM+ image with the cluster number K “2, 3, 4, 5, respectively. The four ground truth classes of the image (marsh, forest, water, and farmland) were well separated with cluster numberK“4 (Figure3k). However, similar to the Landsat TM experiment, most indices (PE, MPC, XBI, KI, TI, CWBI, PBMFI, and WLI) recommended two clusters as the optimal partitioning of the image (Table5). CHI and WSJI were the only two indices that correctly indicated the cluster number of the image.

Figure3m–p presents the clustering results of the GaoFen-1 image with the cluster number K“2, 4, 5, 6. The five ground truth classes of the image, namely water1 (light colored), water2 (dark colored), grass, soil, and sand, were well distinguished with cluster numberK“5 (Figure3o). This was correctly indicated by most CVIs including DBI, CHI, MPC, FSI, XBI, KI, TI, SCI, WSJI, and WLI (Table6). For the rest of the CVIs that erroneously indicated the cluster number, most of them (DI, PC, PE, and PBMFI) suggested two.

Figure3q–t demonstrates the clustering results of the FLC1 image with the cluster number K“2, 3, 5, 6. The five ground truth classes of the image (soybeans, oats, corn, wheat, and red clover) were fairly well identified with cluster numberK “5 (Figure3s). For the cases of clustering with K“2 andK“3, obvious misclassifications were observed, with some separated classes being merged into one class (Figure3q,r). As shown in Table7, there were four CVIs (DI, CHI, FSI, and WSJI) that provided the actual cluster number (K “5) for the image. But five CVIs (DBI, PC, PE, PBMFI, and WLI) and five others (MPC, XBI, KI, TI, and CWBI) erroneously supported the clustering with cluster numberK“2 andK“3, respectively.

Figure4a–d depicts the clustering results of Hyperion data with the cluster numberK“2, 3, 4, 5. The four ground truth classes (woodland, island interior, water, and floodplain grasses) were well classified with cluster numberK“4 (Figure4c). Clustering results with other cluster numbers were

(17)

obviously not satisfactory. For example, in the case of clustering withK“2, three (without water) of the four classes were wrongly merged into one class (Figure4a). This was chosen by half the total CVIs, namely DBI, PC, PE, MPC, XBI, KI, TI, and CWBI (Table8). In fact, all of the CVIs, except WSJI, failed to detect the actual cluster number of the image.

Figure4e–h provides the clustering results of HYDICE data with the cluster numberK“3, 4, 5, 6, of which the clustering withK“4 properly separated the four ground truth classes (roads, trees, trail, and grass) (Figure4f). In the case of clustering withK“3, there were clear errors due to the incorrect merging of trees and roads (Figure4e). However, it was still suggested by half of the CVIs, including DBI, MPC, XBI, KI, CWBI, TI, PBMFI, and WLI (Table9). Similar to the experiment on Hyperion, WSJI was the only index returning the correct information about cluster number.

Figure4i–l lists the clustering results of ROSIS data with the cluster number K “ 3, 4, 5, 6. Among them, the clustering withK“5 successfully classified the five ground truth classes (meadows, trees, asphalt, bricks, and shadows) (Figure4k). For the case ofK“3, trees and shadows were not distinguished (Figure4i). However, this incorrect suggestion was also made by as many as half of the CVIs, including DBI, CHI, MPC, XBI, KI, TI, CWBI, and WLI (Table10). Once again, only WSJI correctly indicated the actual cluster number of the image.

Figure4m–p shows the clustering results of AVIRIS data with the cluster numberK“3, 4, 5, 6, in which the five ground truth classes (vineyard untrained, celery, fallow smooth, fallow plow, and stubble) were well classified withK“5 (Figure4o). However, no CVI was able to indicate the actual cluster number of the image (Table11). Instead, most of them (DBI, DI, PC, MPC, XBI, KI, TI, CWBI, WSJI, and WLI) preferred four clusters for the image, which merged the classes of fallow smooth and fallow plow (Figure4n).

Figure5illustrates the percentage of successes (correct guesses) achieved by the 16 CVIs. Table12 summarizes the cluster validity results of the 16 CVIs by FCM on nine types of remote sensing image datasets, in which the actual cluster number of each image is listed in column K# while those indicated by CVIs are shown in other columns. From the table it can be seen that WSJI was the only index that correctly recognized the actual cluster numbers of all of the datasets (including multispectral and hyperspectral data), except for the AVIRIS image. Thus WSJI, was the most effective and stable index of all. CHI and FSI succeeded in multispectral datasets but failed in hyperspectral datasets. The DBI, DI, MPC, SCI, XBI, KI, TI, CWBI, and WLI indices were only effective for two multispectral images. CVIs including PC, PE, and PBMFI failed, generally, except for the simple QuickBird experiment. SVFI failed for all images.

Remote Sens. 2016, 8, 295 18 of 23

(meadows, trees, asphalt, bricks, and shadows) (Figure 4k). For the case of

K

=

3

, trees and shadows were not distinguished (Figure 4i). However, this incorrect suggestion was also made by as many as half of the CVIs, including DBI, CHI, MPC, XBI, KI, TI, CWBI, and WLI (Table 10). Once again, only WSJI correctly indicated the actual cluster number of the image.

Figure 4m–p shows the clustering results of AVIRIS data with the cluster number

6

,

5

,

4

,

3

=

K

, in which the five ground truth classes (vineyard untrained, celery, fallow smooth, fallow plow, and stubble) were well classified with

K

=

5

(Figure 4o). However, no CVI was able to indicate the actual cluster number of the image (Table 11). Instead, most of them (DBI, DI, PC, MPC, XBI, KI, TI, CWBI, WSJI, and WLI) preferred four clusters for the image, which merged the classes of fallow smooth and fallow plow (Figure 4n).

Figure 5 illustrates the percentage of successes (correct guesses) achieved by the 16 CVIs. Table 12 summarizes the cluster validity results of the 16 CVIs by FCM on nine types of remote sensing image datasets, in which the actual cluster number of each image is listed in column K# while those indicated by CVIs are shown in other columns. From the table it can be seen that WSJI was the only index that correctly recognized the actual cluster numbers of all of the datasets (including multispectral and hyperspectral data), except for the AVIRIS image. Thus WSJI, was the most effective and stable index of all. CHI and FSI succeeded in multispectral datasets but failed in hyperspectral datasets. The DBI, DI, MPC, SCI, XBI, KI, TI, CWBI, and WLI indices were only effective for two multispectral images. CVIs including PC, PE, and PBMFI failed, generally, except for the simple QuickBird experiment. SVFI failed for all images.

Figure 5. The overall performance of CVIs by applying FCM algorithm to cluster nine types of remote sensing datasets.

Table 12. The optimal cluster numbers indicated by the CVIs by FCM for each remote sensing image.

Images K# PC PE MPC DBI DI CHI FSI SCI

Multispectral image QuickBird 3 3 * 3 * 3 * 3 * 3 * 8 8 8 Landsat TM 4 2 2 2 2 7 4 * 4 * 4 * Landsat ETM+ 4 3 2 2 2 5 4 * 3 7 GaoFen-1 5 2 2 5 * 5 * 2 5 * 5 * 5 * FLC1 5 2 2 3 2 5 * 5 * 5 * 6 Hyperspectral image Hyperion 4 2 2 2 2 10 5 6 10 HYDICE