Texture Classification Based on Complex
Network Model with Spatial Information
著者
Thewsuwan Srisupang, Horio Keiichi
journal or
publication title
International Journal of Innovative Computing,
Information and Control
volume
14
number
3
page range
1113-1121
year
2018-06
URL
http://hdl.handle.net/10228/00006770
doi: info:doi/10.24507/ijicic.14.03.1113Computing, Information and Control ICIC International c⃝2018 ISSN 1349-4198 Volume14, Number3, June2018 pp. 1113–1121
TEXTURE CLASSIFICATION BY LOCAL SPATIAL PATTERN MAPPING BASED ON COMPLEX NETWORK MODEL
Srisupang Thewsuwan and Keiichi Horio Graduate School of Life Science and Systems Engineering
Kyushu Institute of Technology
2-4 Hibikino, Wakamatsu, Kitakyushu 808-0196, Japan
[email protected]; [email protected] Received August 2017; revised December 2017
Abstract. This paper proposes a method for image texture classification based on a complex network model. Finding relevant and valuable information in an image texture is an essential issue for image classification and remains a challenge. Recently, a com-plex network model has been used for texture analysis and classification. However, with current analysis methods, important empirical properties of image texture such as spa-tial information are discarded from consideration. Accordingly, we propose local spaspa-tial pattern mapping (LSPM) method for manipulating the spatial information in an image texture with multi-radial distance analysis to capture the texture pattern. In experiments, the feature properties under the traditional complex network model and those with the pro-posed method are analyzed by using the Brodatz, UIUC, and Outex databases. As results, the proposed method is shown to be effective for texture classification, providing an im-proved classification rate as compared to the traditional complex network model.
Keywords: Complex network, Spatial arrangement, Spatial information, Texture anal-ysis, Texture classification
1. Introduction. Texture is important information for characterizing the appearance of an image. The attributes of a texture can be described qualitatively in such terms as coarseness, orientation, and spatial relationship appearing as uniformity of image intensity [1, 2]. Coarseness and directionality are two important attributes of textures which aid in discrimination. Coarseness relates to the size of the texture elements, whereas direction-ality corresponds to the orientation and spatial arrangement of the texture elements [2]. Texture analysis has played an important role in texture discrimination by quantifying image properties. There are a growing number of techniques described in the literature for texture characterization. These techniques are able to extract and to characterize texture information using first-order features [1], Fourier descriptors [3], Gabor filters [4], graph theory [5, 6], and complex networks (CN) [7].
In recent years, graph-based representation and complex network model have been effi-ciently applied in texture analysis [8, 9]. Backes et al. [9] proposed a traditional complex network model for texture analysis and classification. Graph-based representations have been used to characterize the topological structure of networks, including image pixels [6]. Vertex measurement is obtained in terms of the distribution vertex degree or number of edges incident on a particular vertex. This numerical measure of connectivity between a vertex and its neighbors can be used to characterize the texture attributes of an im-age. The coarseness and orientation of an image structure can be described in terms of the topological properties of the network. However, other empirical properties of image texture such as spatial arrangement are discarded from the statistical properties by the conventional vertex measurement.
1114 S. THEWSUWAN AND K. HORIO
Based on the flexibility of complex networks for characterizing textural structures, it is interested in applying standard pattern recognition techniques to the complex network model of [9] in order to improve texture descriptors. Regarding the pattern analysis tech-niques, local binary pattern (LBP) operator is employed in this work. Ojala et al. [10, 11] proposed the LBP texture operator for classification, which analyzes differences between a central pixel and its neighbors by a thresholding which considers the intensities as binary numbers. The original LBP has been extended to improve the rotation invariance which calls this operator LBPriu2 [12]. The LBPriu2 operator is great measures of the spatial structure of local image texture and therefore the advantage of the LBPriu2 operator has been used in this paper.
This paper proposes a method to characterize texture primitives by considering spatial information based on the complex network model of [9] for texture classification. The LBPriu2 method is adopted to analyze the spatial arrangements of spatial patterns in the texture elements. Three standard texture databases, Brodatz [13], UIUC [14], and Outex [15] are used for evaluation. The experimental results show the effectiveness of the proposed method compared to the traditional complex network model [9] and texture analysis based on conventional methods. The remainder of the paper is organized as follows. Section 2 explains our proposed method, which is based on image graph-based representation and includes the spatial information analysis. The feature descriptors are explained in Section 3 followed by experimental results and discussions in Section 4. Finally, Section 5 concludes this work.
2. Proposed Method.
2.1. Image graph-based representation.
2.1.1. Weights of edges. Image textures can be represented as pixel networks based on graph theory [6]. Figure 1 shows an example of the image graph-based representation. Suppose, image I with a resolution of M ×N pixels. Let I(i, j) = g, i = 1, . . . , M and
Figure 1. Image pixel representation based on graph theory: (a) each pixel of the image is a vertex in the graph, (b) two vertices are connected if d(vi, vj)≤r (r= 3 in this example), whereas a weighted graph is defined
by Equation (1), (c) a threshold t is applied to imitating a transformation in the network (t = 0.245 in this example), and (d) the binary pattern transformation after passing thresholding
j = 1, . . . , N, where i and j are the pixel indices of I(i, j). Let G= (V, E) be the graph comprising the set of vertices V and the set of edges E. Each pixel in the image is a vertex vi ∈ V. The two vertices are connected by a non-directed edge e ∈ E, e = eij,
when the Euclidean radial distance between them is less than or equal to r (see Figure 1(b)). Based on [9], the weight of the edges eij is defined as
W(eij) = ∥ri−rj∥+r2|I(i)−LI(j)| r2+r2 if d(vi, vj)≤r 0 otherwise, (1)
where ri and rj are the coordinates of pixels i and j in image I. The intensities of pixels i and j are denoted by I(i) and I(j). L is the maximum value of intensity at a radial distancer from either vi and vj.
2.1.2. Threshold. A threshold (t) is a parameter related to the property of being an edge in graph theory [5, 6]. In the traditional complex network model [9], a set of thresholds is used to construct a network that imitates dynamic transformation for the purpose of texture analysis. Thus, we can determine the spatial relationships of the attributes of textures when applying threshold parameters in the complex network model. The threshold t value is applied to the original set of edges, as illustrated in Figure 1(c). In this study, threshold values are obtained through an experiment. Then, the binary pattern transformation process is performed by converting the vertices whose weights are less than or equal to threshold t to 1, while the remaining vertices are converted to 0 as in Figure 1(d). This process is defined as follows:
Wb(t)(eij) =
{
1 if W(eij)≤t
0 otherwise. (2)
2.2. Spatial pattern analysis. In the traditional complex network model [9], the tex-ture properties are characterized by the distribution of vertex degree. However, some informative properties such as spatial arrangement should be determined among the ex-tracted topological features. To our knowledge, spatial information was never previously been analyzed in texture analysis based on a complex network model. Therefore, spatial arrangement is the focus of this paper.
2.2.1. Local spatial pattern mapping. Multi-radial distance analysis is employed for feature representation as defined below. After the binary pattern transformation, the neighbors of a vertex vi which have Euclidean radial distances r1, r2 and r3 equal to 1, 2, and 3 are constructed by the radial symmetric neighborhood as in Figure 2. This approach enables us to describe local context information about pixel surroundings, (indicated in gray in the figure). The results of these binary row records are used for encoding the spatial arrangement in the next process.
A set of binary features for a radial distance r is denoted by
k(t)(eij) = [ Wb(t)(ei1), W (t) b (ei2), . . . , W (t) b (eip1) ] if d(vi, vj)< r1 [ Wb(t)(ei1), W (t) b (ei2), . . . , W (t) b (eip2) ] if r1 ≤d(vi, vj)< r2 [ Wb(t)(ei1), W (t) b (ei2), . . . , W (t) b (eip3) ] if r2 ≤d(vi, vj)< r3 0 otherwise, (3)
where j = 1,2, . . . , pn, n = 1,2 and 3. The pn is denoted as vertices neighborhood of
1116 S. THEWSUWAN AND K. HORIO
Figure 2. Local spatial pattern mapping with the radially symmetric neighbor sets for different (p, r)
neighborhood p1,p2, and p3 are equal to 4, 8, and 16, whereas the radial distances r1, r2 and r3 equal to 1, 2 and 3, respectively.
A spatial arrangement analysis is performed by local spatial pattern mapping or LSPM. The uniformity of LBP mapping [10, 12] is adapted for spatial mapping at different radial distances. The LSPM method is used to describe the uniformity of texture primitives when the binary pattern of a binary row record contains at most two bit-wise transitions between 0 and 1 in the same way as uniformity in LBP theory [10]. We define the LSPM method as follows: lspm(eij) = pn ∑ j=1 k(t)(eij)2(j−1), (4)
where pn = p1, p2 and p3, which are equal to 4, 8, and 16, respectively. For considering the uniformity of lspm, the following equation is used:
LSPM(pn, rn) =
{
lspm(eij) if U(lspm(eij))≤2
p+ 1 otherwise, (5)
where U is the uniform pattern ofk(t)(e
ij) in Equation (3), which is determined when the
binary pattern of a binary row record contains at most two bit-wise transitions between 0 and 1. For example, the pattern of 00000000 shows the U value of 0, whereas the binary pattern of 11000001 shows U equal to 2 as justified by [10]. This equation means that if the lspm(eij) has U > 2, it defines for non-uniform pattern. This step enables
us to analyze the uniform pattern of pixel surroundings, which can refer to local texture analysis. In practice, LSPM is implemented by using a look-up table of 2pn elements. In
histogram for the radial analyses LSPM(pn, rn) are defined as follows: F(t)(vi) = LSPM(4,1) LSPM(8,2) LSPM(16,3) ⊤ . (6)
3. Feature Descriptors. The feature properties in the system analyzed herein are given by (6). A probability density function of histograms is computed to obtain statistical properties. The set of statistical descriptors is given by [9] and comprises the mean, contrast, energy, and entropy, which are defined by
p(i) = ∑pLSPM(pn, rn)
n+2
i=1 LSPM(pn, rn)
, n= 1,2,3, (7)
where p1, p2, and p3 are equal to 4, 8, and 16, whereas r1, r2 and r3 equal to 1, 2 and 3, respectively. For vertex characterization using the histogram of LSPM, the following set of features was evaluated:
µ= p∑n+2 i=0 ip(i) C= p∑n+2 i=0 p(i)i2 E = p∑n+2 i=0 [ p(i)2] H =− p∑n+2 i=0 p(i)log2[p(i)]. (8)
The set of complex network properties in terms of LSPM with a threshold set{t0, t1, . . .,
tend} is given by Mean = [ µ(F(t0)(v i) ) , µ(F(t1)(v i) ) , . . . , µ(F(tend)(v i) ) ] Contrast = [ C(F(t0)(v i) ) , C(F(t1)(v i) ) , . . . , C(F(tend)(v i) ) ] Energy = [ E(F(t0)(v i) ) , E(F(t1)(v i) ) , . . . , E(F(tend)(v i) ) ] Entropy = [ H(F(t0)(v i) ) , H(F(t1)(v i) ) , . . . , H(F(tend)(v i) ) ] . (9)
In order to evaluate our proposed method, a discrimination function for texture classi-fication was generated by a support vector machine (SVM). In the implementation of this work, the Classification Learner app of MATLAB 2016a version with default parameter values was used for multi-class classification with the one-versus-all strategy. An SVM with a quadratic kernel was integrated for evaluation following 10-fold cross-validation.
4. Experimental Results. For the present study, three experiments were conducted to compare the results between the traditional complex network texture descriptor (CNTD) [9] and our proposed method. The first experiment was a comparison of threshold sets as listed in Table 1. The objective of this experiment was the selection of the best threshold set by using Brodatz as the validation database. The results are shown in Table 2. The second experiment examined combinations of feature descriptors by using the threshold set which was selected based on the first experiment. Along with Brodatz, this experiment used the additional two texture databases, UIUC and Outex for the evaluation. The last experiment was a comparison with other conventional methods which include Gabor filters, Fourier descriptors, LBP, LBPriu2, and CNTD.
1118 S. THEWSUWAN AND K. HORIO
Table 1. Threshold sets for experiments
Set Thresholds CNTD [9] Proposed
t0 tstep tend No. feature No. feature T1 0.2 0.015 0.335 10 30 T2 0.2 0.015 0.485 20 60 T3 0.2 0.015 0.590 27 81
Table 2. Results for Brodatz database by threshold set
Feature Descriptors
Threshold CNTD [9] Proposed Mean Contrast Energy Entropy Set No.
features Success rate (%) No. feature Success rate (%) X − − − T1 10 36.22 30 76.76 X − − − T2 20 38.83 60 77.84 X − − − T3 27 42.25 81 80.36 − X − − T1 10 37.30 30 77.12 − X − − T2 20 44.23 60 79.37 − X − − T3 27 46.31 81 81.98 − − X − T1 10 62.07 30 63.78 − − X − T2 20 70.63 60 65.68 − − X − T3 27 73.24 81 77.21 − − − X T1 10 62.16 30 63.42 − − − X T2 20 75.14 60 63.96 − − − X T3 27 77.84 81 79.28
4.1. Databases. Three standard texture databases were used for evaluation in this study. First, the Brodatz texture album [13] is a benchmark for evaluating methods. This dataset is composed of 111 classes, each class containing 10 grayscale samples of 100 × 100 pixels which are 10 non-overlapping of sub-images. Second, the UIUC database [14] is a very challenging database for evaluation of texture recognition because the images were obtained in an uncontrolled environment, viewpoint, and scale. For each of 25 classes, 128×128-pixel 40 grayscales images were considered. Finally, the Outex database Suite (Outex TC 0013) [15] has 68 classes, each class containing 20 images, totaling 1360 grayscales images, each 128 × 128 pixels.
4.2. Comparison of results from different threshold sets. The first experiment determined the appropriate set of threshold values to be used. The most meaningful information could be extracted when t ≤0.590. We followed [9] in defining tstep as 0.015. Table 1 defines the threshold sets T1, T2, and T3 and lists the numbers of features separately for CNTD and the proposed method. The proposed method generates three times as many features compared with the CNTD method because the radial distance
r here sets equal to 3, as shown in Equation (6). Using the threshold sets in Table 1, an experiment on statistical descriptors was performed using the Brodatz database. The results in Table 2 indicate that the number of features affects the success rate. Based on the success rates, we chose the T3 configuration for the CNTD method in the second experiment, whereas the T1 configuration was chosen for the proposed method because the numbers of features and success rates were close to those of CNTD with T3. Table 2 shows the proposed method produced significantly better results for all feature descriptors. In a spectrum of textures from regular textures to stochastic textures on the Brodatz
database, the proposed method can describe irregular, near-stochastic, and stochastic textures better than the CNTD method. These reasons imply the efficiency of using spatial arrangement to improve texture classification based on a complex network model.
4.3. Results by combination of feature descriptors. The selected threshold config-urations as explained in Section 4.2 were used in the second experiment for evaluating combinations of feature descriptors. Table 3 lists the numbers of features by combination of feature descriptors, and Figure 3 shows the classification results for the Brodatz, UIUC, and Outex databases for CNTD and the proposed method. For the Brodatz database, good performance was obtained for all combinations of descriptors for the same reasons as discussed with reference to Table 2. In order to clarify the performance of the proposed method, this second experiment evaluated it with respect to the UIUC database. The UIUC database is more challenging because it was collected the database in uncontrolled environments including viewpoints (rotations) and scale changes. Based on the UIUC re-sults in Figure 3, the proposed method outperformed CNTD. This is because LSPM can extract more important information from local image texture. For example, an arrange-ment of neighbors at the same radial distance is identified as a radially symmetric set for encoding the spatial arrangement. Moreover, the uniformity of texture patterns has been
Table 3. Number of features by combination of feature descriptors Feature Descriptors CNTD [9] Proposed No. Mean Contrast Energy Entropy No. feature (T3) No. feature (T1)
1 X X − − 54 60 2 X − X − 54 60 3 X − − X 54 60 4 − X X − 54 60 5 − X − X 54 60 6 − − X X 54 60 7 X X X − 81 90 8 X X − X 81 90 9 X − X X 81 90 10 − X X X 81 90 11 X X X X 108 120
1120 S. THEWSUWAN AND K. HORIO
analyzed based on the LBPriu2 mapping. Thus, LSPM can characterize the texture of sig-nificant feature patterns by using their spatial arrangement. As a result, we can conclude that the proposed method is invariant to scale and rotation. For the Outex database, the performance was obtained in the same scale as compared with CNTD method was ob-tained for all combinations of descriptors as shown in Figure 3. The texture images of the Outex database are obtained under the various illumination condition which influenced the performance. Although this condition has an effect on the system, this approach was good to compare with other texture analysis methods.
4.4. Comparison with other methods. The success rates and numbers of features of the third experiment are listed in Table 4. The success rates of the CNTD and the proposed method are shown for descriptors set No. 9 (Table 3). For more evaluating our proposed method, the additional conventional texture analysis methods are chosen for comparison which include, Gabor filters [4], Fourier descriptors [3], LBP and LBPriu2 texture operators [10, 11] and CNTD [9]. For the experiments, we used a total of 40 filters (combination of 8 rotation filters and 5 scale filters) a frequency range from 1.2 to 1.4 by using energy as a descriptor for the Gabor filters method. The Fourier descriptors are computed 90 descriptors from the shifted spectrum. For LBP and LBPriu2 operators, the histograms were concatenated for (P, R) = (2,8) to characterize texture, resulting in a total of 256 and 10 descriptors, respectively.
Table 4. Comparison of the proposed and other methods
Method No. features Success rate (%) Brodatz UIUC Outex Gabor filters 40 76.58 60.00 74.26 Fourier descriptors 90 78.02 72.40 72.13 LBP 256 82.52 51.30 71.47 LBPriu2 10 81.90 75.90 73.81 CNTD 81 69.55 74.60 84.49 Proposed method 90 81.08 81.70 82.28
As the results in Table 4, the LBP and LBPriu2 methods outperformed the other meth-ods in the Brodatz database. In this work, we built the new Brodatz dataset by cropping ten subsections with non-overlapping of a larger Brodatz image. Thus, some image is difficult to distinguish between each class. The complex network model can extract the meaningful information on the local textural pattern by a set of thresholds as we described in Section 4.2, whereas the result is shown to be effective in the Outex database. On the other hand, the LBP and LBPriu2methods extract local features using the sign of the dif-ference between a central pixel and its neighbors for thresholding, whereas the results turn to be effective in the Brodatz database more than the CNTD and the proposed method. These points can indicate the set of thresholds has influences on the performance of the CNTD and the proposed method, which can be a promising in the future work. For the UIUC database, the proposed method outperformed the other methods. The different viewpoints with respective distortion and non-rigid transformation are manipulated by the proposed method. Accordingly, these results confirm that the proposed method can perform efficiently for traditional complex networks applied to challenging environments relative to conventional methods.
5. Conclusion. In this paper, we proposed a method for image texture classification based on a complex network model. We used an approach which we refer to as local spatial pattern mapping (LSPM) to describe the spatial arrangement of neighbors in a network. The experimental results show that the performance of our method in analyzing spatial information based on a complex network model improves the accuracy of texture classification as compared to the original and other methods. Therefore, spatial informa-tion analysis based on a complex network is a promising direcinforma-tion for further research.
REFERENCES
[1] A. Materka and M. Strzelecki, Texture Analysis Methods – A Review, Tech. Rep., Institute of Elec-tronic, Technical University of Lodz, 1998.
[2] T. Acharya and A. K. Ray,Image Processing: Principles and Applications, Wiley-Interscience Pub-lishers, 2005.
[3] R. Azencott, J. Wang and L. Younes, Texture classification using windowed Fourier filters, IEEE Trans. Pattern Analysis and Machine Intelligence, vol.19, no.2, pp.148-153, 1997.
[4] M. Idrissa and M. Acheroy, Texture classification using Gabor filters, Pattern Recognition Letters, vol.23, no.9, pp.1095-1102, 2002.
[5] J. J. de Mesquita S´a Junior, P. C. Cortez and A. R. Backes, Texture analysis and classification using shortest paths in graphs,Pattern Recognition Letters, vol.34, no.11, pp.1314-1319, 2013.
[6] M. E. J. Newman, The structure and function of complex networks, SIAM Review, vol.45, pp.167-256, 2003.
[7] L. F. Costa, F. A. Rodrigues, G. Travieso and P. R. V. Boas, Characterization of complex networks: A survey of measurements,Advances in Physics, vol.56, no.1, pp.167-242, 2007.
[8] L. F. S. Scabini, W. N. Gon¸calves and A. A. Castro, Texture analysis by bag-of-visual-words of com-plex networks,Proc. of the 20th Iberoamerican Congress (CIARP), Montevideo, Uruguay, pp.485-492, 2015.
[9] A. R. Backes, D. Casanova and O. M. Bruno, Texture analysis and classification: A complex network-based approach,Information Sciences, vol.219, pp.168-180, 2013.
[10] T. Ojala, M. Pietikainen and T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Trans. Pattern Analysis and Machine Intelligence, vol.24, no.7, pp.971-987, 2002.
[11] T. Ojala, M. Pietikainen and D. Harwood, A comparative study of texture measures with classifica-tion based on featured distribuclassifica-tions,Pattern Recognition, vol.29, no.1, pp.51-59, 1996.
[12] M. Pietik¨ainen, A. Hadid, G. Zhao and T. Ahonen,Computer Vision Using Local Binary Patterns, Springer London Publishers, London, 2011.
[13] P. Brodatz, Textures: A Photographic Album for Artists and Designers, Dover Publications, New York, 1966.
[14] S. Lazebnik, C. Schmid and J. Ponce, A sparse texture representation using local affine regions, IEEE Trans. Pattern Analysis and Machine Intelligence, vol.27, no.8, pp.1265-1278, 2005.
[15] T. Ojala, T. Maenpaa, M. Pietikainen, J. Viertola, J. Kyllonen and S. Huovinen, Outex – New framework for empirical evaluation of texture analysis algorithms,Object Recognition Supported by User Interaction for Service Robots, vol.1, pp.701-706, 2002.