• No results found

Lee04b

N/A
N/A
Protected

Academic year: 2020

Share "Lee04b"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

Network Center Selection

Dae-Won Lee and Jaewook Lee Department of Industrial Engineering, Pohang University of Science and Technology,

Pohang, Kyungbuk 790-784, Korea. {woosuhan,jaewookl}@postech.ac.kr

Abstract. In this paper, we propose a new method for selecting RBF centers. The strength of our method is to determine the number and the locations of RBF centers automatically without any priori assumption about the number of centers. The proposed method consists of three phases. The first phase is to partition the input patterns into the several subsets according to their output labels. In the second and third phase, the number and the locations of RBF centers are determined using bi-section algorithm and weighted mean centering. These second and third phase are iteratively repeated until to reach the goal error. The proposed method is applied to several benchmark data sets. The numerical results show that our method is robust and efficient for determining the number and the locations of centers.

1

Introduction

Radial basis function network (RBFN), due to the simplicity of its single hidden layer structure and universal property, has been widely used for nonlinear func-tion approximafunc-tion and pattern classificafunc-tion. Training of the RBFNs consists of selecting centers of the hidden neurons and estimating the weights that connect the hidden and the output layers. Once centers have been fixed, the network weights will be directly estimated by using the least squares algorithm.

The generalized radial basis function network(GRBFN) involves searching for a suboptimal solution in a lower-dimensional space that approximates the interpolation solution where the approximated solutionF(x) can be expressed as follows

F∗(x) = m

i=1

wiφ(xti) (1)

where the set of RBF centers{ti|i= 1, . . . , m} is to be determined.

One of the key issues in the design of RBFN specially is how to determine the number and the locations of the RBF centers. In the recent researches, a variety of ways for determining the locations of centers have been proposed. Previously reported approaches for RBF center selection include random selection from

F. Yin, J. Wang, and C. Guo (Eds.): ISNN 2004, LNCS 3173, pp. 350–355, 2004. c

(2)

input patterns [1], and selection of centers based on clustering algorithms [2], [3]. These methods have some drawbacks in that they cannot determine the number and the locations of centers at the same time and do not consider the information of the supervised data.

In this paper, to overcome such drawbacks, the proposed method consists of three basic ingredients. Firstly, input patterns are partitioned into several subsets according to their output labels and the RBF centers are determined separately for each subset in order to reflect information of the output class label. Then these centers are combined to form a larger superset to be used in the final RBFN. Secondly, the proposed algorithm determines the number of the centers by using bi-section algorithm. Finally, the proposed algorithm determines the optimal locations of the centers by employing a weighted mean centering.

The organization of this paper is as follows : In Section 2 a new method for RBF center selection was explained. Section 3 presents an algorithm for the proposed method, and Section 4 presents experimental results applied to benchmark problems, followed by conclusions in Section 5.

2

The Proposed Method

2.1 Phase I: Partitioning Supervised Data

The proposed method is basically similar to the selection of centers based on clustering algorithm which is most widely used for center selection. However this kind of approaches deal with input patterns as unsupervised data during the center selection step even though supervised data are available. When the output class label is not considered, some of centers is often found to be located in the boundary between several classes, in which case these centers fail to play a role of a good feature extractor in the GRBFN.

In phase I, we partition training data intoCdisjoint subsets{Di}Ci=1 accord-ing to its class to reflect the distribution of input pattern for each output class. Then, with each subset, cluster centers are found using clustering algorithm to be explained in Section 2.2 and 2.3. Finally, we use C disjoint sets of cluster centers as GRBFN centers.

2.2 Phase II: Determining the Number of Centers by Bi-section Algorithm

(3)

Fig. 1.Flow diagram for center selection (Phase II and III). Where Error(λ) is a mis-classified rate with theλnumber of centers

2.3 Phase III: Determining the Locations of Centers Using Weighted Mean Centering

Determining the locations of the RBF centers is based on optimizing the follow-ing objective function with respect to (t,w).

E= 1 2

N

i=1

diki j=1

wjφ(xi−tj)  

2

(2)

From the necessary optimality condition, positions of centers is given by

∂E tj =

1

σ2wj

N

i=1

(dΦw)iΦij(xitj) = 0 (3)

tj = N

i=1(d−Φw)iΦijxi

N

i=(d−Φw)iΦij

= N

i=1

(dΦw)iΦij N

i=1(d−Φw)iΦij

xi

= N

i=1

λj(xi,t,w)xi, j= 1, . . . , ki

(4)

whereNi=1λj(xi,t,w) = 1, j= 1, . . . , ki,Φ= [φ(xi,tj)]i=1,... ,ni, j=1,... ,ki,

d= [d1, d2, . . . , dN]T and w = [w1, w2, . . . , wm1]T. It shows that the estimate

(4)

tj directly. Instead, we employ a so-calledweighted mean centering scheme to implement this, which consists of two steps. In the first step, center positions

tj of Eq. (4) are approximated as a simple average of the xi that has higher absolute value of the numerator of λj(xi,t,w). This process is repeated until no change is made. (See Section 3 for more details about this.) In the case of non-convex data set, the obtained centers in the first step are often found to be located in the regions of other classes, even though we have partitioned data sets according to their classes. To avoid this problem, in the second step, we modify RBF centers into the nearest points within the partitioned subsetDi.

3

Algorithm

An algorithm of center selection for the proposed method is as follows. % Phase I: Partitioning supervised data

1. SeparateN training data intoCdisjoint subsetsDi containingnielements, according to class.

{(xi, di)}iN=1→ D1∪ D2∪, . . . ,∪DC

Di={(x1, i),(x2, i), . . . ,(xni, i)} fori= 1, . . . , C

% Selectingki centers for each subsetDi 2. fori= 1 toC do

% Phase II: bi-section algorithm

2.1. Determine the number of centers (ki) using bi-section algorithm as explained in Fig. 1: initially,ki= 1+2ni.

% Phase III: weighted mean centering

2.2. Determine the locations ofki centers using weighted means centering. 2.2.1. Choose random values for the initial centersTi={tj}kij=1from input

space. Wheretj is thejth cluster center of subsetDi. % Adjust the centers

2.2.2. forl= 1toni do

2.2.2.1. Let Ii(xl) denote the index of the best-matching center for the input vectorxl∈ Di as follows.

Ii(xl) = arg maxj (d−Φw)lΦlj, j= 1,2, . . . , ki

2.2.2.2. Adjust the centers using the update rule

tj tj+η(xltj), j=Ii(xl) (5) where η is a learning step size.

2.2.3. Continue the center adjusting procedure (step 2.2.2) until no change are observed in the centers{tj}kij=1

(5)

Table 1.Benchmark data description

Input dimension Number of classes Number of patterns

2-Spirals 2 2 388 [194,194]

Sonar 60 2 104 [55,49]

Heart 13 2 180 [98,82]

Vowel 10 11 528[48,48,48,48,48,48,48,48,48,48,48]

bracketed numbers mean the number of patterns for each class.

tj= the nearest pointx∈ Di forj= 1,2, . . . , ki 2.3. Repeat Step 2.1.2.2. until converge to goal error.

% Complete the RBFN training

3. Combine the centers ofCdisjoint subsets{Di}Ci=1and construct generalized RBF network by using pseudo-inverse.

T ={tj}Kj=1← T1∪ T2∪, . . . ,∪TC

w= (ΦTΦ+λΦ0)1ΦTd

where Φ = [φ(xi,tj)]i=1,... ,N, j=1,... ,K, Φ0 = [φ(xi,tj)]i,j=1,... ,K, and K =

C i=1ki .

4

Simulation Results

The algorithm described in the previous section has been simulated on four kinds of benchmark data sets (2-spiral, sonar, heart, vowel). Description of the benchmark data sets is given in Table 1.

The performance of the proposed method is compared with two widely used center selection methods. In Table 2, KM is the k-means based center selec-tion without partiselec-tioning andRS israndom selection from training data without partitioning. For these two methods, the number of centers is determined by in-creasing centers one by one until they achieve the goal error. For the comparison we adopted three criterion: the number of centers, the mis-classified rate, and the computing time. Simulation results are shown in Table 2. The results show that the proposed method achieves better accuracy with a slightly fewer number of RBFN centers while significantly reducing computing time.

5

Concluding Remarks

(6)

Table 2.Simulation results on four benchmark problems

Method KM RS Proposed

m E T m E T m E T

2-Spiral 84 0.069 5650 88 0.080 5925 80 [35,45] 0.064 297 Sonar 37 0.096 1570 37 0.096 1112 34 [17,17] 0.096 21 Heart 129 0.97 11876 131 0.99 9723 129 [68,61] 0.093 125 Vowel 66 0.099 3658 64 0.099 2465 59 [2,5,5,2,8,7,3,2,9,2,14] 0.099 188

mis the number of centers and bracketed numbers are the number of centers for each class. E is mis-classified error rate and T is computing time to construct the GRBFN.

the information of output class label. Finally, it is robust to a data set with non-convex distribution.

Experimental results show that the proposed method is competitive with the previously reported approaches for RBF center selection. Other methods to improve efficiency of RBF center selection, such as Homotopy method [5], [6] can be also be investigated.

Acknowledgement. This work was supported by the Korea Research Founda-tion under grant number KRF-2003-041-D00608.

References

1. Mao, K.Z.: RBF Neural Network Center Selection Based on Fisher Ratio Class Separability Measure. IEEE Trans. Neural Networks, Vol. 13(5) (2002) 1211-1217 2. Gomm, J.B., Yu, D.L.: Selection Radial Basis Function Network Centers with

Recur-sive Orthogonal Least Squares Training. IEEE Trans. Neural Networks, Vol. 11(2) (2000) 306-314

3. Haykin, S.: Neural Networks: A Comprehensive Doundation. Prentice Hall, New York (1999)

4. Cover, T.M.: Geometrical and Statistical Properties of Systems of Linear Inequali-ties with Applications in Pattern Recognition. IEEE Trans. Electronic Computers, Vol. EC-14 (1965) 326-334

5. Lee, J., Chiang, H.-D.: Constructive Homotopy Methods for Finding All or Multiple DC Operating Points of Noninear Circuits and Systems. IEEE Trans. on Circuits and Systems- Part I, Vol. 48-(1) (2001) 35-50

Figure

Fig. 1. Flow diagram for center selection (Phase II and III). Where Error(classified rate with theλ) is a mis- λ number of centers

References

Related documents

While the Saudi state has allowed women to enable themselves through education and gain some status, there has always been tension between the state and religious

We obtain these results by comparing four security designs: (i) the status quo , in which sovereign bonds are neither pooled across nation-states nor tranched for safety; (ii)

Starting in October 2012, Medicare will reward hospitals that provide high quality care for their patients through the new Hospital Value-Based Purchasing

quer cetin, t etrah ydr ox ymetho xyfla vone , apigenin meth yl ether , dimeth ylquer - cetigetin, lut eolin Alum W ool Quer cetin, lut eolin meth yl ether , t etrah y- dr ox

Reading this book with the title And Then There's This: How Stories Live And Die In Viral Culture By Bill Wasik will certainly let you understand a lot more points.. From

To estimate the cost of changing the FCTC and the Caregiver Credit, the Conference Board calculated the difference between the current amount paid out by the government for each

The first trials using the three different wave packets to interrogate the edge breaking defect were not successful as stated above, but allowed the observation of the sensitivity

Table 4.3 Showing mean, variance and ANOVA data for sampled macro invertebrate orders of various months at Umoni River, Delta State, Nigeria from February to July, 2017...