• No results found

Wa Language Syllables Classification by Support Vector Machine Based on Genetic Algorithm

N/A
N/A
Protected

Academic year: 2020

Share "Wa Language Syllables Classification by Support Vector Machine Based on Genetic Algorithm"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

2017 2nd International Conference on Artificial Intelligence: Techniques and Applications (AITA 2017) ISBN: 978-1-60595-491-2

Wa Language Syllables Classification by Support Vector Machine

Based on Genetic Algorithm

Mei-jun FU, Jie SU and Wen-lin PAN*

School of Mathematics & Computer Science, Yunnan Minzu University

*Corresponding author

Keywords: Wa language, Support Vector Machine (SVM), Genetic Algorithm (GA), K-Crossover Validation Method (K-CV).

Abstract. In this paper, propose the method to classify Wa syllables by support vector machine (SVM) based on genetic algorithm (GA). Firstly, select all the Wa syllables from the speech corpus recorded. Secondly, extract pitch and formant frequencies by vowel main body extension method and linear predictive coding method respectively. Thirdly, describe every syllable with a 12-dimensional feature vector composed of its mean values, minima and maxima, belonging to its pitch frequency sequence and its first three formant frequency sequences respectively. Fourthly, divide the obtained data set into training set and test set, the former use to train SVM, and the latter test SVM. Fifthly, normalize data set into [0,1], and select the radial basis function as kernel function. Finally, use the K crossover validation method (K-CV) based on genetic algorithm (GA) to get the best parameter values train SVM. As a result, the predictive accuracy of Wa syllables classification by test set arrives at 84.83%, which has shown that the proposed method is effective and feasible on Wa syllables classification.

Introduction

The Wa nationality[1, 2] is one of Chinese nations. They live mainly in the southwest border of

Yunnan Province, China and northern Myanmar. And the native language they use to communicate with each other is Wa language. Wa language belongs to Va-De'ang branch, Mon-Khmer languages, Austro-Asiatic family. China Wa language can be divided into three kinds of dialects that are "Barao dialect", "Lawa dialect" and "Wa dialect". Currently, Wa language is based on the Barao dialect and the voice of YanShuai is standard pronunciation. The speech corpus used here was all recorded in YanShuai.

It is well known that pitch and formant frequencies[3-5] are the two most important parameters of

speech signal. The pitch changing pattern is called pitch tone that plays a very important role in distinguishing the meaning of different syllables. Formant includes two important parameters: resonant frequency and its width of band width (bandwidth), which are important parameters that can distinguish different vowels, including voiced consonants.

Syllable classification, in essence, still belongs to pattern recognition[6], which can be summarized as

"feature extraction syllable pattern recognition". For every Wa syllables, after speech signal

preprocessing, we extract its each frame of pitch frequency and of the first three formant frequencies. Then, we take the mean values, minima and maxima belonging to pitch frequency sequence and the first three formant frequency sequences respectively, to form a 12-dimensional feature vector as its classification pattern. We will classify these obtained syllable classification patterns by support vector machine (SVM) based on genetic algorithm (GA).

(2)

traditional pattern recognition methods, SVM is more suitable for dealing with nonlinear and high

dimensional pattern recognition problems with small samples. For the above reason[7-11], SVM is very

suitable for Wa syllables pattern recognition under the small sample condition. However, in the process of recognizing Wa syllables by SVM, penalty parameters and kernel function parameters will directly affect the recognition ability to SVM. Not only are the traditional methods computationally complex but also difficult to obtain the global optimal solution, which makes it difficult to improve the adaptive

identification ability to SVM. Genetic Algorithm (GA)[12] is a kind of random searches and

optimization methods, possessing the global parallel search ability. Therefore, in this paper, in order to improve the adaptive identification ability to SVM, we will use the K-cross validation method (K-CV) based on Genetic Algorithm (GA) to search the global optimal solution more efficiently. And experiment to find out that the predictive accuracy on Wa syllables classification arrives at 85.23%, which has shown that proposed method is effective and feasible on Wa syllables classification.

Extract Wa-language Syllable Classification Patterns

The main steps to construct Wa syllable patterns are expressed in the following (Figure 1).

1)Selecting all the Wa syllables speech signal from the corpus.

2)Preprocessing Wa syllables, including eliminating direct current (DC) components[13], eliminating

trend items[14], framing, windowing, and pre-emphasising.

3)Endpoint detection by improved spectral entropy method[15].

4)Pitch extraction by vowel main body extension method. Formant extraction by linear predictive

coding (LPC)[15] method based on adaptive variational mode decomposition (AVMD)[16].

5)For any given Wa syllables, taking the mean values, minima and maxima belonging to pitch

frequency sequence and the first three formant frequency sequences respectively to form a 12-dimensional feature vector as its syllables classification pattern.

Figure 1. Extraction process of syllable classification patterns.

C-Support Vector Machine(C-SVM)

Here, mainly introduce the C-support vector machine model (C-SVM) with binary classes. For two

classes of sample set:

{

}

1

( , )

i i l i

x y

= ,

x

i

n,

y

i

∈ −

{

1,1

}

,where

y

iis the category label corresponding

to the i-th input eigenvector

x

i(

i

=

1,

,

l

) , selecting appropriately the kernel functionK

( )

i i, and

parameter

C

for SVM, we can construct the following optimization model:

(

)

(

)

1 1 1

1

0 , 1, , ;

max 1 2 , , . .

0, i

l l l

l

i i j i j i j

i i j i i

i

C i l

y y K x x s t

y α

α α α

α

= = =

=

≤ ≤ =

 

− 

=

 

∑∑

which is a convex quadratic programming problem. Hence, there exists the optimal solution. Solving the

above, we get the optimal solution: α∗=

(

α1∗, ,αl

)

T

where T represents transpose of matrix. Take a

positive component αj

from

α

∗satisfying0 αj C

(3)

( )

(

)

1

sgn ,

l

i i i

i

f x α∗y K x x b∗ =

 

=  + 

Similarly, we can obtain a multi-class support vector machine. by modifying directly the above objective function. Certainly, on the other hand, we can construct a multi-class support vector machine by combining some binary class support vector machines. Here, we will use multi-class C-support vector machine (C-SVM).

It is well known that the necessary and sufficient condition as a kernel function of SVM is that the function satisfies Mercer theorem. The choice of kernel function directly affects the generalization ability and recognition precision. In a SVM, the kernel function used commonly includes mainly four types: linear function, polynomial function, gaussian radial basis function, sigmoid function.

C-support Vector Machine (C-SVM) Based on Genetic Algorithm (GA)

Wa-language Syllable Classification Based on C-SVM

Data Preprocessing. Based on three different normalization ways, the predicted classification accuracy of the same test set is shown respectively in Table 1.

Selecting Kernel Functions. For four types of different kernel functions, the predicted classification accuracies of the same test set are shown in Table 2, when the data set is normalized into [0, 1].

Table 1. The Predicted Classification Accuracies Corresponding to Three Different Normalization Ways.

Normalization way Accuracy (%) SVM parameter options Un-normalization 4.6512 ‘-t 2 –c 3 –g 4’ [0,1] normalization 81.3953 ‘-t 2 –c 3 –g 4’ [-1,1] normalization 76.7442 ‘-t 2 –c 3 –g 4’

Table 2. The Predicted Classification Accuracies Corresponding to Kernel Functions.

Kernel function Accuracy (%) SVM parameter options Linear function 69.7646 ‘-t 0 –c 3’ Polynomial function 65.1163 ‘-t 1 –c 3 –d 3’ Radial basis function 81.3953 ‘-t 2 –c 3 –g 4’

Sigmoid function 4.6512 ‘-t 3 –c 3 –g 4 –r 0’

From Table1 and Table 2, we found that when we normalize the data set into [0, 1] and use Gaussian kernel function, we get the highest accuracy with 81.3953%.

Parameters Optimization

It is necessary to adjust the relevant parameters (mainly the penalty parameter c and the radial basis function parameter g) so as to obtain the best prediction accuracy when predicting classification by SVM. In Model 1, the penalty parameter c and the radial basis function parameter g are given by means of test experience. Generally speaking, using the cross validation (CV) method to find the best parameters c and g under a certain sense, we can effectively avoid under-fitting and overfitting. In this paper, we use the K- CV method to optimize our C-SVM related parameters.

At first, we search roughly the best parameters log2cand log2g in a large grid such aslog2c×log2g

∈[ 8,8]− ×[ 8,8]− , according to specific step sizes ∆(log2 )c = 0.5and (log2 g) = 0.5.Here, log 2x

denotes base-2 logarithm of x, and∆(log2 )x the step size of log 2x. In this case, the best c=27.8576 and

(4)

Support Vector Machine Based on Genetic Algorithm

In the K-CV sense, we can find the best parameters c and g by grid search method. Therefore, we are able to obtain the highest classification accuracy in the K-CV sense, that is, the global optimal solution. Sometimes, however, we find that our search for the optimal range of parameters c and g will be very time-consuming. Using the heuristic algorithm, it is possible to find the global optimal solution without traversing all the parameter points in the grid.

Genetic Algorithm (GA). Genetic algorithm (GA) is an evolutionary algorithm, using selection, crossover and mutation operators to search the optimal solution. Its global search ability is strong while its local search ability is weak. Generally speaking, we get only the suboptimal solution of our problem, rather than the optimal solution. The basic elements of genetic algorithm include chromosome coding method, fitness function, genetic operation and operating parameters. The basic steps of genetic algorithm include: coding, initial population generation, selection, crossover, mutation. The operation parameters of the genetic algorithm in this paper are shown in Table 3.

Table 3.Genetic Algorithm Operation Parameters Setting.

Population size 40

Maximum Generation 200 The number binary bits of the variables 20

Crossover probability 0.7 Mutation probability 0.01 Generation probability 0.9

Figure 2. Algorithm of optimizing SVM parameters by GA. Figure 3. The fitness (accuracy) curve based on GA.

Search the Best Parameter Based on GA. Here, the accuracy of K-CV in training set is regarded as the fitness function value in GA. The overall algorithm process of optimizing SVM parameters by GA is shown in Figure 2. Use this method, we have the best penalty parameter c=23.2021 and the best kernel function parameter g=0.50926 (Figure 3), different from those by K-CV Method based on grid.

Conclusion

In this paper, we proposed the SVM method based on GA to classify Wa-language syllables. From the experimental results, the classification accuracy both reaches 84.83%, if data set is normalized into [0, 1] and SVM is equipped with Gaussian radial basis function, using the best parameters c and g obtained based on whether grid or GA. This same result shows that our method is effective on Wa-language syllable classification. Of course, as we pointed out that our SVM method based on GA is more efficient.

0 20 40 60 80 100 120 140 160 180 200 50

55 60 65 70 75 80 85

Evolutionary algebra

A

d

a

p

ta

b

il

it

y

Best c=23.2021 g=0.50926 CVAccuracy=84.4828%

(5)

In the future, we will study more efficient methods for pitch and formant extraction to improve the accuracy of GA-SVM.

Acknowledgments

This work is financially supported by Yunnan Provincial Department of Education Science Research Fund Project (2016YJS078), Yunnan Minzu University Overseas Masters Program (3019901).

References

[1]. Tu Fang, "The Study of Wa", National Language, 1995 (3): pp. 23-23.

[2]. Xuehua Qing, A Review of Studies on Qinghua and Wa Language Studies, National Translation, 2014 (1).

[3]. Jiangping Kong. Experimental phonetics basic course[M]. Peking University Press, 2015.

[4]. Li Zhao, Speech Signal Processing - 2nd ed. 2009: Mechanical Industry Press.

[5]. Zhiyong Song. Application of MATLAB in Speech Signal Analysis and Synthesis. 2013: Beijing University of Aeronautics and Astronautics Press.

[6]. Cherkassky, V., The Nature Of Statistical Learning Theory. IEEE Transactions on Neural Networks, 1997. 8(6): pp. 1564.

[7]. Ao, H., et al., A Roller Bearing Fault Diagnosis Method Based on LCD Energy Entropy and ACROA-SVM. Shock and Vibration, 2014, (2014-2-24), 2014. 2014(1): pp. 1-12.

[8]. Ben-Hur, A. and W.S. Noble, Kernel methods for predicting protein-protein interactions. Bioinformatics, 2005. 21 Suppl 1(21 Suppl 1): pp. i38-46.

[9]. Zheng, D., J. Wang and Y. Zhao, Non-flat function estimation with a multi-scale support vector regression. Neurocomputing, 2006. 70(1): pp. 420-429.

[10]. Rakotomamonjy, A., F. Bach and Y. Grandvalet. More efficiency in multiple kernel learning. 2007.

[11]. Schölkopf, B., Choosing multiple parameters for Support Vector Machines. 2008.

[12]. Holland, J.H., Adaptation in Natural and Artificial Systems: An Introductory Analysis With Applications to Biology, Control, and Artificial Intelligence. Quarterly Review of Biology, 1975. 6(2): pp. 126–137.

[13]. Du, X., et al., Effect analysis and eliminating scheme for DC components to sinusoidal amplitude integrators based synchronization signal detection method. Zhongguo Dianji Gongcheng Xuebao/proceedings of the Chinese Society of Electrical Engineering, 2014. 34(24): pp. 4084-4091.

[14]. Zhongshen, Y.U., et al., Passive Acoustic Signal Trend items Removal Based on SPA. Journal of Detection & Control, 2015.

[15]. Chung, K. and S.Y. Oh, Improvement of speech signal extraction method using detection filter of energy spectrum entropy. Cluster Computing, 2015. 18(2): pp. 629-635.

References

Related documents

five SMEPMS formulations with either propanol or ethanol as the cosurfactant in all saccharide solutions, TT values of the FD FFB samples reconstituted in either DI water or the

Fig. 1 Flow chart of the IDEAL trial*Defunctioning ileostomy, CR: continuity restoration.. with the disease); initial and cumulative 6-month length of hospital stay; 6-month

The interpretation of this finding should be taken within the context of this particular study. In general participants reported being engaged with the survey and although many

ANXA1: Annexin A1; BCAC: Breast Cancer Association Consortium; BCSS: Breast cancer-specific survival; CC1: Cell Conditioning 1; CFMPB: Core Facility Molecular Pathology and

Aside from the pickup in volume during the market turmoil in the third quarter of 1998, the volume of TIIS transactions among primary dealers reported to the New York Reserve Bank

The services of MPN consist of the collecting service (there are four methods, i.e. the online method, the information link method, the direct method and the direct debit data

[r]

In addition, the fine granularity property can be exploited by the intermediate network nodes (including base stations, in case of wireless networks) in order to