• No results found

The Performance Evaluation of the Breast Mass Classification CAD System Basedon DWT, SNE AND SVM

N/A
N/A
Protected

Academic year: 2020

Share "The Performance Evaluation of the Breast Mass Classification CAD System Basedon DWT, SNE AND SVM"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

581

The Performance Evaluation of the Breast Mass Classification

CAD System Based

on DWT, SNE AND SVM

S Mohan Kumar

1

, Dr. G. Balakrishnan

2

1

Research Scholar, Karpagam University, Coimbatore, Tamil Nadu, India

2Director, Indra Ganesan College of Engineering, Trichy, Tamil Nadu, India Abstract- Mammogram is measured the most consistent

method for early detection of breast cancer. Computer-aided diagnosis system is also able to support radiologist to detect abnormalities earlier and more rapidly. In this paper the performance evaluation of the computer aided diagnostic system for the classification of mass classification in digital mammogram based on Discrete Wavelet Transform (DWT), Stochastic Neighbor Embedding (SNE) and the Support Vector Machine (SVM) is presented in this paper. This system classifies the mammogram images into normal or abnormal, and the abnormal severity into benign or malignant. Mammography Image Analysis society (MIAS) database is used to evaluate the proposed system. The average classification rate achieved is very satisfied.

Keywords-- Discrete Wavelet Transform, Stochastic Neighbor Embedding, Digital mammograms, Mass classification

I. INTRODUCTION

Breast cancer is the most extensive cancerous pathology among women. It is also an important public health problem in the world. As causes of its onset are still unknown, there are no efficient ways to avoid breast cancer. For this reason, an efficient diagnosis in its early stage can give women a better chance of full healing and survival. Therefore, early detection of breast cancer is the key for reducing the associated morbidity and death rates.

To study the human breast, Mammography is widely used as a diagnostic and a screening tool that uses X-rays. The objective of mammography is the premature revealing of breast cancer, usually through detection of characteristic microcalcifications and/or masses. Mammography is the only effective and viable technique to detect breast cancer in particular in the case of minimal tumors. Computer aided diagnosis system is also able to support radiologist to detect abnormalities earlier and faster. Mammography is believed to decrease mortality from breast cancer. This research work deals with classification of mass classification in digital mammograms based on Discrete Wavelet Transform (DWT) and Stochastic Neighborhood Embedding (SNE).

II. METHODOLOGY

The proposed system for the classification of microcalcification in digital mammograms is built based on DWT, SNE and by applying KNN and SVM for building the classifiers. In this following section the theoretical background of all the approaches are introduced.

A. Discrete Wavelet Transform

Nowadays, wavelets have been used quite frequently in image processing and used for feature extraction, denoising, compression, face recognition, and image super-resolution. The decomposition of images into different frequency ranges permits the isolation of the frequency components introduced by “intrinsic deformations” or “extrinsic factors” into certain sub-bands. This process results in isolating small changes in an image mainly in high frequency sub-band images.

[image:1.612.336.555.435.568.2]

(a) (b)

Figure 2.1: (a) Sample mammogram image (b) 2-D Wavelet decomposition

(2)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

582

B. Stochastic Neighbor Embedding

SNE is a probabilistic approach to the task of placing objects, described by high-dimensional vectors or by pair-wise dissimilarities in a low-dimensional space in a way that preserves neighbor identities. A Gaussian is centered on each object in the high-dimensional space and the densities under this Gaussian (or the given dissimilarities) are used to define a probability distribution over all the potential neighbors of the object.

C. Support Vector Machine

Support vector machines (SVMs) are a set of related supervised learning methods that analyze data and recognize patterns, used for classification and regression analysis. The standard SVM is a non-probabilistic binary linear classifier, i.e. it predicts, for each given input, which of two possible classes the input is a member of. A classification task usually involves with training and testing data which consists of some data instances. Each instance in the training set contains one “target value” (class labels) and several “attributes” (features) . SVM has an extra advantage of automatic model selection in the sense that both the optimal number and locations of the basic functions are automatically obtained during training. The performance of SVM largely depends on the kernel.

In this research, two techniques for building a computer aided diagnostic system for the classification of microcalcification in digital mammograms based on DWT and SNE are presented. The SNE applied to wavelet transformed image and also applied on sub-bands of wavelet transformed image individually. SNE is essentially used for reducing high dimensionality data into relatively low dimensional data, efficiently. Then classifier system based on Support Vector Machine (SVM) is constructed and the same is compared with the results of the KNN Classifier. Experiments are conducted on Mammography Image Analysis society (MIAS) database.

This system classifies the mammogram images into normal or abnormal, and the abnormal severity into benign or malignant. The proposed methods are implemented in MATLAB and the performances of these methods are also analyzed productively. Finally, in order to serve the cancer patients with due diagnosis, the classification accuracy rate is sensibly derived from our proposed system.

III. EXPERIMENTAL RESULT AND DISCUSSION

In this section, many computer simulations are run based on the proposed classification system explained in chapter 2. To assess the performance of the proposed system, experiments with MIAS database images are performed. The performance of the proposed system is carried on 99 normal images and 25 microcalcification images. Among the 25 abnormal images, there are 12 benign and 13 malignant images available. All the images are considered for the classification test. The wavelet decomposition levels are varied from 2 to 6 and for each level the performance of the proposed techniques are evaluated.

The performance evaluation of the proposed mass classification system is carried on using MIAS database images. In this study, 99 normal mammograms are selected randomly for training and testing the classifier and all the mass images are considered for the performance study. The KNN and SVM classifiers are used to classify the normal image and also the abnormal severity into benign and malignant.

(3)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

[image:3.612.205.407.135.285.2]

583

[image:3.612.207.410.310.463.2]

Figure 3.1 Histogram plot of 2-level wavelet coefficients of a mass benign image

Figure 3.2 Histogram plot of reduced 2-level wavelet coefficients of a mass- benign image by SNE

[image:3.612.206.407.487.639.2]
(4)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

[image:4.612.205.405.134.280.2]

584

Figure 3.4 Histogram plot of reduced 2-level wavelet coefficients of a mass- malignant image by SNE

The histogram plots shows the variation in the extracted features. Hence the SNE based reduced DWT coefficients are used in the proposed approach as features and fed into the classifier for further classification.

The number of samples used for training the SVM classifier and testing images for classification is given in Table I.

Table I.

Number of Training and Testing samples for microcalcification classification

Category No. of Training Set No. of testing Set

Normal 66 99

Abnormal 37 56

Masses(Benign) 25 37

Masses(Malignant) 13 19

To analyze the performance of the SNE, the proposed system is tested with the state of art technique Principal Component Analysis (PCA) using KNN classifier.

Initially the dimension of the feature is reduced for the wavelet transformed image. The classification accuracy obtained by the proposed system using KNN classifier for first stage and final stage are show in Table II and III.

Table II.

Classification accuracy of initial stage using KNN classifier based on wavelet

Level of decomposition

Wavelet + PCA + KNN

Wavelet + SNE + KNN Normal

(%)

Abnormal (%)

Normal (%)

Abnormal (%)

2 89.90 89.29 95.96 83.93

3 91.91 87.50 94.94 85.72

4 88.89 83.93 91.92 91.07

5 91.92 89.29 95.96 78.57

6 89.90 91.07 95.96 83.92

The initial stage classifier classifies the given unknown mammogram into normal or abnormal image. The above table shows the classification accuracy by varying the decomposition level from 2 to 6. The Bold values indicate the higher classification accuracy achieved by the proposed approach.

(5)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

585

Table III.

Classification accuracy of final stage using KNN classifier based on wavelet

Level of decomposition

Wavelet + PCA + KNN

Wavelet + SNE + KNN Benign

(%)

Malignant (%)

Benign (%)

Malignant (%)

2 94.60 78.95 100 68.43

3 83.78 73.68 97.30 78.95

4 91.89 68.42 100 78.95

5 91.89 68.42 97.30 78.95

6 91.89 73.68 100 78.95

It is concluded from the output of the final stage classifier using KNN classifier; the benign cases are classified with no error. However, performance against malignant cases needs to be improved.

[image:5.612.166.442.367.493.2]

To improve the accuracy, the proposed approach is tested by applying the SNE and PCA to all the wavelet sub-bands separately. Table IV. and Table V. show the classification accuracy of the proposed system for first and final stage respectively using wavelet sub-bands reduction.

Table IV.

Classification accuracy of initial stage using KNN classifier based on wavelet sub-bands

Level of decomposition

Wavelet Sub-bands + PCA + KNN

Wavelet Sub-bands + SNE + KNN Normal

(%)

Abnormal (%)

Normal (%)

Abnormal (%)

2 88.89 80.36 87.88 78.57

3 89.90 83.93 94.95 78.57

4 89.19 84.21 97.98 73.21

5 83.78 73.68 98.99 76

6 94.59 78.95 100 76

Table V.

Classification accuracy of final stage using KNN classifier based on wavelet sub-bands

Level of decomposition

Wavelet Sub-bands + PCA + KNN

Wavelet Sub-bands + SNE + KNN Benign

(%)

Malignant (%)

Benign (%)

Malignant (%)

2 86.49 73.68 91.89 84.21

3 100 78.95 97.30 89.47

4 92.93 85.71 91.89 84.21

5 95.96 80 91.89 94.74

6 93.94 76 89.19 94.74

The variation in the average classification accuracy of initial stage is not much more while using the PCA or SNE in the wavelet decomposed image as well as the wavelet bands. However, dimension reduction in wavelet sub-bands produces higher average classification accuracy of 91.67% in the final stage classifier.

(6)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

586

Table VI.

Classification accuracy of initial stage using SVM classifier based on wavelet

Level of decomposition

Wavelet + PCA + SVM

Wavelet + SNE + SVM Normal

(%)

Abnormal (%)

Normal (%)

Abnormal (%)

2 90.91 85.71 95.96 85.71

3 89.90 87.50 94.95 87.50

4 90.91 89.29 95.96 83.93

5 86.87 83.93 94.95 82.14

[image:6.612.172.442.159.290.2]

6 90.91 85.71 97.30 89.47

Table VII.

Classification accuracy of final stage using SVM classifier based on wavelet

Level of decomposition

Wavelet + PCA + SVM

Wavelet + SNE + SVM Benign

(%)

Malignant (%)

Benign (%)

Malignant (%)

2 86.49 89.47 97.30 78.95

3 89.19 68.42 100 84.21

4 91.89 78.95 97.30 84.21

5 89.19 84.21 100 84.21

6 89.19 78.98 95.96 85.70

From the above two tables, 5.12 and 5.13, the maximum average classification accuracy of 93.39% and 92.10% is achieved by SNE at initial and final stage respectively.

The classification accuracy of initial stage while using the SVM classifier is show from Table VIII. to Table IX.

Table VIII.

Classification accuracy of initial stage using SVM classifier based on wavelet sub-bands

Level of decomposition

Wavelet Sub-bands + PCA + SVM

Wavelet Sub-bands + SNE + SVM Normal

(%)

Abnormal (%)

Normal (%)

Abnormal (%)

2 95.96 76.79 91.92 78.57

3 89.90 75 95.96 82.14

4 95.96 73.21 94.95 83.93

5 93.94 75 97.98 78.57

6 95.96 73.21 96.97 76.79

From the Table 5.14, it is clearly observed that abnormal cases are classified correctly by SNE compared to PCA reduction technique. The SNE classified the abnormal images more than 10% than PCA reduction technique.

[image:6.612.165.449.487.596.2]
(7)

International Journal of Emerging Technology and Advanced Engineering

Website: www.ijetae.com (ISSN 2250-2459,ISO 9001:2008 Certified Journal, Volume 3, Issue 10, October 2013)

[image:7.612.168.444.158.267.2]

587

Table IX.

Classification accuracy of final stage using SVM classifier based on wavelet sub-bands

Level of decomposition

Wavelet Sub-bands + PCA + SVM

Wavelet Sub-bands + SNE + SVM Benign

(%)

Malignant (%)

Benign (%)

Malignant (%)

2 97.30 68.42 100 84.21

3 89.19 78.95 100 84.21

4 100 68.42 97.30 84.21

5 100 68.42 97.30 89.47

6 94.59 73.68 100 78.95

From the tables VI. to IX, it is clearly found that the proposed SNE based method outperforms the PCA method in all aspect. As the decomposition level increases, the average classification accuracy increases for both the approaches.

IV. CONCLUSION

In this paper, two techniques for building a computer aided diagnostic system for the classification of mass classification in digital mammogram based on DWT are proposed. SNE is directly applied to wavelet transformed image in the first technique and the second technique is applied on sub-bands of wavelet transformed image. The high dimensional data from the mammogram image is relatively reduced into low dimensional data set by using the SNE. The reduced data set is used as features to classify the given mammogram images into normal or abnormal as well as benign or malignant. The proposed classification scheme is carried on MIAS database image. Experimental results show that . As the decomposition level increases, the average classification accuracy increases for both the approaches and also the classification rate of the proposed two techniques is satisfied.

REFERENCES

[1] Pelin Gorgel, Ahmet Sertbas, Niyazi Kilic, Osman N. Ucan and Onur Osman, “Mammographic Mass Classification Using Wavelet Based Support Vector Machine”, Journal of Electrical & Electronics Engineering, Vol. No 9(1), 2009, pp 867-875

[2] S. Baeg and N. Kehtarnavaz, “Texture Based Classification of Mass Abnormalities in Mammograms”, Proceedings of 13th IEEE Symposium on Computer-Based Medical Systems, 2000

[3] Yuan Chen and Chein I Chang, “New Texture Shape Feature Coding-Based Computer Aide Diagnostic Methods for Classification of Masses on Mammograms”, Proceedings of the 26th Annual International Conference of the IEEE, September 2004

[4] Maurice Samulski, Nico Karssemeijer, Peter Lucas and Perry Groot, “Classification of mammographic masses using support vector machines and Bayesian networks”, Proceedings of SPIE on Medical Imaging, 2007

[5] Matteo Masotti, “Exploring Rank lets Performances in Mammographic Mass Classification Using Recursive Feature Elimination”, Proceedings of the 16th IEEE Signal Processing Society Workshop on machine learning for signal processing, 2006 pp 265-269

[6] Mohammed J. Islam, Majid Ahmad and Maher A. Sid-Ahmed, “An Efficient Automatic Mass Classification Method in Digitized Mammograms Using Artificial Neural Network”, International Journal of Artificial Intelligence & Applications (IJAIA), Vol. No 1(3), July 2010

[7] Mohamed Meselhy Eltoukhy and Ibrahima Faye, “Curvelet Based Feature Extraction Method for Breast Cancer Diagnosis in Digital Mammogram”, IEEE International Conference on Intelligent and Advanced Systems, June 2010, pp 1-5.

[8] H.D. Cheng, X.J. Shi, R. Min, L.M. Hu, X.P. Cai, H.N. Du (2006) “Approaches for automated detection and classification of masses in mammograms”, Pattern Recognition, Vol. No 39, pp 646-668. [9] B. Sahiner, N. Petrick, H.P. Chan (2001) “Computer-aided

characterization of mammographic masses: accuracy of mass segmentation and its effects on characterization”, IEEE Transactions on Medical Imaging, Vol. No 20(12), pp 1275–1284

[10] J.L. Viton, M. Rasigni, G. Rasigni, A. Liebaria (1996) “Method for characterizing masses in digital mammograms”, Journal of Optical Engineering, Vol. No 35(12), pp 3453–3459

[11] G. Hinton and S. Roweis, “Stochastic neighbor embedding,” in Advances in Neural Information Processing Systems, vol. 15, MIT Press 2003, pp 833-840.

[12] J.S.Leena Jasmine,S.Baskaran,A Govardhan “ An automated Mass classification system in Digital Mammograms using Contourlet Transform and Support Vector Machine”, International Journal of Computer Applications, Vol No. 31 Issue No 9,October 2011. [13] Smola A. J., Scholkopf B., and Muller K. R., “The connection

Figure

Figure 2.1: (a) Sample mammogram image (b) 2-D Wavelet decomposition
Figure 3.1 Histogram plot of 2-level wavelet coefficients of a mass benign image
Figure 3.4 Histogram plot of reduced 2-level wavelet coefficients of a mass- malignant image by SNE
Table IV.  Classification accuracy of initial stage using KNN classifier based on wavelet sub-bands
+3

References

Related documents

The anti- oxidant activity was performed by DPPH free radical scavenging method using ascorbic acid as standard and compound IIa, IIc and IId showed significant free

MAP: Milk Allergy in Primary Care; CMA: cow’s milk allergy; UK: United Kingdom; US: United States; EAACI: European Academy of Allergy, Asthma and Clinical Immunology; NICE:

AIRWAYS ICPs: integrated care pathways for airway diseases; ARIA: Allergic Rhinitis and its Impact on Asthma; COPD: chronic obstructive pulmonary disease; DG: Directorate General;

It was decided that with the presence of such significant red flag signs that she should undergo advanced imaging, in this case an MRI, that revealed an underlying malignancy, which

diagnosis of heart disease in children. : The role of the pulmonary vascular. bed in congenital heart disease.. natal structural changes in intrapulmon- ary arteries and arterioles.

∗ Balint geht sogar so weit, eine „apostolische Funktion“ zu beschreiben, nach welcher der Arzt sich ständig unbewusst verpflichtet fühle, „die Unwissenden und Ungläubigen

– During free fall, the rotational velocity should remain constant, with zero absolute acceleration; this analysis must be performed when there is relatively little rota- tion,

on deeply held misconceptions and fears of people who hold different beliefs, strategies of conceptual change theory, most often found in science education, were