Voxel Selection Framework with Feature Extraction for Classification of Brain Activity in fMRI

(1)

Vol. 28, No. 7, (2019), pp. 33-37

Voxel Selection Framework with Feature Extraction for Classification of Brain Activity in fMRI

1 D.M.Yadav

1 Universal College of Engineering and research Pune,Maharashtra,India

[email protected]

2S.V.Raut

2Rajarshi Shahu College of Engineering, Pune,Maharashtra,India

[email protected]

3S.S.Kanse

3Rajarshi Shahu College of Engineering, Pune,Maharashtra,India

[email protected]

Abstract—This paper presents fMRI (functional Magnetic Resonance Imaging) signal analysis methodology using Principal Component Analysis (PCA) and Mutual Information (MI) based voxel selection framework. Previously, the fMRI signal analysis has been carried out either using Principal Component Analysis (PCA) model or voxel selection on raw fMRI signal. The first methodology does feature extraction that makes voxel selection process easy while the latter methodology does selection of relevant voxels (or features)

.

Both these advantages are added by our methodology in which Principal Component Analysis (PCA) is used for feature extraction to decrease the dimension of fMRI data. The proposed methodologies are adopted for classification of brain activity. Experimentations are carried out in the publicly available fMRI dataset of six subjects and comparisons are made with existing PCA model and voxel selection framework. The superiority of the proposed methodology gets validated by the comparative results.

Keywords— fMRI, voxel, decomposition, PCA, MI.

I. INTRODUCTION

Functional magnetic resonance imaging (fMRI) is one of the most popularly used techniques for the study of human brain function. It is possible to safely, non invasively observe correlations of neural activity across the entire human with good spatial resolution [3]. The brain functions are spatially divided into different brain regions. In functional brain studies, it is shown by Ogawa et. that by measuring the neural activity using fMRI, one can localize the brain functions. This measurement of neural activity is based on the Blood Oxygenation Level Dependent (BOLD) contrast. In the BOLD effect, a neural activity in the brain caused by some sensory or motor tasks produces localized changes in the blood flow and

activity of brain [3]. fMRI is popular technique in the field of neuroscientific as well as cognitive studies [5] from last two decades. The objective of fMRI signal analysis is to find the activation areas in the brain, which corresponds to stimulus in time axis. The fMRI signal analysis achieved either Signal detection or Characterization of Haemodynamic response function. In signal detection, the area of brain that gets activated after giving a stimulus is identified [6]. On the other hand, the Haemodynamic response gives us changes in the blood flow that occur after the stimulus is given [7]. There are many factors that affect the analysis of the fMRI signal like signal acquisition [8], pre-processing [9] as well as de-noising algorithms [12] [13], enhancement procedures [10] [11], evaluation of Haemodynamics [14] [15] and post-processing schemes [16] [17]. All the above methodologies, as a whole, can be grouped into two main methods (1) The hypothesis- driven methods and (2) The data-driven methods [3].

Traditional hypothesis driven methods include Markov random field (MRF) models [18], [19] and hidden Markov models (HMM) [20].These models require more information about the stimulus and are complex [3]. General linear model (GLM) is used in these models [21] and the analysis is highly univariate [22]. These models consider both the hemodynamic response function (HRF) and the stimulus timing function while generating reference functions. In contrast, for analysis of fMRI signals the data-driven methods handle multivariate approach. They contain both the supervised [23] [24] and the unsupervised methods [25] [26] [27] [28]. In opposite to the hypothesis-driven approaches, data-driven approaches do not require prior knowledge about the stimulus timings also it make very less or no assumptions about the HRF shape [3].

The supervised methods have many advantages than

(2)

Vol. 28, No. 7, (2019), pp. 33-37 II. PROPOSED METHOD

In investigating fMRI data to classify brain activity, feature selection (voxel selection) and feature based classification are two cardinal and essential methodologies. The selection of applicable voxels is difficult task in fMRI signal analysis.

Selection of informative voxels for efficient classifications is called as Feature selection. This step is more important because it directly affects the fMRI prediction. Each fMRI time series dataset represents thousands of voxels. Not all of them contain specific information related to stimulus. There could be noise and immaterial, information. If we consider all these voxels for training our classification algorithm there may be overfitting problem. Thus it is important to select only those voxels that would contain information about stimulus.

For voxel selection the analysis of variance (ANOVA) method is largely used. The voxels selected should have maximum relevance with the fMRI signals. In multivariate classification methods multiple voxel correlations and the obtained structural patterns are integrated.

For fMRI multivariate pattern analysis, Gaussian Naïve Bayes (GNB), Neural Networks (NNs) and Support Vector Machine (SVM) are few eminent classification algorithms are nominated. As per literature, these algorithms are used together with the informative voxel selection framework [35].

More than ten millions voxels in the raw fMRI data make the fMRI-related pattern recognition difficult to classify brain activity. In the multivariate signal analysis there is need to add feature extraction methodology [32][33]. By introducing feature extraction using PCA methodology before voxel selection framework, multivariate fMRI pattern analysis can be performed in a more precise way than the existing methodologies.

The algorithm of our methodology contain following three steps:

1. PCA allows us to compute a linear transformation which portrays data from a high dimensional space to a low dimensional space. Brain activity classification using fMRI is performed with PCA based feature extraction..

2. Informative voxels are selected after feature extraction of the fMRI signal, depending on its mutual information. Informative voxels plays important role for achieving accurate prediction of neural activity.

3. This paper focuses the challenges in fMRI analysis and by integrating two contributions achieved improved accuracy. Principal Component Analysis (PCA) [36]

and MI – based voxel selection framework [35] are our two contributions. The above methodologies are well known for its accurate solutions, still there is necessity to improve its performance. By incorporating these two methods individual performance can be improve.

This paper is presented as follows. In section III, we illustrate the concepts and algorithm of PCA used for feature extraction. In section IV we explain the concepts of Mutual Information applied for feature selection. In section V, Experimental results are shown on a standard dataset and

compare with existing algorithms in the literature followed by discussion and conclusion.

III Feature Extraction using PCA

PCA is transformed based feature extraction method which transforms the data from high dimensional coordinate system to a new low dimensional coordinate system.

Let Z be the d*n matrix of measured fMRI data, containing n samples of a d-dimensional vector.

Z=d*n (1)

r is the desired number of principal components.

Z_pca=r*n (2)

Z_pca matrix containing r principal components scaled to variance 1 of the input samples.

Eigenvectors is a (d*r) matrix containing the scaled eigenvectors of the sample covariance of Z.

In a typical use the top r components with the largest eigenvalues are selected.

Fig.1 fMRI decomposition using Principal Component Analysis

IV.VOXEL SELECTION FRAMEWORK FOR BRAINACTIVITY CLASSIFICATION

A. Decomposed Voxel Selection Framework

Depending on the mutual information between the voxels this paper proposed a voxel selection framework for the decomposed signal [35]. fMRI signal suffers due to noise and signal redundancy if conventional MI-based voxel selection process is operated on fMRI signal. In our methodology we applied PCA algorithm to get decomposed signal which is robust against noise and redundancy [33].In the proposed methodology voxels are selected from this decomposed signal, based on their mutual information these voxels are selected.

B. MI – based Voxel Selection:

Mutual Information (MI) is characterizes a quantity that measures relationship between two random variables that are sampled simultaneously. In this algorithm, we used MI which will calculate mutual information between features or between class label (stimulus condition) and a feature. Consider MI between the class random variable X and the j^th feature Y_j The entropy of Y is defined as ^H⁽^Y⁾^^



^P⁽^y⁾^log^P⁽^y⁾^dy

and the MI between X and Y is defined as the Kullback-Leibler divergence between the distributions P(X,Y) and P(X) * P(Y):

(3)

Vol. 28, No. 7, (2019), pp. 33-37 y dxdy

P x P

y x y P

x P Y

X MI

j j y

x

j

j ( ) ( )

) , log ( ) , ( )

, (

,



^

 (9)

Where P(x,yj) is a joint distribution of continuous variables X and Y_jwhich can be calculated more conveniently using the chain rule P(x,y_j)=P(x)P(y_j|x). For greater MI value it gives higher statistical relevancy between the stimulus condition and the voxel. In this methodology we have used three classification methods which are widely used for MVPA.

1) Gaussian Naïve Bayes (GNB).For each feature this classifier learns a class conditional Gaussian generative model. Naïve Bayes classifiers x apply Bayes theorem with intense individual assumptions between the features.

2) Multilayer Perceptron Neural Network Classifier (NN): It is a class of feedforward artificial neural network.MLP is a supervised learning technique. NN extracts features that are fed to other algorithms for clustering and classification.

3) Sparse Multinomial Logistic Regression Classifier (SMLR). SMLR classifier gives the solution for binary or multi class classification problem .Weight parameters of classifier are learned in sparse way. Because of this unique feature SMLR is used to solve high dimensional classification problems. Due to this feature selection which is very time consuming becomes easy to user.

V. EXPERIMENTAL RESULTS A. fMRI Data

The fMRI dataset used were downloaded from http://data.pymvpa.org/datasets/haxby2001. Data is acquired for six subjects. The data procurement process was divided into trials/ intervals. In some intervals, the subject was at rest or directed to look on the screen. In half of the trials the picture was presented first, followed by the sentence while in the remaining trials, the sentence was presented first, followed by the picture, subject pressed the mouse button to indicate whether the sentence correctly described the picture. Total 80 trials performed for every subject in the first forty trials, picture was shown first followed by the sentence and in the remaining trials sentence was shown first, followed by the picture. In this trial, pictures are like geometric arrangement of symbols like _,*, /, $. The first stimulus (sentence or picture) was presented for 4 seconds, succeeded by a blank screen for 4 seconds. The second stimulus was then presented for up to 4 seconds, after that subject pressed the button to specify if according to picture the sentence is correctly narrate. A rest period of 15 seconds was placed before the next trial.

Sentences were illustrations like ―It is true that the plus is below the dollar.‖ Half of the sentences were negated (e.g., ―It is not true that the star is above the plus.‖) and the other half were validated sentences. fMRI images were captured for every 500 msec. Our aim is to train a classifier to determine, the subject is viewing either a sentence or picture during the 8 second interval of fMRI data. In eight second of interval classifier contains 16 images which contain 160,000 voxels before feature selection. Here fMRI time series consider

region of interest (ROI) rather that complete brain which is defined as 25-30 ROI.

B. Adopted Methodologies

In this paper we have explored four methodologies for the analysis of fMRI data and accuracy is predicted. First, without any decomposition and voxel selection framework, simple classifier model has been used. Second, the MI – based voxel selection framework [9] has been developed for which shows enhancement in accuracy than traditional classification model.

Third on fMRI signal PCA applied and get decomposed signal that has been used for training to classifier and last the proposed methodology has been developed, voxels are selected using MI on fMRI signal after feature extraction using PCA.

The proposed methodology is compared with other three methodologies in terms of its performance. MATLAB is used to develop these methodologies.

Subject Classifier Methodologies

Without MI

With

MI PCA PCA +MI

(Proposed) Subject1 MLP

Neural 0.88 0.83 0.84 0.90

Subject2 0.84 0.90 0.90 0.90

Subject3 0.76 0.88 0.76 0.86

Subject4 0.76 0.7 0.70 0.79

Subject5 0.79 0.75 0.73 0.81

Subject6 0.80 0.76 0.76 0.86

Subject1 NBayes 0.80 0.80 0.78 0.80

Subject2 0.80 0.80 0.78 0.81

Subject3 0.80 0.80 0.79 0.79

Subject4 0.80 0.80 0.79 0.81

Subject5 0.83 0.80 0.80 0.81

Subject6 0.80 0.80 0.84 0.86

Subject1 SMLR 0.88 0.88 0.85 0.87

Subject2 0.82 0.84 0.89 0.89

Subject3 0.84 0.84 0.89 0.89

Subject4 0.84 0.84 0.84 0.86

Subject5 0.95 0.95 0.88 0.91

Subject6 0.80 0.80 0.79 0.82

Fig.2. Classification accuracies from classifiers for six subjects.

C. Discussion

After applying PCA algorithm we get decomposed signal and redundant voxels get removed and the voxels who are having values greater than threshold are kept. Feature selection of fMRI signal improved by using decomposition framework. We tested relative performance of three classifiers Fig. 2 presents the performance of six subjects in terms of accuracies using three classifiers. The performance of these classifiers is varying with respect to subjects. SMLR gives better accuracy while GNB yields worst accuracy, MLP Neural classifier gives better results. These results show that by using informative voxels using MI improved overall classification performance.

(4)

Vol. 28, No. 7, (2019), pp. 33-37 VI. CONCLUSION

In this paper, we used multi voxel pattern analysis (MVPA) to discriminate and classify fMRI cognitive states across multiple human subjects. To extract features from raw fMRI signal we have used PCA. To identify informative voxels from feature extracted decomposed fMRI signal a voxel selection framework has been introduced. Using decomposition framework and active feature selection, classification accuracy gets improved. In the future we can use this methodology for identification of healthy brain and disorder brain [31]. For task based fMRI this methodology can apply to detect facial expressions with improved accuracy.

REFERENCES

[1] Barbé, K., Van Moer, W. and Nagels, G., ―Fractional-Order Time Series Models for Extracting the Haemodynamic Response From Functional Magnetic Resonance Imaging Data‖, IEEE Transactions on Biomedical Engineering, Volume 59, Issue 8, pp 2264 – 2272, 2012.

[2] K. K. Kwong and D. A. Chesler, ―Functional MRI, in Medical Devices and Systems‖. Boca Raton: CRC Press, 2006, pp. 22–

30.

[3] Katwal,S.B.,Gore J.C., Marois R. and Rogers B.P.,

―Unsupervised Spatiotemporal Analysis of FMRI Data Using Graph-Based Visualizations of Self-Organizing Maps‖, IEEE Transactions on Biomedical Engineering, Volume 60, Issue 9, pp 2472 - 2483, 2013.

[4] Yuanqing Li , Namburi, P. , Zhuliang Yu , Cuntai Guan , Jianfeng Feng and Zhenghui Gu, ―Voxel Selection in fMRI Data Analysis Based on Sparse Representation‖, IEEE Transactions on Biomedical Engineering, Volume 56, Issue 10, pp 2439 - 2451, 2009.

[5] Michel, V.Gramfort, A. Varoquaux, G. Eger, E.and Thirion, B.,

―Total Variation Regularization for fMRI-Based Prediction of Behavior‖, IEEE Transactions on Medical Imaging, Volume 30, Issue 7, pp 1328 - 1340, 2011.

[6] C. Genovese, N. Lazar, and T. Nichols, ―Thresholding of statistical maps in functional neuroimaging using the false discovery rate,‖ NeuroImage, Volume 15, pp. 870–878, 2002.

[7] M. Lindquist, J. Loh, L. Atlas, and T. Wager, ―Modeling the hemodynamic response function in fMRI: Efficiency, bias and mis-modeling,‖ NeuroImage, Volume 45, no. 1, pp. S187–S196, 2009.

[8] B. Hu, G. Varma, C. Randell, S. Keevil, T. Schaeffter, and P.

Glover, ―A novel receive-only liquid nitrogen (LN2)-cooled RF coil for high-resolution in vivo imaging on a 3-tesla whole-body scanner,‖, IEEE Transactions on Instrumentation and Measurement , Volume 61, Issue. 1, pp. 129–139, Jan. 2012.

[9] S. Strother, ―Evaluating fMRI preprocessing pipelines—Review of pre-processing steps for BOLD fMRI,‖ IEEE Engineering in Medicine and Biology Magazine, Volume 25, Issue 2, pp. 27–

41, Mar./Apr. 2006.

[10] D. A. Karras and G. B. Mertzios, ―New PDE-based methods for image enhancement using SOM and Bayesian inference in various discretization schemes,‖ Measurement Science and Technology, Volume. 20, Issue 10, 2009.

[11] V. Rallabandi and P. Roy, ―Magnetic resonance image enhancement using stochastic resonance in Fourier domain,‖

Magnetic Resononance Imaging, Volume. 28, Issue 9, pp. 1361–

1373, 2010.

[12] X. Yang and B. Baowei Fei, ―A wavelet multiscale denoising algorithm for magnetic resonance (MR) images,‖ Measurement Science and Technology, Volume 22, no. 2, 2011.

[13] J. Sijbers, D. Poot, A. J. den Dekker and W. Pintjens,

―Automatic estimation of the noise variance from the histogram of a magnetic resonance image,‖Physics in Medicine and Biology, Volume. 52, pp. 1335–1348, 2007.

[14] C. Goutte, F. A. Nielsen, and L. K. Hansen, ―Modeling the haemodynamic response in fMRI using smooth FIR filters,‖

IEEE Trans. Med. Imag., Volume 19, no. 12, pp. 1188–1201, Dec. 2000.

[15] R. Gibbons, N. Lazar, D. Bhaumik, S. Sclove, H. Chen, K.

Thulborn, J. Sweeney, K. Hur, and D. Patterson, ―Estimation and classification of fMRI hemodynamic response patterns,‖

Neuroimage, vol. 22, pp. 804– 814, 2004.

[16] A. den Dekker, D. Poot, R. Bos, and J. Sijbers, ―Likelihood- based hypothesis tests for brain activation detection from MRI data disturbed by colored noise: A simulation study,‖ IEEE Trans. Med. Imag., vol. 28, no. 2, pp. 287–296, Feb. 2009.

[17] K. Barb´e, W. Van Moer, and L. Lauwers, ―Functional magnetic resonance imaging: An improved short record signal model,‖

IEEE Trans. Instrum. Meas., vol. 60, no. 5, pp. 1724–1731, May 2011.

[18] X. Descombes, F. Kruggel, and D. Y. von Cramon, ―Spatio- temporal fMRI analysis using Markov random fields,‖ IEEE Trans. Med. Imag., vol. 17, no. 6, pp. 1028–1039, Dec. 1998.

[19] M. Svens´en, F. Kruggel, and D. Y. Von Cramen, ―Probabilistic modeling of single trial fMRI data,‖ IEEE Trans. Med. Imag., vol. 19, no. 1, pp. 25–36, Jan. 2000.

[20] S. Faisan, L. Thoraval, J. P. Armspach, J. R. Foucher, M. N.

Metz-Lutz, and F. Heitz, ―Hidden Markov event sequence models: Toward unspervised functional MRI brain,‖ Acad.

Radiol., vol. 12, no. 1, pp. 25–36, Jan. 2005.

[21] K. J. Friston, A. P. Holmes, K. J. Worsley, J. P. Poline, C. D.

Frith, and R. S. J. Frackowiak, ―Statistical parametric maps in functional imaging: A general linear approach,‖ Human Brain Mapping, vol. 2, no. 4, pp. 189–210, 1994.

[22] E. Zarahn, G. K. Aguirre, and M. D’Esposito, ―Empirical analyses of BOLD fMRI statistics,‖ Neuroimage, vol. 5, no. 3, pp. 179–197, Apr. 1997.

[23] L. K. Hansen, J. Larsen, F. A. Nielsen, S. C. Strother, E.

Rostrup, R. Savoy, N. Lange, J. Sidtis, C. Svarer, and O. B.

Paulson, ―Generalizable patterns in neuroimaging: How many principal components,‖ Neuroimage,vol.9, no. 5, pp. 534–544, May 1999.

[24] K. H. Chuang, M. H. Chiu, C.C. Lin, and J. H. Chen, ―Model- free functional MRI analysis using Kohonen clustering neural network and fuzzy c-means,‖ IEEE Trans. Med. Imag., vol. 28, no. 12, pp. 1117–1128, Dec. 1999.

[25] Jingyu Liu, Lai Xu , Caprihan, A. and Calhoun V.D.,

―Extracting principle components for discriminant analysis of FMRI images‖, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008, pp 449 – 452, 2008.

[26] S. J. Peltier, T. A. Polk and D. C. Noll, ―Detecting low- frequency functional connectivity in fMRI using a self-

(5)

Vol. 28, No. 7, (2019), pp. 33-37 organizing map (SOM) algorithm,‖ Human Brain Mapping, vol.

20, no. 4, pp. 220–226, Aug. 2003.

[27] W. Liao, H. Chen, Q. Yang, and X. Lei, ―Analysis of fMRI data using improved self-organizing map and spatio-temporal metric hierarchical clustering,‖ IEEE Trans. Med. Imag., vol. 27, no.

10, pp. 1472–1483, Oct. 2008.

[28] H. Chen, H. Yuan, D. Yao, L. Chen, and W. Chen, ―An integrated neighborhood correlation and hierarchical clustering approach of functional MRI,‖ IEEE Trans. Biomed. Eng., vol.

53, no. 3, pp. 452–458, Mar. 2006.

[29] T. H. Le and X. Hu, ―Potential pitfalls of principal component analysis in fMRI,‖ presented at the Int. Soc. Mag. Reson. Med.

3, Nice, France, 1995, p. 2008.

[30] M. J. McKeown, S. Makeig, G. G. Brown, T. P. Jung, S. S.

Kindermann, A. J. Bell, and T. J. Sejnowski, ―Analysis of fMRI data by blind separation into independent spatial components,‖

Human Brain Mapping,vol.6, no. 3, pp. 160–188, 1998.

[31] Honorio, J., Tomasi, D., Goldstein, R.Z., Leung, H.-C. and Samaras, D., ―Can a Single Brain Region Predict a Disorder?‖, IEEE Transactions on Medical Imaging, Volume 31, Issue: 11, pp 2062 – 2072, 2012.

[32] Paithane A.N. and D.S.Bormane.‖Electrocardiogram signal analysis using Empirical Mode Decomposition and Hilbert Spectrum‖, Pervasive Computing (ICPC), 2015 International conference on IEEE 2015.

[33] Fan Deng , Dajiang Zhu ,Jinglei Lv ; Lei Guo ,Tianming Liu,

―FMRI Signal Analysis Using Empirical Mean Curve Decomposition‖, IEEE Transactions on Biomedical Engineering, Volume 60, Issue 1, Part: 1, pp 42 – 54, 2013 [34] Paithane A.N.and D.S.Bormane. ―Analysis of nonlinear and

non-stationary signals to extract features using Hilbert Haung Transform‖, Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on IEEE, 2014.

[35] Chun-An Chou Kampa, K. , Mehta, S.H. , Tungaraza, R.F., Chaovalitwongse, W.A. , Grabowski, T.J., ―Voxel Selection Framework in Multi-Voxel Pattern Analysis of fMRI Data for Prediction of Neural Response to Visual Stimuli‖, IEEE Transactions on Medical Imaging, Volume 33, Issue 4, pp 925 – 934, 2014.

[36] Jiang Zhang, Xianguo Tuo, Zhen Yuan, Wei Liao, and Huafu Chen,‖Analysis of fMRI Data Using an Integrated Principal Component Analysis and Supervised Affinity Propagation Clustering Approach IEEE Transactions on Biomedical Engineering, Volume 58, Issue 11, pp 3184 – 3195, 2011.

[37] Paithane A.N., D.S.Bormane and U.G.Patil.‖Novel Algorithm for Feature Extraction and Feature Selection from Electrocardiogram Signal‖, International Journal of Computer Applications 134.9(2016):6-9.

[38] S.V. Raut, D.M.Yadav,‖ A Review on fMRI Signal Analysis and Brain Mapping Methodologies‖, AISC series of Springer International Conference on Computational Intelligence and Informatics-2016(ICCII-2016).