A random forest model based classification scheme for neonatal amplitude-integrated EEG

(1)

R E S E A R C H

Open Access

A random forest model based classification

scheme for neonatal amplitude-integrated EEG

Weiting Chen

1

, Yu Wang

1

, Guitao Cao

1*

, Guoqiang Chen

2*

, Qiufang Gu

2

From

IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2013)

Shanghai, China. 18-21 December 2013

* Correspondence: [email protected]. edu.cn

1_{Software Engineering Institute,}

East China Normal University, 3663 North Zhongshan Road, 200062, Shanghai, China

2

Shanghai Children’s Hospital of Fudan University, 399 Wanyuan Road, 201102, Shanghai, China

Abstract

Background:Modern medical advances have greatly increased the survival rate of infants, while they remain in the higher risk group for neurological problems later in life. For the infants with encephalopathy or seizures, identification of the extent of brain injury is clinically challenging. Continuous amplitude-integrated

electroencephalography (aEEG) monitoring offers a possibility to directly monitor the brain functional state of the newborns over hours, and has seen an increasing application in neonatal intensive care units (NICUs).

Methods:This paper presents a novel combined feature set of aEEG and applies random forest (RF) method to classify aEEG tracings. To that end, a series of experiments were conducted on 282 aEEG tracing cases (209 normal and 73 abnormal ones). Basic features, statistic features and segmentation features were extracted from both the tracing as a whole and the segmented recordings, and then form a combined feature set. All the features were sent to a classifier afterwards. The significance of feature, the data segmentation, the optimization of RF parameters, and the problem of imbalanced datasets were examined through experiments. Experiments were also done to evaluate the performance of RF on aEEG signal classifying, compared with several other widely used classifiers including SVM-Linear, SVM-RBF, ANN, Decision Tree (DT), Logistic Regression(LR), ML, and LDA.

Results:The combined feature set can better characterize aEEG signals, compared with basic features, statistic features and segmentation features respectively. With the combined feature set, the proposed RF-based aEEG classification system achieved a correct rate of 92.52% and a high F1-score of 95.26%. Among all of the seven classifiers examined in our work, the RF method got the highest correct rate, sensitivity, specificity, andF1-score, which means that RF outperforms all of the other

classifiers considered here. The results show that the proposed RF-based aEEG classification system with the combined feature set is efficient and helpful to better detect the brain disorders in newborns.

(2)

Background

Over the past decades, modern medical advances have greatly increased the survival rate of term and preterm infants [1]. Based on modern medical research, brain permanent damage can be minimized before it becomes irreversible [2]. Amplitude-integrated elec-troencephalography is an important tool for the neurological assessment of critically ill newborns [3]. Compared with imaging techniques such as Magnetic Resonance Imaging (MRI), aEEG is more suitable to continuously monitor the brain activity, which could record tracking changes and the maturation process of brain. Benefiting from the non-intrusive nature and high availability of aEEG, it is easy to be applied to portable bedside equipment.

The cerebral function monitor (CFM) was created in the 1960s by Douglas Maynard and first applied clinically by Pamela Prior [4]. In 1970s and early 1980s, Ingmar Rosén and Nils Svenningsen introduced the CFM in the intensive monitoring of brain func-tion in newborns [5][6]. Later, Lena Hellström-Westas started to evaluate the method in the neonatal intensive care unit (NICU) [7].

AEEG signal is derived from a reduced EEG which can be captured by CFM. Unlike the standard EEG, whose setting up and interpreting are labor intensive, aEEG signals are recorded from limited channels with symmetric parietal electrodess [8]. The aEEG proces-sing scenario includes an asymmetric band pass filter with pass band of 2-15Hz, semi-logarithmic amplitude compression and time compression. The filtering will minimize artifacts from sweating, movements, muscle activity and electrical interference. The ampli-tude is semilogarithmic ampliampli-tude compression (linear display 0-10 µV; logarithmic display 10-100µV). Continuous aEEG monitoring offers a possibility to directly monitor the functional state of the brain over hours and days. Toet et al.[9] gave a comparison between amplitude integrated electroencephalogram and standard electroencephalogram in neonates and pointed out CFM is a reliable tool for monitoring background patterns (especially normal and severely abnormal ones). Brain monitoring with aEEG is also reported to can better define brain injury and predict out-come than many other methods [3,10].

AEEG tracings are described and classified in several different ways, depending on whether normal or abnormal circumstances are evaluated and whether term or pre-term infants are studied [9]. A number of publications have described the normal development of aEEG patterns in full-term and preterm infants [10][11][12]. Figure 1 illustrates two typical background activities of normal and abnormal aEEG traces. Clin-ical aEEG monitoring can reveal abnormal brain activities, but there is a potential pos-sibility that the abnormality would pass unrecognized by users [8]. A study on an automatic method for detecting the cerebral activity based on aEEG can be helpful to avoid such unrecognition. In [13], a statistical distribution feature of aEEG signal was proposed. In [14], an algorithm for the automatic detection of seizures in aEEG was proposed, based on a sudden increase of the aEEG lower boundary which is the char-acteristic change caused by electrographic neonatal seizures.

(3)

reported that RF performed better in classification tasks for complex data [11]. In the previous work presented in the 2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) [12], we explored a random forest model with combined fea-tures for aEEG classification. The experiment results showed that RF achieved better performance than other machine learning algorithms, indicating it is a promising algo-rithm for the automatic aEEG signal interpretation. This paper is an extension to our previous work, focusing on the optimizing the configuration of the classification scheme.

Methods

The aEEG classification process is described in details in this section, including data description, the algorithm of random forest, feature extraction, classification and eva-luation. Figure 2 gives the block diagram of the RF-based classification system.

Data description

282 aEEG signals were acquired from Shanghai Children’s Hospital of Fudan Univer-sity, using the Olympic CFM 6000 (Olympic Medical Inc, Seattle, WA). Raw EEG signals were recorded through a pair of biparietal electrodes, and were then filtered, rectified, smoothed and selectively amplified to get aEEG. The positions of the recording electrodes were equivalent to the P3 and P4 electrode positions of the international 10-20 system. The aEEG samples with impedance greater than 10kΩ were discarded. The 282 cases include 209 normal cases and 73 abnormal ones, and the duration of each recording was 3 hours. All the aEEG tracings were interpreted to normal or abnormal ones by experienced clinicians independently.

(4)

Random forest model description

Random forest (RF) developed by Leo Breiman in 2001 has been proved to be a powerful approach with excellent performance in classification tasks [15][16]. Introdu-cing both bagging and random variable selection for tree building, RF utilizes an ensemble of classification trees, which are built on the bootstrap sample of the data. At each split, variable candidate set is randomly selected from the whole variable set. Randomness is injected by growing each tree on different random subsamples and determining splitter partly at random. Each tree is grown fully to obtain a low-bias. Both bagging and random variable selection assure the low correlation for individual trees. Through the averaging over a large ensemble of low-bias, high-variance but low correlation trees, the Algorithm 1 yields an ensemble forest [15].

In this paper, several algorithmic issues were examined, including parameter optimi-zation and imbalanced dataset processing.

Algorithm 1Algorithm of Random Forest

Input:

T: Training set(x1,y1), (x2,y1),. . ., (xn,yn);

Ntree: the number of trees to be built;

Mtry : the number of variables chosen for splitting at each node;

Training:

foreachb= 1 :Ntreedo

Draw a bootstrap sample X_bfrom the given training setT.

At each node of treetr_b, selectM_try variables randomly and determine the best split among these M_tryvariables.

Construct an unpruned treetr_busing the above bootstrapped samples.

end for Classification:

Classify by majority vote among theNtrees. Compute

favg(X) := (p1(X),. . .,pk(X)) := 1

N

b=1

fi(X)

fRF(X) :=argmaxk{p1(X),. . .,pk(X)}

Parameter optimization

In order to achieve desired performance, two important parameters need to be opti-mized in the RF algorithm. One is the number of input variables M_trytried at each split, and the other is the number of trees to grow (Ntree) for each forest.Mtry

consid-ered at each split is a real parameter in the sense that its optimal value depends on the data. The default value (the square root of the number of input variables) is often a good choice for Mtry [17]. Generally speaking, the number of treesNtree in the forest

should increase with the number of candidate predictorsMtry, so that each predictor

has enough opportunities to be selected. To get an appropriate value of Ntree, we can

try several increasing values and select the value when the prediction error stabilizes.

Imbalanced datasets processing

(5)

misclassifying of the minority abnormal class is much higher than the cost of other misclassifications, the imbalanced dataset problem is one of the important issues we need to consider to insure a satisfying result.

In this paper, we attempt to make the classifier more robust to the problem of class imbalance by using class weights. A heavier penalty is given when the RF misclassifies the minority class because the classifier tends to be biased towards the majority class [18]. Each class is set a weight, with the minority class given a larger one. Class weights are applied in two places. The first one is in the tree building procedure, where class weights are used to weight the Gini criterion for split point finding. The second one lies in the prediction procedure to produce a “weighted majority vote”by each terminal node. In such a weighted RF model, the final prediction is determined by aggregating the weighted vote from each individual tree. As essential tuning para-meters to achieve desired performance, the class weights can be selected through the out-of-bag estimate of the accuracy of RF model [19].

Feature extraction

Three kinds of features were extracted to characterize the aEEG signals, including basic features, the histogram features from the signal as a whole, and the segment features got from segmented aEEG recordings.

Basic Features

Basic features were extracted from the initial 3-hour-length aEEG signal, including minimum amplitude, maximum amplitude, mean value of amplitude and percentage of the lower margin values under 5µV . For a 3-hour-length recording, we can get four features.

Histogram features

According to the clinical diagnosis criteria, the distribution of aEEG amplitude means a lot for interpretation of the signal [13][20][21][22]. In this work, a histogram of ampli-tude was calculated to reveal the distribution of aEEG ampliampli-tudes.

As aEEG classification is more sensitive to lower amplitude than the higher ones, in our experiments, 1µV was used as the width of interval in lower amplitude areas (≤ 50µV ) and 10µV as the width of interval in higher amplitude areas (> 50µV ). Thus we can get from one 3-hour-length aEEG recording 55 features carrying the histogram information. Four different histogram of normal and abnormal aEEG sig-nals are illustrated in Figure 3. We can observe that, the histogram of normal aEEG looks like the normal distribution (shown in (a)), when those of abnormal aEEG sig-nals are more irregular (shown in (b)-(d)).

Segment features

(6)

of the segmentation. The upper and lower boundaries are derived from the envelope of the segmented aEEG. The second order envelope was calculated for each segment. Averaging the upper and lower envelop, we got the upper boundary and the lower boundary of one segment (illustrated in Figure 5).

Approximate entropy [23][24] is a good description of aEEG signals helpful in detecting brain disorders of the newborn. ApEn can be briefly described as follow: Given a time-series u(1), u(2), . . . , u(N ), a vector in m dimensions is defined as

X_im= [u(i),u(i+ 1),. . .,u(i+m+ 1)], wheremis the length of comparing window. And a sequence of vectorsXm1,X2m, ,. . .XmN−m+1can be constructed. For eachi, 1≤i≤N−m+ 1, Figure 3Amplitude histograms of normal and abnormal aEEG signals: (a) normal; (b)(c)(d) abnormal.

(7)

letCm_r(r)be (n−m+ 1)−1 times the number of vectorsX_jmwithin tolerancerofX_im. Then we can defineFm(r) as the following formula fromCm_r(r):

m(r) =

N−m+1

i=1 lnCmi (r)

(N−m+ 1) , (1)

wherelnis the natural logarithm. Given a fixed positive integermand a positive real numberr, ApEn(m,r) is defined by

ApEn(m,r) = lim N→∞[

m₍_r₎₋m+1₍_r_)].

(2)

For a fixedNdata points, it is defined as

ApEn(m,r,N) =m(r)−m+1(r). (3)

For one 3-hour-length aEEG recording, 80 segments were observed. And for each segment, we can get four features: the upper boundary, the lower boundary, the mean value and ApEn. Thus for one 3-hour-length recording, we can get 320 features. Obviously it is time consuming if all these features are sent into a classifier. To speed up the classification processing, it’s wise to reduce the dimension of the feature vector by ignoring those unimportant ones. According to our previous work [23], ApEn with higher or lower values may more likely indicate the abnormality of a signal. So the seg-ment features were firstly sorted in an ascending order according to the values of ApEn, and then only those segments with high and low values of ApEn are selected. Through experiments, we picked up the segment features with the ten top and the five bottom values of ApEn, and thus we got a 60-dimensional feature vector for one 3-hour-length recording.

After the basic features, histogram features and segment features had been got respectively, they were integrated into one combined feature set with 119 features.

Classification

The weighted RF was applied to classify the 282 aEEG signals based on the feature sets got above. To evaluate the performance of RF on aEEG classification, other widely

(8)

used classifiers were also tested on the same data sets and the identical feature sets. The compared classification methods include the support vector machine with RBF kernel (SVM-RBF), support vector machine with linear kernel (SVM-Linear) and artifi-cial neural network (ANN). As a reference, we also considered the Maximum Likeli-hood (ML), Decision Tree using CART (DT), Logistic Regression (LR), Linear Discriminant Analysis (LDA) algorithm, four of the most popular traditional supervised classification methods.

Performance evaluation

Instead of using cross validation or estimating from a separate testing, an unbiased error can be estimated internally in random forest [25]. Each tree is constructed under a dif-ferent bootstrap sample. About one-third of the samples are left out of the bootstrap sampling and not used in the construction of the tree, so the left out samples can be put into the tree as test samples. At the end of the procedure, we took the number to be the class that got most of the votes every time. By calculating the proportion of misclassified samples over all cases, we can get the OOB error estimation, which has been proved to be an unbiased error estimation method for random forest [26].

Specificity, sensitivity andF1-score were applied to evaluate the performance of the

classifiers. The specificity is defined as the percentage of the number of true negatives over the sum of the number of true negatives and that of false positives. The sensitivity refers to the percentage of the number of true positives over the sum of the number of true positives and that of false negatives. The F1-score can be interpreted as a

harmo-nic compromise of precision and recall, which reaches its best value at 1 and worst score at 0 [27].

Results

We conducted a series of experiments on 282-subject dataset to achieve an optimum configuration of the RF-based classifier. The experimental study can be divided into three parts. The first set of experiments examined the effects of parameters N_tree, M_try and evaluated the candidate feature sets. In the second set of experiments, we dealt with the problem of imbalanced datasets. And the third set of experiments compared the performance of RF-based classifier with those of other classifiers.

Parameters tuning and feature evaluation

As there are two relevant parameters to be optimized in RF algorithm, we have to try out one of the parameter with the other one supposed to be given.

To select an appropriate value of N_tree, we initially built forest with a default para-meter of Mtry=

√

119≈11, and then the average Out of Bag (OOB) error was exam-ined in different forest sizes Ntree. Figure 6 illustrates the change of OOB error with

increasing forest size Ntree under default Mtry . It can be observed that the OOB

error tends to be stable at 0.083 when Ntreeincreases to about 220. As RF works

bet-ter with greabet-ter Ntree, we selected 1000 as the forest size in our further experiments.

After Ntreehad been set, forests were built with varying values ofMtryfrom 1 to 119.

(9)

experiments, a final random forest model was generated using the parameters Ntree = 1000 and Mtry= 11.

To assess the feature sets, we worked out the significance of feature as following: In every tree grown in the forest, first write down the OOB cases and count the number of votes for the correct class, and then permute the values of feature x_iin the OOB cases randomly and put these cases into the tree classifier. For each tree t, subtract the number of correct votes in the feature-x_i-permuted OOB data from that of correct votes in the unpermuted OOB data:

FIt(xi) =

i∈ ¯BtI(yi=yˆti) |Bt_| −

i∈ ¯BtI(yi=ˆyit,πj)

|Bt_| (4)

Figure 6Random Forest parameter selection. The average Out of Bag (OOB) error with defaultMtryfor different forest sizes. The size should be greater than 220 with a stable OOB error of 0.083.

Figure 7Random Forest parameter selection. The average OOB error for different value ofMtrywith

Ntree= 1000. The optimal value ofMtrywith the lowest OOB error occurs near the default parameter of

(10)

ˆ

yt_i=ft(xi) : predicted class before permuting; (5)

ˆ

yt_i_,_π_j=ft(xi,πj) : predicted class after permuting ,πi; (6)

The average ofFIt(x_i) over all trees in the forest is the raw significance score for fea-turex_ias in Equation 7:

Ixi =

Ntree

t=1 FIt(xi)

Ntree

. (7)

The significances of all the four kinds of features are shown in Figure 8. Most fea-tures’significances in the original feature set are greater than 0.05. To find the most valuable features, the basic, the histogram and the segment features, as well as the combined features were sent to a RF-based classifier respectively. The classification results based on the four kinds of feature sets are depicted in Figure 9. It is obvious that the combined feature set works the best among the four, with a minimum median and narrowest variation of classification error.

Imbalanced datasets processing

For weighted random forest, we tuned the class weight for final prediction: when we raise the minority class weight, the cost of misclassification of the minority class goes up, thus we can get a higher true positive rate and a lower true negative rate. To bal-ance the sensitivity and specificity, the geometric mean (G-mean) was applied. A dis-tinctive property of the G-mean measure is that it is independent of the distribution of classes. It will reach its best value when sensitivity and specificity are performed well at same time. G-mean can be calculated by Equation 8:

g=Sensitivity∗Specificity. (8)

Figure 10 gives the correct rate and G-mean of the weighted random forest with dif-ferent weights. The results show that the model works the best when the weight of the abnormal and normal class is assigned to 3:1.

(11)

Classification

To appraise the performance of the RF-based classifier, some widely used classifiers, including the SVM-RBF, SVM-Linear, ANN, DT, LR, ML, and LDA, were also applied to classify the identical data based on the identical feature sets. OOB method and 10-fold cross validation method were utilized to evaluate the prediction ability of RF and those of other classifiers respectively. Further more, we compared the performance of RF build on different feature sets. Based on previous analysis, we can select part of fea-tures with high significance to build our model for acceptable accuracy and efficiency.

Table 1 describes the results of classification. In this table, RF with the optimal para-meters outperforms all of the other classifiers. The RF method gets the highest correct rate, sensitivity, specificity, as well as F1-score among the seven classifiers. Table 2

shows the performances of RF model trained with different feature sets. During the model training, we selected the top n% (from 10% to 100%) features based on its sig-nificance score to build our model.

Figure 9The classification error of the four kinds of feature sets.

(12)

Conclusions

In this paper, we proposed a RF-based method for aEEG classification and defined a combined feature set. Basic features, statistical features and segment features were extracted from the whole signal as well as from signal segmentations. The combined feature set consisting of the three kinds of features was then sent to the RF classifier. The significance of feature, the data segmentation, parameter optimization of RF algo-rithm, and the problem of imbalanced datasets were examined. Experiments were also conducted to evaluate the performance of RF on aEEG classification, compared with several other widely used classifiers. Results show that, outperforming other widely used classifiers examined here, random forest with the combined feature set is efficient and can help better detect the brain disorders in newborns.

Competing interests

The authors declare that they have no competing interests.

Authors_’contributions

The work presented here was carried out in collaboration among all the authors. WC, GTC and YW designed the methods and experiments. WC and YW carried out the experiments and wrote the manuscript. GQC, QFG recorded and analyzed the data, interpreted the results and helped to write the manuscript. All the authors approved the manuscript.

Acknowledgements

All the aEEG data were provided by Shanghai Children’s Hospital of Fudan University;

Declarations

Publication of this article has been funded by the National Natural Science Foundation of China (Grant No. 81101119), Natural Science Foundation of China (Grant No. 61340036), the Open Project of Software/Hardware Co-design Engineering Research Center MoE, and National Key Basic Research Program (Grant No. 2011CB707104). This article has been published as part ofBioMedical Engineering OnLineVolume 13 Supplement 2, 2014: Selected articles from the IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2013): BioMedical Engineering OnLine. The full contents of the supplement are available online at http://www.biomedical-engineering-online.com/supplements/13/S2.

Table 1 Performance comparison for different classifiers

Method Correct Rate Sensitivity Specificity F1-score

RF 92.52% 93.78% 87.50% 95.26%

SVM RBF 89.67% 91.56% 82.14% 93.52%

SVM Linear 86.12% 89.19% 74.58% 91.03%

ANN 91.10% 92.00% 87.50% 94.31%

DT 87.90% 85.83% 61.76% 71.84%

LR 85.40% 88.31% 68.29% 77.03%

ML 75.44% 78.67% 62.50% 83.69%

LDA 72.95% 73.33% 71.43% 81.28%

Table 2 Performance comparison for feature set under different sizes

Selected feature set size(%) Correct Rate Sensitivity Specificity F1-score

100 92.17% 93.33% 87.50% 95.07%

80 91.81% 92.89% 85.71% 94.78%

60 90.62% 90.19% 87.58% 93.03%

40 88.67% 91.11% 83.93% 91.37%

20 83.33% 96.33% 32.14% 89.59%

(13)

Authors’details

1_{Software Engineering Institute, East China Normal University, 3663 North Zhongshan Road, 200062, Shanghai, China.} 2

Shanghai Children’s Hospital of Fudan University, 399 Wanyuan Road, 201102, Shanghai, China.

Published: 11 December 2014

References

1. Selton D, Andre M, Hascoet J:Normal eeg in very premature infants: reference criteria.Clinical neurophysiology2000,

111(12):2116-2124.

2. Colditz P, Buck L, Foster K, Lingwood B:Can signal processing help prevent brain damage in the newborn?Signal Processing and Its Applications, 1999 ISSPA’99 Proceedings of the Fifth International Symposium On1999,1:345-350, IEEE. 3. Burdjalov VF, Baumgart S, Spitzer AR:Cerebral function monitoring: a new scoring system for the evaluation of

brain maturation in neonates.Pediatrics2003,112(4):855-861.

4. Maynard D, Prior PF, Scott D:Device for continuous monitoring of cerebral activity in resuscitated patients.British medical journal1969,4(5682):545.

5. Bjerre I, Hellström-Westas L, Rosén I, Svenningsen N:Monitoring of cerebral function after severe asphyxia in infancy.

Archives of disease in childhood1983,58(12):997-1002.

6. Viniker D, Maynard D, Scott D:Cerebral function monitor studies in neonates.Clinical EEG (electroencephalography) 1984,15(4):185-192.

7. Hellströ-Westas L, Rosén I, Svenningsen N:Cerebral complications detected by eeg-monitoring during neonatal intensive care.Acta Paediatrica1989,78(s360):83-86.

8. Spitzmiller RE, Phillips T, Meinzen-Derr J, Hoath SB:Amplitude-integrated eeg is useful in predicting

neurodevelopmental outcome in full-term infants with hypoxic-ischemic encephalopathy: a meta-analysis.Journal of child neurology2007,22(9):1069-1078.

9. Toet MC, van der Meij W, de Vries LS, Uiterwaal CS, van Huffelen KC:Comparison between simultaneously recorded amplitude integrated electroencephalogram (cerebral function monitor) and standard electroencephalogram in neonates.Pediatrics2002,109(5):772-779.

10. Friberg H, Westhall E, Rosén I, Rundgren M, Nielsen N, Cronberg T:Clinical review: Continuous and simplified electroencephalography to monitor brain recovery after cardiac arrest.Crit Care2013,17:233.

11. Sculley D,et al:Results from a Semi-Supervised Feature Learning Competition.2012.

12. Wang Y, Chen W, Huang K, Gu Q:Classification of neonatal amplitude-integrated eeg using random forest model with combined feature.Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference On2013, 285-290, IEEE.

13. Wong L:Quantitative continuity feature for preterm neonatal eeg signal analysis.PhD thesis, ResearchSpace@ Auckland2009.

14. Lommen C, Pasman J, Van Kranen V, Andriessen P, Cluitmans P, Van Rooij L, Bambang Oetomo S:An algorithm for the automatic detection of seizures in neonatal amplitude-integrated eeg.Acta Paediatrica2007,96(5):674-680. 15. Breiman L:Random forests.Machine learning2001,45(1):5-32.

16. Segal MR:Machine learning benchmarks and random forest regression.2004. 17. Liaw A, Wiener M:Classification and regression by randomforest.R news2002,2(3):18-22. 18. Chen Chao LB:Andy Liaw: Using random forest to learn imbalanced data.2004.

19. Lin WJ, Chen JJ:Class-imbalanced classifiers for high-dimensional data.Briefings in bioinformatics2013,14(1):13-26. 20. Hellström-Westas L, Rosen I, De Vries L, Greisen G:Amplitude-integrated eeg classification and interpretation in

preterm and term infants.NeoReviews2006,7(2):76-87.

21. Chalak LF, Sikes NC, Mason MJ, Kaiser JR:Low-voltage aeeg as predictor of intracranial hemorrhage in preterm infants.Pediatric neurology2011,44(5):364-369.

22. Hellström-Westas L, De Vries LS, Rosén I:An Atlas of Amplitude-integrated EEGs in the Newborn.Parthenon Publishing Group, London; 2003:1:1-17.

23. Li L, Chen W, Shao X, Wang Z:Analysis of amplitude-integrated eeg in the newborn based on approximate entropy.Biomedical Engineering, IEEE Transactions on2010,57(10):2459-2466.

24. Pincus SM:Approximate entropy as a measure of system complexity.Proceedings of the National Academy of Sciences 1991,88(6):2297-2301.

25. Kohavi R,et al:A study of cross-validation and bootstrap for accuracy estimation and model selection.IJCAI1995,

14:1137-1145.

26. Breiman L:Out-of-bag estimation.Technical report, Citeseer1996.

27. Sokolova M, Japkowicz N, Szpakowicz S:Beyond accuracy, f-score and roc: a family of discriminant measures for performance evaluation.2006, 1015-1021.

doi:10.1186/1475-925X-13-S2-S4