Wavelet Based Dictionaries for Dimensionality Reduction of ECG Signals

(1)

arXiv:1811.05517v2 [eess.SP] 10 Jun 2019

Wavelet Based Dictionaries for Dimensionality Reduction of

ECG Signals

Laura Rebollo-Neira

Mathematics Department Aston University

B3 7ET, Birmingham, UK

Dana ˇ

Cern´a

Technical University of Liberec

Liberec, Czech Republic

June 12, 2019

Abstract

Dimensionality reduction of ECG signals is considered within the framework of sparse rep-resentation. The approach constructs the signal model by selecting elementary components from a redundant dictionary via a greedy strategy. The proposed wavelet dictionaries are built from the multiresolution scheme, but translating the prototypes within a shorter step than that corresponding to the wavelet basis. The reduced representation of the signal is shown to be suitable for compression at low level distortion. In that regard, compression results are superior to previously reported benchmarks on the MIT-BIH Arrhythmia data set.

1 Introduction

Electrocardiography (ECG) is the process of recording the electrical activity of the heart over a period of time. The overall goal of the test is to obtain information about the structure and function of that organ. Dimensionality reduction of ECG signals is relevant to techniques for analysis and classification of this type of data. Such techniques are subject of ongoing research [1–9], which includes methodologies based on the theory of compressive sensing [10–14].

Compressive sensing enhances the concept of sparsity by associating it to a new framework for digitalization [15,16]. Producing sparse representation of ECG signals is the central aim of the present work. This implies to represent an ECG record as superposition of as few elementary components as possible. The same goal was traditionally accomplished by disregarding the least significant terms in the wavelet transform of the signal [17]. However, as shown in this work, much less elementary components are necessary if they are selected from a redundant wavelet dictionary rather than from

(2)

a wavelet basis. This allows us to project an ECG record onto a subspace of lower dimensionality, in comparison to the dimension of the signal, and still reconstruct the record at low level distortion.

The main contributions of the paper are listed below:

• A number of wavelets dictionaries, based on existing families of wavelet bases, are developed.

• Each dictionary is shown to render much higher levels of sparsity for representing ECG signals

than the corresponding basis. The tests are performed on the MIT-BIH Arrhythmia data set consisting of 48 records each of which of 30 min length.

• The metric of local sparsity is shown to be relevant to the detection of non-stationary noise or

significant distortion in the patters of an ECG record.

• A strategy for storing the reduced representation of an ECG signal is considered. The resulting

file is shown to render higher compression ratio than previously reported results within the state of the art for the same data set.

• MATLAB software for constructing the dictionaries, as well as for reproducing results, has been

made available on a dedicated website [18].

The paper is organized as follows: Sec. 2 presents all the elements to build the proposed model for piecewise dimensionality reduction of an ECG record. Sec. 3 describes a strategy to encode the reduced representation of the signal. Sec. 4 presents and discusses numerical results. The conclusions are summarized in Sec. 5.

2 Piecewise ECG signal model

Assuming that an ECG record is given as an array f of dimension N, we divide the signal into Q

disjoint segmentsfq ∈RNb, q= 1, . . . , Qand construct an approximationfqkq for each segment as the

kq-term ‘atomic decomposition’

fkq q = kq X n=1 cq(n)dℓqn, (1)

where the elementsd_ℓq

n, called ‘atoms’, are chosen from a redundant dictionaryD={dn,∈R

Nb,kd

nk=

1}through a greedy pursuit strategy. Here for the selection process we adopt the Optimized

Orthog-onal Matching Pursuit (OOMP) method [19,20] which is stepwise optimal in the sense of minimizing

the residual normkfq−f

kq

q k at each iteration. The algorithm evolves by selecting the atoms one by

(3)

2.1 The OOMP Method

Since ECG records are normally superimposed to a smooth background, we initialize the OOMP

algorithm by assuming that the constant atom d₁ is always one of the elements in (1). Hence, for

each q we set kq = 0, Γq = ∅, ℓq1 = 1, and w

q

1 = b 1,q

1 = d1. Accordingly fq1 =d1hfq,d1i (where h·,·i

indicates the Euclidean inner product), and r1_q =f_q−f_q1.

From the above initialization the OOMP approach for selecting from D the elements d_ℓq

n, n =

2, . . . , kqin (1), and calculating the corresponding coefficientscq(n), n = 1, . . . , kq, iterates as follows.

1) Upgrade the set Γq←Γq∪ℓqkq+1, increase kq ←kq+ 1, and select the index of a new atom for

the approximation as ℓq_k_q₊₁ = arg max n=1,...,M n /∈Γq |hdn,rkqqi|2 1−Pkq i=1|hdn,w˜qii|2 , (2) with ˜wq_i = wqi kwq ik.

2) Orthogonalization step. Compute the corresponding new vector wq_k_q₊₁ as

w_kq q+1 =dℓ q kq+1 − kq X n=1 w_nq kwqnk2 hwq n,dℓq_kq₊₁i (3)

including, for numerical accuracy, the re-orthogonalization step:

w_kq_q₊₁ ←wq_k_q₊₁− kq X n=1 wq n kwnqk2 hwq n,w q kq+1i. (4)

3) Biorthogonalization step. Forn = 1, . . . , kq upgrade vectors bknq,q as

bkq+1,q n =bknq,q −b kq+1,q kq+1 hdℓ q kq+1,b kq+1,q n i, bkq+1,q kq+1 = w_kq_q₊₁ kwq_k_q₊₁k2. (5) 4) Calculate rkq+1 q = rkqq− hwqkq+1,fqi w_kq q+1 kwq_k_q₊₁k2. (6)

5) If for a given ρ the condition krkq+1

q k < ρ has been met compute the coefficients cq(n) =

hbkq+1

n ,fqi, n = 1, . . . , kq+ 1 to write the approximation fqkq of fq as in (1). Otherwise repeat

(4)

Once all the segmentsfq have been approximated, the approximation of the whole signal is obtained

by assembling the approximation of the segments as fr = ˆJQ_q₌₁fkq

q , where ˆJ is the concatenation

operation.

The complexity of the method, dominated by the selection of atoms, is O(KNbM) with K =

PQ q=1kq.

A suitable model for dimensionality reduction of the signal must involve significantly fewer atoms in their atomic decomposition than the dimension of the signal. The success in achieving such a goal depends on the dictionary being used. Next we discuss wavelet dictionaries which are fit for the purpose.

2.2 Wavelet Based Dictionaries

The proposed wavelet dictionaries are inspired on the construction of dictionaries for Cardinal Spline

Spaces (CSS) as discussed in previous works [21–23]. Let us denote Vj, j ≥0 to the CSS of order m

with simple knots at the equidistant partition of the interval [c, d] having distance 2−j _{between two}

adjacent knots. A basis forVj arises from the restriction of the functions

φj,k(x) := 2j/2φ(2jx−k), k ∈Z (7)

to the interval [c, d], which we expressed as 2j/2_φ₍₂j_x₋_k₎_|

[c,d]. The function φ(x)≡φ0,0(x) is called

scaling function. For a CSS φ(x) is the cardinal B-spline of order m associated with the uniform

simple knot sequence 0,1, . . . , m [24]

φ(x) = 1 m! m X i=0 (−1)i m i (x−i)m₊−1, (8)

where (x−i)m₊−1 is equal to (x−i)m−1 _if _x₋_{i >}_{0 and 0 otherwise.}

The complementary wavelet subspace Wj on [c, d] is constructed in order to fulfil

Vj+1 =Vj ⊕Wj, j ∈Z+, (9)

where⊕ indicates the direct sum, i.e. Vj∩Wj ={0}. Hence,

Vj+1 =V0⊕W0⊕W1⊕ · · ·Wj. (10)

For a given value ofj a wavelet basis for Wj arises as

ψj,k(x) = 2j/2ψ(2jx−k)|[c,d], k ∈Z (11)

and the elimination of some redundancy introduced by the cutting process at the borders of the

(5)

0 2 4 6 8 −6 −4 −2 0 2 4 0 2 4 6 8 −6 −4 −2 0 2 4

Figure 1: Wavelet functions taken from a basis for a CSS (left graph) and from a dictionary spanning the same CSS (right graph).

Within the multi-resolution framework, dictionaries for a CSS are simply constructed from the

result asserting that if for l∈Z+

Wj,l={2j/2ψ(2jx−

k

2l)|[c,d], k ∈Z}, (12)

then [22]

span{Wj,l}=Vj+l. (13)

This result facilitates a tool for designing dictionaries of wavelets of different support spanning the

same CSS. A multi-resolution-like dictionaryDj,l spanning Vj is constructed as

Dj,ℓ=V0,l∪ W0,l∪ W1,l ∪ · · · ∪ Wj−l,l, (14)

with

V0,l ={φ(x−

k

2l)|[c,d], k∈Z}. (15)

The top graph of Fig. 1 shows consecutive wavelets in a cubic Chui-Wang4 [25] wavelet basis and consecutive wavelets in a dictionary for the same CSS. Notice that the wavelet functions at the finest scale in the dictionary are broader than those at the finest scale in the basis.

For the sake of comparison we have built dictionaries for the following wavelet families.

Semi-orthogonal spline waveletswere mentioned above. Their construction was proposed by C. Chui and J. Wang in [25]. Semi-orthogonal wavelets, also called prewavelets, are not orthogonal if the wavelets are at the same scale level, but orthogonality of wavelets at the different levels is

(6)

0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2 0 100 200 300 400 500 −0.2 −0.1 0 0.1 0.2

Figure 2: Wavelet prototypes. From left top to right bottom the graphs correspond to the families

CW4, CW2, CDF97, CDF53,Db4, Coif, Sym,and Short3.

respectively. The wavelet prototypes corresponding to this family are given the 1st and 2nd graphs in the 1st row Fig. 2.

Cohen-Daubechies-Feauveau biorthogonal wavelets were built in [26]. In this case both primal and dual wavelets have compact support. Wavelets from this family with 9 and 7 taps filters (CDF97) and with 5 and 3 taps filters (CDF53) are used in the JPEG2000 compression standard. The corresponding prototypes are given in the 3rd and 4th graphs in the 1st row of Fig. 2.

Daubechies wavelets were built by I. Daubechies as first orthonormal compactly supported continuous wavelets in [27]. They have the shortest possible support for the given number of vanishing moments and they have extremal phase. Here we consider the four vanishing moments and indicate

it as Db4. This wavelet prototype is displayed in the 1st graph of the 2nd row of Fig. 2.

Coiflets were suggested by R. Coifman as orthonormal wavelet systems with vanishing moments for both scaling functions and wavelets. They were first constructed by I. Daubechies in [28]. We consider two vanishing moments used exact values of scaling and wavelet parameters from [29] and

indicate the family as Coif. The mother wavelet is displayed in the 2nd graph in the 2nd row of

Fig. 2.

Symlets are modified version of Daubechies wavelets. They have also the shortest possible support for the given number of vanishing moments, see [28]. We consider four vanishing moments.

The prototype wavelet for the familySym is given in the 3rd graph in the 2nd row of Fig. 2.

Spline wavelets with short support are obtained if the local support of dual wavelets is not required. They were designed in [30, 31]. Here we consider quadratic splines and indicate them as

Short3. The prototype for this family is plotted in last graph of the 2nd row of Fig. 2.

In the discrete case dictionaries can be constructed in a very simple manner: A redundant

(7)

such that N of them are linearly independent and M > N. Inspired by the results for CSS we

construct a dictionary for RN simply by discretization of the functions in the wavelet dictionary.

As already mentioned, ECG signals are commonly superimposed to a smooth background, hence we enrich a wavelet dictionary adding a few low frequency components from a discrete cosine basis.

Thus the whole dictionary we consider is built asD = DC_{∪ D}W_{, with} _DW _{a discrete wavelet basis}

(or dictionary) for RNb and

DC ₌ _{_w c(n) cos π(2i−1)(n−1) 2M , i= 1, . . . , Nb} M n=1,

where each wc(n) is a normalization constant. For the numerical tests we fix M = 10 for DC and

for DW _{we consider different wavelet dictionaries, all obtained from the aboved mentioned wavelet}

bases.

3 Encoding the signal model

For the reduced representation of the signal (1) to be suitable for storage and transmission purposes it is necessary to encode it within a small file in relation to the size of the original record. For

this end, firstly the absolute value coefficients |cq(n)|, n= 1. . . , kq, q = 1, . . . , Q, are converted into

integers as follows:

c∆_q(n) =⌊|cq(n)|

∆ +

1

2⌋, (16)

where ⌊x⌋ indicates the largest integer number smaller or equal to x and ∆ is the quantization

parameter. The signs of the coefficients, represented as sq, q = 1, . . . , Q, are encoded separately

using a binary alphabet. For creating the strings of numbers to encode the the signal model we proceed as in [32, 33].

The indices of the atoms in the atomic decompositions of each blockf_qare first sorted in ascending

orderℓq_i →ℓ˜q_i, i= 1, . . . , kq, which guarantees that, for eachqvalue, ˜ℓqi <ℓ˜

q

i+1, i= 1, . . . , kq−1. The

order of the indices induces order in the unsigned coefficients, c∆

q → ˜c∆q and in the corresponding

signs sq → ˜sq. The ordered indices are stored as smaller positive numbers by taking differences

between two consecutive values. By definingδ_iq= ˜ℓq_i −ℓ˜q_i₋₁, i= 2, . . . , kq the follow string stores the

indices for blockq with unique recovery ˜ℓq₁, δ₂q, . . . , δ_kq_q. The number ‘0’ is then used to separate the

string corresponding to different blocks and entropy code a long string,stind, which is built as

stind = [˜ℓ1₁, . . . , δ1_k1,0,ℓ˜ 2 1, . . . , δk22,0,· · ·,ℓ˜ kQ 1 , . . . , δ Q kQ]. (17)

(8)

follows:

stcf = [˜c∆₁(1), . . . ,c˜∆₁(k1),· · · ,˜c∆_k_Q(1), . . . ,˜c∆_k_Q(kQ)]. (18)

Using ‘0’ to store a positive sign and ‘1’ to store negative one, the signs are placed in the string,stsg

as

stsg = [˜s1(1), . . . ,s1˜(k1),· · · ,s˜kQ(1), . . . ,s˜kQ(kQ)]. (19)

4 Results

The numerical tests we present here use the full MIT-BIH Arrhythmia database [35] which contains

48 ECG records. Each of these records is of 30 min length, consisting ofN = 650000 11-bit samples

at a frequency of 360 Hz.

All our results have been obtained in the MATLAB environment using a notebook 2.9GHz dual core i7 3520M CPU and 4GB of memory.

4.1 Assessment Metrics

The quality of the signal approximation is assessed with respect to the PRD calculated as follows,

PRD = kf −f

r_k

kfk ×100% (20)

where, f is the original signal and fr is the signal reconstructed from the approximated segments.

Since the PRD strongly depends on the background of the signal, the PRDN as defined below is also a relevant metric.

PRDN = kf −f

r_k

kf −fk ×100%, (21)

where, f indicates the mean value of f.

For a fixed value of PRD the sparsity of a representation is assessed by the sparsity ratio (SR)

SR = N

K, (22)

where N is the total length of the signal and K = PQ_q₌₁kq, with kq the number of atoms in the

atomic decomposition (1) of each segment of length Nb. For detection of nonstationary noise in

patterns it is useful to define the measure of local sparsity sr(q) as follows:

sr(q) = Nb

kq

, q= 1, . . . Q. (23)

The compression performance depends of the size of the file storing the signal representation and is assessed by the Compression Ratio (CR) as given by

CR = Size of the uncompressed file

(9)

In the numerical tests below we use the notation CR to denote the compression ratio obtained when

the string stind, stcf and stsg are stored in hierarchical data format [34], simply using the MATLAB

instructionsave. If a Huffman coding step is included the corresponding compression ratio is denoted

as CRHf

.

The quality score (QS), reflecting the tradeoff between compression performance and reconstruc-tion quality, is the ratio:

QS = CR

PRD. (25)

Since the PRD is a global quantity, in order to detect possible local changes in the visual quality of

the recovered signal, we consider the local PRD as follows. Each signal is partitioned in Qsegments

fq, q = 1. . . , Q of Nb samples. The local PRD with respect to every segment in the partition is

indicate as prd(q), q= 1, . . . Q and calculated as

prd(q) = kfq−f

r

qk

kfqk

×100%, (26)

where f_qr is the recovered portion of the signal corresponding to the segment q. In all the numerical

test of the next section the OOMP approximation is stopped through a fixed valueρso as to achieve

the same value of prd for all the segments in the records. Assuming that the target prd before

quantization is prd0 we set ρ= prd0kfqk/100.

4.2 Test I

The purpose of the first numerical tests is to demonstrate the significant gain in dimensionality

reduction (high values of SR) achieved if the component DW _{of the dictionary} _D _{is a wavelet}

dic-tionary instead of a wavelet basis. The wavelet basis arises using as translation parameter b = 1

and the wavelet dictionary using a translation parameter b = 1/4. A wavelet basis contains scales

corresponding to the values j = 3, . . . ,8 while the corresponding dictionary contains one less scale

i.e. j = 3, . . . ,7. The exact redundancy of the dictionary depends of the wavelet family but in all

the cases is close to two. The records are approximated by partitioning the signal into Q = 1300

segments ofNb = 500 data points each.

The second column in Table 1 gives the mean value SR, after quantization, with respect to the 48 records in the data set. The standard deviation is given in the 3rd column. The approximation with all the families is accomplished to obtain the same mean value of PRD (PRD = 53). For this test the quantization parameter is fixed. For all the records and all the dictionaries ∆ = 35. The 4th column gives the mean value CR and the fifth column the corresponding standard deviation. For further improvement in CR we have added an entropy coding step to represent the strings described

(10)

Table 1: Mean values of SR and CR with respect to the 48 records in the MIT-BIH Arrhythmia database. In all the cases the reconstructed signals produce PRD = 53.

DW _SR _std _CR _std _CRHf _std CW4B 10.13 2.42 11.72 3.14 16.70 3.78 CW4D 16.62 4.13 13.31 3.51 17.80 4.40 CW2B 11.34 2.91 12.08 3.48 17.78 4.34 CW2D 16.76 4.95 13.91 4.04 18.37 5.25 CDF97B 14.59 3.90 14.90 4.20 20.54 4.99 CDF97D 20.60 5.79 16.13 4.57 21.26 5.82 CDF53B 14.72 4.25 14.84 4.29 20.46 5.31 CDF53D 20.18 6.36 16.02 4.88 21.04 6.27 Db4B 12.81 3.28 13.49 3.63 18.81 4.46 Db4D 17.88 4.94 14.24 4.05 18.97 5.15 CoifB 11.50 2.99 12.66 3.47 17.41 4.29 CoifD 15.27 4.57 12.58 3.68 16.73 4.74 SymB 12.83 3.30 13.65 3.68 18.96 4.47 SymD 18.51 5.26 14.70 4.22 19.26 5.77 Short3B 12.78 3.18 15.09 4.31 19.68 4.59 Short3D 19.84 5.25 16.00 4.24 20.95 5.46 FWT[37] 11.22 1.75 23.17 6.57 26.34 6.99

in Sec. 3 at a fixed bit depth using Huffman coding. The implementation of the step is realized using the off-the-shelf MATLAB function Huff06 available on [36]. The corresponding outcomes, indicated

as CRHf, are given in the sixth column of Table 1. The 1st column lists the different wavelet families

which are considered to build the dictionary DW_{. The subscript B is used to indicate the wavelet}

basis and the subscript D to indicate the corresponding dictionary, i.e. CDF97B indicates the 9/7

Cohen-Daubechies-Feauveau biorthogonal wavelet basis and CDF97D the corresponding wavelet

dictionary. For further of comparison the last row of the table gives the results produced when a

Fast Wavelet Transform (FWT) is applied on the whole record, as proposed in [37].

As seen in Table 1 the best results in relation to dimensionality reduction (largest values of SR)

are for CDF97D,CDF53D and Short3D and all the wavelet bases perform poorly in comparison

to the dictionaries. The difference in CR is not so pronounced though. This is because, in spite of the fact that the approximation using a dictionary involves much less elementary components in the signal decomposition, the dispersion of indices corresponding to the atoms in the decomposition is larger if using a dictionary. Notice that, as far as compression is concerned using a FWT to transform

(11)

the whole signal is more effective than a piecewise transformation. This is true not only in terms of the achieved CR but also in regard to computational time (the approach advanced in [37] is very fast). Contrarily, as indicated by the values of SR in Table 1 the method in [37] is not as effective for dimensionality reduction as it is for compression. However, dimensionality reduction is relevant in its own right to methodologies for classification and recognition tasks [38, 39].

Table 2: Performance metrics for each of the 48 records in the MIT-BIH Arrhythmia database listed in the first column.

Rec PRD SR CRHf QS PRDN 100 0.51 27.19 28.27 55.75 12.64 101 0.51 25.39 25.89 50.82 9.45 102 0.51 24.43 24.74 48.27 13.13 103 0.52 21.93 22.06 42.73 7.86 104 0.53 19.00 19.52 37.21 10.27 105 0.53 17.37 18.15 34.22 6.43 106 0.53 18.63 18.87 35.97 7.07 107 0.55 12.64 12.66 23.01 3.18 108 0.52 21.13 22.74 43.52 8.51 109 0.52 19.32 19.42 37.05 5.16 111 0.51 23.22 24.32 47.45 9.90 112 0.53 22.72 24.47 46.18 10.20 113 0.52 20.21 19.80 38.18 6.26 114 0.50 32.21 33.13 66.17 14.52 115 0.53 20.18 20.52 38.98 6.73 116 0.58 12.33 12.90 22.10 3.72 117 0.52 27.31 28.82 55.27 9.33 118 0.59 11.06 12.51 21.22 5.90 119 0.56 16.15 16.50 29.78 4.44 121 0.51 33.27 33.61 66.10 7.29 122 0.56 16.42 17.62 31.80 6.50 123 0.53 21.94 22.93 43.11 7.94 124 0.53 23.28 23.09 43.92 4.94 200 0.53 15.56 16.69 31.33 7.05 201 0.49 34.72 35.05 71.66 12.52 Rec PRD SR CRHf QS PRDN 202 0.51 26.24 26.28 51.87 8.44 203 0.55 12.75 13.86 25.14 5.54 205 0.51 25.25 26.83 52.54 12.50 207 0.51 25.93 25.66 50.80 7.08 208 0.54 15.09 15.69 29.16 5.55 209 0.54 15.65 16.83 31.40 9.92 210 0.51 23.84 24.62 48.18 9.72 212 0.56 12.12 13.56 24.40 8.35 213 0.56 11.71 12.06 21.71 4.09 214 0.52 19.22 19.15 36.59 5.54 215 0.55 14.07 15.34 28.20 9.60 217 0.53 16.86 16.47 31.06 4.30 219 0.54 18.06 18.00 33.69 4.43 220 0.53 19.18 19.65 37.07 7.65 221 0.52 22.73 23.00 44.69 8.50 222 0.51 26.52 27.53 54.42 13.53 223 0.53 18.56 18.70 35.09 6.06 228 0.52 18.97 20.31 38.91 7.53 230 0.52 19.12 19.47 37.29 7.30 231 0.51 25.67 26.20 51.84 9.23 232 0.50 29.72 31.59 63.83 14.91 233 0.54 14.26 14.81 27.43 4.96 234 0.52 19.81 20.50 39.28 7.68 mean 0.53 20.60 21.26 40.76 7.99 std 0.02 5.79 5.82 12.47 2.92

For further information in Table 2 we provide the figures of the evaluation metrics for each of the 48 records in the dataset. The 2nd column of Table 2 shows the values of PRD for the records listed

in the 1st column. The 3rd column shows the values of SR after quantization. The CRHf

is given in the 4th column and the corresponding QS in the 5th column. The last column corresponds to the

values of PRDN. All the results are obtained using the dictionary CDF97D. The average time for

(12)

Fig. 3 depicts the approximation and raw data (indistinguishable in the scale of the graphs) of 2000 points in the records 107 and 116.

0 500 1000 1500 2000 0 400 800 1200 1600 0 500 1000 1500 2000 0 400 800 1200 1600

Figure 3: The waveforms in the graphs are the raw data and the approximations corresponding to 2000 points in the records 107 (left) and 116 (right). The bottom lines represent the absolute value of the difference between the data and the approximation.

4.3 Test II

In this test we compare compression results with previously reported benchmarks on the MIT-BIH Arrhythmia database. Since the PRD strongly depends on the baseline in the ECG record, when using the relation CR vs PRD for comparison one needs to be able to confirm that the values of PRD reported in previous work either, do not include any subtraction of baseline, or the subtracted value is also reported. From the information given in the papers we are comparing with (the relation between the values of PRD and PRDN) we can guarantee that the results correspond to the same

database [35] and without any subtraction of baseline. In the range [0.31 0.67] the comparison is

realized with the classical benchmark [40]. This method is based on peak detection to perform the Discrete Cosine Transform (DCT) between consecutive peaks and Huffman encoding for lossless

compression. In the range [0.80 1.71] the comparison is with the methods in [41] and [42]. Both

these methods are based on Adaptive Fourier Decomposition (AFD) and overperform those in [40] and [43] for values of PRD in the range being considered. In [42] the lossless compression is realized using Huffman coding while [41] applies symbol substitution. Comparisons are also produced with the results arising when a FWT is applied on the full record [37].

Table 3 compares CRs obtain with CDF97D and CDF53D for values of PRD in the range

[0.31 0.81], Table 4 for values in the range [0.91 1.06], and Table 5 in the range [0.14 1.71]. The last

two rows in all the tables give the values of the quantization parameter, ∆, and the approximation

(13)

publications we are comparing with.

Table 3: Comparison between the average compression performance of the proposed method and the methods in [40], [41], [42] and [37], for values of PRD in the range [0.31 0,81].

PRD 0.81 0.80 0.67 0.48 0.31 CR [40] 11.30 9.28 6.22 CR [41] 18.00 CR [42] 16.85 CR [37] 33.32 33.05 27.79 19.84 10.59 CRHf [37] 36.54 36.26 31.01 23.09 15.00 CR CDF97D 24.03 23.81 20.52 14.79 9.37 CRHfCDF97D 31.64 31.07 26.71 19.59 12.50 CR CDF53D 24.76 24.50 20.67 14.67 8.89 CRHfCDF53D 32.59 31.93 26.01 19.25 11.93 prd₀ 0.750 0.740 0.645 0.435 0.275 ∆ 60 60 40 30 16

Table 4: Comparison between the average compression performance of the proposed method and the

methods in [41], [42], [43] and [37], for values of PRD in the range [0.91 1.06].

PRD 1.06 1.05 1.03 0.94 0.91 CR [41] 25.667 22.27 CR [42] 22.80 20.38 CR [43] 18.59 CR [37] 41.31 41.24 41.00 38.52 37.68 CRHf [37] 44.42 44.23 44.04 41.93 41.08 CR CDF97D 30.36 30.12 29.44 27.29 26.27 CRHfCDF97D 40.25 39.96 39.11 35.94 34.42 CR CDF53D 31.93 31.66 30.84 28.46 27.28 CRHfCDF53D 42.73 42.39 41.40 37.81 35.70 ∆ 100 100 100 80 80 prd0 0.940 0.930 0.900 0.850 0.805

(14)

Table 5: Comparison between the average compression performance of the proposed method and the

methods in [41], [42], and [37] for values of PRD in the range [1.14 1.71].

PRD 1.71 1.47 1.29 1.18 1.14 CR [41] 38.46 33.85 28.21 CR [42] 42.27 3.53 30.21 25.99 CR [37] 62.48 56.78 49.60 47.04 45.75 CRHf [37] 63.92 58.59 52.19 49.59 48.47 CR CDF97D 52.01 43.83 37.63 33.88 32.69 CRHfCDF97D 66.99 56.44 48.77 44.45 42.98 CR CDF53D 57.19 47.43 40.66 36.27 34.62 CRHfCDF53D 73.27 61.00 52.56 47.19 45.51 ∆ 150 120 110 110 110 prd0 1.660 1.430 1.220 1.070 1.020

4.4 Discussion

From the results shown in Tables 4 - 5 we can assert that the dictionary-based approach presented in this paper, which is powerful for dimensionality reduction at low level distortion, is also relevant to compression. In that sense the adopted strategy produces results which significantly improve upon the state of the art for the full MIT-BIH Arrhythmia data set [40–43]. In regard to CR the results are close to those recently reported in [37]. In that publication the compression results are obtained with an equivalent strategy as that described in Sec. 3, but applying a FWT on the full signal instead of using a piecewise model as proposed here. With respect to processing time the approach in [37] is far more efficient though. Indeed, with the same equipment the construction of Table 2 in this paper takes 30 s per record. The equivalent table with the approach of [37] takes 0.14 s per record. However, the central aim of the present work is to accomplish dimensionality reduction. In that sense the dictionary approach is much more effective. As shown in the second column of Table 1, to

achieve PRD = 53 theCDF97D method uses 84% less components than theFWT one. Moreover,

the dictionary approach yields a piecewise model which can be useful for analysis. It is interesting to note, for instance, that abrupt variations of the local sparsity ratio (c.f. (23)) gives information about the presence of nonstationary noise or significant distortions in patters. The left graph in

Fig. 4 plots the values 1/sr(q), q= 1, . . . ,1300 for record 117. The middle top graph shows the piece

(15)

0 5 10 15 20 25 30 0.02 0.04 0.06 0.08 0.1 0.12 0.14 t (min) 1/sr 12 12.02 12.04 12.06 12.08 12.1 12.12 400 600 800 1000 1200 t (min) f 2.74 2.76 2.78 2.8 2.82 2.84 400 600 800 1000 1200 t (min) f

Figure 4: The left graph shows the inverse local sparsity values for each segment in the signal partition

of record 117. The middle graph is the piece of signal where the peak of 1/sr takes place. The right

graph is a piece of signal of the same length but at different time interval.

length, outside that time interval.

5 Conclusions

A method for dimensionality reduction of ECG signals has been proposed. The approach was de-signed within the framework of sparse representation. A number of wavelet based dictionaries have been introduced to undertake the signal model via the OOMP greedy strategy. The resulting repre-sentation was shown to require significantly less elementary component than those needed when using a wavelet basis. The method was proven to be useful for ECG compression at low level distortion. The compression results largely surpass benchmarks within the state of the art for the MIT-BIH Arrhythmia database.

The MATLAB software for constructing the dictionaries and reproducing the results in the paper has been made available on a dedicated website. Although the numerical construction of the signal model is computationally intensive it can be effectively realized, even in the MATLAB environment, where it typically takes 30 s to compress a 30 min record and 1 s to recover the signal from the compressed file. Moreover, since the approximation is carried out independently in every segment of the signal partition, there is room for straightforward parallelization using multiprocessors. The sensitivity of local sparsity to nonstationary noise or significant distortion of patterns leads to con-clude that the model might be useful to support the development of techniques for analysis of ECG records.

Acknowledgments

We are indebted to Karl Skretting for making available the MATLAB functions Huff06 which has

(16)

References

[1] S. Banerjee, M. Mitra, “Application of Cross Wavelet Transform for ECG pattern analysis and

classification”, IEEE Transactions on Instrumentation and Measurement, 63 326–333 (2014).

[2] R.G. Afkhami, G. Azarnia, M.A. Tinati, “Cardiac arrhythmia classification using statistical and

mixture modeling features of ECG signals, Pattern Recognition Letters, 70, 45– 51 (2016).

[3] S. Shadmand, B. Mashoufi, “A new personalized ECG signal classification algorithm using

block-based neural network and particle swarm optimization, Biomedical Signal Processing and

Control, 25, 12–23 (2016).

[4] E.J.D. Luz, W.R. Schwartz, G. Camara-Chavez, D. Menotti, ECG-based heartbeat classification

for arrhythmia detection: a survey, Computer Methods and Programs in Biomedicine,127 144–

164 (2016).

[5] D. P. Tob´on V., T. H. Falk, M. Maier, “MS-QI: A Modulation Spectrum-Based ECG Quality

Index for Telehealth Applications”, IEEE Transactions on Biomedical Engineering, 63, 1613–

1622 (2016).

[6] S. Gutta, Q. Cheng, “Joint feature extraction and classifier design for ECG-based biometric

recognition”, Journal of Biomedical and Health Informatics,20, 460–468 (2016).

[7] H. Li, D. Yuan, Y. Wang, D. Cui, and L. Cao, “Arrhythmia Classification Based

on Multi-Domain Feature Extraction for an ECG Recognition System”, Sensors, 16,

doi:10.3390/s16101744 (2015).

[8] M. Abdelazez, P. X. Quesnel, A. D. C. Chan, H. Yang, “Signal Quality Analysis of

Ambula-tory Electrocardiograms to Gate False Myocardial Ischemia Alarms”, IEEE Transactions on

Biomedical Engineering, 64, 1318 – 1325 (2018).

[9] S. K. Berkaya, A. K. Uysal, E. S. Gunal, S. Ergin., S. Gunal, M. B. Gulmezoglu, “A survey on

ECG analysis”, Biomedical Signal Processing and Control, 43, 216 – 235 (2018).

[10] H. Mamaghanian, N. Khaled, D. Atienza and P. Vandergheynst, “Compressed Sensing for

Real-Time Energy-Efficient ECG Compression on Wireless Body Sensor Nodes”, IEEE Transactions

(17)

[11] Z. Zhang, T-P Jung, S. Makeig and B. D. Rao, “Compressed Sensing for Energy-Efficient

Wire-less Telemonitoring of Noninvasive Fetal ECG via Block Sparse Bayesian Learning”, IEEE

Transactions on Biomedical Engineering, 60, 300–309 (2013).

[12] L. F. Polan´ıa, R. E. Carrillo, Manuel Blanco-Velasco and K. E. Barner, “Exploiting Prior

Knowl-edge in Compressed Sensing Wireless ECG Systems”, IEEE Journal of Biomedical and Health

Informatics, 19,508–519 (2015).

[13] G. Da Poian, R. Bernardini, R. Rinaldo, ‘RSeparation and Analysis of Fetal-ECG Signals From

Compressed Sensed Abdominal ECG Recordings”, IEEE Transactions on Biomedical

Engi-neering,63, 1269 – 1279 (2016).

[14] L. F. Polan´ıa and R. I. Plaza, “Compressed Sensing ECG using Restricted Boltzmann Machines”,

Biomedical Signal Processing and Control, 45, 237–45 (2018).

[15] R. Baraniuk, “Compressive sensing”,IEEE Signal Processing Magazine, 24, 118–121, (2007).

[16] E. Cand`es and M. Wakin, “An introduction to compressive sampling”,IEEE Signal Processing

Magazine, 25, 21 – 30 (2008).

[17] M. S Manikandan and S. Dandapat, “Wavelet-based electrocardiogram signal compression

meth-ods and their performances: A prospective review”, Biomedical Signal Processing and Control,

14,73–107 (2014).

[18] http://www.nonlinear-approx.info/examples/node011.html (Last access May 2018).

[19] L. Rebollo-Neira and D. Lowe, “Optimized orthogonal matching pursuit approach”,IEEE Signal

Process. Letters, 9, 137–140 (2002).

[20] L. Rebollo-Neira, “Cooperative greedy pursuit strategies for sparse signal representation by

partitioning”, Signal Processing,125, 365–375 (2016).

[21] M. Andrle and L. Rebollo-Neira, “Cardinal B-spline dictionaries on a compact interval”, Appl.

Comput. Harmon. Anal. , 18, 336-346 (2005).

[22] M. Andrle and L. Rebollo-Neira, “From cardinal spline wavelet bases to highly coherent

dictio-naries”, Journal of Physics A 41 (2008) 172001.

[23] L. Rebollo-Neira and Z. Xu, “Adaptive non-uniform B-spline dictionaries on a compact interval”,

(18)

[24] L. L. Schumaker, Spline Functions: Basic Theory, Wiley, New-York, 1981.

[25] C. Chui and J. Wang, “On compactly supported spline wavelets and a duality principle”,Trans.

Amer. Math. Soc., 330, 903–915 (1992).

[26] A. Cohen, I. Daubechies, and J.C. Feauveau, “Biorthogonal bases of compactly supported

wavelets”, Comm. Pure and Appl. Math., 45, 485–560 (1992).

[27] I. Daubechies, “Orthonormal bases of compactly supported wavelets”, Commun. Pure Appl.

Math.,41, 909–996 (1988).

[28] I. Daubechies, “Orthonormal bases of compactly supported wavelets II, variations on a theme”,

SIAM J. Math. Anal.,24, 499–519 (1993).

[29] D. ˇCern´a, V. Finˇek, and K. Najzar, “On the exact values of coefficients of coiflets”, Cent.

European J. Math.6, 159–169 (2008).

[30] D. Chen, “Spline wavelets of small support”,SIAM J. Math. Anal., 26, 500–517 (1995).

[31] B. Han and Z. Shen, “Wavelets with short support”,SIAM J. Math. Anal.,38, 530–556 (2006).

[32] L. Rebollo-Neira and I. Sanches, , “Simple scheme for compressing sparse representation of

melodic music”, Electronics Letters (2017) DOI:10.1049/el.2017.3908

[33] L. Rebollo-Neira, “A competitive scheme for storing sparse representation of X-Ray medical

images” (2018) PLOS ONE. https://doi.org/10.1371/journal.pone.0201455.

[34] L. A. Wasser, “Hierarchical Data Formats - What is HDF5?”,

https://www.neonscience.org/about-hdf5 (2015)

[35] https://physionet.org/physiobank/database/mitdb/ (Last access May 2018).

[36] K. Skretting,http://www.ux.uis.no/∼karlsk/proj99 (Last access May 2018).

[37] L. Rebollo-Neira, “Effective high compression of ECG signals at low level distortion”, Scientific

Reports 9, Article number: 4564 (2019).

[38] J. Wang, M. She, S. Nahavandi, A. Kouzani, “Human Identification From ECG Signals Via

(19)

[39] S. Raj, K. C. Ray, “Sparse representation of ECG signals for automated recognition of cardiac

arrhythmias”, Expert Systems with Applications, 105, 49–64 (2018).

[40] S.J. Lee, J. Kim and M. Lee,“A Real-Time ECG Data Compression and Transmission Algorithm

for an e-Health Device”,IEEE Transactions on Biomedical Engineering,58, 2448–2455 (2011).

[41] J.L. Ma, T.T. Zhang and M. C. Dong, “A Novel ECG Data Compression Method Using Adaptive

Fourier Decomposition With Security Guarantee in e-Health Applications”, IEEE Journal of

Biomedical and Health Informatics,19, 986–994 (2015).

[42] C. Tan, L. Zhang and H. Wu,“A Novel Blaschke Unwinding Adaptive Fourier

Decomposi-tion based Signal Compression Algorithm with ApplicaDecomposi-tion on ECG Signals”, IEEE Journal

of Biomedical and Health Informatics 10.1109/JBHI.2018.2817192, 22 March 2018.

[43] A. Pandey, B. Singh, S Sain, and N. Sood, “A joint application of optimal threshold based discrete cosine transform and ASCII encoding for ECG data compression with its inherent