Manifold Learning of COPD distributions - Quantitative lung CT analysis for the study and diagn

4.3 Data

5.2.3 Manifold Learning of COPD distributions

The goal of manifold learning lies in learning a low-dimensional representation to model variability in the data. In this chapter, I have presented the concept of local disease and local deformation distributions. It is possible that one aspect in the heterogeneity of COPD is due to the different mechanisms of emphysema and fSAD and its effect on local lung biomechanics. The presented distributions were developed to capture these processes. I used manifold learning to model variations of the distributions in a large population of

J

0.2 0.4 0.6 0.8 1

µ(J)

f

(µ

(J

))

Figure 5.3: Local deformation distribution quantification. © 2017 Springer Nature. Reprinted, with permission from F. Bragman et al., Manifold learning of COPD, Medical Image Com- puting and Computer Assisted Interventions, 2017.

(a) Z 0 0.2 0.4 0.6 0.8 1 f (v ) v

(b) f(v) for PRMemph, PRMf SAD

and PRMnormal (c) Z 0 0.2 0.4 0.6 0.8 1 f (v ) v

(d) f(v) for PRMemph, PRMf SAD

and PRMnormal

Figure 5.4: Local disease distributions for patient with equal levels of emphysema in the lung (µ(PRMemph=30%). Top row patient: FEV1%predicted=28.3 and bottom row patient:

COPD patients. This led to separate embeddings for emphysema, fSAD and volume change. Fusion of these embeddings was then performed to create a single model of these processes to analyse COPD progression.

5.2.3.1 Distribution distances

Inter-patient differences can be computed by considering the distance (_{L) between two fea-} ture distributions ( f1(v) and f2(v)). As the distance metric employed is an important deter-

minant in comparing distributions, I considered the following: 1) the_L1-norm, 2) theL2-

norm, 3) the Earth-Movers Distance (LEMD) and 4) the Quadratic-Chi histogram dissimilar-

ity (_LQC). The first two are bin-to-bin distance measures and the latter are cross-bin metrics.

The local distributions ( f1(v) and f2(v)) are quantised into histograms (H(v), K(v)∈ RNb)

with Nbbins. These histograms are normalised such that they sum to one. They have equal

mass and are probability mass functions on equal intervals ([0, 1]). The probability mass at a bin i for the histograms H and K are defined as hiand ki.

LpDistances TheL1andL2-norm are defined as follows:

Lp(H, K) = Nb

∑

i |hi− ki|p !1/p (5.2)

with p= 1 and p = 2 corresponding respectively to the_L1andL2-norm. They are

considered bin-by-bin measures as the distances are computed by matching bins at the same index. A drawback of_Lp-style distances is their dependence on the bin number.

If the number of bins is low, the resulting histogram will not be discriminative. If the number of bins is too high, quantisation of the data into bins with small widths can yield undesirable discontinuities in the histogram leading to a non-robust distance.

Earth-Movers Distance The Earth-Movers Distance (LEMD) [175] is a metric based on

the principle of optimal transport. It measures the minimum amount of work needed to transform one distribution into another. The EMD operates on signatures of histograms sj ={mj, wj}, which represents a set of feature clusters. Each cluster is

represented by its mean mj and by the fraction of wj of voxels that belong to that

cluster. The Earth-Movers distance is defined as

LEMD(H, K) = min { fi, j} (∑i, j fi, jdi, j) ∑i, j fi, j (5.3)

where di, j is the ground distance and fi, j is the flow between histogram bins i and

j. The aim ofLEMD is to find the minimal flow from H to K. Importantly, if one

assumes that the histograms are one-dimensional, have equal mass and have equal number of bins, the EMD has a closed-form solution equivalent to theL1-norm on

the cumulative distribution function of P(H) and P(K) [110] such that

LEMD(H, K) =L1(P(H), P(K)) (5.4)

Quadratic-Chi Distance The Quadratic-Chi distance_LQCis another example of a cross-

bin distance measure [160]. It combines the Quadratic-Form distance (_LQF) and the

Chi-squared distance (Lχ2) such that

LQF(H, K) = q (H− K)T_A(H_{− K)} _(5.5a) Lχ2(H, K) = 1 2

∑

_i (hi− ki)2 (hi+ ki) (5.5b) LQC(H, K) = v u u t

_∑

i, j (hi− ki) (∑c(hc+ kc)Ac,i)m ! (hj− kj) (∑c(hc+ kc)Ac, j)m ! Ai, j (5.5c)

where A is a bin-to-bin similarity matrix and m is a normalisation factor. The au- thors [160] recommend m= 0.9 for best results. In the context of the Quadratic-Chi distance, the similarity matrix is defined as:

Ai, j= 1−

D_{i, j} max

i, j (Di, j)

(5.6)

where D is a distance matrix between histogram bins.

The Quadratic-Chi distance exploits advantages of bothLQFandL_χ2. The Quadratic-

form distance takes cross-bin relationships through the similarity matrix A whilst the χ2 distance reduces the effects of bin-to-bin distances caused by bins with large values.

5.2.3.2 Manifold Learning and fusion

Manifold learning is used to model emphysema, fSAD and Jacobian distributions. The aim is to capture variations in the distributions in a population of COPD patients. The

progression of emphysema and fSAD occur synchronously yet the pathological processes are distinct. Their combination is likely to affect the lungs ability to deform. The aim is to capture both processes and their effect on lung deformation to analyse COPD. The manifold fusion framework presented by Aljabar et al. [4] is employed to create a single representation of these processes (Figure5.5).

For P subjects, the classified volumesZ1,··· ,ZPand their respective Jacobian deter-

minant maps J= J1,··· ,JPare obtained. The local disease and deformation distributions are

created and quantised using Nbbins into their respective histograms hp,v(emph), hp,v( f SAD)and

h_p,J. Pairwise measures in the population are obtained using any of the distance functions L discussed above in Section5.2.3.1. This yields the pairwise matricesMemph_,_Mf SAD_and

MJ _{for emphysema, fSAD and the Jacobian respectively.}

The pairwise matrices can be visualised as connected graphs where each node represents a patient and the edge length between nodes is the distance _{L. Various linear and} non-linear dimensionality reduction methods can be applied to the matricesM(·)_{. Since}

the edge-weights in_M(·)are distances, the Isomap algorithm [209] is a natural choice as it interprets edge-weights inM(·)_{as distances.}

Th goal of Isomap is to find the mapping function f : _M(·)_{→ R}d, where_M(·)_{∈ R}D such thatM(·)_{is projected to a low-dimensional space}_Rd_{. The main assumption of Isomap}

is that f is an isometric chart, which means the distances are preserved as they are mapped to the new space. If xiand xjare points on the manifold andD(xi, xj) is the geodesic distance

between them, the following relationship is obeyed:

f (xi)− f (xj)

= D(xi, xj) (5.7)

A further assumption of the Isomap algorithm is that the geodesic distance between nearby points is linear and can be approximated by the Euclidean distance. For points which cannot be approximated by a linear function, the Euclidean distance will not be a valid approximation. A different approach is therefore needed to compute the distance between these points. The Isomap algorithm first constructs a sparse representation of_M(·) by searching for the K-nearest neighbours (k-NN) weighted by Euclidean distances between each node. Local structure can thus be modelled by an appropriate choice of the parameter k. To estimate the manifold structure, a full pairwise geodesic matrixD is es- timated by identifying the shortest path in the k-NN graph of _M(·). This is performed

using Djikstra’s shortest-path algorithm [51]. Given the matrix_D(·), Isomap then seeks the lower-dimensional embedding coordinates y(p·), p = 1,··· ,P that satisfies Equation5.7 by

minimising the cost function:

min

_∑

p, j D(·)p, j− y(·)p − y(·)_j 2 (5.8)

using Multi-Dimensional Scaling. The Isomap algorithm is applied separately to Memph_, _Mf SAD _and_MJ_{, which yields the respective embeddings y}e_{, y}f _{and y}J _{with di-}

mension de, df and dJ that need to be selected (Figure5.5).

The coordinates y(·)can boe fused together in any combination to create various models and investigate different combinations of features implicated in COPD. For simplicity, I consider the fusion of all embeddings. The coordinates are uniformly scaled with scaling factors se, sf and sJsuch that the first component of each embedding y(·)_p,1has a unit variance. These are concatenated to yield the new set Yp= (seyep, sfy

p, sJyJp) with dimension de+ df+

dJ. A new distance matrixDc_{is obtained by calculating the Euclidean distance between all}

pairs of coordinates in Y . A second Isomap step is applied to yield the combined coordinate embedding ycwith dimension dc(Figure5.5). The new embedding coordinates (yc) can then be used in a classification framework to find clusters of patients with similar presentations or in regression to test associations with various clinical metrics and outcomes.

5.3 Experiments and results

In document Quantitative lung CT analysis for the study and diagnosis of Chronic Obstructive Pulmonary Disease (Page 108-113)