In a graph-based segmentation, a graph ๐บ = (๐, ๐ธ, ๐) represents the relationship of nodes (vertices) ๐ connected by edges ๐ธ with weights ๐ associated on each edge. In the MNcut, the graph ๐บ has the pixels as graph nodes ๐, and pixels within distance less than ๐ are connected by a graph edge in ๐ธ. The relationship of pixel connection is computed through a mixed weight value matrix ๐พ๐ด๐๐๐๐ (equation (3.21)), which measures the likelihood of pixels ๐ and ๐ belonging to the same coherent region. The ๐พ๐ด๐๐๐๐ is a combined two grouping cues, pairwise pixel affinities of intensity ๐พ๐ฐ (equation (3.19)) and pairwise pixel affinities of intervening contour ๐พ๐ช (equation (3.20)).
๐๐ผ(๐, ๐) = โก ๐โ โฅ๐๐โ๐๐โฅ2 ๐๐ฅ โกโโก โฅ๐ผ๐โ๐ผ๐โฅ2 ๐๐ผ (3.19) ๐๐ถ(๐, ๐) = โก ๐โ๐๐๐ฅ๐ฅโ๐๐๐๐(๐,๐)โฅ๐ธ๐๐๐(๐ฅ)โฅ 2 ๐๐ถ (3.20) ๐๐๐๐ฅ๐๐(๐, ๐) = โ๐๐ผ(๐, ๐) ร ๐๐ถ(๐, ๐) + โก๐ผ๐๐ถ(๐, ๐) (3.21)
where ๐๐ and ๐ผ๐ is the location and intensity of pixel ๐; ๐๐ฅ, ๐๐ผ, and ๐๐ถ is a variance of pixel location, intensity, and edge; ๐๐๐๐(๐, ๐) is a straight line joining pixels ๐ and ๐; ๐ธ๐๐๐(๐ฅ) is the edge strength at location ๐ฅ; ๐ผ is a parameter, contributing to the magnitude of ๐๐๐๐ฅ๐๐(๐, ๐).
The bipartition of a graph ๐ = ๐ดโก โช ๐ต is based on the Normalised Cuts algorithm, partitioning the graph by maximising the ratio of affinities with a group to that across groups, as defined in equation (3.22) and (3.23).
๐๐๐ข๐ก(๐ด, ๐ต) = ๐ถ๐ข๐ก(๐ด, ๐ต)
๐๐๐๐ข๐๐(๐ด) ร ๐๐๐๐ข๐๐(๐ต) (3.22)
where ๐ถ๐ข๐ก(๐ด, ๐ต) = โก โ ๐
๐โ๐ด,๐โ๐ต
(๐, ๐) (3.23)
The binary group vector is defined as ๐๐ด โ {0,1}๐, with ๐๐ด,๐ = 1 if pixel ๐ belongs to segment ๐ด, otherwise 0 if the pixel belongs to segment ๐ต. The segmentation follows the criteria: ๐๐๐ฅ๐๐๐๐ ๐โก1 2โ ๐ฟ๐๐ป๐พ๐ฟ ๐ ๐ฟ๐๐ป๐ซ๐ฟ ๐ 2 ๐=1 (3.24)
where ๐ซ is a diagonal matrix, ๐ท(๐, ๐) = โก โ ๐(๐, ๐)๐ , ๐พ is the mixed weight values of affinity graph.
In multiscale graph segmentation, the graphs are decomposed into different scales and are connected by three different types of matrices: the multiscale affinity matrix, the cross-scale interpolation matrix, the cross-scale constraint matrix. To connect between scales, pixels are interpolated by using the cross-scale interpolation matrices ๐ช๐,๐+๐ defined in equation (3.25). ๐ถ๐ ,๐ +1(๐, ๐) = โก { 1 |๐๐|โกโก๐๐โก๐ โ โก ๐๐ 0โก๐๐กโ๐๐๐ค๐๐ ๐ (3.25)
where ๐๐ is sampling neighbourhood of ๐.
To partition graph into different scales, the matrix ๐ช๐ ,๐ +1 together with the cross-scale constraint matrices ๐ช are employed to control the relationship between nodes in layer ๐ฐ๐ and ๐ฐ๐+๐. The matrix ๐ช has values in a diagonal direction as shown in equation (3.26).
๐ช = โก (๐ช๐,๐ 0 โก โฑ โก โ๐ฐ๐ ๐ช๐บโ๐,๐บโก โฑ โก 0 โ๐ฐ๐บโกโกโก) (3.26)
The cross-scale constraint equation satisfies the condition in equation (3.27).
๐ช๐ฟ = โก0 (3.27)
where ๐ฟ = โก ( ๐1
โฎ
๐๐ )โก and ๐๐ โก โ {0,1}โก
๐๐ โโกร๐พ is the partitioning matrix at scale ๐ , ๐
๐ โ = โก โ ๐๐ ๐ ,
๐๐ +1(๐) = โก|๐1
๐|โ๐โ๐๐๐๐ (๐). The neighbourhood ๐๐ specifies the projection of ๐ โ ๐ฐ๐+๐โกon the finer layer ๐ฐ๐.
The multiscale affinity matrix covers the full range of affinity matrix in each scale and is defined as the addition of affinity matrix from each scale as shown in equation (3.28).
This ๐พ๐ญ๐๐๐ is memory inefficient and become unmanageable when an image size is increasing. As a result a compressed multiscale affinity matrix ๐พ๐๐๐๐๐๐๐๐๐๐๐ is reconstructed by using cross-scale interpolation matrix ๐ช๐,๐+๐ and recomputing ๐พ๐ by either sub-sample image or sampling values from the affinity matrix ๐พ๐โ๐. The reconstruction affinity matrix is computed by using equation (3.29).
๐พ๐๐๐๐๐๐๐๐๐๐๐= ๐พ๐+ ๐ช๐,๐๐ป ๐พ ๐ ๐๐ช
๐,๐+ โฏ + ๐ช๐,๐+๐๐ป ๐พ๐๐๐ช๐,๐+๐ (3.29) For the multiple partition, a generalized K-way Ncut function can be similarly defined using ๐ฟ = [๐1, โฆ ๐๐พ]. The optimal multiscale normalised cut partitioning can be solved by computing the ๐พ eigenvectors, corresponding to the ๐พ largest eigenvalues, with the maximising criteria of equation (3.30). The ๐พ largest eigenvalues is determined by the number of segments altered by a user. Selection of ๐พ values depends on the complexity of tumour structure. The higher the number of ๐พ values, the finer the composition is acquired. It is obvious that the higher the ๐พ values, the longer the computation time.
๐๐๐ฅ๐๐๐๐ ๐โก1 ๐พโ ๐ฟ๐๐ป๐พ๐ฟ๐ ๐ฟ๐๐ป๐ซ๐ฟ ๐ ๐พ ๐=1 (3.30)
Graphically, one-dimensional view of multiple-scale graph decomposition with ๐ = 1 is decomposed into three scales, ๐ โ {1, 2, 3}, as shown in Figure 3.4. At each scale, pixels are connected by the relationship defined by the affinity matrices: ๐พ๐, ๐พ๐ and ๐พ๐ for scales 1, 2 and 3 respectively. The connections between scales are defined by the cross- scale interpolation matrices: ๐ช๐,๐ and ๐ช๐,๐. Pixels at each scale are sampled at (2๐ + 1)๐ โ1. The value ๐ determines the relationship of two pixels in a graph, i.e. the two pixels are connected if they are within distance ๐ . The length of ๐ offers distinct segmentation results. Generally, a larger ๐ produces better segmentation because a long range graph
connections facilitate propagation of local grouping cues across larger image regions. However, the larger ๐ requires longer segmentation time.
Figure 3.4: Three scales of the MNcut graph decomposition with ๐ = 1. Relationship between pixels at each scale and cross scales are defined by affinity matrices ๐๐ and cross-scale interpolation matrices ๐ถ๐ ,๐ +1.