• No results found

In a graph-based segmentation, a graph ๐บ = (๐‘‰, ๐ธ, ๐‘Š) represents the relationship of nodes (vertices) ๐‘‰ connected by edges ๐ธ with weights ๐‘Š associated on each edge. In the MNcut, the graph ๐บ has the pixels as graph nodes ๐‘‰, and pixels within distance less than ๐‘… are connected by a graph edge in ๐ธ. The relationship of pixel connection is computed through a mixed weight value matrix ๐‘พ๐‘ด๐’Š๐’™๐’†๐’… (equation (3.21)), which measures the likelihood of pixels ๐‘Ž and ๐‘ belonging to the same coherent region. The ๐‘พ๐‘ด๐’Š๐’™๐’†๐’… is a combined two grouping cues, pairwise pixel affinities of intensity ๐‘พ๐‘ฐ (equation (3.19)) and pairwise pixel affinities of intervening contour ๐‘พ๐‘ช (equation (3.20)).

๐‘Š๐ผ(๐‘Ž, ๐‘) = โก ๐‘’โˆ’ โˆฅ๐‘‹๐‘Žโˆ’๐‘‹๐‘โˆฅ2 ๐œŽ๐‘ฅ โกโˆ’โก โˆฅ๐ผ๐‘Žโˆ’๐ผ๐‘โˆฅ2 ๐œŽ๐ผ (3.19) ๐‘Š๐ถ(๐‘Ž, ๐‘) = โก ๐‘’โˆ’๐‘š๐‘Ž๐‘ฅ๐‘ฅโˆˆ๐‘™๐‘–๐‘›๐‘’(๐‘Ž,๐‘)โˆฅ๐ธ๐‘‘๐‘”๐‘’(๐‘ฅ)โˆฅ 2 ๐œŽ๐ถ (3.20) ๐‘Š๐‘€๐‘–๐‘ฅ๐‘’๐‘‘(๐‘Ž, ๐‘) = โˆš๐‘Š๐ผ(๐‘Ž, ๐‘) ร— ๐‘Š๐ถ(๐‘Ž, ๐‘) + โก๐›ผ๐‘Š๐ถ(๐‘Ž, ๐‘) (3.21)

where ๐‘‹๐‘Ž and ๐ผ๐‘Ž is the location and intensity of pixel ๐‘Ž; ๐œŽ๐‘ฅ, ๐œŽ๐ผ, and ๐œŽ๐ถ is a variance of pixel location, intensity, and edge; ๐‘™๐‘–๐‘›๐‘’(๐‘Ž, ๐‘) is a straight line joining pixels ๐‘Ž and ๐‘; ๐ธ๐‘‘๐‘”๐‘’(๐‘ฅ) is the edge strength at location ๐‘ฅ; ๐›ผ is a parameter, contributing to the magnitude of ๐‘Š๐‘€๐‘–๐‘ฅ๐‘’๐‘‘(๐‘Ž, ๐‘).

The bipartition of a graph ๐‘‰ = ๐ดโก โˆช ๐ต is based on the Normalised Cuts algorithm, partitioning the graph by maximising the ratio of affinities with a group to that across groups, as defined in equation (3.22) and (3.23).

๐‘๐‘๐‘ข๐‘ก(๐ด, ๐ต) = ๐ถ๐‘ข๐‘ก(๐ด, ๐ต)

๐‘‰๐‘œ๐‘™๐‘ข๐‘š๐‘’(๐ด) ร— ๐‘‰๐‘œ๐‘™๐‘ข๐‘š๐‘’(๐ต) (3.22)

where ๐ถ๐‘ข๐‘ก(๐ด, ๐ต) = โก โˆ‘ ๐‘Š

๐‘Žโˆˆ๐ด,๐‘โˆˆ๐ต

(๐‘Ž, ๐‘) (3.23)

The binary group vector is defined as ๐‘‹๐ด โˆˆ {0,1}๐‘, with ๐‘‹๐ด,๐‘Ž = 1 if pixel ๐‘Ž belongs to segment ๐ด, otherwise 0 if the pixel belongs to segment ๐ต. The segmentation follows the criteria: ๐‘š๐‘Ž๐‘ฅ๐‘–๐‘š๐‘–๐‘ ๐‘’โก1 2โˆ‘ ๐‘ฟ๐’๐‘ป๐‘พ๐‘ฟ ๐’ ๐‘ฟ๐’๐‘ป๐‘ซ๐‘ฟ ๐’ 2 ๐‘™=1 (3.24)

where ๐‘ซ is a diagonal matrix, ๐ท(๐‘Ž, ๐‘Ž) = โก โˆ‘ ๐‘Š(๐‘Ž, ๐‘)๐‘— , ๐‘พ is the mixed weight values of affinity graph.

In multiscale graph segmentation, the graphs are decomposed into different scales and are connected by three different types of matrices: the multiscale affinity matrix, the cross-scale interpolation matrix, the cross-scale constraint matrix. To connect between scales, pixels are interpolated by using the cross-scale interpolation matrices ๐‘ช๐’”,๐’”+๐Ÿ defined in equation (3.25). ๐ถ๐‘ ,๐‘ +1(๐‘Ž, ๐‘) = โก { 1 |๐‘๐‘Ž|โกโก๐‘–๐‘“โก๐‘ โˆˆ โก ๐‘๐‘Ž 0โก๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’ (3.25)

where ๐‘๐‘Ž is sampling neighbourhood of ๐‘Ž.

To partition graph into different scales, the matrix ๐‘ช๐‘ ,๐‘ +1 together with the cross-scale constraint matrices ๐‘ช are employed to control the relationship between nodes in layer ๐‘ฐ๐’” and ๐‘ฐ๐’”+๐Ÿ. The matrix ๐‘ช has values in a diagonal direction as shown in equation (3.26).

๐‘ช = โก (๐‘ช๐Ÿ,๐Ÿ 0 โก โ‹ฑ โก โˆ’๐‘ฐ๐Ÿ ๐‘ช๐‘บโˆ’๐Ÿ,๐‘บโก โ‹ฑ โก 0 โˆ’๐‘ฐ๐‘บโกโกโก) (3.26)

The cross-scale constraint equation satisfies the condition in equation (3.27).

๐‘ช๐‘ฟ = โก0 (3.27)

where ๐‘ฟ = โก ( ๐‘‹1

โ‹ฎ

๐‘‹๐‘ )โก and ๐‘‹๐‘ โก โˆˆ {0,1}โก

๐‘๐‘ โˆ—โกร—๐พ is the partitioning matrix at scale ๐‘ , ๐‘

๐‘ โˆ— = โก โˆ‘ ๐‘๐‘  ๐‘ ,

๐‘‹๐‘ +1(๐‘Ž) = โก|๐‘1

๐‘Ž|โˆ‘๐‘โˆˆ๐‘๐‘Ž๐‘‹๐‘ (๐‘). The neighbourhood ๐‘๐‘– specifies the projection of ๐‘Ž โˆˆ ๐‘ฐ๐’”+๐Ÿโกon the finer layer ๐‘ฐ๐’”.

The multiscale affinity matrix covers the full range of affinity matrix in each scale and is defined as the addition of affinity matrix from each scale as shown in equation (3.28).

This ๐‘พ๐‘ญ๐’–๐’๐’ is memory inefficient and become unmanageable when an image size is increasing. As a result a compressed multiscale affinity matrix ๐‘พ๐’“๐’†๐’„๐’๐’๐’”๐’•๐’“๐’–๐’„๐’• is reconstructed by using cross-scale interpolation matrix ๐‘ช๐’”,๐’”+๐Ÿ and recomputing ๐‘พ๐’” by either sub-sample image or sampling values from the affinity matrix ๐‘พ๐’”โˆ’๐Ÿ. The reconstruction affinity matrix is computed by using equation (3.29).

๐‘พ๐’“๐’†๐’„๐’๐’๐’”๐’•๐’“๐’–๐’„๐’•= ๐‘พ๐Ÿ+ ๐‘ช๐Ÿ,๐Ÿ๐‘ป ๐‘พ ๐Ÿ ๐’„๐‘ช

๐Ÿ,๐Ÿ+ โ‹ฏ + ๐‘ช๐’”,๐’”+๐Ÿ๐‘ป ๐‘พ๐’”๐’„๐‘ช๐’”,๐’”+๐Ÿ (3.29) For the multiple partition, a generalized K-way Ncut function can be similarly defined using ๐‘ฟ = [๐‘‹1, โ€ฆ ๐‘‹๐พ]. The optimal multiscale normalised cut partitioning can be solved by computing the ๐พ eigenvectors, corresponding to the ๐พ largest eigenvalues, with the maximising criteria of equation (3.30). The ๐พ largest eigenvalues is determined by the number of segments altered by a user. Selection of ๐พ values depends on the complexity of tumour structure. The higher the number of ๐พ values, the finer the composition is acquired. It is obvious that the higher the ๐พ values, the longer the computation time.

๐‘š๐‘Ž๐‘ฅ๐‘–๐‘š๐‘–๐‘ ๐‘’โก1 ๐พโˆ‘ ๐‘ฟ๐’๐‘ป๐‘พ๐‘ฟ๐’ ๐‘ฟ๐’๐‘ป๐‘ซ๐‘ฟ ๐’ ๐พ ๐‘™=1 (3.30)

Graphically, one-dimensional view of multiple-scale graph decomposition with ๐‘… = 1 is decomposed into three scales, ๐‘  โˆˆ {1, 2, 3}, as shown in Figure 3.4. At each scale, pixels are connected by the relationship defined by the affinity matrices: ๐‘พ๐Ÿ, ๐‘พ๐Ÿ and ๐‘พ๐Ÿ‘ for scales 1, 2 and 3 respectively. The connections between scales are defined by the cross- scale interpolation matrices: ๐‘ช๐Ÿ,๐Ÿ and ๐‘ช๐Ÿ,๐Ÿ‘. Pixels at each scale are sampled at (2๐‘… + 1)๐‘ โˆ’1. The value ๐‘… determines the relationship of two pixels in a graph, i.e. the two pixels are connected if they are within distance ๐‘…. The length of ๐‘… offers distinct segmentation results. Generally, a larger ๐‘… produces better segmentation because a long range graph

connections facilitate propagation of local grouping cues across larger image regions. However, the larger ๐‘… requires longer segmentation time.

Figure 3.4: Three scales of the MNcut graph decomposition with ๐‘… = 1. Relationship between pixels at each scale and cross scales are defined by affinity matrices ๐‘Š๐‘  and cross-scale interpolation matrices ๐ถ๐‘ ,๐‘ +1.

Evaluation of Segmentation