• No results found

Image Segmentation/Registration: a Variational Framework for 2-D and 3-D Applications

N/A
N/A
Protected

Academic year: 2020

Share "Image Segmentation/Registration: a Variational Framework for 2-D and 3-D Applications"

Copied!
128
0
0

Loading.... (view fulltext now)

Full text

(1)

and 3-D Applications. (Under the direction of Professor Hamid Krim).

Segmentation lands itself in the middle level of a computer vision system that it extracts boundary features in images. An accurate extraction of features will lead to the success of later higher level processes such as classification and recognition. Registration, on the other hand, is intimately intertwined with segmentation. An accurate allocation of edges, which are used as feature points, may increase the performance of registration. Whenever a single modality is not sufficient for segmentation and the resort to multi-spectral images is needed, and perfect alignment of these multi-spectral images will also facilitate the segmentation task.

In this thesis we propose segmentation and registration methods corresponding to different real applications. In the first biomedical application we propose a constrained Mumford-Shah type energy functional incorporated with an information-theoretic view and tuning weights. This model characterizes higher-order statistical properties of data and give a probabilistic flavor to our segmentation. It successfully segmented T1-Maps and T1-weighted images in both 2-D and 3-D. Validation of experts’ manual segmentations also shows our method outperform most other techniques. Moreover we propose a joint radiofrequency (RF) -inhomogeneity calibration method to correct the non-uniformity of RF filed for accurate T1-Map generation.

We propose a multi-phase joint segmentation and registration technique (MPJSR) for mid-range layered imageries in the second application. Our method in particular may bring the objects of interest in a pair of layered images into perfect alignment and delineate the boundaries simultaneously. Based on our technique, we furthermore tackle the tracking problem for layered videos. By calculating a constrained optical flow between consecutive frames, a prediction for the contour initialization may be made in the next frame to expedite and increase the segmentation performance.

(2)

by

Ping-Feng Chen

A dissertation submitted to the Graduate Faculty of North Carolina State University

in partial fullfillment of the requirements for the Degree of

Doctor of Philosophy

Electrical Engineering

Raleigh, North Carolina

2009

Approved By:

Dr. Gianluca Lazzi Dr. Kazufumi Ito

(3)

DEDICATION

(4)

BIOGRAPHY

Ping-Feng Chen received his BS degree in the Electrical Engineering department

from National Tsing Hua University, Taiwan, in 2001, and then he served in the Ministry

of National Defence Symphony Orchestra, Taiwan, for two years. In 2003 he came to the

United States and started his graduate studies in the Electrical and Computer Engineering

department at North Carolina State University, Raleigh, NC. He received his MS degree

in 2005, and then joined the Vision, Information and Statistical Signal Theories and

Ap-plication (VISSTA) group, under the direction of Dr. Hamid Krim, pursuing his Ph.D.

degree.

From 2005 to 2008 he worked as a teaching assistant and research assistant in

VISSTA group. His research interests include segmentation and registration, active contour

(5)

ACKNOWLEDGMENTS

I would like to first express my gratitude to my advisor, Dr. Hamid Krim. Without

his guidance, this thesis would not be possible. His academic suggestions and revisions of

my technical writings are most beneficial along the road of my doctoral study. I would also

like to thank Dr. Kazufumi Ito, Dr. Gianluca Lazzi, and Dr. Grif Bilbro for serving in my

committee and for their interests in my research.

Special thanks to Dr. Grant Steen for his great help, many discussions, and

polishing my writings in the first project of this thesis. I would also like to thank Dr.

Anthony Yezzi for his professional suggestions and revision of my technical writings.

Many thanks to VISSTA group members, Sajjad, Yang, Shuo, Djamila, Sheng,

and Deokwoo, for lots of healthy discussions and enjoyable moments of my research life.

Finally I would like to thank my parents for their endless support of my whole

(6)

TABLE OF CONTENTS

LIST OF TABLES . . . vi

LIST OF FIGURES . . . vii

1 Introduction . . . 1

1.1 Image Segmentation . . . 1

1.2 Relationship Between Segmentation and Registration . . . 3

1.3 Motivations and Thesis Contributions . . . 4

1.3.1 Brain MRI Image Segmentation . . . 4

1.3.2 Layered Image Registration and Segmentation . . . 5

1.3.3 3-D Model Reconstruction From Multiple Range Images . . . 5

1.4 Thesis Summary and Organization . . . 6

2 Preliminaries . . . 8

2.1 Curve Evolution and Level Set Method . . . 8

2.2 Active Contour Segmentation Methods . . . 10

2.2.1 Snakes . . . 10

2.2.2 Geodesic Active Contour . . . 11

2.2.3 Region-based Active Contour . . . 12

2.2.4 Mumford-Shah Model . . . 13

3 Joint Brain Parametric T1-Map Segmentation and RF-inhomogeneity Cal-ibration . . . 15

3.1 Introduction . . . 16

3.2 Proposed Model for Segmentation . . . 19

3.2.1 Mumford-Shah Energy and Information-Theoretic View Point . . . . 19

3.2.2 Probabilistic Assignment of Segmentation . . . 20

3.2.3 Fast Mumford-Shah Implementation . . . 21

3.3 Segmentation of a Parametric Brain T1-Map . . . 23

3.3.1 Registration of Flip-Angle Images and Generation of Brain Mask . . 23

3.3.2 Determination of Optimal Flip-Angles . . . 24

3.3.3 Brain T1-Map Segmentation Procedure . . . 25

3.4 Joint T1-Map Segmentation and RF-inhomogeneity Calibration . . . 27

3.4.1 Variable Nutation Method with RF-inhomogeneity . . . 27

3.4.2 Flip Angle Rectification and Segmentation of T1-Maps . . . 27

3.5 Experimental Results . . . 30

3.5.1 Subjects and Scan Protocol . . . 30

3.5.2 Registration of Flip-Angle Image and Brain Mask . . . 30

(7)

3.5.4 Flip-Angle Rectification and Segmentation of T1-Maps . . . 32

3.5.5 Segmentation of T1-weighted Images . . . 38

3.6 Discussion . . . 41

3.6.1 Registration of Flip-angle Images and Generation of Brain Mask . . 41

3.6.2 Segmentation of Brain Data and Probabilistic Segmentation . . . 41

3.6.3 Joint Segmentation and RF-inhomogeneity Calibration . . . 41

3.7 Conclusion . . . 42

4 Multi-phase Joint Segmentation-Registration and Object Tracking for Layered Images . . . 43

4.1 Introduction . . . 44

4.2 Camera Model and Homographies Between Multi-view . . . 47

4.3 Problem and Strategy Description . . . 51

4.3.1 Algorithm . . . 52

4.3.2 Rough Alignment of Mid-range Layered Images . . . 53

4.4 Multi-phase Active Contour with Joint Segmentation-Registration . . . 56

4.4.1 Joint Segmentation-Registration . . . 56

4.4.2 Adapted to Multi-phase Active Contour . . . 61

4.5 Optical Flow for Object Tracking . . . 67

4.5.1 Tracking in Two Image Sequences Under Constraint . . . 68

4.6 Discussion and Experimental Results . . . 71

4.6.1 MPJSR Results . . . 71

4.6.2 Tracking Results . . . 74

4.7 Conclusion . . . 79

5 Determining Canonical Views of 3-D Object . . . 80

5.1 Introduction . . . 80

5.2 A Model of Sum of Multiple Range Images . . . 82

5.3 MDL-Based Canonical View of a 3-D Object . . . 86

5.3.1 Theory . . . 86

5.3.2 Experimental Results . . . 87

5.4 A Compressive Sensing approach to Canonical View . . . 90

5.4.1 Theory . . . 90

5.4.2 Experimental Result . . . 94

5.5 conclusion . . . 96

6 Contributions and Future Work . . . 97

6.1 Contributions of The Thesis . . . 97

6.1.1 Contributions to Region-based Active Contour for Segmentation . . 97

6.1.2 Brain Segmentation and Accurate T1-Map Generation . . . 98

6.1.3 Joint RF-inhomogeneity calibration and T1-Map Segmentation . . . 98

6.1.4 Segmentation and Registration of Layered Images . . . 98

6.1.5 Canonical Views by Compressive Sensing and MDL Methods . . . . 99

(8)

6.2.1 Three-tissue Segmentation . . . 99

6.2.2 RF-inhomogeneity Calibration in the Transverse Plane . . . 100

6.2.3 Change of Phase Number in MPJSR . . . 100

6.2.4 Canonical Views . . . 100

Bibliography . . . 101

Appendices . . . 112

A Derivation of Gradient Flows For Our Proposed Energy . . . 113

(9)

LIST OF TABLES

Table 3.1 Different validation metrics for a simulated brain data of T1-weighted

(10)

LIST OF FIGURES

Figure 3.1 From left to right the weight βin increases such that the segmented region becomes smaller, and therefore we obtain purer tissues. . . 22

Figure 3.2 A carton illustrating the scanning orientation. . . 28

Figure 3.3 A flowchart for the whole procedure to jointly segment T1-Map and calibrate

RF-inhomogeneity. The procedure includes the registration of flip-angle images, brain mask generation, and our JSRIC method. . . 29

Figure 3.4 JSR technique: two active contours evolve in two flip-angle images (of angles 5◦ and 40◦) to jointly register and segment (generate the brain mask) the brain. The region outside the mask are brightened to better show the contrast. . . 31

Figure 3.5 The T1-Map generated by (a) unregistered and (b) registered flip-angle

images. It can be seen that the anatomical structure of the registered one is clearer than the other. . . 31

Figure 3.6 The error rate E versus number of flip-angles, and 10 flip-angles is at the inflection point of the fitted curve. . . 32

Figure 3.7 The T1-Maps generated by (a)2 (b)6 and (c)10 flip-angles on the top row,

and their corresponding error maps on the second row. . . 33

Figure 3.8 Histogram of a (a) T1-weighted image and a (b) T1-Map. . . 34

Figure 3.9 ”α map”: The plot ofα value versus slice number for two subjects. . . 35

Figure 3.10 . T1-Maps generated from unrectified (First row) and rectified (second row)

flip-angle images at slice number 5, 6, 14 and 15. . . 35

Figure 3.11 . Segmentation result of WM (first row) and GM (second row) for the testing subject at slice number 10 - 13. . . 36

Figure 3.12 Overlap metric of (a) WM and (b) GM segmentations for calibrated and uncalibrated T1-Maps with cubic function G(I) = I3 and tuned weights and also

for calibrated T1-Map with function G(I) =I with tuned weights. . . 37

(11)

Figure 3.14 3-D segmentation results of (a) WM and (b) GM for the testing simulated brain.. . . 39

Figure 3.15 Average OM for WM and GM segmentations on 20 normal real brain data of T1-weighted modality. The left and right columns denote the average OM of WM

and GM segmentation respectively. Some statistics are from IBSR and others are from [1]. They represent: AMAP: adpaptive MAP, BMAP: biased MAP, FUZZY: fuzzy C-means; MAP: Maximum a posteriori probability, MLC: maximum likeli-hood, TSK-MEANS: tree-structure k-means, FAST: hidden Markov method [2], ZENG: coupled-surface method [3], MPM-MAP: Bayesian method [4],and Dual-front: Dual-front method [1].. . . 40

Figure 3.16 3-D segmentation results of (a) WM and (b) GM for one real brain data of T1-weighted modality. . . 40

Figure 4.1 (a) Long-range and (a) mid-range layered imaging. . . 46

Figure 4.2 The pinhole camera model. The projected 2-D image is the intersection between the image plane and the rays illuminated from the camera. . . 49

Figure 4.3 Overlayed long-range layered images. One image is affine-transformed and overlapped on top of the other. A single affine transform can bring two images to an almost perfect alignment. . . 49

Figure 4.4 Overlayed mid-range layered images with windows focused on the objects of interest. A single affine transform can only bring two images roughly aligned. . . 50

Figure 4.5 A simple carton explaining our multi-phase joint segmentation-registration and tracking. . . 52

Figure 4.6 3-point correspondence method. 3 points on the basis and reference images are picked up and the affine transformation matrix H can be determined.. . . 54

Figure 4.7 (a.) Simulated images with a square on the left and its affine transformation on the right. The contours successfully segmented both images and captured the registration parameters, (b.) Joint segmentation-registration result for a layered EO image (grey scale image). It captured the front side wall of the building. . . 60

Figure 4.8 3-phase active contour segmentation on a simulated image.. . . 63

(12)

Figure 4.10 (Top to bottom) The curve evolution of ternary joint segmentation-registration for two different buildings. . . 72

Figure 4.11 (Top to bottom) The curve evolution of ternary joint segmentation-registration for two buildings which handles topological changes and copes with non-perfect planes.. . . 73

Figure 4.12 (a) The basis image with the located windows overlayed, (b) The transformed objects, which were segmented using ternary joint segmentation-registration, from the basis image will be in alignment with (c) The reference image. . . 74

Figure 4.13 From top to bottom, left to right. The 2 steps of tracking along 10 pair of frames along the layered image sequences. The odd rows show the end of each refinement step, where the contours attached on the boundaries and the registra-tion parameters were obtained; the even rows show the predicregistra-tion step, where the contours were propagated by the calculated velocities. The brighter windows in the even rows denote the new positions of the windows. . . 76

Figure 4.14 From top to bottom, left to right. The 2 steps of tracking along 10 pair of frames along the layered image sequences. The odd rows show the end of each refinement step, where the contours attached on the boundaries and the registra-tion parameters were obtained; the even rows show the predicregistra-tion step, where the contours were propagated by the calculated velocities. The brighter windows in the even rows denote the new positions of the windows. . . 77

Figure 4.15 The brighter contours represent the propagated contours by the calculated velocities, which are closer to the edges compared to the darker contours that are not being propagated.. . . 78

Figure 4.16 The number of approximated planes in the basis image is different from that in the reference image. . . 78

Figure 5.1 example . . . 83

Figure 5.2 example . . . 84

Figure 5.3 (a.) 3-D model, (b.) Reconstructed surface from the 11 canonical views obtained from minimum description length criterion (the poor resolution is due to computational complexity concern). . . 89

(13)

Chapter 1

Introduction

1.1

Image Segmentation

For a machine to perceive the world like human beings do, it needs additional

mechanisms to intelligently make sense out of the things it sees. A typical computer vision

system may be described as a systematic structure, in which a real world scene is first

acquired by the sensors (ex: optical lens), possibly followed by a de-noising process (ex:

filters), and a feature extraction procedure. These features are subsequently used for high

levelperception mechanisms, such as recognition and classification.

Segmentation, which detects edge features in images, is found in the middle level

of this hierarchical vision structure. It is a crucial step in the sense that accurate features

extracted in an image will be fatal to a successful recognition process. In addition to

delineating the boundaries of objects in an image, segmentation may also be instrumental

in dividing an image into subregions that separate the objects from the background, or,

discretely, in labeling each pixel as either foreground or background. Not only does it

provide features for later processing, segmentation may help for better visualization or

diagnosis purpose in medical applications or as an essential tool in tracking applications.

Numerous methods were proposed to segment images. Since the pioneering work

of Kass et al. [5], active contours have been extensively used for segmentation. In con-trast to other edge detectors such as Canny [6] or Sobel operators, which may generate

discontinuous and open curves, active contours yield continuous and closed contours. The

(14)

functional to be optimized. Minimizing this functional by a gradient descent method drives

the contours to attache onto edges in the imagery. This so-called evolution of the contours is then shown to be governed by a partial differential equation (PDE).

Depending on the type of information the energy functional incorporates, active

contour methods may further be classified into two categories: edge-based [5, 7, 8, 9, 10,

11, 12, 13, 14, 15, 9, 16] and region-based [17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27] method.

Edge-based methods, such as ”snake” [5] or the geometric active contour [8, 28], stop the

contour movement when an edge is found, which may for example be defined as a high gradient in the image intensity (for a gray scale image). Region-based methods, such as

region competition [19] oractive contour without edges [17, 18], on the other hand examine some statistics (ex: mean or variance) inside and outside the contours. By maximizing

the divergence of the statistics inside and outside the contour, segmentation is achieved.

Region-based methods have been proved to be more robust than edge-based method [19, 18]

and hence are adopted and studied in this thesis. The implementation of a contour evolution

involves the well establishedlevel set method [29], which has at least two advantages: a. it handles topological changes, b. automatically distinguishes the inside and outside region at

any time of the evolution. We will elaborate more about it in Chapter 2. In this thesis we

propose a constrained Mumford-Shah [27] energy functional (Chapter 3) (which falls in the

region-based category), to segment 2-D and 3-D biomedical images. Moreover, we propose a

method, our so-called Multi-phase Joint Segmentation-Registration (MPJSR) (Chapter 4),

to jointly segment and register objects in layered sensing imageries.

Object tracking in videos is also a direct application of segmentation. The simplest

edge-based tracking method is to segment each image in a sequence over time, such that at each frame the objects of interest are depicted. This method does not, however, utilize

any relationship between consecutive frames. A prediction step based on the optical flow (or visual motion) may hence be made to give a good contour initialization at the next

frame [30]. In this thesis we propose a simple tracking method (Chapter 4) for layered

(15)

1.2

Relationship Between Segmentation and Registration

Image registration, is the process of finding the correspondence or the

transforma-tion between two images containing the same scene, taken at different times, from different

perspectives, and/or by different sensors, and of overlaying them into one entity. The

purpose of registration may be to align multi-spectral (or modality) images to gain more

information and a better understanding of the scene. The alignment process constitutes the

bottom level of the data fusion hierarchical structure [31]. Different approaches have led

to registration methods being classified into two categories: area-based and feature-based

methods [32]. A feature-based registration method comprises of four steps: 1) feature

de-tection, 2) feature matching, 3) mapping function design, and 4) image transformation and

re-sampling. Segmentation closely relates to registration in the first step. The features

of interest are object boundaries which form the basis for contour evolution. The feature

matching step is addressed by taking into account that the contours are closed curves and

their parametrization along with their variation in multiple images closely followed. In this

sense segmentation is intimately tied to registration, and their interdependence is central

to many applications.

This is manifest in cases where a segmentation technique may not perform well

when using one modality and additional auxiliary information from other modalities (ex:

MRI scans of T1,T2,PD images) is generally beneficial, and the alignment of these modalities

also helps guide the segmentation task.

A joint segmentation and registration (JSR) technique, proposed by Yezzi et al. [33], successfully coupled these two processes in a variational framework. The ratio-nale of this methodology is to basically enforce a relationship (transformation) between two

contours evolved simultaneously in two images by minimizing the sum of their energies.

In this thesis, we adopt the same rationale and further extend it to develop our

Multi-phase Joint Segmentation-Registration (MPJSR) method (Chapter 4), which is

(16)

1.3

Motivations and Thesis Contributions

From an engineering point of view, every proposed method or solution comes from

a corresponding real world problem. This thesis, too, is application driven and the novel

segmentation, registration as well as the 3-D model reconstruction methods are proposed to

solve three real and practical problems. We briefly discuss these motivations and describe

contributions in this thesis towards addressing three applications elaborated below.

1.3.1 Brain MRI Image Segmentation

Research work on brain imaging has long been of interest and is challenging. Three

main tissues: GM (Gray Matter), WM (White Matter), and CSF (Cerebrospinal Fluid),

constitute the major parts of the brain. Segmentation of these three tissues is important for

either diagnosis or visualization purposes. The nature of the folded structure of the brain

cortex, the radiofrequency (RF)-inhomogeneity effects of MRI, and the low contrast (ex:

T1-Map) of an image all increase the difficulty for segmentation.

Our first application focuses on T1-Map modality1 with a goal of segmenting this

case of imagery. To that end, several problems need to be studied. They include: a

gen-eration of an accurate T1-Map using a proper set of flip-angle images [34], a registration

between various flip-angle images, a masking to remove extraneous parts and to preserve

the brain structure in the MR images, and a calibration of RF-inhomogeneity. We

pro-pose a constrained Mumford-Shah energy functional which 1) incorporates an

information-theoretic measure of different tissues and 2) puts weights on the various tissues, yielding a

probabilistic approach to segmentation. The proposed functional is driven by a three-tissue

segmentation of the brain. Moreover, the proposed technique simultaneously segments a

T1-Map and calibrates RF-inhomogeneities (our so-called JSRIC method). Our proposed

segmentation method will be demonstrated not only for T1-Map but also for T1-weighted

images, in both 2-D and 3-D settings, and are validated by an expert’s manual

segmenta-tions.

1T

1 (Spin Lattice Relaxation Time) is a time constant in MRI and T1-Map is a parametric map/image

(17)

1.3.2 Layered Image Registration and Segmentation

Layered images refer to the images of a common scene taken by (possibly) different

sensors at different altitudes and angles. Examples include aerial views of a city for traffic

monitoring or UAV (Unmanned Aerial Vehicle) airborne image acquisition for surveillance

or target recognition purpose [35]. Our focus differs from that of traditional aerial layered

images at high altitude and making it such that a single affine transformation will align

two images. With sensors located at mid-range altitude, our approach seeks to divide the

image into subregions and to achieve registration for each region, in a divide-and-conquer

fashion.

In this thesis we propose a multi-phase joint segmentation-registration (MPJSR)

technique to segment and register mid-range layered images. Moreover, we introduce an

object tracking procedure between a pair of layered videos to address video sequences. A

constrained optical flow (visual motion) between consecutive frames is calculated to predict

the location of object contours at the next frame.

1.3.3 3-D Model Reconstruction From Multiple Range Images

Range images are images whose pixel value denotes the distance between the sensor

and the object. Range images may be acquired by laser scanners, for example, and may

be so noisy that a 3-D object reconstruction is more than a simple overlay of all the range

images at corresponding positions. A 3-D object reconstruction from multiple range images,

may be viewed as a subsequent step to segmentation and registration. To delineate an object

in a range image requires segmentation and to merge different parts requires the knowledge

of the relative position of each range image, i.e. registration, and the reconstruction of a

3-D object is in this sense a data fusion.

We propose a novel model, for analysis purposes, and for reconstruction of a

3-D object from multiple range images, under the assumption that the segmentation of the

range image is carried out and the scanning angle is known. Moreover, we use the notion

(18)

1.4

Thesis Summary and Organization

In Chapter 2 we give a brief review of the theory and background of curve evolution

and segmentation, which will be used throughout the thesis. The curve evolution theory

and its numerical implementation, the level set method [36], are illustrated in Section 2.1.

In Section 2.2, we discuss the two categories, edge- and region-based, active contour

meth-ods for segmentation. Specifically sections 2.2.1 and 2.2.2 give two celebrated examples,

Snakes [5] and geometric (or geodesic) active contour [8, 28] respectively, of edge-based

active contour methods. Section 2.2.3 gives a region-based energy functional and its

corre-sponding curve evolution, which is a special case (piecewise constant approximation of the

image) of the well-known Mumford-Shah model in Section 2.2.4.

In Chapter 3, we propose a constrained Mumford-Shah type energy functional and

a systematic procedure for brain parametric T1-Map segmentation. Based on the proposed

segmentation method we further develop a method to jointly segment T1-Map and calibrate

radiofrequency (RF) -inhomogeneity which is present in all MR scanning processes.

Sec-tion 3.2 details our proposed energy funcSec-tional which incorporates an informaSec-tion-theoretic

point of view along with a weighting to yield a probabilistic segmentation. Section 3.3

first addresses two problems in order to generate an accurate T1-Map: the registration of

flip-angle images in 3.3.1, and the determination of optimal flip-angles in Section 3.3.2. A

systematic procedure for brain T1-Map segmentation is subsequently given in 3.3.3. In

Section 3.4 we describe the model for RF-inhomogeneity calibration in order to generate

an accurate T1-Map and propose our joint T1-Map segmentation and RF-inhomogeneity

calibration (JSRIC) technique. Section 3.5 demonstrates the results of flip-angle image registration, optimal flip-angle determination, the results of JSRIC, and the validation

per-formance. In Section 3.5.5, we further apply our segmentation method on another modality,

T1-weighted data in a 3-D setting, to demonstrate the generality of our proposed model

with validations.

In Chapter 4 we tackle another registration/segmentation problem on layered

im-ageries. Section 4.2 first reviews the camera model to understand our proposed strategy to

jointly segment and register a pair of layered images in Section 4.3. Section 4.4 describes our

proposed multi-phase joint segmentation-registration (MPJSR) technique. We first review

(19)

incorpo-rate a multi-phase method in 4.4.2. Section 4.5 extends our MPJSR technique to object

tracking in a pair of layered videos. Section 4.5.1 describes the constrained optical flow (or

visual motion) calculation which is used as a prediction of the contour position in the next

frames. The results of MPJSR are shown in Section 4.6.1, and its application leads to a

successful joint registration and segmentation of a pair of layered images, and an alignment

of the objects of interest in the images. The results of video tracking utilizing optical flow

calculation are shown in Section 4.6.2, which performs well for our methods.

Chapter 5 studies the problem of determining the canonical views from multiple

views of range images. A novel model to reconstruct a 3-D model from multiple range images is first proposed in Section 5.2. Two methods are used to determine the canonical

views. First the canonical views are determined using a MDL (minimum description length)

formulation in Section 5.3 followed by a compressive sensing procedure given in Section 5.4.1,

to parsimoniously sample a 3-D object. The respective reconstruction results are given.

Conclusions and possible directions of future works of this thesis are presented in

(20)

Chapter 2

Preliminaries

In this chapter we review the theory of curve evolution, the level set method, and

both of the edge-based and region-based segmentation methods, for the purpose of better

understanding and prelude to subsequent material in later chapters.

2.1

Curve Evolution and Level Set Method

Denote a family of continuous and closed curves by C(s, t) = (X(s, t),Y(s, t)),

which is a mapping I ∈ R×[0, T] 7→ R2, where s parameterizes a given contour and t

parameterizes the family. A general evolution may be expressed as advancing every point

on the curve in an arbitrary direction, written as a combination of the tangent and normal

directions, from an initial position of the contour C0(s)

∂C

∂t =FT(s, t)T +F(s, t)N,

C(s,0) =C0(s), (2.1)

where FT and F are both scalar functions, and T and N respectively denote the unit tangent and outward normal vector. It can be shown [37] that only the normal component

has an effect on the displacement of a point and the tangential component will only affect

the re-parametrization of the curve. Therefore Eq. (2.1) is equivalent to

∂C

(21)

Evolving a contour by Lagrangian approach involves the discretization of the

con-tour into finite nodes, and the readjustment of nodes’ spacing after each evolution. This is

required to preserve data fidelity and to reduce numerical approximation errors [38].

Another implicit representation of a contour, the level set function, was proposed

by Osher and Sethian [29] to embed a contour in a one-dimensional higher space to become

a popular scheme for curve evolution. A contour is implicitly embedded in a 0-level of a

level set function, Φ(x, y, t) :R2×[0, T]7→R, as

  

  

C ={(x, y) : Φ(x, y, t) = 0},

inside(C) ={(x, y) : Φ(x, y, t)<0},

outside(C) ={(x, y) : Φ(x, y, t)>0}.

(2.3)

While there may be an infinite choices of level set functions, a signed distance function is

preferred for its stability in numerical computations, i.e. Φ(x, y, t) = d((x, y),C), where

d((x, y),C) denotes the Euclidean distance from the point (x, y) to the contour C.

The evolution of the level set function can be derived from the constraint that, at

any time, the contour,C, corresponds to the 0-level set such that Φ(C, t) = Φ(X,Y, t) = 0.

Takeing the partial derivative of Φ with respect to t, we have

Φt+ ΦxXt+ ΦyYt = 0, Φt+ (Φx,Φy)·(Xt,Yt)T = 0,

Φt+∇Φ·Ct = 0, (2.4)

where we have used the subscript to denote a corresponding partial differentiation and

the gradient operator. Substituting the evolution of contour Ct by Eq. (2.2), we have

Φt+∇Φ·F(s, t)N = 0. (2.5)

Notice that the outward unit normal vector can be written as N =Φ/k∇Φk [29] where

k · k denotes the Euclidean norm. Replacing it into Eq. (2.5) we have

(22)

The forceF(s, t) is often a function of curvature. By letting it equal to the negative curvature, κ, we have Ct =κN, or in terms of level set function, Φt =κk∇Φk. This

flow is known as theGeometric Heat Equationand it smoothes an image. It has been shown by Grayson [39] that Heat Equation shrinks an embedded closed curve to a single point

and make it more and more circular along the way. In turns of curve evolution, it also

corresponds to thecurve shortening flow [40] which minimizes the arc-length

L= Z 1 0 ∂C ∂s ds, (2.7)

where we assume C(0, t) = C(1, t). Curvature representation for the level set is κ =

∇ ·k∇∇ΦΦk

, where ∇· denotes the divergence operator, and therefore the geometric heat

flow of the level set is

Φt=∇ ·

∇Φ

k∇Φk

k∇Φk= Φ

2

yΦxx−2ΦxΦyΦxy + Φ2xΦyy Φ2

x+ Φ22

. (2.8)

Notice that in the implementation of Eq. (2.6), the forceF(s, t) is only defined on the 0-level set, or the contour C(s, t) though. We need to define the force on every grid

points in the domain of a level set function for the level set to evolve globally. Different

methods of extending the force, such asglobal ornarrow band extension[29], then follow. To improve on computational complexity, fast implementation such asnarrow band method [41] or real-time method [42] have been proposed as well.

2.2

Active Contour Segmentation Methods

In this section, we review some active contour methods for segmentation. They

can generally be classified into two categories: edge-based and region-based methods. We

present two examples for each category.

2.2.1 Snakes

Kass et al. [5] proposed an energy minimization based framework to deform a contour. The energy can be written as the sum of two terms

(23)

whereEint and Eext denote the internal and external energy functionals respectively. The internal energy determines the regularity of the contour such as the smoothness and the

rigidity. A common choice of the internal energy may be given by

Eint=

Z 1

0

α C′(s)

C′′(s)

ds, (2.10)

where α and β are positive reals controlling the weighting between the two terms. The external energy determines the force from the image to guide the contour evolution. A

common choice is to let the edge (high gradient) attract the contour

Eext=

Z

C

1

λk∇Gσ ∗I(x)k

dx, (2.11)

where λ is a real number controlling the weights, and Gσ denotes a smoothing Gaussian filter with standard deviation σ.

In a numerical implementation, finite nodes are used to approximate the contour,

and each node is moved according to some greedy algorithm to minimize the energy. The

resulting contour locates the local minimum ofEs, and if the initial contour is close enough to the boundaries in the image, one hopes the results will reach the global minimum.

2.2.2 Geodesic Active Contour

Caselleset al.[8] and Kichenassamyet al. [28] proposed another edge-based active contour model which is called geometric orgeodesic active contour. The idea of a geodesic active contour is that instead of using a length metric kC′k in the curve shortening flow

(Eq. (2.7)), a new metric which takes into account of the image is used. The model is to

minimize

min

C

Z 1

0

g(I(C(s)))C′(s)

ds, (2.12)

where the function g(·) is usually chosen as a monotonically decreasing function of the image gradient to detect edges, such as 1+k∇Ik1 . This new metric therefore does not find the minimal classical length but minimizes a new length definition (g(C)kC′k) which takes

(24)

The gradient flow for this model can be derived as

∂C

∂t = (g(C)κ− ∇g·N)N. (2.13)

The geodesic model is also very sensitive to local minima and has to be initialized close to

the edges in order to lock onto the global minimum.

2.2.3 Region-based Active Contour

Instead of examining the image gradient in search of edges, which is very sensitive

to noise, region-based active contour looks at some statistics in the regions inside and outside the contour, and move the contour according to separating these statistics furthest away. Since region-based methods examine the statistics globally of the whole image divided

into subregions, they are proved to be more robust than local edge-based active contour

methods.

Yezzi et al. [18] proposed a global region-based approach that also puts the seg-mentation problem in an energy minimization framework. Let an image consist of two

regions, the foregroundRin and the backgroundRout = Ω\Rin, where Ω denotes the whole domain of the image. An energy functional can be constructed as the difference between

some statistics, for example the mean, of the regions Rin and Rout, and the active contour is evolved in a way to pull these two statistics furthest apart.

min

C Ey(

C) =1

2(u−v)

2, (2.14)

where u and v denote the mean inside and outside the contour C respectively. When

the active contour locks onto the edges in an image, the energy (2.14) achieves its global

minimum. In other words, this model approximates an image as a piecewise constant

function with its value equal to the means inside and outside the object.

In order to prevent the contour from enclosing outliers such as salt and pepper

type noises, a length penalizing term may be added to yield

min

C E(

C) =1

2(u−v)

2+αI

C ds, (2.15)

(25)

length. The gradient flow of the contour may be derived by calculus of variation to yield

∂C

∂t = (u−v)

Iu |Rin|

+ I−v

|Rout|

N ακN, (2.16)

where|R|represents the area of region R.

Chan and Vese [17] also proposed a similar energy

min

C ECV(

C) =

Z

Rin

(Iu)2dx + Z

Rout

(Iv)2dx +α I

C ds, (2.17)

and the resulting gradient flow is the same as the flow in (2.16) with the restriction that

the areas inside and outside the contour equal, i.e.,|Rin|=|Rout|

∂C

∂t = (u−v) (2I−u−v)N −ακN. (2.18)

The advantage of these region-based active contours is that they are capable of

zeroing in on the global minimum, regardless of the initialization displaying less sensitivity

to noises compared to edge-based methods.

2.2.4 Mumford-Shah Model

Most active contour methods for segmentation usually find the boundaries which

divide an image into regions assuming that inside each region the intensities are

homoge-neous; most nonlinear diffusion methods smooth an image within homogeneous regions but

not across the boundaries of such regions. Mumfor-Shah’s model [27] simultaneously

ad-dresses these two goals. Their model basically approximates an image by piecewise smooth

functions

E(fRin, fRout,C) =

X

i=in,out

Z

Ri

n

(fRi −I)

2+νk∇f

Rik

2o

dx +α I

C ds, (2.19)

where α, ν R+ control the weighting between various terms. fR

i approximate

the imageI for regionRi,i∈ {in, out}. fRi is smooth within each regionRi, but not across

the boundaries, and C denotes the region boundaries. By minimizing this functional we

(26)

region but not across the boundaries (second term), while penalizing excessive length of

the boundaries (last term). Several papers [43, 44, 45] have addressed the derivation of

an approximated function for Mumford-Shah model, and the results indeed achieved both

(27)

Chapter 3

Joint Brain Parametric T

1

-Map

Segmentation and

RF-inhomogeneity Calibration

In this chapter we propose a constrained and novel Mumford-Shah type energy

functional for brain magnetic resonance imaging (MRI) T1-Map segmentation. The

incor-poration of an information-theoretic view and of selective weights achieves better

segmen-tation results, and also makes three-tissue (WM, GM, and CSF) segmensegmen-tation possible.

Based on our proposed method, we propose a joint segmentation and RF-inhomogeneity

calibration (JSRIC) approach to calibrate the RF-inhomogeneity in the vertical (z-) direc-tion and simultaneously segment T1-Maps. In Section 3.2, we review the Mumford-Shah

model for segmentation, followed by an information-theoretic point of view, and then state

our adapted model, with a fast implementation method. In Section 3.3, we study two

prob-lems to generate an accurate T1-Map, by including the registration of flip-angle images and

the determination of optimal flip-angles. We propose a systematic procedure to segment a

brain T1-Map in Section 3.3.3. In Section 3.4, we propose a novel method to jointly segment

a T1-Map and calibrate RF-inhomogeneities of MRI. In Section 3.5, we show the results

of registration of flip-angle images, the optimal flip-angles, segmentation of T1-Maps with

validations, the resulting T1-Map after RF-calibration, and the 3-D segmentations on two

(28)

provide some discussion in Section 3.6, and conclude this Chapter in Section 3.7.

3.1

Introduction

Brain structure segmentation is the apportionment of brain tissue into gray matter

and white matter, based on the appearance of tissue in images produced by magnetic

resonance imaging (MRI). Because manual tracing of the boundaries between tissues in the

brain is labor intensive, difficult, error-prone, and unrealistic for large amounts of data,

an automated or semiautomated segmentation technique is needed for either visualization

or diagnosis. Different imaging modalities, such as T1-weighted, T2-weighted, or Proton

Density (PD) images, have been used for different segmentation methods. T1-weighted

images, because of their good contrast [3], have been widely tested for various segmentation

methods [46, 47, 48, 1, 49]. A T1-Map is a parametric image of pure T1 (spin lattice

relaxation time), derived from the solution of an equation describing tissue relaxation,

and a parametric T1 map which is different from a T1-weighted image. The relationship

between T1 and several diseases, such as schizophrenia or sickle cell disease, has been

studied [50, 51, 52, 53], and T1 may be used as a possible indicator of pathology. Change

in T1 of certain voxels in the brain over time, may be be an early indicator of possible

pathology [51]. Therefore, the segmentation of a parametric brain T1-Map may highlight

pathology unseen by other imaging approaches.

Past research has studied the segmentation of cortex in the brain. The three

tissues- white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF), constitute

the main parts in the brain. The goal is to find their respective boundaries. Different

methods have been proposed to achieve this goal, and they may be classified into various

categories: fuzzy segmentation methods [48, 54], Markov random field (MRF) methods [55,

56], Bayesian methods [4], active contour methods [3, 1, 49], or the combinations of two

or more techniques. Some of these combinations are as follows: Leemput et al. [47] used Expectation-maximization (EM) and MRF, Xu et al. [46] used fuzzy segmentation and deformable surfaces, and Zhang et al. [2] combined a hidden Markov random field and an EM algorithm. Our method falls in the active contour category in the region-based

formulation. It is an adaptive version of Mumford-Shah’s model [27] to systematically

(29)

Before the segmentation of a T1-Map, several issues arise for obtaining an accurate

T1-Map. A T1-Map may be calculated by a rapid method known as the variable nutation

method [57] which provides comparable precision but much faster speed over conventional

methods [58]. This method requires the acquisition of a set of flip-angle images and the

T1 information can be extracted therein. The problem of determining the set of optimal

flip-angles therefore was studied. Deoni et al. [58, 34] proposed methods to determine the optimal flip-angles by basically maximizing the signal to noise ratio (SNR) of a T1-Map.

Their first work [58] required the knowledge of average TR/T1 in advance though, and

their second work [34] introduced a weighted least-square method to estimate the angles.

We take another approach to determine the optimal flip-angles to achieve the trade-off

between acquisition time and T1 accuracy.

Since a T1-Map is generated by a set of flip-angle images, alignment of of the images

is important in order to obtain an accurate T1-Map. We therefore propose a method to

register the raw data. The registration of flip-angle images is usually ignored though because

the movement of the head in the coil is minute. However, slight registration errors affect

the resulting T1-Map dramatically, as will be shown.

Radiofrequency (RF)-inhomogeneity is another unavoidable problem encountered

in MR imaging. The nonuniform distribution of the RF field can cause the resulting images

to have low contrast and inhomogeneous intensity, which makes quantitative description

and segmentation of the image difficult [59]. The basic theory of MR imaging is that when

the RF field is applied in the scanner and interacts with the magnetic field, the spins of

the protons are tilted by a certain angle (thus the name of a flip-angle image). The spin

returns to the equilibrium state after the RF field is switched off, and the resonance energy

resulting from this return to a fully-relaxed state is what is reconstructed to make the

images [60]. RF inhomogeneity then affects the generation of T1-Map in the sense that the

spins are not tilted by the predefined nominal angles. Hence, we focus on calibrating the RF non-uniformity in order to generate an accurate T1-Map. Cheng et al. [57] calculated

an analytical form of T1errors induced by RF non-uniformity and allowed simple correction

of T1 measurements. Both Wanget al. [59] and Venkatesanet al.[61] incorporate a scaling

factor α to rectify the nominal flip-angles in their models. We propose a method, which assumes the average T1 value among a transverse plane is the same across slices, to jointly

(30)
(31)

3.2

Proposed Model for Segmentation

Active contour methods comprise a popular segmentation technique in which an

initialized contour is driven by a partial differential equation (PDE) to minimize an energy

functional designed to attract the contour toward image edges. Active contour methods

can be classified into two categories: edge-based [5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 9, 16]

and region-based [17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27] methods. Edge-based methods

examine the gradient information of the image, and stop the contour whenever the gradient

is high. However, there are many situations when the edge is not clearly characterized

by the gradient, and it has been shown that region-based methods outperform edge-based

methods [19, 26, 17, 18, 23, 25]. By examining some statistics of the region inside and

outside of the active contour, and optimizing the separation of these two statistics, we

may achieve a better segmentation performance, thus making region-based methods more

attractive.

Our proposed model falls in the region-based category. Before stating our adapted

energy, we first illustrate Mumford-Shah’s [27] model. Two adaptations of Mumford-Shah’s

model constitute the novelty of our proposed technique. First is the incorporation of an

information-theoretic view, and second is a selective weighting which favors erring towards

one tissue type or another. Both of them will be illustrated in the following two subsections.

3.2.1 Mumford-Shah Energy and Information-Theoretic View Point

Mumford and Shah [27] proposed an energy functional that models an image as

a piecewise continuous function, i.e., the intensity is homogeneous within the interior and

exterior of the region of an object, but has a jump across the boundaries. Given a closed,

smooth contour C~ located within the image domain Ω, the image can be divided into two regions, the inside and the outside of the contour, Rin and Rout. The energy can then be expressed as

E(fRin, fRout, ~C) =

X

i=in,out

Z

Ri

n

(fRi−I)

2+νk∇f

Rik

2o

dx

I

~ C

(32)

where fRi approximates the image I for region Ri, i=in or i=out, and α, ν ∈ R

+ are

parameters that control the relative weighting between various terms above. fRi is smooth

within each region Ri, but not across the boundaries denoted by C~. By minimizing this energy, we search for an approximating function that is faithful to the image (initial terms),

and smooth within the region interior and exterior of the contour (middle terms) but not

across the boundaries, while penalizing excessive boundary length (last term).

In recent work by Unalet al.[23], the authors approach the segmentation problem from an information-theoretic point of view and were able to obtain a general form of the

energy. Rather than pulling some statistics (such as mean or variance) for the inside and

outside region of the contour apart [18], they instead treat the pixel value of the image as

a random variable, and pulled the probability density functions (PDF) of the two regions

apart to achieve the goal of segmentation. They maximized the so-called generalized

Jensen-Shannon (JS) divergence [62], and found that JS divergence corresponds to an energy

EJS =

r

X

j=1

Z

Rin

(uj−Gj(I))2dx

+

Z

Rout

(vj−Gj(I))2dx

, (3.2)

whereGj(·) are some functions of the image, anduj are the expectations ofGj(·),E[Gj(I)]. The constructed function Gj(·) characterizes the property of the PDF of the image. For example it may reflect the degree of skewness or kurtosis of a density relative to a Gaussian

density [63]. A proper choice ofGj(·) will capture the statistical characteristics of the data hence giving a good segmentation.

This energy is actually a generalized form of the external energy functional

pro-posed by Chan and Vese [17], which is a simplified version of the Mumford-Shah energy

with constants in place of the functionsfRin/out. Our first modification of this energy is to

replaceI by G(I) in Eq. (3.1). The choice ofG(·) will be shown in Section 3.5.

3.2.2 Probabilistic Assignment of Segmentation

Our second modification of the energy, which also motivates our method of

seg-menting different brain tissues, is to tune the weights on the inside and outside integrals

(33)

prob-abilistic segmentation differs from other probprob-abilistic or fuzzy segmentation [48, 54] in the sense that existing methods yield a measure of confidence in the membership of each pixel

(voxel for 3-D) within a certain tissue class. Our method however works by assigning some

weight, so we have a higher (or lower) chance of segmenting certain kinds of pure tissue. The reason for avoiding a hard segmentation and carrying out either a fuzzy or a proba-bilistic segmentation is because of partial volume effects [46, 1, 3], when some pixels (or voxels)actually contain a mixture of different tissue types.

Our proposed weighting of the inside versus outside energy terms, together with the

aforementioned incorporation of an information-theoretic view, yields the energy functional

E(fRin, fRout, ~C) =

X

i=in,out

βi

Z

Ri

n

(fRi−G(I))

2+νk∇f

Rik

2o

dx

I

~ C

ds, (3.3)

whereβi ∈R+,βin+βout = 1, denote the weights applied on the inside and outside energy terms. This provides a probabilistic assignment to the segmented regions. Enhancing the

weight of the interior of the evolving contour, βin, is tantamount to penalizing both the error of the difference between the approximated functionfRin, and the data fidelity term

G(I), as well as the degree of smoothness of fRin. This would yield a smaller segmented

region which is likely to be more faithful to the image, and of ”purer” tissues. Fig. 3.1

shows examples of white matter (WM) segmentation with different weights. With a larger

inside-weightβin, the segmented region is smaller and the segmented tissues are purer.

3.2.3 Fast Mumford-Shah Implementation

Minimizing the Mumford-Shah energy functional involves solving for the

approxi-mating functionsfRin/out and for the contour C~. The joint search for these infinite

dimen-sional unknowns usually entails gradient descent flows. In particular, the approximated

functions are typically modeled as a linear combination of a basis set whose dimension

equals the number of pixels in the image, i.e. each pixel is assumed to be independent [44].

The curve and the approximated functions are then evolved iteratively along the

(34)

Figure 3.1: From left to right the weight βin increases such that the segmented region becomes smaller, and therefore we obtain purer tissues.

method was in order. Alvinoet al.[45] proposed a fast implementation using a significantly smaller basis number to model the approximated functions, while still achieving sufficient

resemblance to the obtained functions when using the pixel-by-pixel basis.

We adopt their so-calledlinear heat equation basis with the change fromI toG(I) to incorporate the statistical properties of the data from an information-theoretic point of

view in Section 3.2.1. We hence have,

fRi = γ1,iG(I) +γ2,imean(G(I)), (3.4)

where mean(·) is the average function, and the coefficients γj,in(out), j = 1,2, may be similarly derived as in [45]. Our derivations are included in Appendix A.

Substituting Eq. (3.4) into Eq. (3.3), and using a classical methods of variational

calculus, the gradient descent evolution of the curve may be derived as

∂ ~C

∂t =

n

βout

h

(fRout−G(I))

2+ν

k∇fRoutk

2i

−βin

h

(fRin−G(I))

2+ν

k∇fRink

2ioN

−ακN, (3.5)

= (βoutFout−βinFin)N −ακN, (3.6)

where κ denotes the curvature of the contour C~, t an artificial time evolution parameter, andN the outward normal of the contour. Fin/out are derived by substitutingfR

in/out from

(35)

3.3

Segmentation of a Parametric Brain T

1

-Map

A brain parametric T1-Map is not a direct outcome from an MRI scanner but

is actually calculated from several flip-angle images. Because the flip-angle images are

acquired at different times, and because the subject may move during image acquisition,

registration should be carried out first to obtain an accurate T1-Map. Moreover, in order

to achieve a balance between the acquisition time and the resulting T1-Map accuracy, we

propose a method to determine optimal flip-angles. In the following subsections we will

first illustrate how we register the flip-angle images and obtain a mask as a byproduct,

then describe our method to determine a set of optimal flip-angles, and at last describe our

proposed procedure to segment a T1-Map.

3.3.1 Registration of Flip-Angle Images and Generation of Brain Mask

The value of T1is traditionally determined by acquisition methods such as

Inversion-Recovery (IR) or Saturation-Inversion-Recovery (SR). Other rapid methods, such asvariable nutation (the DESPOT method) have been proposed [57], and require acquisition of several flip-angle

images, and calculation of T1. Since these flip-angle images are acquired at different times,

registration must first be accomplished. Even though the interval between consecutive scans

may be as short as two minutes, and movement of the subject’s head inside the receiver

coil may be a few pixels (under the resolution of a 256×256 image), the effects of such

off-registration can be significant, as shown (Fig. 3.5). Here we describe a method to register

flip-angle images and jointly obtain a mask, as a byproduct, to get rid of the skull and other

structures around the brain.

We use a joint segmentation and registration (JSR) technique proposed by Yezzi

et al.[33] with an additional tuning weight, to achieve registration and to obtain the mask. The theory of JSR technique consists of evolving two contours, with a enforced relationship

(ex: rigid or affine transform) between them, in two images according to a partial

differ-ential equation (PDE) which is a result of optimizing, for example, the sum of two energy

functionals.

For our particular task, we may choose a region-based energy, such as Chan and

Vese’s model [17] incorporating weights, which arises as a special case of Eq. (3.3) with

(36)

outside the contour. This model approximates the image I by a simple piecewise constant function, which suits our goal here because it creates a mask that divides the image into two

parts- brain region and non-brain region. This uses the data I itself without considering higher-order statistics, and a piecewise constant approximating function is sufficient.

We observe that the boundary between the brain and non-brain is visually more

easily distinguished than the boundary between different tissues within the brain, for every

flip-angle image. For example, Fig. 3.4 shows two flip-angle images, and we notice that the

contrast between the tissues inside the brain is poor. We therefore put a very small weight

βin on the inside energy in Eq. (3.3) to penalize very little the difference between the data inside the contour, I, and the approximated function mean(I). Experimental results show thatβin= 0.4 yields satisfactory results.

Furthermore, our curve evolution is implemented via level-set methods [29] using

the narrow-band techniques [36], where the level set function changes its values only within

a predefined band around the contour. We may therefore initialize a sufficiently small

contour at the center of the images such that it is localized fully inside the brain, and its

evolution will eventually segment only the brain region without crossing the boundary to

the non-brain. We carry out this registration technique pairwise for all flip-angle images,

and once the flip-angel images are registered to each other, they are used to generate a

T1-Map.

3.3.2 Determination of Optimal Flip-Angles

The T1-Map is generated by a variable nutation method which is later detailed

in Section 3.4 for a better understanding of our RF-inhomogeneity calibration method.

We begin by acquiring images at a set of 19 flip-angles which spans the range of standard angles [64] and will give an optimized T1-Map. This is also confirmed by observing that the

generated T1-Map gives the best quality. We call this particular T1-Map thegold standard,

or T1G, and denote this set of angles as ~θ19= [2◦,5◦,10◦,15◦,20◦,25◦,30◦,35◦,40◦,45◦,

50◦,55,60,65,70,75,80,85,90]. We then compare the T

1-Map generated from all

combinations of the subset of~θ19with T1G. Out of these combinations we select the optimal

subset of angles. Since the data acquisition is slice-based, this study of optimal flip-angle

focuses on the central slice, which is least affected by RF-inhomogeneity [65] and is routinely

(37)

corpus callosum, and generally shows the putamen and lateral ventricle [66]. We denote the

optimaln-angles~θopt,n, which is a subset of~θ19, as those that exhibit the smallest difference

between T1G and the T1-Map generated bynflip-angles:

~

θopt,n = arg min ~ θn

X

(x,y)∈brain

T1G(x, y)−T1~θn(x, y)

, (3.7)

where n ∈ {2,3, . . . ,19}, ~θn ⊂ ~θ19, T1~θn denotes the T1-Map generated by ~θn, and the

summation of (x, y) is over the whole brain region at the central slice.

Eq. (3.7) therefore gives 18 sets, with the number of elements ranging from 2 to

19, of optimal angles. For the determination of the optimal set of flip-angles, in reaching

a compromise between acquisition time (efficiency) and T1-Map quality (accuracy), we

examine the error between T1~θopt,n and T1G and compute the error rate. The error rate is

defined as

E = X

(x,y)∈brain

e(x, y)/A(brain), (3.8)

whereA(brain) is the area of the brain, ande(x, y) is the error, defined as 1 if the difference between T1G(x, y) andT1~θopt,n(x, y) is greater than some thresholdε, and 0 otherwise. The

threshold ε is defined as the minimum of the standard deviation among the T1 values of

three brain tissues (WM, GM, and CSF) manually segmented by an expert. The summation

of e(x, y) is over the whole brain at the central slice.

The plot of the error rate versus the number of flip-angles is shown in Fig. 3.6,

where the inflection point of the fitted curve is at 10 flip-angles (~θopt,10 = [2◦,5◦,55◦,60◦,

65◦,70,75,80,85,90]). We hereafter routinely use these 10 flip-angles to generate the T1-Map, a compromise between efficiency and accuracy.

3.3.3 Brain T1-Map Segmentation Procedure

Once the brain mask is obtained (Section 3.3.1), it is used to segment away the

skull, leaving only three major tissues in the image: WM, GM, and CSF. Notice that

the curve evolution corresponding to the energy introduced in Section 3.2 always results

in a ”binary segmentation”, where we will have regions inside (foreground) and outside (background) the contour(s). We can not simultaneously segment the three tissues, even

(38)

weights (Section 3.2.2), to penalize the error between the data term and the approximated

function (Eq. (3.3)) to segment one tissue at a time, with a progression analogous to that

of ”peeling an onion”.

We illustrate our T1-Map segmentation procedure for ahard segmentationof three

tissues, and the probabilistic segmentation is obtained by varying the weights around the

value of the trained weight (Section 3.5.4). This procedure is applicable to both 2-D and

3-D data sets.

A T1-Map segmentation procedure consists of three steps, where the first two steps

are to evolve the contours by minimizing the energy in Eq. (3.3) with different weights, and

the third step is just a simple subtraction. The procedure is: 1) treat WM as the foreground,

everything else as the background, and letβin be the trained weightβin,W M in Eq. (3.3) to segment WM in the interior region of the contour, 2) treat WM and GM as the foreground,

CSF and everything else as the background, and let βin be the trained weight βin,CSF in Eq. (3.3) to obtain CSF in the exterior region of the curve filtered by the brain mask, and 3)

GM is obtained by subtracting WM and CSF from the whole brain. The procedure is based

on the anatomical observation that GM is enclosed by CSF, and that CSF is separated from

WM [47, 3], such that we may peel off one layer at a time.

The choice of function G(I) in Eq. (3.3), which is chosen to better capture the statistical property of T1-Map (and for other modality), will be shown in Section 3.5. The

values of βin,W M and βin,CSF are determined through a training process. Suppose an ex-pert’s manual segmentation is regarded as the ground truth. IfRedenotes the segmentation region by the expert, andRβin denotes the segmentation region by the weightβinwith some

fixed G(I), for some tissue, then the value ofβin,tissue is determined by minimizing

βin,tissue = arg min

βin

1 |Re∩Rβin|

|Rβin|+|Re| − |Re∩Rβin|

, (3.9)

(39)

3.4

Joint T

1

-Map Segmentation and RF-inhomogeneity

Cal-ibration

RF-inhomogeneity is an unavoidable problem in MR imaging. The strength of the

RF field varies within the MR scanner, such that the resulting image may be of low contrast

or of non-uniform intensity. In what follows, we will first describe how a T1-Map is calculated

from a set of flip-angle images, then how the T1-Map is affected by RF-inhomogeneity and

then show our proposed method to correct it.

3.4.1 Variable Nutation Method with RF-inhomogeneity

A T1-Map is calculated by variable nutation [57], in that images are acquired

using a FLASH sequence at different flip-angles, and T1 is calculated from the slope of a

least-square fitted line to the pair of data hsins(θ)θ,tans(θ)θi, where s(θ) is the signal strength of the flip-angle image expressed as a function of the flip-angleθ.

RF-inhomogeneity affects the computation of T1 in that the spin of protons at

some spatial positions are not tilted by the angle θ because the strength of the RF field is not as predefined. This phenomenon is more prevalent at the periphery of the receiver coil.

Calibration of RF-inhomogeneity can produce a more accurate T1-Map, since this enables

us to correct the flip-angles. Therefore a spatially dependent scaling factor α has to be introduced to adjust the tilted angle [59, 61], i.e. replacingθ byαθ

s(αθ) sin(αθ),

s(αθ) tan(αθ)

, (3.10)

and T1 is extracted from the slope of the line fitted to the above space. Our imaging

set-ting is slice-based across different transverse planes (Fig. 3.2), and since by observation the

RF-inhomogeneity effect is most significant across the z-direction, our proposed joint seg-mentation and RF-inhomogeneity calibration (JSRIC) is as an initial step to easily calibrate

the flip-angles vertically.

3.4.2 Flip Angle Rectification and Segmentation of T1-Maps

Our method is based on the assumption that the ”average” T1 of WM should be

(40)

Figure 3.2: A carton illustrating the scanning orientation.

a slice [67], we have no evidence that WM T1 varies within or between slices.

Our joint segmentation and RF-inhomogeneity calibration (JSRIC) method

re-quires first taking the averaged T1 value for segmented WM at the central slice (which is

least affect by RF-inhomogeneity [65]) as the reference, and then iteratively segmenting

and searching for the scaling factor α in Eq. (3.10) for all other slices. This is a three-step iterative process (as shown inside the dashed box on the flowchart Fig. 3.3)

1. Segment WM for thecentral slice by the method proposed in Section 3.3.3, compute the average T1 value of WM, and denote the average asM

2. For slice m, search α by letting it equal αmin+i∆α in Eq. (3.10), and calculate the

corresponding T1-Map, denoted as T1(α). Ifα goes beyondαmax, claim the current

slice as uncalibratable and repeat this step for the next slicem+ 1

3. Segment WM for T1(α), compute the average T1 of WM, and denote it byL(α). If

M 6=L(α), go back to step 2 and increase α (by incrementing i), otherwise claim it is done for the current slice

4. Iterate step 2 and 3 through all the slices

(41)

Figure 3.3: A flowchart for the whole procedure to jointly segment T1-Map and calibrate

Figure

Figure 3.1: From left to right the weight βin increases such that the segmented regionbecomes smaller, and therefore we obtain purer tissues.
Figure 3.2: A carton illustrating the scanning orientation.
Figure 3.7: The T1-Maps generated by (a)2 (b)6 and (c)10 flip-angles on the top row, andtheir corresponding error maps on the second row.
Figure 3.10: . T1-Maps generated from unrectified (First row) and rectified (second row)flip-angle images at slice number 5, 6, 14 and 15.
+7

References

Related documents