Unsupervised segmentation of multi-echo MR images with an ART-based neural network

(1)

MURDOCH RESEARCH REPOSITORY

http://dx.doi.org/10.1109/ICNN.1995.487819

Li, W. and Attikiouzel, Y. (1995) Unsupervised segmentation of

multi-echo MR images with an ART-based neural network. In:

Proceedings of the IEEE International Conference on Neural

Networks, 27 November - 1 December, Perth,

Western Australia pp. 2600 - 2604 vol.5.

http://researchrepository.murdoch.edu.au/20631/

Copyright © 1995 IEEE

Personal use of this material is permitted. However, permission to reprint/republish

this material for advertising or promotional purposes or for creating new collective

(2)

Unsupervised Segmentation of Multi-echo

MR

Images

With an

ART-Based

Neural Network

Wanqing Li

Yianni

Attikiouzel

Centre for Intelligent Information Processing Systems The University of Westem Australia, WA 6009, Australia

[email protected], [email protected]

ABSTRACT

This paper investigates the suitability of an ART-based neural network for unsupervised segmentation of multi-echo MR images. T h e ART2A network was used to segment standard dual-echo

MR

images. Two problems were identified with the basic ART2A: one, the network was hardly convergent; and two, the categorization depended on the order of presentation

of the patterns. In order to solve these two problems, a dynamic learning parameter and

a random pattern presentation method were introduced. Results using a number of actual dual-echo MR images with the modified ART2A network show that ART-based networks can be used for segmentation of multi-echo MR images.

1. Introduction

Multi-echo or multispectral MR images are com- monly used in routine diagnosis as they provide more distinguishable patterns for a variety of tissues and contain more pathological information than single echo MR images. The standard MR images depict dual-echoes: the proton density weighted

(PDW) and the spin-spin relaxation time weighted (T2W). Since all echo images are registered spatially over the object space, the information extracted by means of image processing from these multi-echo images is obviously more valuable and reliable than information extracted from singleecho image and then simply adding it together.

The purpose of segmenting human head MR images is t o quantify the brain tissues. This can provide radiologists with pathological information, and the ability t o visualize the shape of tissues and their spatial relationships. It also provides surgeons and radiation therapists with a method for simu- lating or applying treatment. Currently, most of the proposed segmentation methods [l, 15, 16, 101

are based on multi-dimensional d a t a analysis techniques; some of them are supervised and others are unsupervised.

Supervised segmentation [l, 15, 16, 101 methods require intensive user interaction t o collect training samples for each tissue and t o correct the segmentation results. In contrast, unsupervised segmentation

[I,

15, 10, 13, 12, 9, 11, 171 provides an auto- matic approach for extracting tissues from images

without any prior information and thus increasing productivity.

Adaptive Resonance Theory (ART) [3] was introduced by Carpenter and Grossgerg as a theory of human cognitive information processing. This theory led t o a family of real-time neural networks [3, 2, 6, 8, 7, 4, 51 for unsupervised learning and pattern recognition. Among them, ART2 [2] and ART2A [6] can stably learn t o categorize either analog or binary input patterns presented in an arbitrary order (ART2A is an asymptotic case of ART2).

This paper provided a study of the suitability of an ART-based neural network to automatically segment multi-echo MR images. An ART2A network was used t o segment standard dual-echo MR images. Two problems were identified with the basic ART2A proposed by Carpenter and Grossberg [6]. One was that the network was hardly convergent; and the other was that the categorization depended on the presentation order of the patterns. In order to solve these problems, a dynamic learning parameter and a random pattern presentation method were introduced.

Section 2 overviews the proposed method, in- cluding the learning and execution cycles of an ART2A network and the organization of patterns. Section 3 discusses the problems encountered with a

(3)

dual,-echo

MR

images with the proposed improved ART2A network together with the conclusions.

2. Method

2.1. ART2A neural network

The principal components of ART neural networks [3] a.re an attentional subsystem, which contains an input field F1, a category representation field F2, and an orienting subsystem, which interacts with the izttentional subsystem t o carry out an internally controlled search process. The two fields are linked by both a bottom-up,

+

F z , adaptive filter and a top-down, F2

+

F l , adaptive filter. A path from the i'th Fl node t o j ' t h F2 node contains a Long Term Memory (LTM) trace or weight, zij, a path from the j ' t h FI node t o the

i'th

F2 node contains a

LTM weight zji. The states of F1 and F2 are known as Short Term Memory (STM).

[n ART2 or ART2A model, another preprocess- ing field FO is added to carry out contrast enhancement, noise deduction and normalization of input signals before the input goes into field F1. From software simulation point of view, the major parameters characterizing an ART2A network are d ,

a threshold for signal enhancement on Fo; p , a vigilance parameter, 0

5

p

5

1; and @, a learning parameter. Its major operations are

(1) enhancement and normalization on FO of an input pattern Io

where q is a normalizing operation} q x E

&

and

xi if xi

>

d 0 otherwise (F0x)z

=

(2) choice of winning node

J

on F 2 when TJ

=

ma.xj(Tj) and TJ

2

p;

(3) updating or learning of bottom-up LTM z;

I

if J is an uncommitted node

if J is a committed node

denotes the value of z; at the start of (2)

* ( o l d )

wh,ere z J

the input presentation.

* ( o l d )

Ii

if zJi

>

t3

0 otherwise

*<

=

2.2. MRI Data

Multi-echo MR images are normally scanned at,

different echo times (TEs) with a tailored pulse sequence of a certain repetition time (TR). Each echo image represents the magnetic property of a,

thin slice, say 5mm, of human body at a same position along horizontal (coaxial) or saggital or

verticofrontal direction A volume element, called voxel, in the thin slice corresponds t o a pixel in each echo image and its magnetic property appears as the densities of the pixels in all echoes.

As all echo images are spatially registered, if

a voxel is considered as a pattern in a multidimensional framework the p echoes of MRI, each of which is of size M x N , can be represented as

M x N patterns, X = ( x ~ , x ~ , ~ ~ - , x ~ ~ N ) . Each

pattern is characterized as a feature vector in W ,

xi = ( x i l , x i z , . . .

,

x i p ) , where xij represents the density of i'th pixel in j'th echo image.

Our data set consists of 20 slices of actual standard dual-echo head MR images with T R == 1800ms. One echo is PDW scanned at T E = 20ms; the other is T2W scanned at T E = 80ms. The size of each slice is 256 x 256. The resolution is l m m in intraslices and 5mm in interslices.

2.3. Pattern transformation

When a pattern is presented to an ART2A network, its magnitude information is lost after normalization by Fo. However, this information is crucial for

characterization of tissue types in MRI. Hence, the pattern should be transformed so as t o keep the magnitude information before it is used as an input to the network. Here, we use z-axis normalization

~ 4 1 .

Given a pdimensional pattern vector x := (z1

,

22, * *

. ,

xp),

where xi is the density of the i'th echo of MRI, the transformation consists of two steps. The first step is to scale each component zi of the pattern vector x in the range [ a l l ] by dividing

xi by the maximum density of the i'th echo,

Di.

Then

The second step is to apply z-Axis normalization t o the vector x'. Here, an extra dimension is created to encode the magnitude information of XI.

(4)

I P

p

-

xi2

li

i=l

X p + l =

2.4. Segmentation procedures

For N pattern vectors from p echoes:

Step 1 Initialize the network parameters 6,

p,

p , and the maximum number of iterations, IT. Step 2 Select one pattern x from the N patterns. Step 3 Transform x t o x using Eqs.(3)-(7). Step 4 Input x t o the network. The network en-

ters its competing and learning cycle as de- scribed in Section 2.1

for intermediate-learning when ,8 was small, for example 0.01, the initial number of categories learned by the network did not depend on the vigilance parameter p though it should have, as p is one of the most important properties of ART networks. As the network learned gradually from the input patterns iteratively, the number of categories would (7)

Step 5 If all N patterns have been learned by the network, go t o next Step 6; Otherwise, go t o Step 2.

Step 6 If the categorization by the network is stable or the number of iteration is greater than I T , go t o Step 7; Otherwise, increase the number of iteration and go t o Step 2. Step 7 Classify all N patterns with the learned

network and form a coarse segmentation of the MR images.

Step 8 Refine the coarse segmentation.

The values of ,fl and p control the network learning speed and degree of categorization as discussed in [6]. In contrast t o [6], we constrain the value of 6 in the range of (0, &], where Db is the average density of background and D is maximum density of foreground.

In Step 7, a stable categorization learned by the network means that the categories of all patterns keep identical between two successive iterations.

-

be expected t o depend on the vigilance parameter

p . This learning process resulted in a waste of nodes

on F2. A number of nodes on F 2 were only of tem- porary use and could not be accessed any more at the latter stage of learning. Therefore, the number of nodes on F 2 had t o be very large even though the final categories were small.

The second problem, which was intrinsic t o all ART networks, was that the final categories of patterns depended on the order in which the pattern were presented to the network.

3.2. Improvements

For the first problem, a dynamically decreased learning parameter was introduced into the learning procedures; and for the second problem, this could be solved by the use of random selection of input patterns. With these two new approaches, the related steps were modified as follows:

Step 2 Randomly select one pattern from the N

patterns.

Step 6 If the categorization by the network is stable or the number of iteration is greater than

I T , go to Step 7; Otherwise, increase the number of iteration and go to Step 6a. Step 6a Modify the learning parameter

t

IT (8)

Step 8 refines the coarse segmentation so as to p e w ) = (1 - -) x

p W )

correct some misclassification of pixels by Step 7

since only density information is used so far. Nor- mally it is a procedure to check the spatial depen- dency of a pixel on its neighbours.

where, t is the current iteration, pinit is the initial value of

p.

4.

Results and Conclusion

3. Problems and Improvements

3.1. Problems

Implementing the segmentation method by software simulation and applying it to a pair of our dual-echo MR images, it was observed that the results were not as expected. Tracing through the learning process of the network, two problems were identified.

The first one was related to the choice of learning parameter

p.

For fast-learning, when ,8 was large, for example 0.85, the network was unstable;

A number of dual-echo MR images chosen from our data set were used for Segmentation. The results were satisfactory, one is presented in Figure 1. The network parameters used were 6 = 0.05,

p

= 0.8, 0

=

0.80,0.85 for (b) and ( c ) respectively.

(5)

tissue, as shown in Figure l(b); when p = 0.85, these two matters were well separated, as shown in Figure l ( c ) . Finally, the improved network was stable. It can be expected that for our 256 x 256 patterns, the network can reach its stable status within several hundred iterations in comparison with over

2000 iterations of the original ART2A network for

only 46 patterns [6].

ACKNOWLEDGMENT

Wainqing Li is supported by the Overseas Postgrad- uate Research Scholarship (OPRS) of Australia and the University Research Studentship

(URS)

of The University of Western Australia

References

J.C. Bezdek, L.O. Hall, and L.P. Clarke. Re- view of MR image segmentation techniques using pattern recognition. A m . Assoc. Phys. Med.

,

20(4): 1033-48, Jul/Aug 1993.

Gail A. Carpenter and Stephen Grossberg. Art 2: Self-organization of stable category recognition codes for analog input patterns. Applied

Optics, 26(23):4919-30, December 1987. Gail A. Carpenter and Stephen Grossberg. A massively parallel architecture for a self- organizing neural pattern recognition machine.

Computer Vision, Graphics and Image Pro-

cessing, 37(1):54-115, 1987.

Gail A. Carpenter, Stephen Grossberg, and John H. Reynolds. Artmap: Supervised real- time learning and classification of nonstation- ary data by a self-organizng neural network.

Neural Networks, 4:565-588, 1991.

Gail A. Carpenter, Stephen Gross- berg, John H. Reynolds, and David B . Rosen. Fuzzy artmap: A neural network architecture for incremental supervised learning of analog multidimensional maps. IEEE Transactions on Neural Networks, 3(5):698-712, September 1992.

Gail A. Carpenter, Stephen Grossberg, and David B. Rosen. Art 2-a: An adaptive r e s onance algorithm for rapid category learning and recognition. Neural Networks, 4:493-504, 1991.

Gail A. Carpenter, Stephen Grossberg, and David B. Rosen. Fuzzy art: Fast stable learning and categorization of analog patterns by an adaptive resonance system. Neural Networks,

Gail A. Crapenter and Stephen Grossberg. Art 3: Hierarchical search using chemical trans- mitters in self-organizing pattern recognition architectures. Neural Networks, 3 (2) : 129-152, 1990.

4:759-771, 1991. Fig. 1: Segmentation of _{the improved}_ARTZA_{network with}an actual dual-echo _B₌_0.05,pMR images with _{= 0.80.}_(a)

original images; (b) segmentation into 4 tissues at p = 0.810;

(6)

[9] Richard 0. Duda and Peter E. Hart. Pattern Classificatzon and Scene Analysis. John Wiley and Sons, New York, 1973.

[lo] Lawrence 0. Hall, Amine M . Bensaid, and Laurence P. Clarke. A comparison of neural network and fuzzy clustering techniques in segmenting magnetic resonance images of the brain. IEEE Trans. Neural Networks, 3( 5):672-82, Sept 1992.

[11] Teuvo Kohonen. Self-Organization and Asso- caative Memory. Springer-Verlag, Berlin Hei- delberg, second edition, 1988.

[12] Zhengrong Liang, James R. MacFall, and Don- ald P. Harrington. Parameter estimation and tissue segmentation from multispectral mr images. lEEE Trans Medical Imaging, 13(3):441- 449, Sept 1994.

[13] Z Linag, R.J. Jaszczak, and R.E. Coleman. Parameter estimation of finite mixture using the em algorithm and information criteria with application to medical image processing. IEEE Trans Nuclear Science, 39(4):1126-33, 1992.

Practical Neural Network Recipes in C Plus Plus. Academic Press, Inc., 1993.

[15] M Morrison and Y Attikiouzel. An introduction t o the segmentation of megnetic resonance medical images. The Austmlian Computer Journal, 26(3):90-98, Aug 1994.

[ l G ] Mehmed Ozkan, Benoid M. Dawant, and Robert J. Maciunas. Neural-netwrok-based segmentation of multi-modal medical images: A comparative and prospective study. IEEE

Trans. Medical Imaging, 12(3):534-44, Sept 1993.

[17] Nikhil R Pal, James C Bezdek, and Eric C.-K.

Tsao. Generalized clustering networks and k e honen’s self-organizing scheme. IEEE Transac- tions on Neural Networks, 4(4):549-557, July 1993.