• No results found

Cloud Computing Service for Managing Large Medical Image Data-Sets Using Balanced Collaborative Agents

N/A
N/A
Protected

Academic year: 2021

Share "Cloud Computing Service for Managing Large Medical Image Data-Sets Using Balanced Collaborative Agents"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

Cloud Computing Service for Managing Large

Medical Image Data-Sets Using Balanced

Collaborative Agents

Raúl Alonso-Calvo, José Crespo, Víctor Maojo, Alberto Muñoz, Miguel García-Remesal, and David Pérez-Rey

Abstract. Managing large medical image coltections is an increasingiy demanding important issue in many hospitals and other medica) settings. A huge amount of this informaüon is daily generated, which requires robust and agüe systems. In this paper we present a distributed multi-agent system capable of managing very large medical image dataseis. In this approach, agents extract low-Ievel informaüon from ímages and store them in a data structure impiemented in a relational datábase. The data structure can also store semantic información related to images and particular regions. A distinctive aspect of our work is that a single image can be divided so that the resultant sub-images can be stored and managed separately by different agents to improve performance in data accessing and processing. The system also offers the possibüity of applying some region-based operations and filters on images, fa-cilitating image classification. These operations can be performed directly on data structures in the datábase.

1 Introduction

Different medical imaging devices and techniques used in clinical routine in many settings, such as CT, MRI, PET, X-Rays, etc., genérate a huge quantity of informa-tion every day. Moreover, continuous advances in the optical resoluinforma-tion of medical capturing instruments and sensors imply that image sizes are growing, introducing larger síorage requirements.

Raúl Alonso-Calvo • José Crespo • Victor Maojo • Miguel García-Remesal • David Pérez-Rey DLSIÍS & DÍA - Biomedical Informados Group,

Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, Avda Monteprfncipe, S/N, 28660 Boadiila del Monte, Spain

e-maii: [email protected] . f i .upm. es Alberto Muñoz

(2)

Building effective storage and indexation systems for medical images poses ¡j challenging problem. The standard approach Pie ture Archiving and Communicib üon Systems (PACS) aims to address and solve important issues like robuslnes% scalability, interoperability, and simple access to data. For this purpose there are scv> eral projeets that use image databases with Content-Based Image Retrievai (CBIR) techniques.

Different kinds of CBIR have been proposed. CBIR systems usually utiiize lew-level features (such as, for exainple, color histogram techniques and textureanalysi^O for performing image matching. In this group we can include well-known systtíim like QBIC [7], Virage [3], VisualSeek [11] and Blobworld [4]. However, medien! images and their contents are usually linked to semantic information such as patio ni data, dale, image type, and additional information (such as the body parts shown in the image). By using such textual information, software systems can refine contení based image searches.

In this context, storing and managing this medical imaging information becomes a crucial task for most medical organizalions. Besides, inlegrating this multimedia information with other textual sources is useful for difierent aspeets related to clin ical workflows, such as diagnosis, therapeutiedecisions, research and teaching.

In this paper we present a distributed multi-agent prototype system capable of storing and indexing very large image collections. This approach extends previous work on our group in ontology-based datábase integration of structured [2] and un structured sources [10], [8] and in image processing [5], [6].

Distributing large image databases is commonly used for improving scalabiiiiy and systems response. A distinctive aspect of our work with respect to other systems previously mentioned is that, in those other systems, data distribution is limita! to spreading images to different servers, whereas in our sysíem an image can be divided so that image parts can be stored in two or more diííerent agents. This aims to improve performance.

This paper is organized as follows. In the next section we describe the methods used to créate our multiagent system for storing large collections of images. Next, we present and discuss the results obtained for querying and retrieving information from stored images in the system. Finally, a conclusión section ends the paper.

2 Methods

2.1 Cloud Computing Service

As is well known, cloud computing [9] isintendedioofferan interfaceto use virtual resources accessibie on demand, and they can be controlled by the service owner. A goal of our work is to offer a cloud computing service for storing, analyzing and retrieving medical multimedia information. Institutions and medical professionais that do not have enough storage and computing resources couid manage medical data through appHcations buiit on top of these types of services. This kind of Virtual PACS based on the Cioud is different to other previously reponed systems.

(3)

The Cioud computing service is accessed by users having an authorízed account. images loaded by users are normally prívate, although they can be set as public if desired. By using Cloud services for accessmg images, prívate PACS belonging to concrete medica! organizations can be extended in a relativeiy easy manner.

2.2 Multi-Agent System Architecture

A system prototype for storing and managing images has been implemenled foliow-ing the agent-oriented paradígm. There are three agent roles in the system:

(i) The Cloud Computing Service Access Point Agent is a unique agent in the system. This agent is an entry point for final registered users. It offers methods for (a) storing images in the system, (b) applying operations on stored images, and (c) querying for images in the system.

(ii) The Resource Index Agent can be seen as a directory service. This agent con-tains information about location, current load, estimation funcíion for perfor-mance prediction, and image processing status, of each working agent of the system.

(iii) Working Agents are at the core of the system. They contaín all the functionali-ties implemented in the system, namely extracting image low-level and seman-tic features, storing images and its associated information, retrieving them, and applying filters and operations.

Each working agent manages a relational datábase impíementing the region-based datastructure created for storing images. Besides, they have methods implemented to be capable of:

• Dividinganinput image. As resultof this división asetofprocessedregionsand two untreated subimages are obtained. Detailed explanation of the algorithm used can be found in our previous work [ 1 ].

• Extracting regions, low-level descriptors, relationships between regions, and textual semantic information (contained, if available, in DICOM headers) from the input image.

• Storing processed information in the graph-based data-stiucture. By ínseriing the data obtained in a relational datábase with the graph-based datastructure logical schema.

• Sending untreated parts of the input image, obtained in the división step, to other agents to be processed.

• Obtaining and updating information from the resource Índex agent for load balancing.

• Communicating to other agents to synchronize information of divided images. • Applying filters and operations that have been implemented in the system. We

(4)

2.3 Managing Images in the Prototype: Agent Behaviors

We can dislinguish three differení behaviors of agents in the system depending on the kind of task that has to be performed: (i) processing and storing a new imago, (ii) performing a filter or operation on an image, and (iii) querying the system for image contents (this query can be a query by image-content, or simply accessing lo parts of a given image).

(i) When a new image is introduced in the system to be stored, the itiput image is sent to a working agent thal follows an algorithm with six main steps, depicteij in Figure i. Note: An ínput image is divided if its área is larger tharj a giveti thresholdflj.

(ii) When a petition of appiying a filter or operation on an image is received in the cloud computing service access point agent, it spread the message to thoso working agents that contains sub-images of the processed image. Then those working agents initialize the requested operation in their stored sub-images be-longing ío the selected image. These filters and operations are directly applied to the daía-structure stored in the agent iocal datábase. Distributed adaptations of some most representative morphological region-based operations has been developed the system, namely erosión, dilation, and watershed.

(iii) When the system is queried by a final user for obtaining a parí of the information stored (through the cloud computing service access point), the query is propa-gaíed toali agents of the system (asín project Ontofusion [2], [10}). Each agent returns information matching with the user query restriclions that are stored in its datábase. Note that results are not necessarüy entire images. All resuíts are returned by the cioud computing service access point agent ío the final user.

(5)

3 Evaluation and Discussion

We have conducted various image retrieval experiments using a set of images stored in the system. As ülustrative cases that correspond to three selected images of dif-ieren t sizes, we have displayed in some figures execution times for three images: ímage 1 is 1000x1000, Image 2 is 3000x3000, and Image 3 is 5000K5000. Results in seconds for some example quenes in the system are shown when the number of agents (which are deployed in different servers) is increased from 1 to 3. Images are divided among the availabie agents. When more than one agent is invoived, the execution time is the máximum of all agents.

Figure 2 shows the time reduction in a crop operaüon (where a 300x600 pixel rectangie is retrieved from the image, with the corresponding label of the región to which it belongs).

Finaily, Figure 3 visualices the execution time of a more compíex image multi-criteria query. Particularly, the query has retrieved ail regions having (a) an área less than a constant, (b) an intensity valué greater than a threshold, and (c) both a major and a minor axis smaller than 90% the image size (such a muki-criteria query is often used to elimínate background regions).

As can be observed the time needed for executing íhese quenes decrease when the number of agents in the system íncreases. The data división and distributed manage-ment among different agents, has decreased the time needed by datábase searches and operations.

Many of the existing datábase CBIR use distributed systems for storing images. This image distribution in different servers reduces the time needed for datábase quenes. Besides this kind of paraliehzation, our system can aiso divide input images and store subimages in a distributed manner to further reduce the time for quedes. This paper has focused on this second, novel type of reduction. Another important feature of our system is that operations are applied directly to databases.

18 , í ;

¡Agcnl ÍAgenl 3Agcnl 1 Aflenl 2 Agón! SAgonl

Fig. 2 Crop operation Fig. 3 MuUi-criteria query

4 Conclusions and Future Work

In this paper we have presented a scalabie multi-agent system prototype capabie of storing very large image collections. The system offers methods for managing,

(6)

anaiyzing and querying images by contení through a cloud computing service, Tho system uses a distributed region-oriented data-structure that can be stored, in a div tributed manner, in one or more databases. Some experimental results have bcen provided that show thc performance increase in normaHy used operations when th<; number of agents is increased.

First, we plan to extend the image operators implemented by adding new ones, such as región merging. Another planned improvement is to créate methods in (he cioud computing service for offermg inteliigent streaming of image contents, which can be used to visualize complex medical image data.

Acknowledgements

This work has bcen supporíed in parí by "Ministerio de Ciencia e Innovación" of Spain (Reí.: TIN2007-61768).

References

1. Aionso-Caivo, R., Crespo, J., García-Remesal, M„ Anguila, A.: On dislributing load in cloud computing: A real application for very-large image dataseis. In: Inter-national Conference on Computationai Science, vol. 1{¡), pp. 2663-2671 (2010), doi:]0.1Ü16/j.procs.20i0.04.300

2. Aionso-Caívo, R., Maojo, V., Biílhardt, H., Marlín-Sánchez, F, García-Remesal, M., Pérez-Rey, D.: An agent- and ontology-based system for inlegrating publie gene, protein, and disease databases. Journal of Biomedical Informatics 40(1), 17-29 (2007)

3. Bach, J.,et al.: Vírage Image search engine: An open framework for image managemeni. In: SPIE Storage and Retrieval for image and Video Databases, vol. 2670, pp. 76-87 (19%)

4. Carson, CThomas, M., Belongie, S., Hcllcrstein, J.M., Malik, J.: Blobworid: A system for region-based image indexing and retrieval. In: Huijsmans, D.P., Smeuíders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 509-517. Springer, Heidelberg (1999) 5. Crespo, J., Serra, J., Schafer, R.W.: Thcoretical aspeéis of morphoiogical fiílers by

re-construcción. Signai Process. 47(2), 201-225 (1995)

ó. Crespo, J., Schafer, R.W., Sena, J., Gratín, C , Meyer, R: The fial zone approach: a general low-levei región merging segmentaiion method. Signa! Process. 62(1), 37-60 (1997)

7. Flickner, M., eí al: Query by Image and Video Contení: The QBIC System. IEEE Com-puter 28(9), 23-32 (1995)

8. Maojo, V., García-Remesal, M., Biílhardt, H., Alonso-Calvo, R., Pérez-Rey, D., Martín-Sánchez, F.: Designing new methodologies for inlegrating biomedical informalion in clinicai triáis. Methods of Information in Medicine 45(2), 180-185 (2006)

9. NIST. definilion of Cloud Computing, vol. 15 NÍST (2010),

h t t p ; / / c s r c .n i s t . g o v / g r o u p s / S N S / c l o u d - c o m p u t i n g /

(cited April201I)

10. Pérez-Rey, D., Maojo, V., García-Remesal, M., Alonso-Calvo, R., Bilíhardt, H., Martín-Sánchez, F., Sonsa, A,: Ontology-based iniegration of genomic and clinicai databases. Compuí. Biol. Med. 36(7-8), 712-730 (2006)

11. Smiíh, J., Chang, S.: Blobworid: VisualSeek: A fully Automated conteni-based image query system. In: ACM International Conference on Multimedia, pp. 87-98 (1996)

References

Related documents

addressing girl-friendly WASH facilities, providing biologically accurate and skills- based health and hygiene information on menstruation to boys and girls and ensuring access

8 "% Z 8Q # bahw, o de “T” invertida que se observa en la crujía norte del Palacio de Comares, en donde la gran Sala de la Barca, de planta rectangular, hace las veces de nártex

Abstract: Detrital zircons from pre-orogenic Upper Ordovician to Devonian low-grade metasedimentary rocks have been dated by laser ablation–inductively coupled plasma mass

O Form will be signed by PI/Co-PI, Department Chair, and Dean O Submit the Grant form to BIRS at least 2 weeks prior to the.

Figures: (1) cylindrical plastic cage to maintain adults; (2) sexual dichromism: male (left) and female (right); (3) detail of cotton disc (indicated by the arrow) for oviposition

This strategy is problematic because it is at odds with the mechanismic commitment to the ‘details of scientific practice’ ( MDC, 2000, p. 2 ), given that evolutionary biologists

The presentation has not been updated since it was originally presented, and does not constitute a commitment by any CDF entity to underwrite, subscribe for or place any securities or

Activity data for each individual sector were collected from various sources based on activity type (e.g. electricity usage from utility companies, fuel consumption from the