A Service for Data-Intensive Computations on Virtual Clusters

(1)

A Service for

Data-Intensive Computations

on Virtual Clusters

Rainer Schmidt, Christian Sadilek, and Ross King

[email protected]

(2)

Planets Project

“Permanent Long-term Access through NETworked Services”

Addresses the problem of digital preservation

driven by National Libraries and Archives

Project instrument: FP6 Integrated Project

5. IST Call

Consortium: 16 organisations from 7 countries

Duration: 48 months, June 2006 – May 2010

Budget: 14 Million Euro

(3)

Outline

Digital Preservation

Need for eScience Repositories

The Planets Preservation Environment

Distributed Data and Metadata Integration

Data-Intensive Computations

Grid Service for Job Execution on AWS

Experimental Results

(4)

Drivers for Digital Preservation

Exponential growth of digital data across all sectors of society

Large quantities of often irreplaceable information

Research and scientific data

Data becomes significantly more complex and diverse

Digital information is fragile

Ensure long term access and interpretability

Requires data curation and preservation

Development and assessment of preservation strategies

(5)

The Planets Environment

A collaborative research infrastructure for the systematic

development and evaluation and of preservation strategies.

A decision-support system for the development of

preservation plans

An integrated environment for dynamically providing,

discovering and accessing a wide range of preservation tools,

services, and technical registries.

A workflow environment for the data integration, process

execution, and the maintenance of preservation information.

(6)

Service Gateway Architecture

Preservation Planning (Plato) Experimentation Testbed Application Notification and Logging System Workflow Execution UI Workflow Execution and Monitoring Experiment Data and Metadata Repository Service and Tool Registry Application Services Execution Services Data Storage Services Administration UI Authentication and Authorization User Applications Portal Services

Application Execution and Data Services

(7)

Data and Metadata Integration

Support distributed and remote storage facilities

data registry and metadata repository

Integrate different data sources and encoding standards

Support range of protocols and description schemas

Connect distributed data repositories and archives.

Development of common digital object model

Dynamic mapping of objects from different institutions

(memory inst., research collections, content networks)

Recording of provenance information, preservation,

integrity, and descriptive metadata

(8)

Data Intensive Applications

Development Planets Job Submission Services

Allow Job Submission to a PC cluster (e.g. Hadoop, Condor)

Standard grid protocols/interfaces (SOAP, HPC-BP, JSDL)

Demand for massive compute and storage resources

Instantiate cluster on top of (leased) cloud resources based on AWS EC2 + S3, (or alternatively in-house in a PC lab.)

Allows computation to be moved to data or vice-versa

On-demand cluster based on virtual machine images

Many Preservation Tools are/rely on 3rd party applications

(9)

Experimental Architecture

JSDL

Virtual Node (Xen)

JSS

Data Service

Cloud Infrastructure (EC2)

Storage Infrastructure

(S3) Virtual Cluster and File System (Apache Hadoop)

Digital Objects

Job Description File

Job HPCBP Workflo w Envir onment

(10)

A Light-Weight Job Execution Service

TLS/WS Security WS - Container BES/HPCBP API Account Manager JSDL Parser Session Handler Exec. Manager Data Resolver Input Generator Job Manager

(11)

A Light-Weight Job Execution Service

TLS/WS Security WS - Container BES/HPCBP API Account Manager JSDL Parser Session Handler Exec. Manager Data Resolver Input Generator Job Manager HDFS S3 Hadoop

(12)

The MapReduce Application

Map-Reduce implements a framework and prog. model for processing large documents (Sorting, Searching, Indexing) on multiple nodes.

Automated decomposition (split)

Mapping to intermediary pairs (map), optionally (combine)

Merge output (reduce)

Provides implementation for data parallel + i/o intensive applications

Migrating a digital object (e.g web page, folder, archive)

Decompose into atomic pieces (e.g. file, image, movie)

On each node, process input splits

(13)

Experimental Setup

Amazon Elastic Compute Cloud (EC2)

1 – 150 cluster nodes

Custom image based on Fedora 8 i386

Amazon Simple Storage Service (S3)

max. 1TB I/O per experiment

All measurements based on a single core per virtual machine

Apache Hadoop (v.0.18)

MapReduce Implementation

Preinstalled command line tools

Quantitative evaluation of VMs (AWS) based on execution time, number of tasks, number of nodes, physical size

(14)

Experimental Results 1 – Scaling Job Size

0,00 1,00 2,00 3,00 4,00 5,00 6,00 7,00 8,00 9,00 10,00 1 10 100 1000 tasks ti m e [m in ] EC2 0,07 MB EC2 7,5 MB EC2 250 MB SLE 0,07 MB SLE 7,5 MB SLE 250 MB number of nodes = 5 x(1k) = 3,5 x(1k) = 4,4 x(1k) = 3,6 x(1k) = t_seq / t_parallel and tasks = 1000

(15)

Experimental Results 2 – Scaling #nodes

0,00 5,00 10,00 15,00 20,00 25,00 30,00 35,00 40,00 0 50 100 150 nodes ti m e [m in ] EC2 1000 x 0,07 MB SLE 1000 x 0,07 MB X X X X _X X n=1, t=36, s1 = 0.72, e=72% n=5, t=4.5, s=3.3, e=66% n=10, t=4.5, s=5.5, e=55% n=50, t=1.68, s=16, e=31% n=100, t=1.03, s=26, e=26% n=1 (local), t=26

(16)

Results and Observations

AWS + Hadoop + JSS

Robust and fault tolerant, scalable up to large numbers of nodes

~32,5MB/s download / ~13,8MB/s upload (cloud internally)

Also small clusters of virtual machines reasonable

S = 4,4 for p = 5, with n=1000 and s=7,5MB

Ideally, size of cluster grows/shrinks on demand

Overheads:

SLE vs. Cloud (p=1, n=1000) 30% due to file system master

In average, less then 10% due to S3

(17)

Conclusion

Challenges in digital preservation are diversity and scale

Focus: scientific data, arts and humanities

Repositories Preservation systems need to be embedded with large-scale distributed environments

grids, clouds, eScience environments

Cloud Computing provides a powerful solution for getting on-demand access to an IT infrastructure.

Currently integration issues: Policies, Legal Aspects, SLAs

We have presented current developments in the context of the EU project Planets on building an integrated research environment for the development and evaluation of preservation strategies

A Service for Data-Intensive Computations on Virtual Clusters

A Service for

Data-Intensive Computations

on Virtual Clusters

Planets Project

Outline



Digital Preservation



Need for eScience Repositories



The Planets Preservation Environment



Distributed Data and Metadata Integration



Data-Intensive Computations



Grid Service for Job Execution on AWS



Experimental Results

Drivers for Digital Preservation



Exponential growth of digital data across all sectors of society



Large quantities of often irreplaceable information



Research and scientific data



Data becomes significantly more complex and diverse



Digital information is fragile



Ensure long term access and interpretability



Requires data curation and preservation



Development and assessment of preservation strategies

The Planets Environment



A collaborative research infrastructure for the systematic

development and evaluation and of preservation strategies.



A decision-support system for the development of

preservation plans



An integrated environment for dynamically providing,

discovering and accessing a wide range of preservation tools,

services, and technical registries.



A workflow environment for the data integration, process

execution, and the maintenance of preservation information.

Service Gateway Architecture

Data and Metadata Integration



Support distributed and remote storage facilities



data registry and metadata repository



Integrate different data sources and encoding standards



Support range of protocols and description schemas



Connect distributed data repositories and archives.



Development of common digital object model



Dynamic mapping of objects from different institutions

(memory inst., research collections, content networks)



Recording of provenance information, preservation,

integrity, and descriptive metadata

Data Intensive Applications

Experimental Architecture

A Light-Weight Job Execution Service

A Light-Weight Job Execution Service

The MapReduce Application

Experimental Setup

Experimental Results 1 – Scaling Job Size

Experimental Results 2 – Scaling #nodes

Results and Observations