• No results found

e-infrastructures for Science and Industry

N/A
N/A
Protected

Academic year: 2021

Share "e-infrastructures for Science and Industry"

Copied!
33
0
0

Loading.... (view fulltext now)

Full text

(1)

e-Infrastructures

for Science and Industry

-Clusters, Grids, and Clouds –

Wolfgang Gentzsch, The DEISA Project and OGF

8th Int. Conference on Parallel Processing and Applied Mathematics Wroclaw, Poland, Sep 13 – 16, 2009

(2)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 2

HPC Centers

They are service providers

, for past 40 years

For research, education, and industry

Computing, storage, apps, data, services

Very professional

(3)

Grids

1998: The Grid: Blueprint for a

New Computing Infrastructure

2002: The Anatomy of the Grid

Ian Foster, Carl Kesselman, Steve Tuecke Ian Foster, Carl Kesselman

(4)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 4

Grids (Sun in 2001)

Departmental

Grids

Enterprise

Grids

Global

Grids

Clouds

(5)

Public Clouds

• Outside corporate data center • Access over the Internet

• Virtual (Vmware, Xen,...) • Abstraction of the hardware

• Service oriented: SaaS, PaaS, IaaS, HaaS • Variable cost of services (QoS)

• Pay-per-use IT services • Scaling up/down

IaaS, PaaS, SaaS

Access

Elasticity

Abstraction

Public, private, hybrid

Capex => Opex

Pay-per-use

Scaling

and

we

have

all

the

components

available

today

Clouds

(6)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 6

Benefits of moving HPC to Grids

Closer collaboration with your colleagues (VCs)

More resources allow faster/more processing

Different architectures serve more users

Failover: move jobs to another system

. . . and Clouds

No upfront cost for additional resources

CapEx => OpEx, pay-per-use

Elasticity, scaling up and down

(7)

The Cloud of Cloud Companies

• Akamai • Areti Internet • Enki • Fortress ITX • Joyent • Layered Technologies • Rackspace • Terremark • Xcalibre • Manjrasoft / Aneka • GridwiseTech / Momentum • NICE/EnginFrame • Amazon • Google • Sun • Salesforce • Microsoft • IBM • Oracle • EMC • Cloudera • Cloudsoft

(8)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 8

NICE EnginFrame

Cluster/Grid/Cloud Portal

Remote, interactive, transparent, secure access to apps & data on corporate Intranet or Internet, or in the Cloud.

Interactive Applications Intranet Clients Win LX UX Mac Intranet Clients Win LX UX Mac Virtualized Data Center Clusters

Users Batch Applications Virtualized Storage Cloud Portal / Gateway Cloud Portal / Gateway Administrators Administrators Users Administrators Administrators Users S ta n d a rd p ro to c o ls S ta n d a rd p ro to c o ls Licenses

(9)

A Scalable Data Cloud Infrastructure

Example:

GridwiseTech Momentum

(10)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 10

ANEKA Cloud Platform

Private Cloud LAN network Amazon Microsoft Google Sun Data Center Virtual Machines

Windows Mac with Mono Linux with Mono

A n e k a C lo u d P la tf o r m IaaS PaaS

SaaS Cloud applications

Social computing, Enterprise, ISV, Scientific, CDNs, ...

Cloud Programming Models & SDK

Task Model Task Model Thread Model Thread Model Map Reduce Model Map Reduce Model Third Party Models Third Party Models

Core Cloud Services

SLA Management SLA Management VM Management VM Management VM Deployment VM Deployment QoS Negotiation QoS Negotiation Job Scheduling Job Scheduling Execution Management Execution Management Pricing

Pricing BillingBilling

Admission Control Admission Control Metering Metering Data Storage Data Storage Monitoring Monitoring Workflow Model Workflow Model Private Cloud LAN network Private Cloud Private Cloud Private Cloud LAN network Amazon Microsoft Google Sun Amazon Microsoft Google Sun Data Center Virtual Machines

Windows Mac with Mono Linux with Mono

Virtual Machines

Windows

Windows Mac with MonoMac with Mono Linux with MonoLinux with Mono

A n e k a C lo u d P la tf o r m IaaS PaaS

SaaS Cloud applications

Social computing, Enterprise, ISV, Scientific, CDNs, ...

Cloud Programming Models & SDK

Task Model Task Model Thread Model Thread Model Map Reduce Model Map Reduce Model Third Party Models Third Party Models

Core Cloud Services

SLA Management SLA Management VM Management VM Management VM Deployment VM Deployment QoS Negotiation QoS Negotiation Job Scheduling Job Scheduling Execution Management Execution Management Pricing

Pricing BillingBilling

Admission Control Admission Control Metering Metering Data Storage Data Storage Monitoring Monitoring Workflow Model Workflow Model Courtesy: Manjrasoft

(11)
(12)
(13)

Animoto EC2 image usage

Day 1 Day 8

0 4000

(14)

‚My‘ current project:

DEISA: Grid or Cloud ?

Distributed European Infrastructure for Supercomputing Applications

(15)

DEISA1: May 1st, 2004 – April 30th, 2008

DEISA HPC Centers

(16)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 16 Gateway CSC Gateway ECMWF Gateway FZJ Gateway IDRIS Gateway SARA Gateway LRZ Gateway HPCX Gateway HLRS NJS CINECA IBM P5 IDB UUDB Gateway BSC Gateway CINECA NJS FZJ IBM IDB UUDB NJS RZG IBM IDB UUDB NJS ECMWF IBM P5 IDB UUDB NJS CSC Cray XT4/5 IDB UUDB NJS HPCX Cray XT4 IDB UUDB NJS LRZ SGI ALTIX IDB UUDB NJS HLRS NEC SX8 IDB UUDB CINECA user LRZ user job job NJS SARA IBM IDB UUDB NJS BSC IBM PPC IDB UUDB Gateway RZG NJS IDRIS IBM P6 IDB UUDB AIX LL-MC AIX LL LINUX PBS Pro Super-UX NQS II GridF TP LINUX Maui/Slurm UNICOS/lc PBS Pro LINUX LL AIX LL-MC AIX LL-MC UNICOS/lc PBS Pro AIX LL-MC

(17)

Technologies req ues tssu pp ort Applications Operations off ers pro du ct req u es ts con figu ra tion o ffe rs se rv ice offers technology requests development

(18)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 18 DEISA Sites Unified Unified AAA AAA Network Network connectivity connectivity Data Data transfer transfer tools tools Data staging Data staging tools tools Job Job rerouting rerouting Single Single monitor monitor system system Co Co- -reservation reservation and co and co- -allocation allocation Workflow Workflow managemnt managemnt Multiple Multiple ways to ways to access access Common Common production production environmnt environmnt WAN WAN shared shared File system File system Network and AAA layers Job manag. layer and monitor. Presen-tation layer Data manag. layer

(19)

AIX LL-MC AIX LL LINUX PBS Pro Super-UX NQS II GridF TP UNICOS/lc PBS Pro LINUX LL AIX, Linux LL-MC AIX, Linux LL-MC IBM P5

IBM P6 & BlueGene/P

IBM P6 & BlueGene/P IBM P6 Cray XT4/5 Cray XT4 SGI ALTIX NEC SX8 IBM P5+ / P6 IBM PPC

IBM P6 & BlueGene/P

UNICOS/lc

PBS Pro

AIX, Linux

LL-MC

DEISA Global File System

LINUX

Maui/Slurm

Global transparent file system based on the Multi-Cluster General Parallel File System (MC-GPFS of IBM)

(20)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 20

User management in DEISA

• A dedicated LDAP-based distributed repository administers DEISA users

• Trusted LDAP servers are authorized to access each other (based on X.509 certificates) and encrypted communication is used to maintain confidentiality

BSC CINECA CSC ECMWF EPCC FZJ HLRS IDRIS LRZ RZG SARA SARA

(21)

DEISA: Grid or Cloud ?

• Built on top of

proven

, professional infrastructure of HPC centers with expertise in implementation, operation, services.

• Ecosystem of resources, middleware, applications is

respecting

administrative, cultural and political autonomy of partners.

• Globalizing existing HPC services from local to global -according to user requirements: revolution by

evolution

.

• User support: user-friendly access to resources,

porting

user apps onto turnkey architecture.

• After EU funding, DEISA HPC ecosystem will operate in a

sustainable

way, in the interest of the ‘global scientist’, as...

(22)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 22

There are still many

Challenges with Clouds

Sustainable Sustainable Competitive Competitive Advantage Advantage CULTURAL CULTURAL TECHNICAL TECHNICAL LEGAL & LEGAL & REGULATORY REGULATORY

(23)

• Not all applications are cloud-ready or cloud-enabled

• Interoperability of clouds (standards ?)

• Sensitive data, sensitive applications (med.patient records) • Different organizations have different ROI

• Security: end-to-end from your resources to the cloud ! • Current IT culture is not predisposed to sharing resources • “Static” licensing model doesn’t embrace cloud

• Protection of intellectual property • Legal issues (FDA, HIPAA)

(24)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 24

A Cloud Checklist for HPC

When is your HPC app ready for the Cloud ?

... no issues with licenses, IP, secrecy, privacy, sensitive

data and big data movement, legal or regulatory issues, trust, . . .

...your app is architecture independent, not optimized for

specific architecture (single process, loosely-coupled low-level parallel, I/O-robust)

...it’s just one app and zillions of parameters ...latency and bandwidth are not an issue

Ideally, your meta-scheduler knows your

requirements and schedules automatically

(25)

Hybrid Grid/Cloud

Resource Management

External Cloud

Resources Department 1

Department 2

Department resource access Campus wide resource demand

Project A Team B Contractor X Project C User 1 User 2 Department 3 Department 4

Define policies according to priorities, budget, and time

(26)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 26

(27)
(28)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 28

(29)
(30)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 30

A Closer Look at HPC Load

Single parallel job, cpu-intensive, tightly-coupled,

highly scalable, peta, exa,..

Single parallel job, cpu-intensive, weakly-scalable

Capacity computing, throughput, parameter jobs

Managing massive data sets, possibly

geographically distributed

Analysis and visualization of data sets

(31)

Clouds and supercomputers:

Conventional wisdom?

Too slow

Too

expensive

Clouds/ clusters Super computers Loosely coupled applications Tightly coupled applications Courtesy Ian Foster

(32)

PPAM, Wroclaw, Sept 2009 Wolfgang Gentzsch 32

3

Loosely coupled problems

• Ensemble runs to quantify climate model uncertainty

• Identify potential drug targets by screening a database of ligand structures against target proteins

• Study economic model sensitivity to parameters

• Analyze turbulence dataset from many perspectives

• Perform numerical optimization to determine optimal resource assignment in energy problems

• Mine collection of data from advanced light sources

• Construct databases of computed properties of chemical compounds

• Analyze data from the Large Hadron Collider

• Analyze log data from 100,000-node parallel computations

(33)

Thank You

Dziekuje

References

Related documents

This document contains the following sections: • Import CA certificate to appliance • Create local certificates for appliances.. • Create VPN Tunnel (IKE using 3rd

14 PSTN side PSTN side Signalling gateway Signalling gateway Media gateway controller Media gateway controller Media gateway Media gateway IP side IP side ISUP/IP Megaco/MGCP

174. A merger is a combination of two or more businesses down below a single management. Bats rely on their hearings to navigate and to find food at night. Sugar cane, a native

are connected to the Gateway but not active, Web Console, Master and Gateway Administrator will be automatically disconnected from the Gateway, and input control will be

Presentamos un estudio llevado a cabo con estudiantes del Profesorado en Educación Inicial de la Universidad Nacional de Río Cuarto (UNRC) y del Instituto Superior María

Nevertheless, several studies assessed both phenotype and in vitro function of effector immune cells from tumor patients [ 16 – 18 ], but only limited data are available to date

Homemaker Services - hourly rate multiplied by 44 hours per week, multiplied by 52 weeks Home Health Aide Services - hourly rate multiplied by 44 hours per week, multiplied by 52

Here, we consider two models for the pattern of interconnections namely (i) a sparse random network (SRN) with fixed conductances through out the network and (ii) a fully