• No results found

CERN s Scientific Programme and the need for computing resources

N/A
N/A
Protected

Academic year: 2021

Share "CERN s Scientific Programme and the need for computing resources"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

CERN’s Scientific Programme and the need for

computing resources

Bob Jones

Head of openlab

IT dept

CERN

This document produced by

Members of the Helix Nebula consortium

is licensed under a

Creative Commons Attribution 3.0 Unported License

.

Permissions beyond the scope of this license may be available at

http://helix-nebula.eu/.

The Helix Nebula project is co-funded by the European Community Seventh Framework

Programme (FP7/2007-2013) under Grant Agreement no 312301

(2)

2

CERN founded 1954:

“Science for Peace”

~ 2300 staff

~ 1050 other paid personnel

~ 11000 users

Budget (2012) ~1000 MCHF

20 Member States:

Austria, Belgium, Bulgaria,

the Czech Republic, Denmark, Finland, France, Germany,

Greece, Hungary, Italy, Netherlands, Norway, Poland,

Portugal, Slovakia, Spain, Sweden, Switzerland and

the United Kingdom

1 Candidate for Accession:

Romania

8 Observers to Council:

India, Israel, Japan,

the Russian Federation, the United States of America,

Turkey, the European Commission and UNESCO

(3)

Community collaboration on an international scale

Tim Bell, CERN

3

OpenStack Conference, Boston

(4)

Accelerating Science and Innovation

(5)

5

200-400 MB/sec

Data flow to permanent storage: 4-6 GB/sec

1.25 GB/sec

1-2 GB/sec

(6)

CERN IT Department

CH-1211 Genève 23

Switzerland

www.cern.ch/it

6

The LHC Data Challenge

• The accelerator will run for 20

years

• Experiments are producing more

than 20 Million Gigabytes of

data each year

• LHC data analysis requires a

computing power equivalent to

~100,000 of today's fastest PC

processors

• Requires many cooperating

computer centres, as CERN can

only

provide

<20% of the

capacity

(7)

A distributed computing

infrastructure to provide the

production and analysis

environments for the LHC

experiments

Managed and operated by a

worldwide collaboration

between the experiments and

the participating computer

centres

The resources are distributed

– for funding and sociological

reasons

Our task was to make use of

the resources available to us –

no matter where they are

located

May 2012 - Frédéric Hemmer

WLCG – what and why?

Tier-0 (CERN):

• Data recording

• Initial data reconstruction

• Data distribution

Tier-1 (11 centres):

•Permanent storage

•Re-processing

•Analysis

Tier-2 (~130 centres):

• Simulation

• End-user analysis

7

(8)

0 100000000 200000000 300000000 400000000 500000000 600000000 700000000 800000000 900000000 1E+09

Ja

n-10

Fe

b-10

M

ar

-1

0

Ap

r-1

0

M

ay

-1

0

Ju

n-10

Ju

l-1

0

Au

g-10

Se

p-10

Oc

t-1

0

No

v-10

De

c-10

Ja

n-11

Fe

b-11

M

ar

-1

1

Ap

r-1

1

M

ay

-1

1

Ju

n-11

Ju

l-1

1

Au

g-11

Se

p-11

Oc

t-1

1

No

v-11

De

c-11

LHCb

CMS

ATLAS

ALICE

WLCG in 2011

22 PB data written in 2011

More than 6 GB/s to tape during HI run

Dec. 11

CMS has used close to 100% of

the Tier-1 pledge over all of 2011

Mostly Reprocessing of Data and

Simulation

Expect the trend to continue in

2012

6

0% 20% 40% 60% 80% 100% 120% 140% 160% 1 2 3 4 5 6 7 8 9 10 11 Pe rc e nt a g e o f U til iz e d H o ur s Month 2010

Usage Tier-1 Only

ALICE Percentage Use ATLAS Percentage Use CMS Percentage Use LHCb Percentage Use Total Percentage Use

Status Data Re-Processing and MC Production

12/01/11

6

Era Pass Events Total Size [TB]

Run2010A+B Apr21 Run2011A Apr13 Run2011A Apr22 Run2011A May3 Run2011A May7 Run2011A May10 Run2011A May13 Run2011A 16Jun2011 Run2011A 02Jul2011 Run2011A 05Jul2011 Run2011A 16Jul2011 Run2011A 05Aug2011 Run2011A 31Aug2011 Run2011A 16Sep2011 Run2011A 20Sep2011 Run2011A 03Oct2011 Run2011A 06Oct2011 Run2011A 21Oct2011 Run2011A 02Nov2010 Run2011A 08Nov2011 Run2011B 08Nov2011 Run2011A 15Nov2011 Run2011B 15Nov2011 Run2011B 19Nov2011 Run2011A 26Nov2011 Run2011B 26Nov2011 1,420,077,332 388.89 40,729,454 2.74 10,767,224 9.59 12,808,958 9.13 29,085,527 21.57 476,783,419 257.65 18,604,900 5.59 463,744 0.36 144,560,884 130.85 180,655,731 161.52 56,168,211 8.55 123,381,634 110.63 273,081 0.65 52,384,460 28.10 3,347,184 0.00 59,822,214 69.45 13,730 0.00 12,657,332 8.19 4,053,868 0.64 1,238,155,918 626.75 5,652,294 5.46 2,865,513 0.00 2,305,064 0.00 184,445,390 228.60 1,475,622 0.27 2,476,862 0.61 Total Total Run2010A+B Total Run2011A+B 4,084,015,550 2,075.79 1,420,077,332 388.89 2,663,938,218 1,686.90

0

750,000,000

1,500,000,000

2,250,000,000

3,000,000,000

Events

Total Run2010A+B

Total Run2011A+B

0

500

1,000

1,500

2,000

Total Size [TB]

Total Run2010A+B

Total Run2011A+B

In progress, CMSSW_4_4 In progress, CMSSW_4_4 In progress, Higgs

Status: 11/30/2011

D

at

a

R

ep

ro

ce

ss

in

g

20

11

Grid resources close to full occupation

(plot shows use vs. pledges)

10

9

HEPSPEC-hours/month

(~150 k CPU continuous use)

CPU Use continues to grow…

even during technical stops

(9)

How to evolve WLCG?

A distributed computing infrastructure to provide the production and

analysis environments for the LHC experiments

• Collaboration - The resources are distributed and provided “in-kind”

• Service - Managed and operated by a worldwide collaboration

between the experiments and the participating computer centres

• Implementation - Today general grid technology with high-energy

physics specific higher-level services

Evolve the Implementation while preserving the collaboration & service

(10)

CERN openlab in a nutshell

A science – industry partnership to drive R&D

and innovation with over a decade of success

Evaluate state-of-the-art technologies in a

challenging environment and improve them

Test in a research environment today what will

be used in many business sectors tomorrow

Train next generation of engineers/employees

Disseminate results and outreach to new

audiences

CONTRIBUTOR (2012)

(11)

A European Cloud Computing Partnership

big science teams up with big business

11

(12)

Strategic Plan

for a scientific Cloud Computing Infrastructure in Europe

Establish a sustainable multi-tenant cloud

computing infrastructure in Europe

Initially based on the needs for the European

Research Area & space agencies

Integrate commercial services from multiple IT

industry providers

(13)

13

A Collaboration Initiative

European

Cloud Computing

Strategy

User organisations

Demand-side

Commercial Service

Providers

Supply-side

European Commission

& relevant projects

(14)

Timeline

Deploy flagships

Analysis of functionality,

performance &

financial model

Success Stories

Set-up

(2011)

Pilot phase

(2012-2014)

Full-scale cloud

service market

(2014 … )

Agree strategic plan

Select flagships use cases

Identify service providers

Define governance model

More applications

More services

More users

More service providers

co-funded by EC under

grant 312301 with 1.8M€

(15)

Pilot Phase

Explore / push a series of perceived barriers to Cloud adoption

via deployment of selected flagship applications:

Security: Unknown or low compliance and security standards

Reliability: Availability of service for business critical tasks

Data privacy: Moving sensitive data to the Cloud

Scalability / Elasticity: Will the Cloud scale-up to our needs

Network performance: Data transfer bottleneck; QoS

Integration: Hybrid systems with in-house / legacy systems

Vendor lock-in: Vendor dependency once data & applications are

transferred to the Cloud

Business Models: Finding a win-win model for the suppliers and users

Legal concerns: liability, jurisdiction, intellectual property

(16)

Initial flagship use cases

Scientific challenges with societal impact

Sponsored by user organisations

Stretch what is possible with the cloud today

16

ATLAS High Energy

Physics Cloud Use

Genomic Assembly

in the Cloud

SuperSites Exploitation

Platform

To support the computing

capacity needs for the

ATLAS experiment

A new service to simplify

large scale genome

analysis; for a deeper

insight into evolution and

biodiversity

To create an Earth

Observation platform,

focusing on earthquake

and volcano research

(17)

• Proof of Concept deployments within the Pilot Phase

started January 2012

• Each flagship has been deployed with a series of

providers independently

CERN, EMBL and ESA succeeded in deploying

scientific applications each involving tens of

thousands of jobs running at data centres

operated by Atos, CloudSigma and T-Systems

Flagship deployments

First results

(18)

CERN-ATLAS flagship configuration

Monte Carlo jobs (lighter I/O)

10s MB in/out

~6-12 hours/job

Ran ~40,000 CPU days

Ramón Medrano Llamas,Fernando Barreiro, Dan van der Ster (CERN IT), Rodney Walker (LMU Munich)

Difficulties overcome

Different vision of clouds

Different APIs

(19)

Open to new members

Users

Service

Providers

Adopters

Interested

Parties

19

http://www.helix-nebula.eu

(20)

Accelerating Science and Innovation

20

References

Related documents

The set of genes (blue lines consisting of yellow pixels in Figure 2-a) is generated inside the building footprint (Red rectangle in Figure 2-a) using the DDA (Digital

With support from both the Ontario Ministry of Health and the Ontario Chiropractic Association, presentations were developed that primarily addressed the three areas of greatest

It will: define the concept of an effective remedy; establish Hong Kong’s legal responsibility to provide an effective remedy for human rights violations as a party to International

Relation to [MO13] On October 2013, we initially posted the results contained within this paper which include: a construction of a public key encryption scheme that is IND-CPA

In the risk evaluation details page, you can compare the transaction to transactions that have been registered with the same card number, BIN, IP address, email

ABSTRACT: The species belonging to the genus Ixora are amongst the plants in Indian traditional Ayurveda system of medicine for a variety of ailments. The research work carried

This Smart Citizen kit works on geo location, Internet and hardware and software for data collection and sharing and analysing it (Smart Citizen Kit - SCK, REST full API, App and,

Environmental Technology, Recreational Goods Manufacturing, Biotechnology and Pharmaceuticals, Biomedical Products, Communications, and Software and Computer Services are all