• No results found

dcache, a managed storage in grid

N/A
N/A
Protected

Academic year: 2021

Share "dcache, a managed storage in grid"

Copied!
30
0
0

Loading.... (view fulltext now)

Full text

(1)

dCache.ORG

dCache.ORG

dCache, a managed storage in grid

Patrick

for the dCache Team

(2)

dCache.ORG

dCache.ORG

Topics

Project Topology

Why do we need storage elements in the grid world ?

The idea behind the LCG (gLite) storage element.

Available Solutions

The dCache implementation

dCache in a nutshell

Weak points and outlook

Usage

(3)

dCache.ORG

dCache.ORG

Head of dCache.ORG

Patrick Fuhrmann

Vladimir Podstavkov

Ted Hesselroth

External

Martin Radicke

Maarten Lithmaath, CERN

Alex Kulyavtsev

Gerd Behrmann, NDGF

Dmitri Litvintsev

Core Team (Desy and Fermi)

Development

Support and Help

Owen Synge

Greig Cowan, gridPP

Timur Perelmutov

Tigran Mkrtchyan

Abhishek Singh Rana, SDSC

Flavia Donno, CERN

Stijn De Weirdt (Quattor)

Jonathan Schaeffer, IN2P3

Dirk Pleiter

David Melkumyan

Head of Development FNAL :

Head of Development DESY :

Neha Sharma

Project Topology :

The Team

Iryna Koslova

Bjoern Boettscher

Andrew Baranovski

(4)

dCache.ORG

dCache.ORG

What do we need a grid storage element for ?

We need to serve large amounts of data locally

Access from local Compute Element

Huge amount of simultaneously open files.

Posix like access (What does that mean ?)

We need to exchange large amount of data with remote sites

Streaming protocols.

Optimized for low latency (wide area) links.

Possibly controlling 'link reservation'.

(5)

dCache.ORG

dCache.ORG

We need to allow storage control

Space reservation to guarantee maximum streaming.

Define space properties (TAPE, ONLINE, ...)

Transport protocol negotiation.

We need to publish SE specific information

What do we need a grid storage element for ? (cont.)

Clients need to select 'best' SE or CE for a job.

Availability

Available Space (max, used, free ...)

Supported Spaces (Tape, disk ...)

(6)

dCache.ORG

dCache.ORG

The Idea of a Grid Storage Element

SE

Local Area

Access

Site B

SE

Local Area

Access

Site A

Wide area dataset transport

Possibly managed by high level tools (FTS)

CE

(7)

dCache.ORG

dCache.ORG

SE

Information Publishing

Content : GLUE

Transport : LDAP

SRM Storage Resource Management

Space/Protocol Management

Wide Area Transport Protocol

In use : gsiFtp

Discussed : http(s)

Local Access Protocol

(gsi)dCap or rfio and xRoot

(8)

dCache.ORG

dCache.ORG

Available Solutions

dCache

TSM ®

HPSS ®

DMF ®

Enstore, OSM

CASTOR

DPM

SToRM

GPFS

Common Protocols

(9)

dCache.ORG

dCache.ORG

dCache Core

Namespace

provider

ACLs

Tape Storage

OSM, Enstore Tsm, Hpss, DMF Information Protocol(s)

(NFS 4.1)

gsiFtp

(http)

xRoot

Storage Management Protocol(s)

SRM 1.1 2.2

Data & Namespace Protocols

NFS 2 / 3

Namespace ONLY

dCap

High Level Services

Resilient Manager Maintenance Module

Hopping Manager

Admin Module (ssh, jpython) Flush Manager

ftp (V2)

The dCache SE implementation

Black Box View

(10)

dCache.ORG

dCache.ORG

File system view

/pnfs/desy.de /atlas /myFile

dCache disks

External (HSM)

001067645 001067645

001067645

001067645

Strict name space and data storage separation, allowing

consistent name space operations (mv, rm, mkdir e.t.c)

consistent access control per directory resp. file

managing multiple internal and external copies of the same file

convenient name space management by nfs (or http)

(11)

dCache.ORG

dCache.ORG

In a Nutshell

dCache partitioning for very large installations

File hopping on

automated hot spot detection

configuration (read only, write only, stage only pools)

on arrival (configurable)

(12)

dCache.ORG

dCache.ORG

Read Only Cache

From Client

RAW Data

2. copy

Backup Store

Main Tertiary Store

Replicate  on high load

non RAW Data

immediately 

copied to read pools 

Write Pools

Transfer Pools

File Hopping

(13)

dCache.ORG

dCache.ORG

dCache in a Nutshell

Overload and meltdown protection

Request Scheduler.

Separate I/O queues per protocol (load balancing)

Supported protocols : (gsi)ftp , (gsi)dCap, xRoot, SRM, nfs2/3

xRoot support

Vector read

(14)

dCache.ORG

dCache.ORG

Technical Introduction

Dispatcher by

request

Attributes

Scheduler and I/O queues

Pool Candidates selected by

Protocol

Client IP number/net

Data Flow Direction

Name Space Attributes (Directory)

SRM Spaces

Dispatcher by

Pool

Cost

List of

candidates

Pool Protocol

Queues

gsiFtp

dCap

xRoot

I/O Request

Space

Manager

(15)

dCache.ORG

dCache.ORG

In the Nutshell

HSM Support

TSM, HPSS, DMF, Enstore, Osm

Automated migration and restore

Working on Central Flush facility

support of multiple, non overlapping HSM systems (NDGF approach)

Misc

Graphical User Interface

Command line interface

Jpython interface

SRM watch

(16)

dCache.ORG

dCache.ORG

Is this useful for non LCG applications ?

Weak points :

Posix like is NOT posix (file system driver)

Http(s) not really supported

(17)

dCache.ORG

dCache.ORG

Posix like is NOT posix

SE

SE

App

libC

dCap

iNodes

xfs

ext3

Linked Library

App

libC

iNodes

xfs

ext3

preload dCap

Preload Library

Application needs to be

linked with the dCap

library.

Application stays unchanged but

doesn't work in all cases.

(18)

dCache.ORG

dCache.ORG

And this is real posix

SE

SE

App

libC

iNodes

xfs

NFS 4.1

Application doesn't need to be changed.

NFS 4.1 driver comes with OS.

(19)

dCache.ORG

(20)

dCache.ORG

dCache.ORG

We are currently putting significant efforts in the NFS 4.1 protocol

NFS 4.1 is aware of distributed data

Faster (optimized) e.g.:

Compound RPC calls

'Stat' produces 3 RPC calls in v3 but only one in v4

GSS authentication

Built in

mandatory security on file system level

ACL's

OPEN / CLOSE semantic (so system can keep track on open files)

'DEAD' client discovery (by client to server pings)

Technical Advantages :

Deployment Advantages :

Clients are coming for free (provided by all major OS vendors).

(21)

dCache.ORG

dCache.ORG

Postgres Oracle ***

NFS 4.1

Server

Jdbc

Chi

Lib

Chimera

dCache

core

Chi

Lib

NFS 4.1

Client

NFS 4.1 in dCache

provided by OS vendors

(22)

dCache.ORG

dCache.ORG

Goal : Industry standards in HEP ?

SE

SE

http(s)

nfs 4.1

(23)

dCache.ORG

dCache.ORG

dCache usage

In use at 9 Tier I centers

fzk (Karlsruhe, GR)

in2p3 (Lyon, FR)

RAL (Rutherford, UK)

BNL (New York. US)

FERMILab (Chicago, US)

SARA (Amsterdam. NL)

PIC (Spain)

Triumf (Canada)

NDGF (NorduGrid)

About 40 Tier II's

dCache will hold the largest share of the LHC data.

Tier 0

CERN

Tier 1

fzk

Tier II

DESY

Tier 1

RAL

Tier 1

IN2P3

Tier 1

***

dCache is part of VDT (OSG)

We are expecting > 20 PB per site > 2011

(24)

dCache.ORG

dCache.ORG

(25)

dCache.ORG

dCache.ORG

Finland

Denmark

Sweden

The NDGF Challenge : gsiFtp Protocol Version II

Chimera

Denmark

NDGF Tier I

Norway

SRM

Head-node

CERN

(26)

dCache.ORG

dCache.ORG

Single Site approach

Flush to HSM

Restore to

any Pool

Multi Site approach

Sweden

Norway

The NDGF Challenge : Multi Site HSM support

(27)

dCache.ORG

dCache.ORG

SRM 2.2

Only if there is a lot of time left

The wonderful world of

(28)

dCache.ORG

dCache.ORG

CUSTODIAL (T1Dx)

NON-CUSTODIAL (T0D1)

Dynamic Space Reservation

late pool binding for spaces

and more

The SRM in dCache supports

SRM 2.2

(29)

dCache.ORG

dCache.ORG

SRM 2.2 ( The space token)

T1 D0

T0 D1

As it used to be ( <= 1.7 )

As it will be with 1.8

(Custodial T1) Link Group

ST

Space

Size

Retention Policy

Space Token

Remark : The space of a Space Token is assigned to a pool at the

(30)

dCache.ORG

dCache.ORG

Further reading

References

Related documents

The purpose of this article is to examine recent literature about five of the high-impact educational practices (cap- stone experiences, learning communities, service learning

Schmit (2000) learns motion or The so-called motor learning states that motion learning is a series of processes associated with training or experience that leads to

Therefore, the research on the development of individual forehand smash models based on the results of badminton training from the results of research shows that the use of a

Employment protection appears to be strongly positively correlated to the labor share in minimum wage countries, and has no significant impact in non-minimum wage countries,

This module aims to critically evaluate the impact of a changing world environment on international business and develop a range of management solutions in response to these

This study investigates the behavior of urban runoff under different long-term climate scenarios with various densities of SUDS implementation using three kinds of

HOUSEFLIES AND BAD ODOUR IF NOT PROPERLY USED PATHOGENS TO HANDLERS OF MANURE PATHOGENS TO CONSUMERS OF FOOD PATHOGENS TO THOSE EMPTYING CONTENTS OF PIT PATHOGENS TO PEOPLE

Furthermore, at each study visit, the physicians were asked to collect by specific and validated questionnaires: (a) patients’ satisfac- tion with COPD medical treatments evaluated